BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 002202
         (954 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
 gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
          Length = 952

 Score = 1513 bits (3917), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 736/952 (77%), Positives = 843/952 (88%), Gaps = 3/952 (0%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
           M EG+I  NG  +E++EP +S  +KVDP  PASLTWQRKL++ +I LSQF L+++E  QL
Sbjct: 1   MSEGEIPANGC-QEDREPSDSLLEKVDPGSPASLTWQRKLNSEDIALSQFNLSFQEKFQL 59

Query: 61  APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
           AP+G+R+  LIREE AKG+ + I+PF+KR +TS HG+PLGG+GSGSIGRSY+GEFQRWQ+
Sbjct: 60  APVGIRLWRLIREETAKGRVSIINPFLKRFITSCHGIPLGGIGSGSIGRSYKGEFQRWQL 119

Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
           FPR+CE+KPVLANQFSVFVSRS+G+KYSSVLCP+ PEVL +   +GIGSWDWNLKGD ST
Sbjct: 120 FPRICEEKPVLANQFSVFVSRSSGEKYSSVLCPRNPEVLMEPAISGIGSWDWNLKGDNST 179

Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
           YHALYPRAWT+++GEPDPELRIVCRQISPIIPHNYKESSYPVSVFT+T+YNSGKT+AD++
Sbjct: 180 YHALYPRAWTIYDGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTFTLYNSGKTTADVS 239

Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
           LLFTWTNSVGG+SE++GQH+NS T M DGVHAVLLHH+T+   PPVTFA+AAQET+ VHV
Sbjct: 240 LLFTWTNSVGGNSEYSGQHFNSTTMMEDGVHAVLLHHKTAEGFPPVTFAIAAQETNDVHV 299

Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
           S CP FVISGN  G+TAKDMWHE+KEHGSFD L S  TS  SEPGSSIGAAIAASVT+PP
Sbjct: 300 SKCPRFVISGNCQGITAKDMWHEVKEHGSFDNLKSAGTSGPSEPGSSIGAAIAASVTIPP 359

Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
           D+   VTFSL+WDCPEV FM G+TY+RRYTKFY TH +AAA IA DAILEHG WE QI A
Sbjct: 360 DAIRSVTFSLSWDCPEVYFMGGRTYHRRYTKFYSTHGDAAARIAHDAILEHGLWESQIVA 419

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQRPILEDKRLPEWYPITLFNELYYLN+GG +WTDGSPP H+LV+I   KFSLD S + L
Sbjct: 420 WQRPILEDKRLPEWYPITLFNELYYLNSGGTIWTDGSPPYHNLVSIRGSKFSLDTSGAGL 479

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
           K I+DV ++NDTAVNIL RM+S LEQI+  VA NS+FGTNLLQ+GEENIGQFLYLEGIEY
Sbjct: 480 KSIIDVTHENDTAVNILGRMTSALEQIHAHVASNSAFGTNLLQEGEENIGQFLYLEGIEY 539

Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
            MWNTYDVHFYSSFAL+MLFPK++LS+QRDFAAAVMMHDPSKM+LL DGQWV RKVLGAV
Sbjct: 540 HMWNTYDVHFYSSFALVMLFPKLELSVQRDFAAAVMMHDPSKMQLLHDGQWVCRKVLGAV 599

Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
           PHDIGI DPW+EVNAY LY+T RWKDLNPKFVLQVYRDVVATGDKKFA+AVWPSVY+AMA
Sbjct: 600 PHDIGINDPWYEVNAYSLYNTDRWKDLNPKFVLQVYRDVVATGDKKFAEAVWPSVYIAMA 659

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
           YMDQFDRDGDGMIENDGFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAREVGD+GSED
Sbjct: 660 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSED 719

Query: 721 YFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
           YF  +FQKAK VY+KLWNGSYFNYDNSG   SSSIQADQLAGQWYARACGL PIVD+DKA
Sbjct: 720 YFWARFQKAKDVYDKLWNGSYFNYDNSGGRNSSSIQADQLAGQWYARACGLFPIVDKDKA 779

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
           RSAL+KVYNYNVLKV  GKRGA+NGMLPDG+VD+SSMQSREIWSGVTYA+AA+MI ED+ 
Sbjct: 780 RSALEKVYNYNVLKVKDGKRGAINGMLPDGKVDLSSMQSREIWSGVTYALAATMIQEDML 839

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
           D+ F TA GIYEAAWS  GLGY+FQTPEAWN  DQYRSLCYMRPLAIWAMQWAL+R  PK
Sbjct: 840 DMAFHTASGIYEAAWSERGLGYSFQTPEAWNNVDQYRSLCYMRPLAIWAMQWALSR--PK 897

Query: 901 TLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
             +++M+ EV ++ LL +HAGF+KVAR L+LPE + +  LLQSLF++TC+++
Sbjct: 898 LEKEEMEMEVNEDYLLPHHAGFTKVARFLRLPEGEESLGLLQSLFNYTCKKL 949


>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
          Length = 949

 Score = 1470 bits (3805), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 711/952 (74%), Positives = 824/952 (86%), Gaps = 6/952 (0%)

Query: 6   ILENGLHEEEKEPLNSSFD--KVDPAKPASLTWQRKL-STGEIPLSQFTLNWKETVQLAP 62
           + ENGL E E+EP NSS +  KVDP KP SLTWQRKL S G  P+ +F +N +ET  LAP
Sbjct: 1   MCENGLEEGEREPSNSSIEEIKVDPGKPGSLTWQRKLNSDGNAPV-EFKINLRETFHLAP 59

Query: 63  IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
           IG R+   +REE AKG+ A IDPF+KR+++SS GVPLGG+G+GSIGRSY+GEFQR+Q+FP
Sbjct: 60  IGFRLWRHVREETAKGRGAMIDPFVKRYISSSQGVPLGGIGAGSIGRSYKGEFQRFQLFP 119

Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
              E++PVL NQFSVFVSR NG+KYS+VLC ++PE LK+   +GIGSWDWNL G+KSTY 
Sbjct: 120 ITSENEPVLENQFSVFVSRPNGEKYSTVLCRQSPEALKECPPSGIGSWDWNLNGNKSTYL 179

Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
           ALYPRAWTV++GEPDP L+IVCRQISPIIPHNYKESS+PV+VFT+T++NSGKT+ADITLL
Sbjct: 180 ALYPRAWTVYDGEPDPALKIVCRQISPIIPHNYKESSFPVAVFTFTLFNSGKTAADITLL 239

Query: 243 FTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
           FTW NSVGG S  +GQH NSK  M DGV  VLLHH+T++  PPVT+A+AAQE DGVH+S 
Sbjct: 240 FTWANSVGGVSGLSGQHLNSKFMMKDGVRGVLLHHKTANGRPPVTYAIAAQEMDGVHISE 299

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
           CP F ISG++ G+TAKDMW+EIKEHGSFDRLNS ETS+ SE GSS+GAA+AAS+T+P DS
Sbjct: 300 CPCFFISGDTPGITAKDMWNEIKEHGSFDRLNSAETSMPSELGSSVGAAVAASLTIPSDS 359

Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
           E  VTFSLAWDCPE+NF   +TYYRRYTKFYGT  +AAA IA DAIL+HG WE QIEAWQ
Sbjct: 360 EQTVTFSLAWDCPEINFSKERTYYRRYTKFYGTRGDAAAKIAHDAILDHGHWESQIEAWQ 419

Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
           +P+LEDKR PEWYPITLFNELYYLN+GG VWTDGSPPVHS  +I  RKFSLD S+SDLK 
Sbjct: 420 KPVLEDKRFPEWYPITLFNELYYLNSGGTVWTDGSPPVHSFTSIIERKFSLDRSRSDLKN 479

Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
            V + + NDTAV+ILERM+S+LEQ++TPV  NS+FG NLLQ GEENIGQFLYLEG+EY M
Sbjct: 480 TVSISHHNDTAVDILERMTSVLEQVHTPVTSNSAFGPNLLQQGEENIGQFLYLEGVEYFM 539

Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
           WNT DVHFYSSFALIMLFPK++LSIQRDFAA+VMMHDPSKMKLL +G+WVSRKVLGAVPH
Sbjct: 540 WNTNDVHFYSSFALIMLFPKLELSIQRDFAASVMMHDPSKMKLLCNGKWVSRKVLGAVPH 599

Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           D+G  DPWFEVN Y LY+T RWKDLNPKFVLQVYRDVVATGDK FAKAVWPSVYVA+AYM
Sbjct: 600 DVGNYDPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYM 659

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
           +QFD+DGDGMIEN+GFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAR VGD+GSEDYF
Sbjct: 660 NQFDKDGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYF 719

Query: 723 LFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
            FKFQKAK VY+KLWNGSYFNYD+S  S SSSIQADQLAGQWYARACGL PIVDEDKA+S
Sbjct: 720 WFKFQKAKGVYQKLWNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLSPIVDEDKAKS 779

Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
           AL+KVY+YNVLKV GGKRGAVNGMLPDG+VD ++MQSREIWSGVTY VAA+MIHE L D+
Sbjct: 780 ALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDM 839

Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
            FQTA G+YEAAWS  GLGY+FQTPE+WNTDDQYRSLCYMRPLAIWAMQWA ++PK    
Sbjct: 840 AFQTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLHKY 899

Query: 903 EKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
           E    PE+ ++SL+  HAGFS+VARLLKLP+E  ++S LQ ++D+TC+RM++
Sbjct: 900 EAN--PEMNEDSLVLQHAGFSRVARLLKLPDEDVSRSALQVIYDYTCKRMWL 949


>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
          Length = 960

 Score = 1466 bits (3794), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 711/963 (73%), Positives = 824/963 (85%), Gaps = 17/963 (1%)

Query: 6   ILENGLHEEEKEPLNSSFD-------------KVDPAKPASLTWQRKL-STGEIPLSQFT 51
           + ENGL E E+EP NSS +             KVDP KP SLTWQRKL S G  P+ +F 
Sbjct: 1   MCENGLEEGEREPSNSSIEEVGHVLCYMLGRIKVDPGKPGSLTWQRKLNSDGNAPV-EFK 59

Query: 52  LNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSY 111
           +N +ET  LAPIG R+   +REE AKG+ A IDPF+KR+++SS GVPLGG+G+GSIGRSY
Sbjct: 60  INLRETFHLAPIGFRLWRHVREETAKGRGAMIDPFVKRYISSSQGVPLGGIGAGSIGRSY 119

Query: 112 RGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWD 171
           +GEFQR+Q+FP   E++PVL NQFSVFVSR NG+KYS+VLC ++PE LK+   +GIGSWD
Sbjct: 120 KGEFQRFQLFPITSENEPVLENQFSVFVSRPNGEKYSTVLCRQSPEALKECPPSGIGSWD 179

Query: 172 WNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYN 231
           WNL G+KSTY ALYPRAWTV++GEPDP L+IVCRQISPIIPHNYKESS+PV+VFT+T++N
Sbjct: 180 WNLNGNKSTYLALYPRAWTVYDGEPDPALKIVCRQISPIIPHNYKESSFPVAVFTFTLFN 239

Query: 232 SGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALA 291
           SGKT+ADITLLFTW NSVGG S  +GQH NSK  M DGV  VLLHH+T++  PPVT+A+A
Sbjct: 240 SGKTAADITLLFTWANSVGGVSGLSGQHLNSKFMMKDGVRGVLLHHKTANGRPPVTYAIA 299

Query: 292 AQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAA 351
           AQE DGVH+S CP F ISG++ G+TAKDMW+EIKEHGSFDRLNS ETS+ SE GSS+GAA
Sbjct: 300 AQEMDGVHISECPCFFISGDTPGITAKDMWNEIKEHGSFDRLNSAETSMPSELGSSVGAA 359

Query: 352 IAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEH 411
           +AAS+T+P DSE  VTFSLAWDCPE+NF   +TYYRRYTKFYGT  +AAA IA DAIL+H
Sbjct: 360 VAASLTIPSDSEQTVTFSLAWDCPEINFSKERTYYRRYTKFYGTRGDAAAKIAHDAILDH 419

Query: 412 GSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKF 471
           G WE QIEAWQ+P+LEDKR PEWYPITLFNELYYLN+GG VWTDGSPPVHS  +I  RKF
Sbjct: 420 GHWESQIEAWQKPVLEDKRFPEWYPITLFNELYYLNSGGTVWTDGSPPVHSFTSIIERKF 479

Query: 472 SLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQ 531
           SLD S+SDLK  V + + NDTAV+ILERM+S+LEQ++TPV  NS+FG NLLQ GEENIGQ
Sbjct: 480 SLDRSRSDLKNTVSISHHNDTAVDILERMTSVLEQVHTPVTSNSAFGPNLLQQGEENIGQ 539

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           FLYLEG+EY MWNT DVHFYSSFALIMLFPK++LSIQRDFAA+VMMHDPSKMKLL +G+W
Sbjct: 540 FLYLEGVEYFMWNTNDVHFYSSFALIMLFPKLELSIQRDFAASVMMHDPSKMKLLCNGKW 599

Query: 592 VSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAV 651
           VSRKVLGAVPHD+G  DPWFEVN Y LY+T RWKDLNPKFVLQVYRDVVATGDK FAKAV
Sbjct: 600 VSRKVLGAVPHDVGNYDPWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAV 659

Query: 652 WPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAR 711
           WPSVYVA+AYM+QFD+DGDGMIEN+GFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAR
Sbjct: 660 WPSVYVALAYMNQFDKDGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAR 719

Query: 712 EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL 771
            VGD+GSEDYF FKFQKAK VY+KLWNGSYFNYD+S  S SSSIQADQLAGQWYARACGL
Sbjct: 720 VVGDKGSEDYFWFKFQKAKGVYQKLWNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGL 779

Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
            PIVDEDKA+SAL+KVY+YNVLKV GGKRGAVNGMLPDG+VD ++MQSREIWSGVTY VA
Sbjct: 780 SPIVDEDKAKSALEKVYHYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVA 839

Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
           A+MIHE L D+ FQTA G+YEAAWS  GLGY+FQTPE+WNTDDQYRSLCYMRPLAIWAMQ
Sbjct: 840 ATMIHEGLVDMAFQTASGVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQ 899

Query: 892 WALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRR 951
           WA ++PK    E    PE+ ++SL+  HAGFS+VARLLKLP+E  ++S LQ ++D+TC+R
Sbjct: 900 WAFSQPKLHKYEAN--PEMNEDSLVLQHAGFSRVARLLKLPDEDVSRSALQVIYDYTCKR 957

Query: 952 MFI 954
           M++
Sbjct: 958 MWL 960


>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 956

 Score = 1439 bits (3725), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 678/958 (70%), Positives = 797/958 (83%), Gaps = 8/958 (0%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
           MFE KI+  G   E+ +  NSS  KVDPA PASLTWQRK+ +   P  +F L  KE  QL
Sbjct: 1   MFEEKIITIG---EDVKSFNSSDHKVDPAVPASLTWQRKIDSDAKPPREFNLTAKEIFQL 57

Query: 61  APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
           AP+G+R+  L REEAAKG+ AFIDPF +  +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58  APVGIRLWFLCREEAAKGRLAFIDPFSRHSVTSSHGVPLGGIGAGSIGRSFQGEFQRWQL 117

Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
           FP  CE +PVLANQFS FVSR+NG+KYS+VLCP+ P++ K  + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEYEPVLANQFSAFVSRANGKKYSTVLCPRNPKMGKQDSESGIGSWDWNLKGDKST 177

Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
           YHALYPR+WT++EGEPDPELRI+CRQ+SP IPHNYKESS+PVSVFT+T++N G T+A++T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIICRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTANVT 237

Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
           LLFTW NSVGGDSEF+G HYNSK  MNDGV  VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKIMMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297

Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
           S CP F++SG   G+TAKDMW  +KE+GSFD   + E S+ SE GSSIGAA+AAS+TV P
Sbjct: 298 SACPFFIVSGKQNGITAKDMWQAVKENGSFDHFKASEASMQSENGSSIGAAVAASLTVLP 357

Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
                VTFSLAWDCPEV F SGK Y RRYTKFYG H +AAA IA DAIL H  WE  IE 
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNHGDAAAQIAYDAILGHSQWESWIED 417

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL  +  +KFSLD SQ  L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
              ++VP+QNDTA+++L +M+S LEQ++   A NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 NNDIEVPHQNDTAISVLGKMASTLEQLHASTASNSAFGTKLLEEGEENIGHFLYLEGIEY 537

Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
            MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597

Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
           PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
           YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAAS LAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASVLARVVGDKNSQD 717

Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
           YF  KFQKAKVVYE KLWNGSYFNYDNSGS  SSSIQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSSIQADQLAGQWYARASGLLPIVDEDK 777

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
           AR+AL+KVYN+NV+K+  GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNFNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            ++ FQTA G+YEAAWS  GLGY+FQTPEAWNT+D+YRSL YMRPLAIWAMQWALT+   
Sbjct: 838 VEMAFQTASGVYEAAWSEAGLGYSFQTPEAWNTNDEYRSLTYMRPLAIWAMQWALTKTSQ 897

Query: 900 KT----LEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
           K     LE + +PEV   SL+++  GFS+V+RLL LP E  AKS LQ+LFD+TCRR+ 
Sbjct: 898 KQQQLGLEPEQEPEVETNSLMKHDIGFSRVSRLLNLPNEASAKSTLQTLFDYTCRRLM 955


>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
 gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
          Length = 922

 Score = 1436 bits (3718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 704/930 (75%), Positives = 805/930 (86%), Gaps = 10/930 (1%)

Query: 25  KVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFID 84
           KVDP KPA LTWQR+L T E  LSQFTL W+E +++APIG+R+   ++E A K K  FID
Sbjct: 1   KVDPGKPAPLTWQRRLDTSETVLSQFTLTWQEILRMAPIGIRLWRYVKENAKKKKGIFID 60

Query: 85  PFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
           PF KR++TSSHG+P+GGVGSGSIGRSYRGEFQRWQ+FPRV E+KPVLANQFS+FVSRSNG
Sbjct: 61  PFAKRNVTSSHGIPVGGVGSGSIGRSYRGEFQRWQLFPRV-EEKPVLANQFSIFVSRSNG 119

Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
           +KY SVLC ++P+ L++   +GI SW+WNLKGD STYHALYPRAWTV+EGEPDPELR+VC
Sbjct: 120 KKYCSVLCSRSPDKLEEPAGSGIESWEWNLKGDNSTYHALYPRAWTVYEGEPDPELRVVC 179

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
           RQISP+IPHNYKESS+PVSVFT+T+YNSG+T+AD+TLLFTW NSVGG SEF+GQH NS  
Sbjct: 180 RQISPVIPHNYKESSFPVSVFTFTLYNSGETAADVTLLFTWANSVGGVSEFSGQHLNSTK 239

Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
            M+DGVH VLLHH+T+++LPP+TFA+AAQET GVHVS CP FVISGNS GLTAK+MW+E+
Sbjct: 240 MMDDGVHCVLLHHKTANELPPLTFAIAAQETPGVHVSKCPSFVISGNSQGLTAKEMWNEV 299

Query: 325 KEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
           KEHGSFD LNS    V SEPGSSIGAAIAA+ TVPPDS   VTFSLAWDCPEV F SG+T
Sbjct: 300 KEHGSFDNLNSSGKPVPSEPGSSIGAAIAATSTVPPDSVCTVTFSLAWDCPEVIFASGRT 359

Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELY 444
           Y+RRYTKFYGTH +AAANIA DAIL HG W+ QIEAWQRPILEDKRLPEWYP+TLFNELY
Sbjct: 360 YHRRYTKFYGTHGDAAANIAHDAILGHGHWDSQIEAWQRPILEDKRLPEWYPVTLFNELY 419

Query: 445 YLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSIL 504
           YLN+GG +WTDGS P+HSL T+G +KFSLD + SDL       +Q DT+V+IL RM+S+L
Sbjct: 420 YLNSGGTIWTDGSSPLHSLATVGGKKFSLDRTGSDLG------HQGDTSVDILGRMTSVL 473

Query: 505 EQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQ 564
           EQI+TP+A NS+ GTNLLQ+GEEN+GQFLYLEGIEY MWNTYDVHFY+SFALIMLFPK+Q
Sbjct: 474 EQIHTPLATNSALGTNLLQEGEENVGQFLYLEGIEYPMWNTYDVHFYASFALIMLFPKLQ 533

Query: 565 LSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARW 624
           LSIQRDFAAAVMMHDPSKM LL DGQ V+RKVLGAVPHDIGI DPWFEVNAY L++T RW
Sbjct: 534 LSIQRDFAAAVMMHDPSKMHLLLDGQCVTRKVLGAVPHDIGIDDPWFEVNAYNLHNTDRW 593

Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD 684
           KDLNPKFVLQVYRDVVATGDKKFA+A WPSVYVAMAYMDQFD+DGDGMIENDGFPDQTYD
Sbjct: 594 KDLNPKFVLQVYRDVVATGDKKFAQAAWPSVYVAMAYMDQFDKDGDGMIENDGFPDQTYD 653

Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNY 744
           TWSVSG+SAY GGLWVAALQAASALA EVGD+ S +YF F+FQKAKVVY+KLWNGSYFNY
Sbjct: 654 TWSVSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWFRFQKAKVVYDKLWNGSYFNY 713

Query: 745 DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
           D+S    S SIQADQLAGQWYARACGLLPIVDEDKARSAL+K+YNYN LKV  GKRGAVN
Sbjct: 714 DDSNGRNSPSIQADQLAGQWYARACGLLPIVDEDKARSALEKIYNYNFLKVHDGKRGAVN 773

Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
           GMLPDG VDMS MQSREIWSGVTYAVAA+M+ E L D+ F TA G+YEAAW+  GLGY+F
Sbjct: 774 GMLPDGTVDMSDMQSREIWSGVTYAVAATMMQEGLMDMAFHTASGVYEAAWAEQGLGYSF 833

Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLL--RYHAGF 922
           QTPE WNT+ QYRSL YMRPLAIWAMQW L+ PK    E   + ++ D SLL  ++HAGF
Sbjct: 834 QTPEGWNTNGQYRSLGYMRPLAIWAMQWTLSSPKLHKQEMNFQVKLED-SLLGHQHHAGF 892

Query: 923 SKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
           +KVAR LKLPEE+ + S LQ+LFD+ C++ 
Sbjct: 893 AKVARFLKLPEEESSVSYLQALFDYACKKF 922


>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
          Length = 957

 Score = 1420 bits (3677), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 686/959 (71%), Positives = 803/959 (83%), Gaps = 9/959 (0%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
           MFE KI++ G   E+ +PLNSS  KVDPA PASLTWQRK+ +      +F L+ KE  QL
Sbjct: 1   MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57

Query: 61  APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
           AP+G+R+  L REEAAKG+ AFIDPF K  +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58  APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117

Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
           FP  CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K  + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177

Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
           YHALYPR+WT++EGEPDPELRIVCRQ+SP IPHNYKESS+PVSVFT+T++N G T+AD+T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVT 237

Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
           LLFTW NSVGGDSEF+G HYNSK  MNDGV  VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297

Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
           S CP F++SG   G+TAKDMW  +KE+GSFD L + E S+ S+ GSSIGAA+AASVTV P
Sbjct: 298 SACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLP 357

Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
                VTFSLAWDCPEV F SGK Y RRYTKFYG + +AAA IA DAIL H  WE  IE 
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIED 417

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL  +  +KFSLD SQ  L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
           K  +DVP+QNDTAV++LE+M+S LE+++     NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 KNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEY 537

Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
            MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597

Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
           PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
           YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASALAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQD 717

Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
           YF  KFQKAKVVYE KLWNGSYFNYDNSGS  SS+IQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDK 777

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
           AR+AL+KVYNYNV+K+  GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            ++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAIWAMQWALT+   
Sbjct: 838 VEMAFQTASGIYEAAWSKTGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQ 897

Query: 900 KTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
           K  +     +Q +PE+   S +++  GFS+V+RLL LP E  AKS LQ+LFD+TCRRM 
Sbjct: 898 KQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956


>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
 gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
 gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 957

 Score = 1420 bits (3677), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 686/959 (71%), Positives = 803/959 (83%), Gaps = 9/959 (0%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
           MFE KI++ G   E+ +PLNSS  KVDPA PASLTWQRK+ +      +F L+ KE  QL
Sbjct: 1   MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57

Query: 61  APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
           AP+G+R+  L REEAAKG+ AFIDPF K  +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58  APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117

Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
           FP  CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K  + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177

Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
           YHALYPR+WT++EGEPDPELRIVCRQ+SP IPHNYKESS+PVSVFT+T++N G T+AD+T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVT 237

Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
           LLFTW NSVGGDSEF+G HYNSK  MNDGV  VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297

Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
           S CP F++SG   G+TAKDMW  +KE+GSFD L + E S+ S+ GSSIGAA+AASVTV P
Sbjct: 298 SACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLP 357

Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
                VTFSLAWDCPEV F SGK Y RRYTKFYG + +AAA IA DAIL H  WE  IE 
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIED 417

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL  +  +KFSLD SQ  L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
           K  +DVP+QNDTAV++LE+M+S LE+++     NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 KNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEY 537

Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
            MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597

Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
           PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
           YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASALAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQD 717

Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
           YF  KFQKAKVVYE KLWNGSYFNYDNSGS  SS+IQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDK 777

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
           AR+AL+KVYNYNV+K+  GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            ++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAIWAMQWALT+   
Sbjct: 838 VEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQ 897

Query: 900 KTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
           K  +     +Q +PE+   S +++  GFS+V+RLL LP E  AKS LQ+LFD+TCRRM 
Sbjct: 898 KQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956


>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
 gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
          Length = 948

 Score = 1417 bits (3667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 700/953 (73%), Positives = 800/953 (83%), Gaps = 9/953 (0%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
           MFE KI  NG  EE+ EP N   DKVD  KPA LTWQRKL   E  LSQFTL+ +E + +
Sbjct: 1   MFERKISGNGFDEEQGEPSNYPVDKVDAGKPAPLTWQRKLDGDETVLSQFTLSLQEKLLM 60

Query: 61  APIGVRILCLIREEAAKGKRAF-IDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
           APIG+R+   IREE A+ +  F +DPF KR++TS  G+P+GG+GSGSIGRSY+GEFQRWQ
Sbjct: 61  APIGIRLWRHIREENARKRGGFYMDPFAKRNVTSCLGIPVGGIGSGSIGRSYKGEFQRWQ 120

Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
           +FPR+CE+KPVLANQFS+FVSRSNG+KYSSVLC ++P+VL++   +GI SWDWNLKG+ S
Sbjct: 121 LFPRICEEKPVLANQFSIFVSRSNGKKYSSVLCSRSPDVLEEAAGSGIESWDWNLKGNNS 180

Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
           TYHALYPRAWTV+EGEPDPELR+VCRQISPIIPHNYKESS+P SVFT+ +YNSGKTSAD+
Sbjct: 181 TYHALYPRAWTVYEGEPDPELRVVCRQISPIIPHNYKESSFPASVFTFKLYNSGKTSADV 240

Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
           TLLFTW NSVGG SEF+GQH NS   M DGVH  LL+H+T++ LP V+FA+AAQET  VH
Sbjct: 241 TLLFTWANSVGGVSEFSGQHLNSTKMMEDGVHCALLNHKTANGLPSVSFAIAAQETPVVH 300

Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
           VS CP FVISGNS G+TAK+MW+E+KEHGSFD LNS    V SEPGSSIGAA+AA+ TVP
Sbjct: 301 VSKCPCFVISGNSQGVTAKEMWNEVKEHGSFDNLNSPGKLVPSEPGSSIGAAVAATSTVP 360

Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIE 419
           PD    VTFSLAWDCP+V F SG+TY+RRYTKFYGTH +AAANIA DAILEHG WE +IE
Sbjct: 361 PDGVCTVTFSLAWDCPDVKFGSGRTYHRRYTKFYGTHGDAAANIAHDAILEHGLWESEIE 420

Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
           AWQRPILEDKRLPEWY  TLFNELYYLN+GG VWTDGSPP  SL TI   KFSLD + S+
Sbjct: 421 AWQRPILEDKRLPEWYAGTLFNELYYLNSGGTVWTDGSPPFRSLATIEGSKFSLDRAGSN 480

Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
           L       +Q DTAV+IL RM+S LE+I+TP+  NS+FGTNLLQ+GEENIGQFLYLEGIE
Sbjct: 481 LG------HQGDTAVDILGRMTSALEEIHTPLTTNSAFGTNLLQEGEENIGQFLYLEGIE 534

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y MWNTYDVHFY+SFA IMLFPK+QLSIQRDFAAAVMMHDPS M LL DGQ V RKV+GA
Sbjct: 535 YHMWNTYDVHFYASFAFIMLFPKLQLSIQRDFAAAVMMHDPSMMPLLHDGQRVPRKVIGA 594

Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           VPHDIGI DPWFEVNAY L++T RWKDLNPKFVLQVYRDV+ATGDKKFA+AVWPSVYVAM
Sbjct: 595 VPHDIGIHDPWFEVNAYNLHNTDRWKDLNPKFVLQVYRDVIATGDKKFARAVWPSVYVAM 654

Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
           AYMDQFDRDGDGMIENDGFPDQTYDTWS+SG+SAY GGLWVAALQAASALA EVGD+ S 
Sbjct: 655 AYMDQFDRDGDGMIENDGFPDQTYDTWSLSGVSAYCGGLWVAALQAASALAWEVGDKDSA 714

Query: 720 DYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
           +YF  +FQKAK+VY KLWNGSYFNYDNSG   SSSIQADQLAGQWYARACGL PIVDEDK
Sbjct: 715 EYFWCRFQKAKIVYGKLWNGSYFNYDNSGGINSSSIQADQLAGQWYARACGLSPIVDEDK 774

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
           AR AL+K+YNYNVLKV  GKRGAVNGMLPDG VD+S +QSREIWSGVTYAVAA+MI E L
Sbjct: 775 ARCALEKIYNYNVLKVQDGKRGAVNGMLPDGTVDLSCLQSREIWSGVTYAVAATMIQEGL 834

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            D+ F TA G+YEA W+  GLGY+FQTPE WNT DQYRSLCYMRPLAIWAMQWAL+R  P
Sbjct: 835 TDMAFHTASGVYEAVWAEQGLGYSFQTPEGWNTTDQYRSLCYMRPLAIWAMQWALSR--P 892

Query: 900 KTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
           +   ++MK +  ++S+  +HAGF+KVAR LKLP  + +KS  QSLF++   + 
Sbjct: 893 ELHRQEMKLQAEEDSVPVHHAGFAKVARFLKLPHAESSKSHFQSLFEYATTKF 945


>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
          Length = 957

 Score = 1416 bits (3665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 685/959 (71%), Positives = 801/959 (83%), Gaps = 9/959 (0%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
           MFE KI++ G   E+ +PLNSS  KVDPA PASLTWQRK+ +      +F L+ KE  QL
Sbjct: 1   MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57

Query: 61  APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
           AP+G+R+  L REEAAKG+ AFIDPF K  +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58  APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117

Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
           FP  CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K  + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177

Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
           YHALYPR+WT++EGEPDPELRIVCRQ+SP IPHNYKESS+PVSVFT+T++N G T+AD+T
Sbjct: 178 YHALYPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVT 237

Query: 241 LLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHV 300
           LLFTW NSVGGDSEF+G HYNSK  MNDGV  VLLHH+T++ LP +++A++AQ TDGV V
Sbjct: 238 LLFTWANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSV 297

Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
           S CP F++SG   G+TAKDMW  +KE+GSFD L + E S+ S+ GSSIGAA+AASVTV P
Sbjct: 298 SACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLP 357

Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
                VTFSLAWDCPEV F SGK Y RRYTKFYG + +AAA IA DAIL H  W   IE 
Sbjct: 358 GESRIVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWGSWIED 417

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL  +  +KFSLD SQ  L
Sbjct: 418 WQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGL 477

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEY 540
           K  +DVP+QNDTAV++LE+M+S LE ++     NS+FGT LL++GEENIG FLYLEGIEY
Sbjct: 478 KNDIDVPHQNDTAVSVLEKMASTLEGLHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEY 537

Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
            MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGAV
Sbjct: 538 RMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAV 597

Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
           PHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAMA
Sbjct: 598 PHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMA 657

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSED 720
           YM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASALAR VGD+ S+D
Sbjct: 658 YMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQD 717

Query: 721 YFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
           YF  KFQKAKVVYE KLWNGSYFNYDNSGS  SS+IQADQLAGQWYARA GLLPIVDEDK
Sbjct: 718 YFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDK 777

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
           AR+AL+KVYNYNV+K+  GKRGAVNGM P+G+VD +SMQSREIWSGVTYA++A+MI E L
Sbjct: 778 ARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGL 837

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            ++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAIWAMQWALT+   
Sbjct: 838 VEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQ 897

Query: 900 KTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
           K  +     +Q +PE+   S +++  GFS+V+RLL LP E  AKS LQ+LFD+TCRRM 
Sbjct: 898 KQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956


>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 969

 Score = 1381 bits (3574), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 684/951 (71%), Positives = 797/951 (83%), Gaps = 7/951 (0%)

Query: 7   LENGLHEEE---KEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPI 63
           +ENG H+E+   + P+ SS  KV+P KPASLTWQRKL+T    L++F L  +E   LAP+
Sbjct: 17  MENG-HKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPL 75

Query: 64  GVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPR 123
           GVR+   +  EAAKG+ + IDPF KR +TS HGVPLGG+G GSIGRSYRGEFQR+Q+FPR
Sbjct: 76  GVRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPR 135

Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
           +CED PVLANQFSVFVSR NG+K S+VLCP+ PEVLK + ++GIGSWDWNL G+  TYHA
Sbjct: 136 ICEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHA 195

Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
           LYPRAWTV+EGEPDPE+ I+  QISP IPHNYKESS+PVSVF +T+ NSGKTSADITLLF
Sbjct: 196 LYPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLF 255

Query: 244 TWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
           TW NSVGG SEF+G HYNSK K  DGVH VLLHH+T++  PPVTFA+AA+ET  VH+S C
Sbjct: 256 TWANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISEC 315

Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
           P F+ISGNSLG+TAK+MW EIK+HGSFD L+   +S+ SEPGSSIGAA+AAS+T+PPD+ 
Sbjct: 316 PCFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTV 375

Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
             VTFSLAW CPEV F SGKTY+RRYT+FYGTH +AA  IA DAILEH +W  +IEAWQ 
Sbjct: 376 RTVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQG 435

Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI 483
           PILED+RLPEWY ITLFNELY+LNAGG +WTDG PP+ SL TI   KFSLD S SD K  
Sbjct: 436 PILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNT 495

Query: 484 VDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMW 543
            D+ +QND+ V IL RM+S+LEQI+ P   NS+FGT LLQ GEEN+GQFLYLEGIEY MW
Sbjct: 496 TDIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMW 555

Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
           NTYDVHFYSSFA+IMLFP+++LSIQRDFAAAVM+HDPS+MK++ DG+WV RKVLGAVPHD
Sbjct: 556 NTYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHD 615

Query: 604 IGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
           IGI DPWFE+NAY LYDT RWKDLN KFVLQVYRD+VATGDK FA+AVWP+VY+A+A++D
Sbjct: 616 IGISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLD 675

Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           QFD+DGDGMIENDGFPDQTYD WSV+G+SAY GGLWVAALQAASA+AREVGD  + DYF 
Sbjct: 676 QFDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFW 735

Query: 724 FKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
           FKFQKAK VY+KLWNGSYFNYDNSG S SSSIQADQLAGQWYARACGL PIVD++KARSA
Sbjct: 736 FKFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSA 795

Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
           L+KVYN+NVLKV  GK GAVNGMLPDGRVDMS+MQSREIW+GVTY+VAA+MIHE + +  
Sbjct: 796 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 855

Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLE 903
           F TA GIY+AAWS  GLGY+FQTPEAWNTD++YRSLCYMRPLAIWAMQWAL+  KP+   
Sbjct: 856 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALS--KPELHN 913

Query: 904 KQMKPEVTDESL-LRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
             MK E    +L   +H GF KVA LLKLPEE+ +KS LQ  FD TCRR++
Sbjct: 914 HDMKHEEGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLY 964


>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Glycine max]
          Length = 944

 Score = 1380 bits (3573), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 673/953 (70%), Positives = 798/953 (83%), Gaps = 13/953 (1%)

Query: 6   ILENGLHEEEKEPLNSSFDK---VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAP 62
           ++ENG  +++  P   S      V+P KPA LTWQRKL+      SQ +L+ KE + LAP
Sbjct: 1   MVENGFVQDDITPPPKSCSNKKMVEPGKPAGLTWQRKLNNNGNASSQISLSLKEIIHLAP 60

Query: 63  IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
           IG R+    REEAAKG+   IDPF KRH+T  HGVPLGG+G+GSIGRS+RGEFQRWQ+FP
Sbjct: 61  IGYRLWRHCREEAAKGRGGMIDPFAKRHVTFCHGVPLGGIGAGSIGRSFRGEFQRWQLFP 120

Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
            +CE+KPVLANQFSVFVSR +G+KYSSVLCP+ P ++K    +GI SWDWN+ G+ STYH
Sbjct: 121 VICEEKPVLANQFSVFVSRPSGEKYSSVLCPRKPVIIKQNPVSGIESWDWNINGNSSTYH 180

Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
           ALYPRAWTV+E EPDP LRIVCRQISP+IPHNYKESS+PV+VFT+T+ N G T+AD+TLL
Sbjct: 181 ALYPRAWTVYE-EPDPALRIVCRQISPVIPHNYKESSFPVTVFTFTLKNLGNTTADVTLL 239

Query: 243 FTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
           FTWTNSVGG SEFTG H+NSKT  NDGVHAVLLHH+T+++  PVTFA+AA+ET+ VH+S 
Sbjct: 240 FTWTNSVGGISEFTGDHFNSKTTRNDGVHAVLLHHKTANERSPVTFAIAAEETEDVHISE 299

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
           CP FVISG    ++AK+MW+E+K+HGSFD LN  ETS  SEPGSSIGAAIAA+VT+P ++
Sbjct: 300 CPVFVISGAYSSISAKNMWNEVKQHGSFDHLNFAETSAPSEPGSSIGAAIAATVTIPSNA 359

Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
           +  VTFSLAWDCPEV F  GKTYYRRYTKFYGT+ +AAA+IA DAI+EH  WE QI+ WQ
Sbjct: 360 QRIVTFSLAWDCPEVKFPEGKTYYRRYTKFYGTNGDAAADIAHDAIMEHCQWEAQIDDWQ 419

Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
           RPILEDKR PEWYP TL NELYYLN+GG +WTDGS PVH LV  G RKFSLD   ++L  
Sbjct: 420 RPILEDKRFPEWYPTTLLNELYYLNSGGTIWTDGSLPVHGLVNTGERKFSLDGFNNNLS- 478

Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
                ++ND A+NILE  +S++EQ ++P A  S++G NLLQ+GEENIGQFLYLEGIEY M
Sbjct: 479 -----HKNDIAINILEMFNSVVEQTHSPPASKSAYGVNLLQEGEENIGQFLYLEGIEYQM 533

Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
           WNTYDVHFYSSF+L+MLFPK++LSIQRDFAAAV+MHDPSKMKLL DGQW  RKVLGAVPH
Sbjct: 534 WNTYDVHFYSSFSLVMLFPKLELSIQRDFAAAVLMHDPSKMKLLYDGQWAPRKVLGAVPH 593

Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           DIG+ DPWFEVN Y LY+T RWKDLNPKFVLQ+YRDVVATGDKKFA+AVWP+VY+A+AYM
Sbjct: 594 DIGLNDPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDKKFAQAVWPAVYIAIAYM 653

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
           DQFD+DGDGMIEN+GFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAREVGD+GSEDYF
Sbjct: 654 DQFDKDGDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYF 713

Query: 723 LFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
             KFQKAK VYE LWNGSYFNYD+SG S S SIQADQLAGQWYARACGLLPIV+E K+RS
Sbjct: 714 WLKFQKAKSVYENLWNGSYFNYDSSGGSSSLSIQADQLAGQWYARACGLLPIVEEKKSRS 773

Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
           AL+ VYN+NV+KV  G+RGAVNGMLPDG+VDMS+MQSREIWSGVTYA+AA+MI E++ D+
Sbjct: 774 ALQIVYNHNVMKVHXGRRGAVNGMLPDGKVDMSTMQSREIWSGVTYALAATMIQENMIDM 833

Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
            FQTA G+YE AWS  GLGY+FQTPEAW T D+YRSLCYMRPLAIWAMQW L+R   K +
Sbjct: 834 AFQTAGGVYETAWSNNGLGYSFQTPEAWTTKDEYRSLCYMRPLAIWAMQWELSR--AKHI 891

Query: 903 EKQMKPEVTDESLL-RYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
           + + K ++ ++ +L RYH GFSKVARLLK+ EE  ++SL Q ++D TC+RM++
Sbjct: 892 QHESKSDMKEKDMLSRYHDGFSKVARLLKVKEETDSRSLFQVIYDFTCKRMWV 944


>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
          Length = 953

 Score = 1380 bits (3571), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 684/951 (71%), Positives = 797/951 (83%), Gaps = 7/951 (0%)

Query: 7   LENGLHEEE---KEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPI 63
           +ENG H+E+   + P+ SS  KV+P KPASLTWQRKL+T    L++F L  +E   LAP+
Sbjct: 1   MENG-HKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPL 59

Query: 64  GVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPR 123
           GVR+   +  EAAKG+ + IDPF KR +TS HGVPLGG+G GSIGRSYRGEFQR+Q+FPR
Sbjct: 60  GVRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPR 119

Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
           +CED PVLANQFSVFVSR NG+K S+VLCP+ PEVLK + ++GIGSWDWNL G+  TYHA
Sbjct: 120 ICEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHA 179

Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
           LYPRAWTV+EGEPDPE+ I+  QISP IPHNYKESS+PVSVF +T+ NSGKTSADITLLF
Sbjct: 180 LYPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLF 239

Query: 244 TWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
           TW NSVGG SEF+G HYNSK K  DGVH VLLHH+T++  PPVTFA+AA+ET  VH+S C
Sbjct: 240 TWANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISEC 299

Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
           P F+ISGNSLG+TAK+MW EIK+HGSFD L+   +S+ SEPGSSIGAA+AAS+T+PPD+ 
Sbjct: 300 PCFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTV 359

Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
             VTFSLAW CPEV F SGKTY+RRYT+FYGTH +AA  IA DAILEH +W  +IEAWQ 
Sbjct: 360 RTVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQG 419

Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI 483
           PILED+RLPEWY ITLFNELY+LNAGG +WTDG PP+ SL TI   KFSLD S SD K  
Sbjct: 420 PILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNT 479

Query: 484 VDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMW 543
            D+ +QND+ V IL RM+S+LEQI+ P   NS+FGT LLQ GEEN+GQFLYLEGIEY MW
Sbjct: 480 TDIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMW 539

Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
           NTYDVHFYSSFA+IMLFP+++LSIQRDFAAAVM+HDPS+MK++ DG+WV RKVLGAVPHD
Sbjct: 540 NTYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHD 599

Query: 604 IGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
           IGI DPWFE+NAY LYDT RWKDLN KFVLQVYRD+VATGDK FA+AVWP+VY+A+A++D
Sbjct: 600 IGISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLD 659

Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           QFD+DGDGMIENDGFPDQTYD WSV+G+SAY GGLWVAALQAASA+AREVGD  + DYF 
Sbjct: 660 QFDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFW 719

Query: 724 FKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
           FKFQKAK VY+KLWNGSYFNYDNSG S SSSIQADQLAGQWYARACGL PIVD++KARSA
Sbjct: 720 FKFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSA 779

Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
           L+KVYN+NVLKV  GK GAVNGMLPDGRVDMS+MQSREIW+GVTY+VAA+MIHE + +  
Sbjct: 780 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 839

Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLE 903
           F TA GIY+AAWS  GLGY+FQTPEAWNTD++YRSLCYMRPLAIWAMQWAL+  KP+   
Sbjct: 840 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALS--KPELHN 897

Query: 904 KQMKPEVTDESL-LRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
             MK E    +L   +H GF KVA LLKLPEE+ +KS LQ  FD TCRR++
Sbjct: 898 HDMKHEEGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLY 948


>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 951

 Score = 1354 bits (3504), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 683/956 (71%), Positives = 800/956 (83%), Gaps = 8/956 (0%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDK---VDPAKPASLTWQRKLSTGEIPLSQFTLNWKET 57
           M EGK L NGL E++ E +N S  +   +DP +  SLTWQRKL+     L QF L+ KE 
Sbjct: 1   MSEGKKLGNGLVEQD-EDINKSLTEKTGIDPGQTPSLTWQRKLNREGTSLWQFRLHLKEI 59

Query: 58  VQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQR 117
           + LAP+G R+   IREE+ KG+ A I+PFI+R +T +H +PLGG+GSGSIGRSYRGEFQR
Sbjct: 60  IHLAPVGYRLFRYIREESVKGRGAMINPFIRRRITDTHAIPLGGLGSGSIGRSYRGEFQR 119

Query: 118 WQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGD 177
           WQ+FPR CEDKP+LANQFSVFVSR N +KYS+VLC ++PE  ++  ++GIGSWDWNLKG 
Sbjct: 120 WQLFPRKCEDKPILANQFSVFVSRPNAEKYSTVLCAQSPETQREVESSGIGSWDWNLKGH 179

Query: 178 KSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA 237
            STYHALYPRAWT+++GEPDPELRIVCRQISP+IPHNYKESSYPVSVFT+T++NSGKT A
Sbjct: 180 SSTYHALYPRAWTIYDGEPDPELRIVCRQISPVIPHNYKESSYPVSVFTFTLHNSGKTDA 239

Query: 238 DITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG 297
           D++LLFTW NSVGG SE++G H NS+TK  DGVH VLLHH+T+   PPVT+A+AAQE +G
Sbjct: 240 DVSLLFTWANSVGGLSEYSGNHINSRTKKKDGVHTVLLHHKTASGFPPVTYAIAAQEGNG 299

Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
           VHVS CP FVISGNS G++AKDMW EIKEHGSFDRL   + S+ SE GSSIGAA++ASVT
Sbjct: 300 VHVSDCPCFVISGNSQGISAKDMWLEIKEHGSFDRLKFADMSMPSEVGSSIGAAVSASVT 359

Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
           V  DS   VTFSL+WDCPEVNF  GKTY+RRYTKFYG   +AAA+IARDAILEH  WE Q
Sbjct: 360 VSSDSVRTVTFSLSWDCPEVNFCRGKTYHRRYTKFYGNLGDAAADIARDAILEHHHWESQ 419

Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
           I+AWQRP+LEDKR P+WYP+TLFNELYYLNAGG +WTDGS P+ SLV+IG R F LD  +
Sbjct: 420 IDAWQRPVLEDKRFPKWYPVTLFNELYYLNAGGTIWTDGSLPIQSLVSIGERGFCLDQCR 479

Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
           S    I +  ++NDTA +IL RM+S L+++   VA NS+FG NLLQ GEEN+GQFLYLEG
Sbjct: 480 SGDSVIYE-SHENDTANDILGRMTSRLDELRDSVASNSAFGVNLLQKGEENVGQFLYLEG 538

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
           +EY MWNTYDVHFYSSFA+IMLFPK++LSIQRDFAAAVMMHDPSKM+LLD+G+W +R VL
Sbjct: 539 VEYTMWNTYDVHFYSSFAIIMLFPKLELSIQRDFAAAVMMHDPSKMQLLDNGKWEARNVL 598

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           GAVPHDIG+ DPWFEVN Y LY+T RWKDLNPKFVLQ+YRDVVATGD KFAKAVWP VY+
Sbjct: 599 GAVPHDIGVNDPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDMKFAKAVWPPVYL 658

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
           A+AYMDQFDRDGDGMIENDGFPDQTYDTWSVSG+SAYSGGLWVAALQAASALAR   ++ 
Sbjct: 659 AIAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVADEKD 718

Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
           +E YF FKFQKAK  YEKLWNGSYFNYD+SG S SSSIQADQLAGQWYARA GLLPIVDE
Sbjct: 719 AEHYFWFKFQKAKRAYEKLWNGSYFNYDSSGGSSSSSIQADQLAGQWYARASGLLPIVDE 778

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
           DKA+SAL KVYNYNV+KV  GKRGAVNGMLPDG +D SSMQSREIWSGVTYAVAASMIHE
Sbjct: 779 DKAKSALAKVYNYNVMKVKDGKRGAVNGMLPDGTIDFSSMQSREIWSGVTYAVAASMIHE 838

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
           D+ D+ F+TA GI+EAAWS  GLGY FQTPEAW T D+YRSLCYMRPLAIWAMQWA +  
Sbjct: 839 DMTDMAFRTAEGIHEAAWSEDGLGYNFQTPEAWTTTDRYRSLCYMRPLAIWAMQWAFSE- 897

Query: 898 KPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
           K   +E ++K E+  +++ R+HA FSKVAR LKLPE+  + S+LQ+++D+T +R F
Sbjct: 898 KVSVIE-ELK-ELDSDAISRHHARFSKVARYLKLPEDGTSSSVLQTVYDYTLKRFF 951


>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
 gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
          Length = 891

 Score = 1304 bits (3374), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 618/899 (68%), Positives = 748/899 (83%), Gaps = 12/899 (1%)

Query: 60  LAPIGVRILCLIREEAAKGKRAFIDPFIKRH-LTSSHGVPLGGVGSGSIGRSYRGEFQRW 118
           + P+G+R+   I++EAAK +RA I  F K+H LTS HG+PLGG+G+GSIGRSYRGEFQ +
Sbjct: 1   MLPMGLRLWRHIKQEAAK-ERATIFDFSKKHILTSDHGIPLGGIGAGSIGRSYRGEFQHF 59

Query: 119 QIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDK 178
           ++FP +CE+ PVLANQFS FVSR +G+ +SSVLC +TP++ K++T +GI SWDWNL G K
Sbjct: 60  RLFPGICEEGPVLANQFSAFVSRPSGETFSSVLCSRTPDIPKESTGSGIESWDWNLNGQK 119

Query: 179 STYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD 238
            TYHAL+PRAWT+++G PDPEL IV RQISP IPHNYKESS+PVSVFT+T+ N GKTSAD
Sbjct: 120 CTYHALFPRAWTIYDGAPDPELTIVSRQISPFIPHNYKESSFPVSVFTFTLSNHGKTSAD 179

Query: 239 ITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
           +TL+FTW NSVGG S  +G H+NSK    DGVHAV LHH+ +++ PPVTFA+AAQET  V
Sbjct: 180 VTLMFTWANSVGGVSGLSGHHFNSKMMTKDGVHAVTLHHKMTNRQPPVTFAIAAQETADV 239

Query: 299 HVSLCPHFVISGNS--LGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASV 356
           HVS CP F+ISG+S  + +TAKDMW  I+++G+FD++   +TS  SEPGSSIGAAIAASV
Sbjct: 240 HVSECPCFLISGDSQDVSVTAKDMWDVIRKNGTFDQIGCNKTS-PSEPGSSIGAAIAASV 298

Query: 357 TVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWEL 416
           TVP  S   VTFSL+WD PE+ F S ++Y+RRYTKFYGT  +AAANIARDAILEH +WE 
Sbjct: 299 TVPSGSIRTVTFSLSWDIPEIRF-SERSYHRRYTKFYGTLGDAAANIARDAILEHANWES 357

Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWS 476
           QIEAWQRPILEDKR+PEWYPITLFNELYYLNAGG +WTD SPP+ +L  +  R+FSL+ S
Sbjct: 358 QIEAWQRPILEDKRVPEWYPITLFNELYYLNAGGTIWTDESPPMQNLTAVRERRFSLERS 417

Query: 477 QSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLE 536
            S  K    +  +NDTA+ ILERM+S  EQ++ PV+ NS FG NLLQ+GEENIGQ LYLE
Sbjct: 418 SSGYKNGNGIAYKNDTAIEILERMASTYEQLHNPVSSNSVFGANLLQNGEENIGQLLYLE 477

Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
           G EYLMWNTYDVHFY+SFAL+MLFPK++L++QRDFAAAV+MHDPS+M+++ DG+WV RKV
Sbjct: 478 GTEYLMWNTYDVHFYASFALVMLFPKLELNLQRDFAAAVLMHDPSRMQIMSDGKWVPRKV 537

Query: 597 LGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
           LGAVPHD+G+ DPWFE+NAY L+ TARWKDLN KFVLQVYRDVVATGDK FA+AVWPSVY
Sbjct: 538 LGAVPHDVGLNDPWFEINAYNLFSTARWKDLNCKFVLQVYRDVVATGDKDFARAVWPSVY 597

Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
           VAMAYMDQFD+DGDGMIEN+G PDQTYDTWSV+G+SAY GGLWVAALQA SA+A EVGD 
Sbjct: 598 VAMAYMDQFDKDGDGMIENEGIPDQTYDTWSVTGVSAYCGGLWVAALQATSAMAHEVGDD 657

Query: 717 GSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
           GS +YF  K+QKAK VY KLWNGSYFNYD+SG   + SIQADQLAGQWYARACGL P+VD
Sbjct: 658 GSANYFWIKYQKAKTVYGKLWNGSYFNYDSSG---NKSIQADQLAGQWYARACGLSPVVD 714

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
            DKAR AL+K+Y +NVLKV GG RGAVNGMLPDGRVD+++MQ++EIW GVTYA++ASMI 
Sbjct: 715 GDKARKALEKIYEFNVLKVKGGTRGAVNGMLPDGRVDLTAMQTKEIWPGVTYALSASMIQ 774

Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           E L ++ FQTA GIY AAWS  GLGY+FQ PE+W+ +DQYRSLCYMRPLAIWAMQWAL+ 
Sbjct: 775 EGLEEMAFQTAVGIYNAAWSEEGLGYSFQIPESWDMNDQYRSLCYMRPLAIWAMQWALS- 833

Query: 897 PKPKTLEKQMKPE-VTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
            KPK  +++M  E + DES L+ HAGFSKVA LL+LPEE+  KS  Q++++ TC+RM  
Sbjct: 834 -KPKDFKEEMHHEGIEDESYLKQHAGFSKVAHLLRLPEEEAPKSFFQAIYEFTCKRMLF 891


>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 951

 Score = 1291 bits (3341), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 618/950 (65%), Positives = 757/950 (79%), Gaps = 10/950 (1%)

Query: 9   NGLHEEEK------EPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAP 62
           NG+ E  K       P  ++ D VDP +   LTW+ KLS     L  F L W+E V++A 
Sbjct: 4   NGMPEPPKGVSRNSSPSQTNGDSVDPGQLPELTWEHKLSHVRYDLPSFGLKWREAVKMAG 63

Query: 63  IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
           +G R+   I EE +KG+ A IDP  KR   S  GVPLGG+G+GSIGRSY+GEFQRWQ+FP
Sbjct: 64  LGFRLGQHIVEETSKGRTAIIDPMKKRTAKSGQGVPLGGIGAGSIGRSYKGEFQRWQLFP 123

Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
             CEDKPVLANQFS F+SR +G+KYS+VL P  P++ K T  +GIGSWDWNL G KSTYH
Sbjct: 124 GACEDKPVLANQFSAFISRQDGRKYSTVLHPGKPDLPKGTNISGIGSWDWNLSGQKSTYH 183

Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
           ALYPRAWTV++GEPDP+L+IVCRQ+SPIIPHNY++SSYP +VFT+T+ NSG T+ D+TLL
Sbjct: 184 ALYPRAWTVYDGEPDPDLKIVCRQVSPIIPHNYQQSSYPAAVFTFTVANSGNTAVDVTLL 243

Query: 243 FTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
           FTW NSVGG SE TG H NS     DGVH +LLHHRT+   PPVTFA+AAQE + VH+S 
Sbjct: 244 FTWANSVGGKSELTGYHCNSSMLEKDGVHGILLHHRTADGQPPVTFAIAAQEKEDVHISE 303

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
           CP+FV+SG+S   TAKDMW+ +KEHGSFD L+ +E S+ S PG+SIGAAIAASV + P S
Sbjct: 304 CPYFVMSGSSDEFTAKDMWNSVKEHGSFDLLDPIEASICSRPGTSIGAAIAASVKLAPQS 363

Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
              V+FSLAW CPEV F SGKTY+RRYTKFYGT  +AAA++A DAI++H SWE QIE WQ
Sbjct: 364 TKDVSFSLAWACPEVKFSSGKTYHRRYTKFYGTDVDAAASLAHDAIVDHSSWEKQIEEWQ 423

Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
            PIL+DKR P WYP+TLFNELYYLNAGG++WTDG PP+ SL  IG +KFSLD S  +   
Sbjct: 424 NPILQDKRFPAWYPVTLFNELYYLNAGGSIWTDGLPPIQSLTAIGGKKFSLDMSNGETDD 483

Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
           + ++   N+TA +IL++M+SILE+I+  +A NS+ GT LLQ GEENIGQFLYLEGIEY M
Sbjct: 484 VDEMIPHNNTATDILQQMASILERIHASLASNSAIGTTLLQ-GEENIGQFLYLEGIEYYM 542

Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
           WNTYDVHFY+SFALIMLFPK+QLSIQRDFAAAVMMHDP K+KLL DG+   RKVLGAVPH
Sbjct: 543 WNTYDVHFYASFALIMLFPKLQLSIQRDFAAAVMMHDPEKLKLLHDGKLAPRKVLGAVPH 602

Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           D+G+ DPWF+VNAY L++T RWKDLNPKFVLQVYRDVVATG+K FA+AVWPSVY+AMAYM
Sbjct: 603 DLGLYDPWFKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGNKSFARAVWPSVYMAMAYM 662

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
           +QFD+D DGMIEN+ FPDQTYD WS++G+SAY GGLWVAALQAASALAREVGD+ SE+ F
Sbjct: 663 EQFDKDKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQAASALAREVGDKASEELF 722

Query: 723 LFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
             K++KAK VY KLWNGSYFNYD++G+  S+SI ADQLAGQWYA++CGL  IVD+DK++S
Sbjct: 723 WNKYEKAKSVYGKLWNGSYFNYDDAGTKVSTSIHADQLAGQWYAKSCGLSSIVDKDKSQS 782

Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
           AL+K+Y++NV+K   GKRGA+NGM PDG VDMS+MQSREIW GVTYA+AASMI E + + 
Sbjct: 783 ALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSTMQSREIWPGVTYALAASMIQEGMVEE 842

Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
           GF+TA GIY AAWS  GLGYAFQTPEAW  DD YRSLCYMRPLAIW++QWAL+ PK   L
Sbjct: 843 GFKTAEGIYHAAWSPEGLGYAFQTPEAWTNDDGYRSLCYMRPLAIWSIQWALSSPK---L 899

Query: 903 EKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRM 952
            K+ + ++  +S  +    ++++A+LL+LPE++ +KS LQ +++    R 
Sbjct: 900 HKEPQRDLAQDSFPKNQFSYARIAKLLQLPEDESSKSFLQVIYEIVRNRF 949


>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
 gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
          Length = 948

 Score = 1283 bits (3319), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 645/956 (67%), Positives = 767/956 (80%), Gaps = 14/956 (1%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKL-STGEIPLSQFTLNWKETVQ 59
           M E +I E G   E+   L+SS  K++P  P SL+W+R L S G +PL  FTL+++E + 
Sbjct: 1   MMENRIKEEG---EKDSYLDSSTQKINPGVPPSLSWERPLNSNGNVPLG-FTLSFREILH 56

Query: 60  LAPIGVRILCLIREEAAKGKRAFIDPFIKRH-LTSSHGVPLGGVGSGSIGRSYRGEFQRW 118
           + PIG+R+    +EE  KG+    D F K+H +T  H VPLGG+G+GSIGRSY+GEFQ +
Sbjct: 57  MLPIGLRLWRYSKEEPTKGRVPIFD-FSKKHVITGDHAVPLGGIGAGSIGRSYKGEFQCF 115

Query: 119 QIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDK 178
           ++ P  CE+ P+LANQFSVFVSR NG+K+SSVLC + PE+  +   +GI SWDWNL G+ 
Sbjct: 116 KLLPLACEEGPILANQFSVFVSRPNGKKFSSVLCSRRPELPTEIKGSGIESWDWNLNGEN 175

Query: 179 STYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD 238
            TYHAL+PRAWT ++G+PDPEL+IV +QISP IPHNYKESS+PVSVFT+T+ N G+TSAD
Sbjct: 176 CTYHALFPRAWTTYKGQPDPELKIVSQQISPFIPHNYKESSFPVSVFTFTLSNFGRTSAD 235

Query: 239 ITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
           +TLLFTW NS+GG S  +  H+NS     DGVH V LHH+T    PP+TFA+AAQET  V
Sbjct: 236 VTLLFTWANSIGGVSGSSRHHFNSSIMKKDGVHTVTLHHKTGDGQPPLTFAIAAQETPDV 295

Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
           HVS CP F+ISGNS G+TAKDMW EIK+HG+FD L+  + S  SE GS IGAAIAA++T+
Sbjct: 296 HVSECPCFLISGNSQGVTAKDMWDEIKKHGTFDHLSYNKIS-PSEGGSCIGAAIAATLTI 354

Query: 359 PPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQI 418
           PPD+    TFSLAWDCPEV F SG+TY+RRYTKFYGT  +AAA+IA DAILEHG WE QI
Sbjct: 355 PPDTIRTATFSLAWDCPEVRF-SGRTYHRRYTKFYGTLGDAAADIAHDAILEHGHWESQI 413

Query: 419 EAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS 478
           EAWQRPILEDKRLP+WYPITLFNELYYLNAGG VWTDGSPP+ S   I   K +LD S+S
Sbjct: 414 EAWQRPILEDKRLPKWYPITLFNELYYLNAGGTVWTDGSPPMQSFAAIKGGKLTLDKSRS 473

Query: 479 DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGI 538
           + +    V ++NDTAV IL RM+SI E+++ PV  N++FGT LLQ+ EENIGQFLYLEG 
Sbjct: 474 EFENARPVAHRNDTAVEILNRMASIYEKMHNPVTSNAAFGTYLLQNDEENIGQFLYLEGS 533

Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
           EYLMWNTYDVHFYSSFAL+MLFPK++LSIQRDFAAAVMMHDPS+M+++ DG+ V RKVLG
Sbjct: 534 EYLMWNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVMMHDPSRMQIMSDGRRVPRKVLG 593

Query: 599 AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVA 658
           AVPHDIG+ DPWFEVNAY L  TARWKDLNPKFVLQ+YRDVVATGDK FA AVWPSVYVA
Sbjct: 594 AVPHDIGLNDPWFEVNAYNLISTARWKDLNPKFVLQIYRDVVATGDKSFALAVWPSVYVA 653

Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
           MAYMDQFD+DGDGMIEN+GFPDQTYDTWSV+G+SAY GGLWVAALQAASALA EVGD  S
Sbjct: 654 MAYMDQFDKDGDGMIENEGFPDQTYDTWSVTGVSAYCGGLWVAALQAASALACEVGDSES 713

Query: 719 EDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED 778
             +F  K+QKAK VY  LWNGSYFNYD+SG   +SSI ADQLAGQWYARACGL  IVDE+
Sbjct: 714 ASFFWVKYQKAKAVYSTLWNGSYFNYDSSG---NSSIHADQLAGQWYARACGLSSIVDEE 770

Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
           K RSAL+K+Y +NVLKV  GKRGAVNGMLPDG+VDMS MQSREIW GVTYA++ASMI E 
Sbjct: 771 KVRSALEKIYKFNVLKVKEGKRGAVNGMLPDGKVDMSVMQSREIWPGVTYALSASMIQEG 830

Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           +A++ FQTA GIYEAAWS  GLGY+FQ PE WNTDDQYRSLCYMRPLAIWAMQWAL++PK
Sbjct: 831 MAEMAFQTASGIYEAAWSQEGLGYSFQIPEGWNTDDQYRSLCYMRPLAIWAMQWALSKPK 890

Query: 899 PKTLEKQMKPEVTDESLL-RYHAGFSKVARLLKLPEEQGA-KSLLQSLFDHTCRRM 952
               E ++     D+ L    +AGFSKVA LLKLP+++G+ KS LQS ++ TCRR+
Sbjct: 891 VFKEEMKLLSLAADDRLHPSQNAGFSKVAHLLKLPDDEGSNKSFLQSAYEFTCRRL 946


>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
 gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
          Length = 951

 Score = 1269 bits (3284), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 607/930 (65%), Positives = 743/930 (79%), Gaps = 10/930 (1%)

Query: 19  LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
           +N   +KV   +   LTW+ KL+ G   L  F L W+ET+QLA IG+R+   I EE +KG
Sbjct: 22  MNGDEEKVYSGQLPVLTWEHKLTRGRHDLPPFRLTWRETMQLAGIGLRLSRHILEETSKG 81

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           + A IDP  KR   S  GVPLGG    SIGRSY+G+FQRWQ+FP  CEDKPVLANQFS F
Sbjct: 82  RIAVIDPMKKRSARSGQGVPLGG----SIGRSYKGDFQRWQLFPGTCEDKPVLANQFSAF 137

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           +SR +G+KYS+VL P  P++ K +  +GIGSWDWN+ G++STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGRKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGEQSTYHALYPRAWTVYDGEPDP 197

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
           EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG 
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257

Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
           H NS     DGVH +LLHHRT+   PPVTF +AAQE + + +S CP+FVISG+S     T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPPVTFVIAAQEKEDILISQCPYFVISGSSASDEFT 317

Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
           AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P +   ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATKDISFSLAWACPE 377

Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
           V F SGKTY+RRYTKFYGT  +AAA +A DAILEH SWE QI+ WQ PIL+D+R P WYP
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAATLAHDAILEHASWERQIDEWQDPILQDERFPAWYP 437

Query: 437 ITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNI 496
           +TLFNELYYLNAGG +WTDG PP+ SL  IG +KFSLD    +     ++  QN+TA +I
Sbjct: 438 VTLFNELYYLNAGGTIWTDGLPPIQSLTAIGWKKFSLDMLNGESDDFNEMVRQNNTASDI 497

Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
           L RM+S+ E+++  +A NS+ GT LLQ GEENIGQFLYLEGIEY MWNTYDVHFY+SF+L
Sbjct: 498 LHRMASVFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSL 556

Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
           IMLFPK+QLS+QRDFAAAVMMHDP K+++L DG+W +RKVLGAVPHD+G+ DPWF+VNAY
Sbjct: 557 IMLFPKLQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAY 616

Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
            LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AMAYM+QFD+D DGMIEN+
Sbjct: 617 TLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENE 676

Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKL 736
            FPDQTYD WS++GISAY GGLWVAALQAASALAREVGD+ SE  F  K++KAK VY KL
Sbjct: 677 DFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKSVYSKL 736

Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
           WNGSYF+YD+  +  S+SIQADQLAGQWYA+ACGL PIVD+DKA+SAL+K++++NV+K  
Sbjct: 737 WNGSYFSYDDGDNKVSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKIFSFNVMKFK 796

Query: 797 GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
            G RGA+NGM PDG VDMS+MQSREIW GVTYA+AASMI E + + GF+TA GIY AAWS
Sbjct: 797 DGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGIYHAAWS 856

Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLL 916
             GLGY+FQTPEAWN DD+YRSLCYMRPLAIWAMQWAL+ PK   L    + ++  +S  
Sbjct: 857 PEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSNPK---LHNTPQADIPQDSFP 913

Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
           +    ++++ +LL+LPEE+ +KS L+ +++
Sbjct: 914 KNQFAYARITKLLQLPEEETSKSFLRVVYE 943


>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
 gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
          Length = 950

 Score = 1259 bits (3259), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 616/952 (64%), Positives = 754/952 (79%), Gaps = 11/952 (1%)

Query: 6   ILENGLHEEEK-----EPLNSSFDK-VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQ 59
           ++ENG+ E+ K      P   S D  VDP     LTW+ KLS     L  F L W+ET Q
Sbjct: 1   MVENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKLSNIGYDLPSFRLTWRETFQ 60

Query: 60  LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
           LA +G+R+   I EE +KG+ A IDP  KR   S  GVPLGG+GSGSIGRSY+GEFQRWQ
Sbjct: 61  LAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQ 120

Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
           +FP  CE++PVLANQFS F+SR +G+ YSSVL P  P++ K +  +GIGSWDWN+ G  S
Sbjct: 121 LFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQNS 180

Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
           TYHALYPR+WT++ GEPDP++ IVCRQISPIIPHNY++SSYPVSVFT+T+ NSG T+AD+
Sbjct: 181 TYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAADV 240

Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
           TLLFTW NSVGG SE TG H NS     DGVH +LLHHRT++  PPVTFA+AAQE + +H
Sbjct: 241 TLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDIH 300

Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
           +S CP+F+ISG+S   +AKDMW+ +KE+GSFD L+  +TS+ S+PG SIGAAIAASV +P
Sbjct: 301 ISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKLP 360

Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIE 419
           P +   V+F+LAW CPEV F SGKTY+RRYTKF+GT  +AAA++A DAILEH SWE QIE
Sbjct: 361 PQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQIE 420

Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
            WQ PIL+D+R P+WYP+TLFNELYYLNAGG +WTDG PP+ SL  IG +KFSLD    D
Sbjct: 421 EWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNGD 480

Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
                 +  +N+TA +IL +M+S+LE+I+  +  NS+ GT LLQ GEENIGQFLYLEGIE
Sbjct: 481 ADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGIE 539

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y MWNTYDVHFY+SF+LIMLFPK+QLSIQRDFAAAV+MHDP K+++L DG+WV+RKVLGA
Sbjct: 540 YYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLGA 599

Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           VPHD+G+ DPWF+VNAY LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AM
Sbjct: 600 VPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAM 659

Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
           AYM+QFDRD DGMIEN+ FPDQTYD WS++GISAY GGLWVAALQAASALA EVGD+ SE
Sbjct: 660 AYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASE 719

Query: 720 DYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
             F  K++KAK VY KLWNGSYFNYD+  +  S+SI ADQLAGQWYA+ACGL PIVD+DK
Sbjct: 720 KLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKDK 779

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
           A SAL+K+Y++NV+K   GKRGA+NGM PDG VDMS+MQSREIW GVTYA+AA+MI E +
Sbjct: 780 AESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGM 839

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            + GF+TA GIY AAWS  GLGY+FQTPEAWN DD+YRSLCYMRPLAIWA+QWAL+ PK 
Sbjct: 840 VEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK- 898

Query: 900 KTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRR 951
             L KQ   ++  +S  +    ++++A+LL LPE++  KS L+ +++    R
Sbjct: 899 --LHKQTA-DIPQDSFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947


>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
          Length = 950

 Score = 1259 bits (3259), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 616/952 (64%), Positives = 754/952 (79%), Gaps = 11/952 (1%)

Query: 6   ILENGLHEEEK-----EPLNSSFDK-VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQ 59
           ++ENG+ E+ K      P   S D  VDP     LTW+ KLS     L  F L W+ET Q
Sbjct: 1   MVENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKLSNIGYDLPSFRLTWRETFQ 60

Query: 60  LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
           LA +G+R+   I EE +KG+ A IDP  KR   S  GVPLGG+GSGSIGRSY+GEFQRWQ
Sbjct: 61  LAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQ 120

Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
           +FP  CE++PVLANQFS F+SR +G+ YSSVL P  P++ K +  +GIGSWDWN+ G  S
Sbjct: 121 LFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQNS 180

Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
           TYHALYPR+WT++ GEPDP++ IVCRQISPIIPHNY++SSYPVSVFT+T+ NSG T+AD+
Sbjct: 181 TYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAADV 240

Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
           TLLFTW NSVGG SE TG H NS     DGVH +LLHHRT++  PPVTFA+AAQE + VH
Sbjct: 241 TLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDVH 300

Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
           +S CP+F+ISG+S   +AKDMW+ +KE+GSFD L+  +TS+ S+PG SIGAAIAASV +P
Sbjct: 301 ISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKLP 360

Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIE 419
           P +   V+F+LAW CPEV F SGKTY+RRYTKF+GT  +AAA++A DAILEH SWE QIE
Sbjct: 361 PQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQIE 420

Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
            WQ PIL+D+R P+WYP+TLFNELYYLNAGG +WTDG PP+ SL  IG +KFSLD    D
Sbjct: 421 EWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNGD 480

Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
                 +  +N+TA +IL +M+S+LE+I+  +  NS+ GT LLQ GEENIGQFLYLEGIE
Sbjct: 481 ADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGIE 539

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y MWNTYDVHFY+SF+LIMLFPK+QLSIQRDFAAAV+MHDP K+++L DG+WV+RKVLGA
Sbjct: 540 YYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLGA 599

Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           +PHD+G+ DPWF+VNAY LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AM
Sbjct: 600 IPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMAM 659

Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
           AYM+QFDRD DGMIEN+ FPDQTYD WS++GISAY GGLWVAALQAASALA EVGD+ SE
Sbjct: 660 AYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKASE 719

Query: 720 DYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDK 779
             F  K++KAK VY KLWNGSYFNYD+  +  S+SI ADQLAGQWYA+ACGL PIVD+DK
Sbjct: 720 KLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKDK 779

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
           A SAL+K+Y++NV+K   GKRGA+NGM PDG VDMS+MQSREIW GVTYA+AA+MI E +
Sbjct: 780 AESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEGM 839

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            + GF+TA GIY AAWS  GLGY+FQTPEAWN DD+YRSLCYMRPLAIWA+QWAL+ PK 
Sbjct: 840 VEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK- 898

Query: 900 KTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRR 951
             L KQ   ++  +S  +    ++++A+LL LPE++  KS L+ +++    R
Sbjct: 899 --LHKQTA-DIPQDSFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947


>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
          Length = 928

 Score = 1246 bits (3224), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 629/971 (64%), Positives = 745/971 (76%), Gaps = 62/971 (6%)

Query: 1   MFEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQL 60
           MFE KI++ G   E+ +PLNSS  KVDPA PASLTWQRK+ +      +F L+ KE  QL
Sbjct: 1   MFEEKIMDIG---EDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQL 57

Query: 61  APIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
           AP+G+R+  L REEAAKG+ AFIDPF K  +TSSHGVPLGG+G+GSIGRS++GEFQRWQ+
Sbjct: 58  APVGIRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQL 117

Query: 121 FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKST 180
           FP  CED+PVLANQFS FVSR+NG+KYSSVLCP+ P++ K  + +GIGSWDWNLKGDKST
Sbjct: 118 FPPKCEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKST 177

Query: 181 YHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
           YHALYPR+WT++EG                           +  F Y+            
Sbjct: 178 YHALYPRSWTMYEGN--------------------------IYAFEYS------------ 199

Query: 241 LLFTWTNSVGGDSEFTGQH---YNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG 297
             FT  N +     F  +    Y + TK       +    +T++ LP +++A++AQ TDG
Sbjct: 200 --FTLVNLIQSLGLFVVKFHPLYLTTTK-KAAFQCLFSLSQTANGLPSLSYAISAQATDG 256

Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
           V VS CP F++SG   G+TAKDMW  +KE+GSFD L + E S+ S+ GSSIGAA+AASVT
Sbjct: 257 VSVSACPFFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVT 316

Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTY---------YRRYTKFYGTHQNAAANIARDAI 408
           V P     VTFSLAWDCPEV F SGK Y         +RRYTKFYG + +AAA IA DAI
Sbjct: 317 VLPGESRIVTFSLAWDCPEVQFPSGKIYSRQPIKQNLFRRYTKFYGNNGDAAAQIAHDAI 376

Query: 409 LEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGH 468
           L H  WE  IE WQRPILEDKRLP WYP+TLFNELYYLN+GG +WTDGS PVHSL  +  
Sbjct: 377 LGHSQWESWIEDWQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVRE 436

Query: 469 RKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEEN 528
           +KFSLD SQ  LK  +DVP+QNDTAV++LE+M+S LE+++     NS+FGT LL++GEEN
Sbjct: 437 KKFSLDKSQLGLKNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEEN 496

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           IG FLYLEGIEY MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +
Sbjct: 497 IGHFLYLEGIEYRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSE 556

Query: 589 GQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
           GQWV RKVLGAVPHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA
Sbjct: 557 GQWVQRKVLGAVPHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFA 616

Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
            AVWPSVYVAMAYM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY GGLWVAALQAASA
Sbjct: 617 SAVWPSVYVAMAYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASA 676

Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
           LAR VGD+ S+DYF  KFQKAKVVYE KLWNGSYFNYDNSGS  SS+IQADQLAGQWYAR
Sbjct: 677 LARVVGDKNSQDYFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYAR 736

Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
           A GLLPIVDEDKAR+AL+KVYNYNV+K+  GKRGAVNGM P+G+VD +SMQSREIWSGVT
Sbjct: 737 ASGLLPIVDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVT 796

Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
           YA++A+MI E L ++ FQTA GIYEAAWS TGLGY+FQTPE+WNT D+YRSL YMRPLAI
Sbjct: 797 YALSATMIQEGLVEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAI 856

Query: 888 WAMQWALTRPKPKTLE-----KQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQ 942
           WAMQWALT+   K  +     +Q +PE+   S +++  GFS+V+RLL LP E  AKS LQ
Sbjct: 857 WAMQWALTKTSQKQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQ 916

Query: 943 SLFDHTCRRMF 953
           +LFD+TCRRM 
Sbjct: 917 TLFDYTCRRMM 927


>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
          Length = 951

 Score = 1244 bits (3220), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 603/930 (64%), Positives = 740/930 (79%), Gaps = 10/930 (1%)

Query: 19  LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
           +N   +KV P +   LTW+ KL+     L  F L W+E +QLA IG+R+   I EE +KG
Sbjct: 22  MNGDEEKVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKG 81

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           + A IDP  KR   S  GVPLGG    SIGRSY+G+FQRWQ+FP  CED  VLANQFS F
Sbjct: 82  RIAVIDPMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAF 137

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           +SR +G+KYS+VL P  P++ K +  +GIGSWDWN+ G +STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGKKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDP 197

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
           EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG 
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257

Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
           H NS     DGVH +LLHHRT+   P VTF +AAQE + + +S CP+FVISG+S     T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFT 317

Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
           AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P +   ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPE 377

Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
           V F SGKTY+RRYTKFYGT  +AAA +A DAILEH SWE QIE WQ PIL+D+R P WYP
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYP 437

Query: 437 ITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNI 496
           +TLFNELYYLNAGG +WTDG PP+ SL  +G +KFSLD    +     ++  QN+TA +I
Sbjct: 438 VTLFNELYYLNAGGTIWTDGLPPIQSLTAVGWKKFSLDMLNGESDDFNEMVRQNNTASDI 497

Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
           L RM+S+ E+++  +A NS+ GT LLQ GEENIGQFLYLEGIEY MWNTYDVHFY+SF+L
Sbjct: 498 LHRMASVFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSL 556

Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
           IMLFPK+QLS+QRDFAAAVMMHDP K+++L DG+W +RKVLGAVPHD+G+ DPWF+VNAY
Sbjct: 557 IMLFPKLQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAY 616

Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
            LY+T RWKDLNPKFVLQVYRDVVATGDK FA+AVWPSVY+AMAYM+QFD+D DGMIEN+
Sbjct: 617 TLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENE 676

Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKL 736
            FPDQTYD WS++GISAY GGLWVAALQAASALAREVGD+ SE  F  K++KAK VY KL
Sbjct: 677 DFPDQTYDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKFVYSKL 736

Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
           WNGSYF+YD+  +  S+SIQADQLAGQWYA+ACGL PIVD+DKA+SAL+KVY++NV+K  
Sbjct: 737 WNGSYFSYDDGDNKMSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKVYSFNVMKFK 796

Query: 797 GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
           GG RGA+NGM PDG VDMS+MQSREIW GVTYA+AASMI E + + GF+TA G+Y AAWS
Sbjct: 797 GGTRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGVYHAAWS 856

Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLL 916
             GLGY+FQTPEAWN DD+YRSLCYMRPLAIWAMQWAL++PK   L    + ++  +S  
Sbjct: 857 PGGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSKPK---LHNTPQADIPQDSFP 913

Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
           +    ++++ +LL+LPE++ +KS ++ +++
Sbjct: 914 KNQFAYARITKLLQLPEDESSKSFVRVVYE 943


>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 944

 Score = 1217 bits (3149), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 599/945 (63%), Positives = 729/945 (77%), Gaps = 12/945 (1%)

Query: 15  EKEPLNSSFDKVDPAKPASLTWQRKLS-TGEIPLSQFTLNWKETVQLAPIGVRILCLIRE 73
           EK    +S  +VDP+KP SLTW+RKL  TG+ P S F+    +   +   G R+    +E
Sbjct: 2   EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPES-FSFTLTDAWHMGMTGYRLWRNGKE 60

Query: 74  EAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLAN 133
           E AKG+    + F    +T  HGV LGG+G+GSIGRSYRGEFQR+Q+F   CED+PVLAN
Sbjct: 61  EIAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLAN 120

Query: 134 QFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHE 193
           QFSVFVSR NG K+SSVLC   P+  KD    GI SWDWNL G+ STYHAL+PR+WTV++
Sbjct: 121 QFSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYD 180

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
           GEPDP+L+IVCRQ+SPIIPHNYKESS+PVSVFT+ + N G+TSA +TLLFTW NSVGG S
Sbjct: 181 GEPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKS 240

Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSL 313
            FTG H+NSK    DG   VLLHH++++  P VT+ +AA+ TD VHVSLCP FVISG+S 
Sbjct: 241 GFTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSE 300

Query: 314 GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
           G++AKDMW EIK HGSFD L S+  +  S+PG SIGAA+AA++T+PP S   VTFSLAWD
Sbjct: 301 GISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWD 360

Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           CPEV F  GKTY+R+Y+KFYGT  +AA  IARDAI +HG WE +IEAWQRPI+EDKRLP+
Sbjct: 361 CPEVKF-DGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPD 419

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN---QN 490
           WYP+TLFNELY+LN+GG +WTDG PP+ +L TI H+K+ L+ S+S+L      PN   + 
Sbjct: 420 WYPVTLFNELYFLNSGGTIWTDGLPPLQNLSTISHKKYFLERSKSELNG--GAPNGDHRK 477

Query: 491 DTAVNILERMSSILEQIYTPVA-LNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVH 549
           D AV+ILERMS IL+Q +      N++ GT LL  GEEN+G  L +EG +YLMWNTYDVH
Sbjct: 478 DVAVDILERMSQILDQTHGGAGPSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVH 537

Query: 550 FYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDP 609
           FYSSFALIMLFPK++LSIQRDFAAAV+MHDP K K + DG WV RKVLGAVPHDIG  DP
Sbjct: 538 FYSSFALIMLFPKLELSIQRDFAAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDP 597

Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
           W EVNAY L + +RWKDL  KFVLQVYRDVVATGDK FAK+VWPSVYVA+A+M+QFD+D 
Sbjct: 598 WLEVNAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDK 657

Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
           DGMIEN+GFPDQTYDTW+V G+SAY GGLWVAALQAASALA EV D  +  YF  K+QKA
Sbjct: 658 DGMIENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKA 717

Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
           + VYE LWNGSYFNYDNS    SSSIQADQLAGQWYARACGL PI DE+K R AL+K+YN
Sbjct: 718 RSVYETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYN 777

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACG 849
           +NV+KV GG RGAVNGM PDG VD S +Q +EIW+GVTY+VAA+MI E + + GFQTA G
Sbjct: 778 FNVMKVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMG 837

Query: 850 IYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA-LTRPKPKTLEKQMKP 908
           IY+AAW+  GLGY+FQTPEAW+ DD++RS+ YMRPLAIWAMQWA ++  KP  +  +   
Sbjct: 838 IYQAAWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFS 897

Query: 909 EVTDESLLRYHAGFSKVARLLKLP--EEQGAKSLLQSLFDHTCRR 951
           E+ + +    HA F KVA LLKLP  ++   +SL+++ +D  C+R
Sbjct: 898 EMQESAFATQHAAFLKVASLLKLPSNDDTARRSLVEAAYDFICKR 942


>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 947

 Score = 1186 bits (3067), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/940 (61%), Positives = 716/940 (76%), Gaps = 24/940 (2%)

Query: 24  DKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFI 83
           DK+ P      +W+RKL++     S+F L+ ++ + L P+G R+    ++EAAKG+ +  
Sbjct: 15  DKLPP-----FSWERKLNSQAKTPSEFKLSKRDHLHLFPLGYRLWRHTKDEAAKGRASIF 69

Query: 84  DPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
           D F K H+T  HGVPLGG+G+GSIGRSY+GEFQ++++FP++CE+ P+L NQFS FVSR  
Sbjct: 70  DIFRKHHITGDHGVPLGGIGAGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPG 129

Query: 144 GQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
           G K+S+VLCP  P+V+KD             GI SWDWN+ G+KSTYHALYPR+WTV++G
Sbjct: 130 GVKHSTVLCPSKPQVIKDNGGYLCKGQVPNIGIESWDWNMTGEKSTYHALYPRSWTVYDG 189

Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
           EPDPELRIV RQ+SP IPHNY+ESS PVSVF +T+ N+G   A +TLLFTW NSVGG S 
Sbjct: 190 EPDPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEQAIVTLLFTWENSVGGASG 249

Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
            TGQH+NS  K  DGVHAV L H+T++  PPV++A+AA+ET+ V VS CP F++SG +  
Sbjct: 250 LTGQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPN 309

Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
            +TA DMW EIK++ SFD+L S   S  S+PG+SIGAAIAA V VPP  +  VTFSL+WD
Sbjct: 310 KITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWD 368

Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           CPE  F   KTY+RRYT+FYG+  NAA  +A DA+L    WE QIE WQ PIL D  LPE
Sbjct: 369 CPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALLNFSEWETQIEEWQAPILADTTLPE 427

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
           WY ITLFNELYY N+GG +WTDG PP  SL +IG RK SL  S +D       P+QND A
Sbjct: 428 WYRITLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLSISTTDNTD----PDQNDIA 483

Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
           ++IL R+ ++  QI+ P++ N++ GT ++Q+  ENIGQFLYLEGI+YLM+NTYDVHFYSS
Sbjct: 484 LDILGRIDAVCSQIHAPLSSNAALGTTMIQNTSENIGQFLYLEGIQYLMYNTYDVHFYSS 543

Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
           FAL+MLFPK++LSIQRDFAAAV+MHD SK +++  G++V+RK LGAVPHDIG+ DPWFEV
Sbjct: 544 FALLMLFPKLELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKALGAVPHDIGLNDPWFEV 603

Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
           NAY L++T RWKDLN KFVLQVYRDVVATGD  FAKAVWPSVY A+AY+DQFD+DGDGMI
Sbjct: 604 NAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMI 663

Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
           EN+GFPDQTYD WS SG+SAY GGLWVAALQA SALAREVGD G+  YF  K++KA+ VY
Sbjct: 664 ENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREVGDNGAAVYFNAKYEKARSVY 723

Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           EKLWNGSYFNYDNS S  SSSI ADQLAGQWYARACGL PI  E+  + AL+ VY++NV+
Sbjct: 724 EKLWNGSYFNYDNSRSGSSSSILADQLAGQWYARACGLKPIAKEEWIKKALETVYDFNVM 783

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           +V  G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF+TA GIYEA
Sbjct: 784 RVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEA 843

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLEKQMKPEVTD 912
           AWS  GLG AFQTPEAW T+D++RSLCYMRPLAIW +QWA T P+P +  E  +K +  D
Sbjct: 844 AWSDRGLGCAFQTPEAWTTNDEFRSLCYMRPLAIWGIQWAHTMPQPNREQEVSLKQQEED 903

Query: 913 ES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCR 950
            +  L + HAGF KVA  LK  + +  ++ LQ+ ++   R
Sbjct: 904 ATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYETFLR 943


>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
          Length = 947

 Score = 1159 bits (2998), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 572/936 (61%), Positives = 714/936 (76%), Gaps = 24/936 (2%)

Query: 24  DKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFI 83
           DK+ P      +W+RKL++     S+F L+ ++ + L P+G R+    ++EAAKG+ +  
Sbjct: 15  DKLPP-----FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIF 69

Query: 84  DPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
           D F K H+T  HGVPLGG+GSGSIGRSY+GEFQ++++FP++CE+ P+L NQFS FVSR  
Sbjct: 70  DIFRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPG 129

Query: 144 GQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
           G K+S+VLCP  P+V+KD             GI SWDWN+ G+KSTYHALYPR+WTV++G
Sbjct: 130 GVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDG 189

Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
           EPDPELRIV RQ+SP IPHNY+ESS PVSVF +T+ N+G   A +TLLFTW NSVGG S 
Sbjct: 190 EPDPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASG 249

Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
            TGQH+NS  K  DGVHAV L H+T++  PPV++A+AA+ET+ V VS CP F++SG +  
Sbjct: 250 LTGQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPN 309

Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
            +TA DMW EIK++ SFD+L S   S  S+PG+SIGAAIAA V VPP  +  VTFSL+WD
Sbjct: 310 QITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWD 368

Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           CPE  F   KTY+RRYT+FYG+  NAA  +A DA+L    WE QIE WQ P+L D  LPE
Sbjct: 369 CPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPE 427

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
           WY +TLFNELYY N+GG +WTDG PP  SL +IG RK SL  S  D     +  +QN+ A
Sbjct: 428 WYRVTLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTID----KNDQDQNNVA 483

Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
           ++IL R+ ++  QI+ P+  N++ G  ++Q+  ENIGQFLYLEG++YLM+NTYDVHFYSS
Sbjct: 484 LDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYDVHFYSS 543

Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
           FAL+MLFPKI+LSIQRDFAAAV+MHD SK +++  G++V+RKVLGAVPHDIG+ DPWFE+
Sbjct: 544 FALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFEL 603

Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
           NAY L++T RWKDLN KFVLQVYRDVVATGD  FAKAVWPSVY A+AY+DQFD+DGDGMI
Sbjct: 604 NAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMI 663

Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
           EN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE+GD G+  YF  K++KA+ VY
Sbjct: 664 ENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVY 723

Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           EKLWNGSYFNYDNS S  SSSI ADQ+AGQWYARACGL PI  E+  + AL+ VY++NV+
Sbjct: 724 EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVM 783

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           +V  G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF+TA GIYEA
Sbjct: 784 RVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEA 843

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLEKQMKPEVTD 912
           AWS  GLG AFQTPEAW T+D+YRSLCYMRPLAIW +QWA T PKP +  E  ++P+  D
Sbjct: 844 AWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEED 903

Query: 913 ES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
            +  L + HAGF KVA  LK  + +  ++ LQ+ ++
Sbjct: 904 ATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 939


>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
          Length = 947

 Score = 1157 bits (2992), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 571/936 (61%), Positives = 713/936 (76%), Gaps = 24/936 (2%)

Query: 24  DKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFI 83
           DK+ P      +W+RKL++     S+F L+ ++ + L P+G R+    ++EAAKG+ +  
Sbjct: 15  DKLPP-----FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIF 69

Query: 84  DPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
           D F K H+T  HGVPLGG+GSGSIGRSY+GEFQ++++FP++CE+ P+L NQFS FVSR  
Sbjct: 70  DIFRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPG 129

Query: 144 GQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
           G K+S+VLCP  P+V+KD             GI SWDWN+ G+KSTYHALYPR+WTV++G
Sbjct: 130 GVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDG 189

Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
           EPDPELRIV RQ+SP IPHNY+ESS PVSVF +T+ N+G   A +TLLFTW NSVGG S 
Sbjct: 190 EPDPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASG 249

Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
            TGQH+NS  K  DGVHAV L H+T++  PPV++A+AA+ET+ V VS CP F++SG +  
Sbjct: 250 LTGQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPN 309

Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
            +TA DMW EIK++ SFD+L S   S  S+PG+SIGAAIAA V VPP  +  VTFSL+WD
Sbjct: 310 QITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWD 368

Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           CPE  F   KTY+RRYT+FYG+  NAA  +A DA+L    WE QIE WQ P+L D  LPE
Sbjct: 369 CPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPE 427

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
           WY +TLFNELYY N+GG +WTDG PP  SL +IG RK SL  S  D     +  +QN+ A
Sbjct: 428 WYRVTLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTID----KNDQDQNNVA 483

Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
           ++IL R+ ++  QI+ P+  N++ G  ++Q+  ENIGQFLYLEG++YLM+NTY VHFYSS
Sbjct: 484 LDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYGVHFYSS 543

Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
           FAL+MLFPKI+LSIQRDFAAAV+MHD SK +++  G++V+RKVLGAVPHDIG+ DPWFE+
Sbjct: 544 FALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFEL 603

Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
           NAY L++T RWKDLN KFVLQVYRDVVATGD  FAKAVWPSVY A+AY+DQFD+DGDGMI
Sbjct: 604 NAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMI 663

Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
           EN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE+GD G+  YF  K++KA+ VY
Sbjct: 664 ENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVY 723

Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           EKLWNGSYFNYDNS S  SSSI ADQ+AGQWYARACGL PI  E+  + AL+ VY++NV+
Sbjct: 724 EKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVM 783

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           +V  G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF+TA GIYEA
Sbjct: 784 RVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEA 843

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLEKQMKPEVTD 912
           AWS  GLG AFQTPEAW T+D+YRSLCYMRPLAIW +QWA T PKP +  E  ++P+  D
Sbjct: 844 AWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEED 903

Query: 913 ES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
            +  L + HAGF KVA  LK  + +  ++ LQ+ ++
Sbjct: 904 ATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 939


>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 797

 Score = 1111 bits (2873), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 527/794 (66%), Positives = 647/794 (81%), Gaps = 5/794 (0%)

Query: 160 KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESS 219
           + T  +GIGSWDWNL G KSTYHALYPRAWTV++GEPDP+L+IVCRQISPIIPHNY++SS
Sbjct: 6   RGTNISGIGSWDWNLNGGKSTYHALYPRAWTVYDGEPDPDLKIVCRQISPIIPHNYQQSS 65

Query: 220 YPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRT 279
           YP +VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG H NS     DGVH +LLHHRT
Sbjct: 66  YPAAVFTFTVTNSGNTAADVTLLFTWANSVGGKSELTGNHSNSSMTEKDGVHGILLHHRT 125

Query: 280 SHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETS 339
           +   PPV FA+AAQE + +H+S CP+FV+SG+S   TAKDMW+ +KEHGSFD L+ ++ S
Sbjct: 126 ADGRPPVAFAIAAQEKEDIHISECPYFVMSGSSDEFTAKDMWNSVKEHGSFDLLDPVKES 185

Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNA 399
            +S PG+SIGAAIAAS+ + P +   V+FSLAW  PEV F SGKTY+RRYTKFYGT  +A
Sbjct: 186 TSSRPGTSIGAAIAASIKLAPQATQNVSFSLAWASPEVKFCSGKTYHRRYTKFYGTDVDA 245

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           AA++A DAIL+H SWE QIE WQ PIL+DKR P WYP+TLFNELYYLNAGG +WTDG PP
Sbjct: 246 AASLAHDAILDHSSWETQIENWQHPILQDKRFPAWYPVTLFNELYYLNAGGTIWTDGLPP 305

Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
           + SL  IG +KFSLD S  +      +  Q +TA +IL +M+S+LE+I+  +A +S+ GT
Sbjct: 306 IQSLTAIGGKKFSLDMSNEETDDDSQMNPQTNTATDILHQMASVLERIHASLASSSAIGT 365

Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
            LLQ GEENIGQFLYLEGIEY MWNTYDVHFYSSF+LIMLFPK+QLS+QRDFAAAVM+HD
Sbjct: 366 TLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYSSFSLIMLFPKLQLSVQRDFAAAVMIHD 424

Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
           P K+KLL DG+  +RKVLGAVPHD+G+ DPW +VNAY L++T RWKDLNPKFVLQVYRDV
Sbjct: 425 PEKLKLLHDGKLAARKVLGAVPHDLGLYDPWIKVNAYTLHNTDRWKDLNPKFVLQVYRDV 484

Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           VATGDK FA+AVWPSVY+AMAYM+QFD+D DGMIEN+ FPDQTYD WS++G+SAY GGLW
Sbjct: 485 VATGDKSFARAVWPSVYMAMAYMEQFDKDKDGMIENEDFPDQTYDVWSMAGVSAYCGGLW 544

Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQAD 758
           VAALQAASALA EVGD+ SE  F  K++KAK VY+ KLWNGSYFNYD++G+  S+SI AD
Sbjct: 545 VAALQAASALAHEVGDKASEKLFWNKYEKAKSVYDKKLWNGSYFNYDDAGTKASTSIHAD 604

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQ 818
           QLAGQWYA+ACGL  IVD+DK++SAL+K+Y +NV+K   G RGA+NGM PDG +DMS+MQ
Sbjct: 605 QLAGQWYAKACGLSSIVDKDKSQSALEKIYAFNVMKFKDGNRGAINGMWPDGTLDMSTMQ 664

Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
           SREIW GVTYA+AASMI E + D GF+TA GIY AAWS  GLGYAFQTPE+WN DD+YRS
Sbjct: 665 SREIWPGVTYALAASMIQEGMVDEGFKTAEGIYNAAWSTEGLGYAFQTPESWNNDDEYRS 724

Query: 879 LCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAK 938
           LCYMRPLAIW++QWAL+ PK   L K  + ++T +S  +    ++++A+LL+LPE++ +K
Sbjct: 725 LCYMRPLAIWSIQWALSNPK---LHKAPQTDITQDSFPKNQFSYARIAKLLQLPEDESSK 781

Query: 939 SLLQSLFDHTCRRM 952
           SL + +++    R 
Sbjct: 782 SLPRVIYEIVRNRF 795


>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
          Length = 931

 Score = 1088 bits (2814), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 552/911 (60%), Positives = 680/911 (74%), Gaps = 50/911 (5%)

Query: 81  AFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS---- 136
           A IDP  KR   S  GVPLGG+GSGSIGRSY+GEFQRWQ+FP  CE++PVLANQFS    
Sbjct: 28  AVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQLFPGTCEERPVLANQFSDRYL 87

Query: 137 --------------VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
                          F+SR +G+ YSSVL P  P++ K +  +GIGSWDWN+ G  STYH
Sbjct: 88  ATVNVLTVSRDYTQAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQNSTYH 147

Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHN-----YKESSYPVSVFTYTIYNSGKTSA 237
           ALYPR+WT++ G        V  Q + I+  N       E    V++ T    NSG T+A
Sbjct: 148 ALYPRSWTIYNGIAVTNNNRVYFQATTILFSNVGFCLLGEPDPDVNIVT----NSGNTAA 203

Query: 238 DITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHR-----------------TS 280
           D+TLLFTW   V  +  F   +     +  DGVH +LLHHR                 T+
Sbjct: 204 DVTLLFTWA-FVINEHLFYTFNILMNDREKDGVHGILLHHRLSPKQISICLNDMYESRTA 262

Query: 281 HQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
           +  PPVTFA+AAQE + +H+S CP+F+ISG+S   +AKDMW+ +KE+GSFD L+  +TS+
Sbjct: 263 NGQPPVTFAIAAQEKEDIHISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSM 322

Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAA 400
            S+PG SIGAAIAASV +PP +   V+F+LAW CPEV F SGKTY+RRYTKF+GT  +AA
Sbjct: 323 CSKPGLSIGAAIAASVKLPPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAA 382

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
           A++A DAILEH SWE QIE WQ PIL+D+R P+WYP+TLFNELYYLNAGG +WTDG PP+
Sbjct: 383 ASLAHDAILEHNSWERQIEEWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPI 442

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
            SL  IG +KFSLD    D      +  +N+TA +IL +M+S+LE+I+  +  NS+ GT 
Sbjct: 443 QSLTGIGEKKFSLDMQNGDADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTT 502

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
           LLQ GEENIGQFLYLEGIEY MWNTYDVHFY+SF+LIMLFPK+QLSIQRDFAAAV+MHDP
Sbjct: 503 LLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDP 561

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
            K+++L DG+WV+RKVLGAVPHD+G+ DPWF+VNAY LY+T RWKDLNPKFVLQVYRDVV
Sbjct: 562 EKLRMLHDGKWVARKVLGAVPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVV 621

Query: 641 ATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWV 700
           ATGDK FA+AVWPSVY+AMAYM+QFDRD DGMIEN+ FPDQTYD WS++GISAY GGLWV
Sbjct: 622 ATGDKSFARAVWPSVYMAMAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWV 681

Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQL 760
           AALQAASALA EVGD+ SE  F  K++KAK VY KLWNGSYFNYD+  +  S+SI ADQL
Sbjct: 682 AALQAASALAHEVGDKASEKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQL 741

Query: 761 AGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSR 820
           AGQWYA+ACGL PIVD+DKA SAL+K+Y++NV+K   GKRGA+NGM PDG VDMS+MQSR
Sbjct: 742 AGQWYAKACGLFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSR 801

Query: 821 EIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLC 880
           EIW GVTYA+AA+MI E + + GF+TA GIY AAWS  GLGY+FQTPEAWN DD+YRSLC
Sbjct: 802 EIWPGVTYALAATMIQEGMVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLC 861

Query: 881 YMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSL 940
           YMRPLAIWA+QWAL+ PK   L KQ   ++  +S  +    ++++A+LL LPE++  KS 
Sbjct: 862 YMRPLAIWAIQWALSNPK---LHKQTA-DIPQDSFPKNQFSYARIAKLLHLPEDESPKSF 917

Query: 941 LQSLFDHTCRR 951
           L+ +++    R
Sbjct: 918 LRVIYEIVRNR 928


>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
 gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
          Length = 922

 Score = 1087 bits (2810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 552/934 (59%), Positives = 688/934 (73%), Gaps = 36/934 (3%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           V+  +   +TWQRKL++     S+F ++ ++ + L P+G R+    +EEA KG+ +  D 
Sbjct: 16  VENERLPQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLWRYTKEEAKKGRFSMYDI 75

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
           F KRH+   HGVPLGG+G GSIGRSY+GEFQ++++FP++CE+ P+L NQFSVFVSR  G 
Sbjct: 76  FKKRHVRGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSVFVSRPGGL 135

Query: 146 KYSSVLCPKTPEVLKDTTA-AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
            YS+VLCP  P+ +K  T   GI SWDWN++GDKSTYHALYPR+WTV+  EPDPELRIV 
Sbjct: 136 SYSTVLCPTKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPRSWTVY-NEPDPELRIVS 194

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
           RQ+SP IPHNYKESS PVSVF +T+ N GK  A +TLLFTW NSVGG S  TG+H+NS  
Sbjct: 195 RQVSPFIPHNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTI 254

Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHE 323
              DGVHA++LHH+T +  PPVT+A+AAQET+ VHVS CP F++SG+S   +TAK+MW E
Sbjct: 255 MERDGVHAIVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDE 314

Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
           IK++ SFD LNS E    S PG+SIGAAIAA V VPP  +  VTFSL+WDCPEV F + K
Sbjct: 315 IKKNKSFDELNS-EPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRF-NEK 372

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
           TY+RRYTKFYG   +AA  +ARDA+L +  WE QIEAWQ PIL D  LP+WY +TLFNEL
Sbjct: 373 TYHRRYTKFYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTTLPDWYRVTLFNEL 432

Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
           YY N+GG +WTDG PP  S+                 +  V    QND  +++ ++++++
Sbjct: 433 YYFNSGGTIWTDGLPPKESIE----------------RSKVTNTEQNDIVIDLFQKINAV 476

Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
            EQIY+P + NS          EENIGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK+
Sbjct: 477 CEQIYSPQSSNS----------EENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKL 526

Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
            LSIQRDFAA V++ DP+K K++  G+WV+RK+LG+VPHDIG+ DPW E+N Y  ++T R
Sbjct: 527 ALSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDR 586

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
           WKDLN KFVLQVYRDVVAT D+ FAKAVWPSVY A+AY+DQFD+D DGMIEN+GFPDQTY
Sbjct: 587 WKDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTY 646

Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFN 743
           D WSV+G+SAY GGLWVAALQAASA A  VG+     YF  K++KAK+VYEKLWNGSYFN
Sbjct: 647 DAWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFN 706

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
           YD+SGS  SSSI ADQLAGQWYARACGL PI  E+  + AL+ +Y +NV+KV GG RGAV
Sbjct: 707 YDDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAV 766

Query: 804 NGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYA 863
           NGM  +G+VD +S+ S+E+W+G TY+VAA MI E   + GFQTA GIYEA WS  GL  +
Sbjct: 767 NGMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCS 826

Query: 864 FQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQM----KPEVTDESLLRYH 919
           FQTPEAWN +D+YRSLCYMRPLAIWA+QWALTR +    EKQ       E     LLR H
Sbjct: 827 FQTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQH 886

Query: 920 AGFSKVARLLKL-PEEQGAKSLLQSLFDHTCRRM 952
            GF  VAR +K+ P     +S LQ  ++   + +
Sbjct: 887 KGFKDVARFVKIVPTSNVHRSRLQHTYETVLKTL 920


>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 922

 Score = 1087 bits (2810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 551/926 (59%), Positives = 685/926 (73%), Gaps = 36/926 (3%)

Query: 34  LTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTS 93
           +TWQRKL++     S+F ++ ++ + L P+G R+    +EEA KG+ +  D F KRH+  
Sbjct: 24  VTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLWRYTKEEAKKGRFSMYDIFKKRHVRG 83

Query: 94  SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
            HGVPLGG+G GSIGRSY+GEFQ++++FP++CE+ P+L NQFSVFVSR  G  YS+VLCP
Sbjct: 84  DHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSVFVSRPGGLSYSTVLCP 143

Query: 154 KTPEVLKDTTA-AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
             P+ +K  T   GI SWDWN++GDKSTYHALYPR+WTV+  EPDPELRIV RQ+SP IP
Sbjct: 144 TKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPRSWTVY-NEPDPELRIVSRQVSPFIP 202

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHA 272
           HNYKESS PVSVF +T+ N GK  A +TLLFTW NSVGG S  TG+H+NS     DGVHA
Sbjct: 203 HNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTIMERDGVHA 262

Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHEIKEHGSFD 331
           ++LHH+T +  PPVT+A+AAQET+ VHVS CP F++SG+S   +TAK+MW EIK++ SFD
Sbjct: 263 IVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDEIKKNKSFD 322

Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
            LNS E    S PG+SIGAAIAA V VPP  +  VTFSL+WDCPEV F + KTY+RRYTK
Sbjct: 323 ELNS-EPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRF-NEKTYHRRYTK 380

Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
           FYG   +AA  +ARDA+L +  WE QIEAWQ PIL D  LP+WY +TLFNELYY N+GG 
Sbjct: 381 FYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTTLPDWYRVTLFNELYYFNSGGT 440

Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
           +WTDG PP  S+                 +  V    QND  +++ ++++++ EQIY+P 
Sbjct: 441 IWTDGLPPKESIE----------------RSKVTNTEQNDIVIDLFQKINAVCEQIYSPQ 484

Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
           + NS          EENIGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK+ LSIQRDF
Sbjct: 485 SSNS----------EENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLALSIQRDF 534

Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKF 631
           AA V++ DP+K K++  G+WV+RK+LG+VPHDIG+ DPW E+N Y  ++T RWKDLN KF
Sbjct: 535 AATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKF 594

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           VLQVYRDVVAT D+ FAKAVWPSVY A+AY+DQFD+D DGMIEN+GFPDQTYD WSV+G+
Sbjct: 595 VLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYDAWSVTGV 654

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
           SAY GGLWVAALQAASA A  VG+     YF  K++KAK+VYEKLWNGSYFNYD+SGS  
Sbjct: 655 SAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNYDDSGSGS 714

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
           SSSI ADQLAGQWYARACGL PI  E+  + AL+ +Y +NV+KV GG RGAVNGM  +G+
Sbjct: 715 SSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGK 774

Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
           VD +S+ S+E+W+G TY+VAA MI E   + GFQTA GIYEA WS  GL  +FQTPEAWN
Sbjct: 775 VDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEAWN 834

Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQM----KPEVTDESLLRYHAGFSKVAR 927
            +D+YRSLCYMRPLAIWA+QWALTR +    EKQ       E     LLR H GF  VAR
Sbjct: 835 MNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHKGFKDVAR 894

Query: 928 LLKL-PEEQGAKSLLQSLFDHTCRRM 952
            +K+ P     +S LQ  ++   + +
Sbjct: 895 FVKIVPTSNVHRSRLQHTYETVLKTL 920


>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
 gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
          Length = 921

 Score = 1075 bits (2779), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 548/919 (59%), Positives = 682/919 (74%), Gaps = 35/919 (3%)

Query: 34  LTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTS 93
           +TWQRKL++     S+F ++ ++ + L PIG R+    +EEA KG+ +  D F KRH+T 
Sbjct: 24  VTWQRKLNSKVKNPSEFKMSIRDVLHLFPIGYRLWSHTKEEAKKGRASMYDIFKKRHVTG 83

Query: 94  SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
            HGVPLGG+G GSIGRSY+GEFQ++++FP+VCE+ P+L NQFSVFV RS G  YSSVLCP
Sbjct: 84  DHGVPLGGIGGGSIGRSYKGEFQQFKLFPKVCEEAPILTNQFSVFVMRSGGVNYSSVLCP 143

Query: 154 KTPEVLKDTTAA-GIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
             P+ ++  T   GI SWDWN+KG KS YHALYPR+WT++  EPDPELRIV RQISP IP
Sbjct: 144 TKPKSVRGKTEELGIESWDWNMKGYKSMYHALYPRSWTIY-NEPDPELRIVSRQISPFIP 202

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHA 272
           HNYKESS PVSVFT+T+ N GK  A +TLLFTW NSVGG S  TG+H+NS     DGVHA
Sbjct: 203 HNYKESSLPVSVFTFTVTNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTMMERDGVHA 262

Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHEIKEHGSFD 331
           + LHH+TS+  PPVT+A+AAQET+ VHVS C  F++SG+S   +TAK+MW EIK++ SFD
Sbjct: 263 IALHHKTSNGHPPVTYAIAAQETEDVHVSECSCFLVSGHSPKDITAKEMWDEIKKNKSFD 322

Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
            LNS E    S PG+SIGAAIAA+V VPP  +  V FSL+WDCPEV F + KTY+RRYTK
Sbjct: 323 GLNS-EPGSPSRPGTSIGAAIAANVKVPPGCDRTVMFSLSWDCPEVRF-NEKTYHRRYTK 380

Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
           FYG   +AA  +ARDA+L +  WE QIEAWQ PIL D  LP+WY +TLFNELYY N+GG 
Sbjct: 381 FYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTALPDWYRVTLFNELYYFNSGGT 440

Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
           +WTDG PP   +               +  +I++   QN+    + ++++ I  QI++P 
Sbjct: 441 IWTDGLPPKERI---------------ERSKIINT-EQNNIVTELFKKINVICAQIHSPQ 484

Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
           +LN+          EE+IGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK++LSIQRDF
Sbjct: 485 SLNA----------EEHIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLELSIQRDF 534

Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKF 631
           AA V+  DP+K +++  G+WV RK+LG+VPHDIG+ DPW E+N Y  ++T RWKDLN KF
Sbjct: 535 AATVLFQDPTKKQIMSSGEWVPRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKF 594

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           VLQVYRDVV+TGD+ FAKAVWPSVY A+AY+DQFD+D D MIEN+GFPDQTYD WSV G+
Sbjct: 595 VLQVYRDVVSTGDQSFAKAVWPSVYTAVAYLDQFDKDEDEMIENEGFPDQTYDAWSVKGV 654

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
           SAY GGLWVAALQAASA A  VG+ G   YF  K++KAK VYEKLWNGSYFNYD+SGS  
Sbjct: 655 SAYCGGLWVAALQAASAFASIVGENGVAIYFNTKYEKAKSVYEKLWNGSYFNYDDSGSGS 714

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
           SSSI ADQLAGQWYARACGL PI  E+  + AL+ +Y +NV+KV GG RGAVNGM PDG+
Sbjct: 715 SSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSPDGQ 774

Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
           VD +S+ S+E+W+G TY+VAA MI E   + GFQTA GIYEA WS  GL  +FQTPEAWN
Sbjct: 775 VDTNSLVSKEVWAGTTYSVAACMIQEGQRERGFQTASGIYEAVWSDRGLSCSFQTPEAWN 834

Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDES---LLRYHAGFSKVARL 928
            +D+YRSLCYMRPLAIW++QWALTR +    EK+   +  +E    LLR H GF  VAR 
Sbjct: 835 MNDEYRSLCYMRPLAIWSIQWALTRTQSFGEEKEKLVKGEEEESDLLLRQHKGFKDVARF 894

Query: 929 LKL-PEEQGAKSLLQSLFD 946
           +K+ P     +S LQ  ++
Sbjct: 895 VKIVPTRNEHRSRLQHTYE 913


>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 928

 Score = 1040 bits (2689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 530/934 (56%), Positives = 671/934 (71%), Gaps = 27/934 (2%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D  KP +++W+R        ++ F++   + + + P+ +++L L  ++ AKG  +  DP
Sbjct: 15  MDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDNAKGLASVYDP 74

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
             K       GVPLGG+G+GSIGRSYRG FQ++QIFP + E+KP+LANQFS F+SR +G+
Sbjct: 75  LKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQFSAFISRPDGK 134

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
           +YS+VL     +VLK    AGIGSWDW LK    TYH L+PR+WTV+ GEPDPE++I CR
Sbjct: 135 RYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGEPDPEIKITCR 194

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           QISP IPHNYKESS+PV+VFT+T++NSG T AD+TLLFTW NSVGG SE TG H NS+  
Sbjct: 195 QISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSELTGNHKNSRMT 254

Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS-GNSLGLTAKDMWHEI 324
             DG              PPVTFA+A++ETDGV V+ CP F +   NS  +TAKDMW +I
Sbjct: 255 TADG-------------RPPVTFAIASRETDGVRVTGCPRFTMGPSNSGDVTAKDMWDQI 301

Query: 325 KEHGSF--DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
            ++GSF  D   +   +  S PGSSIGAA+AA+ TV       V+F+L+W CPEV F +G
Sbjct: 302 NKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSWSCPEVKFPAG 361

Query: 383 KTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFN 441
           +TY+RRYTKF+GT ++AAA  +A DA+LEH  WE QIE WQRPIL+DK LPEWYPITLFN
Sbjct: 362 RTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSLPEWYPITLFN 421

Query: 442 ELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMS 501
           ELYYLNAGG +WTDG PP ++ ++     F+LD           V N       IL  ++
Sbjct: 422 ELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLD-------TFSTVANGGSAVDGILSTVA 474

Query: 502 SILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFP 561
               +  T  A  ++ GT LL+DGEEN+GQ LYLEG+EY MWNTYDVHFY+SFAL+ LFP
Sbjct: 475 VAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLSLFP 534

Query: 562 KIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDT 621
           +++L++QRDF   V++HDP   + L DG  V+RKVLGAVPHD+G+ DPWFEVNAY L+D 
Sbjct: 535 ELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYMLHDP 593

Query: 622 ARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQ 681
            RWKDLNPKFVLQVYRDVVATG+  FA+A WP+VY+AMAYMDQFDRDGDGM+EN+G PDQ
Sbjct: 594 VRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGRPDQ 653

Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGS 740
           TYD WSVSG+SAY+GGLWVAALQAA+A+A  VGD  +E YF  ++ +A+ VY ++LWNG 
Sbjct: 654 TYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELWNGG 713

Query: 741 YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR 800
           YFNYDNSG + SSSIQADQLAGQWYARACGL PIVD DKAR AL  V +YNV++V GG  
Sbjct: 714 YFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRVKGGAI 773

Query: 801 GAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
           GAVNGM PDG VD SS QS+E+W GVTYAVAA+MIHE + +  F+TA GI++A W   G 
Sbjct: 774 GAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGKHGF 833

Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA 920
           GYAFQTPE+W  D  YR+L YMRPL +WAMQWAL+ P      +      + E       
Sbjct: 834 GYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAALGQE 893

Query: 921 GFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
            F KVA +L+LPEEQ  K +L++L+D T R++ +
Sbjct: 894 KFDKVASMLRLPEEQQHKGILRALYD-TLRQLLL 926


>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
           [Brachypodium distachyon]
          Length = 932

 Score = 1022 bits (2643), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 538/938 (57%), Positives = 676/938 (72%), Gaps = 32/938 (3%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D A+PA+ TWQRK       +  F++   + + + P+  + L    EE  KG+    DP
Sbjct: 15  IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNAEERGKGRDVIYDP 74

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
           F K       G+PLGG+G+GSIGRSYRG FQ +QIFP + E KP+LANQFS F+SR NG+
Sbjct: 75  FKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILANQFSAFISRPNGK 134

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
            YS+VL   T +VLK    AGIGSWDW LK    TYHAL+PR+WTV++GEPDP ++I CR
Sbjct: 135 SYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPDPAIKITCR 194

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           QISP IPHNYKESS+PV VFT+T++N+G T AD+TLLFTW NSVGG SE TG H NSK  
Sbjct: 195 QISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGRSELTGNHTNSKMM 254

Query: 266 MNDGVHAVLLHHRTSHQ-LPPVTFALAAQETDGVHVSLCPHFVISGNSLG-LTAKDMWHE 323
             DGVH VLLHHRT++   PPVTFA+A+QET+ V V+ CP F +  +S G  TAKDMW E
Sbjct: 255 ERDGVHGVLLHHRTANDGHPPVTFAIASQETEDVRVTDCPSFTMGSSSSGDFTAKDMWQE 314

Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
           IK+HGSF  + + +    S+PGSSIGAA+AA+ TVP      V+F+L+W CPEV F +G+
Sbjct: 315 IKKHGSF--ITTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPEVKFPAGR 372

Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
           TY+RRYT F+G  ++AAA ++A DA+LEH  WE +IE WQRPIL+DKRLP+WYP+ LFNE
Sbjct: 373 TYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPDWYPVALFNE 432

Query: 443 LYYLNAGGAVWTDGSPPVHSLVT--IGHRKFSLDWSQSDLKRIVDVPNQNDTAVN-ILER 499
           LYYLNAGG +WTDG PP  S+ +  +G   FSL            +P    TAV+ IL  
Sbjct: 433 LYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSL--------ATTYLPG---TAVDGILSA 481

Query: 500 MSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
           ++S  E  ++  A    FG  LL DGEEN+GQFLYLE  EY M+NTYDVHFY+SFAL+ L
Sbjct: 482 VASATEDQHSAAA----FGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSL 537

Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLY 619
           FP+++LS+QRDFA AV++HDP  M  L DG  V RKVLGAVPHDIG+ DPWFE+NAY ++
Sbjct: 538 FPELELSLQRDFARAVLIHDPRPMVTL-DGATVLRKVLGAVPHDIGLNDPWFELNAYMIH 596

Query: 620 DTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFP 679
           D ARWKDLNPKFVLQVYR VV TG+  FAKA WP+VY+AMAYMDQ+DRDGDGM+EN+G P
Sbjct: 597 DPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-P 655

Query: 680 DQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWN 738
           DQTYD WSV+G+SAY+GG+WV ALQA +A+AR VGD  +E YF  ++ +AK VY+ +LWN
Sbjct: 656 DQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWN 715

Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
           G+YF YDNSG   S SI ADQLAGQWYA  CGL P+V+EDKARSAL  V +YNV++V GG
Sbjct: 716 GTYFRYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGG 775

Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
             GAVNGM PDG +DMSS QS+EIWSG TYAVAA+M+HE + +  F+TA G ++A+W   
Sbjct: 776 AVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKA 835

Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDE--SLL 916
           G GYAFQTPEAW  +  YR L YMRPL++WAMQWAL+ P+   L K +            
Sbjct: 836 GFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPE---LHKDLGLASVSPPGDAA 892

Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
           R    F KVA +L+LPE+   K    +L+ H+ R++ +
Sbjct: 893 RGREKFDKVAGMLRLPEKVQHKGFFLALY-HSLRQLVL 929


>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
          Length = 919

 Score = 1020 bits (2638), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 521/934 (55%), Positives = 663/934 (70%), Gaps = 36/934 (3%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D  KP +++W+R        ++ F++   + + + P+ +++L L  ++ AKG  +  DP
Sbjct: 15  MDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDNAKGLASVYDP 74

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
             K       GVPLGG+G+GSIGRSYRG FQ++QIFP + E+KP+LANQFS F+SR +G+
Sbjct: 75  LKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQFSAFISRPDGK 134

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
           +YS+VL     +VLK    AGIGSWDW LK    TYH L+PR+WTV+ GEPDPE++I CR
Sbjct: 135 RYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGEPDPEIKITCR 194

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           QISP IPHNYKESS+PV+VFT+T++NSG T AD+TLLFTW NSVGG SE TG H NS+  
Sbjct: 195 QISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSELTGNHKNSRMT 254

Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS-GNSLGLTAKDMWHEI 324
             DGVH VLLHHRT+   PPVTFA+A++ETDGV V+ CP F +   NS  +TAKDMW +I
Sbjct: 255 ARDGVHGVLLHHRTADGRPPVTFAIASRETDGVRVTGCPRFTMGPSNSGDVTAKDMWDQI 314

Query: 325 KEHGSF--DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
            ++GSF  D   +   +  S PGSSIGAA+AA+ TV       V+F+L+W CPEV F +G
Sbjct: 315 NKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSWSCPEVKFPAG 374

Query: 383 KTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFN 441
           +TY+RRYTKF+GT ++AAA  +A DA+LEH  WE QIE WQRPIL+DK LPEWYPITLFN
Sbjct: 375 RTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSLPEWYPITLFN 434

Query: 442 ELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMS 501
           ELYYLNAGG +WTDG PP ++ ++     F+LD           V N       IL  ++
Sbjct: 435 ELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLD-------TFSTVANGGSAVDGILSTVA 487

Query: 502 SILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFP 561
               +  T  A  ++ GT LL+DGEEN+GQ LYLEG+EY MWNTYDVHFY+SFAL+ LFP
Sbjct: 488 VAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLSLFP 547

Query: 562 KIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDT 621
           +++L++QRDF   V++HDP   + L DG  V+RKVLGAVPHD+G+ DPWFEVNAY L+D 
Sbjct: 548 ELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYMLHDP 606

Query: 622 ARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQ 681
            RWKDLNPKFVLQVYRDVVATG+  FA+A WP+VY+AMAYMDQFDRDGDGM+EN+G PDQ
Sbjct: 607 VRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGRPDQ 666

Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGS 740
           TYD WSVSG+SAY+GGLWVAALQAA+A+A  VGD  +E YF  ++ +A+ VY ++LWNG 
Sbjct: 667 TYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELWNGG 726

Query: 741 YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR 800
           YFNYDNSG + SSSIQADQLAGQWYARACGL PIVD DKAR AL  V +YNV++      
Sbjct: 727 YFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRT----- 781

Query: 801 GAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
                             S+E+  GVTYAVAA+MIHE + +  F+TA GI++A W   G 
Sbjct: 782 -----------------HSKEVCPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGKHGF 824

Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA 920
           GYAFQTPE+W  D  YR+L YMRPL +WAMQWAL+ P      +      + E       
Sbjct: 825 GYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAALGQE 884

Query: 921 GFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
            F KVA +L+LPEEQ  K +L++L+D T R++ +
Sbjct: 885 KFDKVASMLRLPEEQQHKGILRALYD-TLRQLLL 917


>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
 gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
          Length = 929

 Score = 1014 bits (2623), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 526/946 (55%), Positives = 665/946 (70%), Gaps = 50/946 (5%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D ++P + TWQR+       ++ ++L   +   +  +GV                  DP
Sbjct: 15  IDCSQPPARTWQRRFDDEGKKVAMYSLTMNDLTAI--VGV-----------------YDP 55

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
             K       GVPLGG+G+GSIGRSYRG FQ++QIFP   E+KP+LANQFS FVSR NG 
Sbjct: 56  LRKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPATNEEKPILANQFSAFVSRPNGT 115

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
           KYS+VL   T ++LK+   A IGSWDW LK +K TYHAL+PR+WTV++GEPDPE++I  R
Sbjct: 116 KYSTVLSAPTADLLKEVDKASIGSWDWKLKEEKCTYHALFPRSWTVYDGEPDPEIKITSR 175

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           QISP IPHNY+ESS+P +VFT+T++NSG T+AD+TLLFTW NSVGG SE TG H NSK  
Sbjct: 176 QISPFIPHNYRESSFPAAVFTFTVHNSGSTAADVTLLFTWANSVGGKSELTGNHVNSKMT 235

Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS-GNSLGLTAKDMWHEI 324
             DGVH VLLHHRT+   PPVTFA+A+QETDGV VS+CP F +    S   TA DMW EI
Sbjct: 236 EQDGVHGVLLHHRTAGGHPPVTFAIASQETDGVRVSVCPSFTMGPSGSRDFTAADMWDEI 295

Query: 325 KEH-GSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
           K+H               S+ GSS+GAA+AAS TVP      V+FSLAW CP++ F +G 
Sbjct: 296 KKHGAFGHAGAGNAARAASKAGSSLGAAVAASTTVPAGETRVVSFSLAWACPDIKFPAGS 355

Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
           TY+RRYTKFYG   +AAA  +A DA+LEH +WE QIE WQRPIL DKRLPEWYP+ LFNE
Sbjct: 356 TYHRRYTKFYGVDADAAAEQLAHDALLEHMNWESQIEEWQRPILNDKRLPEWYPVALFNE 415

Query: 443 LYYLNAGGAVWTDGSPPVHSLVT---IGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILER 499
           LYYLNAGG +WTDG PP  +       G   FS+D        + +      +AV+ + R
Sbjct: 416 LYYLNAGGTIWTDGQPPKKAGFASSLTGTEPFSID------TLLAEDGGTGGSAVDGVVR 469

Query: 500 MSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
             +  +++    +  ++FG  LL+DGE+N+GQFLYLEG+EY M+NTYDVHFY+SFAL+ L
Sbjct: 470 AVASAKELSH--SAAAAFGAALLRDGEDNVGQFLYLEGMEYNMYNTYDVHFYASFALLSL 527

Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLY 619
           FP ++LS+QRDFA AV++HDP + +   DG+ V RKVLG VPHD+G+ DPW E+N Y L+
Sbjct: 528 FPSLELSLQRDFARAVLLHDPRRRRTF-DGRTVPRKVLGTVPHDVGLNDPWVEMNEYMLH 586

Query: 620 DTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFP 679
           D A WKDLNPKFVLQVYRD VATGD  FA+A WP+VY+AMAYM QFDRDGDGM+EN+G P
Sbjct: 587 DPALWKDLNPKFVLQVYRDAVATGDAAFAEAAWPAVYIAMAYMHQFDRDGDGMVENEGIP 646

Query: 680 DQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWN 738
           DQTYD WSVSG+SAY+GGLWVAAL+AA+ +AR VGDR +E YF  +  +A  VY+ +LWN
Sbjct: 647 DQTYDIWSVSGVSAYTGGLWVAALEAAAGMARVVGDRHAEAYFRGRRDRAARVYDAELWN 706

Query: 739 GSYFNYDNSGSSQ-SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
           G+YF YDNSG    S SI ADQ+AGQWYARACG+ P+V+E KARSAL  V +YNV++V G
Sbjct: 707 GTYFRYDNSGGGACSESIMADQMAGQWYARACGMEPVVEEGKARSALATVLDYNVMRVKG 766

Query: 798 GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
           G  GAVNGM PDG VD SS QS+E+W GVTYAVAA+M+HE + +  F+TA G ++A W  
Sbjct: 767 GAVGAVNGMRPDGAVDASSGQSKEVWPGVTYAVAAAMVHEGMHEAAFRTAKGAHDAGWGK 826

Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK---------PKTLEKQMKP 908
            G GYAFQTPEAW  D  YRSL YMRPL IWAMQWAL+ P+          +++E++  P
Sbjct: 827 DGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQWALSPPELHKDLRAPGARSVEEESSP 886

Query: 909 EVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
              D +L +  A F KVA +L+LPEE+  K  L +++    R+M +
Sbjct: 887 --ADAALGQ--AQFEKVANMLRLPEEKQPKGYLWAIY-KVIRQMVL 927


>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
           [Brachypodium distachyon]
          Length = 919

 Score =  996 bits (2575), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 530/938 (56%), Positives = 666/938 (71%), Gaps = 45/938 (4%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D A+PA+ TWQRK       +  F++   + + + P+  + L    EE  KG+    DP
Sbjct: 15  IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNAEERGKGRDVIYDP 74

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
           F K       G+PLGG+G+GSIGRSYRG FQ +QIFP + E KP+LANQFS F+SR NG+
Sbjct: 75  FKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILANQFSAFISRPNGK 134

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
            YS+VL   T +VLK    AGIGSWDW LK    TYHAL+PR+WTV++GEPDP ++I CR
Sbjct: 135 SYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPDPAIKITCR 194

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK-T 264
           QISP IPHNYKESS+PV VFT+T++N+G T AD+TLLFTW NSVGG SE TG H NSK  
Sbjct: 195 QISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGRSELTGNHTNSKMI 254

Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG-LTAKDMWHE 323
             NDG              PPVTFA+A+QET+ V V+ CP F +  +S G  TAKDMW E
Sbjct: 255 TANDG-------------HPPVTFAIASQETEDVRVTDCPSFTMGSSSSGDFTAKDMWQE 301

Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
           IK+HGSF  + + +    S+PGSSIGAA+AA+ TVP      V+F+L+W CPEV F +G+
Sbjct: 302 IKKHGSF--ITTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPEVKFPAGR 359

Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
           TY+RRYT F+G  ++AAA ++A DA+LEH  WE +IE WQRPIL+DKRLP+WYP+ LFNE
Sbjct: 360 TYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPDWYPVALFNE 419

Query: 443 LYYLNAGGAVWTDGSPPVHSLVT--IGHRKFSLDWSQSDLKRIVDVPNQNDTAVN-ILER 499
           LYYLNAGG +WTDG PP  S+ +  +G   FSL            +P    TAV+ IL  
Sbjct: 420 LYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSL--------ATTYLPG---TAVDGILSA 468

Query: 500 MSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
           ++S  E  ++  A    FG  LL DGEEN+GQFLYLE  EY M+NTYDVHFY+SFAL+ L
Sbjct: 469 VASATEDQHSAAA----FGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSL 524

Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLY 619
           FP+++LS+QRDFA AV++HDP  M  L DG  V RKVLGAVPHDIG+ DPWFE+NAY ++
Sbjct: 525 FPELELSLQRDFARAVLIHDPRPMVTL-DGATVLRKVLGAVPHDIGLNDPWFELNAYMIH 583

Query: 620 DTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFP 679
           D ARWKDLNPKFVLQVYR VV TG+  FAKA WP+VY+AMAYMDQ+DRDGDGM+EN+G P
Sbjct: 584 DPARWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-P 642

Query: 680 DQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWN 738
           DQTYD WSV+G+SAY+GG+WV ALQA +A+AR VGD  +E YF  ++ +AK VY+ +LWN
Sbjct: 643 DQTYDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWN 702

Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
           G+YF YDNSG   S SI ADQLAGQWYA  CGL P+V+EDKARSAL  V +YNV++V GG
Sbjct: 703 GTYFRYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGG 762

Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
             GAVNGM PDG +DMSS QS+EIWSG TYAVAA+M+HE + +  F+TA G ++A+W   
Sbjct: 763 AVGAVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKA 822

Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDE--SLL 916
           G GYAFQTPEAW  +  YR L YMRPL++WAMQWAL+ P+   L K +            
Sbjct: 823 GFGYAFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPE---LHKDLGLASVSPPGDAA 879

Query: 917 RYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
           R    F KVA +L+LPE+   K    +L+ H+ R++ +
Sbjct: 880 RGREKFDKVAGMLRLPEKVQHKGFFLALY-HSLRQLVL 916


>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 821

 Score =  965 bits (2495), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 490/838 (58%), Positives = 607/838 (72%), Gaps = 40/838 (4%)

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
            FVS   G+ YS+VL   T ++LK    AGIGSWDW LK +   YH L+PR+WTV++GEP
Sbjct: 1   AFVSHPGGKSYSTVLSAPTADLLKGIDKAGIGSWDWKLKEENCNYHGLFPRSWTVYDGEP 60

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
           D +++I CRQISP IPHNYKESS+PV+VFT+T+ NSG T AD+TLLFTW NSVGG SE T
Sbjct: 61  DTKIKITCRQISPFIPHNYKESSFPVTVFTFTVQNSGSTPADVTLLFTWANSVGGRSELT 120

Query: 257 GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG-- 314
           G H NS+ K  DGVH VLL HRT+   PPVTFA+A++ET  V V+ CP F +  +S G  
Sbjct: 121 GNHTNSRIKARDGVHGVLLRHRTADGNPPVTFAIASEETSDVRVTCCPTFAMGPSSPGAE 180

Query: 315 -LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
             TAKDMW E+K  GSF        + +S PGSSIGAA+A++  VP      V+F+L+W 
Sbjct: 181 QFTAKDMWDEVKNRGSFGGDAGGAPTASSRPGSSIGAAVASTTKVPAGDTRVVSFALSWS 240

Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLP 432
           CPEV F SG+TY+RRYTKF G  ++AAA  +  DA+LEH  WE  IE WQ PIL+D+RLP
Sbjct: 241 CPEVKFPSGRTYHRRYTKFLGLDRDAAAEQLVHDALLEHMKWESLIEEWQTPILQDRRLP 300

Query: 433 EWYPITLFNELYYLNAGGAVWTDGSPP-----VHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           EWYP+ LFNELYYLNAGG +WTDG PP       S        FSLD  ++         
Sbjct: 301 EWYPVALFNELYYLNAGGTIWTDGMPPKKTSFASSKYGSTAESFSLDGFRA--------- 351

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
              D AV+ + R  +  E+   P   +S+FGT LL DGEEN+GQFLYLEG+EY MWNTYD
Sbjct: 352 --GDPAVDGILRAMATAEERLEP---SSAFGTALLGDGEENVGQFLYLEGMEYHMWNTYD 406

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SF+L+ LFP+I+LS+QRDFA AV++HDP  M+ LD G  V RKVLGAVPHDIG+ 
Sbjct: 407 VHFYASFSLLSLFPEIELSLQRDFARAVLLHDPRPMRTLD-GVDVPRKVLGAVPHDIGLA 465

Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
           DPWFE+NAY ++D +RWKDLNPKFVLQVYRDV ATG+  FA A WP+VY+AMAYMDQFDR
Sbjct: 466 DPWFELNAYMIHDPSRWKDLNPKFVLQVYRDVAATGNVTFATAAWPAVYLAMAYMDQFDR 525

Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQ 727
           DGDGM+EN+G PDQTYD WSVSG+SAY+GGLW+AALQAA+A+AR VGDRG+E YFL +++
Sbjct: 526 DGDGMVENEGRPDQTYDLWSVSGVSAYTGGLWLAALQAAAAMARIVGDRGAEGYFLERYK 585

Query: 728 KAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKK 786
           +A+ VY+ +LWNGSYF+YDNSG + S SI ADQLAGQWYARACGL PIV+E+KARSAL  
Sbjct: 586 RAQRVYDGELWNGSYFDYDNSGCATSKSIMADQLAGQWYARACGLEPIVEEEKARSALGT 645

Query: 787 VYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQT 846
           V +YNV++V GG  GAVNGM PDG VD SS+QS+E+W GVTY VAA+M+HE + +  F+T
Sbjct: 646 VLDYNVMRVQGGAVGAVNGMRPDGAVDASSLQSKEVWVGVTYGVAAAMLHEGMTEAAFRT 705

Query: 847 ACGIYEAAWSGTGLGYAFQTPEAWNTD--DQYRSLCYMRPLAIWAMQWALTRPK------ 898
           A G ++A W   G GYAFQTPEAW +D    YRSL YMRPL+IWAMQWAL+ P+      
Sbjct: 706 AKGAHDAGWGRDGFGYAFQTPEAWTSDAGGGYRSLHYMRPLSIWAMQWALSPPELHRDLR 765

Query: 899 --PKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMFI 954
             P ++     P   D +  +    F KVA +LKLPEE   K  L+++     R+M +
Sbjct: 766 VVPGSVSAVASPAEVDLAREK----FEKVASMLKLPEEVQHKGYLRAIC-QVLRQMLL 818


>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
 gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
          Length = 983

 Score =  962 bits (2487), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 509/956 (53%), Positives = 667/956 (69%), Gaps = 36/956 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLA-PIGVRILCLIREEAAKGKRAFIDP 85
           DP  P +  W+RKL+     L++F + + E ++L   +G+R+   I+ E ++G+   IDP
Sbjct: 27  DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86

Query: 86  FIK-RHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
           F +    ++ HGVPLGG+G GSIGR +RGEF+RWQ+ P V  +  VLANQFSVFV+R   
Sbjct: 87  FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146

Query: 145 ----QKYSSVLCPKTPE--VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
               +K +SVL P+  E  V K   +  I +WDWNL G  STYH L+PRAWT+++GEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
           +L+I CRQ+SP IPH+Y+ESS PV VF+Y I N+GK SA ++LLFTW NSVGGDS  +G 
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDSHLSGG 266

Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAK 318
           H N     +DGV  V+LHHRT+   P V FA+AA++   V VS+CP F++SG    +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326

Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           +MW++++E G FD     E S+ S   SS+GAA++ASVT+ P  +  + F+L+WD PEV 
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVIDFALSWDIPEVK 386

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F  G++Y+RRYT+FYG++ +AA  +  DA+L +  WE +IE WQ P+L+D  LPEWY  T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYRFT 446

Query: 439 LFNELYYLNAGGAVWTDGSPP---VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ------ 489
           LFNELYYL AGG VWT    P    +   +  + + S+  +  +L  ++ +         
Sbjct: 447 LFNELYYLVAGGTVWTAFVTPSFRSNGFTSRTNSESSITGAVEELVEMIPLNGGTTGNGG 506

Query: 490 -----NDTAVNILERMSSILEQIYTPVALNSS-----FGTNLLQDGEENIGQFLYLEGIE 539
                ++T   I E++   +E       +NS+     +GT LL+DGEEN+GQFLYLE +E
Sbjct: 507 VDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLYLESVE 566

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y+MW TYDVHFY+SFALI LFPK++LSIQRD+AAA + H+P  +K L  G+W  +KVLGA
Sbjct: 567 YVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIKKVLGA 626

Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           VPHD+GI DPW EVNAY ++DT+RWKDLN KFVLQ+YRDVVAT D  FA+  WP+VY AM
Sbjct: 627 VPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAM 686

Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
           AYMD+FDRD DGMIENDGFPDQTYDTW+V GISAY GGLW+AALQAA+A+A +V +  + 
Sbjct: 687 AYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAA 746

Query: 720 DYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED 778
           +YFL KF+ A+ VY+ KLWNGSYFNYD+  SS S+SIQADQ+AGQWYA A GL P+ D+ 
Sbjct: 747 NYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPPLFDDY 806

Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
           KARSAL+++Y+YNV+KV GGK GAVNGM P+GRVD ++MQSREIWSGVTYA AA+MIHE 
Sbjct: 807 KARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEG 866

Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           + +  F TA GIY A WS +G GY FQTPEAW TD  +R+L YMRPLAIWAMQWAL  P 
Sbjct: 867 MLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWALYPPN 926

Query: 899 -----PKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTC 949
                P+   K+     +  S   Y AGF+KV+  LK    Q  +  ++SLF   C
Sbjct: 927 CVAAGPRIPSKEESAGGSTSSSYLY-AGFAKVSEALKNAPPQIEQ--IRSLFHRLC 979


>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
 gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
          Length = 983

 Score =  960 bits (2482), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 511/958 (53%), Positives = 663/958 (69%), Gaps = 36/958 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLA-PIGVRILCLIREEAAKGKRAFIDP 85
           DP  P +  W+RKL+     L++F + + E ++L   +G+R+   I+ E ++G+   IDP
Sbjct: 27  DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86

Query: 86  FIK-RHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
           F +    ++ HGVPLGG+G GSIGR +RGEF+RWQ+ P V  +  VLANQFSVFV+R   
Sbjct: 87  FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146

Query: 145 ----QKYSSVLCPKTPE--VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
               +K +SVL P+  E  V K   +  I +WDWNL G  STYH L+PRAWT+++GEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
           +L+I CRQ+SP IPH+Y+ESS PV VF+Y I N+GK SA ++LLFTW NSVGGD+  +G 
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDTHLSGG 266

Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAK 318
           H N     +DGV  V+LHHRT+   P V FA+AA++   V VS+CP F++SG    +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326

Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           +MW++++E G FD     E S+ S   SS+GAA++ASVT+ P  +  V F+L+WD PEV 
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVVDFALSWDIPEVK 386

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F  G++Y+RRYT+FYG++ +AA  +  DA+L +  WE +IE WQ P+L+D  LPEWY  T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYKFT 446

Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVD-VPNQNDTAVN-- 495
           LFNELYYL AGG VWT    P         R  S       ++ +V+ +P    T+ N  
Sbjct: 447 LFNELYYLVAGGTVWTAFVTPSFRSNGFTSRTNSESSITGAVEELVEMIPLNGGTSGNGG 506

Query: 496 -----------ILERMSSILEQIYTPVALNSS-----FGTNLLQDGEENIGQFLYLEGIE 539
                      I E++   +E       +NS+     +GT LL+DGEEN+GQFLYLE +E
Sbjct: 507 VDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLYLESVE 566

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y+MW TYDVHFY+SFALI LFPK++LSIQRD+AAA + H+P  +K L  G+W  +KVLGA
Sbjct: 567 YVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIKKVLGA 626

Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           VPHD+GI DPW EVNAY ++DT+RWKDLN KFVLQ+YRDVVAT D  FA+  WP+VY AM
Sbjct: 627 VPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAM 686

Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSE 719
           AYMD+FDRD DGMIENDGFPDQTYDTW+V GISAY GGLW+AALQAA+A+A +V +  + 
Sbjct: 687 AYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAA 746

Query: 720 DYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED 778
           +YFL KF+ A+ VY+ KLWNGSYFNYD+  SS S+SIQADQ+AGQWYA A GL P+ D+ 
Sbjct: 747 NYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPPLFDDY 806

Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
           KARSAL+++Y+YNV+KV GGK GAVNGM P+GRVD ++MQSREIWSGVTYA AA+MIHE 
Sbjct: 807 KARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEG 866

Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           + +  F TA GIY A WS +G GY FQTPEAW TD  +R+L YMRPLAIWAMQWAL  P 
Sbjct: 867 MLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWALYPPN 926

Query: 899 -----PKTLEKQMKPEVTDESLLRYHAGFSKVARLLK--LPEEQGAKSLLQSLFDHTC 949
                P+   K+     +  S   Y AGF+KV+  L    P+ +  +SL   L    C
Sbjct: 927 CVAAGPRIPSKEESAGGSTSSSYLY-AGFAKVSEALTNVPPQIEQIRSLFHRLCCCCC 983


>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 970

 Score =  959 bits (2479), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 488/895 (54%), Positives = 630/895 (70%), Gaps = 24/895 (2%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D   P    W+R+L++    L +F++ + E +++  +GVR+   +REEA+ G++A IDP
Sbjct: 30  LDGGSPPEQAWRRRLNSHANILKEFSVTFMEAMKMMTLGVRLWSYVREEASHGRKAPIDP 89

Query: 86  FIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
           F +     ++S GVPLGG+GSGSI R +RGEF+ W I P +CE+ PV+ NQFS+FVSR +
Sbjct: 90  FTRERCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQFSIFVSRDS 149

Query: 144 G-QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRI 202
           G +KYSSVL P   E LK    +GI SWDWNL G  STYHAL+PRAWT+++GEPDP+L+I
Sbjct: 150 GNKKYSSVLAPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTIYDGEPDPDLKI 209

Query: 203 VCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNS 262
            CRQISP IPH+YK+SS P SVF YT+ N+G+  A ++LL TW NS+GG S  +G H+N 
Sbjct: 210 SCRQISPFIPHDYKDSSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFSHHSGGHFNE 269

Query: 263 KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWH 322
               +DGV  VLLHH+T+   PPVTF++AA ET  V V++ P F +SG +  ++AK+MW 
Sbjct: 270 PFIGDDGVSGVLLHHKTAKDNPPVTFSIAACETQNVSVTVLPVFGLSGEN-HVSAKEMWD 328

Query: 323 EIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS 381
            + + G F R N +   S+ S  G ++ AA++AS  V P     V F+LAW  P+V F  
Sbjct: 329 TMSKDGHFSRENFNAGCSMPSSSGETLCAAVSASTWVEPHGRCTVAFALAWSSPKVKFQK 388

Query: 382 GKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLF 440
           G TY RRYT+FYGT + +++ N+  DA+ ++  WE +IE WQ PIL+D++LPEWY  TLF
Sbjct: 389 GCTYNRRYTEFYGTSERSSSINLVHDALTKYRLWEEEIEKWQDPILKDEKLPEWYKFTLF 448

Query: 441 NELYYLNAGGAVWTDGSPPVHSLVTIGHR-KFSLDWSQSD-------------LKRIVDV 486
           NELY+L AGG VWTDG PP  S  +  ++ K+S   ++S+             +    D+
Sbjct: 449 NELYFLVAGGTVWTDGQPPAISEASPAYQHKYSKKGAKSESVKDNHVKPAAEQVSDGDDL 508

Query: 487 PNQNDTAVN--ILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
           PN  + +V+  +    S + EQ  + + L       L +DG EN+G+FLYLEG+EY+MWN
Sbjct: 509 PNGEERSVSTYVAVHGSQMPEQT-SGLGLQEPIPYLLSKDGPENVGKFLYLEGVEYIMWN 567

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  ++K L DG    RKV GAVPHD+
Sbjct: 568 TYDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKVKGAVPHDL 627

Query: 605 GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD  F + VWP+V  AM YMDQ
Sbjct: 628 GTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVSAAMDYMDQ 687

Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
           FDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A  +GDR   + +  
Sbjct: 688 FDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPYAEKYKL 747

Query: 725 KFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
           KF KAK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL PI DE K RSA
Sbjct: 748 KFMKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIFDEHKIRSA 807

Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
           L+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTY VAA+M+   +   G
Sbjct: 808 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 867

Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           F TA GI+ A WS  G GY FQTPE W TD  YRSL YMRPLAIWAMQWAL+ PK
Sbjct: 868 FITAEGIFLAGWSEDGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAMQWALSPPK 922


>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
          Length = 974

 Score =  957 bits (2474), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 490/898 (54%), Positives = 619/898 (68%), Gaps = 28/898 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+R+L++    L +F++ + E +++  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            K     ++S GVPLGG+GSGSI R +RGEF+ W I P +CE  PV+ NQFS+FVSR  G
Sbjct: 91  TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQFSIFVSRDGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KYSSVL P   E LK    +GI SWDWNL G  STYHAL+PRAWTV++GEPDP+L+I 
Sbjct: 151 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPH+YK+SS P SVF YT+ N+GK  A ++LL TW NS+GG S  +G H+N  
Sbjct: 211 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T+   PPVTFA+AA ET  V+V++ P F +SG     +AK MW  
Sbjct: 271 FIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 329

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           +K++G FDR N    TS+ S  G ++ AA++AS  V P     V F LAW  P++ F  G
Sbjct: 330 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY RRYT+FYGT + +A N+  DA+ ++  WE +IE WQ PIL+++RLPEWY  TLFNE
Sbjct: 390 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNERLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
           LY+L AGG VWTDG PPV        S      ++ + D  Q  +K          V N 
Sbjct: 450 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 509

Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
            D A N  E+  S    ++ P    ++ G          L ++G EN+G+FLYLEG+EY+
Sbjct: 510 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 568

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
           MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  +MK L DG    RKV GAVP
Sbjct: 569 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 628

Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
           HD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD  F + VWP+V  AM Y
Sbjct: 629 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAAMDY 688

Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
           M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A  +GDR   + 
Sbjct: 689 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 748

Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
           +  KF +AK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL P+ DE+K 
Sbjct: 749 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 808

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
           RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+   + 
Sbjct: 809 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 868

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
             GF TA GI+ A WS  G GY FQTPE W  D  YRSL YMRPLAIWAMQWA + PK
Sbjct: 869 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 926


>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
 gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 974

 Score =  954 bits (2466), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 488/898 (54%), Positives = 618/898 (68%), Gaps = 28/898 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+R+L++    L +F++ + E +++  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            K     ++S GVPLGG+GSGSI R +RGEF+ W I P +CE  PV+ NQFS+FVSR  G
Sbjct: 91  TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQFSIFVSRDGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KYSSVL P   E LK    +GI SWDWNL G  STYHAL+PRAWTV++GEPDP+L+I 
Sbjct: 151 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPH+YK+SS P SVF YT+ N+GK  A ++LL TW NS+GG S  +G H+N  
Sbjct: 211 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T+   PPVTFA+AA ET  V+V++ P F +SG     +AK MW  
Sbjct: 271 FIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 329

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           +K++G FDR N    TS+ S  G ++ AA++AS  V P     V F LAW  P++ F  G
Sbjct: 330 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY RRYT+FYGT + +A N+  DA+ ++  WE +IE WQ PIL++++LPEWY  TLFNE
Sbjct: 390 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
           LY+L AGG VWTDG PPV        S      ++ + D  Q  +K          V N 
Sbjct: 450 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 509

Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
            D A N  E+  S    ++ P    ++ G          L ++G EN+G+FLYLEG+EY+
Sbjct: 510 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 568

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
           MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  +MK L DG    RKV GAVP
Sbjct: 569 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 628

Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
           HD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD  F + VWP+V   M Y
Sbjct: 629 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDY 688

Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
           M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A  +GDR   + 
Sbjct: 689 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 748

Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
           +  KF +AK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL P+ DE+K 
Sbjct: 749 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 808

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
           RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+   + 
Sbjct: 809 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 868

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
             GF TA GI+ A WS  G GY FQTPE W  D  YRSL YMRPLAIWAMQWA + PK
Sbjct: 869 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 926


>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 981

 Score =  954 bits (2465), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 514/943 (54%), Positives = 661/943 (70%), Gaps = 49/943 (5%)

Query: 2   FEGKILENGLHEEEKEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLA 61
           F G+     L E   +P  +  D  DP  P++L W+RKL++    L++F + + E V++ 
Sbjct: 30  FHGRRPSWPLAEYVHKPTLAMLDH-DPGAPSALAWRRKLNSHADFLTEFRVTFMEAVRMI 88

Query: 62  PIGVRILCLIREEAAKGKRAFIDPFIKRHLTSS-HGVPLGGVGSGSIGRSYRGEFQRWQI 120
            +G+R+   ++ E A+G+   IDPF +    S+ HGVP GG+G GSIGR +RG+F+RWQ+
Sbjct: 89  TLGMRMWFYVKSERAQGRAPPIDPFNRETKPSACHGVPCGGMGGGSIGRGFRGDFRRWQL 148

Query: 121 FPRVCEDKPVLANQFSVFVSRSNG----QKYSSVLCPKTPEVLKDTTA-AGIGSWDWNLK 175
            P VCE+ PVLA+QFSVFV R  G    +K +SVL P  P+ L DT     + SWDWNL 
Sbjct: 149 IPGVCEEAPVLADQFSVFVKRDKGNGVVKKDASVLYPGRPQELSDTKDDTSVSSWDWNLD 208

Query: 176 GDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKT 235
           G  STYHAL+PRAWT++EGEPDP+L+I CRQ+SP IPHNY+ESS+P  VF+Y + N+GK 
Sbjct: 209 GQNSTYHALFPRAWTIYEGEPDPDLKISCRQVSPFIPHNYEESSFPCCVFSYVVMNTGKQ 268

Query: 236 SADITLLFTWTNSVGGDSEFTGQHYNS--KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQ 293
           +AD+TLLFTW NS+GG+S  TG H+N   +T    G               PVTFA+AA+
Sbjct: 269 AADVTLLFTWANSIGGNSATTGGHFNEPFETGAKGGR--------------PVTFAIAAK 314

Query: 294 ETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAI 352
           E   V ++ CP F + G  + +TAKDMW E+KE+GSFD+ N + + S  +  GS+IGAAI
Sbjct: 315 EMRDVTLTTCPCFSLGGKEVEVTAKDMWTEVKENGSFDKENWNFKPSAPTRQGSAIGAAI 374

Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
           AA V VPP  +  V FSLAWD PEV F+ GK+Y+RRYT FYGT  +AA+ + RDA+ ++ 
Sbjct: 375 AARVEVPPGEKRDVVFSLAWDSPEVKFLKGKSYHRRYTSFYGTAGDAASKLTRDALRDYR 434

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
            WE+ IE WQ+P+L+D++LPEWY +TLFNELYYL AGG +WT               +  
Sbjct: 435 KWEIAIEDWQQPVLKDEKLPEWYRVTLFNELYYLVAGGTIWT--------------AELE 480

Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNS----SFGTNLLQDGEEN 528
           L  SQ + +R+      +      LE +  IL+Q+ T +  +S    + G +LLQ+GEEN
Sbjct: 481 LAASQ-ETERLQSGETADQMQTETLEHL--ILKQMQTAMLGSSEAGIATGPSLLQEGEEN 537

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           +GQFLYLEGIEY+MWNTYDVHFY+SFAL+ LFPK++L+IQRD AAA +  +P K+K + D
Sbjct: 538 VGQFLYLEGIEYIMWNTYDVHFYASFALLALFPKLELAIQRDVAAATLSQNPEKVKYMAD 597

Query: 589 GQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
           G    RKV GAVPHD+G  DPW EVNAY ++DT+ WKDLNPKFVLQVYRDVVATGD++FA
Sbjct: 598 GGVGIRKVFGAVPHDLGQHDPWVEVNAYNIHDTSHWKDLNPKFVLQVYRDVVATGDRQFA 657

Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
           KAVWP+VY AMAYMDQFDRD DG+IENDGFPDQTYDTWSV G+SAY GGLW+AALQAA+A
Sbjct: 658 KAVWPAVYAAMAYMDQFDRDRDGLIENDGFPDQTYDTWSVHGVSAYCGGLWIAALQAAAA 717

Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
           +A  V D+ +  YF  KF +A+ VYE KLWNG YFNYD+  SS S+S+QADQLAGQWYA 
Sbjct: 718 MADLVDDKDAATYFKGKFNQARDVYERKLWNGEYFNYDSGTSSNSNSVQADQLAGQWYAW 777

Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
           A GL P+ D+ KARSAL+KV+++NV+KV GGK GA NGM P+G+VD + MQSREIW+GVT
Sbjct: 778 ASGLPPLFDDYKARSALQKVFDFNVMKVKGGKWGAANGMHPNGKVDETCMQSREIWTGVT 837

Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
           YA +A+MIHE + +  F  A G++ A WS   LGY FQTPEAW  D  +RSL YMRPLAI
Sbjct: 838 YAASAAMIHEGMVEQAFTAAQGVFLAGWS--DLGYWFQTPEAWTIDGYFRSLAYMRPLAI 895

Query: 888 WAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLK 930
           WAMQWAL  PK    E    P +   S L  H  FS +A  L+
Sbjct: 896 WAMQWALYPPK-SIAEAPRVPSMDRGSALVTHDRFSSLANALR 937


>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 962

 Score =  952 bits (2460), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/894 (54%), Positives = 623/894 (69%), Gaps = 30/894 (3%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D A P    W+R+L++    L +F++ + E +++  +GVR+   +REEA+ G++A IDP
Sbjct: 30  LDGAAPPEQAWRRRLNSHANILKEFSVTFMEAMKMMSLGVRLWSYVREEASHGRKAPIDP 89

Query: 86  FIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSN 143
           F +     ++S GVPLGG+GSGSI R +RGEF+ W I P +CE+ PV+ NQFS+FVSR  
Sbjct: 90  FTRESCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQFSIFVSRDG 149

Query: 144 G-QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRI 202
           G +K SSVL P   + LK  + +GI SWDWNL G  STYHAL+PRAWTV++GEPDP+L+I
Sbjct: 150 GNKKCSSVLAPGHHDGLKKYSDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKI 209

Query: 203 VCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNS 262
            CRQISP IPH+YKESS P SVF YT+ N+G+  A ++LL TW NS+GG S  +G H+N 
Sbjct: 210 SCRQISPFIPHDYKESSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFSHHSGGHFNE 269

Query: 263 KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWH 322
                DGV  VLLHH+T+   PPVTFA+AA ET  V+V++ P F +SG +  ++AKDMW 
Sbjct: 270 PFIGEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEN-HVSAKDMWD 328

Query: 323 EIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS 381
            +K+ G F+  N +   S+ S PG ++ AA+ AS  V P     V F+L+W  P+V F  
Sbjct: 329 IMKKDGHFNLENFNAGCSMPSSPGETLCAAVTASTWVEPHGRCTVAFALSWSSPKVKFQK 388

Query: 382 GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFN 441
           G TY RRYT+FYGT + ++ N+  DA+ ++  WE +IE WQ PIL D+RLPEWY  TLFN
Sbjct: 389 GCTYNRRYTEFYGTSERSSINLVHDALTKYRLWEEEIEKWQNPILRDERLPEWYKFTLFN 448

Query: 442 ELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMS 501
           ELY+L AGG VWTDG PP      I  +       Q   K+ +    ++++  + L R +
Sbjct: 449 ELYFLVAGGTVWTDGQPPA-----IDEKTNPASNQQKHSKKPIK-DTKSESVKDNLPRPT 502

Query: 502 SILEQIYTPVALNSSF-----GTNLL-----------QDGEENIGQFLYLEGIEYLMWNT 545
           +  EQ++    L +        TN L           +DG EN+G+FLYLEG+EY+MWNT
Sbjct: 503 A--EQVFNGDDLTNGGPQMPEQTNGLRVQEPVPCIHSKDGPENVGKFLYLEGVEYIMWNT 560

Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
           YDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  ++K L DG    RKV GAVPHD+G
Sbjct: 561 YDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKVKGAVPHDLG 620

Query: 606 ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
             DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD  F + VWP+V  AM YMDQF
Sbjct: 621 THDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVCAAMDYMDQF 680

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           DRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A  +GDR   + +  K
Sbjct: 681 DRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRPYAEKYKLK 740

Query: 726 FQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSAL 784
           F KAK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL PI DE K RSAL
Sbjct: 741 FIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIFDEHKIRSAL 800

Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
           +K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTY VAA+M+   +   GF
Sbjct: 801 QKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQGF 860

Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            TA GI+ A WS  G GY FQTPE W TD  YRSL YMRPLAIWAMQWAL+ PK
Sbjct: 861 ITAEGIFLAGWSEEGYGYWFQTPEGWTTDGHYRSLIYMRPLAIWAMQWALSPPK 914


>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 946

 Score =  949 bits (2453), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/919 (55%), Positives = 640/919 (69%), Gaps = 26/919 (2%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           +D A+P   TWQRK       ++ F+    + + +AP+ V+++ ++ EE AKGK +  DP
Sbjct: 15  IDCAQPPERTWQRKFDDEGKEIAMFSFTMNDIMTVAPLIVKMMRILVEERAKGKASVYDP 74

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
           F K       GVPLG +G+GSIGRSYRG FQ +Q+FP + E KP+LANQFS FVSR  G+
Sbjct: 75  FKKWMSNCYRGVPLGTLGAGSIGRSYRGYFQHFQLFPGIYEQKPILANQFSAFVSRPGGK 134

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
            YS+VL     EVLK    A IGSWDWNLK    +YH L+PR+WTV++GEPDPE++I CR
Sbjct: 135 SYSTVLSAPPAEVLKGIDKASIGSWDWNLKEKNCSYHGLFPRSWTVYDGEPDPEIKITCR 194

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           QISP IP NYKESS+PV+VFT+T+ NSG T AD+TLLFTW NSVGG SE TG H NS  +
Sbjct: 195 QISPFIPRNYKESSFPVAVFTFTVQNSGSTPADVTLLFTWANSVGGKSELTGNHSNSTMR 254

Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI--SGNSLGLTAKDMWHE 323
             DGVH VLL H T+   PPVTFA+A+Q+T  V V+ CP F +  S  S   TAK+MW E
Sbjct: 255 ARDGVHGVLLRHSTAEGHPPVTFAIASQDTGDVRVTCCPSFSMGPSSKSGEPTAKEMWEE 314

Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
           IK+HGSF    +  T   S PGSS+GAA+AA+ TV      +V+F+L+W CP V F +G+
Sbjct: 315 IKKHGSFGDA-AAGTGRPSRPGSSVGAAVAAATTVTAGCTREVSFALSWSCPVVKFPAGR 373

Query: 384 TYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
           TY+RR+TKF G  ++AAA  +A  A+LEH  WE QIEAWQRP+L+DK LP+WYP+ LFNE
Sbjct: 374 TYHRRHTKFVGLDRDAAAEQLAHHALLEHMEWERQIEAWQRPVLQDKSLPDWYPVALFNE 433

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
           LYYLNAGG +WTDG P   + +          +S +      +  + +  A +IL  M+ 
Sbjct: 434 LYYLNAGGTIWTDGMPSKKTSLVSSSSGTMEPFSLAAFHPDPNATSSSTAADDILLAMAR 493

Query: 503 ILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPK 562
             E +  P A           + E+ +G+FLYLEG+EY M+NTYDVHFY+SFAL+ LFP 
Sbjct: 494 AEEHL-LPAA-----------EDEKGVGKFLYLEGMEYHMYNTYDVHFYASFALLSLFPD 541

Query: 563 IQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTA 622
           ++LS+QRDFAAAV+ HDP  M  L DG++V RKVLGAVPHDIG+ DPW E+NAY ++D +
Sbjct: 542 LELSLQRDFAAAVLRHDPRLMYTL-DGKFVPRKVLGAVPHDIGLNDPWHELNAYMIHDPS 600

Query: 623 RWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQT 682
           RWKDLNPKFVLQVYRDV ATGD  FA++ WPSVYVA+AYMDQFDRD DGM+EN+G PDQT
Sbjct: 601 RWKDLNPKFVLQVYRDVAATGDLDFARSAWPSVYVALAYMDQFDRDRDGMVENEGRPDQT 660

Query: 683 YDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSY 741
           YD WSVSG+SAY+GGLWVAAL+AA+A+A  VGD  +E  FL ++ +A  VY+ +LW G Y
Sbjct: 661 YDLWSVSGVSAYTGGLWVAALRAATAMAALVGDLPAEAVFLERYNRANKVYDSELWTGDY 720

Query: 742 FNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRG 801
           F YDNSG   S S+ ADQLAGQWYARACGL P+V  DKAR AL  V  +NV++V GG  G
Sbjct: 721 FRYDNSGGGNSESVMADQLAGQWYARACGLEPVVGRDKARRALAAVLEHNVMQVQGGGVG 780

Query: 802 AVNGM-LPD--GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
           AVNG  LP+  G VD SS QS+E+W+G TYAVAA+MI E + + GF  A G Y A W   
Sbjct: 781 AVNGARLPEHGGGVDESSTQSKEVWTGTTYAVAAAMIGEGMREEGFTAAKGAYGAGWGED 840

Query: 859 GLGYAFQTPEAWNTDDQ--YRSLCYMRPLAIWAMQWALTRPKPKTLEKQ---MKPEVTDE 913
           G GYAFQ PE+W  D    YRSL YMRPLA+WAMQWAL+ P P   E +    K + T+E
Sbjct: 841 GYGYAFQMPESWTADGAGGYRSLHYMRPLAVWAMQWALSPPTPVLPELERMAAKSKATEE 900

Query: 914 SLLRYHAGFSKVARLLKLP 932
            +      F KVA LL+LP
Sbjct: 901 EVELAKEKFEKVASLLRLP 919


>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
          Length = 967

 Score =  937 bits (2421), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 483/898 (53%), Positives = 612/898 (68%), Gaps = 35/898 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+R+L++    L +F++ + E +++  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            K     ++S GVPLGG+GSGSI R +RGEF+ W I P +CE  P+       FVSR  G
Sbjct: 91  TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI-------FVSRDGG 143

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KYSSVL P   E LK    +GI SWDWNL G  STYHAL+PRAWTV++GEPDP+L+I 
Sbjct: 144 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 203

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPH+YK+SS P SVF YT+ N+GK  A ++LL TW NS+GG S  +G H+N  
Sbjct: 204 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 263

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T+   PPVTFA+AA ET  V+V++ P F +SG     +AK MW  
Sbjct: 264 FIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 322

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           +K++G FDR N    TS+ S  G ++ AA++AS  V P     V F LAW  P++ F  G
Sbjct: 323 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 382

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY RRYT+FYGT + +A N+  DA+ ++  WE +IE WQ PIL++++LPEWY  TLFNE
Sbjct: 383 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLPEWYKFTLFNE 442

Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
           LY+L AGG VWTDG PPV        S      ++ + D  Q  +K          V N 
Sbjct: 443 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 502

Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
            D A N  E+  S    ++ P    ++ G          L ++G EN+G+FLYLEG+EY+
Sbjct: 503 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 561

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
           MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  +MK L DG    RKV GAVP
Sbjct: 562 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 621

Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
           HD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD  F + VWP+V   M Y
Sbjct: 622 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDY 681

Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
           M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A  +GDR   + 
Sbjct: 682 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 741

Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
           +  KF +AK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL P+ DE+K 
Sbjct: 742 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 801

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
           RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+   + 
Sbjct: 802 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 861

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
             GF TA GI+ A WS  G GY FQTPE W  D  YRSL YMRPLAIWAMQWA + PK
Sbjct: 862 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 919


>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 932

 Score =  936 bits (2418), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/876 (54%), Positives = 611/876 (69%), Gaps = 31/876 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+RKLS+    L +F + ++E  ++  +G+R+   IREEA+ G++A IDPF
Sbjct: 37  DDGSPPEHAWRRKLSSHANRLKEFNVTFREAFKMMKLGLRLWSYIREEASYGRKAPIDPF 96

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+G+GSI R +RGEF+ WQI P  CE  PV+ NQFS+F+ R + 
Sbjct: 97  TRESTKPSASQGVPLGGMGTGSISRGFRGEFKHWQITPGSCEMLPVMENQFSIFIIRGS- 155

Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
           +KYSSVL P   + LK ++  GI SWDW L+GD+STYHAL+PRAWTV++GEPDPEL++ C
Sbjct: 156 KKYSSVLAPGQHDGLKKSSNDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 215

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
           RQISP IPHNY+ESS P SVF YT+ N+GK  A ++L+ TW NS+GG S+ +G H N   
Sbjct: 216 RQISPFIPHNYQESSLPTSVFVYTLVNTGKERAKVSLVMTWANSIGGLSDHSGGHVNEPF 275

Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
              +GV  VLLHH+T++  PPVTFA+AA E   V V++ P F +SG S  +TA++MW  I
Sbjct: 276 IGENGVSGVLLHHKTANNNPPVTFAVAACENQNVDVTVLPVFGLSGES-SVTAREMWGTI 334

Query: 325 KEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
            + G FDR N     S+ S  G ++ AA+ AS  V       V F+LAW  PEV F  G 
Sbjct: 335 VQDGCFDRDNFKAGPSMPSSLGETVCAAVCASTWVEAHGRCTVVFALAWSSPEVKFKKGS 394

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
           TY+RRYTKFYGT   +A N+ +DA++ +  WE +I+ WQ PIL D+RLPEWY ITLFNEL
Sbjct: 395 TYHRRYTKFYGTSPRSAINLVQDALMSYKHWEEEIDKWQTPILRDERLPEWYKITLFNEL 454

Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
           Y+L AGG VW D      SL+    +K +     S L    D+P  +    + +      
Sbjct: 455 YFLVAGGTVWIDS----ESLMVDADKKVN-----SSLPEDSDLPLHDSNCNSTVP----- 500

Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
              +  P+ L+         D +EN+G+FLYLEG+EY MW TYDVHFY+SFAL+ LFPKI
Sbjct: 501 ---LIDPMPLDF--------DDKENVGKFLYLEGVEYFMWCTYDVHFYASFALLNLFPKI 549

Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
           +LSIQRDFA AV+  D S+++ L DG W +RKV+GAVPHD+G  DPW E+NAY ++DT+R
Sbjct: 550 ELSIQRDFARAVLREDRSRVRFLADGTWGTRKVIGAVPHDLGAHDPWHELNAYNIHDTSR 609

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
           WKDLNPKFVLQVYRD  ATGD  F K VWP+V  AM YM+QFDRDGDGMIENDGFPDQTY
Sbjct: 610 WKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFDRDGDGMIENDGFPDQTY 669

Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF 742
           D W+V G+SAY G LW+A+LQAA+A+AR +G     +  + KF KAK V+E KLWNGSYF
Sbjct: 670 DAWTVLGVSAYCGCLWLASLQAAAAMARGLGHNDYAERCMVKFAKAKHVFETKLWNGSYF 729

Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           NYD+  S  S SIQADQLAGQWYA + GL P+ DE + +  L+K+++YNV++V GG+ GA
Sbjct: 730 NYDSGASYSSRSIQADQLAGQWYAASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGA 789

Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           VNGM P+G+VD + MQSREIW+GVTY++AA+M+   + D  F TA GI+ A WS  G GY
Sbjct: 790 VNGMYPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEDQAFTTAEGIFLAGWSEEGYGY 849

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            FQTPEAW  D  YRSL YMRPLAIWAMQ+AL+ PK
Sbjct: 850 WFQTPEAWTIDGHYRSLIYMRPLAIWAMQYALSPPK 885


>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 978

 Score =  929 bits (2402), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/959 (50%), Positives = 636/959 (66%), Gaps = 47/959 (4%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L++    L +F++ + E +++  +G+R+   IREEA++G++A IDPF
Sbjct: 31  DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+GSGSI R +RGEF+ WQI P  C+  P++ANQFS+F+SR  G
Sbjct: 91  TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KY+SVL P   E L  +   GI SW WNL G  STYHAL+PRAWT+++GEPDPEL++ 
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPHNY++SS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T+ + PPVTFA+AA ET  V V++ P F +S  S  +TAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGS-HITAKDMWGK 329

Query: 324 IKEHGSFDRLNSME-TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           + + G FDR N     S+ S PG ++ AA++AS  V P  +  V F+LAW  P+V F+ G
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            +Y+RRYTK+YGT + AA NI  DA+  +  WE +IE WQ PIL D RLPEWY  TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPP-------VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVN 495
           LY+L AGG VW D S P       +H    + +   ++  ++ + +R   V N      +
Sbjct: 450 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYD 509

Query: 496 ILER--MSSILEQIYT----------PVALNS--SFGTNLL---QDGEENIGQFLYLEGI 538
            + R  +    E+I+T          P   NS  S   + L   QD  +++G+FLYLEG+
Sbjct: 510 AISRKGLEYDEEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGV 569

Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
           EY+MW TYDVHFY+SFAL+ LFPKI+LSIQR+FA AV+  D  ++K L +G W  RKV G
Sbjct: 570 EYIMWCTYDVHFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRG 629

Query: 599 AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVA 658
           AVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  AT D  F   VWP+V  A
Sbjct: 630 AVPHDLGTHDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAA 689

Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
           M YM+QFDRD DG+IENDGFPDQTYDTW+V GISAY G LW+AALQAA+A+A ++GD+  
Sbjct: 690 MEYMEQFDRDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPF 749

Query: 719 EDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
            +    KF KAK+V+ EKLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  + D+
Sbjct: 750 AEKCKSKFFKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDD 809

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
            K +S+L K+Y++NV+KV GGK GAVNGM P+G+VD S MQSREIW+GVTY VAA+MI  
Sbjct: 810 YKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILS 869

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            + +  F TA GI+ A WS  G GY FQTPE W  D  +RSL YMRPLAIW MQWAL+ P
Sbjct: 870 GMEEQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMP 929

Query: 898 KPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQG-------AKSLLQSLFDHTC 949
           +           + D   + +           +LP E G       AK    S+F  +C
Sbjct: 930 RA----------ILDAPTINFMERIHVSPHNARLPHETGVRKIATKAKCFGNSVFHCSC 978


>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 970

 Score =  926 bits (2392), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 496/930 (53%), Positives = 626/930 (67%), Gaps = 49/930 (5%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           DP  P    W+RKL++    L +F + + E  ++  +G+R+   ++ E ++G+   IDPF
Sbjct: 46  DPGAPPHQAWRRKLNSHAEVLKEFRVTFMEGFKMLRLGMRMWTYVQSEKSEGRLPPIDPF 105

Query: 87  IKRHLTSS-HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
            +    S+ HGVP GG+G GSIGR +RGEF+RWQI P VC++ PVLANQFSV VS S G+
Sbjct: 106 RRETKPSACHGVPCGGMGGGSIGRGFRGEFRRWQIVPGVCDEAPVLANQFSVHVSLSGGE 165

Query: 146 K-YSSVLCPKTPEVLK--DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRI 202
           K  SSVL P  P  LK       GI SWDW L G  +TYHAL+PRAWT++EGEPDP L+I
Sbjct: 166 KAVSSVLYPGRPRELKGLKNDENGISSWDWKLDGQNTTYHALFPRAWTIYEGEPDPALKI 225

Query: 203 VCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNS 262
            CRQISP IPHNYKESSYP  VF+Y   N+GK  AD+ LLFTW NS+G  +  TG H N 
Sbjct: 226 SCRQISPFIPHNYKESSYPACVFSYVAVNTGKEVADVNLLFTWANSIGSTTGSTGGHINE 285

Query: 263 K--TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDM 320
              T +  G               P+TFA+AA+++  V +S CP F +SG    +TAKD+
Sbjct: 286 TFTTSVKGGQ--------------PLTFAIAAKQSPDVSLSACPCFQVSGEGQSITAKDL 331

Query: 321 WHEIKEHGSF--DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           W  +KEHG    D  N++ TS+ S+PGS IGAAIAA V +PP+ +  + F LAWD P V 
Sbjct: 332 WGRMKEHGRLEVDDWNTINTSLPSKPGSVIGAAIAAHVVIPPNEKRTIDFVLAWDSPTVK 391

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F+ G++Y RRYT FY    NAA  IA +A+  +  WE +IE WQRPILED++LPEWY +T
Sbjct: 392 FLKGRSYLRRYTTFYDADGNAAPRIAHNALCGYRLWEEEIEKWQRPILEDEKLPEWYRVT 451

Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
           LFNELYYL AGG VWT  S    +  +  H   SL        ++V    + D  +  L 
Sbjct: 452 LFNELYYLVAGGTVWTASSANTSAPGSPLHGDLSLG------GKVVPFSTRRDIDLAALR 505

Query: 499 RMSSILEQIYTPVALN-SSF------GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFY 551
           R     + +      N SSF      G  LLQ  EEN+GQFLYLEGIEY+MWNTYDVHFY
Sbjct: 506 RDCKDGDVVAALEGSNKSSFNSRLPSGLALLQAEEENVGQFLYLEGIEYIMWNTYDVHFY 565

Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWF 611
           +SFAL+ LFPK++L+IQRDFAA  + HDP + K +  G+   +KV G+VPHD+G  DPW 
Sbjct: 566 ASFALLSLFPKLELAIQRDFAAGCLAHDPRRTKFMAGGKKGIKKVFGSVPHDLGQHDPWA 625

Query: 612 EVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDG 671
           EVNAY ++DT+RWKDLN KFVLQVYRDVVATGDK FAKAVWP+VY AMAY+DQFDRD DG
Sbjct: 626 EVNAYNIHDTSRWKDLNAKFVLQVYRDVVATGDKVFAKAVWPAVYAAMAYLDQFDRDRDG 685

Query: 672 MIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV 731
           +IENDGFPDQTYDTW+V G+SAY GGLW+AALQAA A+A  + ++ +  YF  KF +A+ 
Sbjct: 686 LIENDGFPDQTYDTWTVHGVSAYCGGLWIAALQAAEAMAVLLEEKNAARYFQSKFIQARE 745

Query: 732 VY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNY 790
            Y +KLWNGSYF+YD+  SS S SIQADQ+AGQWYA A GL P+ D+ KARSAL+K++++
Sbjct: 746 AYLKKLWNGSYFSYDSGNSSNSKSIQADQMAGQWYAWASGLPPLFDDAKARSALQKIFDF 805

Query: 791 NVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
           NV+KV  G  GAVNGM P+G+VD + MQSRE+W+GVTYA+AA+MIHE + D  F+ A G+
Sbjct: 806 NVMKVKNGHLGAVNGMHPNGKVDETCMQSREVWTGVTYALAAAMIHEGMLDQAFRAAEGV 865

Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK-----PKTLEKQ 905
           + A WS   LGY FQTPEAW  D  +RSL YMRPLAIWAMQWA+  P+     P+     
Sbjct: 866 FIAGWS--DLGYWFQTPEAWTMDGYFRSLAYMRPLAIWAMQWAIDPPETIANAPRIPTVD 923

Query: 906 MKPEVTDESLLRYHAGFSKVARLLKLPEEQ 935
             P   D S      GFS +   L+ P  Q
Sbjct: 924 RLPSECDTS------GFSSMQSTLRNPLPQ 947


>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
          Length = 966

 Score =  925 bits (2390), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 507/965 (52%), Positives = 649/965 (67%), Gaps = 78/965 (8%)

Query: 31  PASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRH 90
           PAS TWQR+       ++ F++   + V+L    V +  L+R+        ++D      
Sbjct: 22  PASRTWQRRFDDEGKNVAMFSMTMSDLVEL----VAVYDLLRK--------WMD------ 63

Query: 91  LTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSV 150
                GVP+GG+GSGSIGRSYRG FQ++QIFP   E+KPVLANQFS FVSR NG KYS+V
Sbjct: 64  -NCYRGVPVGGIGSGSIGRSYRGYFQQFQIFPVTNEEKPVLANQFSAFVSRPNGNKYSTV 122

Query: 151 LCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
           L   T ++LK    AGIGSWDW LK D+ TYHAL+PR+WTV++GEPDPE++I CRQISP 
Sbjct: 123 LSAPTADLLKGVGKAGIGSWDWRLKEDRCTYHALFPRSWTVYDGEPDPEIKITCRQISPF 182

Query: 211 IPHNYKESSYPVSVFTYTI-------YNSG--------KTSADITLLFTWTNSVG----- 250
           IPHNY+ESS+P +VFT+T+       + +G        +  A + L  + T SV      
Sbjct: 183 IPHNYRESSFPAAVFTFTVQSVLSVLFKAGEQLWELTCRCDAALHLGRSTTASVAFSSIT 242

Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHH-------RTSHQLPPVTFALAAQETDGVHVSLC 303
           G S    +           V+  +L         RT+   PPVTFA+A++ETDGV VS+C
Sbjct: 243 GGSGLIFRGARCYRPCRWLVYTPVLETDADRRQCRTAGGQPPVTFAIASRETDGVRVSVC 302

Query: 304 PHFVISG----NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSS----IGAAIAAS 355
           P F I+     +S   TA  MW EIK+HG+F    +   +      S     +GAA+AAS
Sbjct: 303 PSFTIASGGPSSSGEPTAAAMWDEIKKHGAFGHAGAGNGNAARAAASKAGSSVGAAVAAS 362

Query: 356 VTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSW 414
             VP  +   V+FSLAW CPEV F +G TYYRRYT+FYGT  +AAA ++A DA+LEH +W
Sbjct: 363 TAVPAGATRVVSFSLAWACPEVKFPAGSTYYRRYTRFYGTDADAAAEHLAHDALLEHMNW 422

Query: 415 ELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLD 474
           E QIE WQ+PIL D+RLPEWYP+ LFNELYYLNAGG +WT     V  ++T G       
Sbjct: 423 ESQIEEWQKPILHDQRLPEWYPVALFNELYYLNAGGTIWT-----VEEMLTDGQAPKRAG 477

Query: 475 WSQS-------DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEE 527
           ++ S        +  ++ V         ++  ++S  E+ ++  A    FG  LL+ GE+
Sbjct: 478 FASSLTATEPFSIDTLLPVAAGGSAVDGVVRAVASATERSHSAAA----FGAALLRGGED 533

Query: 528 NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD 587
           N+GQFLYLEG+EY M+NTYDVHFY+SFAL+ LFP+++LS+QRDFA AV++HDP   + LD
Sbjct: 534 NVGQFLYLEGMEYNMYNTYDVHFYASFALLSLFPRLELSLQRDFARAVLLHDPRPRRTLD 593

Query: 588 DGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
            G+ V+RKVLGAVPHD+G+ DPW E+NAY L+D ARWKDLNPKFVLQVYRD VATGD  F
Sbjct: 594 -GRTVARKVLGAVPHDVGLNDPWVELNAYMLHDAARWKDLNPKFVLQVYRDAVATGDAAF 652

Query: 648 AKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
           A+AVWP+VYVAMAYMDQFDRDGDGM+EN+G PDQTYD WSVSG+SAY+GG+WVAALQAA+
Sbjct: 653 AEAVWPAVYVAMAYMDQFDRDGDGMVENEGVPDQTYDMWSVSGVSAYTGGVWVAALQAAA 712

Query: 708 ALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSS-SIQADQLAGQWY 765
            +AR VGDR +E YF  +  +A  VY+ +LW G+YF YDNSG   +S SI ADQLAGQWY
Sbjct: 713 GMARVVGDRPAEAYFRARRDRAARVYDAELWTGAYFRYDNSGGGGASESIMADQLAGQWY 772

Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
           ARACGL P+V++ KARSAL  V +YNV++V GG  GAVNGM PDG VD SS QS+E+W G
Sbjct: 773 ARACGLEPVVEDAKARSALATVLDYNVMRVKGGAIGAVNGMRPDGAVDASSTQSKEVWPG 832

Query: 826 VTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPL 885
           VTYAVAA+M+HE + +  F+TA G ++AAW   G GYAFQTPEAW  D  YRSL YMRPL
Sbjct: 833 VTYAVAAAMVHEGMTEAAFRTAKGAHDAAWGNDGFGYAFQTPEAWTEDGGYRSLHYMRPL 892

Query: 886 AIWAMQWALTRPK----PKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLL 941
            IWAMQWAL+ P+     +T E +   E +        A F KVA +L+LPEEQ  K  L
Sbjct: 893 GIWAMQWALSPPELHRDLRTPEARSSEEPSPADAALGQAQFDKVASMLRLPEEQQPKGYL 952

Query: 942 QSLFD 946
            ++++
Sbjct: 953 WAIYN 957


>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 950

 Score =  920 bits (2379), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/885 (53%), Positives = 618/885 (69%), Gaps = 27/885 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D + P    W+R+L++    L +F + + E +++  +G+RI   +REEA+ G++A IDPF
Sbjct: 31  DSSAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+GSGSI R +RGEF++WQI P +CE  PV+ANQFS+F+SR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMANQFSIFISREGG 150

Query: 145 QK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            K ++SVL P   E L      GI SW WNL G  STYHAL+PRAWTV++GEPDPEL+I 
Sbjct: 151 NKNFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP +PHNY+ESS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFVPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHLSGDHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
            K  DGV  VLL+H+T+   PPVTFA+AA ET  V+VS+ P F +S  S   TAK MW +
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEGS-STTAKGMWSK 329

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           + + G FD+ N +   S+ S PG ++ AA+AAS+ V P  +  V FSLAW  P+V F+ G
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASMWVEPHGKCTVAFSLAWSSPKVKFVKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            T+ RRYTKFYGT + AAA++A DA+  +  WE +IE WQ PIL+D+ LPEWY  TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPILKDETLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ--NDTAVNILERM 500
           LY+L AGG +W D SP + S +            ++D  R+ ++ N    +T   + +R 
Sbjct: 450 LYFLVAGGTIWID-SPLLSSNM------------RNDQDRVRELENTVVKETEDKMSDRK 496

Query: 501 SSILEQIYTPVALNSSF------GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
            +++E+I      ++           L  D + ++G+FLYLEG+EY+MW TYDVHFY+SF
Sbjct: 497 RTVVERIMDSTCDSAVITGHDPADEKLSGDDDADVGRFLYLEGVEYIMWCTYDVHFYASF 556

Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVN 614
           AL+ LFPKI+L+IQRDFA AV+  D  K+K L +G W  RKV GAVPHD+G  DPW E+N
Sbjct: 557 ALLELFPKIELNIQRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMN 616

Query: 615 AYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIE 674
           AY ++DT++WKDLNPKFVLQVYRD   TGD +F   VWP+V  AM YM+QFDRDGDG+IE
Sbjct: 617 AYNIHDTSKWKDLNPKFVLQVYRDFATTGDLEFGVDVWPAVRAAMEYMEQFDRDGDGLIE 676

Query: 675 NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY- 733
           NDGFPDQTYDTW+V G+S Y G LW+AALQAA+A+A ++GDR   +    KF KAK  + 
Sbjct: 677 NDGFPDQTYDTWTVHGVSTYCGCLWLAALQAAAAMALDLGDREFAEKCKRKFLKAKPAFE 736

Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           EKLWNG+YFNYD+  S  S SIQADQLAGQWY  + GL P+ ++ K +SAL+KVY++NV+
Sbjct: 737 EKLWNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGLPPLFEDSKIKSALRKVYDFNVM 796

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY +AA+MI   + +  F TA GI+ A
Sbjct: 797 KVKGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMILAGMEEEAFATAEGIFLA 856

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            WS  G GY FQTPEAW  D  YRSL YMRPLAIW MQ+A+ RPK
Sbjct: 857 GWSEDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 901


>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 950

 Score =  919 bits (2374), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 470/883 (53%), Positives = 609/883 (68%), Gaps = 20/883 (2%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L+     L +FT+ ++E +++  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DCAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            K +   ++S GVPLGG+GSGSI R +RGEF++WQI P  C+  P+++NQFS+F+SR  G
Sbjct: 91  TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KY+SVL P     L  +   G+ SW WNL G  STYHAL+PRAWT+++GEPDPEL+I 
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IP+NY++SS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T    PPVTFA+AA ET  V+V++ P F +S +S   TAKDMW  
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDS-SFTAKDMWDT 329

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           +++ G FD+ N +   S  S  G +I AA++AS  V    +  V+F+L+W  P+V F  G
Sbjct: 330 VEKDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY RRYTKFYGT   AA ++  DA+  +  WE  IEAWQ PIL D+RLPEWY  TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS-----DLK-RIVDVPNQNDTAVNI 496
           LY+L AGG VW D S  +++     H++  L  S       D+  +  D+ N N   V  
Sbjct: 450 LYFLVAGGTVWIDSSS-LNANGNSQHQQSGLGNSDGMAGGLDINDQRNDLANGNSVGVKS 508

Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
            + +S+I  +       N  F      D  +++G+FLYLEG+EY+MW TYDVHFY+S+AL
Sbjct: 509 NDEVSAIHNR-------NGLFVDTRHVDDGDDVGRFLYLEGVEYVMWCTYDVHFYASYAL 561

Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
           +MLFPKI+L+IQRDFA AV+  D  K+K L +G    RKV GAVPHD+G+ DPW E+NAY
Sbjct: 562 LMLFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAY 621

Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
            ++DT++WKDLNPKFVLQVYRD  ATGD +F   VWP+V  AM YM+QFDRD D +IEND
Sbjct: 622 NIHDTSKWKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIEND 681

Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-K 735
           GFPDQTYDTW+V G+SAY G LW+AALQAA+A+A ++GD+   +    KF  AK   E K
Sbjct: 682 GFPDQTYDTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETK 741

Query: 736 LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV 795
           LWNGSYFNYD+  SS S SIQ DQLAGQWYA + GL PI +E K RS ++K++++NV+K 
Sbjct: 742 LWNGSYFNYDSGASSNSKSIQTDQLAGQWYAASSGLPPIFEESKIRSTMQKIFDFNVMKT 801

Query: 796 MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
            GGK GAVNGM PDG+VD + MQSREIW+GVTYA AA+MI   + + GF TA GI+ A W
Sbjct: 802 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 861

Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           S  G GY FQTPE W  D  YRSL YMRPLAIW MQWAL+ PK
Sbjct: 862 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPK 904


>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
 gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
          Length = 968

 Score =  919 bits (2374), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 485/921 (52%), Positives = 623/921 (67%), Gaps = 32/921 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L++    L +F++ + E +++  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DSAAPPKHAWRRRLNSHANILKEFSVTFTEAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+GSGSI R +RGEF++WQI P +CE  PV+ANQFS+F+SR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPSICEVSPVMANQFSIFISRDGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KY+SVL P   E L      GI SW WNL G  STYHAL+PRAWT+++GEPDPEL+I 
Sbjct: 151 TKKYASVLAPGQHEGLGKDGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPHNY++SS P +VF YT+ NSGK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNSGKERAKVSLLFTWANSIGGVSHLSGDHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T+   PPVTFA+AA ET  V V++ P F +S  S  +TAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKGNPPVTFAIAACETQNVSVTVLPSFGLSEES-HITAKDMWSK 329

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           + + G FDR N     ++ S PG ++ AA++AS  V P  +  V F+L+W  P++ F  G
Sbjct: 330 MVQDGQFDRENFDCGPTMPSSPGETLCAAVSASAWVEPHGKCTVAFALSWSSPKIKFSKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY+RRYTKFYGT + AA N+  DA+  +  WE +IE WQ PIL+D+RLPEWY  TLFNE
Sbjct: 390 STYHRRYTKFYGTSERAAQNLVHDALKNYKWWEEEIEKWQNPILKDERLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRK-FSLDWSQSDLKRIVDVPNQNDT-----AVNI 496
           LY+L AGG VW D S     L+T   R+  ++D  +  + R      Q  T     A   
Sbjct: 450 LYFLVAGGTVWIDSS-----LLTEDMRETMNVDVIEVQVSRPKGAEKQIATNGYNVATIG 504

Query: 497 LERMSSILEQIY-----TPVA-----LNSSFGTNLL---QDGEENIGQFLYLEGIEYLMW 543
           LE         Y      PV+     LN S   + L   Q+  +++G+FLYLEG+EY+MW
Sbjct: 505 LEEKDGASNGNYPSKDELPVSHENGHLNHSLKLSPLMEWQNNSDDVGRFLYLEGVEYIMW 564

Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
            TYDVHFY+SFAL+ LFPKI+L+IQRDFA AV+  D  K+K L +G    RKV GAVPHD
Sbjct: 565 CTYDVHFYASFALLELFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHD 624

Query: 604 IGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
           +G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  AT D  F   VWP+V  AM YM+
Sbjct: 625 LGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATQDMSFGVDVWPAVRSAMEYME 684

Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           QFDRDGD +IENDGFPDQTYD W+V G+SAY G LW+AAL+AA+A+A +VGD+   +   
Sbjct: 685 QFDRDGDALIENDGFPDQTYDAWTVHGVSAYCGCLWLAALEAAAAMALQVGDKYFAELCR 744

Query: 724 FKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARS 782
            KF KAK  +E KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL P+ D+ K +S
Sbjct: 745 SKFVKAKSAFEAKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYVASSGLPPLFDDSKIKS 804

Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
            L+K+Y++NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTYAVAA+MI   + D 
Sbjct: 805 TLQKIYDFNVMKVRGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYAVAATMILAGMEDK 864

Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
            F  A GI+ A WS  G GY FQTPE W TD  +RSL YMRPLAIW MQWAL+   PK +
Sbjct: 865 AFAAAEGIFLAGWSEDGYGYWFQTPEGWTTDGHFRSLIYMRPLAIWGMQWALSL--PKAI 922

Query: 903 EKQMKPEVTDESLLRYHAGFS 923
            +  K  + D  LL     FS
Sbjct: 923 LEAPKINIMDRLLLSPSTRFS 943


>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
 gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
 gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 950

 Score =  910 bits (2352), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 469/883 (53%), Positives = 608/883 (68%), Gaps = 20/883 (2%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L+     L +FT+ ++E +++  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            K +   ++S GVPLGG+GSGSI R +RGEF++WQI P  C+  P+++NQFS+F+SR  G
Sbjct: 91  TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KY+SVL P     L  +   G+ SW WNL G  STYHAL+PRAWT+++GEPDPEL+I 
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IP+NY++SS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T    PPVTFA+AA ET  V+V++ P F +S +S   TAKDMW  
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDS-SFTAKDMWDT 329

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           +++ G FD+ N +   S  S  G +I AA++AS  V    +  V+F+L+W  P+V F  G
Sbjct: 330 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY RRYTKFYGT   AA ++  DA+  +  WE  IEAWQ PIL D+RLPEWY  TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI------VDVPNQNDTAVNI 496
           LY+L AGG VW D S  +++     H++  L  S   +  +       D+ N N   V  
Sbjct: 450 LYFLVAGGTVWIDSSS-LNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKS 508

Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
            + +S+I  +       N  F      D  +++G+FLYLEG+EY+MW TYDVHFY+S+AL
Sbjct: 509 NDEVSAIHNR-------NGLFVDTPHVDDGDDVGRFLYLEGVEYVMWCTYDVHFYASYAL 561

Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
           +MLFPKI+L+IQRDFA AV+  D  K+K L +G    RKV GAVPHD+G+ DPW E+NAY
Sbjct: 562 LMLFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAY 621

Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
            ++DT+RWKDLNPKFVLQVYRD  ATGD +F   VWP+V  AM YM+QFDRD D +IEND
Sbjct: 622 NIHDTSRWKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIEND 681

Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-K 735
           GFPDQTYDTW+V G+SAY G LW+AALQAA+A+A ++GD+   +    KF  AK   E K
Sbjct: 682 GFPDQTYDTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETK 741

Query: 736 LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV 795
           LWNGSYFNYD+  SS S SIQ DQLAGQWYA + GL P+ +E K RS ++K++++NV+K 
Sbjct: 742 LWNGSYFNYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKT 801

Query: 796 MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
            GGK GAVNGM PDG+VD + MQSREIW+GVTYA AA+MI   + + GF TA GI+ A W
Sbjct: 802 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 861

Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           S  G GY FQTPE W  D  YRSL YMRPLAIW MQWAL+ PK
Sbjct: 862 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPK 904


>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 993

 Score =  909 bits (2349), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/976 (50%), Positives = 649/976 (66%), Gaps = 66/976 (6%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+RKL+     L +F++ + E +++  +G+R+   +REEA++G++A IDPF
Sbjct: 31  DSASPPEQAWRRKLNGHANLLKEFSVTFVEAIKMVRLGIRLWSYVREEASQGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SN 143
            +     ++S GVPLGG+GSGSI R +RGEF++WQI P  CE  PV+ANQFS+FVSR   
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGTCEASPVMANQFSIFVSRDGG 150

Query: 144 GQKYSSVLCPKTPEVL---KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPEL 200
           G+KY+SVL P   E L   +    +GI SW WNL G  STYHAL+PRAWTV++GEPDPEL
Sbjct: 151 GKKYASVLAPGQHEGLGCCRKDGDSGISSWGWNLDGQHSTYHALFPRAWTVYDGEPDPEL 210

Query: 201 RIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHY 260
           ++ CRQISP IPHNY++SS P +VF YT+ N+G+  A ++LLFTW NS+GG+S  +G H 
Sbjct: 211 KVSCRQISPFIPHNYRDSSLPTAVFVYTLVNTGRERAKVSLLFTWANSIGGNSHLSGNHV 270

Query: 261 NSKTKMNDGVHAVLLHH-----------RTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
           N      DGV  VLLHH           RT+   PPVTFA+AA ET  V V++ P F +S
Sbjct: 271 NEPFIDEDGVSGVLLHHKQNSKDLCKCFRTAKGNPPVTFAIAACETQNVSVTVLPSFGLS 330

Query: 310 GNSLGLTAKDMWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             S  +TAKDMW ++ + G FDR N S   S+ S PG ++ AA+AAS  V P  +  V F
Sbjct: 331 EGSC-ITAKDMWDKMVQDGQFDRDNFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAF 389

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
           SL+W  P+V F+ G +Y+RRYTKFYGT   AA  +A DA+  +  WE +IE WQRP+L D
Sbjct: 390 SLSWSSPKVKFLKGFSYHRRYTKFYGTSGKAAQTLAHDALTNYKRWEEEIEKWQRPVLMD 449

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS---------------------PPVHSLVTIG 467
           +RLPEWY  TLFNELY+L AGG VW D S                       V + V+  
Sbjct: 450 ERLPEWYKFTLFNELYFLVAGGTVWIDSSFVGKKASYDQDHLARLKNDDVKAVEAKVSGR 509

Query: 468 HRKFSLDWSQSDLKRI--VDVPNQNDTAVNIL--ERMSSILEQIYTPVALNSSFGT-NLL 522
             + S   + + L     ++  ++N T+ +    + +   L++ YT    + S+ T  +L
Sbjct: 510 GEEVSRTTTTTTLDGFPSIEYDDENSTSSSHASEDELMVPLKRGYT----DRSYQTYKVL 565

Query: 523 QDG--EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
           + G  EE++G+FLYLEG+EY+MW TYDVHFY+S+AL+ LFPKI+L+IQRDFA AV+  D 
Sbjct: 566 EPGNTEEDVGRFLYLEGVEYVMWCTYDVHFYASYALLELFPKIELNIQRDFAKAVLSEDG 625

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
            K++ L +G++  RKV GAVPHD+G  DPW E+NAY ++DT+RWKDLN KFVLQVYRD  
Sbjct: 626 RKVRFLAEGKYGIRKVRGAVPHDLGTHDPWNEMNAYNIHDTSRWKDLNTKFVLQVYRDFA 685

Query: 641 ATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWV 700
           AT D  F   VWPSV  A+ YM+QFDRDGDG+IENDGFPDQTYDTW+V GISAY G LWV
Sbjct: 686 ATRDMSFGVDVWPSVRAAIEYMEQFDRDGDGVIENDGFPDQTYDTWTVHGISAYCGCLWV 745

Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQ 759
           AALQAA+A+A E+GD+   +    KF KA+ V E +LWNGSYFNYD+  SS S SIQADQ
Sbjct: 746 AALQAAAAMAHELGDQEFAETCKSKFLKARPVLEAELWNGSYFNYDSGSSSNSKSIQADQ 805

Query: 760 LAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQS 819
           LAGQWY  + GL P+ D+ K +SAL+K+Y++NV+KV GG+ GAVNGM P+G++D + MQS
Sbjct: 806 LAGQWYTASSGLPPLFDDFKIKSALRKIYDFNVMKVRGGRMGAVNGMHPNGKIDETCMQS 865

Query: 820 REIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSL 879
           REIW+GVTY VAA+MI   + +  F+TA GI+ A WS  G GY FQTPEAW+TD  YRSL
Sbjct: 866 REIWTGVTYGVAATMILAGMEEEAFKTAEGIFLAGWSEEGFGYWFQTPEAWSTDGHYRSL 925

Query: 880 CYMRPLAIWAMQWALTRPK-----PK-TLEKQMKPEVTDESLLRYHAGFSKVARLLKLPE 933
            YMRPL+IW MQWAL+ PK     PK  +  ++    ++     +  G  ++A       
Sbjct: 926 IYMRPLSIWGMQWALSLPKAILDAPKINVMDRIHVSSSNTKFFNHETGVRRIA------- 978

Query: 934 EQGAKSLLQSLFDHTC 949
              AK    S+F+  C
Sbjct: 979 -TKAKCFGDSVFNCAC 993


>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 949

 Score =  907 bits (2345), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 468/882 (53%), Positives = 614/882 (69%), Gaps = 22/882 (2%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D + P    W+R+L++    L +F + + E +++  +G+RI   +REEA+ G++A IDPF
Sbjct: 31  DSSAPPEQAWRRRLNSHANLLKEFRVTFTEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+GSGSI R +RGEF++WQI P +CE  PV++NQFS+F+SR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMSNQFSIFISREGG 150

Query: 145 QK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
           +K ++SVL P   E L      GI SW WNL G  STYHAL+PRAWTV++GEPDPEL+I 
Sbjct: 151 KKKFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPHNY+ESS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHSSGDHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
            K  DGV  VLL+H+T+   PPVTFA+AA ET  V+VS+ P F +S  S  +TAK MW +
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEES-SMTAKHMWSK 329

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           + + G FD+ N +   S+ S PG ++ AA+AAS  V P  +  V FSLAW  P+V F+ G
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASTWVEPHGKCTVAFSLAWSSPKVKFVKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            T+ RRYTKFYGT + AAA++A DA+  +  WE +IE WQ P+L+D+ LPEWY  TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPVLKDEALPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
           LY+L AGG +W D SP + S +           +  D  R ++     +T   + +R  +
Sbjct: 450 LYFLVAGGTIWID-SPVLSSNMR----------NDQDRVRELESAVVKETEDKMSDRKRT 498

Query: 503 ILEQIY-----TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALI 557
           ++E        + V          L + ++++G+FLYLEG+EY+MW TYDVHFY+SFAL+
Sbjct: 499 VVESTTDSTYDSAVITGHDRADEKLYEDDDDVGRFLYLEGVEYIMWCTYDVHFYASFALL 558

Query: 558 MLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYC 617
            LFP+I+L+IQRDFA AV+  D  K+K L +G W  RKV GAVPHD+G  DPW E+NAY 
Sbjct: 559 ELFPRIELNIQRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMNAYN 618

Query: 618 LYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDG 677
           ++DT++WKDLNPKFVLQVYRD   TGD +F   VWP+V  AM YM+QFDRDGDG+IENDG
Sbjct: 619 IHDTSKWKDLNPKFVLQVYRDFATTGDLQFGVDVWPAVRAAMEYMEQFDRDGDGLIENDG 678

Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKL 736
           FPDQTYDTW+V G+S Y G LW+AALQAA+ +A E+GDR   +    KF KAK  + EKL
Sbjct: 679 FPDQTYDTWTVHGVSTYCGCLWLAALQAAAVMALELGDREFAETCKRKFLKAKPAFEEKL 738

Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
           WNG+YFNYD+  S  S SIQADQLAGQWY  + GL  + ++ K +SAL+KVY++NV+KV 
Sbjct: 739 WNGTYFNYDSGSSGNSKSIQADQLAGQWYTASSGLPSLFEDSKIKSALRKVYDFNVMKVK 798

Query: 797 GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
           GG+ GAVNGM P+G+VD + MQSRE+W+GVTY +AA+MIH  + +  F TA GI+ A WS
Sbjct: 799 GGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGLAATMIHAGMEEEAFTTAEGIFLAGWS 858

Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
             G GY FQTPEAW  D  YRSL YMRPLAIW MQ+A+ RPK
Sbjct: 859 EDGYGYWFQTPEAWTMDGHYRSLMYMRPLAIWGMQYAINRPK 900


>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
 gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
          Length = 966

 Score =  905 bits (2338), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 475/952 (49%), Positives = 612/952 (64%), Gaps = 45/952 (4%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L++    L +F++ +KE +Q+  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DSAAPPEQAWRRRLNSHANILKEFSVTFKEAIQMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+GSGSI R +RGEF++WQI P +CE  P+       F+SR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPGICESSPI-------FISRDGG 143

Query: 145 QK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            K Y+SVL P   E +      GI SW WNL G  STYHAL+PRAWTV++GEPDPEL+I 
Sbjct: 144 NKNYASVLAPGQHEGIGKAGDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 203

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPHNY++SS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 204 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 263

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+  +  PPVTFA+AA ET  V V++ P F +S  S   TAK MW  
Sbjct: 264 FIGEDGVSGVLLHHKQGN--PPVTFAIAACETQNVSVTVLPSFGLSEGSC-TTAKAMWGT 320

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           + + G FDR N +   S+ S PG ++ AA++AS  V P  +  V F+LAW  P++ F+ G
Sbjct: 321 MVQDGHFDRGNFNWGPSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKIKFLKG 380

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            +Y+RRYTKFYGT + AA N+  DA+  +  WE +IE WQ PIL+D++LPEWY  TLFNE
Sbjct: 381 SSYHRRYTKFYGTSERAAQNLVHDALTNYKQWEEEIEKWQDPILKDEKLPEWYKFTLFNE 440

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
           LY+L AGG VW D S          HR   ++   + +K  V  P  N          + 
Sbjct: 441 LYFLVAGGTVWIDSSLSSADTRNGHHRSREVE--TTGIK--VTEPQVNCNGGPDHTTTND 496

Query: 503 ILEQIYTPVALNSSFGTNLLQDGE------------------------ENIGQFLYLEGI 538
                      N +F T  +   E                        +++G+FLYLEG+
Sbjct: 497 HNTTSSEQKENNKAFHTKCICKDESAVSRERGNLDHTLDPFTFLDPLSDDVGRFLYLEGV 556

Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
           EY+MW TYDVHFY+SFAL+ LFPKI+L+IQRDFA AV+  D  K++ L DG    RK  G
Sbjct: 557 EYIMWCTYDVHFYASFALLALFPKIELNIQRDFAKAVLSEDGRKVRFLADGSVGIRKARG 616

Query: 599 AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVA 658
           AVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD  F   VWP+V  A
Sbjct: 617 AVPHDLGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGVDVWPAVRTA 676

Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
           M YM+QFDRD DG++ENDGFPDQTYD W+V G+SAY G LW+A+LQAA+A+A ++GD+  
Sbjct: 677 MEYMEQFDRDDDGLVENDGFPDQTYDAWTVHGVSAYCGCLWLASLQAAAAMAMQLGDKYF 736

Query: 719 EDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
            +    KF KAK  +E KLWNGSYFNYD+  S+ S SIQADQLAG+WY  + GL  + D+
Sbjct: 737 AELCKSKFAKAKSAFESKLWNGSYFNYDSGSSNNSKSIQADQLAGEWYMASSGLPSLFDD 796

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
            K RSAL K+Y++NV+KV GGK GAVNGM P+G+VD + MQSREIWSGVTYAVAA+MI  
Sbjct: 797 VKIRSALNKIYDFNVMKVRGGKMGAVNGMHPNGKVDETCMQSREIWSGVTYAVAATMILS 856

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            + D  F TA GI+ A WS  G GY FQTPEAW  D  +RSL YMRPLAIW MQWAL+  
Sbjct: 857 GMEDKAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHFRSLIYMRPLAIWGMQWALSL- 915

Query: 898 KPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTC 949
            PK +    K  + + SLL     FS +           A  L  S+F  +C
Sbjct: 916 -PKAILDAPKINIMERSLLSPSTRFSLIGETGVKKIATKANCLGNSVFHCSC 966


>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 992

 Score =  897 bits (2319), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/932 (52%), Positives = 628/932 (67%), Gaps = 71/932 (7%)

Query: 23  FDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAF 82
           FD +D A P    W+R+L++    L +F + + E +++  +G+R+   +REEA+ G++A 
Sbjct: 28  FD-IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAP 86

Query: 83  IDPFIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS 140
           IDPF +     ++S GVPLGG+GSGSI R +RGEF++WQI P +CE  PV+ANQFS+FVS
Sbjct: 87  IDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSIFVS 146

Query: 141 RSNGQK-YSSVLCPKTPE---VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           R  G K ++SVL P   E     +     GI SW WNL G  STYHAL+PRAWTV++GEP
Sbjct: 147 REGGNKSFASVLAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEP 206

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
           DPEL+I CRQISP IPHNY+ESS P +VF YT+ N+GK  A ++LLFTW NS+GG+S  +
Sbjct: 207 DPELKISCRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLS 266

Query: 257 GQHYNSKTKMNDGVHAVLLHH-------RTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
           G H N      DGV  VLL+H       RT+   PPVTF++AA ET  V VS+ P F +S
Sbjct: 267 GDHVNEPFIAEDGVSGVLLYHKQVVEYFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLS 326

Query: 310 GNSLGLTAKDMWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             S  +TAK MW ++ + G FDR N S   S+ S PG ++ AA+AAS  V P  +  V F
Sbjct: 327 DRS-SVTAKGMWTKMVKDGQFDRENFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAF 385

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
           SLAW  P+V F+ G T+ RRYTKFYGT + AA ++A DA+  +  WE +I  WQ PIL+D
Sbjct: 386 SLAWSSPKVKFVKGSTFNRRYTKFYGTSERAAVHLAHDALTHYTRWEEEIAKWQDPILKD 445

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS---LDWSQSDLKRIV- 484
           ++LPEWY  TLFNELY+L AGG +W D +     L++   R  S   L+ S++ + RI  
Sbjct: 446 EKLPEWYKFTLFNELYFLVAGGTIWIDST-----LLSSNKRNNSQDQLEESENAVVRITE 500

Query: 485 ------------------------------------DVPNQNDTAVNILERMSSILEQIY 508
                                               D+  +N T VN L + +S      
Sbjct: 501 AKVDCRKREVVECTTDNSYDSTAHRGHNHLDEKHNRDISRENGT-VNTLGKGNSA----N 555

Query: 509 TPVALNSSFGTNLLQDGE-ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSI 567
           TP   + S   NL  D + ++ G+FLYLEG+EY+MW TYDVHFY+SFAL+MLFP+I+L+I
Sbjct: 556 TP---HHSTMKNLQHDDDNDDGGRFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNI 612

Query: 568 QRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDL 627
           QRDFA AV+  D  K+K L +G W +RKV GAVPHD+G  DPW E+NAY ++DT++WKDL
Sbjct: 613 QRDFAQAVLCEDGRKVKFLAEGNWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDL 672

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWS 687
           NPKFVLQVYRD  ATGD +F   VWP+V  AM YM+QFDRD DG+IENDGFPDQTYDTW+
Sbjct: 673 NPKFVLQVYRDFSATGDLQFGVDVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWT 732

Query: 688 VSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDN 746
           V G+SAY GGLW+AALQAA+A+A ++GDR   +    KF KAK VYE KLWNGSYFNYD+
Sbjct: 733 VHGVSAYCGGLWLAALQAAAAMALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDS 792

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
             SS S SIQADQLAGQWY  + GL  + D+ K +S+L+KV+++NV+KV GG+ GAVNGM
Sbjct: 793 GSSSNSKSIQADQLAGQWYTASSGLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGM 852

Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
            P+G+VD + MQSREIW+GVTY VAA+MI   + +  F TA GI+ A WS  G GY FQT
Sbjct: 853 HPNGKVDETCMQSREIWAGVTYGVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQT 912

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           PEA+  D  YRSL YMRPL+IW MQ+ALT PK
Sbjct: 913 PEAFTIDGHYRSLIYMRPLSIWGMQYALTMPK 944


>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
          Length = 937

 Score =  886 bits (2289), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 461/883 (52%), Positives = 599/883 (67%), Gaps = 33/883 (3%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L+     L +FT+ ++E +++  +G+R+   +REEA+ G++A IDPF
Sbjct: 31  DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            K +   ++S GVPLGG+GSGSI R +RGEF++WQI P  C+  P+++NQFS+F+SR  G
Sbjct: 91  TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KY+SVL P     L  +   G+ SW WNL G  STYHAL+PRAWT+++GEPDPEL+I 
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IP+NY++SS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
                G              PPVTFA+AA ET  V+V++ P F +S +S   TAKDMW  
Sbjct: 271 FMTGKGN-------------PPVTFAIAASETQNVNVTVLPCFGLSEDS-SFTAKDMWDT 316

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           +++ G FD+ N +   S  S  G +I AA++AS  V    +  V+F+L+W  P+V F  G
Sbjct: 317 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 376

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY RRYTKFYGT   AA ++  DA+  +  WE  IEAWQ PIL D+RLPEWY  TLFNE
Sbjct: 377 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 436

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRI------VDVPNQNDTAVNI 496
           LY+L AGG VW D S  +++     H++  L  S   +  +       D+ N N   V  
Sbjct: 437 LYFLVAGGTVWIDSSS-LNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKS 495

Query: 497 LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
            + +S+I  +       N  F      D  +++G+FLYLEG+EY+MW TYDVHFY+S+AL
Sbjct: 496 NDEVSAIHNR-------NGLFVDTPHVDDGDDVGRFLYLEGVEYVMWCTYDVHFYASYAL 548

Query: 557 IMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAY 616
           +MLFPKI+L+IQRDFA AV+  D  K+K L +G    RKV GAVPHD+G+ DPW E+NAY
Sbjct: 549 LMLFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAY 608

Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
            ++DT+RWKDLNPKFVLQVYRD  ATGD +F   VWP+V  AM YM+QFDRD D +IEND
Sbjct: 609 NIHDTSRWKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIEND 668

Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-K 735
           GFPDQTYDTW+V G+SAY G LW+AALQAA+A+A ++GD+   +    KF  AK   E K
Sbjct: 669 GFPDQTYDTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETK 728

Query: 736 LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV 795
           LWNGSYFNYD+  SS S SIQ DQLAGQWYA + GL P+ +E K RS ++K++++NV+K 
Sbjct: 729 LWNGSYFNYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKT 788

Query: 796 MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
            GGK GAVNGM PDG+VD + MQSREIW+GVTYA AA+MI   + + GF TA GI+ A W
Sbjct: 789 KGGKMGAVNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGW 848

Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           S  G GY FQTPE W  D  YRSL YMRPLAIW MQWAL+ PK
Sbjct: 849 SEEGFGYWFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPK 891


>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
          Length = 935

 Score =  882 bits (2278), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 468/898 (52%), Positives = 590/898 (65%), Gaps = 67/898 (7%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+R+L++    L +F++ + E +++                   +A IDPF
Sbjct: 31  DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM-------------------KAPIDPF 71

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            K     ++S GVPLGG+GSGSI R +RGEF+ W I P +CE  P+       FVSR  G
Sbjct: 72  TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI-------FVSRDGG 124

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KYSSVL P   E LK    +GI SWDWNL G  STYHAL+PRAWTV++GEPDP+L+I 
Sbjct: 125 NKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYDGEPDPDLKIS 184

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPH+YK+SS P SVF YT+ N+GK  A ++LL TW NS+GG S  +G H+N  
Sbjct: 185 CRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFSHHSGGHFNEP 244

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
                          T+   PPVTFA+AA ET  V+V++ P F +SG     +AK MW  
Sbjct: 245 -------------FITAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGHD-SAKQMWDR 290

Query: 324 IKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           +K++G FDR N    TS+ S  G ++ AA++AS  V P     V F LAW  P++ F  G
Sbjct: 291 MKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAWSSPKIKFQKG 350

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            TY RRYT+FYGT + +A N+  DA+ ++  WE +IE WQ PIL++++LPEWY  TLFNE
Sbjct: 351 CTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLPEWYKFTLFNE 410

Query: 443 LYYLNAGGAVWTDGSPPV-------HSLVTIGHRKFSLDWSQSDLK------RIVDVPNQ 489
           LY+L AGG VWTDG PPV        S      ++ + D  Q  +K          V N 
Sbjct: 411 LYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHVKLTAEQVTNG 470

Query: 490 NDTAVNILERMSSILEQIYTPVALNSSFGTN--------LLQDGEENIGQFLYLEGIEYL 541
            D A N  E+  S    ++ P    ++ G          L ++G EN+G+FLYLEG+EY+
Sbjct: 471 GDLA-NGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKFLYLEGVEYI 529

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
           MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  +MK L DG    RKV GAVP
Sbjct: 530 MWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSGIRKVKGAVP 589

Query: 602 HDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
           HD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD  F + VWP+V   M Y
Sbjct: 590 HDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVWPAVCAVMDY 649

Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
           M+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+AALQAA+ +A  +GDR   + 
Sbjct: 650 MNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDRPFAEK 709

Query: 722 FLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
           +  KF +AK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL P+ DE+K 
Sbjct: 710 YKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDENKI 769

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
           RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY VAA+M+   + 
Sbjct: 770 RSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVAANMLLHGME 829

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
             GF TA GI+ A WS  G GY FQTPE W  D  YRSL YMRPLAIWAMQWA + PK
Sbjct: 830 HQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQWARSPPK 887


>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
          Length = 790

 Score =  875 bits (2262), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 457/825 (55%), Positives = 569/825 (68%), Gaps = 82/825 (9%)

Query: 138 FVSRSNGQKYSSVLCPKTPEVLKDTTA---------AGIGSWDWNLKGDKSTYHALYPRA 188
           FVSR  G K+S+VLCP  P+V+KD             GI SWDWN+ G+KSTYHALYPR+
Sbjct: 24  FVSRPGGVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRS 83

Query: 189 WTVHEG---EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
           WTV++G   + +    +  R I   +    K     + +    + N+G   A +TLLFTW
Sbjct: 84  WTVYDGIIFDLNVSNHVYYRLIYCKVLMYIK--VMKMFLLDEQVTNTGAEPAIVTLLFTW 141

Query: 246 TNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPH 305
            NSVGG S  TGQH+NS  K  +G              PPV++A+AA+ET+ V VS CP 
Sbjct: 142 ENSVGGASGLTGQHFNSTMKTANG-------------HPPVSYAIAAKETEDVRVSSCPC 188

Query: 306 FVISGNSLG-LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
           F++SG +   +TA DMW EIK++ SFD+L S   S  S+PG+SIGAAIAA V VPP  + 
Sbjct: 189 FIVSGTTPNQITAGDMWDEIKKNASFDKLTSNACS-PSKPGTSIGAAIAAKVKVPPGCDR 247

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
            VTFSL+WDCPE  F   KTY+RRYT+FYG+  NAA  +A DA+L               
Sbjct: 248 TVTFSLSWDCPEARF-DEKTYHRRYTRFYGSLGNAAVAMAHDALL--------------- 291

Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIV 484
                                         +G PP  SL +IG RK SL  S  D     
Sbjct: 292 ------------------------------NGLPPKQSLDSIGRRKISLGLSTIDKND-- 319

Query: 485 DVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
              +QN+ A++IL R+ ++  QI+ P+  N++ G  ++Q+  ENIGQFLYLEG++YLM+N
Sbjct: 320 --QDQNNVALDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYN 377

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFYSSFAL+MLFPKI+LSIQRDFAAAV+MHD SK +++  G++V+RKVLGAVPHDI
Sbjct: 378 TYDVHFYSSFALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDI 437

Query: 605 GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           G+ DPWFE+NAY L++T RWKDLN KFVLQVYRDVVATGD  FAKAVWPSVY A+AY+DQ
Sbjct: 438 GLNDPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQ 497

Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
           FD+DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE+GD G+  YF  
Sbjct: 498 FDKDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNA 557

Query: 725 KFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSAL 784
           K++KA+ VYEKLWNGSYFNYDNS S  SSSI ADQ+AGQWYARACGL PI  E+  + AL
Sbjct: 558 KYEKARSVYEKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKAL 617

Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
           + VY++NV++V  G RGAVNGMLPDGRVD S+M SRE+W+G TY+VAA MI E LAD GF
Sbjct: 618 ETVYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGF 677

Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP-KTLE 903
           +TA GIYEAAWS  GLG AFQTPEAW T+D+YRSLCYMRPLAIW +QWA T PKP +  E
Sbjct: 678 RTASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQE 737

Query: 904 KQMKPEVTDES--LLRYHAGFSKVARLLKLPEEQGAKSLLQSLFD 946
             ++P+  D +  L + HAGF KVA  LK  + +  ++ LQ+ ++
Sbjct: 738 VSLRPQEEDATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 782


>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
 gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
          Length = 916

 Score =  858 bits (2217), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 476/937 (50%), Positives = 618/937 (65%), Gaps = 44/937 (4%)

Query: 33  SLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHL- 91
           +L W+R+L T  + L++F    +E +++ P+GVR+    R E  +G+RA IDPF  +   
Sbjct: 5   ALAWKRRLDTPVVELTEFKPTLREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDTP 64

Query: 92  -TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR----SNGQK 146
            +S  GVPLGG+G GSIGR +RG+F RWQ+ P VC+   VLA+QFSVF++R       QK
Sbjct: 65  ASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQK 124

Query: 147 YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
            SSVL P  P      T +GI SW+WNL G KSTY+ L+PRAWT ++GEPDP+++I CRQ
Sbjct: 125 TSSVLFPGEPVFESKETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCRQ 184

Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM 266
           +SP +PHNYKESS+P  VF Y + N+G   A  TLLFTW NS+ G SE +G H NS    
Sbjct: 185 VSPFVPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSP--- 241

Query: 267 NDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG---LTAKDMWHE 323
                      +  H    VTFA+AA+ETD V  S+CP F+++G+  G    TAKDMW E
Sbjct: 242 ---------FMKEEHS---VTFAIAARETDDVSTSVCPCFLVAGSPEGNSYQTAKDMWSE 289

Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
           I E G F+  NS  ++  S+P + +GAA+AASV VP      +TF LAWD P+  F+ G 
Sbjct: 290 ITEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGGS 349

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
           +YYRRYTKFYG   NAA+ +  DAIL +  WE +I AWQ PIL D  LPEWY  TLFNEL
Sbjct: 350 SYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNEL 409

Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
           YYL AGG +WT  +        +    FS    Q+ L + ++    N  +  +L+ +++ 
Sbjct: 410 YYLVAGGTIWTGMNMKRIYTTLLCDLSFSDTCRQACLAQKLNPVKSNGVSEALLDAINAN 469

Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
            E          SF +  +   E+N+GQFLYLEG+EY  WNTYDVHFY+S+AL+ LFP++
Sbjct: 470 SEA--------QSFWSIGIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRL 521

Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
           +L +QRDFAAA + HDP KM  L  G    R V G+VPHD+G  DPW  +N+Y L+DT++
Sbjct: 522 ELQLQRDFAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLHDTSK 581

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
           WKDLN KFV+QV RD  ATGD +FA+AVWP+ Y AMA  DQFDRDGDGMIENDG PDQTY
Sbjct: 582 WKDLNCKFVVQVNRDAAATGDLEFARAVWPAAYAAMAVTDQFDRDGDGMIENDGVPDQTY 641

Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF 742
           D W+V G+S+Y GGLWVAALQAASALA  VGD+ S   F  KF +AK VYEK LWNGSYF
Sbjct: 642 DLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHGKFLRAKAVYEKSLWNGSYF 701

Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           +YD +    S+S+Q+DQLAGQWY+ A GL  IV ED+A SAL  V+  NV++  GG  GA
Sbjct: 702 HYDKN----STSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGA 757

Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           VNGMLP G  D +S+QSRE+W+G TYAVAA+MI + + + GF+TA G++   WS  G GY
Sbjct: 758 VNGMLPSGGEDRTSLQSREVWAGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGY 817

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA-- 920
           AFQTPEAW+ D +YRSL YMRPL+IWAMQWAL  P+ +  ++    EV D +     A  
Sbjct: 818 AFQTPEAWDNDGKYRSLAYMRPLSIWAMQWALDPPE-QLKQRSCSVEVGDITAQEVEACN 876

Query: 921 -GFSKVARLLKL---PEEQGAKSLLQSLFDHTCRRMF 953
             F+K+A  LK    PE +   S+   +FD+   R++
Sbjct: 877 KSFAKLASQLKELSPPEAKAYSSVWIWMFDYIRSRVY 913


>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
 gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
          Length = 916

 Score =  858 bits (2216), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 477/937 (50%), Positives = 617/937 (65%), Gaps = 44/937 (4%)

Query: 33  SLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHL- 91
           +L W+R+L T  + L +F   ++E +++ P+GVR+    R E  +G+RA IDPF  +   
Sbjct: 5   ALAWKRRLDTPVMELKEFKPTFREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDTP 64

Query: 92  -TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR----SNGQK 146
            +S  GVPLGG+G GSIGR +RG+F RWQ+ P VC+   VLA+QFSVF++R       QK
Sbjct: 65  ASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQK 124

Query: 147 YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
            SSVL P  P    + T +GI SW+WNL G KSTY+ L+PRAWT ++GEPDP+++I CRQ
Sbjct: 125 TSSVLFPGEPVFESEETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCRQ 184

Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM 266
           +SP +PHNYKESS+P  VF Y + N+G   A  TLLFTW NS+ G SE +G H NS    
Sbjct: 185 VSPFLPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSP--- 241

Query: 267 NDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG---LTAKDMWHE 323
                      +  H    VTFA+AA+ETD V  S+CP F+++G   G    TAKDMW E
Sbjct: 242 ---------FMKEEHS---VTFAIAARETDDVSTSVCPCFLVAGAPEGNSYQTAKDMWSE 289

Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
           I E G F+  NS  ++  S+P + +GAA+AASV VP      +TF LAWD P+  F+ G 
Sbjct: 290 IIEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGGS 349

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
           +YYRRYTKFYG   NAA+ +  DAIL +  WE +I AWQ PIL D  LPEWY  TLFNEL
Sbjct: 350 SYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNEL 409

Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
           YYL AGG +WT  +        +    FS    Q+ L + ++    N  +  +L+ +++ 
Sbjct: 410 YYLVAGGTIWTGMNMKRIYTTLLCDLSFSDSCRQACLAQKLNPVKSNGVSEALLDAINAN 469

Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
            E          SF +  +   E+N+GQFLYLEG+EY  WNTYDVHFY+S+AL+ LFP++
Sbjct: 470 SEA--------QSFWSFDIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRL 521

Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
           +L +QRDFAAA + HDP KM  L  G    R V G+VPHD+G  DPW  +N+Y LYDT++
Sbjct: 522 ELQLQRDFAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLYDTSK 581

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
           WKDLN KFV+QV RD  ATGD  FA+AVWP+ Y AMA+ DQFDRDGDGMIENDG PDQTY
Sbjct: 582 WKDLNCKFVVQVNRDAAATGDLDFARAVWPAAYAAMAFTDQFDRDGDGMIENDGVPDQTY 641

Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF 742
           D W+V G+S+Y GGLWVAALQAASALA  VGD+ S   F  KF +AK VYEK LWNGSYF
Sbjct: 642 DLWTVDGVSSYCGGLWVAALQAASALADLVGDKPSSRMFHDKFLRAKAVYEKSLWNGSYF 701

Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           +YD +    S+S+Q+DQLAGQWY+ A GL  IV ED+A SAL  V+  NV++  GG  GA
Sbjct: 702 HYDKN----STSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGA 757

Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           VNGMLP G  D +S+QSRE+W G TYAVAA+MI + + + GF+TA G++   WS  G GY
Sbjct: 758 VNGMLPSGGEDRTSLQSREVWVGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGY 817

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHA-- 920
           AFQTPEAW+ D +YRSL YMRPL+IWAMQWAL  P+ +  ++    EV D +     A  
Sbjct: 818 AFQTPEAWDNDGKYRSLAYMRPLSIWAMQWALDPPE-QLKQRSCNVEVGDITAQEVEACN 876

Query: 921 -GFSKVARLLKL---PEEQGAKSLLQSLFDHTCRRMF 953
             F+K+A  LK    PE +   S+   +FD+   R++
Sbjct: 877 KSFAKLASQLKELSPPETKAYSSVWIWMFDYIRSRVY 913


>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
          Length = 850

 Score =  822 bits (2124), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/876 (49%), Positives = 561/876 (64%), Gaps = 100/876 (11%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+RKLS+    L +F + ++E +++  +G+R+   I+EEA+ G++A IDPF
Sbjct: 23  DDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIQEEASHGRKAPIDPF 82

Query: 87  IKR--HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+G                                     +SN 
Sbjct: 83  TRESDKPSASQGVPLGGMG-------------------------------------KSND 105

Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
              SS                    WDW L+GD+STYHAL+PRAWTV++GEPDPEL++ C
Sbjct: 106 DGISS--------------------WDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 145

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
           RQISP IPHNY+ESS P SVF YTI N+GK  A ++L+ TW NS+GG S  +G H N   
Sbjct: 146 RQISPFIPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEP- 204

Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
                         T++  PPVTFA+AA E   V+V++ P F +SG S  +TA++MW  +
Sbjct: 205 ------------FMTANNNPPVTFAVAACENQNVNVTVLPAFGLSGES-SVTAREMWGTL 251

Query: 325 KEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
            + G FDR N S   S+ S  G ++ AA+ AS  V P     V FSLAW  P+V F  G 
Sbjct: 252 SQDGCFDRDNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGN 311

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
            YYRRYTKFYGT   +A N+ +D+++++  WE +I+ WQRPIL D+R PEWY +TLFNEL
Sbjct: 312 AYYRRYTKFYGTSPRSAINLVQDSLMKYKHWEEEIDKWQRPILHDERFPEWYKVTLFNEL 371

Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
           Y+L +GG VW D +  +     + + + S D               ND  ++   R S++
Sbjct: 372 YFLVSGGTVWIDSASLMADADEMMNSRLSED---------------NDLPLHHSSRNSAV 416

Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
               ++P  +          DG EN+G+FLYLEGIEY MW TYDVHFY+SFAL+ LFPKI
Sbjct: 417 PLIGFSPHII----------DGRENVGKFLYLEGIEYFMWCTYDVHFYASFALLELFPKI 466

Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
           +LSIQRDFA AV+  D S+++ L DG W +RKV+GAV HD+G  DPW E+NAY ++DT+R
Sbjct: 467 ELSIQRDFATAVLREDKSRVRFLADGTWGTRKVIGAVAHDLGAHDPWHELNAYNIHDTSR 526

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
           WKDLNPKFVLQVYRD VATGD  F K VWP+V  AM YM+QFD D DGMIENDGFPDQTY
Sbjct: 527 WKDLNPKFVLQVYRDFVATGDMSFGKDVWPAVCTAMEYMEQFDHDDDGMIENDGFPDQTY 586

Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF 742
           D W+V G+SAY G LW+AALQAA+ALAR +G     +  + KF KAK V+E KLWNGSYF
Sbjct: 587 DAWTVRGVSAYCGCLWLAALQAAAALARTLGHHDYAERCMLKFAKAKPVFEAKLWNGSYF 646

Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           NYD+  S  S SI ADQLAGQWY  + GL P+ DE + +  L+K+++YNV++V GG+ GA
Sbjct: 647 NYDSGTSYNSRSIHADQLAGQWYTASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGA 706

Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           VNGM P+G+VD + MQSREIW+GVTY++AA+M+   + D  F TA GI+ A WS  G GY
Sbjct: 707 VNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIFTAGWSEEGYGY 766

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            FQTPEAW  D  YRSL YMRPLAIWAMQ+AL+ P+
Sbjct: 767 WFQTPEAWTIDGHYRSLIYMRPLAIWAMQYALSPPR 802


>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
          Length = 838

 Score =  803 bits (2075), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/876 (48%), Positives = 550/876 (62%), Gaps = 112/876 (12%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+RKLS+    L +F + ++E +++  +G+R+   IREEA+ G++A IDPF
Sbjct: 23  DDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIREEASHGRKAPIDPF 82

Query: 87  IKR--HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+G                                     +SN 
Sbjct: 83  TRESDKPSASQGVPLGGMG-------------------------------------KSND 105

Query: 145 QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
              SS                    WDW L+GD+STYHAL+PRAWTV++GEPDPEL++ C
Sbjct: 106 DGISS--------------------WDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 145

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
           RQISP IPHNY+ESS P SVF YTI N+GK  A ++L+ TW NS+GG S  +G H N   
Sbjct: 146 RQISPFIPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEP- 204

Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI 324
                         T++  PPVTFA+AA E   V+V++ P F +SG S  +TA++MW  +
Sbjct: 205 ------------FMTANNNPPVTFAVAACENQNVNVTVLPAFGLSGES-SVTAREMWGTL 251

Query: 325 KEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
            + G FDR N S   S+ S  G ++ AA+ AS  V P     V FSLAW  P+V F  G 
Sbjct: 252 SQDGCFDRDNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGN 311

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
            YYRRYTKFYGT  + +            +WE +I+ WQRPIL D+R PEWY +TLFNEL
Sbjct: 312 AYYRRYTKFYGTSPSIS------------NWEEEIDKWQRPILHDERFPEWYKVTLFNEL 359

Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
           Y+L AGG VW D +  +     + + + S D               ND  ++   R S++
Sbjct: 360 YFLVAGGTVWIDSASLMADADEMMNSRLSED---------------NDLPLHHSSRNSAV 404

Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
               ++P  +          D  EN+G+FLYLEGIEY MW TYDVHFY+SFAL+ LFPKI
Sbjct: 405 PLIGFSPHII----------DDRENVGKFLYLEGIEYFMWCTYDVHFYASFALLELFPKI 454

Query: 564 QLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTAR 623
           +LSIQRDFA AV+  D S+++ L DG W +RKV+GAV HD+G  DPW E+NAY ++DT+R
Sbjct: 455 ELSIQRDFATAVLREDKSRVRFLADGTWGTRKVIGAVAHDLGAHDPWHELNAYNIHDTSR 514

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
           WKDLNPKFVLQVYRD  ATGD  F K VWP+V  AM YM+QFD D DGMIENDGFPDQTY
Sbjct: 515 WKDLNPKFVLQVYRDFAATGDMSFGKDVWPAVCTAMEYMEQFDHDDDGMIENDGFPDQTY 574

Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF 742
           D W+V G+SAY G LW+AALQ A+ALAR +G     +  + KF KAK V+E KLWNGSYF
Sbjct: 575 DAWTVRGVSAYCGCLWLAALQTAAALARTLGHHDYAERCMLKFAKAKPVFEAKLWNGSYF 634

Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           NYD+  S  S SI ADQLAGQWY  + GL P+ DE + +  L+K+++YNV++V GG+ GA
Sbjct: 635 NYDSGTSYNSRSIHADQLAGQWYTASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGA 694

Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           VNGM P+G+VD + MQSREIW+GVTY++AA+M+   + D  F TA GI+ A WS  G GY
Sbjct: 695 VNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMDDQAFTTAEGIFTAGWSEEGYGY 754

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            FQTPEAW  D  YRSL YMRPLAIWAMQ+AL+ P+
Sbjct: 755 WFQTPEAWTIDGHYRSLIYMRPLAIWAMQYALSPPR 790


>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
 gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
          Length = 750

 Score =  748 bits (1932), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 403/753 (53%), Positives = 504/753 (66%), Gaps = 96/753 (12%)

Query: 221 PVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT-KMNDGVHAV------ 273
           P+    +++ N GK  A +TLLFTW NSVGG S  TG+H+NS    +  G+  V      
Sbjct: 71  PILTNQFSMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTIIHLKRGLVPVSGRFIA 130

Query: 274 --------LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNS-LGLTAKDMWHEI 324
                   +   RT +  PPVT+A+AAQET+ VHVS CP F++SG+S   +TAK+MW EI
Sbjct: 131 SIYCNCDEMKTCRTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDEI 190

Query: 325 KEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
           K++ SFD LNS E    S PG+SIGAAIAA V VPP  +  VTFSL+WDCPEV F + KT
Sbjct: 191 KKNKSFDELNS-EPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRF-NEKT 248

Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELY 444
           Y+R                                               Y +TLFNELY
Sbjct: 249 YHR-----------------------------------------------YRVTLFNELY 261

Query: 445 YLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSIL 504
           Y N+GG +WTDG PP  S+                 +  V    QND  +++ ++++++ 
Sbjct: 262 YFNSGGTIWTDGLPPKESIE----------------RSKVTNTEQNDIVIDLFQKINAVC 305

Query: 505 EQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQ 564
           EQIY+P + NS          EENIGQF+YLEGIEYLM+NTYDVHFYSSFAL+ LFPK+ 
Sbjct: 306 EQIYSPQSSNS----------EENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLA 355

Query: 565 LSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARW 624
           LSIQRDFAA V++ DP+K K++  G+WV+RK+LG+VPHDIG+ DPW E+N Y  ++T RW
Sbjct: 356 LSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRW 415

Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD 684
           KDLN KFVLQVYRDVVAT D+ FAKAVWPSVY A+AY+DQFD+D DGMIEN+GFPDQTYD
Sbjct: 416 KDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYD 475

Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNY 744
            WSV+G+SAY GGLWVAALQAASA A  VG+     YF  K++KAK+VYEKLWNGSYFNY
Sbjct: 476 AWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNY 535

Query: 745 DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
           D+SGS  SSSI ADQLAGQWYARACGL PI  E+  + AL+ +Y +NV+KV GG RGAVN
Sbjct: 536 DDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVN 595

Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
           GM  +G+VD +S+ S+E+W+G TY+VAA MI E   + GFQTA GIYEA WS  GL  +F
Sbjct: 596 GMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSF 655

Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQ--MKPEVTDES--LLRYHA 920
           QTPEAWN +D+YRSLCYMRPLAIWA+QWALTR +    EKQ  +  +  +ES  LLR H 
Sbjct: 656 QTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHK 715

Query: 921 GFSKVARLLKL-PEEQGAKSLLQSLFDHTCRRM 952
           GF  VAR +K+ P     +S LQ  ++   + +
Sbjct: 716 GFKDVARFVKIVPTSNVHRSRLQHTYETVLKTL 748



 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/66 (51%), Positives = 47/66 (71%), Gaps = 12/66 (18%)

Query: 84  DPFIKRHLTSSHGVPLGGVGS------------GSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           D F KRH+   HGVPLGG+G+            GSIGRSY+GEFQ++++FP++CE+ P+L
Sbjct: 14  DIFKKRHVRGDHGVPLGGIGNVVNFTQTRIFSGGSIGRSYKGEFQQFKLFPKICEEAPIL 73

Query: 132 ANQFSV 137
            NQFS+
Sbjct: 74  TNQFSM 79


>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 676

 Score =  702 bits (1813), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/618 (58%), Positives = 451/618 (72%), Gaps = 16/618 (2%)

Query: 265 KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG---LTAKDMW 321
           +  DGVH VLLHHRT+   PPVTFA+A+QETD V V+ CP F +  ++ G     AKDMW
Sbjct: 10  RARDGVHGVLLHHRTADGHPPVTFAIASQETDDVRVTCCPSFAMGPSAPGGEQFMAKDMW 69

Query: 322 HEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS 381
            E K  GS         S +S PGSSIGAA+AA+ TVP      V+F+L+W CPEV F +
Sbjct: 70  EEAKNRGSVGVAPG--ASASSRPGSSIGAAVAAATTVPAGGTRAVSFALSWSCPEVKFPA 127

Query: 382 GKTYYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLF 440
           G+TY+RRYTKF G  ++AAA  +A DA+LEH  WE  IE WQRP+L DKRLP WYP+ LF
Sbjct: 128 GRTYHRRYTKFLGLDRDAAAERLAHDALLEHMKWESLIEEWQRPVLHDKRLPGWYPVALF 187

Query: 441 NELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERM 500
           NELYYLNAGG +WTDG PP  +  +    K+     +S         + +     IL  M
Sbjct: 188 NELYYLNAGGTIWTDGLPPKKT--SFASSKYGSTTMESFSLDGFHFHSGDHAPDGILRAM 245

Query: 501 SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLF 560
           ++  E+       +S+FGT LL DGEE++G+FLYLEG+EY +WNTYDVHFY+SFAL+ LF
Sbjct: 246 ATAEERTEA----SSAFGTALLGDGEESVGRFLYLEGMEYHLWNTYDVHFYASFALLSLF 301

Query: 561 PKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYD 620
           P+I+L +QRDFA AV+ HDP  M+ LD G  V RKVLGAVPHD+G+ DPWF++NAY ++D
Sbjct: 302 PEIELGLQRDFARAVLHHDPRPMRTLD-GATVPRKVLGAVPHDVGLGDPWFQLNAYMIHD 360

Query: 621 TARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPD 680
            ARWKDLN KFVLQVYRD  ATGD  FA AVWP+VY+AMAYMDQFDRDGDGM+EN+G PD
Sbjct: 361 PARWKDLNTKFVLQVYRDGAATGDAAFATAVWPAVYLAMAYMDQFDRDGDGMVENEGRPD 420

Query: 681 QTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNG 739
           QTYD WSVSG+SAY+GGLWVAALQAA+ +AR VGDRGSE YF+ +++KA+ VY+ +LWNG
Sbjct: 421 QTYDFWSVSGVSAYTGGLWVAALQAAAVMARVVGDRGSEGYFVERYEKARRVYDGELWNG 480

Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
           +YF+YDNSG + S SI ADQLAGQWYARACGL PIV+E+KARSAL  V +YNV++V GG 
Sbjct: 481 AYFDYDNSGGTNSKSIMADQLAGQWYARACGLEPIVEEEKARSALGTVLDYNVMRVQGGA 540

Query: 800 RGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
            GAVNGM PDG VD SS+QSRE+W GVTY VAA+M HE + +  F+TA G ++A W   G
Sbjct: 541 IGAVNGMRPDGTVDTSSIQSREVWPGVTYGVAAAMAHEGMPEAAFRTAKGAHDAGWGRDG 600

Query: 860 LGYAFQTPEAWNTDDQYR 877
            G   + P+A   D + R
Sbjct: 601 FG--VRVPDAGGVDGRRR 616


>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
          Length = 521

 Score =  692 bits (1785), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/531 (64%), Positives = 402/531 (75%), Gaps = 37/531 (6%)

Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
           Y ITLFNELY+LNAGG +WTDG PP+ SL TI   KFSLD S SD K   D+ +QND+ V
Sbjct: 11  YRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTTDIVHQNDSTV 70

Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
            IL RM+S+LEQI+ P   NS+FGT LLQ GEEN+GQFLYLEGIEY MWNTYDVHFYSSF
Sbjct: 71  EILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWNTYDVHFYSSF 130

Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVN 614
           A+IMLFP+++LSIQRDFAAAVM+HDPS+MK++ DG+WV RKVLGAVPHDIGI DPWFE+N
Sbjct: 131 AIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDIGISDPWFELN 190

Query: 615 AYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIE 674
           AY LYDT RWKDLN KFVLQVYRD+VATGDK FA+AVWP+VY+A+A++DQFD+DGDGMIE
Sbjct: 191 AYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDKDGDGMIE 250

Query: 675 NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE 734
           NDG P                       LQAASA+AREVGD  + DYF FKFQKAK VY+
Sbjct: 251 NDGLP-----------------------LQAASAMAREVGDSMTADYFWFKFQKAKAVYD 287

Query: 735 K-----------LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
           K           L     F  + S +      Q  +     YARACGL PIVD++KARSA
Sbjct: 288 KDQLAAVIGEGRLLXLFRFLGNPSHTGIDGLFQDHRDWTTRYARACGLQPIVDDEKARSA 347

Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
           L+KVYN+NVLKV  GK GAVNGMLPDGRVDMS+MQSREIW+GVTY+VAA+MIHE + +  
Sbjct: 348 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 407

Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLE 903
           F TA GIY+AAWS  GLGY+FQTPEAWNTD++YRSLCYMRPLAIWAMQWAL+  KP+   
Sbjct: 408 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALS--KPELHN 465

Query: 904 KQMKPEVTDESL-LRYHAGFSKVARLLKLPEEQGAKSLLQSLFDHTCRRMF 953
             MK E    +L   +H GF KVA LLKLPEE+ +KS LQ  FD TCRR++
Sbjct: 466 HDMKHEEGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLY 516


>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
           sativus]
          Length = 508

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/505 (64%), Positives = 393/505 (77%), Gaps = 9/505 (1%)

Query: 454 TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN---QNDTAVNILERMSSILEQIYTP 510
           +DG PP+ +L TI H+K+ L+ S+S+L      PN   + D AV+ILERMS IL+Q +  
Sbjct: 4   SDGLPPLQNLSTISHKKYFLERSKSELNG--GAPNGDHRKDVAVDILERMSQILDQTHGG 61

Query: 511 VA-LNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
               N++ GT LL  GEEN+G  L +EG +YLMWNTYDVHFYSSFALIMLFPK++LSIQR
Sbjct: 62  AGPSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVHFYSSFALIMLFPKLELSIQR 121

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNP 629
           DFAAAV+MHDP K K + DG WV RKVLGAVPHDIG  DPW EVNAY L + +RWKDL  
Sbjct: 122 DFAAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDPWLEVNAYNLLNVSRWKDLGS 181

Query: 630 KFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
           KFVLQVYRDVVATGDK FAK+VWPSVYVA+A+M+QFD+D DGMIEN+GFPDQTYDTW+V 
Sbjct: 182 KFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDKDGMIENEGFPDQTYDTWTVK 241

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
           G+SAY GGLWVAALQAASALA EV D  +  YF  K+QKA+ VYE LWNGSYFNYDNS  
Sbjct: 242 GVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKARSVYETLWNGSYFNYDNSKG 301

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
             SSSIQADQLAGQWYARACGL PI DE+K R AL+K+YN+NV+KV GG RGAVNGM PD
Sbjct: 302 PWSSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPD 361

Query: 810 GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEA 869
           G VD S +Q +EIW+GVTY+VAA+MI E + + GFQTA GIY+AAW+  GLGY+FQTPEA
Sbjct: 362 GSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQAAWAQDGLGYSFQTPEA 421

Query: 870 WNTDDQYRSLCYMRPLAIWAMQWA-LTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARL 928
           W+ DD++RS+ YMRPLAIWAMQWA ++  KP  +  +   E+ + +    HA F KVA L
Sbjct: 422 WDVDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFSEMQESAFATQHAAFLKVASL 481

Query: 929 LKLP--EEQGAKSLLQSLFDHTCRR 951
           LKLP  ++   +SL+++ +D  C+R
Sbjct: 482 LKLPSNDDTARRSLVEAAYDFICKR 506


>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
 gi|219886613|gb|ACL53681.1| unknown [Zea mays]
          Length = 649

 Score =  643 bits (1659), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/602 (56%), Positives = 420/602 (69%), Gaps = 24/602 (3%)

Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           MW+ + ++G FDR N S   S+ S PG  + AA++AS  V P     V F+LAW  P+V 
Sbjct: 1   MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F  G TY RRYT+FYGT + +A N+  DA+ ++  WE +IE WQ PIL+D+RLPEWY  T
Sbjct: 61  FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120

Query: 439 LFNELYYLNAGGAVWTDGSPPV---HSLVTIGHRKFSL----DWSQSDLK------RIVD 485
           LFNELY+L AGG VWTDG PP           H+K S     D +Q  +K       +  
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180

Query: 486 VP------NQNDTAVNILERM--SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
           VP      N +D +V+    +  S + EQI   +         + +DG E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQI-NGLKSEEPIPYLISKDGPEHVGKFLYLEG 239

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
           +EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  K+K L DG    RKV 
Sbjct: 240 VEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVK 299

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           GAVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQ+YRD  ATGD +F + VWP+V  
Sbjct: 300 GAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRGVWPAVCA 359

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
           AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A  +GDR 
Sbjct: 360 AMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRH 419

Query: 718 SEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
             + +  KF KAK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL P+ D
Sbjct: 420 FAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFD 479

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
           E K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTYAVAA+M+ 
Sbjct: 480 EHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLL 539

Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
             +   GF TA GI+ A WS  G GY FQTPE W TD  YRSL YMRPLAIWA+Q+A++ 
Sbjct: 540 HGMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSP 599

Query: 897 PK 898
           PK
Sbjct: 600 PK 601


>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 649

 Score =  642 bits (1657), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/602 (56%), Positives = 420/602 (69%), Gaps = 24/602 (3%)

Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           MW+ + ++G FDR N S   S+ S PG  + AA++AS  V P     V F+LAW  P+V 
Sbjct: 1   MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F  G TY RRYT+FYGT + +A N+  DA+ ++  WE +IE WQ PIL+D+RLPEWY  T
Sbjct: 61  FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120

Query: 439 LFNELYYLNAGGAVWTDGSPPV---HSLVTIGHRKFSL----DWSQSDLK------RIVD 485
           LFNELY+L AGG VWTDG PP           H+K S     D +Q  +K       +  
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180

Query: 486 VP------NQNDTAVNILERM--SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
           VP      N +D +V+    +  S + EQI   +         + +DG E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQI-NGLKSEEPIPYLISKDGPEHVGKFLYLEG 239

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
           +EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  K+K L DG    RKV 
Sbjct: 240 VEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVK 299

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           GAVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQ+YRD  ATGD +F + VWP+V  
Sbjct: 300 GAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCA 359

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
           AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A  +GDR 
Sbjct: 360 AMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRH 419

Query: 718 SEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
             + +  KF KAK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL P+ D
Sbjct: 420 FAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFD 479

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
           E K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTYAVAA+M+ 
Sbjct: 480 EHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLL 539

Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
             +   GF TA GI+ A WS  G GY FQTPE W TD  YRSL YMRPLAIWA+Q+A++ 
Sbjct: 540 HGMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSP 599

Query: 897 PK 898
           PK
Sbjct: 600 PK 601


>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
          Length = 900

 Score =  629 bits (1621), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 348/687 (50%), Positives = 444/687 (64%), Gaps = 63/687 (9%)

Query: 274 LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
           +L +RT+ + PPVTFA+AA ET  V V++ P F +S  S  +TAKDMW ++ + G FDR 
Sbjct: 167 VLANRTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGS-HITAKDMWGKMVQDGQFDRE 225

Query: 334 NSME-TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKF 392
           N     S+ S PG ++ AA++AS  V P  +  V F+LAW  P+V F+ G +Y+RRYTK+
Sbjct: 226 NCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKGSSYHRRYTKY 285

Query: 393 YGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAV 452
           YGT + AA NI  DA+  +  WE +IE WQ PIL D RLPEWY  TLFNELY+L AGG V
Sbjct: 286 YGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNELYFLVAGGTV 345

Query: 453 WTDGSPP-------VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILER--MSSI 503
           W D S P       +H    + +   ++  ++ + +R   V N      +   R  +   
Sbjct: 346 WIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYDTTSRKGLEYD 405

Query: 504 LEQIYT----------PVALNS--SFGTNLL---QDGEENIGQFLYLEGIEYLMWNTYDV 548
            E+I+T          P   NS  S   + L   QD  +++G+FLYLEG+EY+MW TYDV
Sbjct: 406 EEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGVEYIMWCTYDV 465

Query: 549 HFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD 608
           HFY+SFAL+ LFPKI+LSIQR+FA AV+  D  ++K L +G W  RKV GAVPHD+G  D
Sbjct: 466 HFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRGAVPHDLGTHD 525

Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
           PW E+NAY ++DT++WKDLNPKFVLQVYRD  AT D  F   VWP+V  AM YM+QFDRD
Sbjct: 526 PWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAAMEYMEQFDRD 585

Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
            DG+IENDGFPDQTYDTW+V GISAY G LW+AALQAA+A+A ++GD+   +    KF K
Sbjct: 586 SDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPFAEKCKSKFFK 645

Query: 729 AKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKV 787
           AK+V+ EKLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  + D+ K +S+L K+
Sbjct: 646 AKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDDCKIKSSLHKI 705

Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
           Y++NV+KV GGK GAVNGM P+G+VD S MQSREIW+GVTY VAA+MI   + +  F TA
Sbjct: 706 YDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILSGMEEQAFTTA 765

Query: 848 CGIYEAAWSGTGLG------------------------------------YAFQTPEAWN 871
            GI+ A WS  G G                                    Y FQTPE W 
Sbjct: 766 EGIFTAGWSEEGYGTLDFIDDMHCVVQILNCFVHSGLCVACVILLYCTCRYWFQTPEGWT 825

Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPK 898
            D  +RSL YMRPLAIW MQWAL+ P+
Sbjct: 826 IDGHFRSLIYMRPLAIWGMQWALSMPR 852


>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 1103

 Score =  624 bits (1608), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 353/669 (52%), Positives = 451/669 (67%), Gaps = 57/669 (8%)

Query: 273  VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
            V+ + RT+   PPVTF++AA ET  V VS+ P F +S  S  +TAK MW ++ + G FDR
Sbjct: 401  VVEYFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLSDRS-SVTAKGMWTKMVKDGQFDR 459

Query: 333  LN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
             N S   S+ S PG ++ AA+AAS  V P  +  V FSLAW  P+V F+ G T+ RRYTK
Sbjct: 460  ENFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAFSLAWSSPKVKFVKGSTFNRRYTK 519

Query: 392  FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
            FYGT + AA ++A DA+  +  WE +I  WQ PIL+D++LPEWY  TLFNELY+L AGG 
Sbjct: 520  FYGTSERAAVHLAHDALTHYTRWEEEIAKWQDPILKDEKLPEWYKFTLFNELYFLVAGGT 579

Query: 452  VWTDGSPPVHSLVTIGHRKFS---LDWSQSDLKRIV------------------------ 484
            +W D +     L++   R  S   L+ S++ + RI                         
Sbjct: 580  IWIDST-----LLSSNKRNNSQDQLEESENAVVRITEAKVDCRKREVVECTTDNSYDSTA 634

Query: 485  -------------DVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE-ENIG 530
                         D+  +N T VN L + +S      TP   + S   NL  D + ++ G
Sbjct: 635  HRGHNHLDEKHNRDISRENGT-VNTLGKGNSA----NTP---HHSTMKNLQHDDDNDDGG 686

Query: 531  QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
            +FLYLEG+EY+MW TYDVHFY+SFAL+MLFP+I+L+IQRDFA AV+  D  K+K L +G 
Sbjct: 687  RFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNIQRDFAQAVLCEDGRKVKFLAEGN 746

Query: 591  WVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
            W +RKV GAVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  ATGD +F   
Sbjct: 747  WGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSATGDLQFGVD 806

Query: 651  VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALA 710
            VWP+V  AM YM+QFDRD DG+IENDGFPDQTYDTW+V G+SAY GGLW+AALQAA+A+A
Sbjct: 807  VWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLAALQAAAAMA 866

Query: 711  REVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARAC 769
             ++GDR   +    KF KAK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWY  + 
Sbjct: 867  LQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASS 926

Query: 770  GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
            GL  + D+ K +S+L+KV+++NV+KV GG+ GAVNGM P+G+VD + MQSREIW+GVTY 
Sbjct: 927  GLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREIWAGVTYG 986

Query: 830  VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
            VAA+MI   + +  F TA GI+ A WS  G GY FQTPEA+  D  YRSL YMRPL+IW 
Sbjct: 987  VAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRSLIYMRPLSIWG 1046

Query: 890  MQWALTRPK 898
            MQ+ALT PK
Sbjct: 1047 MQYALTMPK 1055



 Score =  263 bits (671), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 181/314 (57%), Gaps = 59/314 (18%)

Query: 23  FDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAF 82
           FD +D A P    W+R+L++    L +F + + E +++  +G+R+   +REEA+ G++A 
Sbjct: 28  FD-IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAP 86

Query: 83  IDPFIKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSV--- 137
           IDPF +     ++S GVPLGG+GSGSI R +RGEF++WQI P +CE  PV+ANQFSV   
Sbjct: 87  IDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSVSLV 146

Query: 138 -------------------------------------------------FVSRSNGQK-Y 147
                                                            FVSR  G K +
Sbjct: 147 FPCTIDSLYTSLICSQIRSRQNHLWVYFPSIYEDYFVSWGNECSLFAEIFVSREGGNKSF 206

Query: 148 SSVLCPKTPE---VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVC 204
           +SVL P   E     +     GI SW WNL G  STYHAL+PRAWTV++GEPDPEL+I C
Sbjct: 207 ASVLAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEPDPELKISC 266

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKT 264
           RQISP IPHNY+ESS P +VF YT+ N+GK  A ++LLFTW NS+GG+S  +G H N   
Sbjct: 267 RQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLSGDHVNEPF 326

Query: 265 KMNDGVHAVLLHHR 278
              DGV  VLL+H+
Sbjct: 327 IAEDGVSGVLLYHK 340


>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
          Length = 851

 Score =  621 bits (1601), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 358/869 (41%), Positives = 490/869 (56%), Gaps = 79/869 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    ++   +GVR L    R+   + K  FID F    L   +GVPLGG+G 
Sbjct: 44  PFQAKDVSLSNMLEHVSLGVRYLKWWYRKTQVEKKAPFIDMFRAVPLRQIYGVPLGGIGG 103

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           GSI R +RGEF RWQ+ P +   K V+ANQF+V + R     Y  VL  + P  L+    
Sbjct: 104 GSITRGWRGEFCRWQLNPGMYHYKTVIANQFTVCLRRDGQTVYQQVLSTERPSTLQ---- 159

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
                W+W   G+ + YH L+PRAWTV+   P   + + CRQ+SP+IPH+YK+SS PV+V
Sbjct: 160 ----GWNWGYCGEHAFYHGLFPRAWTVYN-LPGQNVTLTCRQVSPVIPHDYKDSSLPVAV 214

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSH 281
           F + I N    + DI+++FT  N  G   + +G H+N      K  + V  VLLHH+T  
Sbjct: 215 FVWDIENKNDYALDISIMFTMVNGSGQKDDKSGGHWNEPFHLEKEGESVSGVLLHHQTPA 274

Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
              P T  +AA++  G  +S    F   G     T   +W ++   G  D  +   +S  
Sbjct: 275 N--PYTLCIAARQKSGQEISHQTAFSPKG-----TCSAVWCDLMTDGRLD--SPTNSSPP 325

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAA 400
           +E G  + AA+  S +V  +S   V FSLAWD P++ F S  +TY RRYT++YGT  +AA
Sbjct: 326 TEKGEEVAAALVVSCSVSANSRNSVDFSLAWDMPKITFGSKERTYVRRYTRYYGTKGDAA 385

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
            +I+  A+  +  WE  IE WQRPIL+D+ LP WY   LFNELY++  GG +W       
Sbjct: 386 PSISHYALTHYSKWEESIEEWQRPILQDRSLPAWYKSALFNELYFVVDGGTIW------- 438

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                                  V++P   D +  +      +  Q   P  +       
Sbjct: 439 -----------------------VELPEDADISGGLRPEDGGLPAQ---PEVV------- 465

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
                 ++ G+F YLEG EY M+NTYDVHFY+SFALIML+PK+ LS+Q D A +V+ HDP
Sbjct: 466 ------KDFGRFAYLEGQEYRMYNTYDVHFYASFALIMLWPKLALSLQYDIAGSVVQHDP 519

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
            +   L +G++   K  G VPHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD
Sbjct: 520 MERLNLMNGRYSPVKTRGVVPHDIGDPDDEPWVRVNAYLIHDTADWKDLNLKFVLQVYRD 579

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              T D+++ K +WP     M    +FD+DGDG+IEN G+ DQTYD W V+G SAY GGL
Sbjct: 580 YHLTQDQQYLKDMWPVCQTVMENELKFDKDGDGLIENSGYADQTYDGWKVTGPSAYCGGL 639

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
           W+A++     +AR +        +    ++    ++K LWNG Y+NYD+SG S S+S+ +
Sbjct: 640 WLASVCMMCKMARVLNCESVYQRYRDILERGSAAFDKLLWNGKYYNYDSSGRSLSNSVMS 699

Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
           DQ AG W+ RA GL         ++K  SALK V++ NV+   GG+ GAVNGM P+G  D
Sbjct: 700 DQCAGHWFLRASGLGDDEYQAFPKEKICSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPD 759

Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
            SS+QS E+W GV Y +AA+MIHE + + G +TA G Y A W   G+  AFQTPEA+   
Sbjct: 760 RSSVQSDEVWVGVVYGLAATMIHEGMVEEGLRTAEGCYRAVWERMGM--AFQTPEAYCEK 817

Query: 874 DQYRSLCYMRPLAIWAMQWAL-TRPKPKT 901
             YRSL YMRPL+IWAMQ AL  RP   T
Sbjct: 818 GIYRSLAYMRPLSIWAMQLALNNRPNHDT 846


>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
          Length = 853

 Score =  614 bits (1584), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 355/844 (42%), Positives = 474/844 (56%), Gaps = 80/844 (9%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           K AFID      L   +G PLGG+GSG+I R +RGEF RWQ+ P     + V+A+QF+V 
Sbjct: 73  KSAFIDLLCAVPLQQIYGCPLGGIGSGTITRGWRGEFCRWQLNPGKYHYETVIADQFTVC 132

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           + R     Y  VL  + P  L+         W+W   G  + YHALYPRAW V+E  P  
Sbjct: 133 LRRKGQTVYQQVLSVEKPSALQ--------GWNWGYCGRYAFYHALYPRAWMVYE-LPGQ 183

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
            + + CRQISP+IPH+YK+SS PV VF + + N+ +   D++++FT  N  G   + +G 
Sbjct: 184 NVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENNSEEPVDVSIMFTLQNGTGTKGDRSGG 243

Query: 259 HYNSKTKMNDG---VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
           H+N    + DG   V  VLLHH T   + P TFA+AA+E  G  ++    F  +G     
Sbjct: 244 HWNEPFALQDGGERVAGVLLHHCTP--VNPFTFAVAAREKAGTVITHLTAFDPAG----- 296

Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
           + +D+W ++ + G  +  +    S  ++ G    AA+ AS TVP    G +  +L WD P
Sbjct: 297 SGRDVWQDLLQDGKLE--SPTGKSKQTQNGEVTAAAVCASCTVPAQGHGTLELALVWDMP 354

Query: 376 EVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
            V+F S  + + RRYT+F+G++ +AA  ++  A+  +  WE +IEAWQ+PILED +LP W
Sbjct: 355 HVHFGSKERLHLRRYTRFFGSNGDAAPALSHYALTHYKEWERKIEAWQKPILEDSQLPSW 414

Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
           Y   LFNELY++  GG +W D  P                                    
Sbjct: 415 YKSALFNELYFMTDGGTIWLDLPP------------------------------------ 438

Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
                   + + +  P A   S    +L++     G+F YLEG EY M+NTYDVHFY+SF
Sbjct: 439 ------DCLPQDLQGPGAAKLSHLLPVLRE----YGRFAYLEGQEYRMYNTYDVHFYASF 488

Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFE 612
           AL+ML+PK+Q+S+Q D A  V+  D    + L  G+    KV   VPHDIG  D  PW  
Sbjct: 489 ALVMLWPKLQISLQYDIAVTVLNEDVQPRQYLVCGKTAQVKVKNVVPHDIGDPDDEPWQR 548

Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
           VNAY ++DTA WKDLN KFVLQVYRD   T D  + + +WP     M    +FD D DG+
Sbjct: 549 VNAYLMHDTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGL 608

Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
           IEN G  DQTYD W V G SAY GGLW+AA++    +A  +GD  +   +    QK K  
Sbjct: 609 IENGGIADQTYDAWVVDGASAYCGGLWLAAVRMMCEMAEVLGDTETRQKYDAILQKGKES 668

Query: 733 YEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKV 787
           +E+ LWNG Y+NYD+SGSS SSSI +DQ AGQW+  ACGL    + +  +    SALK +
Sbjct: 669 FERLLWNGKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKEVEVFPKSHIVSALKTI 728

Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
           +  NV+   GGK GAVNGM PDG  D SS+QS E+W GV YA+AA+MI E L   GF TA
Sbjct: 729 FEKNVMSFAGGKMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTA 788

Query: 848 CGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL---TRPKPKTLEK 904
            G Y   W    LG AFQTPEA+     YRSL YMRPL+IW+MQ AL    R  P     
Sbjct: 789 EGCYRTVWE--HLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLALESKARQAPSPTPP 846

Query: 905 QMKP 908
           Q  P
Sbjct: 847 QKNP 850


>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
 gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
          Length = 603

 Score =  611 bits (1575), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 321/581 (55%), Positives = 399/581 (68%), Gaps = 27/581 (4%)

Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           MW  + + GSFDR N S   S+    G ++ AA++AS  V P     V F+LAW  P+V 
Sbjct: 1   MWGTMVQDGSFDRDNFSAGPSMPCLLGDTVCAAVSASTWVEPHGRCTVAFALAWSSPKVK 60

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F  G TYYRRYTKFYGT   +A N+  DA++++  WE  I+ WQ PIL D+RLPEWY IT
Sbjct: 61  FKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEAIDKWQTPILRDERLPEWYKIT 120

Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
           LFNELY+L AGG VW D      SLV           S +D K     P  ++   +   
Sbjct: 121 LFNELYFLVAGGTVWIDS----ESLV-----------SDADNKLNPSPPEDSNFPFHDST 165

Query: 499 RMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIM 558
             S++    + P  +          D +EN+G+FLYLEGIEY MW TYDVHFY+SFAL+ 
Sbjct: 166 CNSTVPLIGFDPHEI----------DDKENVGKFLYLEGIEYFMWCTYDVHFYASFALLD 215

Query: 559 LFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCL 618
           LFPKI+LSIQRDFA AV+  D  +++ L DG W  RKV+GAV HD+G  DPW E+NAY +
Sbjct: 216 LFPKIELSIQRDFARAVLREDNRRVRFLADGTWGIRKVIGAVAHDLGAHDPWHELNAYNI 275

Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
           +DT+RWKDLNPKFVLQ+YRD  ATGDK F K VWP+V  AM YM+QFD DGD MIENDGF
Sbjct: 276 HDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFDHDGDDMIENDGF 335

Query: 679 PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLW 737
           PDQTYD W+V G+SAY G LW+AALQAA+ALAR +G     +  + +F KAK V+E +LW
Sbjct: 336 PDQTYDAWTVQGVSAYCGCLWLAALQAAAALARSLGHDDYAERCMTRFAKAKSVFEARLW 395

Query: 738 NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
           NGSYFNYD+  S  S SIQADQLAGQWY  + GL P+ DED+ +  L+K+++YNV++V G
Sbjct: 396 NGSYFNYDSGTSYNSRSIQADQLAGQWYTASSGLPPLFDEDRIKCTLQKIFDYNVMRVKG 455

Query: 798 GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
           G+ GAVNGM P+G+VD + MQSREIW+GVTY++AA+M+   +    F TA GIY A WS 
Sbjct: 456 GRMGAVNGMHPNGKVDDTCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYIAGWSE 515

Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            G GY FQTPEAW  D  YRSL YMRPLAIWAMQWAL+ P+
Sbjct: 516 EGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQWALSPPR 556


>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
          Length = 583

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 311/584 (53%), Positives = 390/584 (66%), Gaps = 53/584 (9%)

Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           MW  + + GSFD+ N +   S+ S  G ++ AA++AS  V P     V F+LAW  P+V 
Sbjct: 1   MWGTMVQDGSFDQDNLNTGPSMPSLLGDTVCAAVSASTWVEPHGRCAVVFALAWSSPQVK 60

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F  G TYYRRYTKFYGT   +A N+  DA++++  WE +I+ WQ P+L+D+RLPEWY IT
Sbjct: 61  FKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEEIDKWQTPVLQDERLPEWYKIT 120

Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
           LFNELY+L AGG VW D      SLV+    K +              P +++ +  I  
Sbjct: 121 LFNELYFLVAGGTVWIDSG----SLVSDADNKSN--------------PEESNLSHEI-- 160

Query: 499 RMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIM 558
                                    D +EN+G+FLYLEGIEY MW TYDVHFY+SFAL+ 
Sbjct: 161 -------------------------DDKENVGKFLYLEGIEYFMWCTYDVHFYASFALLD 195

Query: 559 LFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCL 618
           LFPKI+LSIQRDFA AV+  D  +++ L DG W  RKV+GAV HD+G  DPW E+NAY +
Sbjct: 196 LFPKIELSIQRDFARAVLREDNRRVRFLADGTWGIRKVIGAVAHDLGAHDPWHELNAYNI 255

Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
           +DT+RWKDLNPKFVLQ+YRD  ATGDK F K VWP+V  AM YM+QFD DGDGMIENDGF
Sbjct: 256 HDTSRWKDLNPKFVLQIYRDFAATGDKSFGKDVWPAVCTAMEYMEQFDHDGDGMIENDGF 315

Query: 679 PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY---FLFKFQKAKVVYE- 734
           PDQTYD W+V G+SAY G      L A  A A   G  G +DY    + +F KAK V+E 
Sbjct: 316 PDQTYDAWTVQGVSAYCG---CLWLAALQAAAALAGSLGHDDYAERCMTRFVKAKTVFEA 372

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           +LWNGS+FNYD+  S  S SIQADQLAGQWY  + GL P+ DED+ +  L+K+++YNV++
Sbjct: 373 RLWNGSHFNYDSGTSYSSRSIQADQLAGQWYTASSGLPPLFDEDRIKCTLQKIFDYNVMR 432

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSREIW+GVTY++AA+M+   +    F TA GIY A 
Sbjct: 433 VKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYTAG 492

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           WS  G GY FQTPEAW  D  YRSL YMRPLAIWAMQWAL+ P+
Sbjct: 493 WSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQWALSPPR 536


>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
 gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
          Length = 1205

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 340/912 (37%), Positives = 499/912 (54%), Gaps = 108/912 (11%)

Query: 35   TWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSS 94
            +W R+L         F ++  + ++L  IG R+   +++E + G+   +DPF        
Sbjct: 327  SWARRLDHNFPDHKHFQVSMWQGLKLVGIGYRMWKYVKKENSSGRVPMMDPFNLPKPGPI 386

Query: 95   HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV--------------- 139
             GVP+GG+GSGSI R +RG+F RW +   +  ++ V  ++FSV++               
Sbjct: 387  MGVPIGGIGSGSINRGWRGDFVRWNMNSGLVTNETVDVDKFSVYINFDDSNSASSSTGST 446

Query: 140  --------------SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
                          S +  ++ ++VL P  P+         +  W+W LKG+ S Y  L+
Sbjct: 447  PSGTFTPNGPNFDPSSTKSKQKATVLYPGKPK-----NNLNLDVWNWGLKGENSCYFGLF 501

Query: 186  PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGK-TSADITLLFT 244
            PRAWTV+E EP P++++VC+QISP+IPHNY+ESSYPV V+ + I N+ K +SAD++L+ T
Sbjct: 502  PRAWTVYE-EPHPDIKLVCKQISPVIPHNYQESSYPVGVYVWKIENNNKNSSADVSLMLT 560

Query: 245  WTNSVGGDSEFTGQHYNSKTKMND----GVHAVLLHHRTSHQLP----------PVTFAL 290
            W NS+G  S+  G HYN    + D     +  V L HR   +            P+ +++
Sbjct: 561  WQNSIGTKSDQDGGHYNKYFTLGDEPNKKIKGVTLIHRKEQRCAYNSSNIVYHDPMEYSI 620

Query: 291  AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
            A ++   V+++    F  +     L A ++W+   + G    L +++ S  S P   IGA
Sbjct: 621  AVKDDPDVNITFNERFETTSR---LDAANLWYTFNKKG---ELENIQDSRPSAPKKPIGA 674

Query: 351  AIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILE 410
            A+AA V VP      + F +AWD P   F  G  YY+RYTKFYG     +  IA DA+  
Sbjct: 675  AVAAKVKVPAGGSKTIVFCIAWDQPITRFQLGSAYYKRYTKFYGNKGGNSQRIAYDALQN 734

Query: 411  HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRK 470
            + SW+ QI  WQ PI++D  LP +Y + LFNELYYL  GG VWT GSP            
Sbjct: 735  YKSWDNQIAQWQNPIIQDAELPSFYKMALFNELYYLVDGGTVWTHGSPQ----------- 783

Query: 471  FSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIG 530
                                              +Q+     L +      + D   +IG
Sbjct: 784  ----------------------------------DQLPARYPLKTQLKPEDISD-PNHIG 808

Query: 531  QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
            +F YLE +EYLM+NTYDVHFYSS+AL ML+P +++S+Q D A A M+      + +  GQ
Sbjct: 809  RFAYLESLEYLMYNTYDVHFYSSYALAMLWPLLEISLQYDIAEATMLDYGITWEGIHSGQ 868

Query: 591  WVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
             + RK    VPHD+G    DPW  VN+Y + D +RWKDL  KFVLQVYRD +   DK F 
Sbjct: 869  QIPRKKRCTVPHDLGNPGEDPWKRVNSYNIQDISRWKDLPSKFVLQVYRDYLVVEDKNFL 928

Query: 649  KAVWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
              V+  V  V    ++ FD D DG+++N+GFPDQTYD W   G SAYSGGLW+A+L+ AS
Sbjct: 929  LQVYNVVEEVIQRTLESFDTDHDGVVDNEGFPDQTYDVWPAVGCSAYSGGLWLASLKVAS 988

Query: 708  ALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYA 766
             +A+ +G +  E  +   F+K    Y +KLWNG YF YD S S  + SI +D LAG WY 
Sbjct: 989  EMAKILGFKEDESIYNAIFEKGSASYTKKLWNGYYFKYDCSNSVHADSIMSDMLAGHWYL 1048

Query: 767  RACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGV 826
             +CGL   +  D+A S+L  +  YNV      + GAVNGM P+G VD + +QS E+W G 
Sbjct: 1049 LSCGLPSYMTFDQALSSLSIINEYNVNSYGKERCGAVNGMRPEGVVDNTCLQSSEVWIGT 1108

Query: 827  TYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLA 886
            ++++AA+MI   +    ++   GI  ++++    G+ +QTPEAW+++  YR+  YMRPL+
Sbjct: 1109 SFSLAATMIQHHMDKDAWELVKGIVNSSYN--RWGFQYQTPEAWDSNGCYRAGAYMRPLS 1166

Query: 887  IWAMQWALTRPK 898
            IW++QWAL + K
Sbjct: 1167 IWSIQWALLKKK 1178


>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
          Length = 4775

 Score =  607 bits (1566), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 346/876 (39%), Positives = 491/876 (56%), Gaps = 85/876 (9%)

Query: 36   WQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSH 95
            W ++L         F ++  +  +L  +G R+   ++ E + G+   +DPF         
Sbjct: 339  WTKRLDHQFPEHKAFQVSIGQGFKLMGLGYRMWKYVKRENSAGRVPIMDPFNIPKPGPVM 398

Query: 96   GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ------KYSS 149
            GVP+GG+GSGSI R +RG+F RW +   +   + V AN+FSV++     Q      K ++
Sbjct: 399  GVPIGGIGSGSITRGWRGDFVRWNLKNGMVNSEVVDANKFSVYIKMEEQQSQPLSTKRAT 458

Query: 150  VLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
            VLCP  P+     +   +  W+W+LKGD+S+Y   +PRAWTV+E EP  ++R+VC+Q+SP
Sbjct: 459  VLCPGKPK-----SNFALDVWNWSLKGDRSSYFGQFPRAWTVYE-EPHQDVRLVCKQVSP 512

Query: 210  IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND- 268
            +IP+NYKESSYP  VF + I N+  T A+++L+FTW NS G   +  G H+N + K  D 
Sbjct: 513  VIPNNYKESSYPCGVFVWKIENTASTDAEVSLMFTWQNSDGTAVDQAGGHHNKRFKYTDE 572

Query: 269  ------GVHAVLLHHRTSHQLP------PVTFALAAQETDGVHVSLCPHFVISGNSLGLT 316
                  GV         S   P      P+  ++  ++   V  S    F  +     L 
Sbjct: 573  QGRSINGVTLTTNRDLKSSVDPKKVYQDPLELSIGVRDDADVQFSFVSRFETTNR---LE 629

Query: 317  AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
            A ++W+   + G  D   + E S  S P   IGAAIAA V V   +  +V FS+AWD P 
Sbjct: 630  AANLWYSFNKSGVLD---NSEDSRPSAPKKPIGAAIAAKVLVKAGTTREVAFSVAWDTPV 686

Query: 377  VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
              F SG  YYRRYTKF+GT  N +  IA DA+  +  WE +I  WQ PIL D  LP +Y 
Sbjct: 687  CRFNSGSGYYRRYTKFFGTAGNNSQRIACDAVYNYRKWEEEIVRWQHPILSDPSLPTFYK 746

Query: 437  ITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNI 496
              +FNELYY   GG VWT G+P                  Q + K+IV    Q D     
Sbjct: 747  QAIFNELYYFVDGGTVWTHGAPD----------------DQPNQKKIVSKFQQED----- 785

Query: 497  LERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFAL 556
                                     L D    IG+F YLE ++YLM+NTYDVHFY+SFAL
Sbjct: 786  -------------------------LNDPNY-IGRFAYLESVDYLMYNTYDVHFYASFAL 819

Query: 557  IMLFPKIQLSIQRDFAAAVMM-HDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEV 613
             ML+P+++LS+Q DFA A ++ +  ++++ +  G+ + RKV GAVPHD+G    DPW  V
Sbjct: 820  AMLWPRLELSLQTDFADATLLDYSDTQVECIQTGKNMPRKVRGAVPHDLGNPGEDPWKRV 879

Query: 614  NAYCLYDTARWKDLNPKFVLQVYRDVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
            NAY + D +RWKDL  KFVLQ+YRD +  G D+ F   +W  V   +    ++D D DG+
Sbjct: 880  NAYHIQDVSRWKDLPSKFVLQIYRDYLINGNDRTFLLQMWGVVEEVIRRAFEYDIDIDGV 939

Query: 673  IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
            I+N+G PDQTYD WS  G SAYSGGLW+AA++ AS +AR +G +  E  +   F+K K  
Sbjct: 940  IDNEGVPDQTYDAWSALGCSAYSGGLWLAAVKVASEMARILGLKEDETVYKKIFEKGKKS 999

Query: 733  YE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
            Y  KLWNG YFNYD+S +    SI +DQLAG WY  ACGL   +  D+A S+L  +  YN
Sbjct: 1000 YSTKLWNGHYFNYDSSKNPHFDSIMSDQLAGHWYLLACGLPSYITLDQALSSLSIINEYN 1059

Query: 792  VLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIY 851
            V     G  GAVNGM P+G  D +S+QS E+W G +Y +A++M+   + +  ++   G+ 
Sbjct: 1060 VKSYSNGSCGAVNGMRPEGGPDTTSLQSCEVWIGTSYGLASTMLLHFMDNEAWELIKGLV 1119

Query: 852  EAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
            ++ ++    G+ +QTPEAW+ +  YR+  YMRPLAI
Sbjct: 1120 DSTYN--KWGFQYQTPEAWDQNGLYRAGTYMRPLAI 1153


>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 792

 Score =  604 bits (1558), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 351/887 (39%), Positives = 480/887 (54%), Gaps = 117/887 (13%)

Query: 21  SSFDKVDPAKPASLTWQRKL----STGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAA 76
           S  D   P+      W+ KL    +    PL    L  K+  +L P+  R     R    
Sbjct: 7   SCADSTFPSTVPEFGWRIKLDHVFAEKRAPLRSLRL--KQIPELLPMFFRYAWYCRGVTK 64

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           KG+R +ID       T S+GVPLGG+G G I R ++G+F RW + P +     V  +QF+
Sbjct: 65  KGRRPYIDQIHPLQSTQSYGVPLGGIGCGCINRGWKGDFCRWSLIPGMYTHDTVQTDQFT 124

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           V + R     Y  VL    P      T   + +W+W      +TYHALYPRAWTV+   P
Sbjct: 125 VCIRRGGRTTYQQVLSTSRP------TDQNLNTWNWGFNPSCATYHALYPRAWTVYH-LP 177

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
              + ++CRQ+SPI+PH+YK++S PV+VF + +YN G    D+T++F+W N  G  S+ +
Sbjct: 178 GQNVTLLCRQVSPILPHDYKDTSLPVAVFVWEVYNHGNEPIDVTIMFSWQNGTGQKSDKS 237

Query: 257 GQHYNSKTKMNDG-VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
           G  +N    MN+G V  VLLH+   H     T A++A   D V VS    F  SG+ L  
Sbjct: 238 GGRWNKPFTMNEGLVDGVLLHN--GHPTISCTMAISAVCKDDVSVSHKTAFDPSGDGL-- 293

Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
              ++W ++   G  D  +    S  + PG    AA+AA   V   S     F LAWD P
Sbjct: 294 ---ELWQDLISDGELD--SEPGASFRTVPGELTAAAVAAKCHVAASSSCTAEFCLAWDMP 348

Query: 376 EVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
            V FM+  K + RRYT+++GT  +A  +++  A+ ++  WE +IE WQ PIL+++ LP W
Sbjct: 349 RVQFMAKEKIHNRRYTRWFGTDGDAGPSLSAYALTQYREWEHRIERWQNPILQNESLPSW 408

Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
           Y   LFNELY+++ GG +W + +   + L+  G                 D+ N      
Sbjct: 409 YKSALFNELYFISDGGTIWIEHNKEHNGLINSG-----------------DINN------ 445

Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
                  S+++Q                       G+F YLEG EY M+NTYDVHFY+SF
Sbjct: 446 -------SVIKQY----------------------GKFAYLEGHEYRMFNTYDVHFYASF 476

Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVN 614
           ALIML+P++QLS+Q D               +DD                   +PW  VN
Sbjct: 477 ALIMLWPQLQLSLQYD---------------IDD-------------------EPWVRVN 502

Query: 615 AYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIE 674
           AY +YDTA WKDLN KFVLQVYRD   T D  + + +WP V V M     +D DGDG+I+
Sbjct: 503 AYLMYDTADWKDLNLKFVLQVYRDYYVTKDFIYLQDMWPKVKVVMQTSLAWDTDGDGIID 562

Query: 675 NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE 734
           N G+ DQTYD WSV+G SAY GGLW+AA++A   +A+ + +      +     + K  YE
Sbjct: 563 NSGYADQTYDAWSVTGASAYCGGLWLAAVRAMMEIAKILDEEDQFKKYSDVLIQGKSSYE 622

Query: 735 -KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE----DKARSALKKVYN 789
            KLWNG Y+NYD+S    S+SI  DQ AGQWY RACGL+   DE    D    AL+ +Y 
Sbjct: 623 KKLWNGKYYNYDSSKQKHSNSIMCDQAAGQWYLRACGLVQENDEVFPKDHILCALRAIYK 682

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACG 849
            NV+    G  GAVNGM PDG  D +S QS E+W+GVTYA+AA+MI E L +  F TA G
Sbjct: 683 MNVMGFEKGTMGAVNGMRPDGSPDTTSFQSEEVWTGVTYALAANMIQEGLLEEAFTTASG 742

Query: 850 IYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           +Y+  +   GLG  FQTPEA+  ++ YRSL YMRPLAIWAMQ+A+ R
Sbjct: 743 VYKTCYERCGLG--FQTPEAYFANNCYRSLGYMRPLAIWAMQYAVER 787


>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
 gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
          Length = 783

 Score =  604 bits (1558), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 357/849 (42%), Positives = 490/849 (57%), Gaps = 96/849 (11%)

Query: 73  EEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLA 132
           +E   G+  +IDP         +GVPLGG+G G+IGR +RGEF RWQ+ P +     V A
Sbjct: 6   KEKKNGRVPYIDPLAANPCRQVYGVPLGGIGCGTIGRGWRGEFNRWQLTPGIYSYNYVEA 65

Query: 133 NQFSVFVSRSNGQKYSSVLCPKTPE-VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           NQF V V +     Y +VL P  P  VL   +  G   W+W   G  + YHALYPRAWT 
Sbjct: 66  NQFVVCVRKKGRTTYQAVLSPNRPNGVLCGRSLQG---WNWGFSGSNAVYHALYPRAWTR 122

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           +E  P  ++ +VCRQ+SP+ PH+YK++S PV+VF ++I N+     +++++F++ N   G
Sbjct: 123 YE-LPGQDIILVCRQVSPVFPHDYKDTSLPVAVFIWSIENNNNEEVEVSIMFSFENG-DG 180

Query: 252 DSEFTGQHYN-----SKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQ---ETDGVHVSLC 303
             + T  HYN     + +     V  VLLHH+  H   P T A+AA+   + + V V+  
Sbjct: 181 TQDLTAGHYNESADTTGSNTTCDVTGVLLHHK--HHKLPYTLAIAAKALHKKELVTVTTK 238

Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
             F           + +W+++ + G  +  +S E S  S    S+ AA+AA+ TV   S 
Sbjct: 239 TWF-----DSRTPCRRVWNDLMDDGKLN--SSTEPSPPSRSDQSLCAAVAATTTVAARSR 291

Query: 364 GQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
           G++ F+LAWD P + F  S K +YR YT+++G   NA   +   A++ +  WE +IEAWQ
Sbjct: 292 GELEFALAWDMPVIYFGNSKKRHYRYYTRYFGHQGNAGPALCSHALMSYPDWETKIEAWQ 351

Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKR 482
           +PIL+D+ LP WY   LFNELY++  GG VW                             
Sbjct: 352 KPILQDESLPNWYKSALFNELYFVADGGTVW----------------------------- 382

Query: 483 IVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLM 542
            +DV +++ T                       S GT    D    IG+F YLEG EY M
Sbjct: 383 -LDVRSESGT-----------------------SKGT---ADIVRKIGRFAYLEGHEYRM 415

Query: 543 WNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
           +NTYDVHFY+SFAL ML+PK+QLS+Q D A AV + D   +  + +G +  +KV G +PH
Sbjct: 416 YNTYDVHFYASFALAMLWPKLQLSLQYDMAHAVNVSDNQVVMTMMNGHYCRKKVPGCIPH 475

Query: 603 DIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
           DIG     PW  VNAY ++DT++WKDLN KFVLQVYRD V T D  + + +WP     M 
Sbjct: 476 DIGDPSEAPWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKTVMT 535

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREV---GDRG 717
               +D DGDG+IEN G  DQT+D W V+G SAY GGLW+AAL+  + +A  +    +RG
Sbjct: 536 KSMTYDSDGDGLIENSGLADQTFDAWPVTGPSAYCGGLWLAALRVMAEIATILDFPDERG 595

Query: 718 SEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
             +  L + +KA   YE+ LWNG Y+NYD+S S   +SI ADQL+GQWY  AC L    +
Sbjct: 596 KYEKILARGKKA---YERLLWNGKYYNYDSSTSKYHNSIMADQLSGQWYLHACDLAQTSN 652

Query: 777 E-----DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
           +     +   SAL+ V+N+NV+K   G  GAVNG+ PDG++D SS+Q+ E+W+GVTYAVA
Sbjct: 653 DRVFPSENVISALRTVFNFNVMKFQEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVA 712

Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
           ASMI E L D GF+TA GIY   +    LG  FQTPEA   +  YRSL YMRPL+IWAMQ
Sbjct: 713 ASMIQEGLVDEGFKTASGIYNTCFE--RLGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQ 770

Query: 892 WALTRPKPK 900
           WAL + K K
Sbjct: 771 WALEKRKNK 779


>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
           niloticus]
          Length = 856

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 349/856 (40%), Positives = 478/856 (55%), Gaps = 79/856 (9%)

Query: 63  IGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIF 121
           +GVR L    ++    K+A FID F  + L   +G PLGG+G G+I R +RGEF RWQ+ 
Sbjct: 62  LGVRYLKFWYKKTQVEKKAPFIDMFGAQPLRQIYGAPLGGIGGGTITRGWRGEFCRWQLN 121

Query: 122 PRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTY 181
           P +   K V+ NQF+V + R     Y  VL  + P  L+         W+W   G+ + Y
Sbjct: 122 PGMYHYKTVIPNQFTVCLRREGKTVYQQVLSVERPPTLQ--------GWNWGYCGEYAFY 173

Query: 182 HALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITL 241
           HALYPRAWTV++  P   + + CRQISP+IPH+Y++SS PV+V  + I N  +   D+++
Sbjct: 174 HALYPRAWTVYD-LPGQNVTLTCRQISPVIPHDYQDSSLPVAVLVWDIENKNEYDLDVSI 232

Query: 242 LFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
           +FT  N  G   +  G H+N      K  + V  VLLHH T+  + P T  +AA+E    
Sbjct: 233 MFTMVNGSGQKDDKCGGHWNEPFHLEKEGEAVSGVLLHHCTA--VNPYTLCIAAREQSDR 290

Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
            +S    F   G    L     W+++ + G  D  +   +S  +  G  + AA+A   +V
Sbjct: 291 EISHQTAFSPKGPCFSL-----WNDLIDDGRLD--SPTGSSPPTVKGEKVAAAVAVGCSV 343

Query: 359 PPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
           P      + F LAWD P++ F S  K + RRYT+++GT  +A+ +++  A+  +  WE  
Sbjct: 344 PAQGHNSLEFCLAWDMPKMTFGSQEKEHTRRYTRYFGTKGDASPSLSHYALTHYKQWERS 403

Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
           IE WQRPIL+D  LP WY   LFNELY++  GG VWT+                      
Sbjct: 404 IEEWQRPILQDSSLPSWYKSALFNELYFVADGGTVWTE---------------------- 441

Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
                   +    D +  I      +  Q   P  +             +  G+F YLEG
Sbjct: 442 --------MTEDADISGGICSEDGGLPAQ---PAVI-------------KEYGRFAYLEG 477

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
            EY M+NTYDVHFYSSFALIML+PK+ LS+Q D A +V+  D ++   L  GQ+   K  
Sbjct: 478 QEYRMYNTYDVHFYSSFALIMLWPKLALSVQYDIAGSVVQQDLTERLYLMSGQYSPVKTK 537

Query: 598 GAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
             VPHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   T D+++ + +WP  
Sbjct: 538 NVVPHDIGDPDDEPWMRVNAYLIHDTAGWKDLNLKFVLQVYRDFHLTQDRQYLQDMWPIC 597

Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
              M    +FD DGDG+IEN G+ DQTYD W V+G SAY GG+WVA+L     +AR +  
Sbjct: 598 QAVMESELKFDLDGDGLIENSGYADQTYDGWIVTGPSAYCGGMWVASLCVMCKMARLLDS 657

Query: 716 RGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL--- 771
             S  ++     +    +EK LWNG Y+NYD+SG   S+S+ +DQ AG W+ RA GL   
Sbjct: 658 EESYRHYKDLLDRGSAAFEKLLWNGKYYNYDSSGRELSNSVMSDQCAGHWFLRASGLGDG 717

Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
                 ++K +SALK V++ NV+   GG+ GAVNGM P+G  D SS+QS E+W GV Y +
Sbjct: 718 EFQAFPKEKIQSALKSVFDLNVMSFAGGQMGAVNGMRPEGVPDHSSVQSDEVWIGVVYGL 777

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           AA+MIHE + + G +TA G Y   W    LG AFQTPEA+   + YRSL YMRPL+IWAM
Sbjct: 778 AATMIHEGMQEEGMRTAEGCYRTVWE--KLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAM 835

Query: 891 QWAL-TRPKPKTLEKQ 905
           Q AL T  K +T   Q
Sbjct: 836 QLALNTSQKDQTTTAQ 851


>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 589

 Score =  590 bits (1522), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 314/568 (55%), Positives = 392/568 (69%), Gaps = 24/568 (4%)

Query: 320 MWHEIKEHGSFDRLN-SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           MW+ + ++G FDR N S   S+ S PG  + AA++AS  V P     V F+LAW  P+V 
Sbjct: 1   MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F  G TY RRYT+FYGT + +A N+  DA+ ++  WE +IE WQ PIL+D+RLPEWY  T
Sbjct: 61  FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120

Query: 439 LFNELYYLNAGGAVWTDGSPPV---HSLVTIGHRKFSL----DWSQSDLK------RIVD 485
           LFNELY+L AGG VWTDG PP           H+K S     D +Q  +K       +  
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180

Query: 486 VP------NQNDTAVNILERM--SSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
           VP      N +D +V+    +  S + EQI   +         + +DG E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQI-NGLKSEEPIPYLISKDGPEHVGKFLYLEG 239

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
           +EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  K+K L DG    RKV 
Sbjct: 240 VEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVK 299

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           GAVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQ+YRD  ATGD +F + VWP+V  
Sbjct: 300 GAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCA 359

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
           AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AALQAA+ +A  +GDR 
Sbjct: 360 AMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRH 419

Query: 718 SEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
             + +  KF KAK VYE KLWNGSYFNYD+  SS S SIQADQLAGQWYA + GL P+ D
Sbjct: 420 FAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFD 479

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
           E K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSREIW+GVTYAVAA+M+ 
Sbjct: 480 EHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLL 539

Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAF 864
             +   GF TA GI+ A WS  G GYA 
Sbjct: 540 HGMEHQGFTTAEGIFTAGWSEEGYGYAL 567


>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Monodelphis domestica]
          Length = 950

 Score =  589 bits (1519), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 342/829 (41%), Positives = 466/829 (56%), Gaps = 80/829 (9%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V 
Sbjct: 138 KIPFIDMLNFLPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGIYQHQTVIADQFTVC 197

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV++  P  
Sbjct: 198 LRRGGQTVYQQVLSLERPSVLR--------SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQ 248

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
           ++ + CRQI+PI+PH+Y++SS PV VF + + N+     DI+++F+  N +GG  +  G 
Sbjct: 249 QVTLTCRQITPILPHDYQDSSLPVGVFVWDVENNSDEDLDISIMFSMRNGLGGGHDSPGG 308

Query: 259 HYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
            +N    + +DG  V  +LLHH       P T ALAA+ T    V+      I+      
Sbjct: 309 LWNEPFCLEHDGETVQGLLLHHPNPPN--PYTMALAARLTANTTVTH-----ITAFDPXC 361

Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
           T + +W ++      D   + E    ++ G  IG A+ A   +P     Q+ F+LAWD P
Sbjct: 362 TGQKVWQDLLPRXQLDSPLA-EWGPPTQKGQGIGGAVCAGTHLPRRGRCQLEFALAWDMP 420

Query: 376 EVNFMS-GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
           ++ F + G+ +YRRYT+F+G   NAA  +   A+ ++ SWE +IEAWQ P+LED+ LP W
Sbjct: 421 KILFGARGQAHYRRYTRFFGQEGNAAPALCHYALSQYRSWEERIEAWQNPVLEDRSLPAW 480

Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
           Y   LFNELY+L  GG +W                              V+VP +  T  
Sbjct: 481 YKSALFNELYFLADGGTMW------------------------------VEVPAETTT-- 508

Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
              E    IL Q               LQ   +  G+F YLEG EY M+NTYDVHFY+SF
Sbjct: 509 ---EEFGGILNQ---------------LQPTLQEYGRFAYLEGQEYRMYNTYDVHFYASF 550

Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFE 612
           ALIML+PK++LS+Q D A A +  D ++ K L  G     K    +PHD+G  D  PW  
Sbjct: 551 ALIMLWPKLELSLQYDMAVATLSEDLTRRKYLMSGVMAPVKKKNVIPHDVGDPDDEPWLR 610

Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
           +NAY ++DTA WKDLN KFVLQ+YRD   TG++ F + +WP     M    +FD+D DG+
Sbjct: 611 INAYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGL 670

Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
           IEN G+PDQTYD W  +G SAY GGLW+AA+     +A   G +  ++ FL    + +  
Sbjct: 671 IENGGYPDQTYDGWITTGPSAYCGGLWLAAVAVMVQMATVCGTQDIQEKFLSILSRGREA 730

Query: 733 YEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKV 787
           YE+ LWNG Y+NYD+S   QS  I +DQ AGQW+ RACGL      +        AL+ +
Sbjct: 731 YERLLWNGRYYNYDSSTQPQSRIIMSDQCAGQWFLRACGLGEGDTEVFPTPHVIRALQTI 790

Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
           +  NV    GG  GAVNGM PDG  D SS+QS E+W GV Y +AA+MI E L   GF+TA
Sbjct: 791 FEVNVQGFAGGAMGAVNGMQPDGIPDTSSVQSDEVWVGVVYGLAATMIQEGLIQEGFRTA 850

Query: 848 CGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
            G Y   W    LG AFQTPEA+     +RSL YMRPL+IWAMQ AL +
Sbjct: 851 EGCYRTVWE--RLGMAFQTPEAYCQHRVFRSLAYMRPLSIWAMQLALQQ 897


>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
           rubripes]
          Length = 827

 Score =  589 bits (1518), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 351/876 (40%), Positives = 487/876 (55%), Gaps = 85/876 (9%)

Query: 37  QRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSH 95
           ++   +G++ LSQ    W+       +G+R L    ++    K+A FID F    L   +
Sbjct: 22  RKPFQSGDVSLSQV---WEHL----GLGIRYLYWWYKKTRVEKKAPFIDIFGAVPLRQIY 74

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           G PLGG+G G+I R +RGEF RWQ+ P +   K V+ NQF V + R+    Y  VL  + 
Sbjct: 75  GAPLGGIGGGTITRGWRGEFCRWQLKPGMYHYKTVIENQFMVCLRRNGQTVYQQVLSVER 134

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
           P  L+         W+W   G+ + YHALYPRAW V+   P   + + CRQISPIIPH+Y
Sbjct: 135 PPTLQ--------GWNWGFCGEYAYYHALYPRAWNVYH-LPGQNVTLTCRQISPIIPHDY 185

Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM---NDGVHA 272
           +++S PV+VF + I N    + D++++FT  N  G   + +G H+N   ++    + V  
Sbjct: 186 QDASLPVAVFVWDIENKNDYALDVSIMFTLLNGSGHADDKSGGHWNEPFRLEKNGEAVSG 245

Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
           VLLHH T+  + P T  +AA+E     V+    F   G   GL     W ++   G  D 
Sbjct: 246 VLLHHCTA--VNPYTLCIAAREQPKREVTHQTAFSPKGTCHGL-----WSDLITDGRLD- 297

Query: 333 LNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTK 391
            +   +S  +  G  + AA+AA  +V   S G + F LAWD P++ F SG K + RRYT+
Sbjct: 298 -SPTGSSPPTAKGERVAAALAAGCSVATQSRGSLEFCLAWDMPKITFGSGEKEHSRRYTR 356

Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
           +YG+  +A   ++  A+  +  WE  IE WQ PIL+D  LP WY   LFNELY++  GG 
Sbjct: 357 YYGSKGDACPLLSHHALTHYRGWEKSIEEWQGPILQDSSLPTWYKSALFNELYFVADGGT 416

Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
           VWT+                        L    DV     +    L    S+++      
Sbjct: 417 VWTE------------------------LAEDADVSGGLRSEEGGLPAQPSVIK------ 446

Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
                           + G+F YLEG EY M+NTYDVH+Y+SFALIML+PK+ LS+Q D 
Sbjct: 447 ----------------DYGRFAYLEGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDI 490

Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNP 629
           A +V+  DP++   L +G+    K  G VPHDIG  +  PW  VNAY +++TA WKDLN 
Sbjct: 491 AGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNAYLIHNTADWKDLNL 550

Query: 630 KFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
           KFVLQVYRD   T D ++ + +WP     M    +FD DGDG+IEN GF DQTYD W+V+
Sbjct: 551 KFVLQVYRDFHLTQDSQYLQDMWPICQTVMESALKFDLDGDGLIENSGFADQTYDGWTVT 610

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSG 748
           G SAY GGLW+A+L     +AR + +     Y+     +  V Y+K LWNG Y+NYD+SG
Sbjct: 611 GPSAYCGGLWLASLVVMCKMARLLENEMKYQYYRDILDRGSVAYDKVLWNGKYYNYDSSG 670

Query: 749 SSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
              S+S+ +DQ AG W+ RA GL         ++K + ALK V++ NV+   GG+ GAVN
Sbjct: 671 KDHSNSVMSDQCAGHWFLRASGLGDGEYQAFPKEKIQRALKTVFDLNVMSFAGGQMGAVN 730

Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
           GM P+G  D SS+QS E+W GV Y +AA+MIHE + + G +TA G Y   W    LG AF
Sbjct: 731 GMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGCYRTVWE--RLGMAF 788

Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
           QTPEA+     YRSL YMRPL+IWAMQ AL   + K
Sbjct: 789 QTPEAYCEKGIYRSLAYMRPLSIWAMQLALDASRNK 824


>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
 gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
          Length = 843

 Score =  588 bits (1517), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 352/889 (39%), Positives = 492/889 (55%), Gaps = 109/889 (12%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           K+ +    +G+R    I       +  FID F KR +   +GVP+GG+G G+I R + G+
Sbjct: 28  KQVLVSLKMGLRYGMYILHNRFHSREPFIDLFGKRQVDRLYGVPIGGLGCGTIMRGWNGQ 87

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNL 174
           F RWQ+ P       V A+QF+V + ++    Y  VL P  P      ++A +G W W  
Sbjct: 88  FCRWQLRPGWYTYDIVHADQFTVSIRKNGEPSYQQVLSPCRP------SSAQLGDWFWGY 141

Query: 175 KGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGK 234
            G ++TY ALYPRAWT +   P   + + CRQISPIIPHNYK+S  PV  F + I N   
Sbjct: 142 HGSQATYRALYPRAWTTYN-LPGQNVTLTCRQISPIIPHNYKDSCLPVGCFIWEIDNQSD 200

Query: 235 TSADITLLFTWTNSVGGDSEFTGQHYN------SKTKMNDG-VHAVLLHHRTSHQLPPVT 287
              +++++FT+ N  G + +  G H N      S   MN+  +  VLLHHR S  L P T
Sbjct: 201 DEIEVSIMFTFQNGDGSEHDSAGGHQNETFTRSSNENMNNSKISGVLLHHRAS--LQPCT 258

Query: 288 FALAAQETDGVHVSLCPHF-----VISGNSLGLTAKDMWHEIKE----HGSFDRLNSMET 338
           F + A+E        CP       V   NS+    K    EI      HG  +  NS  T
Sbjct: 259 FTICAKE--------CPETGDYPAVSVSNSVSFDPKGSGREISRQLLSHGKLE--NSSNT 308

Query: 339 SVTSEP----GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT-YYRRYTKFY 393
             T  P    G  IGAA+A +  V P  + ++ F+L WD PEV F + +  Y RRYT+F+
Sbjct: 309 KATQSPVSKKGEIIGAAVATTAIVKPQGKAKMEFALTWDQPEVYFDAKEIKYRRRYTRFF 368

Query: 394 ---GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
              G+H+  AA++A  A+    SWE +I+ WQ PILE++  P+WY   +FNELYY++  G
Sbjct: 369 NPNGSHR--AADLAVYALQNVKSWEEKIDQWQSPILENESYPDWYKSAIFNELYYMSDSG 426

Query: 451 AVW--TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIY 508
           ++W  TDG                            D  ++ + A + L+ +    +QI+
Sbjct: 427 SIWAETDG----------------------------DEDDETNLA-SCLKEVPGKGKQIH 457

Query: 509 TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
                             +  G+F YLEG EYLM+NTYDVHFY+SFAL  L+P I+LSIQ
Sbjct: 458 ------------------KEYGRFAYLEGHEYLMYNTYDVHFYASFALAQLWPMIELSIQ 499

Query: 569 RDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG-ICD-PWFEVNAYCLYDTARWKD 626
            DFAA++M  DP    ++  G    RK  G VPHDIG  CD PW  +NAYC+ D   W D
Sbjct: 500 YDFAASIMHEDPELRLIMWTGLRKPRKSYGFVPHDIGEPCDEPWKRINAYCISDVQDWTD 559

Query: 627 LNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTW 686
           LN KFVLQ+YRD  ATG+ +F K +WP     M      D DGDG+I+N+G  DQTYD W
Sbjct: 560 LNTKFVLQIYRDYCATGNLEFLKDMWPCAKHVMETAISQDTDGDGIIDNNG-ADQTYDVW 618

Query: 687 SVSGISAYSGGLWVAALQAASALAREVG---DRGSEDYFLFKFQKAKVVYEKLWNGSYFN 743
            + G SAY GG+++AAL     +A  +G   D+      L + + +    +KLWNG Y+ 
Sbjct: 619 RMYGASAYCGGIFLAALYIMYKIADLIGCKEDKAKYSKILLRGRDS--FQQKLWNGDYYL 676

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDED-----KARSALKKVYNYNVLKVMGG 798
           YD+S  + S SI ADQ+ G WY +ACGL+   ++D     +   AL+KV++YNV+K M G
Sbjct: 677 YDSSKGNHSDSIMADQMCGHWYLQACGLVNNNEDDIFPKSQVDKALRKVFDYNVMKFMNG 736

Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
             GAVNGM P+G +D SS+QS E+W+GVTYA+AA++I + +   GF+ A G+Y+  ++  
Sbjct: 737 TWGAVNGMKPNGEIDTSSVQSEEVWTGVTYALAANLIQQGMIKEGFKVAEGMYDTTFNVM 796

Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMK 907
           GLG  FQTPEA+   ++YR   YMRPL+IWA++WA      K  +K  K
Sbjct: 797 GLG--FQTPEAYYKINRYRCTGYMRPLSIWAIEWAFNHVFKKNTKKITK 843


>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
 gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
          Length = 871

 Score =  584 bits (1505), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 333/834 (39%), Positives = 460/834 (55%), Gaps = 120/834 (14%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           KR FID      L   +GVPLGG+G GSI R ++G+F RW + P +      +ANQF+V 
Sbjct: 129 KRPFIDALNPVQLQQIYGVPLGGLGCGSICRGWKGDFCRWALLPGIYSYDIKMANQFTVC 188

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           V R     Y  VL  + P  L +        W+W+     + YHALYPRAWTV+E  P  
Sbjct: 189 VRRRGQTTYQQVLSTRHPSCLTE--------WNWSYSPSHAYYHALYPRAWTVYE-LPGQ 239

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
            + + CRQ+SP++PH+YK+SS P +VF +T  N GK   DI+L+FT+ NS+G D++ +G 
Sbjct: 240 NITLTCRQVSPVLPHDYKDSSIPAAVFVWTAENHGKEEVDISLMFTFQNSIGADTDLSGG 299

Query: 259 HYNSKTKMN----DGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG 314
           H+N    +N    D V  +LLHH+  H+  P T AL     +G+ V+    F  +G    
Sbjct: 300 HWNEPFSVNEEGVDPVSGILLHHQ--HKTMPCTIALGIPHKEGIRVTGSTSFNPAG---- 353

Query: 315 LTAKDMWHEIKEHGSFDRLNSMETSVT-SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
            + K++W ++   G    L S E     +  G ++  A+ AS  +P   +    F L+WD
Sbjct: 354 -SGKEIWRDLMYDG---ELGSPEAPTNKTVKGEALAGAVCASFKLPSGGKQDTDFVLSWD 409

Query: 374 CPEVNFM-SGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLP 432
            P + F+ S +TY R YT ++G    AA  +A  A+  +  WE  IE WQRPIL+++  P
Sbjct: 410 MPVIKFVGSCRTYKRWYTGWFGDDGTAAPKLAAYALSHYPEWESTIEEWQRPILDNQSYP 469

Query: 433 EWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDT 492
           EWY   LFNELY+++ GG +W                                V   ND 
Sbjct: 470 EWYRSALFNELYFMSDGGTIWL-------------------------------VAEDNDG 498

Query: 493 AVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYS 552
            V           +   P  L      ++++      G+F YLEG EY M+NTYDVH Y+
Sbjct: 499 HV-----------ENNLPPWLREKTPAHIME-----YGRFGYLEGHEYRMYNTYDVHHYA 542

Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
           SFAL ML+P+++ SIQ D+               DD                   +P+  
Sbjct: 543 SFALAMLWPQLEYSIQYDY---------------DD-------------------EPFIH 568

Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
           VNAY ++DT+ W+DLN KFVLQ++RD     DK F   +WP     M     +D D DGM
Sbjct: 569 VNAYVMHDTSDWRDLNLKFVLQIFRDYFVGRDKGFLAFMWPRAQAVMTKALTWDTDQDGM 628

Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
           I+N G  DQTYD W V+G SAY GGLW+AAL+    +A  +G    E ++L   Q+AK  
Sbjct: 629 IDNSGAADQTYDAWIVTGASAYCGGLWLAALRCMVEIANILGKEEEEQHYLAVMQRAKTA 688

Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLP----IVDEDK-------A 780
           Y EKLWNG Y+NYD SG S S+SI A   +G WY RACGL+P    ++D D+        
Sbjct: 689 YEEKLWNGEYYNYDCSGRSYSNSIMAGAGSGHWYLRACGLVPEDNEVMDSDREVLPLDHV 748

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
            S+L+KVY++NV++   G  GAVNGM P+G+ D++SMQS E+W+G+TYA+AASMI E + 
Sbjct: 749 ASSLRKVYDFNVMQFHQGTMGAVNGMRPNGKKDLTSMQSEEVWTGITYALAASMIQEGMV 808

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           + GFQTA G+Y   +     G+AFQTPEA+   D +RSL YMRPLAIWAMQWAL
Sbjct: 809 EEGFQTAYGVYNMCY--LQCGFAFQTPEAYLERDFFRSLGYMRPLAIWAMQWAL 860


>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
           (Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
           (Beta-glucosidase 2) [Ciona intestinalis]
          Length = 861

 Score =  583 bits (1502), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 334/849 (39%), Positives = 481/849 (56%), Gaps = 75/849 (8%)

Query: 63  IGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP 122
           + VR L    +   K K+  +D F +  L   +GVPLGG+GSG+I R ++G+F +WQ+ P
Sbjct: 74  LAVRYLQYYNQLRRKNKKPCMDFFNQHPLRPIYGVPLGGIGSGTINRGWQGDFCKWQLKP 133

Query: 123 RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH 182
            +      LA+QF V + R N      VL  K P      +   + +W+W   G  + Y+
Sbjct: 134 GIYTFDCPLADQFIVSIRRHNTVVCQQVLSTKGP------SGEALRTWNWKYCGSYAYYN 187

Query: 183 ALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLL 242
            LYPRAWTV++ +    + + CRQISP+ P++YK++S P +VF + + N G    ++T++
Sbjct: 188 GLYPRAWTVYDLQGQ-NIVLTCRQISPVFPNDYKDTSLPAAVFVWDVENRGNEEVEVTIV 246

Query: 243 FTWTNSVGG---DSEFTG--QHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG 297
            +  N  G      + +G      S T  N+    V++H+ T   +P  T  +AA E  G
Sbjct: 247 MSLENGTGNPEQQKKVSGLWNEPFSNTDHNNKAKGVVMHNTTYRDMP-YTLTVAAAEKPG 305

Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSM--ETSVTSEPGSSIGAAIAAS 355
           V V+ C  F  + N       ++W+ +   G   RL S   ++S+T+  G     A+AAS
Sbjct: 306 VDVTHCTSFNPTHN-----GAEVWNSLLNTG---RLQSKPGKSSLTAT-GEHTACAVAAS 356

Query: 356 VTVPPDS-EGQVTFSLAWDCPEVNFM-SGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
             V P S  G + F+L WD P+++F  +GK Y RRYT+++    NA A IA  A+  + +
Sbjct: 357 CKVKPHSMAGSMDFALTWDMPQIHFGDAGKHYKRRYTRWFNG-DNAGATIASYALDNYQT 415

Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSL 473
           WE +IEAWQRPIL++K+LP+WY   +FNELYY+  GG VW                    
Sbjct: 416 WEDKIEAWQRPILDNKQLPDWYKSAIFNELYYITDGGTVW-------------------- 455

Query: 474 DWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFL 533
                     ++V  + +   N       + E +     ++S        D  +  G+F 
Sbjct: 456 ----------LEVAEEENGTAN-----GKVPETLQNTTKVDSC-------DFLKEYGKFA 493

Query: 534 YLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVS 593
           YLEG EY M+NTYDVHFY+S AL  L+PK++LS+Q D A +++  +P   K L DG    
Sbjct: 494 YLEGQEYKMYNTYDVHFYASIALAYLWPKLELSVQYDIATSILHSNPQPHKYLMDGVTAP 553

Query: 594 RKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAV 651
            K    VPHD+G  +  PW  VN Y ++DTA W+DLNPKFVLQ YRD   T D  F KA+
Sbjct: 554 VKTPNVVPHDVGCPEDEPWLRVNTYFVHDTADWRDLNPKFVLQAYRDYYITKDIDFLKAM 613

Query: 652 WPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAR 711
           WP   + M    + D+D DG+IEN G  DQT+D W V+G SAY GGLW+AAL+     A 
Sbjct: 614 WPICKIVMEQSMRHDKDNDGLIENSGAADQTFDGWCVTGPSAYCGGLWLAALRCMEEAAD 673

Query: 712 EVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQ-SSSIQADQLAGQWYARAC 769
            + ++  +  F     +A   Y+ +LWNG YFNYD S + Q  +S+ ADQ AGQW+  A 
Sbjct: 674 ILHEKDDQVKFHDILVRATKAYQDRLWNGKYFNYDCSNNPQHRTSVMADQCAGQWFLLAS 733

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
           G+  ++ +D+  S+LK +++ NV +  GG+ GAVNGM P G VD SS+QS E+W+GVTYA
Sbjct: 734 GICDVLPDDQVMSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVDTSSVQSDEVWTGVTYA 793

Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
           +AA+MIH+ +   GF+TA GIY + W   G+  AFQTPEA   +  YRSL YMRPL+IW 
Sbjct: 794 LAATMIHKGMLKEGFRTASGIYRSCWQRYGM--AFQTPEAMRKNHTYRSLAYMRPLSIWG 851

Query: 890 MQWALTRPK 898
           MQ+AL  PK
Sbjct: 852 MQYALENPK 860


>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
          Length = 922

 Score =  582 bits (1500), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 349/869 (40%), Positives = 476/869 (54%), Gaps = 83/869 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    V+   +G+R L    R+   + K+ FID      L   +G PLGG+G 
Sbjct: 92  PFQANNISLSNLVKHLGMGIRYLKWWYRKTQVEKKKPFIDMINCVPLRQIYGCPLGGIGG 151

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL     
Sbjct: 152 GTITRGWRGQFCRWQLNPGIYQHQTVIADQFTVCLRREGRTVYQQVLSMERPSVLH---- 207

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV V
Sbjct: 208 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNITLTCRQITPILPHDYEDSSLPVGV 262

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSH 281
           F + + N G  + D++++F+  N +GG+ +  G  +N    +  DG  V  +LLHH T  
Sbjct: 263 FVWDVENEGDEAVDVSIMFSMRNGLGGEDDAQGGLWNEPFCLEQDGETVQGLLLHHPT-- 320

Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR-LNSMETS 339
            LP P T A+AA+ T+   V+      I+      T   +W ++ + G  D        S
Sbjct: 321 -LPNPYTMAVAARLTEDSTVTH-----ITAFDPDSTGHQVWQDLLQDGQLDSPAGDGGRS 374

Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQN 398
             S+ G+ I  A+  S  +PP     + FSLAWD P + F   G+ +YRRYT+F+G   +
Sbjct: 375 NPSQKGAGIAGALCVSGKLPPRGRCSLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGPDGD 434

Query: 399 AAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
           AA  ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW     
Sbjct: 435 AAPALSHYALCHYADWEDRISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW----- 489

Query: 459 PVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFG 518
                                    ++VP  +                   P  L  S G
Sbjct: 490 -------------------------LEVPEDS------------------LPEELVGSMG 506

Query: 519 TNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMH 578
              L+      G+F YLEG EY M+NTYDVHFY+SFAL+ML+PKI+LS+Q D A A +  
Sbjct: 507 Q--LRPILREYGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKIELSLQYDMALATLRE 564

Query: 579 DPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVY 636
           D ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVY
Sbjct: 565 DLTRRRYLMSGVMGPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVY 624

Query: 637 RDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSG 696
           RD   TGD+ F K +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY G
Sbjct: 625 RDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCG 684

Query: 697 GLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSI 755
           GLW+AA+     +A     +  +D F     + +  YE+ LWNG Y+NYD+S   QS S+
Sbjct: 685 GLWLAAVAVTVQMAVLCAAQDVQDKFSSILSRGQEAYERLLWNGRYYNYDSSSQPQSRSV 744

Query: 756 QADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
            +DQ AGQW+ +ACGL      +        AL+ ++  NV    GG  GAVNGM P G 
Sbjct: 745 MSDQCAGQWFLKACGLGEGDTEVFPTTHVVRALQTIFELNVQSFAGGAMGAVNGMQPHGI 804

Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
            D SS+QS E+W GV Y +AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+ 
Sbjct: 805 PDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYC 862

Query: 872 TDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
               +RSL YMRPL+IWAMQ AL + K K
Sbjct: 863 QQQVFRSLAYMRPLSIWAMQLALQQQKHK 891


>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
          Length = 1464

 Score =  578 bits (1489), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 329/853 (38%), Positives = 474/853 (55%), Gaps = 83/853 (9%)

Query: 56   ETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEF 115
            + +    +G+R      +   +G++ FID   +    + +G P+GG+G G++GR YRGEF
Sbjct: 675  QVIGFIGLGIRYAKEWWKRRRQGRKMFIDHMDQLFHKAIYGAPIGGIGCGTMGRGYRGEF 734

Query: 116  QRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTAAGIGSWDWNL 174
             R+Q+ P + +   + ANQF +   R NGQ  Y  VL  +        ++  + SW+W  
Sbjct: 735  ARFQMVPGIYDHTVIQANQF-ILCIRKNGQTVYQKVLTGQRE------SSKSLRSWEWQA 787

Query: 175  KGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGK 234
              +   YHALYPR+WTV+  + +  LR++CRQ+SPI PH+YK++S+P++ F +T  N+G 
Sbjct: 788  PREGDVYHALYPRSWTVYHLD-EFNLRLICRQVSPIFPHDYKDTSFPMATFIWTAENNGT 846

Query: 235  TSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND---GVHAVLLHHRTSHQLPPVTFALA 291
               D++++FT+ N  G   +  G  +N     N     V  V ++H    Q    T+ + 
Sbjct: 847  EPLDVSIMFTFKNGRGVKDDGNGGCHNHVFDENSEGRQVSGVSINHNI--QDMKCTYCIG 904

Query: 292  AQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAA 351
                D V V+   +F   G     T +++W ++KE G  +     +T+ T + G    AA
Sbjct: 905  GVHKDDVSVTCKEYFDPRG-----TGQEVWEDLKEDGQLN-TEGGKTTCTGK-GEESAAA 957

Query: 352  IAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIARDAILE 410
            +     VPP     + F+L WD P ++F + + +Y RRY++++G    A   ++  A   
Sbjct: 958  VCVKANVPPKQCKDLEFTLTWDMPIIHFKAKERFYARRYSRWFGLDGEAGPRMSSYAAAN 1017

Query: 411  HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRK 470
            + +WE +IE WQ+P+L++K LP WY   LFNELYY++ GG VW D   PV          
Sbjct: 1018 YSTWEQKIEEWQKPVLQNKNLPAWYKSALFNELYYVSDGGTVWID---PVEG-------- 1066

Query: 471  FSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIG 530
                     L R+ D  +                     P+               +   
Sbjct: 1067 -----QGGKLVRVTDPKD--------------------PPIV--------------QEFS 1087

Query: 531  QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
            +F YLEG EY M+NTYDVH Y+SF+LIML+PK+Q+S+Q D A AV   D +++K L  G+
Sbjct: 1088 RFGYLEGHEYRMYNTYDVHHYASFSLIMLWPKLQISLQYDIATAVESEDSTRIKFLTSGE 1147

Query: 591  WVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
               RK L AVPHD+G  +  PW  VN Y ++ T  WKDLN KFVLQ YRD  AT D  + 
Sbjct: 1148 VGIRKRLHAVPHDMGDPEDEPWNRVNCYVIHPTCNWKDLNMKFVLQTYRDYSATKDDAYL 1207

Query: 649  KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
            + ++P     M    ++D D DG+I+N GF DQT+D W+++G SAY GG+W+AAL+    
Sbjct: 1208 RHMYPVAKYVMETAKKWDVDDDGIIDNGGFADQTFDAWTMTGASAYCGGMWLAALRMMIE 1267

Query: 709  LAREVGDRGSEDYFLFK--FQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
            +A  +  +  +D   +K    K KV Y+ KLWNGSY+NYD S      SI ADQLAG W+
Sbjct: 1268 MATIL--KKEDDVLHYKEILDKGKVSYQNKLWNGSYYNYDCSSGGHHDSIMADQLAGHWF 1325

Query: 766  ARACGLL--PIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
             +A GL    I   D+ +S+LKK++  NV+   GG  GA+NG  PDG  D+SS QS E W
Sbjct: 1326 LKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKDISSCQSEEFW 1385

Query: 824  SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
             GVTY +AA+MI E L D  FQTA G Y   W   GL  AFQTPEA+ TD+ YRSL YMR
Sbjct: 1386 VGVTYGLAANMIQEGLLDQAFQTAWGAYHVCWEWYGL--AFQTPEAYMTDNIYRSLGYMR 1443

Query: 884  PLAIWAMQWALTR 896
            PLAIW+MQWAL +
Sbjct: 1444 PLAIWSMQWALEK 1456


>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Cricetulus griseus]
          Length = 898

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 334/828 (40%), Positives = 464/828 (56%), Gaps = 84/828 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID F    L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 101 RKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 160

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R+    Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 161 ADQFTVCLRRNGRTVYQQVLSLERPSVLR--------SWNWGLCGHFAFYHALYPRAWTV 212

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++S  PV VF + + N G  + D++++F+  N +GG
Sbjct: 213 YQ-LPGQSVTLTCRQITPILPHDYQDSCLPVGVFVWDVENQGDETLDVSIMFSMRNGLGG 271

Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
           + +  G  +N    +  DG  V  +LLHH T     P T A+AA++T   H++       
Sbjct: 272 EDDALGGLWNEPFCLQRDGKTVQGLLLHHPTPPN--PYTMAVAARDTTVTHIT-----AF 324

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             +S   T + +W  + + G  D  +    S  S+ G  +  A+  S  +PP  +  + F
Sbjct: 325 DPDS---TGQQVWKHLLQDGQLD--SPAGQSTPSKKGEGVAGAVCISSKLPPRGQCCLEF 379

Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           SLAWD P + F   G+ +YRRYT+F+G+  + A  ++  A+ ++  WE +I AWQ P+L+
Sbjct: 380 SLAWDMPRIMFGAKGQIHYRRYTRFFGSDGDVAPALSHYALCQYADWENRISAWQSPVLD 439

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           D+ LP WY   LFNELY+L  GG VW                              ++VP
Sbjct: 440 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 469

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
             +     + E +   + Q+             +LQD     G+F YLEG EY M+NTYD
Sbjct: 470 EDS-----LPEELGENMYQL-----------RPILQD----YGRFGYLEGQEYRMYNTYD 509

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SFAL+ML+PK++LS+Q D A A    D ++ + L  G     K    +PHDIG  
Sbjct: 510 VHFYASFALVMLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDP 569

Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
           D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M    +F
Sbjct: 570 DDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKF 629

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           D+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F   
Sbjct: 630 DKDQDGLIENGGYADQTYDGWVATGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSI 689

Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
             + +  YE+ LWNG Y+NYD+S   QS SI +DQ AGQW+ RACGL      +      
Sbjct: 690 LCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHV 749

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
             AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L 
Sbjct: 750 VRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLT 809

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
             GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 810 REGFRTAEGCYRTVWERLGL--AFQTPEAYCQQQVFRSLAYMRPLSIW 855


>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
          Length = 1136

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 336/831 (40%), Positives = 463/831 (55%), Gaps = 83/831 (9%)

Query: 72   REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
            R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 338  RKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 397

Query: 132  ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
            A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 398  ADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 449

Query: 192  HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
            ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 450  YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 508

Query: 252  DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
              +  G  +N    +  DG  V  +LLHH     LP P T A+AA+ T    V+      
Sbjct: 509  GDDAIGGLWNEPFCLERDGETVQGLLLHHPA---LPNPYTMAVAARLTADTTVTH----- 560

Query: 308  ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            I+      T + +W ++ + G  D  +    S  S+ G+ I  A+  +  +PP  + ++ 
Sbjct: 561  ITAFDPDSTGQQVWQDLLQDGQLD--SPAGRSTPSQKGAGIAGAVCVTGKLPPRGQCRLE 618

Query: 368  FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            FSLAWD P + F   G+ YYRRYT+F+G   NAA  ++  A+ ++  WE +I AWQ P+L
Sbjct: 619  FSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCQYVDWEEKISAWQSPVL 678

Query: 427  EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
            +D+ LP WY   LFNELY+L  GG VW                              ++V
Sbjct: 679  DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 708

Query: 487  PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTY 546
            P ++     ++E M  +   +                      G+F YLEG EY M+NTY
Sbjct: 709  P-EDSLPEKLVESMCQLRPIL-------------------REYGRFGYLEGQEYRMYNTY 748

Query: 547  DVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGI 606
            DVHFY+SFAL+ML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDIG 
Sbjct: 749  DVHFYASFALVMLWPKLELSLQYDMAVATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGD 808

Query: 607  CD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
             D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F + +WP     M    +
Sbjct: 809  PDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMK 868

Query: 665  FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
            FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G    +D F  
Sbjct: 869  FDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAILCGAHDVQDKFSS 928

Query: 725  KFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDK 779
              ++ +  YE+ LWNG Y+NYD S   QS SI +DQ AGQW+ RA GL      +     
Sbjct: 929  ILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLRASGLGEGDTEVFPTRH 988

Query: 780  ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
               AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L
Sbjct: 989  VVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGL 1048

Query: 840  ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
               GFQTA G Y   W    LG AFQTPEA+     +RSL YMRPL+IWAM
Sbjct: 1049 TWEGFQTAEGCYRTVWE--RLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 1097


>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
          Length = 858

 Score =  575 bits (1482), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 339/845 (40%), Positives = 470/845 (55%), Gaps = 78/845 (9%)

Query: 63  IGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIF 121
           +G+R L    ++    K+A FID F    L   +G PLGG+G G+I R +RGEF RWQ+ 
Sbjct: 62  LGIRYLRWWYKKTQVEKKAPFIDMFHALPLRQIYGAPLGGIGGGTITRGWRGEFCRWQLN 121

Query: 122 PRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTY 181
           P +   K V+A+QF+V + R     Y  VL  + P  L+         W+W   G+ + Y
Sbjct: 122 PGMYHYKTVIADQFTVCLRRGGQTVYQQVLSVEHPSTLQ--------GWNWGYCGEYAFY 173

Query: 182 HALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITL 241
           HALYPRAWTV+   P   + + CRQISP+IPH+YK+SS PV+VF + I N    + ++++
Sbjct: 174 HALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDYKDSSLPVAVFVWDIENKNDYALEVSI 232

Query: 242 LFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGV 298
           +FT  N  G   + +G H+N      K    +  VLLHH T+  + P T  ++A+E    
Sbjct: 233 MFTMVNGSGHKDDSSGGHWNEPFHLEKEGKALSGVLLHHCTA--VNPYTLCISAREQPDR 290

Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
            VS    F  SG   GL     W ++   G  D  +   +S  ++ G  + AA+A   +V
Sbjct: 291 KVSHQTAFSPSGTCSGL-----WSDLMTDGRLDSPSG--SSPPTQKGEKVAAALAVGCSV 343

Query: 359 PPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
           P      V F LAWD P + F S  + + RRYT+++G+   A+ +++  A+  +  WE +
Sbjct: 344 PAHGHNDVEFCLAWDMPLITFGSRERKHVRRYTRYFGSKGEASPSLSHYALTHYKEWERR 403

Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
           IE WQ PIL+D  LP WY   LFNELY++  GG VWT+                      
Sbjct: 404 IEEWQSPILQDSSLPSWYKSALFNELYFVADGGTVWTE---------------------- 441

Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
             L    DV     +    L     ++++                       G+F YLEG
Sbjct: 442 --LAEDADVSGGVRSEDGGLPAQPDVIKEY----------------------GRFAYLEG 477

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
            EY M+NTYDVHFY+SFALIML+PK+ LS+Q D A +V+  D ++   L  G++   K  
Sbjct: 478 QEYRMYNTYDVHFYASFALIMLWPKLALSVQYDIAGSVVQQDLTERLHLMSGRYSPVKTK 537

Query: 598 GAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
             VPHDIG  D  PW  +NAY ++DTA WKDLN KFVLQVYRD   T D ++ K +WP  
Sbjct: 538 NVVPHDIGDPDDEPWHRLNAYLIHDTAGWKDLNLKFVLQVYRDFHITQDNQYLKDMWPIC 597

Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
              M    +FD DGDG+IEN G+ DQTYD W+V+G SAY GGLW+A+L     +AR V  
Sbjct: 598 QAVMESEIKFDLDGDGLIENSGYADQTYDGWTVTGPSAYCGGLWLASLCVMCKMARLVDT 657

Query: 716 RGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY--ARACGL- 771
             +  ++     +    ++K LWNG Y+NYD+SG   S+S+ +DQ AG W+  A ACG  
Sbjct: 658 ERTYQHYKDILDRGSAAFDKLLWNGKYYNYDSSGRYHSNSVMSDQCAGHWFLKASACGEE 717

Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
                 ++K +S LK +++ NV+   GG+ GAVNGM P+G  D SS+QS E+W GV Y +
Sbjct: 718 DYQAFPKEKIQSTLKSIFDLNVMAFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGL 777

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           AA+MIHE + + G  TA G Y   W    LG AFQTPEA+   + YRSL YMRPL+IWAM
Sbjct: 778 AATMIHEGMREEGMHTAEGCYRTVWE--RLGMAFQTPEAYCEKNIYRSLAYMRPLSIWAM 835

Query: 891 QWALT 895
           Q AL 
Sbjct: 836 QLALN 840


>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
          Length = 918

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 336/830 (40%), Positives = 464/830 (55%), Gaps = 81/830 (9%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K+ FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 117 RKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 176

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           ANQF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 177 ANQFTVCLRRRGQTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 228

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + +++++F+  N +GG
Sbjct: 229 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALEVSIMFSMRNGLGG 287

Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
           + +  G  +N    +  DG  V  +LLHH T     P T A+AA+ +    V+    F  
Sbjct: 288 EDDAPGGLWNEPFCLERDGETVQGLLLHHPTPPN--PYTMAVAARLSADTTVTHLTAF-- 343

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             +S G   + +W ++   G  D L     S+ S+ G  I  A+  S  +PP    ++ F
Sbjct: 344 DPDSAG---QQVWQDLLLDGQLDSLAG--KSLPSQKGVGIAGAVCVSGKLPPRGRQRLEF 398

Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           SLAWD P + F   G+ YYRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+LE
Sbjct: 399 SLAWDMPRIMFGAKGRVYYRRYTRFFGPDGDAAPALSHYALSHYADWEEKISAWQNPVLE 458

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           D+ LP WY   LFNELY+L  GG VW                              ++VP
Sbjct: 459 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 488

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
             +                   P AL  S G   L+   +  G+F YLEG EY M+NTYD
Sbjct: 489 EDS------------------LPEALVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYD 528

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SFAL+ML+PK++LS+Q D A A +  D ++ + L  G     K    +PHD+G  
Sbjct: 529 VHFYASFALVMLWPKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDP 588

Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
           D  PW  VNAY ++DT  WKDLN KFVLQVYRD   TGD+ F + +WP     M    +F
Sbjct: 589 DDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKF 648

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           D+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G + +++ F   
Sbjct: 649 DKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSI 708

Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
             + +  Y++ LWNG Y+NYD S   QS SI +DQ AGQW+ RA GL      +    + 
Sbjct: 709 LSRGQEAYDRLLWNGRYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDSEVFPTQQV 768

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
             AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L 
Sbjct: 769 VRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLT 828

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
             GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IWAM
Sbjct: 829 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQRRVFRSLAYMRPLSIWAM 876


>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
           [Nomascus leucogenys]
          Length = 919

 Score =  573 bits (1478), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 334/831 (40%), Positives = 458/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 119 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 178

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G     HALYPR WTV
Sbjct: 179 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFCFLHALYPRXWTV 230

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P PE  + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 231 YQ-LPGPECSLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 289

Query: 252 DSEFTGQHYNSKTKMNDG---VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N    +  G   V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 290 GDDAPGGLWNEPFCLERGGETVQGLLLHHPT---LPNPYTMAVAARVTADTTVTH----- 341

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ 
Sbjct: 342 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLQPRGQCRLE 399

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+ ++  WE +I AWQ P+L
Sbjct: 400 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPTLSHYALCQYPEWEERISAWQSPVL 459

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW +                               
Sbjct: 460 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 488

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L+    + G+F YLEG EY M+N
Sbjct: 489 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 527

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 528 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 587

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 588 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 647

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 648 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 707

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 708 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 767

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 768 PHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 827

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 828 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
          Length = 927

 Score =  573 bits (1476), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 336/830 (40%), Positives = 464/830 (55%), Gaps = 81/830 (9%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K+ FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 126 RKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 185

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           ANQF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 186 ANQFTVCLRRRGQTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 237

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + +++++F+  N +GG
Sbjct: 238 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALEVSIMFSMRNGLGG 296

Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
           + +  G  +N    + +DG  V  +LLHH T     P T A+AA+ +    V+    F  
Sbjct: 297 EDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAARLSADTTVTHLTAF-- 352

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             +S G   + +W ++   G  D L     S+ S+ G  I  A+  S  +PP    ++ F
Sbjct: 353 DPDSAG---QQVWQDLLLDGQLDSLAG--KSLPSQKGVGIAGAVCVSGKLPPRGRHRLEF 407

Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           SLAWD P + F   G+ YYRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+LE
Sbjct: 408 SLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYADWEEKISAWQSPVLE 467

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           D+ LP WY   LFNELY+L  GG VW                              ++VP
Sbjct: 468 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 497

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
             +                   P  L  S G   L+   +  G+F YLEG EY M+NTYD
Sbjct: 498 EDS------------------LPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYD 537

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHD+G  
Sbjct: 538 VHFYASFALIMLWPKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDP 597

Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
           D  PW  VNAY ++DT  WKDLN KFVLQVYRD   TGD+ F + +WP     M    +F
Sbjct: 598 DDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKF 657

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           D+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G + +++ F   
Sbjct: 658 DKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSI 717

Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
             + +  Y++ LWNG Y+NYD S   QS SI +DQ AGQW+ RA GL      +    + 
Sbjct: 718 LSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQV 777

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
             AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L 
Sbjct: 778 VRALQTIFEFNVQAFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLT 837

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
             GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IWAM
Sbjct: 838 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQRRVFRSLAYMRPLSIWAM 885


>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
          Length = 917

 Score =  573 bits (1476), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 340/858 (39%), Positives = 474/858 (55%), Gaps = 88/858 (10%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P +   ++    ++   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 92  PFNANNISLSNLIKHLGMGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGG 151

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL+    
Sbjct: 152 GTITRGWRGQFCRWQLNPGMYQHQTVVADQFTVCLRREGTTVYQQVLSVERPSVLR---- 207

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV V
Sbjct: 208 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 262

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSH 281
           F + + N G  + D++++F+  N +GG  +  G  +N    +  DG  V  +LLHH T  
Sbjct: 263 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLLHHPT-- 320

Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
            LP P T A+AA+ T    V+      I+      T + +W ++ + G  D  +    S 
Sbjct: 321 -LPNPYTMAVAARLTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGRST 372

Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
            S+ G  I  A+  +  +PP  + ++ FSLAWD P + F   G+ YYRRYT+F+G   NA
Sbjct: 373 PSQKGVGIAGAVCVTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNA 432

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           A  ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW      
Sbjct: 433 APTLSHYALCRYTDWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW------ 486

Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQN--DTAVNILERMSSILEQIYTPVALNSSF 517
                                   ++VP  +  +  V  + ++  IL++           
Sbjct: 487 ------------------------LEVPEDSLPEELVGSMCQLRPILQEY---------- 512

Query: 518 GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMM 577
                       G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A + 
Sbjct: 513 ------------GRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLR 560

Query: 578 HDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQV 635
            D ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQV
Sbjct: 561 EDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQV 620

Query: 636 YRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
           YRD   TGD+ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY 
Sbjct: 621 YRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDGWVTTGPSAYC 680

Query: 696 GGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSS 754
           GGLW+AA+     +A   G +  +D F    ++ +  YE+ LWNG Y+NYD S   QS S
Sbjct: 681 GGLWLAAVAVMVQMAVLCGAQDVQDKFSSILRRGQEAYERLLWNGRYYNYDCSPPPQSCS 740

Query: 755 IQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
           I +DQ AGQW+ RA GL      +        AL+ ++ +NV    GG  GAVNGM P G
Sbjct: 741 IMSDQCAGQWFLRASGLGEGDTEVFPTPHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHG 800

Query: 811 RVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
             D SS+QS E+W GV Y +AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+
Sbjct: 801 VPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAY 858

Query: 871 NTDDQYRSLCYMRPLAIW 888
                +RSL YMRPL+IW
Sbjct: 859 CQQRVFRSLAYMRPLSIW 876


>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
          Length = 967

 Score =  573 bits (1476), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 339/855 (39%), Positives = 460/855 (53%), Gaps = 82/855 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    ++L  +G R      ++   + K  FID      L   +G PLGG+G 
Sbjct: 144 PFDANNISLSNMIKLVGVGFRYFQWWYQKTRVEKKTPFIDMLNSVPLRQIYGCPLGGIGG 203

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL+    
Sbjct: 204 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 259

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV V
Sbjct: 260 ----SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 314

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSH 281
           F + + N G  + D++++FT  N +GG  +  G  +N    +  DG  V  +LLHH T  
Sbjct: 315 FVWDVENEGDEALDVSIMFTMRNGLGGRDDAPGGLWNEPFCLERDGETVQGLLLHHPTPP 374

Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
              P T A+AA+ T    V+      I+      T + +W ++ + G  D  +    S  
Sbjct: 375 N--PYTMAMAARLTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGRSTP 425

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
           S+ G  I  A+  +  +PP     + FSLAWD P + F   G+ YYRRYT+F+G   +AA
Sbjct: 426 SQKGVGIAGAVCVAAKLPPRGRCCLEFSLAWDMPRIVFGAKGQVYYRRYTRFFGQDGSAA 485

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
             ++  A+  +  WE +I AWQ PILED+ LP WY   LFNELY+L  GG VW       
Sbjct: 486 PALSHYALCRYADWEERISAWQSPILEDRSLPAWYKSALFNELYFLADGGTVW------- 538

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                                  ++VP    T                         G  
Sbjct: 539 -----------------------LEVPEDALTE--------------------EPEGGMR 555

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
            L+      G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D 
Sbjct: 556 QLRPMLREYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLKEDL 615

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
           +  + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD
Sbjct: 616 TPRRYLMSGLTAPVKRRNVIPHDIGDPDDEPWLRVNAYVIHDTADWKDLNLKFVLQVYRD 675

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              TGD+ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGL
Sbjct: 676 YYLTGDQGFLRDMWPVCLAVMESEMEFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGL 735

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
           W+AA+     +A     +  ++ F     + K  YE+ LWNG Y+NYD+S   QS SI +
Sbjct: 736 WLAAVAVMVQMATLCEAQDIQEKFSSILSRGKEAYERLLWNGRYYNYDSSSQPQSRSIMS 795

Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
           DQ AGQW+ RACGL      +        AL+ ++ +NV    GG  GAVNGM P G  D
Sbjct: 796 DQCAGQWFLRACGLGEGDTEVFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPD 855

Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
            SS+QS E+W GV Y +AA+MI E L   GF TA G Y   W    LG AFQTPEA+   
Sbjct: 856 TSSVQSDEVWVGVVYGLAATMIQEGLTSKGFWTAEGCYRTVWE--RLGLAFQTPEAYCQQ 913

Query: 874 DQYRSLCYMRPLAIW 888
             +RSL YMRPL+IW
Sbjct: 914 RVFRSLAYMRPLSIW 928


>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
 gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 338/856 (39%), Positives = 470/856 (54%), Gaps = 84/856 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++ +  ++   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 100 PFQANNVSLRNMIKHIGMGLRYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGG 159

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL+    
Sbjct: 160 GTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 215

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P  ++ + CRQI+PI+PH+Y++SS PV V
Sbjct: 216 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGV 270

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
           F + + N G  + D++++F+  N +GG  +  G  +N    +  G   V  +LLHH T  
Sbjct: 271 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPT-- 328

Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
            LP P T A+AA+ T    V+      I+      T + +W ++ + G  D  +    S 
Sbjct: 329 -LPNPYTMAVAARVTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQST 380

Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
            ++ G  I  A+  S  + P  + ++ FSLAWD P + F   G+ +YRRYT+F+G   +A
Sbjct: 381 PTQKGVGIAGAVCVSSKLRPRGQCRLEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDA 440

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           A  ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW +    
Sbjct: 441 APALSHYALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE---- 496

Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
                                                  R  S+ E++      N     
Sbjct: 497 --------------------------------------VREDSLPEEL----GRNMCHLC 514

Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
             L+D     G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D
Sbjct: 515 PTLRD----YGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 570

Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
            ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYR
Sbjct: 571 LTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 630

Query: 638 DVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           D   TGD+ F K +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GG
Sbjct: 631 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGG 690

Query: 698 LWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQ 756
           LW+AA+     +A   G +  +D F     + +  YE+ LWNG Y+NYD+S   QS S+ 
Sbjct: 691 LWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVM 750

Query: 757 ADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
           +DQ AGQW+ +ACGL      +        AL+ ++  NV    GG  GAVNGM P G  
Sbjct: 751 SDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 810

Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
           D SS+QS E+W GV Y +AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+  
Sbjct: 811 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQ 868

Query: 873 DDQYRSLCYMRPLAIW 888
              +RSL YMRPL+IW
Sbjct: 869 QRVFRSLAYMRPLSIW 884


>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 338/856 (39%), Positives = 470/856 (54%), Gaps = 84/856 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++ +  ++   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 100 PFQANNVSLRNMIKHIGMGLRYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGG 159

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL+    
Sbjct: 160 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 215

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P  ++ + CRQI+PI+PH+Y++SS PV V
Sbjct: 216 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGV 270

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
           F + + N G  + D++++F+  N +GG  +  G  +N    +  G   V  +LLHH T  
Sbjct: 271 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPT-- 328

Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
            LP P T A+AA+ T    V+      I+      T + +W ++ + G  D  +    S 
Sbjct: 329 -LPNPYTMAVAARVTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQST 380

Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
            ++ G  I  A+  S  + P  + ++ FSLAWD P + F   G+ +YRRYT+F+G   +A
Sbjct: 381 PTQKGVGIAGAVCVSSKLRPRGQCRLEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDA 440

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           A  ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW +    
Sbjct: 441 APALSHYALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE---- 496

Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
                                                  R  S+ E++      N     
Sbjct: 497 --------------------------------------VREDSLPEEL----GRNMCHLC 514

Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
             L+D     G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D
Sbjct: 515 PTLRD----YGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 570

Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
            ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYR
Sbjct: 571 LTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 630

Query: 638 DVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           D   TGD+ F K +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GG
Sbjct: 631 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGG 690

Query: 698 LWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQ 756
           LW+AA+     +A   G +  +D F     + +  YE+ LWNG Y+NYD+S   QS S+ 
Sbjct: 691 LWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVM 750

Query: 757 ADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
           +DQ AGQW+ +ACGL      +        AL+ ++  NV    GG  GAVNGM P G  
Sbjct: 751 SDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 810

Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
           D SS+QS E+W GV Y +AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+  
Sbjct: 811 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQ 868

Query: 873 DDQYRSLCYMRPLAIW 888
              +RSL YMRPL+IW
Sbjct: 869 QRVFRSLAYMRPLSIW 884


>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
          Length = 918

 Score =  572 bits (1474), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 336/830 (40%), Positives = 463/830 (55%), Gaps = 81/830 (9%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K+ FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 117 RKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 176

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           ANQF+V + R     Y  VL  + P VL         SW+W L G  + YHALYPRAWTV
Sbjct: 177 ANQFTVCLRRRGQTVYQQVLSVERPSVL--------CSWNWGLCGYFAFYHALYPRAWTV 228

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + +++++F+  N +GG
Sbjct: 229 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALEVSIMFSMRNGLGG 287

Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
           + +  G  +N    + +DG  V  +LLHH T     P T A+AA+ +    V+    F  
Sbjct: 288 EDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAARLSADTTVTHLTAF-- 343

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             +S G   + +W ++   G  D L     S+ S+ G  I  A+  S  +PP    ++ F
Sbjct: 344 DPDSAG---QQVWQDLLLDGQLDSLAG--KSLPSQKGVGIAGAVCVSGKLPPRGRHRLEF 398

Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           SLAWD P + F   G+ YYRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+LE
Sbjct: 399 SLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYADWEEKISAWQSPVLE 458

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           D+ LP WY   LFNELY+L  GG VW                              ++VP
Sbjct: 459 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 488

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
             +                   P  L  S G   L+   +  G+F YLEG EY M+NTYD
Sbjct: 489 EDS------------------LPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYD 528

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHD+G  
Sbjct: 529 VHFYASFALIMLWPKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDP 588

Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
           D  PW  VNAY ++DT  WKDLN KFVLQVYRD   TGD+ F + +WP     M    +F
Sbjct: 589 DDEPWLRVNAYEIHDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKF 648

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           D+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G + +++ F   
Sbjct: 649 DKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSI 708

Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
             + +  Y++ LWNG Y+NYD S   QS SI +DQ AGQW+ RA GL      +    + 
Sbjct: 709 LSRGQEAYDRLLWNGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQV 768

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
             AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L 
Sbjct: 769 VRALQTIFEFNVQAFAGGAMGAVNGMQPCGVPDTSSVQSDEVWVGVVYGLAATMIQEGLT 828

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
             GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IWAM
Sbjct: 829 REGFRTAEGCYRTVWERLGL--AFQTPEAYCQRRVFRSLAYMRPLSIWAM 876


>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
            [Otolemur garnettii]
          Length = 1147

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 343/862 (39%), Positives = 470/862 (54%), Gaps = 90/862 (10%)

Query: 46   PLSQFTLNWKETVQLAPIGVRILCL-IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
            P     L+    ++   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 314  PFHANNLSLSNIIKHLGMGLRYLQWWYRKTQVEKKAPFIDMINCVPLRQIYGCPLGGIGG 373

Query: 105  GSIGRSYRGEFQRWQIFP------RVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEV 158
            G+I R +RG+F RWQ+ P       V  D+P+   +F+V + R     Y  VL  + P V
Sbjct: 374  GTITRGWRGQFCRWQLNPGMYQHRTVIADQPISPTKFTVCLRRKGQTVYQQVLSLECPSV 433

Query: 159  LKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKES 218
            L+        SW+W L G  + YHALYPRAWTV++  P   + + CRQI+PI+PH+Y++S
Sbjct: 434  LR--------SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDS 484

Query: 219  SYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLL 275
            S PV VF + + N G  + D++++F+  N +GG  +  G  +N    +  DG  V  +LL
Sbjct: 485  SLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLL 544

Query: 276  HHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLN 334
            HH T   LP P T A+AA+ T    V+      I+      T + +W ++ + G  D  +
Sbjct: 545  HHPT---LPNPYTMAVAARCTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--S 594

Query: 335  SMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFY 393
                S  S+ G  +  A+  S  +PP  + ++ FSLAWD P++ F   G+ +YRRYT+F+
Sbjct: 595  PAGQSAPSQKGVGVAGAVCVSGKLPPQGQCRLEFSLAWDMPKIMFGAKGQVHYRRYTRFF 654

Query: 394  GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
            G   NAA  ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW
Sbjct: 655  GQDGNAAPALSHYALCRYADWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW 714

Query: 454  TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVAL 513
             +                                           R  SI E++      
Sbjct: 715  LE------------------------------------------VREDSISEEL------ 726

Query: 514  NSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAA 573
                G   L     + G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A 
Sbjct: 727  --GGGMCQLHPMLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMAL 784

Query: 574  AVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKF 631
            A +  D ++ + L +G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KF
Sbjct: 785  ATLREDLTRRRYLMNGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKF 844

Query: 632  VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
            VLQVYRD   TGD+ F K +WP     M    +FD+D DG+IEN G+ DQTYD W  +G 
Sbjct: 845  VLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDAWVTTGP 904

Query: 692  SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSS 750
            SAY GGLW+AA+     +A   G +  +D F    ++ +  YE+ LWNG Y+NYD+S   
Sbjct: 905  SAYCGGLWLAAVAVMVQMAALCGAQDLQDKFSSILRQGQEAYERLLWNGRYYNYDSSSQP 964

Query: 751  QSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
            QS SI +DQ AGQW+ +ACGL      +        AL+ +Y  NV    GG  GAVNGM
Sbjct: 965  QSRSIMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIYELNVQAFAGGAMGAVNGM 1024

Query: 807  LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
             P G  D SS+QS E+W GV Y +AA+MI E L   GF+TA G Y   W    LG AFQT
Sbjct: 1025 QPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWE--RLGLAFQT 1082

Query: 867  PEAWNTDDQYRSLCYMRPLAIW 888
            PEA+     +RSL YMRPL+IW
Sbjct: 1083 PEAYCQQRVFRSLAYMRPLSIW 1104


>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
 gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
          Length = 1302

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 338/945 (35%), Positives = 495/945 (52%), Gaps = 142/945 (15%)

Query: 36   WQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSH 95
            W R+L         F ++  + ++L  +G R+   +++E + G+   +DPF         
Sbjct: 373  WARRLDNNYPEHKHFQVSMWQGLKLVGMGYRMWKYVKKETSSGRVPIMDPFNLPKPGPIM 432

Query: 96   GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS--------------- 140
            GVP+GG+GSGSI R ++G+F RW +   +  ++ V  ++FSV+++               
Sbjct: 433  GVPIGGIGSGSINRGWKGDFVRWNLNNGLVSNEIVDVDKFSVYINFDNSNTSSSSSSSSS 492

Query: 141  --------------------------------RSNGQKYSSVLCPKTPEVLKDTTAAGIG 168
                                                ++ + VL P  P+     +   + 
Sbjct: 493  NSSTPPPQFSNNNNNSYNNSYNNNINNNGFDNNLKSKQKAVVLYPGKPK-----SNLNLD 547

Query: 169  SWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYT 228
             W+W LKGD S Y  L+PRAWTV+E EP P++R+ C+Q+SP+IPHNY+ESSYP  V+ + 
Sbjct: 548  VWNWGLKGDNSCYFGLFPRAWTVYE-EPHPDIRLTCKQVSPVIPHNYQESSYPCGVYVWK 606

Query: 229  IYNSGK-TSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG------------------ 269
            I N+ + +SAD++L+ TW NS+G  S+  G H+N      D                   
Sbjct: 607  IENNNQYSSADVSLMLTWQNSIGTQSDQDGGHFNKYFTFGDNDQQSPSSSSSSSSGNNKK 666

Query: 270  -VHAVLLHHRTSHQ--LP---------PVTFALAAQETDGVHVSLCPHFVISGNSLGLTA 317
             +  + + H+   +  LP         P+ +++A  +   V +S C  F  +     L A
Sbjct: 667  NIKGITMIHKKEQKCTLPNGNTKVFNDPMEYSIAVMDEPDVEISFCERFETTSR---LDA 723

Query: 318  KDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEV 377
             ++W+   + G    L + + S  S P   IGAA+AA V V   S   + F ++WD P  
Sbjct: 724  ANLWYTFNKKG---ELENNKDSRPSAPKKPIGAAVAAKVKVAAGSSKTIVFCISWDAPIT 780

Query: 378  NFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPI 437
             F  G  YY+RYTKFYG     +  IA DAI     W+ QI  WQ+PIL D  LP +Y +
Sbjct: 781  RFQMGSAYYKRYTKFYGNQGGNSQRIAYDAINNWKHWDNQIIQWQQPILSDPELPTFYKM 840

Query: 438  TLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNIL 497
             LFNELYYL  GG VWT GSP                        I  VP++  T   + 
Sbjct: 841  ALFNELYYLVDGGTVWTHGSP------------------------IDPVPSRYPTKTQLK 876

Query: 498  ERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALI 557
            +  ++                TN       +IG+F YLE +EYLM+NTYDV FY+SFAL 
Sbjct: 877  QEDTT---------------DTN-------HIGRFAYLESLEYLMYNTYDVMFYASFALT 914

Query: 558  MLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNA 615
            ML+P +++S+Q D A A  +      + +  G  + RK  G VPHD+G    DPW  VN+
Sbjct: 915  MLWPLLEISLQYDIADATQLDYGINWEGIHSGHLMPRKKQGTVPHDLGNPGEDPWKRVNS 974

Query: 616  YCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY-VAMAYMDQFDRDGDGMIE 674
            YC+ D +RWKDL  KFVLQVYRD +   DK F    +  V  V    ++ FD D DG+++
Sbjct: 975  YCIQDVSRWKDLPSKFVLQVYRDYLVMEDKNFLLQCYNVVEGVIQRTLEHFDFDHDGVVD 1034

Query: 675  NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVY 733
            N+GFPDQTYD W  +G SAYSGGLW+AAL+ A+ +A+ +G    E  +   F+K +K   
Sbjct: 1035 NEGFPDQTYDVWPATGCSAYSGGLWLAALKVAAEIAKILGFNEDESIYNAIFEKGSKSFT 1094

Query: 734  EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
            +KLWNG YFNYD+S S  S SI +D LAG WY  +CGL   +  D+A S+L  +  YNV 
Sbjct: 1095 KKLWNGHYFNYDSSKSVHSDSIMSDMLAGHWYLLSCGLKSYMTFDQALSSLSIINEYNVK 1154

Query: 794  KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
                G  GAVNGM P+G VD + +QS E+W G ++++AA+MI   +    +    GI  +
Sbjct: 1155 SYSKGVCGAVNGMRPEGTVDTTCLQSSEVWIGTSFSLAATMILHHMDSDAWDLVKGIVNS 1214

Query: 854  AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            ++     G+ +QTPEAW+ +  +R+  YMRPL+IW++QWAL + K
Sbjct: 1215 SYQ--KWGFQYQTPEAWDQNGCFRAASYMRPLSIWSIQWALQKRK 1257


>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
          Length = 915

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 336/831 (40%), Positives = 460/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P +   + V+
Sbjct: 119 RKTRVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYRHQTVI 178

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 179 ADQFTVCLRRKGQTVYQQVLSVEHPSVLR--------SWNWGLCGSFAFYHALYPRAWTV 230

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF +   N G  + D++++F+  N +GG
Sbjct: 231 YQ-LPGQSVTLTCRQITPILPHDYQDSSLPVGVFVWDAENEGDETLDVSIMFSMRNGLGG 289

Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
           D +  G  +N    +  DG  +  +LLHH T   LP P T A+AA+ T    V+    F 
Sbjct: 290 DEDAPGGLWNEPFCLERDGETIQGLLLHHPT---LPNPYTMAMAARLTADTTVTHITAFD 346

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
             G     T + +W ++   G  D  +    S  S+ G  I  A+  +  +PP    ++ 
Sbjct: 347 PDG-----TGQQVWQDLLXDGQLD--SPAGQSPPSQKGVGIAGAVCVTTRLPPGGRRRLE 399

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ Y+RRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 400 FSLAWDMPRIMFGAKGQVYHRRYTRFFGPDGDAAPALSHHALSRYADWEERISAWQSPVL 459

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW                              ++V
Sbjct: 460 DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 489

Query: 487 PNQN--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
           P  +  +  V  + ++  IL++                       G+F YLEG EY M+N
Sbjct: 490 PEDSLPEDLVGSMCQLRPILQEY----------------------GRFGYLEGQEYRMYN 527

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 528 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTERRYLMSGITAPVKRRNVIPHDI 587

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F + +WP     M   
Sbjct: 588 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQVFLRDMWPVCLAVMESE 647

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G R  +D F
Sbjct: 648 MKFDKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGTREVQDQF 707

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD S   Q+ SI +DQ AGQW+ RA GL      +   
Sbjct: 708 SSILSRGREAYERLLWNGRYYNYDCSSQPQAYSIMSDQCAGQWFLRASGLGEGDTEVFPT 767

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV YA+AA+MI E
Sbjct: 768 PHVVQALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYALAATMIQE 827

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 828 GLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
           gorilla gorilla]
          Length = 927

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 332/831 (39%), Positives = 459/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        +W+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------TWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTADTTVTH----- 349

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ 
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW +                               
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L+    + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
           abelii]
          Length = 927

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 333/824 (40%), Positives = 456/824 (55%), Gaps = 87/824 (10%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V 
Sbjct: 134 KTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVC 193

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV++  P  
Sbjct: 194 LRREGQTVYQQVLSLERPTVLR--------SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQ 244

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
            + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG  +  G 
Sbjct: 245 NVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGG 304

Query: 259 HYNSKTKMNDG---VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVISGNSLG 314
            +N    +  G   V  +LLHH T   LP P T A+AA+ T    V+      I+     
Sbjct: 305 LWNEPFCLERGRETVQGLLLHHPT---LPNPYTMAVAARVTADTTVTH-----ITAFDPD 356

Query: 315 LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDC 374
            T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ FSLAWD 
Sbjct: 357 STGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCISSKLRPRGQCRLEFSLAWDM 414

Query: 375 PEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L+D+ LP 
Sbjct: 415 PRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVLDDRSLPA 474

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
           WY   LFNELY+L  GG VW +                                      
Sbjct: 475 WYKSALFNELYFLADGGTVWLE-------------------------------------- 496

Query: 494 VNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWNTYDVHFY 551
                    +LE      +L    G N+  L+    + G+F YLEG EY M+NTYDVHFY
Sbjct: 497 ---------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 542

Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--P 609
           +SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDIG  D  P
Sbjct: 543 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 602

Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
           W  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M    +FD+D 
Sbjct: 603 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 662

Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
           DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F     + 
Sbjct: 663 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 722

Query: 730 KVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSAL 784
           +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +        AL
Sbjct: 723 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRAL 782

Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
           + ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L   GF
Sbjct: 783 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 842

Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           QTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 843 QTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
          Length = 927

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 337/856 (39%), Positives = 469/856 (54%), Gaps = 84/856 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++ +  ++   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 100 PFQANNVSLRNMIKHIGMGLRYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGG 159

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL+    
Sbjct: 160 GTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSVLR---- 215

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV V
Sbjct: 216 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 270

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
           F + + N G  + D++++F+  N +GG  +  G  +N    +  G   V  +LLHH T  
Sbjct: 271 FVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPT-- 328

Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
            LP P T A+AA+ T    V+      I+      T + +W ++ + G  D  +    S 
Sbjct: 329 -LPNPYTMAVAARVTADTTVTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQST 380

Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
            ++ G  I  A+  S  + P  + ++ FSLAWD P + F   G+ +YRRYT+F+G   +A
Sbjct: 381 PTQKGVGIAGAVCVSSKLRPRGQCRLEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDA 440

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           A  ++  A+ ++  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW +    
Sbjct: 441 APALSHYALCQYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE---- 496

Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
                                                  R  S+ E++      N     
Sbjct: 497 --------------------------------------VREDSLPEEL----GRNMCHLC 514

Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
             L+D     G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D
Sbjct: 515 PTLRD----YGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 570

Query: 580 PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
            ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYR
Sbjct: 571 LTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 630

Query: 638 DVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           D   TGD+ F K +WP     M    +FD+D DG+IEN G+ DQTYD W  +G S Y GG
Sbjct: 631 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSVYCGG 690

Query: 698 LWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQ 756
           LW+AA+     +A   G +  +D F     + +  YE+ LWNG Y+NYD+S   QS S+ 
Sbjct: 691 LWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYERLLWNGRYYNYDSSSQPQSRSVM 750

Query: 757 ADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
           +DQ AGQW+ +ACGL      +        AL+ ++  NV    GG  GAVNGM P G  
Sbjct: 751 SDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 810

Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
           D SS+QS E+W GV Y +AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+  
Sbjct: 811 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQ 868

Query: 873 DDQYRSLCYMRPLAIW 888
              +RSL YMRPL+IW
Sbjct: 869 QRVFRSLAYMRPLSIW 884


>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 1001

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 352/894 (39%), Positives = 488/894 (54%), Gaps = 74/894 (8%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    V+   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 92  PFQANNISLSNLVKHLGMGLRYLQWWYRKTQVEKKTPFIDMINCVPLRQIYGCPLGGIGG 151

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL    P VL+    
Sbjct: 152 GTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLDHPSVLR---- 207

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV V
Sbjct: 208 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGV 262

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSH 281
           F + + N G  + D++++F+  N +GG+ +  G  +N    +  DG  V  +LLHH T  
Sbjct: 263 FVWDVENEGDEALDVSIMFSMRNGLGGEDDAPGGLWNEPFCLERDGETVQGLLLHHPT-- 320

Query: 282 QLP-PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSV 340
            LP P T A+AA+ T    V+    F   G     T + +W ++ + G  D  +    S 
Sbjct: 321 -LPNPYTMAVAARLTANTIVTHITAFDPDG-----TGQQVWQDLLQDGQLD--SPAGRST 372

Query: 341 TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNA 399
            S+ G  I  A+  S  +PP  +  + FSLAWD P++ F   G+ +YRRYT+F+    +A
Sbjct: 373 PSQKGVGIAGAVCISGKLPPRGQCCLEFSLAWDMPKIVFGAKGQVHYRRYTRFFSPDGDA 432

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           A  ++  A+  +  WE +I AWQ PIL+D+ LP WY   LFNELY+L  GG VW +   P
Sbjct: 433 APALSHYALCRYADWEERISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VP 490

Query: 460 VHSLVT-IGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILER------MSSILEQIY---- 508
             SL   +G     L     D  R   +    +     L R       S++  ++Y    
Sbjct: 491 EDSLPEELGGGMHQLRPILRDYGRFGYLEGMLNPCPLCLRRSLPAWYKSALFNELYFLAD 550

Query: 509 ------------TPVALNSSFGTNLLQDGEENIGQFLYLEGI-------------EYLMW 543
                        P  L    G + L+    + G+F YLEG+             EY M+
Sbjct: 551 GGTVWLEVPEDSLPEELGG--GMHQLRPILRDYGRFGYLEGMAAGGLQYGWGWGQEYRMY 608

Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
           NTYDVHFY+SFAL+ML+PK++LS+Q D A A +  D ++ + L  G     K    +PHD
Sbjct: 609 NTYDVHFYASFALVMLWPKLELSLQYDMALATLREDLTRRRYLMSGVIAPVKRRNVIPHD 668

Query: 604 IGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
           IG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M  
Sbjct: 669 IGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMES 728

Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
             +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  ++ 
Sbjct: 729 EMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEK 788

Query: 722 FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVD 776
           F     + +  YE+ LWNG Y+NYD+S   QS SI +DQ AGQW+ +ACGL      +  
Sbjct: 789 FTSILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTEVFP 848

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
                 AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI 
Sbjct: 849 TPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQ 908

Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           E L   GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IWAM
Sbjct: 909 EGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIWAM 960


>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
          Length = 826

 Score =  569 bits (1466), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 348/838 (41%), Positives = 469/838 (55%), Gaps = 79/838 (9%)

Query: 71  IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
           +++   + K AFID      L   +G PLGG+G G+I R +RGEF RWQ+ P +     V
Sbjct: 40  VKKTCIEKKTAFIDLLCAVPLQQIYGCPLGGIGGGTITRGWRGEFCRWQLNPGLYHYGTV 99

Query: 131 LANQFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAW 189
           +ANQF+V + R  GQ  Y  VL  + P  L+         W+W   G  + YHALYPRAW
Sbjct: 100 IANQFTVCL-RCKGQTIYQQVLSMERPSSLQ--------GWNWGYCGHYAFYHALYPRAW 150

Query: 190 TVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSV 249
            V+E  P  ++ + CRQ+SP+IPH+YK+SS PV VF + + N      +++++F+  N +
Sbjct: 151 LVYE-LPGQQVVLTCRQVSPVIPHDYKDSSLPVGVFIWEVENRRDEDVEVSIMFSLQNGM 209

Query: 250 GGDSEFTGQHYNSK-TKMNDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
           G   + +G H+N   T   DG  V  VLLHH  +  + P T A++A+E  G  V+     
Sbjct: 210 GTKEDGSGGHWNEPFTFQKDGERVAGVLLHHCPA--VNPFTLAISAREKAGTGVT----H 263

Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
           V + N  GL  +++W ++ + G  D  +    S  +E G    AA+ AS  VP      +
Sbjct: 264 VTAFNPTGL-GREVWQDLLQDGRLD--SPAGKSRPTEKGEVTAAAVCASCRVPAHGHKTL 320

Query: 367 TFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
             +LAWD P V+F S  K + RRYT+F+G+  +AA  ++  A+  +  WE +IEAWQ PI
Sbjct: 321 EMALAWDMPCVHFGSKEKLHVRRYTRFFGSKGDAAPTLSHYALTHYEEWERKIEAWQNPI 380

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVD 485
           LE+ +LP WY   LFNELY++  GG +W +  P               D    DL+    
Sbjct: 381 LENSQLPAWYKSALFNELYFMTDGGTIWVEVPP---------------DCCAEDLQ---- 421

Query: 486 VPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNT 545
                                   P     S    +L++     G+F YLEG EY M+NT
Sbjct: 422 -----------------------GPAGAGLSHLLPVLRE----YGRFAYLEGQEYRMYNT 454

Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
           YDVHFY+SFALIML+PK+Q+S+Q D A  V+  D    + L  GQ    K+   VPHDIG
Sbjct: 455 YDVHFYASFALIMLWPKLQISLQYDIAVTVVNEDVQPRQYLMGGQTAQVKMKNVVPHDIG 514

Query: 606 IC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
               +PW  VNAY ++DTA WKDLN KFVLQVYRD   T D  + + +WP     M    
Sbjct: 515 DPGDEPWQRVNAYLMHDTADWKDLNLKFVLQVYRDYYLTHDSLYLQDMWPVCQAVMESEL 574

Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           +FD D DG+IEN GF DQTYD W V    AY GGLW+AA+     +A  +GD      +L
Sbjct: 575 KFDTDNDGLIENGGFADQTYDAWVVIRRCAYCGGLWLAAVCMMCKMAEVLGDTEIRQKYL 634

Query: 724 FKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDED 778
               K K  +E+ LWNG Y+NYD+SGS  SSSI +DQ AGQW+  ACGL      +  + 
Sbjct: 635 GILNKGKEAFERMLWNGKYYNYDSSGSDTSSSIMSDQCAGQWFLGACGLDQGEFEVFPKS 694

Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
              SAL+ ++  NVL   GG  GAVNGM PDG  D SS+QS E+W GV Y++AA+MI E 
Sbjct: 695 HILSALRTIFEKNVLSFAGGTMGAVNGMRPDGVPDTSSVQSNEVWVGVVYSLAATMIQEG 754

Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           + + GF+TA G Y   W    LG AFQTPEA+     YRSL YMRPL+IW+MQ AL R
Sbjct: 755 MVEEGFRTAEGCYRTVWE--RLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLALER 810


>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Ailuropoda melanoleuca]
          Length = 928

 Score =  569 bits (1466), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 335/831 (40%), Positives = 461/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 130 RKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 189

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 190 ADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 241

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +G 
Sbjct: 242 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGV 300

Query: 252 DSEFTGQHYNSKTKMN-DG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N    +  DG  V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 301 GDDAPGGLWNEPFCLERDGETVQGLLLHHPT---LPNPYTMAVAARLTADTTVTH----- 352

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  S+ G  I  A+  +  +PP  + ++ 
Sbjct: 353 ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPSQKGVGIAGAVCVTGKLPPRGQCRLE 410

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ YYRRYT+F+G   NAA  ++  A+  +  WE +I AWQ PIL
Sbjct: 411 FSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYVDWEEKISAWQSPIL 470

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW                              ++V
Sbjct: 471 DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 500

Query: 487 PNQN--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
           P  +  +  V  + ++  IL +                       G+F YLEG EY M+N
Sbjct: 501 PEDSLPEELVGSMCQLRPILREY----------------------GRFGYLEGQEYRMYN 538

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFAL+ML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 539 TYDVHFYASFALVMLWPKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDI 598

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F + +WP     M   
Sbjct: 599 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESE 658

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 659 MKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKF 718

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
               ++ +  YE+ LWNG Y+NYD S   QS SI +DQ AGQW+ +A GL      +   
Sbjct: 719 SSILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPT 778

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 779 PHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 838

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 839 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 887


>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
          Length = 912

 Score =  568 bits (1464), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 332/828 (40%), Positives = 459/828 (55%), Gaps = 81/828 (9%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID F    L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 118 RKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 177

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 178 ADQFIVCLRRDGRTVYQQVLSLELPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 229

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 230 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDETLDVSIMFSMRNGLGG 288

Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
           + +  G  +N   ++  DG  V  +LLHH T     P T A+AA+     H +       
Sbjct: 289 EDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAAR-----HTADTTVTYT 341

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
           +      T + +W ++ + G  D  +    S  ++ G  +  A+ AS  + P     + F
Sbjct: 342 TAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPTQRGEGVAGAVCASSKLLPRGRCCLEF 399

Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           SLAWD P + F   G+ +YRRYT+F+G+  + A  ++  A+ ++  WE  I AWQ P+L+
Sbjct: 400 SLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQYAGWENSISAWQNPVLD 459

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           D+ LP WY   LFNELY+L  GG VW                              ++VP
Sbjct: 460 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 489

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
             +     + E +   + Q+             +LQD     G+F YLEG EY M+NTYD
Sbjct: 490 EDS-----LPEELGGSMYQL-----------RPILQD----YGRFGYLEGQEYRMYNTYD 529

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SFAL+ML+PK++LS+Q D A A    D ++ + L  G     K    +PHDIG  
Sbjct: 530 VHFYASFALVMLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDP 589

Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
           D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M    +F
Sbjct: 590 DDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKF 649

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           D+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F   
Sbjct: 650 DKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSI 709

Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
             + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ RACGL      +      
Sbjct: 710 LCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHV 769

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
             ALK ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L 
Sbjct: 770 VRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLT 829

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
             GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 830 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 875


>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
 gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
          Length = 904

 Score =  568 bits (1463), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 332/828 (40%), Positives = 459/828 (55%), Gaps = 81/828 (9%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID F    L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 118 RKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHQTVI 177

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 178 ADQFIVCLRRDGRTVYQQVLSLELPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 229

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 230 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDETLDVSIMFSMRNGLGG 288

Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
           + +  G  +N   ++  DG  V  +LLHH T     P T A+AA+     H +       
Sbjct: 289 EDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAAR-----HTADTTVTYT 341

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
           +      T + +W ++ + G  D  +    S  ++ G  +  A+ AS  + P     + F
Sbjct: 342 TAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPTQRGEGVAGAVCASSKLLPRGRCCLEF 399

Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           SLAWD P + F   G+ +YRRYT+F+G+  + A  ++  A+ ++  WE  I AWQ P+L+
Sbjct: 400 SLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQYAGWENSISAWQNPVLD 459

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           D+ LP WY   LFNELY+L  GG VW                              ++VP
Sbjct: 460 DRSLPAWYKSALFNELYFLADGGTVW------------------------------LEVP 489

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
             +     + E +   + Q+             +LQD     G+F YLEG EY M+NTYD
Sbjct: 490 EDS-----LPEELGGSMYQL-----------RPILQD----YGRFGYLEGQEYRMYNTYD 529

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SFAL+ML+PK++LS+Q D A A    D ++ + L  G     K    +PHDIG  
Sbjct: 530 VHFYASFALVMLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDP 589

Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
           D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M    +F
Sbjct: 590 DDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKF 649

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           D+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F   
Sbjct: 650 DKDQDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSI 709

Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
             + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ RACGL      +      
Sbjct: 710 LCRGREAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHV 769

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
             ALK ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L 
Sbjct: 770 VRALKTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLT 829

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
             GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 830 WEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 875


>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
          Length = 923

 Score =  568 bits (1463), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 335/831 (40%), Positives = 461/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +G 
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGV 297

Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N    +  DG  V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 298 GDDAPGGLWNEPFCLERDGETVQGLLLHHPT---LPNPYTMAVAARLTADTTVTH----- 349

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  S+ G  I  A+  +  +PP  + ++ 
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPAGQSTPSQKGVGIAGAVCVTGKLPPRGQCRLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ YYRRYT+F+G   NAA  ++  A+  +  WE +I AWQ PIL
Sbjct: 408 FSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYVDWEEKISAWQSPIL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW                              ++V
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVW------------------------------LEV 497

Query: 487 PNQN--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWN 544
           P  +  +  V  + ++  IL +                       G+F YLEG EY M+N
Sbjct: 498 PEDSLPEELVGSMCQLRPILREY----------------------GRFGYLEGQEYRMYN 535

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFAL+ML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 536 TYDVHFYASFALVMLWPKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDI 595

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F + +WP     M   
Sbjct: 596 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESE 655

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 656 MKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKF 715

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
               ++ +  YE+ LWNG Y+NYD S   QS SI +DQ AGQW+ +A GL      +   
Sbjct: 716 SSILRRGREAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPT 775

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 PHVVCALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 835

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|449526389|ref|XP_004170196.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
           sativus]
          Length = 440

 Score =  567 bits (1462), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 274/441 (62%), Positives = 337/441 (76%), Gaps = 3/441 (0%)

Query: 15  EKEPLNSSFDKVDPAKPASLTWQRKLS-TGEIPLSQFTLNWKETVQLAPIGVRILCLIRE 73
           EK    +S  +VDP+KP SLTW+RKL  TG+ P S F+    +   +   G R+    +E
Sbjct: 2   EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPES-FSFTLTDAWHMGMTGYRLWRNGKE 60

Query: 74  EAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLAN 133
           E AKG+    + F    +T  HGV LGG+G+GSIGRSYRGEFQR+Q+F   CED+PVLAN
Sbjct: 61  EIAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLAN 120

Query: 134 QFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHE 193
           QFSVFVSR NG K+SSVLC   P+  KD    GI SWDWNL G+ STYHAL+PR+WTV++
Sbjct: 121 QFSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYD 180

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
           GEPDP+L+IVCRQ+SPIIPHNYKESS+PVSVFT+ + N G+TSA +TLLFTW NSVGG S
Sbjct: 181 GEPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKS 240

Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSL 313
            FTG H+NSK    DG   VLLHH++++  P VT+ +AA+ TD VHVSLCP FVISG+S 
Sbjct: 241 GFTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSE 300

Query: 314 GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
           G++AKDMW EIK HGSFD L S+  +  S+PG SIGAA+AA++T+PP S   VTFSLAWD
Sbjct: 301 GISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWD 360

Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           CPEV F  GKTY+R+Y+KFYGT  +AA  IARDAI +HG WE +IEAWQRPI+EDKRLP+
Sbjct: 361 CPEVKF-DGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPD 419

Query: 434 WYPITLFNELYYLNAGGAVWT 454
           WYP+TLFNELY+LN+GG +WT
Sbjct: 420 WYPVTLFNELYFLNSGGTIWT 440


>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
          Length = 940

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    V+   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 113 PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 172

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF V + R     Y  VL  + P VL+    
Sbjct: 173 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 228

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQ++PI+PH+Y++SS PV V
Sbjct: 229 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 283

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
           F + + N G  + D+++ F+  N +GG+ +  G  +N   ++  G   V  +LLHH T  
Sbjct: 284 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 343

Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
              P T A+AA+ T    V+    F  +G     T + +W ++ + G  D  +    S  
Sbjct: 344 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 394

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
           ++ G  I  A+  S  + P S   + FSLAWD P++ F    + +YRRYT+F+G+  + A
Sbjct: 395 TQKGEGIAGAVCVSSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 454

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
             ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW       
Sbjct: 455 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 507

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                                  ++VP     A ++ E +   + Q+            +
Sbjct: 508 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 528

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
            LQD     G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A +  D 
Sbjct: 529 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 584

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
           ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 585 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 644

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              TGD+ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGL
Sbjct: 645 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 704

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
           W+AA+     +A   G +  ++ F     + +  YE+ LWNG Y+NYD+S   QS SI +
Sbjct: 705 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 764

Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
           DQ AGQW+ RACGL      +        AL+ ++  NV    GG  GAVNGM P G  D
Sbjct: 765 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 824

Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
            SS+QS E+W GV Y +AA+MI E L   GF+TA G Y   W   GL  AFQTPEA+   
Sbjct: 825 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 882

Query: 874 DQYRSLCYMRPLAIW 888
             +RSL YMRPL+IW
Sbjct: 883 QVFRSLAYMRPLSIW 897


>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
           gorilla gorilla]
          Length = 933

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 332/837 (39%), Positives = 457/837 (54%), Gaps = 93/837 (11%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P       V 
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
            D+P+   +F+V + R     Y  VL  + P VL+        +W+W L G  + YHALY
Sbjct: 187 ADQPICPLKFTVCLRREGQTVYQQVLSLERPSVLR--------TWNWGLCGYFAFYHALY 238

Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
           PRAWTV++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+ 
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297

Query: 246 TNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
            N +GG  +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTADTTVT 354

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
                 I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P 
Sbjct: 355 H-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPR 407

Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
            + ++ FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I A
Sbjct: 408 GQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISA 467

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQ P+L+D+ LP WY   LFNELY+L  GG VW +                         
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE------------------------- 502

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGI 538
                                 +LE      +L    G N+  L+    + G+F YLEG 
Sbjct: 503 ----------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQ 535

Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
           EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K   
Sbjct: 536 EYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRN 595

Query: 599 AVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
            +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP   
Sbjct: 596 VIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCL 655

Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
             M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +
Sbjct: 656 AVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQ 715

Query: 717 GSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL---- 771
             +D F     + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL    
Sbjct: 716 DIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGD 775

Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
             +        AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +A
Sbjct: 776 TEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLA 835

Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           A+MI E L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 ATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
           abelii]
          Length = 933

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 334/830 (40%), Positives = 454/830 (54%), Gaps = 93/830 (11%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVCEDKPVLA 132
           K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P       V  D+PV  
Sbjct: 134 KTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQPVCP 193

Query: 133 NQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVH 192
            +F+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV+
Sbjct: 194 LKFTVCLRREGQTVYQQVLSLERPTVLR--------SWNWGLCGYFAFYHALYPRAWTVY 245

Query: 193 EGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD 252
           +  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG 
Sbjct: 246 Q-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGG 304

Query: 253 SEFTGQHYNSKTKMNDG---VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVI 308
            +  G  +N    +  G   V  +LLHH T   LP P T A+AA+ T    V+      I
Sbjct: 305 DDAPGGLWNEPFCLERGRETVQGLLLHHPT---LPNPYTMAVAARVTADTTVTH-----I 356

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
           +      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ F
Sbjct: 357 TAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCISSKLRPRGQCRLEF 414

Query: 369 SLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           SLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L+
Sbjct: 415 SLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVLD 474

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           D+ LP WY   LFNELY+L  GG VW +                                
Sbjct: 475 DRSLPAWYKSALFNELYFLADGGTVWLE-------------------------------- 502

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWNT 545
                          +LE      +L    G N+  L+    + G+F YLEG EY M+NT
Sbjct: 503 ---------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNT 542

Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
           YDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDIG
Sbjct: 543 YDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIG 602

Query: 606 ICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD 663
             D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M    
Sbjct: 603 DPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEM 662

Query: 664 QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F 
Sbjct: 663 KFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFS 722

Query: 724 FKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDED 778
               + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +    
Sbjct: 723 SILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTP 782

Query: 779 KARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHED 838
               AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E 
Sbjct: 783 HVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEG 842

Query: 839 LADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 843 LTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
          Length = 918

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    V+   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 91  PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 150

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF V + R     Y  VL  + P VL+    
Sbjct: 151 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 206

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQ++PI+PH+Y++SS PV V
Sbjct: 207 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 261

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
           F + + N G  + D+++ F+  N +GG+ +  G  +N   ++  G   V  +LLHH T  
Sbjct: 262 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 321

Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
              P T A+AA+ T    V+    F  +G     T + +W ++ + G  D  +    S  
Sbjct: 322 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 372

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
           ++ G  I  A+  S  + P S   + FSLAWD P++ F    + +YRRYT+F+G+  + A
Sbjct: 373 TQKGEGIAGAVCVSSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 432

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
             ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW       
Sbjct: 433 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 485

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                                  ++VP     A ++ E +   + Q+            +
Sbjct: 486 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 506

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
            LQD     G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A +  D 
Sbjct: 507 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 562

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
           ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 563 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 622

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              TGD+ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGL
Sbjct: 623 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 682

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
           W+AA+     +A   G +  ++ F     + +  YE+ LWNG Y+NYD+S   QS SI +
Sbjct: 683 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 742

Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
           DQ AGQW+ RACGL      +        AL+ ++  NV    GG  GAVNGM P G  D
Sbjct: 743 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 802

Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
            SS+QS E+W GV Y +AA+MI E L   GF+TA G Y   W   GL  AFQTPEA+   
Sbjct: 803 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 860

Query: 874 DQYRSLCYMRPLAIW 888
             +RSL YMRPL+IW
Sbjct: 861 QVFRSLAYMRPLSIW 875


>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
          Length = 983

 Score =  565 bits (1456), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 332/822 (40%), Positives = 458/822 (55%), Gaps = 83/822 (10%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V 
Sbjct: 190 KTPFIDFMNCVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVC 249

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV++  P  
Sbjct: 250 LRREGQTVYQQVLSVERPRVLR--------SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQ 300

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
            + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG  +  G 
Sbjct: 301 NVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDAPGG 360

Query: 259 HYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFVISGNSLG 314
            +N    +  DG     +LLHH T   LP P T A+AA+ T    V+      I+     
Sbjct: 361 LWNEPFCLEQDGETTQGLLLHHPT---LPNPYTMAVAARLTADTTVTH-----ITAFDPD 412

Query: 315 LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDC 374
            T + +W ++ + G  D  +    S  ++ G  I  A+  +  + P    ++ FSLAWD 
Sbjct: 413 STGQQVWQDLLQDGQLD--SPAGPSTPTQKGVGIAGAVCVAGKLSPRGRCRLEFSLAWDM 470

Query: 375 PEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           P + F   G+ YYRRYT+F+G   +AA  ++  A+  +  WE +I AWQ PIL+D+ LP 
Sbjct: 471 PRIMFGAKGQDYYRRYTRFFGPDGDAAPALSHYALCRYADWEKRISAWQSPILDDRSLPA 530

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
           WY   LFNELY+L  GG VW                              ++VP  +   
Sbjct: 531 WYKSALFNELYFLADGGTVW------------------------------LEVPEDS--- 557

Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
             + E ++  + Q+             +LQ+     G+F YLEG EY M+NTYDVHFY+S
Sbjct: 558 --LPEELAGSMCQL-----------RPILQE----YGRFGYLEGQEYRMYNTYDVHFYAS 600

Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWF 611
           FALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDIG  D  PW 
Sbjct: 601 FALIMLWPKLELSLQYDMALATLGEDLTRRRYLMSGVMAPVKKRNVIPHDIGDPDDEPWL 660

Query: 612 EVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDG 671
            VNAY ++DTA WKDLN KFVLQ+YRD   TGD+ F + +WP     M    +FD+D DG
Sbjct: 661 RVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDG 720

Query: 672 MIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV 731
           +IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F     + + 
Sbjct: 721 LIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQDRFSSILSRGQE 780

Query: 732 VYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKK 786
            YE+ LWNG Y+NYD S   QS SI +DQ AGQW+ RACGL      +        AL+ 
Sbjct: 781 AYERLLWNGRYYNYDCSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTRHIVRALQT 840

Query: 787 VYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQT 846
           ++  NV    GG  GAVNGM P G  D +S+QS E+W GV Y +AA+MI E L   GF+T
Sbjct: 841 IFELNVQAFAGGAMGAVNGMQPHGVPDRTSVQSDEVWVGVVYGLAATMIQEGLTWEGFRT 900

Query: 847 ACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           A G Y   W    LG AFQTPEA+     +RSL YMRPL+IW
Sbjct: 901 AEGCYRTVWE--RLGLAFQTPEAYCQQQVFRSLAYMRPLSIW 940


>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
 gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
          Length = 918

 Score =  565 bits (1456), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    V+   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 91  PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 150

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF V + R     Y  VL  + P VL+    
Sbjct: 151 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 206

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQ++PI+PH+Y++SS PV V
Sbjct: 207 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 261

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
           F + + N G  + D+++ F+  N +GG+ +  G  +N   ++  G   V  +LLHH T  
Sbjct: 262 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 321

Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
              P T A+AA+ T    V+    F  +G     T + +W ++ + G  D  +    S  
Sbjct: 322 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 372

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
           ++ G  I  A+  S  + P S   + FSLAWD P++ F    + +YRRYT+F+G+  + A
Sbjct: 373 TQKGEGIAGAVCVSSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 432

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
             ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW       
Sbjct: 433 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 485

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                                  ++VP     A ++ E +   + Q+            +
Sbjct: 486 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 506

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
            LQD     G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A +  D 
Sbjct: 507 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 562

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
           ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 563 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 622

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              TGD+ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGL
Sbjct: 623 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 682

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
           W+AA+     +A   G +  ++ F     + +  YE+ LWNG Y+NYD+S   QS SI +
Sbjct: 683 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 742

Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
           DQ AGQW+ RACGL      +        AL+ ++  NV    GG  GAVNGM P G  D
Sbjct: 743 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 802

Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
            SS+QS E+W GV Y +AA+MI E L   GF+TA G Y   W   GL  AFQTPEA+   
Sbjct: 803 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 860

Query: 874 DQYRSLCYMRPLAIW 888
             +RSL YMRPL+IW
Sbjct: 861 QVFRSLAYMRPLSIW 875


>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
          Length = 918

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 335/855 (39%), Positives = 471/855 (55%), Gaps = 82/855 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P     ++    V+   +G+R L    R+   + K  FID      L   +G PLGG+G 
Sbjct: 91  PFQANNISLSNLVKHLGMGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGG 150

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF V + R     Y  VL  + P VL+    
Sbjct: 151 GTITRGWRGQFCRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNVLR---- 206

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P   + + CRQ++PI+PH+Y++SS PV V
Sbjct: 207 ----SWNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGV 261

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---VHAVLLHHRTSH 281
           F + + N G  + D+++ F+  N +GG+ +  G  +N   ++  G   V  +LLHH T  
Sbjct: 262 FVWDVENEGDETLDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPP 321

Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
              P T A+AA+ T    V+    F  +G     T + +W ++ + G  D  +    S  
Sbjct: 322 N--PYTMAVAARCTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLD--SPAGQSTP 372

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
           ++ G  I  A+  S  + P S   + FSLAWD P++ F    + +YRRYT+F+G+  + A
Sbjct: 373 TQKGEGIAGAVCISSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVA 432

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
             ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW       
Sbjct: 433 PALSHYALCHYADWEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW------- 485

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                                  ++VP     A ++ E +   + Q+            +
Sbjct: 486 -----------------------LEVP-----ADSLPEGLGGSMRQL-----------RS 506

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
            LQD     G+F YLEG EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A +  D 
Sbjct: 507 TLQD----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDL 562

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
           ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 563 TRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRD 622

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              TGD+ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGL
Sbjct: 623 YYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGL 682

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
           W+AA+     +A   G +  ++ F     + +  YE+ LWNG Y+NYD+S   QS SI +
Sbjct: 683 WLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMS 742

Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
           DQ AGQW+ RACGL      +        AL+ ++  NV    GG  GAVNGM P G  D
Sbjct: 743 DQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPD 802

Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
            SS+QS E+W GV Y +AA+MI E L   GF+TA G Y   W   GL  AFQTPEA+   
Sbjct: 803 RSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWERLGL--AFQTPEAYCQQ 860

Query: 874 DQYRSLCYMRPLAIW 888
             +RSL YMRPL+IW
Sbjct: 861 QVFRSLAYMRPLSIW 875


>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
          Length = 954

 Score =  563 bits (1451), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 336/863 (38%), Positives = 472/863 (54%), Gaps = 82/863 (9%)

Query: 46  PLSQFTLNWKETVQLAPIGVRIL-CLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGS 104
           P +   ++   T++   +G R L    ++   + K  FID      L   +G PLGG+G 
Sbjct: 109 PFNANNVSLSNTLKHLGMGFRYLKWWYQKTQVEKKTPFIDMLNCLPLRQIYGCPLGGIGG 168

Query: 105 GSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTA 164
           G+I R +RG+F RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL+    
Sbjct: 169 GTITRGWRGQFCRWQLNPGIYQHRTVMADQFTVCLRREGQTVYQQVLSLERPNVLR---- 224

Query: 165 AGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               SW+W L G  + YHALYPRAWTV++  P  ++ + C QI+PI+PH+Y++SS PV V
Sbjct: 225 ----SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQQVTLTCLQITPILPHDYQDSSLPVGV 279

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSH 281
           F + + N+G    +++++F+  N +GG  +  G  +N    +  DG  V  VLLHH    
Sbjct: 280 FVWNVENTGDEDLEVSIMFSMRNGLGGGHDNLGGLWNEPFCLERDGETVQGVLLHHPNPP 339

Query: 282 QLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVT 341
              P T ALAA+ T    V+    F   G     T + +W ++   G  D  + +  S  
Sbjct: 340 N--PYTMALAARRTANTTVTHITAFDPDG-----TGQKVWQDLLLDGQLD--SPLGPSPP 390

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAA 400
           ++    IG A+ A   +P     Q+ F+L+WD P+V F   G+ +YRRYT+F+G   NAA
Sbjct: 391 TQKRQGIGGAVCAGAHLPRRGRCQLEFALSWDMPKVLFGAKGQAHYRRYTRFFGQEGNAA 450

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
             +   A+ ++ SWE +I AWQ P+LED+ LP WY   LFNELY+L  GG VW      +
Sbjct: 451 PALCHYALTQYRSWEEKIIAWQSPVLEDRTLPAWYKSALFNELYFLADGGTVW------L 504

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                    +FS                       +L ++   L++          +G  
Sbjct: 505 EVTEETITEEFS----------------------GMLNQLRPTLQE----------YGRF 532

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
              +G+E            Y M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D 
Sbjct: 533 AYLEGQE------------YHMYNTYDVHFYASFALIMLWPKLELSLQYDMAVATLSEDL 580

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
           ++ K L  G     K    +PHD+G  D  PW  +NAY ++DTA WKDLN KFVLQ+YRD
Sbjct: 581 TRRKYLMSGVIAPVKKKNVIPHDVGDPDDEPWLRINAYLIHDTADWKDLNLKFVLQIYRD 640

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              TG++ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGL
Sbjct: 641 FYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGL 700

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQA 757
           W+AA+     +A   G +  ++ FL    + +  YE+ LWNG Y+NYD+S   QS  I +
Sbjct: 701 WLAAVAVMVQMAAVCGAQDVQEKFLSILSRGREAYERLLWNGRYYNYDSSSQPQSRIIMS 760

Query: 758 DQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
           DQ AGQW+ RACGL      +        AL+ ++  NV    GG  GAVNGM PDG  D
Sbjct: 761 DQCAGQWFLRACGLGKGDTEVFPSAHVIRALQTIFELNVQGFAGGAMGAVNGMHPDGVPD 820

Query: 814 MSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD 873
            SS+QS E+W GV Y +AA+MI E L   GF TA G Y   W    LG AFQTPEA+   
Sbjct: 821 TSSVQSDEVWVGVVYGLAATMIQEGLVQEGFLTAEGCYRTVWE--RLGMAFQTPEAYCQR 878

Query: 874 DQYRSLCYMRPLAIWAMQWALTR 896
             +RSL YMRPL+IWAMQ AL +
Sbjct: 879 RVFRSLAYMRPLSIWAMQLALQQ 901


>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
 gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
 gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
 gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
 gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
 gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
          Length = 927

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 333/831 (40%), Positives = 459/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ 
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW +                               
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L+    + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
          Length = 868

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 328/832 (39%), Positives = 448/832 (53%), Gaps = 109/832 (13%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           K+ FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+A+QF+V 
Sbjct: 126 KKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVIADQFTVC 185

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV++  P  
Sbjct: 186 LRREGQTVYQQVLSVERPSVLR--------SWNWGLCGFFAFYHALYPRAWTVYQ-LPGQ 236

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
            + + CRQI+P++PH+Y++SS PV VF + + N G  + D++++F+  N +GG  + +G 
Sbjct: 237 NVTLTCRQITPVLPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGGGDDASGG 296

Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAK 318
            +N            L H   +H        + A + DG                  T +
Sbjct: 297 LWNEPF--------CLEHTTVTH--------ITAFDPDG------------------TGQ 322

Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
            +W ++ + G  D  +    S  S+ G  I  A+  +  +PP    ++ FSLAWD P + 
Sbjct: 323 QVWQDLLQDGQLD--SPAGRSTLSQKGVGIAGAVCVTSKLPPRGRCRMEFSLAWDMPRIM 380

Query: 379 F-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPI 437
           F   G+ YYRRYT+F+G+  +AA  ++  A+  +  WE +I AWQ P+L+D+ LP WY  
Sbjct: 381 FGAQGQVYYRRYTRFFGSDGDAAPALSHYALCRYADWEERISAWQSPVLDDRSLPAWYKS 440

Query: 438 TLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNIL 497
            LFNELY+L  GG VW                              ++VP          
Sbjct: 441 ALFNELYFLADGGTVW------------------------------LEVPED-------- 462

Query: 498 ERMSSILEQIYTPVALNSSFGTNLLQDGE--ENIGQFLYLEGIEYLMWNTYDVHFYSSFA 555
                         +L    G N+ Q        G+F YLEG EY M+NTYDVHFY+SFA
Sbjct: 463 --------------SLPEELGGNMCQLRPLLTEYGRFGYLEGQEYRMYNTYDVHFYASFA 508

Query: 556 LIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEV 613
           LIML+PK++LS+Q D A A +  D +  + L  G     K    +PHDIG  D  PW  V
Sbjct: 509 LIMLWPKLELSLQYDMALATLGEDLTWRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRV 568

Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
           NAY ++DTA WKDLN KFVLQVYRD   TGD+ F + +WP     M    +FD+D DG+I
Sbjct: 569 NAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLI 628

Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
           EN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  ++ F     + +  Y
Sbjct: 629 ENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDIQEKFSSILSRGQQAY 688

Query: 734 EK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVY 788
           E+ LWNG Y+NYD S   QS SI +DQ AGQW+ RA GL      +        AL+ ++
Sbjct: 689 ERLLWNGRYYNYDCSSRPQSCSIMSDQCAGQWFLRASGLGEGETEVFPTQHVVRALQTIF 748

Query: 789 NYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
            +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L   GFQTA 
Sbjct: 749 EFNVQAFAGGTMGAVNGMQPQGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAE 808

Query: 849 GIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
           G Y   W   GLG  FQTPEA+     +RSL YMRPL+IWAMQ AL +   K
Sbjct: 809 GCYRTVWERLGLG--FQTPEAYCQKRVFRSLAYMRPLSIWAMQLALQQQHKK 858


>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
           paniscus]
          Length = 927

 Score =  561 bits (1445), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 334/831 (40%), Positives = 458/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ 
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW +                               
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L     + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLHPTLRDYGRFGYLEGQEYRMYN 535

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ V+  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
          Length = 927

 Score =  561 bits (1445), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 332/831 (39%), Positives = 459/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L+G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLRGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ 
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LF ELY+L  GG VW +                               
Sbjct: 468 DDRSLPAWYKSALFKELYFLADGGTVWLE------------------------------- 496

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L+    + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
           troglodytes]
 gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
          Length = 927

 Score =  560 bits (1444), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 333/831 (40%), Positives = 458/831 (55%), Gaps = 87/831 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  +  G  I  A+  S  + P  + ++ 
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTHKGVGIAGAVCVSSKLRPRGQCRLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW +                               
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L+    + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835

Query: 838 DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 GLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|413920604|gb|AFW60536.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
          Length = 459

 Score =  558 bits (1438), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 275/442 (62%), Positives = 333/442 (75%), Gaps = 6/442 (1%)

Query: 19  LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
           +N   +KV P +   LTW+ KL+     L  F L W+E +QLA IG+R+   I EE +KG
Sbjct: 22  MNGDEEKVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKG 81

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           + A IDP  KR   S  GVPLGG    SIGRSY+G+FQRWQ+FP  CED  VLANQFS F
Sbjct: 82  RIAVIDPMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAF 137

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           +SR +G+KYS+VL P  P++ K +  +GIGSWDWN+ G +STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGKKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDP 197

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
           EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG 
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257

Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
           H NS     DGVH +LLHHRT+   P VTF +AAQE + + +S CP+FVISG+S     T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFT 317

Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
           AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P +   ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPE 377

Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
           V F SGKTY+RRYTKFYGT  +AAA +A DAILEH SWE QIE WQ PIL+D+R P WYP
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYP 437

Query: 437 ITLFNELYYLNAGGAVWTDGSP 458
           +TLFNELYYLNAGG +WT   P
Sbjct: 438 VTLFNELYYLNAGGTIWTGMHP 459


>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
          Length = 1065

 Score =  557 bits (1435), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 340/858 (39%), Positives = 473/858 (55%), Gaps = 86/858 (10%)

Query: 72   REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
            R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P +   + V+
Sbjct: 269  RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYRHQTVI 328

Query: 132  ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
            A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 329  ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 380

Query: 192  HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
            ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G    D++++F+  N +GG
Sbjct: 381  YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEDLDVSIMFSMRNGLGG 439

Query: 252  DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
              +  G  +N    +  DG  V  +LLHH T   LP P T A+AA+ T    V+    F 
Sbjct: 440  GDDAPGGLWNEPFHLERDGKTVQGLLLHHPT---LPNPYTMAVAARVTADTTVTHITAFD 496

Query: 308  ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
              G     T + +W ++ + G  D  +    S  ++ G  +  A+  S  + P  + ++ 
Sbjct: 497  PDG-----TGQQVWQDLLQDGQLD--SPAGQSTPTQKGVGVAGAVCVSSKLRPRGQCRLE 549

Query: 368  FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+ ++  WE +I AWQ P+L
Sbjct: 550  FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCQYTEWEERISAWQNPVL 609

Query: 427  EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
            +D+ LP WY   LFNELY+L  GG VW +                          R   +
Sbjct: 610  DDRLLPAWYKSALFNELYFLADGGTVWLE-------------------------VREDSL 644

Query: 487  PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTY 546
            P + D   N+     ++ +           +G     +G+E            Y M+NTY
Sbjct: 645  PQELDG--NMCHLRPTLRD-----------YGRFGYLEGQE------------YRMYNTY 679

Query: 547  DVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGI 606
            DVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDIG 
Sbjct: 680  DVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGD 739

Query: 607  CD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
             D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M    +
Sbjct: 740  PDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMK 799

Query: 665  FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
            FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F  
Sbjct: 800  FDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSS 859

Query: 725  KFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDK 779
               + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +     
Sbjct: 860  ILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPH 919

Query: 780  ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
               AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L
Sbjct: 920  VVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGL 979

Query: 840  ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
               GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IWAMQ AL + + 
Sbjct: 980  TWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIWAMQLALQQQQH 1037

Query: 900  KTLEKQMKPEVTDESLLR 917
            K   K  +P+V  ++ LR
Sbjct: 1038 K---KASRPKVKQDTGLR 1052


>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
          Length = 933

 Score =  555 bits (1431), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 333/837 (39%), Positives = 457/837 (54%), Gaps = 93/837 (11%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P       V 
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
            D+P+   +F+V + R     Y  VL  + P VL+        SW+W L G  + YHALY
Sbjct: 187 ADQPICPLKFTVCLRRERQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALY 238

Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
           PRAWTV++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+ 
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297

Query: 246 TNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
            N +GG  +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVT 354

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
                 I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P 
Sbjct: 355 H-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPR 407

Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
            + ++ FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I A
Sbjct: 408 GQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISA 467

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQ P+L+D+ LP WY   LFNELY+L  GG VW +                         
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE------------------------- 502

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGI 538
                                 +LE      +L    G N+  L+    + G+F YLEG 
Sbjct: 503 ----------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQ 535

Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
           EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K   
Sbjct: 536 EYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRN 595

Query: 599 AVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
            +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP   
Sbjct: 596 VIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCL 655

Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
             M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +
Sbjct: 656 AVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQ 715

Query: 717 GSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL---- 771
             +D F     + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL    
Sbjct: 716 DIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGD 775

Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
             +        AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +A
Sbjct: 776 TEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLA 835

Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           A+MI E L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 ATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
           troglodytes]
          Length = 933

 Score =  555 bits (1429), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 333/837 (39%), Positives = 457/837 (54%), Gaps = 93/837 (11%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P       V 
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
            D+P+ + +F+V + R     Y  VL  + P VL+        SW+W L G  + YHALY
Sbjct: 187 ADQPICSLKFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALY 238

Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
           PRAWTV++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+ 
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297

Query: 246 TNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
            N +GG  +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVT 354

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
                 I+      T + +W ++ + G  D  +    S  +  G  I  A+  S  + P 
Sbjct: 355 H-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTHKGVGIAGAVCVSSKLRPR 407

Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
            + ++ FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I A
Sbjct: 408 GQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISA 467

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDL 480
           WQ P+L+D+ LP WY   LFNELY+L  GG VW +                         
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE------------------------- 502

Query: 481 KRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGI 538
                                 +LE      +L    G N+  L+    + G+F YLEG 
Sbjct: 503 ----------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQ 535

Query: 539 EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
           EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K   
Sbjct: 536 EYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRN 595

Query: 599 AVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVY 656
            +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP   
Sbjct: 596 VIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCL 655

Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
             M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +
Sbjct: 656 AVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQ 715

Query: 717 GSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL---- 771
             +D F     + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL    
Sbjct: 716 DIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGD 775

Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
             +        AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +A
Sbjct: 776 TEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLA 835

Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           A+MI E L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 836 ATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
           paniscus]
          Length = 755

 Score =  536 bits (1382), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 312/779 (40%), Positives = 427/779 (54%), Gaps = 87/779 (11%)

Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
           V  D+P+ + +F+V + R     Y  VL  + P VL+        SW+W L G  + YHA
Sbjct: 7   VIADQPICSLKFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHA 58

Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
           LYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F
Sbjct: 59  LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117

Query: 244 TWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVH 299
           +  N +GG  +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATT 174

Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
           V+      I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + 
Sbjct: 175 VTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLR 227

Query: 360 PDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQI 418
           P  + ++ FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I
Sbjct: 228 PRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERI 287

Query: 419 EAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS 478
            AWQ P+L+D+ LP WY   LFNELY+L  GG VW +                       
Sbjct: 288 SAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------------------- 324

Query: 479 DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLE 536
                                   +LE      +L    G N+  L     + G+F YLE
Sbjct: 325 ------------------------VLED-----SLPEELGRNMCHLHPTLRDYGRFGYLE 355

Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
           G EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K 
Sbjct: 356 GQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKR 415

Query: 597 LGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
              +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP 
Sbjct: 416 RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPV 475

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
               M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G
Sbjct: 476 CLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCG 535

Query: 715 DRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-- 771
            +  +D F     + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL  
Sbjct: 536 AQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGE 595

Query: 772 --LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
               +        AL+ V+  NV    GG  GAVNGM P G  D SS+QS E+W GV Y 
Sbjct: 596 GDTEVFPTQHVVRALQTVFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYG 655

Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           +AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 656 LAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 712


>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
          Length = 755

 Score =  536 bits (1380), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 311/779 (39%), Positives = 427/779 (54%), Gaps = 87/779 (11%)

Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
           V  D+P+   +F+V + R     Y  VL  + P VL+        SW+W L G  + YHA
Sbjct: 7   VIADQPICPLKFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHA 58

Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
           LYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F
Sbjct: 59  LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117

Query: 244 TWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVH 299
           +  N +GG  +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPHTMAVAARVTAATT 174

Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
           V+      I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + 
Sbjct: 175 VTH-----ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLR 227

Query: 360 PDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQI 418
           P  + ++ FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I
Sbjct: 228 PRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERI 287

Query: 419 EAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQS 478
            AWQ P+L+D+ LP WY   LFNELY+L  GG VW +                       
Sbjct: 288 SAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------------------- 324

Query: 479 DLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLE 536
                                   +LE      +L    G N+  L+    + G+F YLE
Sbjct: 325 ------------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLE 355

Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
           G EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K 
Sbjct: 356 GQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKR 415

Query: 597 LGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
              +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP 
Sbjct: 416 RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPV 475

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
               M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G
Sbjct: 476 CLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCG 535

Query: 715 DRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-- 771
            +  +D F     + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL  
Sbjct: 536 AQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGE 595

Query: 772 --LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
               +        AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y 
Sbjct: 596 GDTEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYG 655

Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           +AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 656 LAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 712


>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
          Length = 860

 Score =  530 bits (1364), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 359/905 (39%), Positives = 488/905 (53%), Gaps = 138/905 (15%)

Query: 36  WQRKLSTGEIPLSQF---TLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRH-- 90
           WQR+LS  E P  Q    T     T++  P+G R+    REE  +G R   DP +     
Sbjct: 36  WQRRLS--EQPTRQLRPLTAPLLHTLRNLPLGFRLWRQAREE--RGGR---DPVLDASGG 88

Query: 91  LTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL-ANQFSVFVSRSNGQKYSS 149
             SS G+P GG+G G +GRS+RG+F+   I P      PV+ ANQFSVF+        ++
Sbjct: 89  QASSSGLPAGGMGGGCVGRSWRGDFEPGLILPNGRGQAPVVPANQFSVFIEPDGEDSEAT 148

Query: 150 VLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
           VL P  P   +      +G+WDW+  G++S Y AL+PRAWTV++ EP+P LR+  RQISP
Sbjct: 149 VLNPGIPPTDR------LGAWDWSFVGEESYYSALFPRAWTVYD-EPEPRLRLTSRQISP 201

Query: 210 IIPHNYKESSYPVSVFTYTIYN-SGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
           +I +NYKESS P  VF +T+ N   +     +L+FTW N  G  S+  G H N+      
Sbjct: 202 VIGNNYKESSLPACVFVWTLENLDQEDEITASLMFTWQNGTGAASDALGGHANTPFTRPH 261

Query: 269 GVHAVLLHH---------RTSHQLPPVTFALAAQETDG-VHVSLCPHFVISGNSLGLTAK 318
               V+ H          +T     P++FA+AA   D   H+++      + NS G    
Sbjct: 262 PERVVMDHTQWRVDERGGKTHASSEPLSFAIAASSPDSDAHITV--RSCWATNSDG---A 316

Query: 319 DMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVN 378
           ++W + K  G+ +  N       S+ G ++GAAIA  VT+PP    QV F+LAWD P V 
Sbjct: 317 EVWKDFKRSGALEE-NWDALGKASQEGQTLGAAIAVRVTIPPGEHRQVEFALAWDHPVVE 375

Query: 379 FMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPIT 438
           F +G+ Y RRYT+FY  H ++AA +A DA++++  WE  IE WQ PIL+DKRLP      
Sbjct: 376 FGAGRRYTRRYTRFYSQHGDSAAQLACDALVQYPQWERLIEQWQAPILDDKRLP------ 429

Query: 439 LFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILE 498
                     GG VW                       Q +L    + P +     +  E
Sbjct: 430 --------GCGGTVWL----------------------QDELSAKEEKPRREKRETDGQE 459

Query: 499 RMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIM 558
              S                         ++G++  LE  E+ ++N  D  FY+SFALI 
Sbjct: 460 EEQS-------------------------DVGRWACLESQEHRLYNASDALFYASFALIA 494

Query: 559 LFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAY 616
            +P++QL++QR+ AAA +  D   + +L  G+    KV GAVP+D+G    DPW  +NAY
Sbjct: 495 NWPQLQLALQREVAAATLREDSRMVDVLATGRQAPNKVRGAVPYDMGTPADDPWVALNAY 554

Query: 617 CLYDTARWK---------------------DLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
            +++T+ WK                     DLN KFVL VYRD VAT DK+F   VWP+V
Sbjct: 555 NIHETSSWKGKLGLLPASHLSPSTELTEAADLNSKFVLGVYRDFVATQDKRFLADVWPAV 614

Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL--AREV 713
             A+ ++ QFDR+ DGMIEN+G PDQTY +W        +GGLWVAAL AA A+  A ++
Sbjct: 615 DAALRHLQQFDRNKDGMIENEGLPDQTYHSW--------TGGLWVAALSAACAMAQALQL 666

Query: 714 GDRGSEDYFLFKFQKAKVVY-EKLWNGS-YFNYDNSGSSQSSSIQADQLAGQWYARACGL 771
            D     ++    Q A  VY E LWN S YFNYD S S+   SIQADQL GQWYARACGL
Sbjct: 667 ADDSVAQHYRDLHQSAARVYNESLWNPSGYFNYDTSTSTHHDSIQADQLCGQWYARACGL 726

Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
             I  + +  +ALK V+  N  +   GKRGAVNGM PDGRVD + + S E W G T+A+A
Sbjct: 727 PSIAPDKRIVTALKTVFANNFNE---GKRGAVNGMRPDGRVDYACLHSSEAWPGTTFALA 783

Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
           A M+ E L D  F TA G+  + +    LGY F+TPEAW+    YR+L +MRPLAIWAMQ
Sbjct: 784 ACMMGEGLVDQAFITAAGVCGSIYE--DLGYWFRTPEAWDGSGNYRALGHMRPLAIWAMQ 841

Query: 892 WALTR 896
           WA T+
Sbjct: 842 WAWTQ 846


>gi|413920605|gb|AFW60537.1| hypothetical protein ZEAMMB73_659971, partial [Zea mays]
          Length = 449

 Score =  521 bits (1341), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 260/427 (60%), Positives = 317/427 (74%), Gaps = 6/427 (1%)

Query: 19  LNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKG 78
           +N   +KV P +   LTW+ KL+     L  F L W+E +QLA IG+R+   I EE +KG
Sbjct: 22  MNGDEEKVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKG 81

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           + A IDP  KR   S  GVPLGG    SIGRSY+G+FQRWQ+FP  CED  VLANQFS F
Sbjct: 82  RIAVIDPMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAF 137

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           +SR +G+KYS+VL P  P++ K +  +GIGSWDWN+ G +STYHALYPRAWTV++GEPDP
Sbjct: 138 ISRQDGKKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDP 197

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
           EL IVCRQISPIIPHNY++SSYPV+VFT+T+ NSG T+AD+TLLFTW NSVGG SE TG 
Sbjct: 198 ELNIVCRQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGY 257

Query: 259 HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLG--LT 316
           H NS     DGVH +LLHHRT+   P VTF +AAQE + + +S CP+FVISG+S     T
Sbjct: 258 HSNSSMIEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFT 317

Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
           AKDMW+ +KEHGSFD L+ ++TS+ S PGSSIGAAIAASV + P +   ++FSLAW CPE
Sbjct: 318 AKDMWNSVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPE 377

Query: 377 VNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYP 436
           V F SGKTY+RRYTKFYGT  +AAA +A DAILEH SWE QIE WQ PIL+D+R P W+ 
Sbjct: 378 VKFSSGKTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWFV 437

Query: 437 ITLFNEL 443
            TL   L
Sbjct: 438 PTLIIHL 444


>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
 gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
          Length = 877

 Score =  514 bits (1323), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 306/782 (39%), Positives = 429/782 (54%), Gaps = 85/782 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 349

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ 
Sbjct: 350 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW +                               
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 496

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L+    + G+F YLEG EY M+N
Sbjct: 497 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 535

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 536 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 595

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 596 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 655

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 656 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 715

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 716 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 775

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 776 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835

Query: 838 DL 839
            L
Sbjct: 836 LL 837


>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
          Length = 922

 Score =  513 bits (1322), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 306/782 (39%), Positives = 429/782 (54%), Gaps = 85/782 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 172 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 231

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 232 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 283

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 284 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 342

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T    V+      
Sbjct: 343 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAATTVTH----- 394

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  + P  + ++ 
Sbjct: 395 ITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSKLRPRGQCRLE 452

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 453 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERISAWQSPVL 512

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
           +D+ LP WY   LFNELY+L  GG VW +                               
Sbjct: 513 DDRSLPAWYKSALFNELYFLADGGTVWLE------------------------------- 541

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWN 544
                           +LE      +L    G N+  L+    + G+F YLEG EY M+N
Sbjct: 542 ----------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYN 580

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           TYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDI
Sbjct: 581 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDI 640

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYM 662
           G  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M   
Sbjct: 641 GDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESE 700

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
            +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F
Sbjct: 701 MKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKF 760

Query: 723 LFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDE 777
                + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +   
Sbjct: 761 SSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPT 820

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
                AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 821 QHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 880

Query: 838 DL 839
            L
Sbjct: 881 LL 882


>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 923

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 315/832 (37%), Positives = 436/832 (52%), Gaps = 89/832 (10%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPNVLR--------SWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVSLCPHFV 307
             +  G  +N    +  DG  V  +LLHH T   LP P T A AA+ T    V+    F 
Sbjct: 298 GDDAPGGLWNEPFHLERDGETVQGLLLHHPT---LPNPYTMAAAARVTADTMVTHITAFD 354

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
             G     T + +W ++ + G  D  +    S  ++ G  +  A+  S  + P  +  + 
Sbjct: 355 PDG-----TGQQVWQDLLQDGQLD--SPAGQSTPTQKGVGVAGAVCVSSKLRPRGQCCLE 407

Query: 368 FSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
           FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L
Sbjct: 408 FSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALSRYTEWEERISAWQSPVL 467

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVW---TDGSPPVHSLVTIGHRKFSLDWSQSDLKRI 483
           +D+ LP WY   LFNELY+L  GG VW    + S P      + H + +L     D  R 
Sbjct: 468 DDRSLPAWYKSALFNELYFLADGGTVWLEVREDSLPQELDGNMCHLRPTL----RDYGRF 523

Query: 484 VDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMW 543
             +  Q     N  +            V   +SF   +L                    W
Sbjct: 524 GYLEGQEYRMYNTYD------------VHFYASFALIML--------------------W 551

Query: 544 NTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHD 603
              ++      AL  L  +  L+ +R   + VM   P K +               +PHD
Sbjct: 552 PKLELSLQYDMALATL--REDLTRRRYLMSGVMA--PVKRR-------------NVIPHD 594

Query: 604 IGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY 661
           IG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M  
Sbjct: 595 IGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMES 654

Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDY 721
             +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D 
Sbjct: 655 EMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDK 714

Query: 722 FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVD 776
           F     + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +  
Sbjct: 715 FSSILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFP 774

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
                 AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI 
Sbjct: 775 TPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQ 834

Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           E L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 835 EGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
          Length = 970

 Score =  501 bits (1289), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 295/777 (37%), Positives = 418/777 (53%), Gaps = 103/777 (13%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K+ FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 119 RKTQVEKKKPFIDLINSVPLKQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 178

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 179 ADQFTVCLRREGRTVYQQVLSVERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 230

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P  ++ + CRQI+PI+PH+Y++SS PV +F + + N G  + D++++F+  N +GG
Sbjct: 231 YQ-LPGQKVTLTCRQITPILPHDYQDSSLPVGIFVWDVENEGDEALDVSIMFSMRNGLGG 289

Query: 252 DSEFTGQHYNSKTKMN-DGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
             + +G  +N    +  DG  A                     +T   H++         
Sbjct: 290 GDDASGGLWNEPFCLERDGETA---------------------DTTITHIT-----AFDP 323

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
           +S   T + +W ++ + G  D  +    S  S+ G  I  A+  +  +PP  +  + FSL
Sbjct: 324 DS---TGQQVWQDLLQDGQLD--SPAGRSTPSQKGVGIAGAVCVTGKLPPRGQCCLEFSL 378

Query: 371 AWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK 429
           AWD P + F   G+ YYRRYT+F+G+  +AA  ++  A+  +  WE +I AWQ P+L+D+
Sbjct: 379 AWDMPRIMFGAKGQVYYRRYTRFFGSDGDAAPALSHYALCRYTDWEERISAWQSPVLDDR 438

Query: 430 RLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ 489
            LP WY   LFNELY+L  GG +W                              ++VP  
Sbjct: 439 SLPSWYKSALFNELYFLADGGTIW------------------------------LEVPED 468

Query: 490 N--DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
              +  V  + ++  IL++                       G+F YLEG EY M+NTYD
Sbjct: 469 ALPEELVGNMCQLRPILQEY----------------------GRFGYLEGQEYRMYNTYD 506

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           VHFY+SFALIML+PK++LS+Q D A A +  D S  + L  G     K    +PHDIG  
Sbjct: 507 VHFYASFALIMLWPKLELSLQYDMALATLREDLSWRQYLMSGVVAPVKRRNVIPHDIGDP 566

Query: 608 D--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF 665
           D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F + +WP     M    +F
Sbjct: 567 DDEPWLRVNAYMIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKF 626

Query: 666 DRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK 725
           D+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +   D F   
Sbjct: 627 DKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDILDKFSSI 686

Query: 726 FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKA 780
             + +  YE+ LWNG Y+NYD S   QS SI +DQ AGQW+ RA GL      +      
Sbjct: 687 LSRGREAYERLLWNGHYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDTEVFPTPHV 746

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
             AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E
Sbjct: 747 VCALQTIFKFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 803



 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 26/41 (63%), Gaps = 2/41 (4%)

Query: 850 IYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            Y   W   GL  AFQTPEA+     +RSL YMRPL+IWAM
Sbjct: 890 CYRTVWERLGL--AFQTPEAYCQHQVFRSLAYMRPLSIWAM 928


>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
          Length = 444

 Score =  497 bits (1279), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 248/379 (65%), Positives = 296/379 (78%), Gaps = 1/379 (0%)

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
           L ++G EN+G+FLYLEG+EY+MW TYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D 
Sbjct: 18  LSKNGPENVGKFLYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDR 77

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
            +MK L DG    RKV GAVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQVYRD  
Sbjct: 78  RRMKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFA 137

Query: 641 ATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWV 700
           ATGD  F + VWP+V   M YM+QFDRDGDG+IENDGFPDQTYD W+V GISAY GGLW+
Sbjct: 138 ATGDMSFGRDVWPAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWL 197

Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQ 759
           AALQAA+ +A  +GDR   + +  KF +AK VYE KLWNGSYFNYD+  SS S SIQADQ
Sbjct: 198 AALQAAATMAHRLGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQ 257

Query: 760 LAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQS 819
           LAGQWYA + GL P+ DE+K RSAL+K++ +NV+KV GG+ GAVNGM P+G+VD + MQS
Sbjct: 258 LAGQWYAASSGLPPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQS 317

Query: 820 REIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSL 879
           REIW+GVTY VAA+M+   +   GF TA GI+ A WS  G GY FQTPE W  D  YRSL
Sbjct: 318 REIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSL 377

Query: 880 CYMRPLAIWAMQWALTRPK 898
            YMRPLAIWAMQWA + PK
Sbjct: 378 IYMRPLAIWAMQWARSPPK 396


>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 929

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 315/838 (37%), Positives = 434/838 (51%), Gaps = 95/838 (11%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFP------RVC 125
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P       V 
Sbjct: 127 RKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALY 185
            D+P+   +F+V + R     Y  VL  + P VL+        SW+W L G  + YHALY
Sbjct: 187 ADQPICPLKFTVCLRREGQTVYQQVLSLERPNVLR--------SWNWGLCGYFAFYHALY 238

Query: 186 PRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTW 245
           PRAWTV++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+ 
Sbjct: 239 PRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSM 297

Query: 246 TNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRTSHQLP-PVTFALAAQETDGVHVS 301
            N +GG  +  G  +N    +  DG  V  +LLHH T   LP P T A AA+ T    V+
Sbjct: 298 RNGLGGGDDAPGGLWNEPFHLERDGETVQGLLLHHPT---LPNPYTMAAAARVTADTMVT 354

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
               F   G     T + +W ++ + G  D  +    S  ++ G  +  A+  S  + P 
Sbjct: 355 HITAFDPDG-----TGQQVWQDLLQDGQLD--SPAGQSTPTQKGVGVAGAVCVSSKLRPR 407

Query: 362 SEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
            +  + FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I A
Sbjct: 408 GQCCLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALSRYTEWEERISA 467

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVW---TDGSPPVHSLVTIGHRKFSLDWSQ 477
           WQ P+L+D+ LP WY   LFNELY+L  GG VW    + S P      + H + +L    
Sbjct: 468 WQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLEVREDSLPQELDGNMCHLRPTL---- 523

Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
            D  R   +  Q     N  +            V   +SF   +L               
Sbjct: 524 RDYGRFGYLEGQEYRMYNTYD------------VHFYASFALIML--------------- 556

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVL 597
                W   ++      AL  L  +  L+ +R   + VM   P K +             
Sbjct: 557 -----WPKLELSLQYDMALATL--REDLTRRRYLMSGVMA--PVKRR------------- 594

Query: 598 GAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
             +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP  
Sbjct: 595 NVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVC 654

Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
              M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G 
Sbjct: 655 LAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGA 714

Query: 716 RGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL--- 771
           +  +D F     + +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL   
Sbjct: 715 QDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEG 774

Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
              +        AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +
Sbjct: 775 DTEVFPTPHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGL 834

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           AA+MI E L   GFQTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 835 AATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
          Length = 802

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 309/844 (36%), Positives = 454/844 (53%), Gaps = 130/844 (15%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +GK+ FID          +G P+GG+G GSIGR +RGEF R+Q+ P +   K V A+QF 
Sbjct: 68  QGKKMFIDNLNLVPWKPIYGAPIGGIGCGSIGRGFRGEFCRFQMIPGMYTHKTVEADQFI 127

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           + V +SN   +  VL P     LK      + SW W     ++TYHALYPR+WT +E  P
Sbjct: 128 LTVRKSNETIHQCVLSP-----LK-LKGKRLSSWRWGFPSQQATYHALYPRSWTRYE-IP 180

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT 256
           + +L ++CRQISPI P++Y+E+ +P SVF + + N G  + DI+L FT+ N  G   + +
Sbjct: 181 ECDLTLICRQISPIFPNDYQETCFPGSVFVWEVINKGSDAVDISLTFTFKNGQGVKEDRS 240

Query: 257 GQHYNSK-----TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGN 311
           G  ++       T  +D V  V +H + +    P  + +AA+   G++VS   +F     
Sbjct: 241 GGVWSRAFQEGITGTDDCVSGVEIHQKFNDL--PCVYGVAAKHGPGINVSEKSYFNPQS- 297

Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
               T +++W+++   GS  +   +  S T++ G  +  A+ A  T+ P S   +TFSL 
Sbjct: 298 ----TGEEVWNDLLAEGSL-KSTPVSGSETAK-GEELACAVCAKTTISPYSHTSLTFSLT 351

Query: 372 WDCPEVNFMSGKT-YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
           WD P ++F + +  Y+RRY +F+G    A   +A  A+    SWE +I  WQ+P L+  +
Sbjct: 352 WDMPIIHFGAKRNKYHRRYARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANK 411

Query: 431 LPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQN 490
           LP WY   LFNE Y+++ GG+VW D                            +DV +  
Sbjct: 412 LPAWYKSALFNETYFISDGGSVWVDS---------------------------IDVKD-- 442

Query: 491 DTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHF 550
                      ++ E  +        FG     +G E            Y M+NT DVH+
Sbjct: 443 -----------AVSEHPFV-----QEFGKFAYLEGHE------------YRMYNTLDVHY 474

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           Y+SFAL+ L+P +QLS+Q D               +D+                   +PW
Sbjct: 475 YASFALMKLWPNLQLSLQYD---------------IDN-------------------EPW 500

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             +NAY ++ T  WKDLNPKFVLQVYRD V T D+++   ++P     M +  ++D D D
Sbjct: 501 SRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRWDVDRD 560

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-- 728
           G+I+N GF DQTYD W+V+G SAY GGLW+AA++A + +A  +G    ED  L K+++  
Sbjct: 561 GIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLG--VVED--LMKYEEIL 616

Query: 729 ---AKVVYEKLWNG----SYFNYDNSGSSQSSSIQADQLAGQWYARACGLL--PIVDEDK 779
               K   EKLWNG     Y+NYD S S    SI ADQ+ GQW+ +ACG+    +     
Sbjct: 617 ERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSAN 676

Query: 780 ARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDL 839
            + AL+ V+ +NVL   GG+ GA+NGM P+ + D++S QS E W+GVTYA+ A+MI   +
Sbjct: 677 VKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVGM 736

Query: 840 ADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
            D GFQTA G Y   W   GL  AFQTPEA+  + ++RSL YMRPLAIWA+Q A+ +  P
Sbjct: 737 IDKGFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEKFHP 794

Query: 900 KTLE 903
           +  E
Sbjct: 795 QFFE 798


>gi|297741934|emb|CBI33369.3| unnamed protein product [Vitis vinifera]
          Length = 508

 Score =  471 bits (1211), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 227/452 (50%), Positives = 306/452 (67%), Gaps = 5/452 (1%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L++    L +F++ + E +++  +G+R+   IREEA++G++A IDPF
Sbjct: 31  DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+GSGSI R +RGEF+ WQI P  C+  P++ANQFS+F+SR  G
Sbjct: 91  TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KY+SVL P   E L  +   GI SW WNL G  STYHAL+PRAWT+++GEPDPEL++ 
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           CRQISP IPHNY++SS P +VF YT+ N+GK  A ++LLFTW NS+GG S  +G H N  
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
               DGV  VLLHH+T+ + PPVTFA+AA ET  V V++ P F +S  S  +TAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGS-HITAKDMWGK 329

Query: 324 IKEHGSFDRLNSME-TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           + + G FDR N     S+ S PG ++ AA++AS  V P  +  V F+LAW  P+V F+ G
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
            +Y+RRYTK+YGT + AA NI  DA+  +  WE +IE WQ PIL D RLPEWY  TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLD 474
           LY+L AGG VW D S P  S     H+  +++
Sbjct: 450 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVE 481


>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 890

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 323/886 (36%), Positives = 458/886 (51%), Gaps = 117/886 (13%)

Query: 44  EIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVG 103
            +P ++ T+  ++     P   R      +   K ++ FI+ F        +GVP GG+G
Sbjct: 30  RVPFNRPTI--RQVYDSLPFVRRYFIYWLKHVGKKEKLFINTFQPLKHKPHYGVPCGGIG 87

Query: 104 SGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFSVFVSRSNGQK----YSSVLC--PKT 155
            GSIGR +RG F ++ + P + E K   + A+QF V + R NG+     Y  VLC  P T
Sbjct: 88  CGSIGRDFRGGFCKFGLRPGLIEHKVDVLRADQFIVTLRRKNGEGFATVYQKVLCAAPTT 147

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
           P      T+  + +WD++   D  TY  LYPR+WT ++   +  + IV RQISP+IP +Y
Sbjct: 148 P-----ATSTELNAWDFSFPPDCLTYRGLYPRSWTRYDIR-EYRVSIVMRQISPVIPGDY 201

Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------GDSEFTGQHYNSKTK 265
           ++SS PVSVF + + N G     +++ FT+ N  G            S F  +H +S+  
Sbjct: 202 RDSSLPVSVFVFDVENGGDEDLYVSITFTFRNGTGYHKWSQEGNCTSSLFESKHAHSR-- 259

Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIK 325
               V  V L H  S      T+A+AA       VS C  F  S      +  ++W  + 
Sbjct: 260 ----VEGVCLEHTISRMR--CTYAVAANCNQRTTVSRCAAFDPSK-----SGDEIWQSLW 308

Query: 326 EHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS--EGQVTFSLAWDCPEVNFMSGK 383
             G    +  +       P   +  A+ A   V  +S  E  + F+LAWD P V F + K
Sbjct: 309 STGHLQNVGEVT------PTRELAIAVCAQTIVKANSKAEAPMEFALAWDMPIVQFGARK 362

Query: 384 -TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
             Y RRYT+F+G   +AA  +   ++  +  WE +I  WQ P+L  + LP+WY   LFNE
Sbjct: 363 RNYRRRYTRFFGDRGDAAPALCAHSLTVYEQWERRINDWQDPVLNHRNLPDWYKSALFNE 422

Query: 443 LYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSS 502
           LY++  GG+VW +               F  DW+                          
Sbjct: 423 LYFMTDGGSVWFE---------------FEEDWACE------------------------ 443

Query: 503 ILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPK 562
             EQ  +P ++       L++D     G+F YLE  EY M NTYDVHFY+SFAL  L+P 
Sbjct: 444 --EQHLSPYSIA------LMKD----FGRFGYLESWEYRMVNTYDVHFYASFALAQLWPS 491

Query: 563 IQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYD 620
           ++  IQ +F   +   D   +    +G    +K    VPHD+G    DPW   NAY ++D
Sbjct: 492 LEHVIQAEFTDQIEHFDGKPVIYHMEGNRGPKKSRTRVPHDLGNPAVDPWLLTNAYVMHD 551

Query: 621 TARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDG 677
           T +WKDLN KF+L  +RD   V   D++F   VWP+V   +   +  +D+DGDGMIEN G
Sbjct: 552 TGKWKDLNLKFILISFRDYTCVLGKDQRFLNLVWPAVKSLINEGLKNWDQDGDGMIENFG 611

Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKL 736
             DQTYD W +SG+SAY G LW+AAL+AA  +AREVGD      +     +A+  Y  KL
Sbjct: 612 AADQTYDAWKMSGVSAYCGCLWLAALRAALVMAREVGDSACAHRYAETLSRAREAYITKL 671

Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG------LLPIVDEDKARSALKKVYNY 790
           WNG Y+N+D    S+ + I ADQL G W+ ++        LLP       +SAL  +YNY
Sbjct: 672 WNGEYYNFDERSRSRHT-IMADQLCGYWFLQSISPELADELLPAA---HVKSALNIIYNY 727

Query: 791 NVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
           NVL+  GG+ GAVNGM  DG VD + +Q+ E+W+GVTYA+AA +I +     GF TA G 
Sbjct: 728 NVLRFAGGRLGAVNGMRRDGTVDRNYIQADEMWTGVTYALAAFLIQQGAVQRGFDTAWGC 787

Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           Y+  ++  GL Y  QTPEA      YR++ YMRPL+IWAMQWAL R
Sbjct: 788 YDGCFNRFGLQY--QTPEALYEQRFYRAIGYMRPLSIWAMQWALIR 831


>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
 gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
          Length = 830

 Score =  464 bits (1194), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 295/869 (33%), Positives = 439/869 (50%), Gaps = 105/869 (12%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
           K +++L P+ VR+      E   G+ AFID F + RH T + GVPLGG+G GSIG  +RG
Sbjct: 36  KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTYT-GVPLGGIGCGSIGTDFRG 94

Query: 114 EFQRWQIFPRVCEDKPVL-ANQFSVFVSRSNGQK--YSSVL-CPKTPEVLKDTTAAGIGS 169
            F R+ I P + E       NQF V V      +  Y S+L C + P       A  +  
Sbjct: 95  GFNRFSIIPGIKEQTETQKCNQFIVTVHSKKTFELIYQSILSCAEFP-------ATVLPK 147

Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
           WD  +  +   Y  L+PRAW          + +V   +SP+IP +Y +SS P++ F + +
Sbjct: 148 WDTTIPAEDVRYRGLFPRAWQEFR-LGSSGVTVVVEHLSPVIPGDYSDSSLPLANFEFHV 206

Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
           +N      ++++  ++ N  G           S+    D +    L H       PVT+A
Sbjct: 207 FNDSYEEVEVSITMSFRNGTGNRKWNDENLCQSQKIQKDTMVVRTLAHTVKGM--PVTYA 264

Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           +  +E +G  V+ C   +   N    T   +W +++ +G     + +     S+P   +G
Sbjct: 265 IGTEEKNGSKVTTC---LFDPNG---TGGRLWSDLEAYGHLSSYDHL----PSKP-KELG 313

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
            A+ +S  VPPD      FSL W  P+V+F + + +Y RRY +F+         A I R 
Sbjct: 314 IAVCSSFFVPPDGAHNTQFSLTWYMPQVHFGTAERFYNRRYCRFFNGPDADEVTAAICRH 373

Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
            +    +W+ +IE WQ P+L D++LPEWY   +FNELYY+  G  VW +  P        
Sbjct: 374 GLQNFSTWQKKIEDWQAPVLNDQKLPEWYKSAIFNELYYIVDGSTVWFEYDP-------- 425

Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
                  DW   +                    MS   E+ +                  
Sbjct: 426 -------DWKTEE------------------SLMSEQTEKQF------------------ 442

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
           +  G+F Y+E  EY M NTYDVHFYSS+A++  +P+I++S+Q DFA  +   D +    L
Sbjct: 443 KEYGRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQLDFADQMDRVDNNIATSL 502

Query: 587 DDGQWVSRKVLGAVPHDIG-------------ICDPWFEVNAYCLYDTARWKDLNPKFVL 633
            DG+ ++ K +  +PHD+G             + DPW   NAY L+DT RWKDLN KFV+
Sbjct: 503 ADGEQMTIKSVDRIPHDMGHPTKGCIQEKRLSVADPWIHTNAYILHDTGRWKDLNLKFVI 562

Query: 634 QVYRD--VVATGDKK---FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSV 688
             YRD  ++  G +K       +     +    ++ +D+D DGMIENDGF DQTYD W +
Sbjct: 563 SCYRDWKLIELGSEKGQVLEFFLGKCTKIVDGALECWDKDNDGMIENDGFADQTYDVWKM 622

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNS 747
           +G SAY G LW+AAL +   + ++ G      ++  K + A   Y  KLWNG++F +D  
Sbjct: 623 TGTSAYCGSLWIAALSSYIEMLKQSG--LPTKHYEEKLEMAYDAYIGKLWNGTFFKFDEL 680

Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
               S  + ADQL G W   A      + +DK +SAL  ++ YNV     G+ GAVNG L
Sbjct: 681 -PENSKIVMADQLCGFWAMTAMDEPVQISKDKMKSALDTIFKYNVQMYNNGRCGAVNGYL 739

Query: 808 PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
              RVD SS+QS E+W+G+TYA++A MI + + +  F+T+ G++E+ W    L Y  QTP
Sbjct: 740 TSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEQAFKTSEGLFESIWHRFPLQY--QTP 797

Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           EA  +D  YR+L YMRPL+IWA+Q AL R
Sbjct: 798 EAITSDGMYRALGYMRPLSIWAIQHALDR 826


>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
 gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
          Length = 959

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 292/849 (34%), Positives = 447/849 (52%), Gaps = 104/849 (12%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
           ++ FI+ F        +GVP GG+G G+IGR +RG F ++ + P + E K   V A+QF 
Sbjct: 119 EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 178

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           + V  ++   Y  VL     +      +  + +WD+        Y  L+PR+WT     P
Sbjct: 179 LSVRENDRCIYQKVLSAADVQ----RPSGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-P 233

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD--ITLLFTWTNSVGG--- 251
           + EL +V RQ+SP++PHNY++++YPV +F   + N G  S +  I++ FT+ N  G    
Sbjct: 234 ELELTVVIRQVSPVLPHNYEDTTYPVCLFLIDVENGGSASREYEISVAFTFRNGTGNRRW 293

Query: 252 --DSEFTGQHYNSK-------TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
             ++E +G  + ++       T   + +  V L+H  S    P T+ LA    +    ++
Sbjct: 294 EREAECSGVKFETEEEGLKPTTTAQEKISGVSLYHTISSM--PCTYGLATTHREHTTTTV 351

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
           C  F  S N   L     W+ +K+ G  D  +  E  + +     +  A+    T+PP+S
Sbjct: 352 CERFDPSRNGAAL-----WNHLKQTG--DVPSCEEECLIN--AREMAVAVCNRFTLPPNS 402

Query: 363 EGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIE 419
                ++L+WD P+V+F S  ++Y+RRYT+F+   +  +AA ++   A+     WE +IE
Sbjct: 403 TKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEASEAGSAADSLCVRALRLKDKWESEIE 462

Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
           AWQ PIL  K+LP+WY   +FNELY++  GG VW +               F  +W++ +
Sbjct: 463 AWQSPILNHKKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENWAEHE 507

Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
                                               +  ++  +D  + IG+F YLE  E
Sbjct: 508 ------------------------------------THLSHYTKDKMKEIGRFGYLESWE 531

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y M NTYDVHFY+SFAL  L+P+++++IQ +F   V        +   +G W + K    
Sbjct: 532 YRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANVKTASR 591

Query: 600 VPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSV 655
           VPHD+G    +PW   NAY ++DT +WKDLN K+VL  +RD  V++   + F    WP+V
Sbjct: 592 VPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFHTWPAV 651

Query: 656 YVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
            + M   ++ +D+DGDGMIEN G  DQTYD W + G+SAY G LW+A+L+ A  +A  + 
Sbjct: 652 RMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMK 711

Query: 715 DRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG--- 770
           D  ++  F    +KAK V+ + LW G+YF +     S+ + + ADQL G W+ ++     
Sbjct: 712 DDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEL 770

Query: 771 ---LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
              LLP       RSAL  +Y  NV K   G+ GAVNGM P G VD   +Q+ E+W+GVT
Sbjct: 771 ADELLP---NHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVT 827

Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
           YAVA+ +I +   +  F TA G Y   +  TGL Y  QTPEA      YR++ YMRPL+I
Sbjct: 828 YAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRPLSI 885

Query: 888 WAMQWALTR 896
           WAMQW+L +
Sbjct: 886 WAMQWSLKK 894


>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
 gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
          Length = 930

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 292/849 (34%), Positives = 447/849 (52%), Gaps = 104/849 (12%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
           ++ FI+ F        +GVP GG+G G+IGR +RG F ++ + P + E K   V A+QF 
Sbjct: 90  EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 149

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           + V  ++   Y  VL     +      +  + +WD+        Y  L+PR+WT     P
Sbjct: 150 LSVRENDRCIYQKVLSAADVQ----RPSGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-P 204

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD--ITLLFTWTNSVGG--- 251
           + EL +V RQ+SP++PHNY++++YPV +F   + N G  S +  I++ FT+ N  G    
Sbjct: 205 ELELTVVIRQVSPVLPHNYEDTTYPVCLFLIDVENGGSASREYEISVAFTFRNGTGNRRW 264

Query: 252 --DSEFTGQHYNSK-------TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
             ++E +G  + ++       T   + +  V L+H  S    P T+ LA    +    ++
Sbjct: 265 EREAECSGVKFETEEEGLKPTTTAQEKISGVSLYHTISSM--PCTYGLATTHREHTTTTV 322

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
           C  F  S N   L     W+ +K+ G  D  +  E  + +     +  A+    T+PP+S
Sbjct: 323 CERFDPSRNGAAL-----WNHLKQTG--DVPSCEEECLIN--AREMAVAVCNRFTLPPNS 373

Query: 363 EGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIE 419
                ++L+WD P+V+F S  ++Y+RRYT+F+   +  +AA ++   A+     WE +IE
Sbjct: 374 TKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEASEAGSAADSLCVRALRLKDKWESEIE 433

Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
           AWQ PIL  K+LP+WY   +FNELY++  GG VW +               F  +W++ +
Sbjct: 434 AWQSPILNHKKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENWAEHE 478

Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
                                               +  ++  +D  + IG+F YLE  E
Sbjct: 479 ------------------------------------THLSHYTKDKMKEIGRFGYLESWE 502

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y M NTYDVHFY+SFAL  L+P+++++IQ +F   V        +   +G W + K    
Sbjct: 503 YRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANVKTASR 562

Query: 600 VPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSV 655
           VPHD+G    +PW   NAY ++DT +WKDLN K+VL  +RD  V++   + F    WP+V
Sbjct: 563 VPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFHTWPAV 622

Query: 656 YVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
            + M   ++ +D+DGDGMIEN G  DQTYD W + G+SAY G LW+A+L+ A  +A  + 
Sbjct: 623 RMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMK 682

Query: 715 DRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG--- 770
           D  ++  F    +KAK V+ + LW G+YF +     S+ + + ADQL G W+ ++     
Sbjct: 683 DDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEL 741

Query: 771 ---LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
              LLP       RSAL  +Y  NV K   G+ GAVNGM P G VD   +Q+ E+W+GVT
Sbjct: 742 ADELLP---NHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVT 798

Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
           YAVA+ +I +   +  F TA G Y   +  TGL Y  QTPEA      YR++ YMRPL+I
Sbjct: 799 YAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRPLSI 856

Query: 888 WAMQWALTR 896
           WAMQW+L +
Sbjct: 857 WAMQWSLKK 865


>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
 gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
          Length = 922

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 292/849 (34%), Positives = 447/849 (52%), Gaps = 104/849 (12%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
           ++ FI+ F        +GVP GG+G G+IGR +RG F ++ + P + E K   V A+QF 
Sbjct: 82  EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 141

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           + V  ++   Y  VL     +      +  + +WD+        Y  L+PR+WT     P
Sbjct: 142 LSVRENDRCIYQKVLSAADVQ----RPSGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-P 196

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD--ITLLFTWTNSVGG--- 251
           + EL +V RQ+SP++PHNY++++YPV +F   + N G  S +  I++ FT+ N  G    
Sbjct: 197 ELELTVVIRQVSPVLPHNYEDTTYPVCLFLIDVENGGSASREYEISVAFTFRNGTGNRRW 256

Query: 252 --DSEFTGQHYNSK-------TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL 302
             ++E +G  + ++       T   + +  V L+H  S    P T+ LA    +    ++
Sbjct: 257 EREAECSGVKFETEEEGLKPTTTAQEKISGVSLYHTISSM--PCTYGLATTHREHTTTTV 314

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
           C  F  S N   L     W+ +K+ G  D  +  E  + +     +  A+    T+PP+S
Sbjct: 315 CERFDPSRNGAAL-----WNHLKQTG--DVPSCEEECLIN--AREMAVAVCNRFTLPPNS 365

Query: 363 EGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIE 419
                ++L+WD P+V+F S  ++Y+RRYT+F+   +  +AA ++   A+     WE +IE
Sbjct: 366 TKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEASEAGSAADSLCVRALRLKDKWESEIE 425

Query: 420 AWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSD 479
           AWQ PIL  K+LP+WY   +FNELY++  GG VW +               F  +W++ +
Sbjct: 426 AWQSPILNHKKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENWAEHE 470

Query: 480 LKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIE 539
                                               +  ++  +D  + IG+F YLE  E
Sbjct: 471 ------------------------------------THLSHYTKDKMKEIGRFGYLESWE 494

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y M NTYDVHFY+SFAL  L+P+++++IQ +F   V        +   +G W + K    
Sbjct: 495 YRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANVKTASR 554

Query: 600 VPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSV 655
           VPHD+G    +PW   NAY ++DT +WKDLN K+VL  +RD  V++   + F    WP+V
Sbjct: 555 VPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFHTWPAV 614

Query: 656 YVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
            + M   ++ +D+DGDGMIEN G  DQTYD W + G+SAY G LW+A+L+ A  +A  + 
Sbjct: 615 RMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEMAGLMK 674

Query: 715 DRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG--- 770
           D  ++  F    +KAK V+ + LW G+YF +     S+ + + ADQL G W+ ++     
Sbjct: 675 DDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEL 733

Query: 771 ---LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
              LLP       RSAL  +Y  NV K   G+ GAVNGM P G VD   +Q+ E+W+GVT
Sbjct: 734 ADELLP---NHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWTGVT 790

Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
           YAVA+ +I +   +  F TA G Y   +  TGL Y  QTPEA      YR++ YMRPL+I
Sbjct: 791 YAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRPLSI 848

Query: 888 WAMQWALTR 896
           WAMQW+L +
Sbjct: 849 WAMQWSLKK 857


>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
          Length = 925

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 288/841 (34%), Positives = 439/841 (52%), Gaps = 98/841 (11%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
           ++ FI+ F        +GVP GG+G G+IGR +RG F ++ + P + E K   V A+QF 
Sbjct: 81  EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 140

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           + V  ++   Y  VL         D   + + SWD+        Y  L+PR+WT     P
Sbjct: 141 LSVRENDTCIYQKVLSAA------DVHRSQLSSWDFGFPKKHLHYRGLFPRSWTTFRV-P 193

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA-DITLLFTWTNSVGG---- 251
           +  L +  RQ+SP++PHNY+E++YPV +F   + N+    A D+++ FT+ N  G     
Sbjct: 194 ELGLTVTIRQVSPVLPHNYQETTYPVCLFLVDVENTSNNRAYDVSIAFTFRNGTGNRRWD 253

Query: 252 -DSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
            +SE +   ++S     + V  V L+H  S    P T+ LA+      H+S+C  F  S 
Sbjct: 254 RESECSAGRFDSIHSDRESVSGVSLYHTISSM--PCTYGLASTHKKDSHLSICERFDPSR 311

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
           +         W+ +K+ G              +    +  ++     +PP S     ++L
Sbjct: 312 SGAAF-----WNHLKQTGDVPEFEEE----CPQNAREMAVSVCNRFHLPPQSSKTHDYAL 362

Query: 371 AWDCPEVNFMS-GKTYYRRYTKFY-GTHQNAAAN-IARDAILEHGSWELQIEAWQRPILE 427
           +WD P+V+F S  ++Y+RRYT+F+ G+   + A+ +   A+L    WE +IE WQ PIL 
Sbjct: 363 SWDMPKVHFGSVARSYHRRYTRFFEGSEAGSVADSLCVRALLARNQWEEEIEKWQSPILT 422

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
             +LP+WY   +FNELY++  GG VW +               F  +W+           
Sbjct: 423 HPKLPDWYKSAIFNELYFMVDGGTVWFE---------------FDENWAD---------- 457

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
           ++   +V   E+M                          +NIG+F YLE  EY M NTYD
Sbjct: 458 HEAHLSVYTKEKM--------------------------KNIGRFGYLESWEYRMVNTYD 491

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG-- 605
           VHFY+S+AL  L+P+++L++Q +F   V        +   +G W + K    VPHD+G  
Sbjct: 492 VHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPHDLGNP 551

Query: 606 ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVATGDKKFAKAVWPSVYVAM-AYM 662
             +PW   NAY ++DT +WKDLN K+VL  +RD  V++   ++F    WP++ + M   +
Sbjct: 552 ADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEAHQEFLHHTWPAIKMIMLEAL 611

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
           + +D++GDGMIEN G  DQTYD W + G+SAY G LW+A+L+ +  +A  V D  +E+ F
Sbjct: 612 ENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSVEMAAIVKDEITENLF 671

Query: 723 LFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG------LLPIV 775
               +KAK V+ + LW G+YF +     S+  ++ ADQL G W+ ++        LLP  
Sbjct: 672 KGTLEKAKKVFIDLLWTGTYFRFCERSRSR-ETVMADQLCGYWFLQSVSPEMADDLLP-- 728

Query: 776 DEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMI 835
                RSAL  +Y  NV K   G+ GAVNGM P G VD   +Q+ E+W+GVTYAVA+ +I
Sbjct: 729 -NHMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 787

Query: 836 HEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
            +   +  F TA G Y   +   GL Y  QTPEA      YR++ YMRPL+IWAMQW+L 
Sbjct: 788 QQGEVEKAFHTASGSYLTCFEQAGLQY--QTPEALYESKFYRAIGYMRPLSIWAMQWSLK 845

Query: 896 R 896
           +
Sbjct: 846 K 846


>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
          Length = 571

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 254/611 (41%), Positives = 343/611 (56%), Gaps = 65/611 (10%)

Query: 293 QETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME-TSVTSEPGSSIGAA 351
           +E  GV VS C  F  +G       +++W ++ E G   RLNS    S ++  G    AA
Sbjct: 1   RERPGVCVSHCTQFDPTG-----LGQEVWKDLLEDG---RLNSAPGPSPSTAKGKKTAAA 52

Query: 352 IAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDAILE 410
           +AA  +VPP S   + F L+WD P + F +G + ++RRYT+F+G   +AA  +    +  
Sbjct: 53  VAAGCSVPPRSRETLEFCLSWDMPNIRFGAGQREHHRRYTRFFGCQGDAAPALCHYGLTH 112

Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRK 470
              WE +IEAWQ PIL++  LP WY   LFNELY++  GG VW +               
Sbjct: 113 FVGWEQKIEAWQEPILQNGDLPAWYKSALFNELYFMADGGTVWVE--------------- 157

Query: 471 FSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIG 530
                        V   + +D   N++ R    L                 ++D  +  G
Sbjct: 158 -------------VSAADDDD---NLMRRGCKDLPG---------------MKDVLQEYG 186

Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQ 590
           +F YLEG EY M+NTYDVHFY+SFALIML+P++++S+Q D AA+V+  DP   K L  G 
Sbjct: 187 RFAYLEGQEYRMYNTYDVHFYASFALIMLWPQLEISLQYDMAASVLQEDPELRKYLMSGT 246

Query: 591 WVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFA 648
               K    VPHD+G  D  PW ++NAY ++DTA WKDLN KFVLQVYRD   T D  + 
Sbjct: 247 VAPVKSRSVVPHDVGDPDDEPWQKLNAYLIHDTANWKDLNLKFVLQVYRDYHITKDTTYL 306

Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
           + +W      M    +FD DGDG+IEN GF DQTYD W ++G S+Y GGLW+AA+     
Sbjct: 307 RDMWSVCQTVMETSLKFDEDGDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCK 366

Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
           +A  +GD  +++ F     + K  +EK LWNG Y+N+D      S+S+  DQ AG W+ R
Sbjct: 367 MAEVLGDGAAQEKFTDILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLR 426

Query: 768 ACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
           ACGL      +  +D   SAL+ ++  NV +   G+ GAVNGM PDG  D SS+QS E+W
Sbjct: 427 ACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVW 486

Query: 824 SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
            GV Y +AA+MIHE L   GF TA G Y   W    LG +FQTPEA+     +RSL YMR
Sbjct: 487 IGVVYGLAATMIHEGLVQEGFATAEGCYRTVWE--RLGMSFQTPEAYCEKQVFRSLAYMR 544

Query: 884 PLAIWAMQWAL 894
           PL+IWAMQ AL
Sbjct: 545 PLSIWAMQLAL 555


>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
 gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
          Length = 933

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 285/861 (33%), Positives = 447/861 (51%), Gaps = 114/861 (13%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
           ++ FI+ F        +GVP GG+G G+IGR +RG F ++ + P + E K   V A+QF 
Sbjct: 81  EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 140

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAG--IGSWDWNLKGDKSTYHALYPRAWTVHEG 194
           + V  ++   Y  VL         D   +G  + +WD+        Y  L+PR+WT    
Sbjct: 141 ISVRENDTCIYQKVLSAA------DVHRSGNQLSTWDFTFPKKNVNYRGLFPRSWTTFRI 194

Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA-DITLLFTWTNSVGG-- 251
            P+  + +  RQ+SP++PHNY++++YPV +F   + N+  +   ++++ FT+ N  G   
Sbjct: 195 -PELGITVTIRQVSPVLPHNYEDTTYPVCLFVIDVENTTTSKEYEVSIAFTFRNGTGNRR 253

Query: 252 ---DSEFTGQHYN----SKTKMNDG---VHAVLLHHRTSHQLPPVTFALAAQETDGVHVS 301
              +SE +   +     S ++ N G   V  + L+H  S    P T+ LA    +G  +S
Sbjct: 254 WERESECSAVKFETPGPSDSESNPGQEKVTGISLYHTISSM--PCTYGLATTHKNGSILS 311

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGS-----------FDRLNSMETSVTSEPGSSIGA 350
           +C  F  + N         W+ +++ G            F ++NS +    +     +  
Sbjct: 312 VCERFDPARNGSAF-----WNHLQQTGDVPSCVRILHVFFHKINSFQEEECAVNAREMAV 366

Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDA 407
           A+     + P ++    ++L+WD P+V+F S  ++Y+RRYT+F+   +   AA ++   A
Sbjct: 367 AVCNRFRLSPGTKKTHDYALSWDMPKVHFGSVARSYHRRYTRFFDAAEAGTAADSLCVRA 426

Query: 408 ILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIG 467
           +     WE +IE WQ PIL  ++LP+WY   +FNELY++  GG VW +            
Sbjct: 427 LHLKNKWEEEIEKWQLPILNHEKLPDWYKSAIFNELYFITDGGTVWFE------------ 474

Query: 468 HRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEE 527
              F   W++ ++            +    E+M SI                        
Sbjct: 475 ---FDEQWAEHEMH----------LSEYTKEKMKSI------------------------ 497

Query: 528 NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD 587
             G+F YLE  EY M NTYDVHFY+S+AL  L+P+++L++Q +F   V        +   
Sbjct: 498 --GRFGYLESWEYRMVNTYDVHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPTRFHM 555

Query: 588 DGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD- 644
           +G W + K    VPHD+G    +PW   NAY ++DT +WKDLN K+VL  +RD V   + 
Sbjct: 556 EGDWANVKTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEE 615

Query: 645 -KKFAKAVWPSVYVAM-AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA 702
            ++F    WP+V + M   ++ +D++GDGMIEN G  DQTYD W + G+SAY G LW+A+
Sbjct: 616 HQEFLFHTWPAVKMIMLEALENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLAS 675

Query: 703 LQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLA 761
           L+ +  +AR +G+  +E+ F     KAK V+ + LW G+YF +     S+  ++ ADQL 
Sbjct: 676 LRVSIEMARLMGEEATENLFRVTLNKAKKVFIDTLWTGTYFRFCERSRSR-ETVMADQLC 734

Query: 762 GQWYARACG------LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS 815
           G W+ ++        LLP       RSAL  +Y  NV +   G+ GAVNGM P+G VD  
Sbjct: 735 GYWFLQSVSPEMVDDLLP---NHMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGVVDRE 791

Query: 816 SMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQ 875
            +Q+ E+W+GVTYAVA+ +I +   +  F TA G Y   +  TGL Y  QTPEA      
Sbjct: 792 YIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALYESKF 849

Query: 876 YRSLCYMRPLAIWAMQWALTR 896
           YR++ YMRPL+IWAMQW+L +
Sbjct: 850 YRAIGYMRPLSIWAMQWSLKK 870


>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
          Length = 899

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 289/858 (33%), Positives = 450/858 (52%), Gaps = 96/858 (11%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP--VLANQFS 136
           ++ FI+ F        +GVP GG+G G+IGR +RG F ++ + P + E K   V A+QF 
Sbjct: 75  EKLFINTFQPLKHKPYYGVPCGGIGCGAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFI 134

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEP 196
           + V  ++   Y  VL     +V +  ++  + +WD+        Y  L+PR+WT     P
Sbjct: 135 LSVRENDRCIYQKVL--SAADVHR--SSGQLSTWDFGFPKKNVHYRGLFPRSWTTFRI-P 189

Query: 197 DPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNS-GKTSADITLLFTWTNSVGG---- 251
           D  L +V RQ+SP++P+NY++++YPV +F   + N+      D+++ FT+ N  G     
Sbjct: 190 DLGLTVVIRQVSPVLPNNYEDTTYPVCLFLIDVENTTNDREYDVSVAFTFRNGTGNRRWE 249

Query: 252 -DSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
            ++E +   + +     + V  V L+HR S      T+ LA+  +    +S+C  F  S 
Sbjct: 250 REAECSAVMFETPEDSAEQVTGVSLYHRISGM--SCTYGLASTHSKDSVLSVCERFDPSK 307

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
           +   L     W+ + + G    L   E ++ +     +  A++   T+PP S     ++L
Sbjct: 308 SGGAL-----WNHLMQTGDVPSLEE-ECAINAR---EMAVAVSNRFTLPPLSSKTHDYAL 358

Query: 371 AWDCPEVNFMS-GKTYYRRYTKFYGTHQ--NAAANIARDAILEHGSWELQIEAWQRPILE 427
           +WD P+V+F S G++Y+RRYT+F+   +  +AA ++   A+     WE +IE WQ PIL 
Sbjct: 359 SWDMPKVHFGSTGRSYHRRYTRFFEASEAGSAADSLCVRALRLRHKWEEEIEKWQSPILT 418

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
             +LP+WY   +FNELY++  GG VW +               F  +W            
Sbjct: 419 HDKLPDWYKSAIFNELYFITDGGTVWFE---------------FDENW------------ 451

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
                                   A + +  +N  ++  +NIG+F YLE  EY M NTYD
Sbjct: 452 ------------------------ADHEAHLSNYTKEKMKNIGRFGYLESWEYRMVNTYD 487

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG-- 605
           VHFY+S+AL  L+P+++L++Q +F   V        +   +G W + K    VPHD+G  
Sbjct: 488 VHFYASYALAELWPELELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPHDLGNP 547

Query: 606 ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD--KKFAKAVWPSVYVAM-AYM 662
             +PW   NAY ++DT +WKDLN K+VL  +RD V   +  ++F    WP+V + M   +
Sbjct: 548 ADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLHHTWPAVKMIMLEAL 607

Query: 663 DQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
             +D++GDGMIEN G  DQTYD W + G+SAY G LW+A+L+ +  +A  + D  ++  F
Sbjct: 608 ANWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMAGLLEDGETKKLF 667

Query: 723 LFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACG------LLPIV 775
           L    KAK V+ + LW G+YF +     S+  ++ ADQL G W+ ++        LLP  
Sbjct: 668 LETLNKAKKVFVDVLWTGTYFRFCERSRSR-ETVMADQLCGYWFLQSVSPEMADDLLP-- 724

Query: 776 DEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMI 835
                RSAL  +Y  NV +   G+ GAVNGM P G VD   +Q+ E+W+GVTYAVA+ +I
Sbjct: 725 -SHMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAVASLLI 783

Query: 836 HEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
            +   +  F TA G Y   +  TGL Y  QTPEA      YR++ YMRPL+IWAMQW+L 
Sbjct: 784 QQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALYESKFYRAIGYMRPLSIWAMQWSLK 841

Query: 896 RPKPKTLEKQMKPEVTDE 913
           +       K+  P++  E
Sbjct: 842 KHCGLNTSKERLPKLPSE 859


>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
          Length = 820

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 295/863 (34%), Positives = 437/863 (50%), Gaps = 95/863 (11%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
           K +++L P+ VR+      E   G+ AFID F + RH T + GVPLGG+GSGSIG  +RG
Sbjct: 36  KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTFT-GVPLGGIGSGSIGTDFRG 94

Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
            F R+ I P + E       NQF  SV   ++    Y S+L C + P  +       +  
Sbjct: 95  GFNRFSIIPGIKEQTDTQKCNQFIASVHSKKTFELIYQSILSCAEFPSTV-------LPK 147

Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
           WD  +  +   Y  L+PRAW          + +V   +SP+IP++Y +SS P++ F + +
Sbjct: 148 WDSTIPAEDVRYRGLFPRAWQEFR-LGSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHV 206

Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
           +N      ++++   + N  G + ++  +H  +  K+      V     T   +P VT+A
Sbjct: 207 FNDSPEEVEVSITMAFRNGTG-NRKWNDEHLCTSQKVQKETQVVRTLSHTVKGMP-VTYA 264

Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           +  +E +G  V+ C   +   N  G     +W +++ +G     +++           +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDNLPPK-----PKELG 313

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
            A+ +S  VPPD      FSL+W  PEV+F + + +Y RRY +F+        AA I R 
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLSWYMPEVHFGTAERFYNRRYCRFFNGPDADEVAAAICRH 373

Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
            +     W+  IE WQ PI+ +++LPEWY   +FNELYY+  G  VW +  P        
Sbjct: 374 GLQNIEKWQEAIEKWQNPIITNQKLPEWYRSAIFNELYYIVDGSTVWFEYDP-------- 425

Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
                  DW                T  N++   +    + Y                  
Sbjct: 426 -------DWR---------------TEENLISEQTEKQFREY------------------ 445

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
              G+F Y+E  EY M NTYDVHFYSS+A++  +P+I++S+Q DFA  V   D      L
Sbjct: 446 ---GRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNGTATSL 502

Query: 587 DDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD 644
            DG  +  K    +PHD+G  + DPW   NAY L+DT  WKDLN KFV+  YRD      
Sbjct: 503 ADGDQMLIKSYDRIPHDMGHPMADPWLHTNAYILHDTGSWKDLNLKFVISCYRDYKMIAK 562

Query: 645 KKFAKAVWPSVYVAMAY------MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            +  K      ++          ++ +D+D DGMIENDGF DQTYD W ++G SAY G L
Sbjct: 563 DREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSL 622

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQA 757
           W+AAL     + ++ G      ++  K   A   Y  KLWNG +F +D      S  + A
Sbjct: 623 WLAALTCYIEMLKQAG--SPSKHYEGKLLNAYDAYTTKLWNGRFFKFDEL-PDNSKIVMA 679

Query: 758 DQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSM 817
           DQL G W   A      V E+K + AL+ ++ YNV     GK GAVNG L   RVD SS+
Sbjct: 680 DQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVDGSSI 739

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           QS E+W+G+TYA++A MI +++ ++ F+T+ G++E+ W+   L Y  QTPEA  +D  YR
Sbjct: 740 QSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY--QTPEAITSDGMYR 797

Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
           +L YMRPL+IWA+Q AL R   K
Sbjct: 798 ALGYMRPLSIWAIQHALDRRHKK 820


>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
          Length = 384

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 224/343 (65%), Positives = 270/343 (78%), Gaps = 1/343 (0%)

Query: 523 QDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK 582
           +DG E++G+FLYLEG+EY+MWNTYDVHFY+SFAL+ LFPKI+LSIQRDFA AV+  D  K
Sbjct: 20  KDGPEHVGKFLYLEGVEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRK 79

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVAT 642
           +K L DG    RKV GAVPHD+G  DPW E+NAY ++DT++WKDLNPKFVLQ+YRD  AT
Sbjct: 80  VKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAAT 139

Query: 643 GDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA 702
           GD +F + VWP+V  AM YMDQFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+AA
Sbjct: 140 GDMQFGRDVWPAVCAAMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAA 199

Query: 703 LQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLA 761
           LQAA+ +A  +GDR   + +  KF KAK VYE KLWNGSYFNYD+  SS S SIQADQLA
Sbjct: 200 LQAAATMAHRLGDRHFAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLA 259

Query: 762 GQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSRE 821
           GQWYA + GL P+ DE K R+AL+K++ +NV+KV GG+ GAVNGM P G+VD + MQSRE
Sbjct: 260 GQWYAASSGLPPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSRE 319

Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
           IW+GVTYAVAA+M+   +   GF TA GI+ A WS  G GYA 
Sbjct: 320 IWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSEEGYGYAL 362


>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
 gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
          Length = 845

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 292/874 (33%), Positives = 444/874 (50%), Gaps = 92/874 (10%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
           K +++L P+ VR+      E   G+ AFID F + RH T + GVPLGG+GSGSIG  +RG
Sbjct: 36  KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTYT-GVPLGGIGSGSIGTDFRG 94

Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
            F R+ I P + E       NQF  SV   ++    Y S+L C + P       A  +  
Sbjct: 95  GFNRFSIIPGIKEQTETQKCNQFIASVHSKKTFELIYQSILSCAEFP-------ATVLPK 147

Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
           WD  +  +   Y  L+PR W          + +V   +S +IP +Y +SS P++ F + +
Sbjct: 148 WDTTIPAEDVRYRGLFPRGWQEFR-LGSSGITVVIESLSSVIPGDYSDSSLPLANFDFHV 206

Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
           +N      ++++   + N  G + ++  +H     K+      V     T   +P VT+A
Sbjct: 207 FNDSPEEVEVSITLAFRNGTG-NRKWNDEHLCQSQKVQRDTMVVRTLSHTVKGMP-VTYA 264

Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           +  +E +G  V+ C   +   N  G     +W +++ +G     + +           +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDHLPPK-----PKELG 313

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
            A+ +S  VPPD      FSL+W  P+V+F + + +Y RRY +F+        AA+I R 
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLSWYMPQVHFGTAERFYNRRYCRFFNGPDADEVAASICRH 373

Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
            +     W+  IE WQ P++ D++LP+WY   +FNELYY+  G  VW +  P        
Sbjct: 374 GLQNISKWQEAIEKWQTPVINDEKLPDWYRSAIFNELYYIVDGSTVWFEYDP-------- 425

Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
                  +W   +      +    +       R   +  + ++ V L +           
Sbjct: 426 -------EWKTDESL----ISEHTEKHFKEYGRFGYMECEFFSLVELRT----------- 463

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
                F +L   EY M NTYDVHFYSS+A++  +P+I++S+Q DFA  V   D      L
Sbjct: 464 ----LFFFLTAWEYFMLNTYDVHFYSSWAILKNWPQIEMSMQLDFADQVDRIDNGTATSL 519

Query: 587 DDGQWVSRKVLGAVPHDIG-------------ICDPWFEVNAYCLYDTARWKDLNPKFVL 633
            DG+ +S K    +PHD+G             + DPW   NAY L+DT  WKDLN KFV+
Sbjct: 520 ADGEQMSIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTGCWKDLNLKFVI 579

Query: 634 QVYRD--VVATGDKKFAKA-VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
             YRD  +++ GD++  +  +     +    ++ +D+D DGMIENDGF DQTYD W ++G
Sbjct: 580 SCYRDWKMISNGDQEILEFFIGKCTKIVDGALESWDKDQDGMIENDGFADQTYDVWKMTG 639

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE----KLWNGSYFNYDN 746
            SAY G LW+AAL     + ++ G         F  +K  + YE    KLWNG +F +D 
Sbjct: 640 TSAYCGSLWLAALTCYIQMLKKAGSPTK-----FYDEKLLMAYEAYTTKLWNGKFFKFDE 694

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
                S  + ADQL G W   A      V E+K +SAL+ ++ YNV    GGK GAVNG 
Sbjct: 695 L-PDNSKIVMADQLCGFWALTAMDEPVQVSEEKMKSALETIFKYNVQMYDGGKCGAVNGY 753

Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
           L   RVD SS+QS E+W+G+TYA++A MI + + ++ F+T+ G++++ W+   L   FQT
Sbjct: 754 LTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEMAFKTSEGLFDSIWNRFPL--QFQT 811

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
           PEA  +D  YR+L YMRPL+IWA+Q AL +   K
Sbjct: 812 PEAITSDGMYRALGYMRPLSIWAIQHALEKRNRK 845


>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
          Length = 831

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 294/873 (33%), Positives = 436/873 (49%), Gaps = 104/873 (11%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
           K +++L P+ VR+      E   G+ AFID F + RH T + GVPLGG+GSGSIG  +RG
Sbjct: 36  KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTFT-GVPLGGIGSGSIGTDFRG 94

Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
            F R+ I P + E       NQF  SV   ++    Y S+L C + P  +       +  
Sbjct: 95  GFNRFSIIPGIKEQTDTQKCNQFIASVHSKKTFELIYQSILSCAEFPSTV-------LPK 147

Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
           WD  +  +   Y  L+PRAW          + +V   +SP+IP++Y +SS P++ F + +
Sbjct: 148 WDSTIPAEDVRYRGLFPRAWQEFR-LGSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHV 206

Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
           +N      ++++   + N  G + ++  +H  +  K+      V     T   +P VT+A
Sbjct: 207 FNDSPEEVEVSITMAFRNGTG-NRKWNDEHLCTSQKVQKETQVVRTLSHTVKGMP-VTYA 264

Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           +  +E +G  V+ C   +   N  G     +W +++ +G     +++           +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDNLPPK-----PKELG 313

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
            A+ +S  VPPD      FSL+W  PEV+F + + +Y RRY +F+        AA I R 
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLSWYMPEVHFGTAERFYNRRYCRFFNGPDADEVAAAICRH 373

Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
            +     W+  IE WQ PI+ +++LPEWY   +FNELYY+  G  VW +  P        
Sbjct: 374 GLQNIEKWQEAIEKWQNPIINNQKLPEWYRSAIFNELYYIVDGSTVWFEYDP-------- 425

Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
                  DW                T  N++   +    + Y                  
Sbjct: 426 -------DWR---------------TEENLISEQTEKQFREY------------------ 445

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
              G+F Y+E  EY M NTYDVHFYSS+A++  +P+I++S+Q DFA  V   D      L
Sbjct: 446 ---GRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNGTATSL 502

Query: 587 DDGQWVSRKVLGAVPHDIG-------------ICDPWFEVNAYCLYDTARWKDLNPKFVL 633
            DG  +  K    +PHD+G             + DPW   NAY L+DT  WKDLN KFV+
Sbjct: 503 ADGDQMLIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTGSWKDLNLKFVI 562

Query: 634 QVYRDVVATGDKKFAKAVWPSVYVAMAY------MDQFDRDGDGMIENDGFPDQTYDTWS 687
             YRD       +  K      ++          ++ +D+D DGMIENDGF DQTYD W 
Sbjct: 563 SCYRDYKMIAKDREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQTYDVWK 622

Query: 688 VSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNS 747
           ++G SAY G LW+AAL     + ++ G   S++Y             KLWNG +F +D  
Sbjct: 623 MTGTSAYCGSLWLAALTCYIEMLKQAGS-PSKNYEGKLLNAYDAYITKLWNGRFFKFDEL 681

Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
               S  + ADQL G W   A      V E+K + AL+ ++ YNV     GK GAVNG L
Sbjct: 682 -PDNSKIVMADQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFL 740

Query: 808 PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
              RVD SS+QS E+W+G+TYA++A MI +++ ++ F+T+ G++E+ W+   L Y  QTP
Sbjct: 741 TSERVDGSSIQSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY--QTP 798

Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
           EA  +D  YR+L YMRPL+IWA+Q AL R   K
Sbjct: 799 EAITSDGMYRALGYMRPLSIWAIQHALDRRHKK 831


>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
          Length = 821

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 293/864 (33%), Positives = 435/864 (50%), Gaps = 97/864 (11%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPF-IKRHLTSSHGVPLGGVGSGSIGRSYRG 113
           K +++L P+ VR+      E   G+ AFID F + RH T + GVPLGG+GSGSIG  +RG
Sbjct: 36  KHSIKLIPLFVRVALHTFVEWWNGREAFIDIFNVFRHFTYT-GVPLGGIGSGSIGTDFRG 94

Query: 114 EFQRWQIFPRVCEDKPVL-ANQF--SVFVSRSNGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
            F R+ I P + E       NQF  SV   ++    Y S+L C + P       A  +  
Sbjct: 95  GFNRFSIIPGIKEQTETQKCNQFIASVHSKKTFELIYQSILSCAEFP-------ATVLPK 147

Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
           WD  +  +   Y  L+PRAW          + +V   +SP+IP +Y +SS P++ F + +
Sbjct: 148 WDSTIPAEDVRYRGLFPRAWQEFRLGTSG-ITVVVEHLSPVIPEDYSDSSLPLANFDFHV 206

Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
           +N      ++++  ++ N  G           S     D +    L H       PVT+A
Sbjct: 207 FNDSPEEVEVSITMSFRNGTGNKKWNDESLCQSHKVQKDTMVVRTLSHTVKGM--PVTYA 264

Query: 290 LAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           +  +E +G  V+ C   +   N  G     +W +++ +G     + +           +G
Sbjct: 265 IGTEEKNGAKVTTC---LFDPNGTG---GRLWSDLEAYGHLSSYDHLPPK-----PKELG 313

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYG--THQNAAANIARD 406
            A+ +S  VPPD      FSLAW  P+V+F + + +Y RRY +F+         A+I R 
Sbjct: 314 IAVCSSFFVPPDGSHTTQFSLAWYMPQVHFGTAERFYNRRYCRFFNGPDADEVGASICRH 373

Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
            +    +W+  I  WQ PI+ D++LPEWY   +FNELYY+  G  VW +  P        
Sbjct: 374 GLQNIETWQEAIRKWQDPIINDEKLPEWYRSAIFNELYYIVDGSTVWFEYDP-------- 425

Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
                  +W   +              + I E      +Q                    
Sbjct: 426 -------EWRTDE--------------ILISEETEKQFKQY------------------- 445

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
              G+F Y+E  EY M NTYDVHFYSS+A++  +P+I++S+Q DFA  V   D      L
Sbjct: 446 ---GRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQMDFADQVDRVDYGTATSL 502

Query: 587 DDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD--VVAT 642
            DG  ++ K    +PHD+G  + DPW   NAY L+DT  WKDLN KFV+  YRD  ++A 
Sbjct: 503 ADGDTMTIKSYDRIPHDMGHPMADPWIHTNAYILHDTGHWKDLNLKFVISCYRDWKMIAE 562

Query: 643 GDKKFAKAV----WPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
           G++   + +         +    ++ +D+D DGMIENDGF DQTYD W ++G SAY G L
Sbjct: 563 GEEDSQEILEFFLGKCTKIVDGALENWDKDKDGMIENDGFADQTYDVWKMTGTSAYCGSL 622

Query: 699 WVAALQAASALAREVG--DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQ 756
           W+AAL +   + +  G   +  E+  L  ++       KLWNG +F +D      S  + 
Sbjct: 623 WIAALTSYIQMLKRAGIPTKDYEEKLLMAYEAYTT---KLWNGKFFKFDEL-PDNSKIVM 678

Query: 757 ADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSS 816
           ADQL G W   A      V E K +SAL+ ++ YNV    GGK GAVNG L   RVD SS
Sbjct: 679 ADQLCGFWALTAMDEPIQVSEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASERVDGSS 738

Query: 817 MQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQY 876
           +QS E+W+G+TY+++A MI + + ++ F+T+ G++ + W+   L Y  QTPEA   D  Y
Sbjct: 739 IQSEEVWAGITYSLSAMMIEKGMDEMAFKTSEGLFHSIWNRFPLQY--QTPEAITADGMY 796

Query: 877 RSLCYMRPLAIWAMQWALTRPKPK 900
           R+L YMRPL+IWA+Q AL + K +
Sbjct: 797 RALGYMRPLSIWAIQHALDKKKNR 820


>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
          Length = 1200

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 291/853 (34%), Positives = 422/853 (49%), Gaps = 93/853 (10%)

Query: 71  IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
           +R+   + +  F+DP         +GVP+GG+G G+IGR +RGEF R  + P +    P 
Sbjct: 63  LRKRLIEKRLPFLDPSQHVPWKPIYGVPMGGIGCGAIGRGFRGEFVRSSLIPGMYSYDPQ 122

Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
            A+QF + + +     +  VL P+T        A G+ SW+W      + Y  LYPR+WT
Sbjct: 123 PADQFILTIRKGGETLFHQVLSPQTS---VPAFANGLRSWNWGFSAQHAHYIGLYPRSWT 179

Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTN-SV 249
           V+E      L +VC Q+SP++PH+YK S  P  VF +T++N G  + ++T+ F+W   S 
Sbjct: 180 VYEVA-HFNLILVCHQVSPVLPHDYKASCLPTCVFVWTVFNFGSEALEVTISFSWHGPSP 238

Query: 250 GGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
              S     +  S  K  +G+ +V L      +  PV   L+A+    +H+ + P   IS
Sbjct: 239 APRSAPNSNNQVSNHKQPNGLSSVELTQPLVKE--PVADYLSARIQFAMHLLVGPSGYIS 296

Query: 310 GNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
             S G T  + W  ++      +L  M ++    P +    A A  ++     + Q+ F+
Sbjct: 297 YVS-GYTITN-WPTLRRRTP--KLAVMVSATCIVPPARSANASAGDLS--EAGQSQLEFA 350

Query: 370 LAWDCPEVNFMSGKTYYRR-YTKFYGT-HQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           + W  P V F   K  YRR Y +++       A  +   A+     W  +IE WQ P+L 
Sbjct: 351 VVWHSPLVWFRGNKVIYRRRYARWFPEPGMLGAERLLESALSNWKDWVERIEQWQNPVLH 410

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVP 487
           +   P+WY   LFNELYYL  GG VW D   P H      H      W     +  V   
Sbjct: 411 NSSFPDWYKSALFNELYYLTDGGTVWLD---PNH------HGGADFKWQSWKYRSRV--- 458

Query: 488 NQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
                                                  + +G F YLEG EY M+NT D
Sbjct: 459 --------------------------------------AQEMGLFAYLEGHEYRMYNTLD 480

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGI- 606
           VH+ SS+ALI L+PK+QL++  D A   +  D +++  +  G++  R    AVPHD G  
Sbjct: 481 VHYNSSWALIKLWPKLQLAVLLDCADLAIEEDQTQLYFIHQGRYGIRSTESAVPHDFGDP 540

Query: 607 -CDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-YVAMAYMDQ 664
             +PW + NAY +Y T  WKDLNPKFVLQV+RD   T D  +   + P V  +    +  
Sbjct: 541 EGEPWRDANAYVMYPTKDWKDLNPKFVLQVWRDWKLTQDNDYLLYMLPIVNRIIQVSLQS 600

Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA-----------ALQAASALAREV 713
           +D +GDGMIEN GFPDQTYDTW+  GISAY+GGLW+A           AL A S L   +
Sbjct: 601 WDSNGDGMIENSGFPDQTYDTWNAQGISAYTGGLWLACLYSACEMLEYALSANSPLKAFL 660

Query: 714 GDRGSED---------YFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
               SED               + AK VY++ LW G+Y+ Y N+      +I ADQL+G 
Sbjct: 661 IASNSEDEVSWPQVQKELQILLESAKTVYDRALWTGTYYIYQNTPVGNHEAIMADQLSGH 720

Query: 764 WYARACGLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSRE 821
            + R  G  P  I+  +    ALK + + N   +  G  G +NG LP  + D+S++Q+ E
Sbjct: 721 GFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLPGAKRDLSNVQAEE 780

Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
            W GV Y++AA+MI E ++  GF  A   Y   ++  GL Y  QTPEA+  D ++R   Y
Sbjct: 781 FWVGVNYSLAATMILEGMSQEGFSLAEACYNTIYNRFGLQY--QTPEAYMIDGRFRCPGY 838

Query: 882 MRPLAIWAMQWAL 894
           MRPLAIW++Q AL
Sbjct: 839 MRPLAIWSIQQAL 851


>gi|326528739|dbj|BAJ97391.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 424

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 230/410 (56%), Positives = 292/410 (71%), Gaps = 2/410 (0%)

Query: 26  VDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDP 85
           ++ A+PA+ TWQRK       +  F++   + + + P+ ++ L +  ++  KG+    DP
Sbjct: 15  INCAQPATRTWQRKFDDEGKKIELFSMTMNDMISIIPMILKGLMINADQRGKGRDILYDP 74

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
           F K       G+PLGG+G+GSIGRSYRG FQ +QIFP + E++P+LANQFS FVSR NG+
Sbjct: 75  FRKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPALYEERPILANQFSAFVSRPNGK 134

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
            YS+VL   T +  K     GIGSWDW LK    TYHAL+PR+WTV++GEPDPE+ I CR
Sbjct: 135 SYSTVLSAPTADAFKGVDKVGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPDPEINITCR 194

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           QISPIIPHNYKESS+PV+VFT+T+ NSG T+AD+TLLFTWTNSVGG SE TG H NSK  
Sbjct: 195 QISPIIPHNYKESSFPVAVFTFTVQNSGSTAADVTLLFTWTNSVGGRSELTGNHTNSKMI 254

Query: 266 MNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI-SGNSLGLTAKDMWHEI 324
             DGV  VLL HRT+   PPVTFA+A+QET+ V V+ CP F + S +S   TAKDMW EI
Sbjct: 255 ERDGVQGVLLRHRTADGHPPVTFAIASQETEEVRVTDCPFFTMGSSDSGDFTAKDMWEEI 314

Query: 325 KEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
           K+HGSF    + +    S+PGSSIGAA+AA+ TVP      V+F+L+W CPEV F  GKT
Sbjct: 315 KQHGSFSETRTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPEVKFPDGKT 374

Query: 385 YYRRYTKFYGTHQNAAA-NIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           Y+RRYTKF G   +AAA ++A DA+LEH  WE +IE WQRPIL+DKRLPE
Sbjct: 375 YHRRYTKFCGLDGDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPE 424


>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
          Length = 550

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 240/560 (42%), Positives = 315/560 (56%), Gaps = 64/560 (11%)

Query: 339 SVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQ 397
           S  S+ G  I  A+  +  +PP  + ++ FSLAWD P + F   G+ YYRRYT+F+G   
Sbjct: 1   STPSQKGVGIAGAVCVTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDG 60

Query: 398 NAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           NAA  ++  A+  +  WE +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW    
Sbjct: 61  NAAPALSHYALCRYIDWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---- 116

Query: 458 PPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQN--DTAVNILERMSSILEQIYTPVALNS 515
                                     ++VP  +  D  V  + ++  IL++         
Sbjct: 117 --------------------------LEVPEDSLPDELVGSMCQLRPILQEY-------- 142

Query: 516 SFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
                         G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A 
Sbjct: 143 --------------GRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALAT 188

Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
           +  D ++ + L  G     K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVL
Sbjct: 189 LREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVVHDTADWKDLNLKFVL 248

Query: 634 QVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
           QVYRD   TGD+ F + +WP     M    +FD+D DG+IEN G+ DQTYD W  +G SA
Sbjct: 249 QVYRDYHLTGDQSFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDGWITTGPSA 308

Query: 694 YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQS 752
           Y GGLW+AA+     +A   G +  +D F    ++ +  YE+ LWNG Y+NYD S   QS
Sbjct: 309 YCGGLWLAAVAVMVQMAVLCGAKDIQDKFSSILRRGQEAYERLLWNGRYYNYDCSPQPQS 368

Query: 753 SSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
            SI +DQ AGQW+ +A GL      +        AL+ ++ +NV    GG  GAVNGM P
Sbjct: 369 CSIMSDQCAGQWFLKASGLGEGDTEVFPTQHVVCALQTIFEFNVRAFAGGAMGAVNGMQP 428

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
            G  D SS+QS E+W GV Y +AA+MI E L   GFQTA G Y   W   GL  AFQTPE
Sbjct: 429 HGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWERLGL--AFQTPE 486

Query: 869 AWNTDDQYRSLCYMRPLAIW 888
           A+     +RSL YMRPL+IW
Sbjct: 487 AYCQQRVFRSLAYMRPLSIW 506


>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
 gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
          Length = 867

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 290/851 (34%), Positives = 408/851 (47%), Gaps = 133/851 (15%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
           HG+PLGG G+G IGRS+RG+F  W +         +PV   QF++F      +  + VLC
Sbjct: 79  HGMPLGGFGAGCIGRSHRGDFTLWHLDSGDHYFRSQPVC--QFALFEQSEGAKPRAKVLC 136

Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
            + P          +G+WDW       TY ALYPR+W  +EG  +  +++  +Q SPIIP
Sbjct: 137 TEKP------ADGSLGAWDWTYPKGAGTYAALYPRSWFTYEGFFNSHVQL--KQFSPIIP 188

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND---- 268
           +NYK++SYPV+VF +T +N       +++L +W N VG    FT Q  +   K+ D    
Sbjct: 189 NNYKQTSYPVAVFEWTAFNPTDQPVTLSILVSWENMVGW---FTNQFKSRAVKVRDDDTP 245

Query: 269 -------------------------GVHAVLLHHRTSHQLP-PVTFALAAQ-----ETDG 297
                                    G+        T  Q P P T A++ +     E DG
Sbjct: 246 VYDYEPYWGESTGNFNSAREENAYRGLTMSRASGYTEQQKPAPATAAVSERPFVTAEGDG 305

Query: 298 -----------VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGS 346
                      V ++    F  +GN L     ++W      G   RL  + +   +  G 
Sbjct: 306 QFVIATTREPDVEITYHTRFDPTGNGL-----EVWQPFATTG---RLGDLVSVRPAAKGE 357

Query: 347 SIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARD 406
            I AA+A  +T+ P     V  +LAWD P + F  G   YR YTK +      A  IA +
Sbjct: 358 RIAAAMAVRLTLKPGELRVVPMALAWDLPIMEFAQGVRRYRYYTKHFDKSGRNAWKIATE 417

Query: 407 AILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTI 466
           A+  +  W+ QI+ WQ P+L D  LP+WY   LFNELYYL +GG +W +G  P       
Sbjct: 418 ALDNYADWDRQIQEWQNPVLADPSLPDWYKQALFNELYYLTSGGTLWENGPVP------- 470

Query: 467 GHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGE 526
                                          E+++S+ +    P                
Sbjct: 471 -------------------------------EKITSLADIAKAP---------------- 483

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
              G+F  LE I+Y  +++ DV  Y SFAL++ +P+I  ++ R +A AV   D S   + 
Sbjct: 484 PGQGRFAVLESIDYRWYDSLDVRLYGSFALLINWPEIDKAVLRSYADAVPDEDDSTRIIG 543

Query: 587 DDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG- 643
                  RK   A+PHD+G    DPW + N     D   WKDL   FVLQVYRD V TG 
Sbjct: 544 YTKTVAKRKSRDALPHDLGAPNEDPWIKNNYTTYQDCNLWKDLPCDFVLQVYRDYVDTGK 603

Query: 644 -DKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
            D  F +  WP+   A+  +   FD+DGDG  EN G PD TYD W + GISAY GGLW+A
Sbjct: 604 RDLGFVRYCWPASKAALVRLKTTFDQDGDGFPENGGPPDCTYDAWPLKGISAYCGGLWLA 663

Query: 702 ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQL 760
           AL AA  + + VGD  +   F   + +A+ +YEK LWNG Y+  D S S  S +I ADQL
Sbjct: 664 ALLAAVEMGKLVGDAAAVQQFGAWYAQAQPLYEKTLWNGRYYRID-SQSKNSEAIMADQL 722

Query: 761 AGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSR 820
            G++YA+ C L  IV E +ARSAL+ VY    +K  GG+ G  NG   DG     +    
Sbjct: 723 CGEYYAQVCNLADIVPEPQARSALETVYQTCFVKFYGGRFGCANGTNADGSFIGDTEHPS 782

Query: 821 EIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLC 880
           E+W+G+ + +AA MI   +   G   A  +    + G   G  F+TPEA      +R+  
Sbjct: 783 EVWTGINFGLAAFMIRNGMRREGMAIAEAVVANVYGG---GLQFRTPEALTPARTFRACM 839

Query: 881 YMRPLAIWAMQ 891
           Y+RP+AIWAMQ
Sbjct: 840 YLRPMAIWAMQ 850


>gi|147812670|emb|CAN61859.1| hypothetical protein VITISV_016692 [Vitis vinifera]
          Length = 305

 Score =  404 bits (1038), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 211/323 (65%), Positives = 250/323 (77%), Gaps = 23/323 (7%)

Query: 7   LENGLHEEE---KEPLNSSFDKVDPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPI 63
           +ENG H+E+   + P+ SS  KV+P KPASLTWQRKL+T    L++F L  +E   L  I
Sbjct: 1   MENG-HKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLISI 59

Query: 64  GVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPR 123
                              IDPF KR +TS HGVPLGG+G GSIGRSYRGEFQR+Q+FPR
Sbjct: 60  -------------------IDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPR 100

Query: 124 VCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHA 183
           +CED PVLANQFSVFVSR NG+K S+VLCP+ PEVLK + ++GIGSWDWNL G+  TYHA
Sbjct: 101 ICEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHA 160

Query: 184 LYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLF 243
           LYPRAWTV+EGEPDPE+ I+  QISP IPHNYKESS+PVSVF +T+ NSGKTSADITLLF
Sbjct: 161 LYPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLF 220

Query: 244 TWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
           TW NSVGG SEF+G HYNSK K  DGVH VLLHH+T++  PPVTFA+AA+ET  VH+S C
Sbjct: 221 TWANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISEC 280

Query: 304 PHFVISGNSLGLTAKDMWHEIKE 326
           P F+ISGNSLG+TAK+MW EIK+
Sbjct: 281 PCFLISGNSLGVTAKEMWQEIKQ 303


>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
          Length = 514

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 225/524 (42%), Positives = 292/524 (55%), Gaps = 64/524 (12%)

Query: 375 PEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE +I AWQ P+L+D+ LP 
Sbjct: 2   PRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVLDDRSLPA 61

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
           WY   LFNELY+L  GG VW +                                      
Sbjct: 62  WYKSALFNELYFLADGGTVWLE-------------------------------------- 83

Query: 494 VNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLYLEGIEYLMWNTYDVHFY 551
                    +LE      +L    G N+  L+    + G+F YLEG EY M+NTYDVHFY
Sbjct: 84  ---------VLED-----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 129

Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--P 609
           +SFALIML+PK++LS+Q D A A +  D ++ + L  G     K    +P DIG  D  P
Sbjct: 130 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPRDIGDPDDEP 189

Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
           W  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +WP     M    +FD+D 
Sbjct: 190 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 249

Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
           DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  +D F     + 
Sbjct: 250 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 309

Query: 730 KVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSAL 784
           +  YE+ LWNG Y+NYD+S   QS S+ +DQ AGQW+ +ACGL      +        AL
Sbjct: 310 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 369

Query: 785 KKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
           + ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L   GF
Sbjct: 370 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 429

Query: 845 QTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
           QTA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 430 QTAEGCYRTVWERLGL--AFQTPEAYCQQRVFRSLAYMRPLSIW 471


>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 777

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 243/660 (36%), Positives = 354/660 (53%), Gaps = 78/660 (11%)

Query: 37  QRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRA-FIDPFIKRHLTSSH 95
           ++   +G++ LSQ        ++   +G+R L    ++    K+A FID F    L   +
Sbjct: 31  RKPFQSGDVSLSQ-------VLKHLGLGIRYLRWWYKKTRVEKKAPFIDMFGALPLRQIY 83

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           G PLGG+G G+I R +RGEF RWQ+ P     K V+ NQF+V + R     Y  VL  + 
Sbjct: 84  GAPLGGIGGGTITRGWRGEFCRWQLNPGKYHYKTVIENQFTVCLRRDGQTVYQQVLSVER 143

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
           P  L+         W+W   G+ + YHALYPRAWTV+   P   + + CRQISP+IPH+Y
Sbjct: 144 PPTLQ--------GWNWGFCGEYAYYHALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDY 194

Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN---DGVHA 272
           +++S PV+VF + + N    + D++++FT  N  G   + +G H+N   ++    + V  
Sbjct: 195 QDASLPVAVFVWDVENKNDYALDVSIMFTALNGSGHRDDKSGGHWNQPFRLEKEGEAVSG 254

Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
           VLLHH T+  + P T  +AA+E     VS    F   G   GL     W+++   G  D 
Sbjct: 255 VLLHHCTA--VNPYTLCIAAREQPDRDVSHQTAFSPKGTCNGL-----WNDLITDGRLD- 306

Query: 333 LNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTK 391
            +   +S  +  G  + AA+AA   V   +   + FSLAWD P++ F +G + + RRYT+
Sbjct: 307 -SPTGSSPPTAKGERVAAALAAGCAVSAQNRNCLEFSLAWDMPKITFGAGEQQHSRRYTR 365

Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
           +YG+  +A+  ++  A+ ++  WE +IE WQ P+L+D  LP WY   LFNELY++  GG 
Sbjct: 366 YYGSKGDASPLLSHHALTQYRGWEKRIEEWQTPVLQDSSLPAWYKSALFNELYFVADGGT 425

Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPV 511
           VWT                        +L    DV          L    +++++     
Sbjct: 426 VWT------------------------ELAEDADVSGGLRPEDGGLPAQPAVIKE----- 456

Query: 512 ALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDF 571
                             G+F YLEG EY M+NTYDVH+Y+SFALIML+PK+ LS+Q D 
Sbjct: 457 -----------------YGRFAYLEGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDI 499

Query: 572 AAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNP 629
           A +V+  DP++   L +G+    K  G VPHDIG    +PW  VNAY ++DTA WKDLN 
Sbjct: 500 AGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNAYLIHDTAGWKDLNL 559

Query: 630 KFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
           KFVLQVYRD   T D ++ + +WP   +      QFD DGDG+IEN G+ DQTYD W+ S
Sbjct: 560 KFVLQVYRDFHLTQDAQYLRDMWPICEMVTESELQFDLDGDGLIENSGYADQTYDGWAAS 619



 Score =  126 bits (317), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 63/118 (53%), Positives = 81/118 (68%), Gaps = 2/118 (1%)

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
           ++K ++ALK +++ NV+   GG+ GAVNGM P+G  D SS+QS E+W GV Y +AA+MIH
Sbjct: 661 KEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIH 720

Query: 837 EDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           E + + G +TA G Y   W    LG AFQTPEA+     YRSL YMRPL IWAMQ AL
Sbjct: 721 EGMLEEGMRTAEGCYRTVWE--RLGMAFQTPEAYCEKGIYRSLAYMRPLCIWAMQLAL 776


>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 900

 Score =  384 bits (987), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 276/868 (31%), Positives = 425/868 (48%), Gaps = 109/868 (12%)

Query: 71  IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
           I++   + +  FID          +GVP+GG+GSGSIGR YRGEF R  + P +      
Sbjct: 77  IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136

Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
             +QF V V ++    Y  VL P T      + A  +  W W    +K  Y  LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193

Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
           V+E  P+ +L +V +QISP+IPH+Y+ +  PV+VF + + +  K +  +T+ F+W     
Sbjct: 194 VYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252

Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
                +     S+   N   +    H  T+       F   +Q +             +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVP--- 359
           + L L  +      K+  +F  L+        ++     ++    +  A++A+  VP   
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354

Query: 360 -----PDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
                  S+ ++ F++ W  P V F +G   Y RRY +++   + + A +    AI    
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
            W  +IE WQ PIL ++ LP WY   LFNELYYL+ GG VW D   P+            
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ----------- 460

Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
           +D  +SDL  + +     D  V+  +  + +                         IG F
Sbjct: 461 VDCFKSDLLNLTE-----DMKVDSWDHRARL----------------------SREIGLF 493

Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
            YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++  D A   +  D + +  + +G+ +
Sbjct: 494 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 553

Query: 593 SRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
            R    AV HD G  +  PW   NAY ++ T  WKDLN KF+LQV+RD   T D ++   
Sbjct: 554 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 613

Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA--- 706
           + P V  +    +  +D D DG+IEN GFPDQTYD W+  G++AY+GG+W++ L A    
Sbjct: 614 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDM 673

Query: 707 --------SALAREVGDRGSE---------DYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
                   S +  ++ +   +         D     F KA+  Y  KLW GSY+ +    
Sbjct: 674 LSWCLKSDSPVYDQIVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYC 733

Query: 749 SSQSSSIQADQLAGQWYARACGLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
           + +   I ADQL+G W++R  G+ P  I+ ++     L+ + N N   +  G  GA+NG 
Sbjct: 734 TPRREVIMADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGC 793

Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
           LP  + D+SS+Q+ E W  V Y++++ MI E + D G       Y+  ++  GL Y  QT
Sbjct: 794 LPVCKPDLSSVQAEEFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QT 851

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           PEA+ +D ++R   YMR LAIW++Q AL
Sbjct: 852 PEAYMSDGRFRCPGYMRALAIWSIQQAL 879


>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 892

 Score =  381 bits (978), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 277/868 (31%), Positives = 422/868 (48%), Gaps = 117/868 (13%)

Query: 71  IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
           I++   + +  FID          +GVP+GG+GSGSIGR YRGEF R  + P +      
Sbjct: 77  IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136

Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
             +QF V V ++    Y  VL P T      + A  +  W W    +K  Y  LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193

Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
           V+E  P+ +L +V +QISP+IPH+Y+ +  PV+VF + + +  K +  +T+ F+W     
Sbjct: 194 VYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252

Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
                +     S+   N   +    H  T+       F   +Q +             +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVP--- 359
           + L L  +      K+  +F  L+        ++     ++    +  A++A+  VP   
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354

Query: 360 -----PDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
                  S+ ++ F++ W  P V F +G   Y RRY +++   + + A +    AI    
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
            W  +IE WQ PIL ++ LP WY   LFNELYYL+ GG VW D   P+            
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PI------------ 459

Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
               Q DL   + V + +  A     R+S                           IG F
Sbjct: 460 ----QVDLTEDMKVDSWDHRA-----RLS-------------------------REIGLF 485

Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
            YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++  D A   +  D + +  + +G+ +
Sbjct: 486 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 545

Query: 593 SRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
            R    AV HD G  +  PW   NAY ++ T  WKDLN KF+LQV+RD   T D ++   
Sbjct: 546 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 605

Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA--- 706
           + P V  +    +  +D D DG+IEN GFPDQTYD W+  G++AY+GG+W++ L A    
Sbjct: 606 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDM 665

Query: 707 --------SALAREVGDRGSE---------DYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
                   S +  ++ +   +         D     F KA+  Y  KLW GSY+ +    
Sbjct: 666 LSWCLKSDSPVYDQIVNNTDDTQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYC 725

Query: 749 SSQSSSIQADQLAGQWYARACGLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
           + +   I ADQL+G W++R  G+ P  I+ ++     L+ + N N   +  G  GA+NG 
Sbjct: 726 TPRREVIMADQLSGYWFSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGC 785

Query: 807 LPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
           LP  + D+SS+Q+ E W  V Y++++ MI E + D G       Y+  ++  GL Y  QT
Sbjct: 786 LPVCKPDLSSVQAEEFWVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QT 843

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           PEA+ +D ++R   YMR LAIW++Q AL
Sbjct: 844 PEAYMSDGRFRCPGYMRALAIWSIQQAL 871


>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
          Length = 777

 Score =  369 bits (947), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 245/788 (31%), Positives = 395/788 (50%), Gaps = 89/788 (11%)

Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
           PKTP          +GSW+  L   + +Y ALYPRAW+ ++   +  +R+V RQISPIIP
Sbjct: 31  PKTP----------LGSWESALDASRCSYTALYPRAWSEYD-LSEYGVRLVQRQISPIIP 79

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHA 272
           H+YKESS P +VF +T+ N       +T+ FT+ N  G   +       +          
Sbjct: 80  HDYKESSMPCAVFVWTVENVCDRDRQVTITFTFKNGTGTKKQDAEGGSETGAFTQGSARG 139

Query: 273 VLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDR 332
           V +    +      T+ L+ + +  ++++ C  F  SG     T + +W+++KEHG    
Sbjct: 140 VSIRQTIADM--ACTYCLSCRSSSEINLTRCERFDPSG-----TGEKLWNDLKEHGHLTE 192

Query: 333 LNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYTK 391
            +  ET  T +    +  A++  + V P +  ++ F+L WD P+V+F   GK Y+R YTK
Sbjct: 193 QSVDETVKTKD----VAVAVSGQILVQPGTTSELQFALVWDMPKVHFQKRGKEYHRYYTK 248

Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
           ++G   +A   ++  A+  +G WE  I+ WQRPIL+D  LP+WY   +FNELY++  GG+
Sbjct: 249 YFGKTGDAGPTMSDYALTNYGKWERLIDEWQRPILDDADLPDWYKSAIFNELYFIADGGS 308

Query: 452 VW---TDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV------------PN-----QND 491
           VW    D   P         R   L+  +  +    DV            PN     Q D
Sbjct: 309 VWLTMEDSDLPYDDPRLAYGRYSYLEGHEYRMYTTYDVHFYASHALASLWPNLQVSIQYD 368

Query: 492 TAVNILERMSSILEQIY----TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYD 547
              +I   +S   + +Y     P  + +S   +L    EE            + + N Y 
Sbjct: 369 YKDSIGREISEGRKHLYDGKVIPRKIQNSVPHDLGDPAEE-----------PFDLINAYP 417

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFA-----AAVMMHDPSKMKLLDDGQWVSRKVLGAVPH 602
           +H  S +  + +  K  L + RD+      A +   + SK   +   +++ ++ +    +
Sbjct: 418 IHDVSEWRDLNI--KFILQVYRDYYTLNHYAQLNAENASKFSSI---EFIDKESM----Y 468

Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPK----FVLQVYRDVVATGDKKFAKAVWPSVYVA 658
           D  I D    V+     +    K  N K    ++ +    V       + KA++P+  + 
Sbjct: 469 DTYIQDNRHRVSTPPATNGTENKAANRKSASMYINEANGKVYLMDAMTYLKAMYPACKLV 528

Query: 659 MAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGS 718
           +    ++D+D DG+IEN   PDQTYD+W + G SAY GGLW+A+L   + +A  +     
Sbjct: 529 LERSREWDKDDDGLIENSRSPDQTYDSWVMDGPSAYCGGLWLASLHCMATMASLLDQTED 588

Query: 719 EDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-LPIVD 776
              +     K +  + EKLWNG+Y+ +D   +S+ +SI +DQL G WY RACG    +  
Sbjct: 589 AANYQAILDKGRRSFEEKLWNGTYYKFDAQSASK-NSIMSDQLCGHWYLRACGFDYDVFP 647

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDMSSMQSREIWSGVTYAV 830
           ++  R A++ +Y  NV++  GG  GAVNG +P      +GR D+S++Q  E+W+GVTYA+
Sbjct: 648 KENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPNKEGRPDVSNIQGEEVWTGVTYAL 707

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           AA+MIHE + +  FQTA G+Y +     G+   F+TPEA   +  YR++ YMRPL+IW+M
Sbjct: 708 AATMIHEGMFEEAFQTAGGLYRSLSEKIGMN--FETPEAVYAERHYRAIGYMRPLSIWSM 765

Query: 891 Q--WALTR 896
           Q  W L +
Sbjct: 766 QTAWELRK 773


>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
           carolinensis]
          Length = 667

 Score =  362 bits (929), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 199/456 (43%), Positives = 257/456 (56%), Gaps = 56/456 (12%)

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YT+F+G   +A   +A   +  +  WE +IEAWQRPILED  LP WY   LFNELY+L+ 
Sbjct: 228 YTRFFGREGDACPALAHYMLTHYPEWEKRIEAWQRPILEDSDLPAWYKSALFNELYFLSD 287

Query: 449 GGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIY 508
           GG +W +  P                          D P                     
Sbjct: 288 GGTLWVEFQPE-------------------------DAP--------------------- 301

Query: 509 TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
              +L +  G + L    +  G+F YLEG EY M+NTYDVHFY+SFAL ML+PK++LS+Q
Sbjct: 302 ---SLAAGQGLSGLLPVLKEYGRFAYLEGQEYRMYNTYDVHFYASFALAMLWPKLELSLQ 358

Query: 569 RDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKD 626
            D A AV+  DP     L +GQ    K+   VPHDIG  D  PW  VNAY ++DTA WKD
Sbjct: 359 YDMAVAVLSEDPRPRVYLMNGQTAQVKLRNVVPHDIGEPDEEPWQRVNAYLIHDTADWKD 418

Query: 627 LNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTW 686
           LN KFVLQVYRD   T D  + + +WP     M    +FD DGDG+IEN GF DQTYD W
Sbjct: 419 LNLKFVLQVYRDFFLTEDCTYLRDMWPVCQAVMESELKFDTDGDGLIENSGFADQTYDAW 478

Query: 687 SVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYD 745
            V+G SAY GGLW+AA+     +A  +G+  +   +          +E+ LWNG Y+NYD
Sbjct: 479 VVTGASAYCGGLWLAAVCMMCRMAEILGEGAALQRYSAILSSGAAAFERLLWNGRYYNYD 538

Query: 746 NSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRG 801
           + G   S+SI +DQLAGQW+ RACGL      +  ++  +SAL+ +Y  NVL    G  G
Sbjct: 539 SGGGPSSNSIMSDQLAGQWFLRACGLGEGKYQVFPQEHVQSALRTIYEMNVLGFSEGAMG 598

Query: 802 AVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
           AVNGM PDG  D SS+QS E+W GV Y++AA+MI E
Sbjct: 599 AVNGMRPDGVPDTSSVQSDEVWVGVVYSLAATMIQE 634



 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/226 (37%), Positives = 128/226 (56%), Gaps = 14/226 (6%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K+ FID F  + L   +G PLGG G G+I R ++G+F RWQ+ P +   K V+
Sbjct: 7   RKTRLEKKQPFIDLFNSQPLRQIYGCPLGGFGGGTITRGWKGDFCRWQLNPGLYHYKRVV 66

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P  L+         W+W+  G  + YHALYPRAW+V
Sbjct: 67  ADQFTVCLRRKGQTAYQQVLSVEQPSCLQ--------GWNWSFCGHYAFYHALYPRAWSV 118

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P  +L + CRQ++PIIPH+YKE+S PV VF + + N      +++++FT+ N    
Sbjct: 119 YQ-LPGQDLVLTCRQVTPIIPHDYKETSLPVGVFVWEVENLRDEEVEVSIMFTFQNGTET 177

Query: 252 DSEFTGQHYNSKTKMNDGVHA---VLLHHRTSHQLPPVTFALAAQE 294
             +  G H+N    + +G H    VLLHH     +   T A+AA+E
Sbjct: 178 KEDRRGGHWNEPFALREGGHCVRGVLLHH--CLPVNGYTLAIAARE 221


>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
 gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
          Length = 808

 Score =  362 bits (929), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 283/856 (33%), Positives = 400/856 (46%), Gaps = 143/856 (16%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       + + A QFSVF     G+  +  +C +
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHVFRSLPACQFSVFEQPEGGEAQAYAMCTE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W  K  K TYHALYPR+W  ++G    EL  VC Q SPI   N
Sbjct: 101 PPE---DGT---LSRWLWYPK-KKGTYHALYPRSWYEYKGVFQSEL--VCEQFSPICAGN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
           Y+E+SYPV++F +T +N       ++++ TW N+VG                G  E+  Q
Sbjct: 152 YQETSYPVAIFEWTAHNPTDKPITLSIMLTWQNTVGWFSNAIKSPSIQVRDDGSPEYEYQ 211

Query: 259 --------HYNSKTKMNDGVHAVL----LHH-RTSHQLPPVTFALAAQETDGVHVSLCPH 305
                   ++N     N  V  +L    LH          + FA     T  +  S    
Sbjct: 212 PRWGDSTGNFNQWIVDNFRVGCLLNRIRLHEDEIGEGEGQICFASITNPT--IEASYLGR 269

Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
           +  +G+       ++W     +GS   L   E    +EPG  I AA+A   TV P    +
Sbjct: 270 WNPNGD-----GSEVWDCFAANGS---LPDTEDETPAEPGEQIAAAMAIRFTVRPGKTRK 321

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F LAWD P   F  G  Y RRYT F+G     A  I R A L+HG      + W+   
Sbjct: 322 IPFILAWDLPITEFAEGVKYCRRYTDFFGRTGTNAWAIVRTA-LKHG------DMWK--- 371

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVD 485
              +R+ EW                                          QS + +  D
Sbjct: 372 ---ERIKEW------------------------------------------QSPILKRED 386

Query: 486 VPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNT 545
           +P+    A         +  ++Y  +A   +  T   +D  + IGQF  LE ++Y  + +
Sbjct: 387 LPDWFKMA---------LFNELYL-LADGGTLWTAATED--DPIGQFGVLECMDYRWYES 434

Query: 546 YDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG 605
            DV  Y SF L+ML+ ++  ++   FA A+   D +   +  +     RK  GA PHD+G
Sbjct: 435 LDVRLYGSFGLMMLWSRLDKAVLEAFARAIPTGDDTPRIIGYNKAPAIRKAKGATPHDLG 494

Query: 606 ICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAY 661
             +  PW + N     D   WKDL   FVLQVYRD V TG  D  F    W S+   +AY
Sbjct: 495 APNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFVLTGAKDVAFLWECWQSIVETLAY 554

Query: 662 MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREV-------G 714
           + +FDRDGDG+ EN G PDQT+D W + GISAY GGLW+AAL+AA A+ + +        
Sbjct: 555 LKKFDRDGDGIPENSGAPDQTFDDWRLRGISAYCGGLWIAALEAAIAIGKILIENPPSNP 614

Query: 715 DRGSEDY----------FLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
           D   E+Y          +    Q A+ +Y EKLWNG Y++ D+   S S  + ADQL GQ
Sbjct: 615 DLELENYPQSLQNTIQTYQNWLQGARSLYHEKLWNGEYYHLDS--ESGSDIVMADQLCGQ 672

Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV-DMSSMQSREI 822
           +YAR  GL  +V+   A SALKKVY    LK   GK GA NG+ PDG   D  +    E+
Sbjct: 673 FYARLLGLPDVVERQYAESALKKVYEACFLKFQDGKFGAANGVKPDGSPEDPKATHPLEV 732

Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
           W+G+ + +AA ++   + D  F+ A  +    +     G  F+TPEA      +R+  Y+
Sbjct: 733 WTGINFGLAAFLLQMGMKDEAFKLAETVVRQVYDN---GLQFRTPEAITAVGTFRASHYL 789

Query: 883 RPLAIWAMQWALTRPK 898
           R +AIW +   LT  K
Sbjct: 790 RAMAIWGIYGVLTEFK 805


>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
          Length = 467

 Score =  356 bits (914), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 200/467 (42%), Positives = 262/467 (56%), Gaps = 59/467 (12%)

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN 488
           + LP WY   LFNELY+L  GG VW                              ++VP 
Sbjct: 10  RTLPAWYKSALFNELYFLADGGTVW------------------------------LEVP- 38

Query: 489 QNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDV 548
               A ++ E +   + Q+ + +               ++ G+F YLEG EY M+NTYDV
Sbjct: 39  ----ADSLPEGLGGSMRQLRSTL---------------QDYGRFGYLEGQEYRMYNTYDV 79

Query: 549 HFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD 608
           HFY+SFAL+ML+PK++LS+Q D A A +  D ++ + L  G     K    +PHDIG  D
Sbjct: 80  HFYASFALVMLWPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPD 139

Query: 609 --PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFD 666
             PW  VNAY ++DTA WKDLN KFVLQ+YRD   TGD+ F + +WP     M    +FD
Sbjct: 140 DEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFD 199

Query: 667 RDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKF 726
           +D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G +  ++ F    
Sbjct: 200 KDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASIL 259

Query: 727 QKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKAR 781
            + +  YE+ LWNG Y+NYD+S   QS SI +DQ AGQW+ RACGL      +       
Sbjct: 260 CRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVV 319

Query: 782 SALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLAD 841
            AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y +AA+MI E L  
Sbjct: 320 RALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTW 379

Query: 842 IGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIW 888
            GF+TA G Y   W   GL  AFQTPEA+     +RSL YMRPL+IW
Sbjct: 380 EGFRTAEGCYRTVWERLGL--AFQTPEAYCQQQVFRSLAYMRPLSIW 424


>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
 gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
          Length = 821

 Score =  354 bits (909), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 276/865 (31%), Positives = 387/865 (44%), Gaps = 160/865 (18%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G +GRS RG+F  W I       + + A QFSVF   ++G   +  LC +
Sbjct: 39  HGMPLGGFGAGCLGRSPRGDFNLWHIDGGEHVFQSMPACQFSVFEQGADGTSQAYALCSE 98

Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
            P       A  +G W W      +    TY ALYPR+   +EG      R+ C Q SPI
Sbjct: 99  AP-------AESLGQWQWYPASMAERSTGTYSALYPRSRFDYEGVF--RARLACEQFSPI 149

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND-- 268
              NY+ESSYPV+VF +T +N       +++L TW N VG    FT    + + K+ D  
Sbjct: 150 WADNYQESSYPVAVFEWTAHNPTDQLLTLSILLTWQNMVGW---FTNALKSPEVKVRDDG 206

Query: 269 -----------------------GVHAVLLHHRTSHQLPPV----TFALAAQ-ETDGVHV 300
                                  G     L  R      P      +A+A Q E  G  +
Sbjct: 207 SPVYEYQPRLGESEGNFNQWIAEGAWVGCLMDRAGRAEEPQEGEGQWAIATQCEGPGTEI 266

Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSS---IGAAIAASVT 357
                +  +G+       ++W       +F +  ++E  +   P      IG AIA   T
Sbjct: 267 FYHTRWNPAGD-----GGELWR------TFAQTGALENRIDGTPAQGEEQIGGAIALRFT 315

Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
           + P    +V F LAWD P   F +G   YRRYT F+G        IAR A+ E+      
Sbjct: 316 LAPGETRRVPFVLAWDFPVTEFAAGTAAYRRYTDFFGRSGRNGWAIARTALEEY------ 369

Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQ 477
             AWQ  I                                                 W Q
Sbjct: 370 -SAWQARIAA-----------------------------------------------WQQ 381

Query: 478 SDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEG 537
             L R  D+PN    A         +  ++Y   +  + +     +D    +GQF  LE 
Sbjct: 382 PILDR-ADLPNWFKMA---------LFNELYDLTSGGTLWSAASDRD---PVGQFAVLEC 428

Query: 538 IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD-------DGQ 590
           I+Y  + + DV  Y SF L+ML+P+++ S+ R FA A+   D  + +++        +  
Sbjct: 429 IDYRWYESLDVRLYGSFGLLMLWPELEKSVMRAFARAIPTAD-DRQRVIGYYYTQGTESP 487

Query: 591 WVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKK 646
              RK L A PHD+G  +  PW + N     D   WKDL   FVLQVYRD   TG  D  
Sbjct: 488 LALRKALDATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFRLTGSTDDA 547

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           F    WP+V  A+AY+  FD+DGDG+ EN G PDQT+D W + G+SAY GGLW+AAL+AA
Sbjct: 548 FIAECWPAVTRALAYLKAFDQDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWIAALEAA 607

Query: 707 SALAREVGDRGSED---------------YFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
            AL   +      D               Y  +  Q   V  + LWNG Y+  D+   S 
Sbjct: 608 IALGERLQQHPVSDHPELSQGAIAHHLSTYRTWLAQSRAVYDDALWNGRYYRLDS--GSG 665

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
           S  + ADQL GQ+YAR   L  +V  D+A SALK VY+   L+   GK GA NG+ PDG 
Sbjct: 666 SEVVMADQLCGQFYARLLDLPDVVASDRALSALKTVYDSCFLRFHDGKFGAANGVKPDGS 725

Query: 812 VDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            +   +    E+W+G+ + +AA ++   L D GF+    + E  +     G  F+TPEA 
Sbjct: 726 PENPKATHPLEVWTGINFGIAAFLVQMGLRDEGFRLTQAVVEQIYE---HGLQFRTPEAI 782

Query: 871 NTDDQYRSLCYMRPLAIWAMQWALT 895
                +R+  Y+RP+AIWA+   LT
Sbjct: 783 TAVGTFRACHYLRPMAIWAVYGVLT 807


>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Meleagris gallopavo]
          Length = 639

 Score =  351 bits (901), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 204/413 (49%), Positives = 251/413 (60%), Gaps = 21/413 (5%)

Query: 508 YTPVALNSSFGTNLLQDG--EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQL 565
           + P         +LLQDG  E  IGQ       EY M+NTYDVHFY+SFAL+ML+PK+Q+
Sbjct: 233 FDPAGSGRDVWQDLLQDGKLESPIGQ-------EYRMYNTYDVHFYASFALVMLWPKLQI 285

Query: 566 SIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTAR 623
           S+Q D A  V+  D    + L  G+    K+   VPHDIG  D  PW  VNAY ++DTA 
Sbjct: 286 SLQYDIAVTVLNEDMQPRQYLVCGKTAQVKLKNVVPHDIGDPDDEPWQRVNAYLMHDTAN 345

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTY 683
           WKDLN KFVLQVYRD   T D  + + +WP     M    +FD D DG+IEN G  DQTY
Sbjct: 346 WKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGLIENGGTADQTY 405

Query: 684 DTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF 742
           D W V+G SAY GGLW+AA++    +A  +GD  ++  +    QK K  +E+ LWNG Y+
Sbjct: 406 DAWVVNGASAYCGGLWLAAVRMMCEMAEVLGDAETQQKYGAILQKGKESFERLLWNGKYY 465

Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSALKKVYNYNVLKVMGG 798
           NYD+SGSS SSSI +DQ AGQW+  ACGL    L +  +    SALK ++  NV+   GG
Sbjct: 466 NYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKELEVFPKSHIVSALKTIFEKNVMSFAGG 525

Query: 799 KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
             GAVNGM PDG  D SS+QS E+W GV YA+AA+MI E L   GF TA G Y   W   
Sbjct: 526 TMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVWE-- 583

Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL---TRPKPKTLEKQMKP 908
            LG AFQTPEA+     YRSL YMRPL+IW+MQ AL   TR  P     Q  P
Sbjct: 584 NLGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLALESKTRRPPSPTPTQKNP 636



 Score =  156 bits (395), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 95/256 (37%), Positives = 139/256 (54%), Gaps = 19/256 (7%)

Query: 79  KRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           K AFID      L   +G PLGG+G G+I R +RGEF RWQ+ P     + V+ +QF+V 
Sbjct: 14  KSAFIDLLCAVPLQQIYGCPLGGIGGGTITRGWRGEFCRWQLNPGKYHYETVITDQFTVC 73

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
           + R     Y  VL  + P  L+         W+W   G  + YHALYPRAW V+E  P  
Sbjct: 74  LRRKGQTVYQQVLSVEKPSALQ--------GWNWGYCGRYAFYHALYPRAWMVYE-LPGQ 124

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQ 258
            + + CRQISP+IPH+YK+SS PV VF + + N+ +   D++++FT  N  G   + +G 
Sbjct: 125 NVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENNSEEPVDVSIMFTLQNGTGTKGDGSGG 184

Query: 259 HYNSKTKMNDG---VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGL 315
           H+N    + DG   V  VLLHH T   + P TFA+AA    G  ++    F  +G     
Sbjct: 185 HWNEPFALQDGGERVTGVLLHHCTP--INPFTFAVAAXXQAGTVITHLTAFDPAG----- 237

Query: 316 TAKDMWHEIKEHGSFD 331
           + +D+W ++ + G  +
Sbjct: 238 SGRDVWQDLLQDGKLE 253


>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
          Length = 449

 Score =  341 bits (874), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 204/541 (37%), Positives = 289/541 (53%), Gaps = 109/541 (20%)

Query: 375 PEVNFMSGKT-YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
           P ++F + +  Y+RRY +F+G    A   +A  A+    SWE +I  WQ+P L+  +LP 
Sbjct: 2   PIIHFGAKRNKYHRRYARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANKLPA 61

Query: 434 WYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTA 493
           WY   LFNE Y+++ GG+VW D                            +DV +     
Sbjct: 62  WYKSALFNETYFISDGGSVWVDS---------------------------IDVKD----- 89

Query: 494 VNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSS 553
                   ++ E  +        FG     +G E            Y M+NT DVH+Y+S
Sbjct: 90  --------AVSEHPFV-----QEFGKFAYLEGHE------------YRMYNTLDVHYYAS 124

Query: 554 FALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEV 613
           FAL+ L+P +QLS+Q D               +D+                   +PW  +
Sbjct: 125 FALMKLWPNLQLSLQYD---------------IDN-------------------EPWSRL 150

Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
           NAY ++ T  WKDLNPKFVLQVYRD V T D+++   ++P     M +  ++D D DG+I
Sbjct: 151 NAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCKAVMDHSLRWDVDRDGII 210

Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK----- 728
           +N GF DQTYD W+V+G SAY GGLW+AA++A + +A  +G    ED  L K+++     
Sbjct: 211 DNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLG--VVED--LMKYEEILERG 266

Query: 729 AKVVYEKLWNG----SYFNYDNSGSSQSSSIQADQLAGQWYARACGLL--PIVDEDKARS 782
            K   EKLWNG     Y+NYD S S    SI ADQ+ GQW+ +ACG+    +      + 
Sbjct: 267 KKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGVADNAVFPSANVKR 326

Query: 783 ALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADI 842
           AL+ V+ +NVL   GG+ GA+NGM P+ + D++S QS E W+GVTYA+ A+MI   + D 
Sbjct: 327 ALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYALGATMIQVGMIDK 386

Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTL 902
           GFQTA G Y   W   GL  AFQTPEA+  + ++RSL YMRPLAIWA+Q A+ +  P+  
Sbjct: 387 GFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPLAIWAIQHAVEKFHPQFF 444

Query: 903 E 903
           E
Sbjct: 445 E 445


>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
          Length = 692

 Score =  337 bits (864), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 185/365 (50%), Positives = 233/365 (63%), Gaps = 7/365 (1%)

Query: 535 LEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSR 594
           L G EY M+NTYDVHFY+SFALIML+PK+Q+S+Q D A AV+  D    + L  GQ    
Sbjct: 314 LPGQEYRMYNTYDVHFYASFALIMLWPKLQISLQYDIAVAVVNEDVQPRQYLMGGQTAPV 373

Query: 595 KVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVW 652
           K    VPHDIG    +PW  VNAY ++DTA WKDLN KFVLQVYRD   T D  + + +W
Sbjct: 374 KTKNVVPHDIGDPADEPWQRVNAYLIHDTANWKDLNLKFVLQVYRDYYLTHDSLYLRDMW 433

Query: 653 PSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
           P     M    +FD D DG+IEN GF DQTYD W V+G SAY GGLW+AA+     +A  
Sbjct: 434 PVCQAVMESELKFDMDNDGLIENGGFADQTYDAWVVNGASAYCGGLWLAAVCMMCKMAEV 493

Query: 713 VGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL 771
           +GD   +  ++   +K K  +E+ LWNG Y+NYD+SGS+ SSSI +DQ AG  + +    
Sbjct: 494 LGDAEIQQKYMDILRKGKETFERLLWNGKYYNYDSSGSNTSSSIMSDQCAG--HPKPPLS 551

Query: 772 LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVA 831
           + +  +    SALK ++  NV+   GG  GAVNGM P G  D SS+QS E+W GV YA+A
Sbjct: 552 MQVFPKSHVVSALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALA 611

Query: 832 ASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQ 891
           A+MI E L + GF+TA G Y   W    LG AFQTPEA+     YRSL YMRPL+IW+MQ
Sbjct: 612 ATMIQEGLVEEGFRTAEGCYRTVWE--QLGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQ 669

Query: 892 WALTR 896
            AL R
Sbjct: 670 LALER 674



 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 109/202 (53%), Gaps = 21/202 (10%)

Query: 134 QFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVH 192
           QF+V + R  GQ  Y  VL  + P  L+         W+W   G  + YHALYPRAW V+
Sbjct: 6   QFTVCL-RCKGQTVYQQVLSVERPSTLQ--------GWNWGYCGHYAFYHALYPRAWMVY 56

Query: 193 EGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD 252
           E  P   + + CRQISP+IPH+YK+SS PV VF + + N      D++++F+  N  G  
Sbjct: 57  E-LPGQNVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENGRDEDVDVSIMFSLQNGTGTK 115

Query: 253 SEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
            + +G H+N      K  + V  VLLHH T   + P TFA++A+E  GV V+    F  +
Sbjct: 116 EDRSGGHWNEPFTFEKEGERVAGVLLHHCT--HVNPFTFAISAREKAGVVVTHLTAFNPA 173

Query: 310 GNSLGLTAKDMWHEIKEHGSFD 331
           G     + +++W ++ + G  D
Sbjct: 174 G-----SGREVWQDLLQDGQLD 190


>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
          Length = 1200

 Score =  337 bits (863), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 177/399 (44%), Positives = 246/399 (61%), Gaps = 30/399 (7%)

Query: 526  EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMM-HDPSKMK 584
            ++ IGQF YLE  EYLM+NTYDVHFY+SFAL  L+P ++ S+Q   A A M  +    ++
Sbjct: 790  DKEIGQFAYLESQEYLMYNTYDVHFYASFALATLWPSLEFSLQSGIADATMEDYGGETVE 849

Query: 585  LLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVAT 642
             +  G+ + RK+ G VPHDIG    DPW  VNAY + D +RWKDL  KF+LQVYRD + T
Sbjct: 850  CIHSGKQIPRKLRGTVPHDIGNPGEDPWKRVNAYNIQDISRWKDLPCKFILQVYRDYLLT 909

Query: 643  ------------------------GDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
                                    GD+ F   +W +V   +    +FD D DG+I+N+GF
Sbjct: 910  SVDSSGQKSNATSSSFGQEYIPLDGDRGFLLQMWGTVEEVIRKAFEFDTDDDGVIDNEGF 969

Query: 679  PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLW 737
            PDQTYDTWS SG SAY+GGLW+AA++A SA+AR +G R  E+ +   F+K K  Y +KLW
Sbjct: 970  PDQTYDTWSASGCSAYTGGLWLAAIKATSAMARILGLRDDEEVYSKLFEKGKKSYNKKLW 1029

Query: 738  NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
            NG YFNYD+S  S  +SI ADQLAG WY  +CGL   +  D+A S L  +  YN+     
Sbjct: 1030 NGHYFNYDSSKQSHYNSIMADQLAGHWYLTSCGLSSYMTLDQALSTLSIINEYNIKSYSN 1089

Query: 798  GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
            G  GAVNG+ P   VD + +QS E+W G +Y++A++ +   +    +    G+  +++  
Sbjct: 1090 GSCGAVNGISPLAIVDQTCLQSSEVWIGTSYSLASTFLLHYMDKEAWSLIKGLVNSSYE- 1148

Query: 858  TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
               G+ +QTPEAW+ +  +R+  YMRPLAIW++QWAL+R
Sbjct: 1149 -KWGFQYQTPEAWDMNGSFRASTYMRPLAIWSVQWALSR 1186


>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 814

 Score =  335 bits (860), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 268/864 (31%), Positives = 385/864 (44%), Gaps = 153/864 (17%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G +GRS RG+F  W +       K + A QFSVF    N    +  LC +
Sbjct: 41  HGMPLGGFGAGCLGRSPRGDFNLWHLDGGEHTFKRLPACQFSVFEQPENQDAQAYALCTE 100

Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKS------TYHALYPRAWTVHEGEPDPELRIV 203
            PE         +G W W     N++          TYHALYPR+W V+E     EL   
Sbjct: 101 PPE------DGSLGKWQWYPTSPNVETRHGASLRTGTYHALYPRSWFVYENVFQTEL--T 152

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
           C+Q SPI    Y+ESSYPV+VF +T +N       ++++ TW N VG    FT    + +
Sbjct: 153 CQQFSPIWADCYQESSYPVAVFEWTAHNPTDQPITLSIMLTWQNMVGW---FTNAIQHPQ 209

Query: 264 TKMNDGVHAVLLHH------------------RTSHQLPPVTFALAAQETDGVHVSLCPH 305
            ++ D    V  +                   R    L  V      +E +G    LC  
Sbjct: 210 VQVRDDGSPVYEYQPRWGNSTGNYNQWIVDNFRVGCLLDRVRLRDEIEEGEG---QLCFA 266

Query: 306 FVISG----------NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAAS 355
            V +           N  G    ++W+     GS   L+ ++    +EPG  I  AIA  
Sbjct: 267 TVTNPSQEVFYHCRWNPTG-DGSEVWNTFASDGS---LSDVQDETPAEPGEQIAGAIAVR 322

Query: 356 VTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWE 415
           V + P    ++ F LAWD P   F SG  YYRRYT F+G   N A ++ R A L+H    
Sbjct: 323 VRLRPGRTRKIPFILAWDFPVTEFASGVKYYRRYTDFFGRTGNNAWSMVRTA-LKHS--- 378

Query: 416 LQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDW 475
              + WQ  IL                                                W
Sbjct: 379 ---DLWQERILS-----------------------------------------------W 388

Query: 476 SQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEEN-IGQFLY 534
               L+R  D+PN    A             ++  + L S+ GT      E + +G+F  
Sbjct: 389 QNPILQR-HDLPNWFKMA-------------LFNELYLLSAGGTIWSSANERDPVGKFGV 434

Query: 535 LEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSR 594
           LE  +Y  + + DV  Y SFAL ML+ ++  ++   FA A+   D +   +  +     R
Sbjct: 435 LESFDYRWYESLDVRLYGSFALAMLWSRLDKAVLEAFARAIPTSDDTPRIIGYNQATAIR 494

Query: 595 KVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDK--KFAKA 650
           K   A PHD+G  +  PW + N     D  +WKDL   FVLQVYR  V TG K  +F   
Sbjct: 495 KAADATPHDLGAANEHPWEKTNYTSYQDCNQWKDLPCDFVLQVYRTFVFTGSKDIEFLAE 554

Query: 651 VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA--------- 701
            W ++  A+AY+  FD DGDG+ EN G PDQT+D W + G+SAY GGLW+A         
Sbjct: 555 CWSAIVRALAYLKSFDHDGDGIPENSGAPDQTFDDWQLQGVSAYCGGLWLAALEAASAIA 614

Query: 702 -------ALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSS 753
                   L +A  L      + + + +    Q+A+ +Y EKLWNG Y+  D+   S S 
Sbjct: 615 QILLDNYTLVSAINLPDPESIQHTLNTYHKWLQQAQPIYQEKLWNGQYYRLDS--DSDSD 672

Query: 754 SIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
            + ADQL GQ+YA+  GL  IV  +  +SALK VY    LK    + GA NG+ PDG  +
Sbjct: 673 VVMADQLCGQFYAQLLGLPDIVPPECTQSALKTVYESCFLKFHNAQFGAANGVKPDGSPE 732

Query: 814 M-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
              +    E+W+G+ + +AA +I   + D   +    +    +     G  F+TPEA   
Sbjct: 733 HPDATHPLEVWTGINFGLAAFLIQLGMKDEALKLTESVVRQVYEN---GLQFRTPEAITP 789

Query: 873 DDQYRSLCYMRPLAIWAMQWALTR 896
              +R+  Y+R +AIWA+   L++
Sbjct: 790 VGTFRAGHYLRAMAIWAVYLTLSQ 813


>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
           vitripennis]
          Length = 826

 Score =  332 bits (850), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 171/386 (44%), Positives = 239/386 (61%), Gaps = 8/386 (2%)

Query: 519 TNLLQ--DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVM 576
           +N LQ  D     G+F YLEG EY M+NTYDVHFY++FAL  L+P ++  IQ +F  +V 
Sbjct: 422 SNELQPDDPRRRYGRFAYLEGHEYRMYNTYDVHFYAAFALAQLWPNLEACIQYEFRDSVH 481

Query: 577 MHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQ 634
             D  +++ L DG    RKV G+VPHDIG    +P+  +NAY ++D ++W+DLN KFVL 
Sbjct: 482 AEDQQRLQGLYDGSKRHRKVKGSVPHDIGDPGEEPFDLINAYPIHDVSQWRDLNSKFVLS 541

Query: 635 VYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
            YR      + +  +  W ++ + + +   FD D DG+IEN GFPDQTYD W +SG SAY
Sbjct: 542 CYRIYFFNRNLEQLRDFWSTIKLVLEHSLTFDEDNDGLIENGGFPDQTYDCWVMSGPSAY 601

Query: 695 SGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSS 753
            GGLW+AAL  A  +A  +G+   E  +     + KV ++ KLWNG Y+N+D        
Sbjct: 602 CGGLWIAALHCAVKMAELLGESEDEARYKGILDRGKVAFQDKLWNGKYYNFDCGKDESKL 661

Query: 754 SIQADQLAGQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRV 812
           SI +DQL G W  RACG    +  +D+ RS+L+ V+  NV+K   G++GAVNG  P G +
Sbjct: 662 SIMSDQLCGHWLLRACGFSYEVFPQDRVRSSLETVFQNNVMKYKNGQQGAVNGFSPTGSI 721

Query: 813 DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
           D S +QS E+W+GV Y +AA +IHE + +  F+TA G+Y   +   G+G  F+TPEA   
Sbjct: 722 DYSCIQSEEMWTGVAYGLAALLIHEGMIEEAFRTAEGVYRTVYEKIGMG--FETPEALYE 779

Query: 873 DDQYRSLCYMRPLAIWAMQWALTRPK 898
              YR++ YMRPL+IWAMQ A T  K
Sbjct: 780 HKVYRAIGYMRPLSIWAMQHAWTMRK 805



 Score =  222 bits (566), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 139/405 (34%), Positives = 220/405 (54%), Gaps = 24/405 (5%)

Query: 53  NWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYR 112
            +K+T+Q+ P+ VR +      +  G+   +D F  RH    +GVP+GG+G GSIGR +R
Sbjct: 32  KFKQTIQMMPLAVRYVFYYTNVSRYGRPVLMDYFNMRHGQQIYGVPIGGLGGGSIGRGFR 91

Query: 113 GEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLCPKT-PEVLKDTTAAGIGSW 170
           GEF R+ + P +     + ANQF + V   NG   Y+ VL  KT P+ L+        SW
Sbjct: 92  GEFCRYALLPGLYSYHVIPANQFILTVRDVNGNTIYNQVLSTKTKPKHLR--------SW 143

Query: 171 DWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
            W  +G+K++Y  LYPR+WT +E E + EL++ CRQ+SP+IPH+YK+SS P +VF + + 
Sbjct: 144 KWGFEGNKASYTGLYPRSWTTYEVE-ELELKLTCRQVSPVIPHDYKDSSLPCAVFVWDVV 202

Query: 231 NSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFAL 290
           N       I + FT+ +      +  G  +    ++++     ++H        P T+A+
Sbjct: 203 NRSNRDLRIAITFTFQSGCASKEDAQGDKWTEFFELDES-RGTMIHQEFRGM--PCTYAI 259

Query: 291 AAQ-ETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           +++   D V V+    F   GN   L     W  ++  G FD   S E  ++S+    I 
Sbjct: 260 SSRTRPDDVGVTRMIGFDPRGNGAVL-----WQRLQNAGRFD--ESAEPRISSKTRKPIA 312

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFM-SGKTYYRRYTKFYGTHQNAAANIARDAI 408
            A+ A   V   S G + FSLAW  P++ F    K + R YT+F+ + + A   I R A+
Sbjct: 313 TAVCADTLVKAGSSGFLEFSLAWHMPKIQFFFKKKDHLRYYTQFF-SGEKAGPEICRYAL 371

Query: 409 LEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
            ++  WE +IE WQ+ +LED  LP+WY   +FNELY+++ GG++W
Sbjct: 372 SQYERWEDEIERWQKSVLEDPDLPDWYKSAIFNELYFVSDGGSLW 416


>gi|345289099|gb|AEN81041.1| AT1G33700-like protein, partial [Capsella rubella]
          Length = 191

 Score =  332 bits (850), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 152/190 (80%), Positives = 173/190 (91%)

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
           ENIGQFLYLEG +YLM+NTYDVHFYSSFAL+MLFPK++LSIQRDFAAAV+MHD S+ +++
Sbjct: 2   ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61

Query: 587 DDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKK 646
             G++V+RKVLGAVPHDIG+ DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD  
Sbjct: 62  SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           FAKAVWPSVY A+AY+DQFD+DGDGMIENDGFPDQTYD WS SG+SAY GGLWVAALQA 
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENDGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181

Query: 707 SALAREVGDR 716
           SALARE+GD 
Sbjct: 182 SALAREIGDN 191


>gi|390344694|ref|XP_001200450.2| PREDICTED: uncharacterized protein LOC764213 [Strongylocentrotus
           purpuratus]
          Length = 1283

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 194/558 (34%), Positives = 286/558 (51%), Gaps = 78/558 (13%)

Query: 49  QFTLNWKETVQLAPIGVRILCLIREEAAK--------GKRAFIDPFIKRHLTSSHGVPLG 100
           +F+  W+    + P  ++   L     A+        GKR FID          +GVP+G
Sbjct: 99  KFSEKWRPLTMVRPHQIKSFVLPSVRYAQMHLKMWRGGKRPFIDVLHPMKFKPVYGVPVG 158

Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
           G+G G+I R ++G+F RW + P       V  NQF+V V R     Y  VL P  P   K
Sbjct: 159 GIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTVCVRRGGKTVYQQVLSPNKPHN-K 217

Query: 161 DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSY 220
           D   A    W+W   G++++YH LYPRAWT +   P   + +VCRQISP+ PHNY+E+S 
Sbjct: 218 DLAQA----WEWQFPGNRASYHGLYPRAWTKYL-LPGQNITLVCRQISPVFPHNYQETSL 272

Query: 221 PVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTS 280
           PV VF + +YN G  +AD++++FTW N  G  ++  G  +N+    +     V+ H    
Sbjct: 273 PVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKGGRWNTGFS-SASTSGVMFHD--G 329

Query: 281 HQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME-TS 339
           H   P T  +AA + D V V+ C  F  S        K++W+++KE G   +LN+ +  +
Sbjct: 330 HPTQPCTMCIAAADGDDVTVTTCASFNPSSQ-----LKELWNDLKEDG---QLNTTKGPT 381

Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNA 399
             +E G ++  A+ +S  + P+ +G + F+L+WD P+V F S  +Y+RRYT+++G   ++
Sbjct: 382 ERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPKVQFDSC-SYFRRYTRWFGRDGDS 440

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           A  +   A+ E+G WE  I  WQ PILE+ RLP WY   LFNELY++  GG +W D    
Sbjct: 441 ALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKSALFNELYFMTDGGGIWVDVDD- 499

Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
                                    + PN  +   N   R SS++ +             
Sbjct: 500 -------------------------EYPNGEN---NAHTRASSLVRE------------- 518

Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
                     G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q DF   V   D
Sbjct: 519 ---------YGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKD 569

Query: 580 PSKMKLLDDGQWVSRKVL 597
               + L  G    R+VL
Sbjct: 570 EDSFQDLSSGLRSIRRVL 587


>gi|115633999|ref|XP_780575.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Strongylocentrotus purpuratus]
          Length = 597

 Score =  330 bits (847), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 194/558 (34%), Positives = 286/558 (51%), Gaps = 78/558 (13%)

Query: 49  QFTLNWKETVQLAPIGVRILCLIREEAAK--------GKRAFIDPFIKRHLTSSHGVPLG 100
           +F+  W+    + P  ++   L     A+        GKR FID          +GVP+G
Sbjct: 99  KFSEKWRPLTMVRPHQIKSFVLPSVRYAQMHLKMWRGGKRPFIDVLHPMKFKPVYGVPVG 158

Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
           G+G G+I R ++G+F RW + P       V  NQF+V V R     Y  VL P  P   K
Sbjct: 159 GIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTVCVRRGGKTVYQQVLSPNKPHN-K 217

Query: 161 DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSY 220
           D   A    W+W   G++++YH LYPRAWT +   P   + +VCRQISP+ PHNY+E+S 
Sbjct: 218 DLAQA----WEWQFPGNRASYHGLYPRAWTKYL-LPGQNITLVCRQISPVFPHNYQETSL 272

Query: 221 PVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTS 280
           PV VF + +YN G  +AD++++FTW N  G  ++  G  +N+    +     V+ H    
Sbjct: 273 PVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKGGRWNTGFS-SASTSGVMFH--DG 329

Query: 281 HQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME-TS 339
           H   P T  +AA + D V V+ C  F  S        K++W+++KE G   +LN+ +  +
Sbjct: 330 HPTQPCTMCIAAADGDDVTVTTCASFNPSSQ-----LKELWNDLKEDG---QLNTTKGPT 381

Query: 340 VTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNA 399
             +E G ++  A+ +S  + P+ +G + F+L+WD P+V F S  +Y+RRYT+++G   ++
Sbjct: 382 ERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPKVQFDSC-SYFRRYTRWFGRDGDS 440

Query: 400 AANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPP 459
           A  +   A+ E+G WE  I  WQ PILE+ RLP WY   LFNELY++  GG +W D    
Sbjct: 441 ALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKSALFNELYFMTDGGGIWVDVDD- 499

Query: 460 VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGT 519
                                    + PN  +   N   R SS++ +             
Sbjct: 500 -------------------------EYPNGEN---NAHTRASSLVRE------------- 518

Query: 520 NLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD 579
                     G+F YLEG EY M+NTYDVHFY+SFALIML+PK++LS+Q DF   V   D
Sbjct: 519 ---------YGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKD 569

Query: 580 PSKMKLLDDGQWVSRKVL 597
               + L  G    R+VL
Sbjct: 570 EDSFQDLSSGLRSIRRVL 587


>gi|345289089|gb|AEN81036.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289091|gb|AEN81037.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289093|gb|AEN81038.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289095|gb|AEN81039.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289097|gb|AEN81040.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289101|gb|AEN81042.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289103|gb|AEN81043.1| AT1G33700-like protein, partial [Capsella rubella]
          Length = 191

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 151/190 (79%), Positives = 173/190 (91%)

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
           ENIGQFLYLEG +YLM+NTYDVHFYSSFAL+MLFPK++LSIQRDFAAAV+MHD S+ +++
Sbjct: 2   ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61

Query: 587 DDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKK 646
             G++V+RKVLGAVPHDIG+ DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD  
Sbjct: 62  SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           FAKAVWPSVY A+AY+DQFD+DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA 
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181

Query: 707 SALAREVGDR 716
           SALARE+GD 
Sbjct: 182 SALAREIGDN 191


>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
 gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
          Length = 806

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 170/367 (46%), Positives = 236/367 (64%), Gaps = 10/367 (2%)

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           F  LE  +Y  + T DV FY SF L+ML+P I+  + R FA  + + D S+ K+  +G  
Sbjct: 437 FGLLECFDYNYYETLDVRFYGSFPLVMLWPDIEKQVMRQFADTINVQDSSEFKVGSNGAM 496

Query: 592 VSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKF 647
             +KV G +PHD+G     PW ++NAY   +   WKDLN K+VL VYRD V TG  DK+F
Sbjct: 497 AVKKVQGMIPHDLGSSYALPWIKINAYDWQNPNIWKDLNSKYVLLVYRDYVLTGKTDKEF 556

Query: 648 AKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
            K  W SV  A+  + + D+D DG+ +N+G PDQTYDTWS+ G SAY G LW+AAL+AA 
Sbjct: 557 LKYTWKSVKTALDKLKEMDKDNDGIPDNEGIPDQTYDTWSMKGTSAYCGSLWLAALKAAQ 616

Query: 708 ALAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQW 764
            + + + D  +E Y  +   ++ A+  +EK LWNG Y+N+D   S    SI ADQLAGQW
Sbjct: 617 EIGKVLKD--NEAYIKYNEWYKIAQQNFEKELWNGEYYNFDTE-SDHKDSIMADQLAGQW 673

Query: 765 YARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWS 824
           YA    L  I+ +D  + ALKK+Y +NV+K   GK GAVNGM PDG VD S +Q++E+W+
Sbjct: 674 YADILRLGDILPKDHVQKALKKIYEFNVMKFENGKMGAVNGMRPDGIVDESDIQAQEVWT 733

Query: 825 GVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRP 884
           GVTYA+A+ M +  + +  + TA G+Y+  +  +G GY F+TPEAW  D  YR+  YMRP
Sbjct: 734 GVTYALASFMKYRGMTEEAYNTAYGVYKMTYDKSGKGYWFRTPEAWTKDGNYRASMYMRP 793

Query: 885 LAIWAMQ 891
           L+IW+M+
Sbjct: 794 LSIWSME 800



 Score =  238 bits (608), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 131/374 (35%), Positives = 202/374 (54%), Gaps = 32/374 (8%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS-RSNGQKYSSVLCP 153
            G PLGG G+G+IGR+Y G F RW +     +   V ANQFSVF     N    + VL  
Sbjct: 71  QGAPLGGFGAGTIGRTYNGGFSRWHLEIGKNKYTTVYANQFSVFQKVEGNKDGVAQVLYA 130

Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
             PE         + SW W+   +   Y+ALYP +W  +  +  P +++  +Q SPIIP+
Sbjct: 131 GEPE------NGYLSSWKWDYPKESGMYYALYPNSWYTYTNKDLP-VQLAVKQFSPIIPY 183

Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN-----------S 262
           NYKE+SYPV+VF +T YN    + D++++FTW N +G    F G+  N            
Sbjct: 184 NYKETSYPVAVFKWTAYNPTNKNVDVSIMFTWQNMIG----FFGKQVNVNSGNFNKIIKD 239

Query: 263 KTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWH 322
           K+K ++ V AV+ +    ++     +++  ++  GV +S    FV +G+       D+WH
Sbjct: 240 KSKDSEIVAAVMGNISNDNEEWNGEYSIGVKKVPGVDISYKAKFVTTGD-----GSDLWH 294

Query: 323 EIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG 382
           E  ++G  D  N  + + T + G  IG+AIA +  + P    +V F+L+WD P + F  G
Sbjct: 295 EFSKNGILD--NKDDETPTKQDG--IGSAIAVNFKLQPGQTIEVPFALSWDLPIMKFGGG 350

Query: 383 KTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNE 442
             +Y+ YTK++G +   +  I ++A+  +  WE  I+ WQ+PIL +K  P+WY   LFNE
Sbjct: 351 DKWYKMYTKYFGKNGKNSFAILKEALNNYQKWEKMIDDWQKPILSNKSKPDWYKTALFNE 410

Query: 443 LYYLNAGGAVWTDG 456
           LYYL  GG  W +G
Sbjct: 411 LYYLADGGTAWENG 424


>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
 gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
          Length = 807

 Score =  330 bits (845), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 255/858 (29%), Positives = 387/858 (45%), Gaps = 148/858 (17%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       +   A QF V+     G+  +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFGVYEYVEGGEAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            P+   D T   + SWDW    +K TY ALYPR+W  ++   + EL   C Q SPI+  +
Sbjct: 101 PPD---DGT---LESWDW-YPTEKGTYSALYPRSWYEYKDVFNVEL--TCEQFSPIVAES 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+E+SYP ++F ++ +N       ++++ +W N VG                        
Sbjct: 152 YQETSYPTAIFEWSAHNPTDKPITVSIMMSWQNMVGWFTKATKTPKVELRDDGSPVYEYQ 211

Query: 251 ---GDSEFT-GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
              GDS     Q    + ++   +  V  H + S     + FA     T  + V     +
Sbjct: 212 SRWGDSTGNLNQWIQDRYRVGCLLTRVRPHDQISEGEGQICFATVTNPT--LEVFYHNRW 269

Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
             SG+      +D+W+    +GS   L  +E    + PG  I  AIA   TV P    ++
Sbjct: 270 NPSGD-----GRDVWNYFAGNGS---LPDVEDETPASPGEQIAGAIAVRFTVRPGRTKKI 321

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            F LAWD P + F    TY+RRYT F+G                H  W +   A +   L
Sbjct: 322 PFFLAWDFPVMEFAPNITYFRRYTDFFGRSG-------------HNVWSIIRTAMKHSDL 368

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDV 486
             +R+ EW    L                                    S++DL     +
Sbjct: 369 WKERIEEWQKPFL------------------------------------SRADLPDWFKM 392

Query: 487 PNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTY 546
                          S+  ++Y  +A   S  T   +D  + +GQF  LE ++Y  + + 
Sbjct: 393 ---------------SLCNELYL-LAQGGSLWTAATED--DPVGQFGVLECLDYSWYESL 434

Query: 547 DVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM-----KLLD-DGQWVSRKVLGAV 600
           DV  Y SFAL ML+PK+  S+   FA A+   D ++      KL+  D    +RK  GA 
Sbjct: 435 DVRMYGSFALAMLWPKLDKSVLEAFARAIPSSDDTQRAIGYNKLMGYDNFMAARKRAGAT 494

Query: 601 PHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVY 656
           PHD+G  +  PW   N     D   WKDL   FVLQVYRD V TG  D +F  + W  V 
Sbjct: 495 PHDLGAPNEHPWEATNYTAYQDCNLWKDLGADFVLQVYRDFVMTGSKDTEFLWSCWEGVV 554

Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW----------VAALQAA 706
            A+ Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW             L+  
Sbjct: 555 QALDYLKTFDLDCDGIPENGGSPDQTFDDWKLKGISAYCGGLWIAALEAAIAIANILKNK 614

Query: 707 SALAREVGDRGSEDYF---LFKFQ----KAKVVYEK-LWNGSYFNYDNSGSSQSSSIQAD 758
           +++  E  ++ S + F   + K+Q    +++ +Y+  LWNGSY+  D    S S  + +D
Sbjct: 615 ASIYEEHQNQVSTEEFEAAIAKYQDWLKQSRPLYQDTLWNGSYYTLDT--GSDSKVVMSD 672

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSS-M 817
           QL G++Y +   L  +  E++    L+ +Y    LK   G+ GA NG+ PDG  +  +  
Sbjct: 673 QLCGEFYTQLLNLPNVNPENRTAITLQTIYEACFLKFHDGQFGAANGLNPDGSPEKENDT 732

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W+G+ + +AA MI   + D   +    +    +S    G  F+TPEA   ++ +R
Sbjct: 733 HPLEVWTGINFGIAAFMIRNGMKDEALKMTEAVIHQVYSN---GLQFRTPEAITANNTFR 789

Query: 878 SLCYMRPLAIWAMQWALT 895
           +  Y+R + IWA+  ALT
Sbjct: 790 ACHYLRAMGIWAIYDALT 807


>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 534

 Score =  329 bits (843), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 217/576 (37%), Positives = 289/576 (50%), Gaps = 71/576 (12%)

Query: 342 SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK-FYGTHQNAA 400
           +E    +G AI A       S  +V F LAWD P V F +G+  YRR    F+       
Sbjct: 11  TEGEVDLGVAICAEFPAKCKSSAEVEFVLAWDMPIVKFGAGRRQYRRRYARFFPDASKRV 70

Query: 401 ANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPV 460
             +   A++    WE +I+AWQ+ IL D  LP+WY   LFNE Y+L  GG  W       
Sbjct: 71  EQMCSRALMSRIEWERKIDAWQQRILSDDSLPDWYKSALFNESYFLTDGGTCW------- 123

Query: 461 HSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTN 520
                     F  D    D  R  +    N++A    E                      
Sbjct: 124 ----------FEYD----DEWRSTERQMSNESAKYFKE---------------------- 147

Query: 521 LLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDP 580
                    G+F YLE  EY M NTYDVHFYSSFAL+  +P I+L+IQ DFA  V+    
Sbjct: 148 --------FGRFAYLEAWEYYMLNTYDVHFYSSFALLENWPLIELAIQLDFADQVLSSCD 199

Query: 581 SKMKLLDDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
            K   +++    + K LG +PHD+G  + +PW  +NAY L DT  WKDLN KFVL  YRD
Sbjct: 200 RKSVNINESTRTAVKRLGRLPHDLGNPMDEPWLHLNAYALSDTCEWKDLNLKFVLTCYRD 259

Query: 639 ------VVATGDKKFAKAVWPSVYV----AMAYMDQFDRDGDGMIENDGFPDQTYDTWSV 688
                 +    D +    +    Y      +A    +D DGD +IEN G PDQTYD WS+
Sbjct: 260 YEKIVKIYFNDDNEMKGCLLRRFYDLSSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSM 319

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNS 747
            G SAY GGLW+ AL+    +A  +G+      F  K   A+  YE KLWNG YF++D  
Sbjct: 320 HGSSAYCGGLWLCALECVRRMALTLGEVVDAQKFANKLNNARKAYERKLWNGKYFDFDEH 379

Query: 748 GSSQSSSIQADQLAGQWYARAC-GLLP--IVDEDKARSALKKVYNYNVLKVMGGKRGAVN 804
            S+   SI ADQL G W+     G +   I+   +  ++LK ++ YNV K   G+ G VN
Sbjct: 380 -STDHKSIMADQLCGFWFMCITDGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVN 438

Query: 805 GMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
            M+P G VD + +QS E+W GV YA+A+  +  +  +  F+TA G Y + W   GL Y  
Sbjct: 439 AMMPSGVVDSTGIQSEEVWGGVAYALASFHLLVEENESAFKTAEGWYRSCWERYGLQY-- 496

Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
           Q+PEA N    YR++ YMRPLAIWAMQ AL   + K
Sbjct: 497 QSPEAINESSYYRAIGYMRPLAIWAMQSALDALRNK 532


>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
 gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
          Length = 869

 Score =  328 bits (841), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 274/928 (29%), Positives = 429/928 (46%), Gaps = 181/928 (19%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +G+R ++D +   +    +GVP+GG+G GSIGR Y GEF R+Q+ P + E   V ANQF 
Sbjct: 13  EGRRVYMDYYYMENGKQMYGVPIGGIGGGSIGRGYAGEFCRFQMRPGIYEYNVVQANQFI 72

Query: 137 VFVSRSNGQK-YSSVLC----------PKTPEVLKDTTAAG----------------IGS 169
           V +    G   + S+L           P   +   D   A                 + +
Sbjct: 73  VTIKDHKGCTIFQSLLSKCSTKSIKAKPNNNDAEADEAEAQKSKCQLPNCSSRSRQPLSA 132

Query: 170 WDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVF 225
           W  N++  + +Y  LYPR+WT     H G     +R++CRQISP+IPH+YKESS P +VF
Sbjct: 133 WHSNIEDARCSYTGLYPRSWTEYDLSHYG-----VRLICRQISPVIPHDYKESSLPCAVF 187

Query: 226 TYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP 285
            ++  N       +++ FT+ N  G   +  G+  N+++++    +A  +  R +    P
Sbjct: 188 VWSAQNVCDHERKVSITFTFKNGTGNKKQ-DGEG-NAESQLISEGNAKGVAIRQTIGDMP 245

Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
            ++ LA +    + ++ CP F  +GN      + +W ++KEHG      + E+  T +  
Sbjct: 246 CSYNLACRVLPEISITRCPQFDPAGN-----GEQLWMQLKEHGQLSEQPTAESLKTKD-- 298

Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIA 404
             IG A+ A + + P +   + F LAWD P++ F    +T+ R YTK++     +   I 
Sbjct: 299 --IGVAVCAHLALKPKTSHDLEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDSGESGPRIC 356

Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLV 464
             A+  + SWE              RL + +   + N+            DG P  +   
Sbjct: 357 EYALKHYASWE--------------RLIDAWQRPILND------------DGLPDWYKCA 390

Query: 465 TIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQD 524
                 F  D     LK         DT++               P+A +         D
Sbjct: 391 IFNQLYFISDGGTLWLK--------CDTSLG-------------KPLAFD---------D 420

Query: 525 GEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPK----IQLSIQRDFAAAV----- 575
                G+F YLEG EY M+NTYDVHFY+S AL  L+P     +Q   +   AA +     
Sbjct: 421 PRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRK 480

Query: 576 MMHD----PSKMK----------------------LLDDGQW--VSRKVLGAVPHDIGIC 607
           M++D    P K+K                      + D  +W  ++ K +  V  D  + 
Sbjct: 481 MLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDYYVL 540

Query: 608 DPWFEVNA-----------------YCLY-------DTARWKDLNPK----FVLQVYRDV 639
           +   +  A                 Y LY       ++A  K  N K    ++ +    V
Sbjct: 541 NELAQAQADNASKFSSIEFIDKESLYELYTQDNKRKNSADEKQQNRKSASMYINETNGKV 600

Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
                  + KA++ +    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW
Sbjct: 601 YLMDAMSYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLW 660

Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQAD 758
           +AALQ  S +A  +        +    ++ K  + EKLWNGSY+ +D S  +   +I AD
Sbjct: 661 LAALQTMSVMATLLDQPNDCLRYQDILERGKNSLEEKLWNGSYYRFDQS-HNHRDTIMAD 719

Query: 759 QLAGQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------G 810
           QL G WY ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G
Sbjct: 720 QLCGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEPSKPG 779

Query: 811 RVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            VD S++Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA 
Sbjct: 780 HVDNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMN--FETPEAL 837

Query: 871 NTDDQYRSLCYMRPLAIWAMQWALTRPK 898
             + +YRS+ YMRPL+IW+MQ A  R +
Sbjct: 838 YGEKRYRSIGYMRPLSIWSMQVAWERRR 865


>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 812

 Score =  328 bits (840), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 170/368 (46%), Positives = 237/368 (64%), Gaps = 7/368 (1%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
           G+F YLEG EY M+NTYDVHFY+SFAL ML+P +Q  +QRDF   V   D S+   L +G
Sbjct: 441 GRFAYLEGHEYRMYNTYDVHFYASFALAMLWPNLQSCVQRDFCDTVPQEDRSRQWHLFNG 500

Query: 590 QWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
               RKV  +VPHD+G    +P+ ++N+Y ++D + W+DL+ KFVLQ YRD     D   
Sbjct: 501 SIGYRKVKNSVPHDLGDPFEEPFLKINSYPIHDVSEWRDLDLKFVLQSYRDYFLNNDLNQ 560

Query: 648 AKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
            K +W SV V M    ++D+DGDG+IEN G PDQTYD W++ G SAY G LW+AALQ   
Sbjct: 561 LKYLWKSVCVLMNNALRWDKDGDGLIENVGEPDQTYDAWTMEGPSAYCGSLWLAALQCTC 620

Query: 708 ALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYA 766
            +A  + D  + + F    + A+  ++ KLWNG Y+N+D S     ++I +DQL G WY 
Sbjct: 621 KMALILSDVDTYNKFSETLKMAQQSFDKKLWNGKYYNFDCS-EKWGTTIMSDQLCGLWYL 679

Query: 767 RACGLL-PIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
             CG+   ++   K  S+L+ +Y  NVLK+  G  G VNGMLP+G +D+ ++QS EIW G
Sbjct: 680 YCCGIKNEVLPPAKVNSSLRCIYENNVLKLGKGNLGCVNGMLPNGEIDIHTLQSEEIWVG 739

Query: 826 VTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPL 885
           V YA+A+ MI E + + G++TA GIYE  +   G+G  F+TPEA     +YR++ YMRPL
Sbjct: 740 VVYALASLMIFEGMVEEGWKTARGIYETVYDRIGMG--FETPEALRESKKYRAIGYMRPL 797

Query: 886 AIWAMQWA 893
           +IW+MQ A
Sbjct: 798 SIWSMQLA 805



 Score =  219 bits (558), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 140/402 (34%), Positives = 219/402 (54%), Gaps = 24/402 (5%)

Query: 60  LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
           + P+ +R L        KG+R  +D     +    +GVPLGG+GSGSIGR ++GEF R+Q
Sbjct: 39  MIPLIIRYLIYSIYILCKGRRPLMDYMKNLNGKQMYGVPLGGIGSGSIGRGFKGEFCRYQ 98

Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVL-CPKTPEVLKDTTAAGIGSWDWNLKGD 177
           + P   E + +  NQF + V    G   +  VL C + P+       + + +WDW L G+
Sbjct: 99  LRPGNYEYETIYGNQFILTVRDGEGNAVFQKVLNCGRAPK-------SCLKNWDWTLNGE 151

Query: 178 KSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSA 237
            + Y  LYPR+WT +  E + +LR++C+QISP+IP+NYK+S  P S+F + I N      
Sbjct: 152 NANYVGLYPRSWTEYNIE-EQKLRLICKQISPVIPNNYKDSCIPGSMFIWHIENQSNEVK 210

Query: 238 DITLLFTWTNSVG--GDSEFTGQHYNSKTKMNDG-VHAVLLHHRTSHQLPPVTFALAAQE 294
           D+ + FT+ N +G  GD +        K   ++G +  VL++H  S      TF ++  E
Sbjct: 211 DVNITFTFQNGIGEKGDKKKGCWSQMFKENCSNGNILGVLINHVISSM--NYTFGISVCE 268

Query: 295 TDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAA 354
             GVHVS    +    N LG   KD+W++IK  G  ++ +  + S  +  G  + +A+ A
Sbjct: 269 KAGVHVSCLKKW----NPLG-NGKDIWNKIKT-GKLEKFD-FDPSHHTANGEQLASAVCA 321

Query: 355 SVTVPPDSEGQVTFSLAWDCPEVNFMSGKT-YYRRYTKFYGTHQNAAANIARDAILEHGS 413
              +P  S   + F+L WD P++NF   K  Y++ YT+F+G  +N    ++      + +
Sbjct: 322 QTLIPGKSVSTLEFNLIWDMPKINFHENKKQYWKYYTRFFGKEKNGPK-LSAYCFENYKN 380

Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTD 455
           W+L+I+ WQ PIL D  LP WY   LFNE Y+++ GG VW D
Sbjct: 381 WDLEIDKWQNPILNDDNLPSWYKSALFNETYFVSDGGTVWLD 422


>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
           pisum]
          Length = 776

 Score =  324 bits (830), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 176/380 (46%), Positives = 231/380 (60%), Gaps = 12/380 (3%)

Query: 522 LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS 581
           + D     G F YLEG EY M+NTYDVHFY+SF L  L+PK+Q  IQ  F   +      
Sbjct: 396 VTDPRYQYGYFGYLEGHEYRMYNTYDVHFYASFTLAKLWPKLQACIQYKFRDTIDNALNI 455

Query: 582 KMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
             K L  G    RK   +VPHD+G    DP+  VNAY +++ + W+DLN KFVLQ YRD 
Sbjct: 456 TRKHLSCGTRGLRKYKYSVPHDLGDPEEDPFNLVNAYLVHNVSEWRDLNLKFVLQCYRDY 515

Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
              GD K+   ++P +   M     +D DGDGMIEN G  DQT+DTW ++GISAY GGLW
Sbjct: 516 TMFGDNKYLADMYPQIKNVMEKSLSWDTDGDGMIENSGTADQTFDTWIMTGISAYCGGLW 575

Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQAD 758
           +AAL     ++  +      + +     KAK  +   LWNG YFN+D+SG   S SI AD
Sbjct: 576 LAALYCTVQISNILKQDDISEKYSTILNKAKESFNAALWNGGYFNFDSSGQHHSKSIMAD 635

Query: 759 QLAGQWYARACG----LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM 814
           QL G+WY + CG    +LP+   D+ + AL  +Y  NVL    GK GAVNGM+PDG  D 
Sbjct: 636 QLCGEWYLKCCGVNEEVLPV---DRVKKALHTIYKMNVLGFNDGKMGAVNGMMPDGNPDT 692

Query: 815 SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDD 874
            S+QS E+W+GVTYA+++ MI   L D GF TA GIY   ++  G+  A+QTPEA  + +
Sbjct: 693 FSLQSEEVWTGVTYALSSLMIAHGLRDEGFNTAKGIYNTVYNNIGM--AYQTPEAIYSKN 750

Query: 875 QYRSLCYMRPLAIWAMQWAL 894
            YRS+ YMRPL+IW++Q AL
Sbjct: 751 AYRSVGYMRPLSIWSIQTAL 770



 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 122/409 (29%), Positives = 189/409 (46%), Gaps = 75/409 (18%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFID---PFIKRHLTSSHGVPLGGVGSGSIGRSY 111
           K+ +QL P+ +R +  +     KG++  +D   P   +H+   +GVPLGG+G G+IGR +
Sbjct: 40  KQILQLLPLIIRYVFYMLLCYLKGQKPIMDFLFPIKSKHM---YGVPLGGIGGGTIGRGF 96

Query: 112 RGEFQRWQIFPRVCEDKPVLANQFSVFVSRS-NGQKYSSVL-CPKTPEVLKDTTAAGIGS 169
           +GEF R+Q+ P + E + V ANQF V +  S N   Y++VL C K            + +
Sbjct: 97  KGEFCRYQVVPGMYEYESVQANQFIVNIQNSENITVYNNVLSCLK-------CNNKTLYN 149

Query: 170 WDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTI 229
           W W+   ++  Y  LYPR+WT +   P+  +R++C+QISPIIPHNYK+S  P  VF +T+
Sbjct: 150 WKWDFPSNQCKYTGLYPRSWTEYL-IPEFHIRLICKQISPIIPHNYKDSCIPGGVFIWTV 208

Query: 230 YNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFA 289
            N+      I+L FT+ +  G  S                      +H       P    
Sbjct: 209 ENNSDQQYSISLTFTFASGNGISS----------------------YHSDEPSCEPFNLN 246

Query: 290 LAAQETDGVHVSLC-----PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
               E  G  +  C       +VI G             IKE  +F +LN        +P
Sbjct: 247 KDGTEVAGSLIHNCFKEMKCTYVIGG------------LIKEGAAFSQLN-------FDP 287

Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
             + G  I+ S+             L  +  ++       Y + YTKF+  +  A   I 
Sbjct: 288 KGN-GMTISKSL------------HLFGELRDMQLGPSNIYSKYYTKFFDKNGTAGPEIM 334

Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
             ++     WE +IE WQ PIL +  LP+WY   +FNE+Y++  GG +W
Sbjct: 335 HYSMQNFKKWEQEIEKWQNPILNNPLLPDWYKSAIFNEVYFVADGGTIW 383


>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
 gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
          Length = 805

 Score =  322 bits (825), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 257/864 (29%), Positives = 394/864 (45%), Gaps = 141/864 (16%)

Query: 80  RAFIDPFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQ 134
           +A+ +P+  R+ ++      HGVPLGG G+G IGRS +G+F  W +       + + A Q
Sbjct: 21  KAWDNPYTVRYNSNLDDGPWHGVPLGGFGAGCIGRSPKGDFNLWHLDGGEHIFRTLAACQ 80

Query: 135 FSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
           FS+F     G+  +  L  + PE   D T   +  W W    +K TYHALYPR+W  +EG
Sbjct: 81  FSIFEQPEGGEAQAYALSTEPPE---DGT---LSRWSW-YPTEKGTYHALYPRSWYEYEG 133

Query: 195 EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSE 254
               + +I+C Q SPI   NY+E+SYPV++F +TI+N       ++++ +W N+VG    
Sbjct: 134 VF--QTKIICEQYSPIWAGNYQETSYPVAIFAWTIHNPTDRPITLSIMMSWQNTVGW--- 188

Query: 255 FTGQHYNSKTKMNDGVHAVLLHH------------------RTSHQLPPVTFALAAQETD 296
           F+      + K+ D    V  +                   R    L  +      QE +
Sbjct: 189 FSNAIKTPEVKVRDDGSPVYEYQPRWGDSTGNYNQWIVDNFRVGCLLNRIQPHPQVQEGE 248

Query: 297 G-------VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           G       ++ S+   +    N +G    ++W     +G    L   E    + PG  I 
Sbjct: 249 GQLCIASILNPSVEVFYNTRWNPMG-DGSEIWDYFAMNG---MLGDYENESAAAPGEQIA 304

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
           AA+A   T+PP                     GKT   R   F                 
Sbjct: 305 AAMAIRFTIPP---------------------GKT---RLIPF----------------- 323

Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHR 469
                   I AW  PI E  +   +Y    + + +  N G  VW+     + ++     R
Sbjct: 324 --------ILAWDLPITEFSQGINYY--RRYTDFFGRN-GQNVWSVVRTALKNVDI--WR 370

Query: 470 KFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENI 529
              +DW QS + +  D+P+    A         +  ++Y  +A   +  T   +D  + +
Sbjct: 371 DRIIDW-QSPILKDQDLPDWFKMA---------LFNELYL-LADGGTLWTAATED--DPV 417

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
           GQF  LE I+Y  + + DV  Y SFAL ML+P++  ++   FA A+   D +   +  +G
Sbjct: 418 GQFAVLECIDYRWYESLDVRLYGSFALGMLWPRLDKAVMEAFARAIPQADDTPRIIGYNG 477

Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DK 645
               RK  GA PHD+G  +  PW + N     D   WKDL   FVL +YR  + TG  ++
Sbjct: 478 AKAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLLLYRSFLWTGGNNQ 537

Query: 646 KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQA 705
            +    W SV   +AY+  FD DGDG+ EN G PDQT+D W + GISAY G LW+AAL+A
Sbjct: 538 DYLWEHWHSVTETLAYLKAFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAALEA 597

Query: 706 ASAL------------------AREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNS 747
           A  +                  A E  +     Y  +  Q   + ++ LWNG Y+  D+ 
Sbjct: 598 AIKMGEILLQRPPMNPQLQPQNAAESIEEAITTYRHWLQQARSLYHDTLWNGEYYRLDS- 656

Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
             S S  + +DQL GQ+YAR  GL  +V+     SAL+K+Y    LK  GGK GA NG+ 
Sbjct: 657 -ESGSDVVMSDQLCGQFYARLLGLPDVVEPQYTTSALRKIYEACFLKFHGGKYGAANGVK 715

Query: 808 PDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
           PDG  +   +    E+W+G+ + +AA MI   + +  F+    + +  +     G  F+T
Sbjct: 716 PDGTPENPDATHPLEVWTGINFGLAAFMIQMGMKEEAFKITEAVVKQVYEN---GLQFRT 772

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAM 890
           PEA      +R+  Y+R +AIWA+
Sbjct: 773 PEAITAVGTFRASHYLRAMAIWAV 796


>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
 gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
          Length = 913

 Score =  315 bits (807), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 194/636 (30%), Positives = 317/636 (49%), Gaps = 94/636 (14%)

Query: 53  NWKETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYR 112
           ++++ + + P+ +R +    + A +G++  +D +   +    +G PLGG+G+G+IGR + 
Sbjct: 35  SFRQILSMVPLALRYIPYYWKVAREGRQVLMDYWYTENGKQIYGAPLGGIGAGTIGRGFA 94

Query: 113 GEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLCPKTPEVLKDTTA------- 164
           GEF R+Q+ P + E   V ANQF V +    G   + S+L      V K  +        
Sbjct: 95  GEFCRYQLKPGLYEYNTVHANQFIVTIKDETGATIFHSLLSTYRSYVQKIDSHRLAKAFV 154

Query: 165 ----AG---------IGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
               AG         + SW+  L   + +Y ALYPRAW+ ++   +  +++V RQISPII
Sbjct: 155 LRVWAGFISYRPKTPLASWESGLDASRCSYTALYPRAWSEYD-LSEHGVKLVQRQISPII 213

Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVH 271
           PH+YKESS P +VF +T+ N       +TL FT+ N  G   +       +         
Sbjct: 214 PHDYKESSLPCAVFVWTVENVCDKDRQVTLTFTFKNGTGTKKQDAEGGSETSAFTQGNAR 273

Query: 272 AVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFD 331
            V +    +    P T+ ++ + +  ++++ C  F  +GN      + +W+++KE+G   
Sbjct: 274 GVSIRQTIAEM--PCTYCVSCRSSSEINLTRCERFDPTGN-----GEKLWNDLKENGHLT 326

Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRRYT 390
             ++ ET  + +    +  A++  + V P +  Q+ F+L WD P+V+F   GK Y+R YT
Sbjct: 327 EKSNDETLKSKD----VAVAVSGQILVQPGTTSQLEFALVWDMPKVHFQKRGKEYHRYYT 382

Query: 391 KFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
           K++G   +A   I+  A+  +G WE  I+ WQRPILED  LP+WY   +FNELY++  GG
Sbjct: 383 KYFGKSGDAGPQISDYALNNYGKWERLIDEWQRPILEDADLPDWYKSAIFNELYFIADGG 442

Query: 451 AVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTP 510
           +VW                 F++D  Q+DL                              
Sbjct: 443 SVW-----------------FTMD-DQTDLP----------------------------- 455

Query: 511 VALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRD 570
                        D     G++ YLEG EY M+ TYDVHFY+S AL  L+P +Q+SIQ D
Sbjct: 456 -----------FDDPRLAYGRYSYLEGHEYRMYTTYDVHFYASHALASLWPNLQVSIQYD 504

Query: 571 FAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLN 628
           +  ++        K L DG+ + RK+  +VPHD+G    +P+  +NAY ++D + W+DLN
Sbjct: 505 YKDSIEREITEGRKHLYDGKVIPRKIKNSVPHDLGDPAEEPFDLINAYPIHDVSEWRDLN 564

Query: 629 PKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
            KF+LQVYRD           A   S + ++ ++D+
Sbjct: 565 LKFILQVYRDYYTLNHYAQLNAENASKFSSIEFIDK 600



 Score =  219 bits (558), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 110/255 (43%), Positives = 164/255 (64%), Gaps = 11/255 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++P+  V + +  ++D+DGDG+IEN   PDQTYDTW + G SAY GGLW+A+L   
Sbjct: 653 YLKAMYPACRVVLEHSLEWDKDGDGLIENSKAPDQTYDTWVMDGPSAYCGGLWLASLHCM 712

Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +      D +     K +  + EKLWNG+Y+ +D   +S+ +SI +DQL G WY
Sbjct: 713 SAMASLLDQNEDSDRYKAVLDKGRASFEEKLWNGTYYRFDAQSASK-NSIMSDQLCGHWY 771

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDMSSMQ 818
            R+CG    +  ++  R A++ +Y  NV++  GG+ GAVNG +P      DGR D  S+Q
Sbjct: 772 LRSCGFDYDVFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPNKDGRPDTVSIQ 831

Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
             E+W+GVTYA+A++MIHE + +  F+TA G+Y+A     G+   F+TPEA   +  YR+
Sbjct: 832 GEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQALSERIGMN--FETPEAVYAERHYRA 889

Query: 879 LCYMRPLAIWAMQWA 893
           + YMRPL+IW+MQ A
Sbjct: 890 IGYMRPLSIWSMQTA 904


>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
           [Tribolium castaneum]
 gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
          Length = 818

 Score =  315 bits (806), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 167/373 (44%), Positives = 236/373 (63%), Gaps = 12/373 (3%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
           G+F  LEG EY M+NTYDVHFY+SF+ ++ FP +Q  +Q D   A+    P K+K+L DG
Sbjct: 442 GKFALLEGHEYKMYNTYDVHFYASFSFVLNFPLLQSILQYDMRDAIFTEIPDKVKMLYDG 501

Query: 590 QWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DK 645
           +   RKV   VPHDIG    +P+  +N+Y ++D ++W+DLN KFVLQV+RD   TG  D+
Sbjct: 502 EVCERKVPNTVPHDIGDPGEEPFILLNSYPIHDVSQWRDLNSKFVLQVFRDAFITGLDDR 561

Query: 646 KFA--KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAAL 703
             A    ++ + Y  M     FD DGDG+IEN G PDQT+DTW ++G SAY GGLW+AAL
Sbjct: 562 SIAYLNDMYNACYTVMHKSLDFDVDGDGLIENSGSPDQTFDTWVMTGASAYCGGLWLAAL 621

Query: 704 QAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAG 762
            A + +A  +     ++ F     K    +E KLWNG  +N+D S   +  SI ADQL G
Sbjct: 622 FAMTKIADALQKTEDKEKFQELLDKGTAAFERKLWNGKCYNFDCS-DKECRSIMADQLCG 680

Query: 763 QWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSRE 821
           QWY R+CG    +  +D+ +++LK +Y  NV     G+ GAVNG + DG +D  ++QS+E
Sbjct: 681 QWYLRSCGFNYEVFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGFI-DGVIDEFTIQSQE 739

Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
           +W+GVTYA+AASM+ E +    F TA G++++     GL  +F TPEA      YR++ Y
Sbjct: 740 VWTGVTYALAASMLQEGMRTEAFNTAGGMFKSMSERFGL--SFDTPEALYAAKYYRAIGY 797

Query: 882 MRPLAIWAMQWAL 894
           MRPL+IW+MQ A+
Sbjct: 798 MRPLSIWSMQLAI 810



 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 126/398 (31%), Positives = 195/398 (48%), Gaps = 24/398 (6%)

Query: 60  LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
           + P+ +R +         GK   +D          +GVP+GG+G G+IGR YRGEF R+Q
Sbjct: 46  MLPLILRYIWYYIVHRIMGKSIVMDYVHTLRAKQIYGVPIGGIGCGTIGRGYRGEFCRFQ 105

Query: 120 IFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS 179
           + P + E   V ANQF V +   + +     L    P          + SW+  + G + 
Sbjct: 106 LKPGLYEYNTVDANQFIVTIKDGHNETIFHSLLSTFP-------TKSLKSWESLIDGSEC 158

Query: 180 TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADI 239
            Y  LYPR+WT ++      + + CRQI+PIIPHNYK+SS P ++F + + N  +    +
Sbjct: 159 FYTGLYPRSWTEYDLSK-YGVMLTCRQITPIIPHNYKDSSLPCAMFVWDVKNISEEDRTV 217

Query: 240 TLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
           T+ FT+ N VG  S       +SK         V+L+H         +++LAA+    + 
Sbjct: 218 TIAFTFKNGVGDKSRDRSSTCSSKAFTLADSEGVILYHTIDKM--QCSYSLAAKTNPQID 275

Query: 300 VSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVP 359
           +S C +F    NS G+     W ++K +GSFD+++        +    +   IA   T+ 
Sbjct: 276 ISKCLYF--DPNSDGIKP---WIQLKNNGSFDKISDKNYG---QIFGEMACGIATKATIR 327

Query: 360 PDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTH---QNAAANIARDAILEHGSWEL 416
             +      +L WD P V F   +   ++YT++Y  H    N    I R A   + +WE 
Sbjct: 328 AGNTVTSEMALVWDMPSVQFPKKQ---KKYTRYYTKHFDGDNTGLRIVRYAFENYQNWER 384

Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
            I  WQR IL+D+ LP+WY   LFNE YY++ GGA+W 
Sbjct: 385 AIYDWQRRILDDENLPDWYKSALFNESYYISDGGAIWV 422


>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
 gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
          Length = 835

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 256/909 (28%), Positives = 383/909 (42%), Gaps = 202/909 (22%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+I R+ ++      HG+PLGG+G+G IGR+  G F  W +       +   A QF+V+ 
Sbjct: 22  PYIVRYASNLDDGPEHGMPLGGLGAGCIGRAPNGTFNLWHLDGGEHTFEHFPACQFAVYE 81

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKS-------------------- 179
             +   +  ++           T   G+  W W    ++S                    
Sbjct: 82  EAAGASQTYAMA----------TAGDGLERWQWYPAENRSVGTAHPPGQPGNDTTSTIKG 131

Query: 180 -TYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSAD 238
            TY+ALYPR+W  +       L   C Q SPI P NY+ESSYPV+VF +T +N       
Sbjct: 132 GTYNALYPRSWYSYADVFKANL--TCEQFSPIWPDNYQESSYPVAVFVWTAHNPTDQPIT 189

Query: 239 ITLLFTWTNSVG--GDSEFTGQ-----------HYNSKTKMNDGVHAVLLHHRTSHQL-- 283
           +++L +W N VG   +SE + +            Y    + + G    L+    +  +  
Sbjct: 190 LSILLSWQNMVGWFTNSESSPEIQQREDGSPFYDYGPPLRQSAGNFNQLVSDERAIGMVM 249

Query: 284 ----------PPVTFALAAQETDGV--HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFD 331
                         F +A   T+G+  H    PH             D+W     HG+  
Sbjct: 250 DGAWTDSPGEGDGQFCIAVPATEGITYHTRWNPHG---------DGGDLWESFAGHGTLS 300

Query: 332 RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTK 391
            +    ++V  E    +G AIA   T+ P    ++ F L WD P   F +GK  YRRYT 
Sbjct: 301 NVADTTSAVKDE---QVGGAIAIKFTLAPGETQELPFVLTWDLPVTEFATGKLAYRRYTD 357

Query: 392 FYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGA 451
           F       A  +A++A+  +                    P W                 
Sbjct: 358 FCDCSGRNAWQLAQEALANY--------------------PTW----------------- 380

Query: 452 VWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV-NILERMSSILEQIYTP 510
                            R+  +DW Q  L R  D+P+    A+ N L  M+S        
Sbjct: 381 -----------------RQNIIDWQQPILDR-PDLPDWFKMALFNELYDMAS-------- 414

Query: 511 VALNSSFGTNLLQDGEE--NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
                  G  L     +   +GQF  LE I+Y  + + DV  Y  F+ +ML+P+++ +I 
Sbjct: 415 -------GGTLWSAASDIDPVGQFGVLECIDYRWYESLDVRLYGGFSTLMLWPELEKAII 467

Query: 569 RDFAAAVMMHDPSK------MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYD 620
           R FA A+   DP K        + +      RK+ GA PHD+G  +  PW   N     D
Sbjct: 468 RAFARAIPTEDPKKRIIGYYYTVGESEHLAPRKLKGATPHDLGAPNENPWIATNYTSYQD 527

Query: 621 TARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
             +WKDL   FVLQVYR  V TG  D +F    WP+V   + Y+  FD DGDG+ EN G 
Sbjct: 528 CNQWKDLPSDFVLQVYRAFVLTGGDDLEFLAECWPAVTQTLQYLKLFDLDGDGVPENGGA 587

Query: 679 PDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD--RGSEDYFLFKFQ------KAK 730
           PDQT+D W + G+SAY GGLW+AAL+AA  +   +    + S  + + K Q      + K
Sbjct: 588 PDQTFDDWQLKGLSAYCGGLWLAALEAAIKMGDLLQQHRKISGSFAIVKSQYQRWIKQGK 647

Query: 731 VVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
           +VY+ KLW GS++  D+   S S  + ADQL GQ+ AR   L  IVD++    AL  +Y+
Sbjct: 648 IVYQNKLWTGSFYRLDS--ESNSDVVMADQLCGQFCARLMDLPDIVDQEFIDIALDTIYD 705

Query: 790 YNVLKV---------------------------MGGKRGAVNGMLPDGRVDM-SSMQSRE 821
              +K                            +G   GA NG+ PDG  +        E
Sbjct: 706 ACFVKFNEYAAKHSTPQNQKFEGSQAGFFSAAQLGMAIGAANGVKPDGSPENPDDTHQLE 765

Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
           +W+G+ + VAA +      D        + +  ++    G  F+TPEA      +R+  Y
Sbjct: 766 VWTGINFGVAAFLAQMGKRDQAMAITEAVVQQIYN---YGLQFRTPEAITAMGTFRACHY 822

Query: 882 MRPLAIWAM 890
           +RP+AIW +
Sbjct: 823 LRPMAIWGL 831


>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
 gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
          Length = 828

 Score =  306 bits (784), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 263/903 (29%), Positives = 390/903 (43%), Gaps = 204/903 (22%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSV 137
           P+I R+ ++      HG PLGG G+G IGRS  G F  W +     V +  P    QFSV
Sbjct: 26  PYIVRYASNLDDGPEHGAPLGGFGAGCIGRSPHGTFNLWHLDGGEHVFDSFP--GCQFSV 83

Query: 138 FVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHE 193
           F   ++G+  +  +    PE         + SW W   G    +  TYHALYPR+W ++E
Sbjct: 84  FEQSADGETKAYAMSSDKPET-------ELASWQWYPAGKTTENTGTYHALYPRSWYLYE 136

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
                +L   C Q+SPI  +NY ESSYP + F +T +N       ++++ +W N VG   
Sbjct: 137 NIFKADL--TCEQLSPIWANNYIESSYPTATFIWTAHNHTDQPLTLSIMLSWENMVGW-- 192

Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSL-CPH-----FV 307
            FT    NS+T          +  R     P   +    +E+ G + +L C        V
Sbjct: 193 -FT----NSETSPE-------IQQREDGS-PFYDYVPLLRESKGNYNTLVCEEGVARGVV 239

Query: 308 ISGNSLGLTAK-----------------------------DMWHEIKEHGSFDRLNSMET 338
           + GN  G  A+                             ++W++    G   RL+++E 
Sbjct: 240 LDGNWTGSPAEGQGQLAIAAHHLEAEVSYHTRWHPDGDGSELWNDFSTTG---RLSNLE- 295

Query: 339 SVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQN 398
            V    G  IG AIA   T+ P    Q+ F+L+WD P   F  GK  YRRYT F+     
Sbjct: 296 DVDQSGGERIGGAIALRFTLAPGETKQIPFTLSWDLPVTEFAKGKRAYRRYTDFFDRSGR 355

Query: 399 AAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
            A  IA  A+  +         WQ+ I+                                
Sbjct: 356 NAWVIASTALNNYA-------IWQQNII-------------------------------- 376

Query: 459 PVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFG 518
                          +W Q  L R  D+PN    A         +  ++Y   +  + + 
Sbjct: 377 ---------------NWQQPILDR-TDLPNWFKMA---------LFNELYDLASGGTLWS 411

Query: 519 TNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMH 578
                D    +GQF  LE I+Y  + + DV  Y  +AL++L+P+++ SI R FA A+   
Sbjct: 412 AASKSD---PVGQFGVLECIDYRWYESLDVRLYGGYALLLLWPELEKSILRAFARAIPTA 468

Query: 579 DPSKMKLL-------DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNP 629
           + +K +++       +      RK+ GA PHD+G  +  PW   N     D  +WKDL+ 
Sbjct: 469 N-AKERIIGYYYTVGEADHMAPRKLKGATPHDLGAPNENPWMATNYTSYQDCNQWKDLSS 527

Query: 630 KFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWS 687
            FVLQVYR   +TG  D  F    WP++   + Y   FD DGDG+IEN G PDQT+D W 
Sbjct: 528 DFVLQVYRAFESTGSVDMDFLLDCWPAIVETLGYTKLFDTDGDGLIENSGAPDQTFDDWK 587

Query: 688 VSGISAYSGGLWVAALQAASALAREVGD----RGSEDYFLFKF----QKAKVVYEK-LWN 738
           + GISAY GGLW+AAL AA  +   + +     G+    L ++    Q  +  Y+K LWN
Sbjct: 588 LQGISAYCGGLWIAALSAAIKIGEILQESKRVEGTIVILLAQYRRWWQHGRTAYQKQLWN 647

Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKV--- 795
           G Y+  D    S S  + ADQL GQ+ A   GL  +VD +     L+ +Y    ++    
Sbjct: 648 GEYYRLDT--GSGSDVVMADQLCGQFCASTMGLPDVVDSEFVEPTLRAIYEACFVRFNQY 705

Query: 796 -----------------------MGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVA 831
                                  +G K G  NG+ PDG   +       E+W+G+ + +A
Sbjct: 706 TAQLGPQNQKFEGAQTGYFSASELGVKVGCANGVRPDGSPQNPDDTHQLEVWTGINFGLA 765

Query: 832 ASMIHE----DLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
           A + HE    +  +I       +YE        G  F+TPEA      +R+  Y+RP+AI
Sbjct: 766 AFLAHEGKLHEAMEITEAVVRQVYEH-------GLQFRTPEAITAVGTFRACHYLRPMAI 818

Query: 888 WAM 890
           WA+
Sbjct: 819 WAV 821


>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
           tropicalis]
          Length = 692

 Score =  296 bits (759), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 179/471 (38%), Positives = 237/471 (50%), Gaps = 100/471 (21%)

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPN 488
           K LP WY   LFNELY++  GG VW +                            V   +
Sbjct: 301 KDLPAWYKSALFNELYFMADGGTVWVE----------------------------VSAAD 332

Query: 489 QNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDV 548
            +D   N++ R    L  +            ++LQ+     G+F YLEG EY M+NTYDV
Sbjct: 333 DDD---NLMRRGCKDLPGM-----------KDVLQE----YGRFAYLEGQEYRMYNTYDV 374

Query: 549 HFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD 608
           HFY+SFALIML+P++++S+Q D AA+V+  DP   K L  G                   
Sbjct: 375 HFYASFALIMLWPQLEISLQYDMAASVLQEDPELRKYLMSGTVAP--------------- 419

Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
               V +   Y +       P   LQ                        M    +FD D
Sbjct: 420 ----VKSRSYYYSG------PSLALQT----------------------VMETSLKFDED 447

Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
           GDG+IEN GF DQTYD W ++G S+Y GGLW+AA+     +A  +GD  +++ F     +
Sbjct: 448 GDGLIENSGFADQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTDILHR 507

Query: 729 AKVVYEKL-WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL----LPIVDEDKARSA 783
            K  +EKL WNG Y+N+D      S+S+  DQ AG W+ RACGL      +  +D   SA
Sbjct: 508 GKAAFEKLLWNGKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDHVVSA 567

Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
           L+ ++  NV +   G+ GAVNGM PDG  D SS+QS E+W GV Y +AA+MIHE L   G
Sbjct: 568 LRTIFELNVKQFADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGLVQEG 627

Query: 844 FQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           F TA G Y   W    LG +FQTPEA+     +RSL YMRPL+IWAMQ AL
Sbjct: 628 FATAEGCYRTVWE--RLGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQLAL 676


>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
 gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
           785]
          Length = 774

 Score =  295 bits (754), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 159/371 (42%), Positives = 228/371 (61%), Gaps = 6/371 (1%)

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
           +++G F YLE  +Y  + T DV FYSS++++ L+P+++      F+  V   D + + ++
Sbjct: 400 DDVGLFSYLECYDYPFYGTLDVSFYSSWSILALWPELERGEILAFSKTVNDADDTVVTIV 459

Query: 587 DDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGD 644
                  RK  GA+PHD+G     P  + NAY   D   WKDLN K++L++YRDV    D
Sbjct: 460 ATQVPAIRKAAGALPHDLGAPKEQPLIKTNAYDFQDINNWKDLNLKYILRIYRDVSLWND 519

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
           +   +A W ++  A+ Y+ QFD DGDG++++ G  DQTYDTW++SG ++YS  L + AL+
Sbjct: 520 QAMLEATWDTIPTALEYVHQFDSDGDGLLDHSG-ADQTYDTWAMSGAASYSASLLICALE 578

Query: 705 AASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
           AA  LA+ +GD    D +      A+  +E KLWNG+YF Y  + +     I ADQL GQ
Sbjct: 579 AAIRLAQRMGDHAQADAWSEWLAAARQSFETKLWNGTYFRYHTADTDLREVIMADQLVGQ 638

Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
           WYA A GL  +   +  RSAL+ VY +NV++   G  GAVNGM PDG VD SS Q+ E+W
Sbjct: 639 WYAGAIGLPAVAPREMIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVDTSSNQASEVW 698

Query: 824 SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
           SG TYA+AA M+ E L   G+QTA G Y A ++  GL   F+TPEAW  +  +R+  YMR
Sbjct: 699 SGTTYAIAAMMLQEGLDLEGWQTAWGAYNATYNELGLW--FRTPEAWGIERTFRASMYMR 756

Query: 884 PLAIWAMQWAL 894
           P +IWA++ AL
Sbjct: 757 PQSIWAIEHAL 767



 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 128/367 (34%), Positives = 193/367 (52%), Gaps = 26/367 (7%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+P+GG+GSG+IGR++RG++ RW +       + V  NQ+SVF  ++  Q+ + VLC  
Sbjct: 43  HGLPIGGMGSGAIGRNFRGDWSRWHLEVGKHVHRSVWPNQWSVFW-QTASQQAAQVLCTT 101

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            P+  +      + SW+WN       YHAL+PRAW  ++  PD  L +V  Q SP++  N
Sbjct: 102 QPDTDE------LSSWNWNYPVGAGNYHALFPRAWFDYQ-HPDWPLELVQEQFSPVLAGN 154

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS---EFTGQHYNSKTKMNDGVH 271
            KESS+PV VFT+ + N G  +  + L+ TW ++   ++       QH    +  NDG  
Sbjct: 155 LKESSFPVGVFTWRVTNRGSETVRLGLMLTWEHTRAVEAAGLSLQRQH----SAWNDGNT 210

Query: 272 AVLLHHRTSHQLPPV---TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHG 328
           + +   +TS Q       T+ALA Q  +   VS    + ++ ++  L     W +    G
Sbjct: 211 SGVTLTQTSDQALSSHNGTWALAVQAPESASVSQWTCWDVAQDAAAL-----WQDFASDG 265

Query: 329 SFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRR 388
              +L    TS           AIA ++ + P +   + FSLAWD P V F     +Y+R
Sbjct: 266 ---QLADYPTSQKVAADQRSATAIAVTLELAPGASAVIPFSLAWDFPIVEFADQSRWYKR 322

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YT+F+GT+ + A  +A  ++    +W   IEAWQ PIL D + P WY   LFNELYYL  
Sbjct: 323 YTRFWGTNGDQAQALAVASLTNADAWRTAIEAWQNPILADDQRPFWYKSALFNELYYLVD 382

Query: 449 GGAVWTD 455
           GG +W D
Sbjct: 383 GGTLWVD 389


>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Strongylocentrotus purpuratus]
          Length = 347

 Score =  291 bits (745), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 152/325 (46%), Positives = 199/325 (61%), Gaps = 9/325 (2%)

Query: 579 DPSKMKLLDDGQWVSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVY 636
           D    + L  G    RK  G VPHD G    DPW ++N Y  +DTA WKDLN K VL VY
Sbjct: 13  DEDSFQDLSSGLRSIRKSAGCVPHDSGNPEEDPWLKLNCYWFHDTAEWKDLNLKMVLMVY 72

Query: 637 RDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSG 696
           RD  AT D++F   +WP     MA  +  D+DGDG+I++ G  DQTYD W+  G SAY G
Sbjct: 73  RDYFATKDREFLDFMWPKCMTVMAKANSQDKDGDGLIDHFGTADQTYDVWTAKGCSAYCG 132

Query: 697 GLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSI 755
           GL++AAL+    +A  +GD  ++  +    Q+ K  YE KLWNG YFNYD+S  S   SI
Sbjct: 133 GLYLAALKCMCEMADLLGDDKAKRQYSEILQRGKATYEKKLWNGRYFNYDSSDQSYHDSI 192

Query: 756 QADQLAGQWYARACGLLPIVDE----DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
            ADQ  G WY RAC L+P  D+    D  RSAL+ +++ NV+ V  G  GA+NG+ P G+
Sbjct: 193 MADQTCGHWYLRACDLVPERDQVFQVDHVRSALRTIFDMNVMGVKEGNFGAMNGIRPSGK 252

Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWN 871
            D +S+Q  E+W G TY +A +MI E + + GF+TA G Y   +   GL  A+Q PEA+ 
Sbjct: 253 PDHTSLQGEEVWVGTTYGLAGNMIQEGMWEEGFRTAKGCYTTCYEQAGL--AYQVPEAYM 310

Query: 872 TDDQYRSLCYMRPLAIWAMQWALTR 896
           +   YRSL YMRPLAIWAMQWA+ +
Sbjct: 311 SKKIYRSLGYMRPLAIWAMQWAVEK 335


>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 811

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 165/376 (43%), Positives = 229/376 (60%), Gaps = 24/376 (6%)

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           F YLE  +Y  + T DV FY SF L   +P+I+    R++A  +    P  +       W
Sbjct: 438 FSYLECFDYAYYGTLDVRFYGSFPLAKFWPEIEKQEMREYADTI----PENIDQKYIWAW 493

Query: 592 VS----------RKVLGAVPHDIG--ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
            S          RKV G+ PHD+G  + D     N Y     + WKDLN K+VL V+RD 
Sbjct: 494 KSDNEHKLEPTQRKVAGSAPHDLGSPLEDALILPNQYNYQSVSFWKDLNSKYVLMVWRDY 553

Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           V +G  DK F +  + +V  +M ++ QFD++ DG+IENDG+PDQTYD WS  G SAYSG 
Sbjct: 554 VMSGSKDKAFLQYNYNAVKQSMQFLRQFDKNNDGLIENDGYPDQTYDNWSARGESAYSGS 613

Query: 698 LWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSI 755
           L++AAL+A   +A+ +GD+   ++   LFK  +   V +KLWNG+YFNYD  GSS   +I
Sbjct: 614 LYLAALRATEEMAKLLGDQRTAADTAALFKRAQDSFV-KKLWNGTYFNYD-VGSSYKDAI 671

Query: 756 QADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS 815
            A+QLAGQWYA   GL  +V  D   SALKKVY++NV+K+  G  GA+NG+   G V   
Sbjct: 672 MAEQLAGQWYASLTGLGDLVPRDMQHSALKKVYDFNVMKLQNGTMGALNGISASGEVLKD 731

Query: 816 SMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQ 875
           + Q+ E+W+GVT+AVAA+M+   L + GF TA G+Y   +     GY F+TPEA++T+  
Sbjct: 732 NEQTEEVWTGVTFAVAATMLQNGLREEGFNTAKGVYNVVYDQK--GYWFRTPEAYDTNGM 789

Query: 876 YRSLCYMRPLAIWAMQ 891
           YR+  YMRP AIW+M+
Sbjct: 790 YRASMYMRPGAIWSME 805



 Score =  210 bits (534), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 125/386 (32%), Positives = 191/386 (49%), Gaps = 41/386 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
            G P+GG G+G+  R+YRG F+RW +   + + + V ANQF+VF         + VL  +
Sbjct: 61  QGAPVGGFGAGTFSRTYRGNFERWHVKAGIHKYQNVPANQFAVFAQADGDAAVAQVLSTQ 120

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE         + SW+W        Y ALYP++W  ++  P   + +   Q SP+IP N
Sbjct: 121 KPE------HGELSSWNWTYPVGAGDYAALYPKSWFAYK-SPQLPITLTLEQFSPVIPDN 173

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTG-----------QHYNSK 263
           YKESSYPV+V+ + + N  +    +++LF+WTN VG   + T             HY S+
Sbjct: 174 YKESSYPVAVYNWYVTNPSQKRVTVSILFSWTNMVGWFRDTTTGFGAALNSQDINHYKSE 233

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVT------FALAAQETDGVHVSLCPHFVISGNSLGLTA 317
              +  +  ++     S    PV+      FA+A+    GV +S    +  SG+      
Sbjct: 234 AIKSSQMQGIVFDRLRSE---PVSDEWDGQFAIASLAGSGVEISYLTTYRPSGD-----G 285

Query: 318 KDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEV 377
            D+W    + G   RL ++   + S  G S+  AIA   T+ P  +  +  +LAWD P +
Sbjct: 286 SDVWTPFAKDG---RLPNVAAGLASS-GDSLAGAIAVRFTLAPGEKRTIPMALAWDLPFI 341

Query: 378 NFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPI 437
            F SG+ + R YT+F+      A  +AR A+ +   W  QI+AWQ+P + D+  P WY  
Sbjct: 342 QFGSGRKWARHYTRFFDRFGTNAWKLARTALEQGNDWSRQIDAWQKPYISDESKPLWYRG 401

Query: 438 TLFNELYYLNAGGAVW-----TDGSP 458
            LFNELY L  GG +W     T G+P
Sbjct: 402 MLFNELYDLVDGGTMWAHEVGTPGNP 427


>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
          Length = 377

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 148/308 (48%), Positives = 192/308 (62%), Gaps = 7/308 (2%)

Query: 537 GIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKV 596
           G EY M+NTYDVHFY+SFAL+ML+PK++LS+Q D A A +  D ++ + L  G     K 
Sbjct: 19  GQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKR 78

Query: 597 LGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
              +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQ+YRD   TGD+ F + +WP 
Sbjct: 79  RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPV 138

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
               M    +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G
Sbjct: 139 CLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCG 198

Query: 715 DRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-- 771
            +  ++ F     + +  YE+ LWNG Y+NYD+S   QS SI +DQ AGQW+ RACGL  
Sbjct: 199 AQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGE 258

Query: 772 --LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
               +        AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W GV Y 
Sbjct: 259 GDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYG 318

Query: 830 VAASMIHE 837
           +AA+MI E
Sbjct: 319 LAATMIQE 326


>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
 gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
          Length = 811

 Score =  287 bits (735), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 165/380 (43%), Positives = 220/380 (57%), Gaps = 18/380 (4%)

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           F  LE  +Y  + T DV FY+S  L+  +P I   + R+FA  V    P +   +   Q 
Sbjct: 439 FALLECFDYAYYGTLDVRFYASLPLLKFWPDIDKQVLREFADTVPKEWPEQGLWVWKTQQ 498

Query: 592 VS------RKVLGAVPHDIGI--CDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
           V       RK +GAVPHD+G+   DP+  VN     DT  WKDLN KFVL VYRD V TG
Sbjct: 499 VGEPITHKRKKIGAVPHDLGVPEGDPFVAVNEPGWQDTNDWKDLNSKFVLMVYRDYVLTG 558

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D  F +  WP+V  A+ Y+ QFD  G G+ EN G+PDQTYD W V G+SAYSGGLW+A
Sbjct: 559 RKDTAFLRETWPAVKAAIEYLRQFDHGG-GVPENSGYPDQTYDDWVVRGVSAYSGGLWLA 617

Query: 702 ALQAASALAREVGD-RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQL 760
           AL+A    AR VGD + + +Y     +  K    +LWNG YF YD S  S+   IQADQL
Sbjct: 618 ALRAGEETARVVGDTKTTAEYHALFLKGQKTYISQLWNGEYFRYDTSSESK-DDIQADQL 676

Query: 761 AGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSR 820
           AGQWYA   GL  IV      SA KK+++ NV+K   G+ GA NGM  DG + +++ +++
Sbjct: 677 AGQWYANLTGLGEIVPHTMQVSAAKKIFDVNVMKFGHGEMGAANGMTADGAI-LTNAEAK 735

Query: 821 EIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLC 880
           E+W G T   A  ++ E + D  ++T  G+Y   +     GY F+TPEAW+    +R+  
Sbjct: 736 EVWVGTTLGYAGLLMQEGMNDEAWKTTRGLYHVIYEDK--GYWFRTPEAWDITGNFRAGM 793

Query: 881 YMRPLAIWAMQWALTRPKPK 900
           YMRP AIWA++  +T P  K
Sbjct: 794 YMRPTAIWALE--MTSPTAK 811



 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 142/438 (32%), Positives = 201/438 (45%), Gaps = 51/438 (11%)

Query: 51  TLNWKETVQLA-PIGVRILCLIREEAAKG---------KRAFIDPFIKRHLTSS------ 94
           T  W  T  LA PI    LCL     A G         KRA   P     +T +      
Sbjct: 4   TSKWISTRGLAFPILAFTLCLSMASTAWGGDGIPKAAWKRALGLPLENPGVTRAAGDIDD 63

Query: 95  ---HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV-SRSNGQKYSSV 150
               GVP+GG G+G+  RSYRG+F RW I   V + +PV ANQF++F  S  +    S  
Sbjct: 64  GYWQGVPVGGFGAGTFSRSYRGDFARWHIKAGVHKYEPVYANQFAMFQQSEGDAHGTSQA 123

Query: 151 LCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
           L    P      +   + SW W+       Y+ L+P+AW  ++ +  P   +   Q SP+
Sbjct: 124 LMNGHP------SNGELSSWQWDYPVGAGDYYGLFPKAWFDYKWDKFPA-HVTLEQFSPV 176

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT---------GQH-- 259
           +P+NY+ESSYPV+V+ +   N    +  +++L +WTN  G    +T         G H  
Sbjct: 177 LPNNYRESSYPVAVYRWHAENPTNKTVVVSVLLSWTNMSGWFRTYTRDFKGAPNQGNHNE 236

Query: 260 YNSKTKMNDGVHAVLLHHRTSHQLPPV----TFALAAQETDGVHVSLCPHFVISGNSLGL 315
           Y S+   + G    ++  R      P      FA+AA E+ GV VS    +  SG+    
Sbjct: 237 YVSEKVGDAGTMKGIVFDRNRAGTAPNDWDGQFAIAAMESPGVEVSYQTTYQASGD---- 292

Query: 316 TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
             K +W    + G   RL +   S  S+    +  AIA   T+ P  +  V   ++WD  
Sbjct: 293 -GKAVWSAFSKDG---RLANDGKSWVSDK-EKLAGAIALRFTLRPGEKKVVPMVVSWDFS 347

Query: 376 EVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWY 435
            + F  G+ + RRYT FYGT    A  IARD +L    W  +I+ WQ P + D+  P WY
Sbjct: 348 LIQFGEGRKWDRRYTDFYGTSGQNAWKIARDGLLHATEWSDEIDKWQAPYITDESKPLWY 407

Query: 436 PITLFNELYYLNAGGAVW 453
              LFNELY L  GG  W
Sbjct: 408 RGMLFNELYALTDGGTFW 425


>gi|353232146|emb|CCD79501.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 725

 Score =  287 bits (735), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 209/657 (31%), Positives = 319/657 (48%), Gaps = 84/657 (12%)

Query: 71  IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
           I++   + +  FID          +GVP+GG+GSGSIGR YRGEF R  + P +      
Sbjct: 77  IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136

Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
             +QF V V ++    Y  VL P T      + A  +  W W    +K  Y  LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193

Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
           V+E  P+ +L +V +QISP+IPH+Y+ +  PV+VF + + +  K +  +T+ F+W     
Sbjct: 194 VYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252

Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
                +     S+   N   +    H  T+       F   +Q +             +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVP--- 359
           + L L  +      K+  +F  L+        ++     ++    +  A++A+  VP   
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354

Query: 360 -----PDSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
                  S+ ++ F++ W  P V F +G   Y RRY +++   + + A +    AI    
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
            W  +IE WQ PIL ++ LP WY   LFNELYYL+ GG VW D   P+            
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ----------- 460

Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
           +D  +SDL  + +     D  V+  +  + +                         IG F
Sbjct: 461 VDCFKSDLLNLTE-----DMKVDSWDHRARL----------------------SREIGLF 493

Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
            YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++  D A   +  D + +  + +G+ +
Sbjct: 494 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 553

Query: 593 SRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
            R    AV HD G    +PW   NAY ++ T  WKDLN KF+LQV+RD   T D ++   
Sbjct: 554 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 613

Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + P V  +    +  +D D DG+IEN GFPDQTYD W+  G++AY+GG+W++ L A 
Sbjct: 614 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 670


>gi|345293661|gb|AEN83322.1| AT5G49900-like protein, partial [Capsella grandiflora]
 gi|345293663|gb|AEN83323.1| AT5G49900-like protein, partial [Capsella grandiflora]
 gi|345293665|gb|AEN83324.1| AT5G49900-like protein, partial [Capsella grandiflora]
          Length = 156

 Score =  285 bits (728), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 130/155 (83%), Positives = 144/155 (92%)

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGA
Sbjct: 1   YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60

Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           VPHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAM
Sbjct: 61  VPHDLGINDPWFEVNGYSLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120

Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
           AYM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAY 155


>gi|345293667|gb|AEN83325.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293669|gb|AEN83326.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293671|gb|AEN83327.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293673|gb|AEN83328.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293675|gb|AEN83329.1| AT5G49900-like protein, partial [Capsella rubella]
          Length = 156

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 130/155 (83%), Positives = 144/155 (92%)

Query: 540 YLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGA 599
           Y MWNTYDVHFY+SFAL+MLFPK++LSIQRDFAAAVM+HDP+K+K L +GQWV RKVLGA
Sbjct: 1   YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60

Query: 600 VPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           VPHD+GI DPWFEVN Y L++T RWKDLNPKFVLQVYRDVVATGDKKFA AVWPSVYVAM
Sbjct: 61  VPHDLGINDPWFEVNGYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120

Query: 660 AYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
           AYM QFD+DGDGMIEN+GFPDQTYDTWS SG+SAY
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAY 155


>gi|449674846|ref|XP_002159451.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
           magnipapillata]
          Length = 751

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 168/428 (39%), Positives = 238/428 (55%), Gaps = 62/428 (14%)

Query: 317 AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE 376
           A ++W ++ + G  D  N    S  S PG  + AA+AA  TV   ++  + F L+WD P 
Sbjct: 381 AAELWEDLNDDGVLD--NKRGDSCYSYPGQLLAAAVAAKTTVHAGAKQDLKFVLSWDQPF 438

Query: 377 VNFMS--GKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEW 434
           V F S  G +Y RRYT+F+G++ NA+ ++   A+     WE  IE WQ PIL D  LP W
Sbjct: 439 VTFGSDSGLSYCRRYTRFFGSNGNASPSLCCYALQNIDFWEDNIERWQNPILNDSELPAW 498

Query: 435 YPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAV 494
           +   LFNELY+++ GG VW                              +D+  + D+  
Sbjct: 499 FKSALFNELYFISDGGTVW------------------------------LDIKRKKDSHG 528

Query: 495 NILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSF 554
           N +    S L ++Y                     G+F YLEG EY M+NTYDVHFY+S+
Sbjct: 529 NFI----SDLVKLY---------------------GRFAYLEGHEYRMYNTYDVHFYASW 563

Query: 555 ALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFE 612
           AL +L+P +Q+S+Q D   +V+  D  K  ++ +G    RKV+G VPHD+G  D  P+  
Sbjct: 564 ALALLWPNLQISMQYDICNSVLNVDDEKRPIMMEGGIAPRKVIGCVPHDLGDPDDEPFVR 623

Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
           VNAYC++DT++WKDLN  FVLQVYRD   T +K+F   +WP++  AMA+    D DGDG+
Sbjct: 624 VNAYCIHDTSQWKDLNLHFVLQVYRDFYITKNKQFLMDMWPAMKAAMAHSLAQDIDGDGI 683

Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
           IEN G  D T+DTW V+G SAY+GGLW++AL+    +A  +G   S   +    ++ K  
Sbjct: 684 IENSGIADSTFDTWVVTGPSAYTGGLWLSALRCIIEIAHILGLSQSISKYKSILERGKKS 743

Query: 733 YE-KLWNG 739
           YE KLWNG
Sbjct: 744 YEKKLWNG 751



 Score =  163 bits (413), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 78/194 (40%), Positives = 114/194 (58%), Gaps = 3/194 (1%)

Query: 70  LIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP 129
             R+   KGK  FIDP+        +GVPLGG+G+GSIGR ++G+F RWQ+ P +     
Sbjct: 3   FTRKVKKKGKLPFIDPYAVSACKQIYGVPLGGIGAGSIGRGWKGDFNRWQLRPGIYTYDT 62

Query: 130 VLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAW 189
           V  NQF+V + +     Y  VL P  P  +K+  +  + SW W+  G +  YHALYPRAW
Sbjct: 63  VDVNQFTVCIRKCGKTIYQQVLAPYYPVAIKNVES--LQSWTWSFDGGRGYYHALYPRAW 120

Query: 190 TVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSV 249
           TV+    +  +++ CRQ+SPI P NYK++S PV VF +TI N G    D++++ T+ N  
Sbjct: 121 TVYH-ITEHNIKLTCRQVSPIFPDNYKDTSLPVGVFVWTIENIGMHDLDVSIMLTFQNGD 179

Query: 250 GGDSEFTGQHYNSK 263
           G + +  G H N +
Sbjct: 180 GTNEDCVGGHNNEE 193


>gi|256082539|ref|XP_002577512.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 717

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 210/657 (31%), Positives = 316/657 (48%), Gaps = 92/657 (14%)

Query: 71  IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPV 130
           I++   + +  FID          +GVP+GG+GSGSIGR YRGEF R  + P +      
Sbjct: 77  IKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFCRSSLIPGMYSYDVQ 136

Query: 131 LANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWT 190
             +QF V V ++    Y  VL P T      + A  +  W W    +K  Y  LYPR+WT
Sbjct: 137 PVDQFIVTVRKNGVTVYHQVLSPLTKP---PSNAKKLKKWKWGFDPEKGHYLGLYPRSWT 193

Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
           V+E  P+ +L +V +QISP+IPH+Y+ +  PV+VF + + +  K +  +T+ F+W     
Sbjct: 194 VYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISWNKENLKVTITFSWHGPNH 252

Query: 251 GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
                +     S+   N   +    H  T+       F   +Q +             +G
Sbjct: 253 RKRTKSSDFSKSEDGNNSSENTTNTHVSTAEN----AFPTCSQRSTAFS---------TG 299

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLN--------SMETSVTSEPGSSIGAAIAASVTVPP-- 360
           + L L  +      K+  +F  L+        ++     ++    +  A++A+  VP   
Sbjct: 300 DFLKLNCQG-----KKLKTFTDLSWLGWLVGYTVPLDSKAKKSPKLAIAVSATTNVPRCN 354

Query: 361 ------DSEGQVTFSLAWDCPEVNFMSGKTYY-RRYTKFYGTHQNAAANIA-RDAILEHG 412
                  S+ ++ F++ W  P V F +G   Y RRY +++   + + A +    AI    
Sbjct: 355 ITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKLLLAHAIDNWR 414

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFS 472
            W  +IE WQ PIL ++ LP WY   LFNELYYL+ GG VW D   P+            
Sbjct: 415 QWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PI------------ 459

Query: 473 LDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQF 532
               Q DL   + V + +  A     R+S                           IG F
Sbjct: 460 ----QVDLTEDMKVDSWDHRA-----RLS-------------------------REIGLF 485

Query: 533 LYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWV 592
            YLEG EY M+NTYDVH Y+S+ALI L+PKIQL++  D A   +  D + +  + +G+ +
Sbjct: 486 GYLEGHEYRMYNTYDVHHYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKAL 545

Query: 593 SRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKA 650
            R    AV HD G    +PW   NAY ++ T  WKDLN KF+LQV+RD   T D ++   
Sbjct: 546 FRSSECAVVHDFGDPEDEPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLY 605

Query: 651 VWPSVY-VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + P V  +    +  +D D DG+IEN GFPDQTYD W+  G++AY+GG+W++ L A 
Sbjct: 606 MLPIVLRILRKSLVAWDSDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 662


>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
 gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
          Length = 937

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 156/389 (40%), Positives = 223/389 (57%), Gaps = 24/389 (6%)

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           F YLEG EYLM NTYDVHFY+S A+IM +P+IQLSIQRD   AV   +  ++        
Sbjct: 550 FFYLEGHEYLMCNTYDVHFYASHAMIMNWPEIQLSIQRDIIRAVREENNQEVTFYLSNMK 609

Query: 592 VSRKVLGAVPHDIGIC--DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
             RKV G+VPHD+G     PW  VNAY   D  +WKDLN +F+L ++RD + T +++F +
Sbjct: 610 NRRKVKGSVPHDVGTPYERPWKLVNAYNFQDVNKWKDLNSQFILTIFRDFMITKNREFLE 669

Query: 650 AVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
             +PS+ +A+ Y +  FD+DGDG+IEN+ FPD TYD W V+G+SAYSG LW+A+L+A   
Sbjct: 670 EAFPSIILALNYSLTNFDKDGDGLIENESFPDTTYDAWKVTGVSAYSGMLWIASLKAILE 729

Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWN--GSYFNYDNSGSSQSSSIQADQLAGQW- 764
           +A+E+ D  +        +     +E KLW+    Y++YD S   Q  SI +D L G + 
Sbjct: 730 MAKELDDENTRQKVEPLIESGLKTFESKLWDETNQYYHYDASTEPQHDSIMSDHLHGTFM 789

Query: 765 ------YARACG-------LLPIVDEDKARSALKKVYNYNVLKVMGGKR--GAVNGMLPD 809
                 Y +  G       L+P  + ++   +L K+   N +K     +  G VNGM P 
Sbjct: 790 LLLIGNYVKNSGRCEDSHYLIPF-NMERVSKSLNKILENNFVKYQQITKLGGCVNGMRPT 848

Query: 810 GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEA 869
             VD +S+QSRE+W+G +Y VAA  I     + G      +++  WS     + FQTPEA
Sbjct: 849 QEVDKTSLQSREMWTGTSYVVAALTILLGRREEGIDLMKSVFDKCWSSNS-AFWFQTPEA 907

Query: 870 WNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
                +YR+L Y RPL+IW+   A+   K
Sbjct: 908 VTETGEYRALGYCRPLSIWSCLSAMMNKK 936



 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 121/466 (25%), Positives = 203/466 (43%), Gaps = 67/466 (14%)

Query: 40  LSTGEIPLSQFTLNWKETVQLAPIGVRILCL--IREEAAKGK---------RAFIDPFIK 88
           L    IPL   T +  E     P  +R L +  + E+  K +         RA +DP  K
Sbjct: 87  LQKTTIPLRAQTYSIGEMFHFLPGALRFLTMSYVPEKLIKKRETPFDFIMRRAVVDPRRK 146

Query: 89  RHLTSSHGVPLGGVG-SGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKY 147
                  G  +  +G +G   RS+    Q   I+     D  +  NQFS+ V R NG+  
Sbjct: 147 GPPIGGIGAGMFTLGMNGWFTRSHLIFNQTGGIYS----DTVLPVNQFSLRVER-NGEIE 201

Query: 148 SSVLCP----KTPEVLKDTTAAGIGSW----DWNLKGDKSTY----HALYPRAWTVHEGE 195
           S VL P    K   + KD       +W    D   + +K  +    ++L+P++WTV++  
Sbjct: 202 SFVLNPICEFKDGTMDKDLIKKLKHNWMFINDERRRKNKMQFSQQVYSLFPKSWTVYQT- 260

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEF 255
            D +  ++C + SP+  HNY+ESS+P   F +TI N+    A+++LLFT    +  + +F
Sbjct: 261 SDEKFTVICERYSPVWAHNYQESSFPTCNFEWTIINNTDEEANVSLLFT----IQSEEKF 316

Query: 256 TGQHYNSKTKMN--DGVHAVLLHHRTSHQLP---PVTFALAAQETDGVHVSLCPHFVISG 310
             ++  S    N  D V  + +    +        + F + A  + G  ++    F  + 
Sbjct: 317 RKKNEQSIIARNETDSVKCLTIETTVTDSFSHKDTIQFCIGANTSLG-SITFLDEFNGTN 375

Query: 311 NSLGLTAKDMWHEIKEHGSF-----DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
            S      D+W E   +G F     +          S+       AI + V +P +S  Q
Sbjct: 376 ES---HLHDLWQEFNNNGEFSTEFCENFAKRHDWTKSQSKHLSCHAICSKVQIPSNSSKQ 432

Query: 366 VTFSLAWDCPEVNFMSGKT----YYRRYTKF-------YGTHQNAAA-NIARDAILEHGS 413
           V F+L W+ P V F  G T    Y R YT+F          H++ A   + + ++     
Sbjct: 433 VCFNLNWNAPFVRF--GPTMEFRYLRHYTQFCTDAISSITQHKDTACFEMCKRSLDSSKQ 490

Query: 414 WELQIEAWQRPILEDKRLPEWYPI---TLFNELYYLNAGGAVWTDG 456
           W+ +I  W + +++  ++ + +P     LFNELYY+  GG +WT+G
Sbjct: 491 WKDEISQWHQGVID--QVGKDHPTAVAALFNELYYIVDGGTLWTNG 534


>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
 gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
          Length = 794

 Score =  275 bits (704), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 159/377 (42%), Positives = 216/377 (57%), Gaps = 28/377 (7%)

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           F  LE  +Y  + T DV FY+S  L+  +P+I   + R+FA  V      + +  + G W
Sbjct: 423 FALLECFDYAYYGTLDVRFYASLPLLKFWPEIDKRVLREFAETV------EKEWSEKGLW 476

Query: 592 V------------SRKVLGAVPHDIGI--CDPWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
           V             RK +GAVPHD+G+   DP++ VN     DT  WKDLN KFVL VYR
Sbjct: 477 VWKTAETGSPVLHKRKKVGAVPHDLGVPEGDPFYVVNEPGWQDTNDWKDLNSKFVLMVYR 536

Query: 638 DVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
           D V TG  DK F +  WP+V  A+ Y+ QFD  G G+ EN G+PDQTYD+W V G+SAYS
Sbjct: 537 DYVLTGRTDKAFLQETWPAVKAAIQYLRQFDH-GRGVPENSGYPDQTYDSWVVRGVSAYS 595

Query: 696 GGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSS 754
           GGLW+ AL+A    AR +GD  ++  +   F  A+  Y   LWNG YF YD   S++   
Sbjct: 596 GGLWLGALRAGEETARVLGDWTAQAEYQALFAMAQKTYVATLWNGEYFLYDTDASNK-GV 654

Query: 755 IQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM 814
           IQADQLAGQWYA   GL  +V     RS LKK+Y +NV +   G  GAVNGM  DG V +
Sbjct: 655 IQADQLAGQWYAHMTGLGDLVPRAMQRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTV-V 713

Query: 815 SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDD 874
            ++++RE+W+G T   AA +    + +  +    G++   +     GY F+TPEAW+   
Sbjct: 714 DNVEAREVWAGTTLGYAALLKSLGIEEESYHVVHGLFHVIYESK--GYWFRTPEAWDVTG 771

Query: 875 QYRSLCYMRPLAIWAMQ 891
            +R+  YMRP  +W M+
Sbjct: 772 NFRASMYMRPAGLWGME 788



 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 178/370 (48%), Gaps = 27/370 (7%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
            G P+GG G+G+  RSYRG+F RW I   V +     AN+F++F      Q+        
Sbjct: 56  QGAPVGGFGAGTFSRSYRGDFARWHIKAGVHKYDVSYANEFAMF------QQVEGEATGV 109

Query: 155 TPEVLKDTTAAG-IGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
              +L D    G + SW W+       Y ALYP++W  +  E  P   +   Q SP+IP 
Sbjct: 110 AKVLLNDHPKQGELASWSWDYPVGAGEYAALYPKSWYDYRWEKFPA-HVTLEQYSPVIPD 168

Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTG------QHYNSKTKMN 267
           NY+ESSYP +V+ +   N    +  +++L +WTN  G    FT          N  T  N
Sbjct: 169 NYRESSYPTAVYRWHAENPTDKTVTVSVLLSWTNMSGWFRTFTHDFKGAPNQGNYDTYRN 228

Query: 268 DGVHAVLLHHRTSHQLPPV----TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHE 323
           +     ++  R+   + P      FA+AA ET GV V+    F  +G+      K +W  
Sbjct: 229 EAGMKGIVFDRSRAGVAPNEWDGQFAIAAMETPGVEVTYQTSFQAAGD-----GKAVWSH 283

Query: 324 IKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK 383
               G   RL + +TS  S+ G  +  AIA   T+ P  +  V   +AWD P V F  G+
Sbjct: 284 FARDG---RLPNDDTSWVSD-GEKLAGAIAVRFTLKPGEKKVVPMVIAWDLPVVQFGEGR 339

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNEL 443
            + R+YT FYGT    A  IAR+ +L    W   I+AWQ+P + D+  P WY   LFNEL
Sbjct: 340 QWNRKYTDFYGTDGKNAWKIAREGLLHAQEWSDAIDAWQKPTITDESKPLWYRGMLFNEL 399

Query: 444 YYLNAGGAVW 453
           Y L  GG  W
Sbjct: 400 YALTDGGTFW 409


>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
 gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
          Length = 877

 Score =  273 bits (697), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 173/463 (37%), Positives = 242/463 (52%), Gaps = 78/463 (16%)

Query: 511 VALNSSFGTNL-LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +  +SS G  L   D     G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 414 LKCDSSLGNQLAFDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 473

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RK+   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 474 DFKDAIGAELNDTRKMLYDGKVMPRKIKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 533

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMD------QFDRDG------------ 669
           N KFVLQVYRD     +   A+A   S + ++ ++D      Q+ +D             
Sbjct: 534 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDKESLYEQYSQDNNRKSSADGKQQN 593

Query: 670 ------------------DGMI---------------------ENDGF------PDQTYD 684
                             D M+                     +NDG       PDQTYD
Sbjct: 594 RKSASMYINETNGKVYLMDAMVYLKSMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYD 653

Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFN 743
            W + G SAY  GLW+AALQA S +A  +        +    +K K  + EKLWNGSY+ 
Sbjct: 654 AWVMDGPSAYCSGLWLAALQAMSVMATLLDQPNDCLRYQDILEKGKRSLEEKLWNGSYYR 713

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           +D S  +   +I ADQL G WY ++CG    I  ++  R+ALK++Y+ NV+    G  GA
Sbjct: 714 FDLS-HNHRDTIMADQLCGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGA 772

Query: 803 VNGMLPD-------GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAW 855
            NG + +       G VD S++Q+ E+W GV YA+AA+MI E + D  FQTA G+Y+   
Sbjct: 773 ANGFIANVSEPSKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLS 832

Query: 856 SGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
              G+ Y  +TPEA   + +YRS+ YMRPL+IW+MQ A  R +
Sbjct: 833 QRIGMNY--ETPEALYGEKRYRSIGYMRPLSIWSMQVAWERRR 873



 Score =  192 bits (487), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 130/417 (31%), Positives = 208/417 (49%), Gaps = 56/417 (13%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +G+R ++D +   +    +GVP+GG+G GSIGR Y GEF R+Q+ P + E   V ANQF 
Sbjct: 13  EGRRVYMDYYYLENGKQIYGVPIGGIGGGSIGRGYAGEFCRFQLRPGIYEYNVVHANQFI 72

Query: 137 VFVSRSNGQK-YSSVL--CP--------KTPEVLKDTTAAG------------------- 166
           V +    G   + S+L  C         KTP    +   A                    
Sbjct: 73  VTIKDHKGCTIFQSLLSRCSTKSFKAKTKTPSTNNNDIDASTDEAEAQSTKCQLPHCSCS 132

Query: 167 -----IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKE 217
                + +W  N++  + +Y  LYPRAWT     H G     +R+VCRQISP+IPH+YKE
Sbjct: 133 RNRQPLSTWHSNIEDSRCSYTGLYPRAWTEYDLSHYG-----VRLVCRQISPVIPHDYKE 187

Query: 218 SSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHH 277
           SS P +VF +++ N       +++ FT+ N  G   +    +  ++         V +  
Sbjct: 188 SSLPCAVFVWSVQNVCDQERKVSITFTFKNGTGNKKQDAEGNPEAQLIAEGNAKGVAIRQ 247

Query: 278 RTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSME 337
             +    P ++ LA +    + ++ CP F  +G     + + +W ++KE+G  +   + E
Sbjct: 248 TIADM--PCSYNLACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEYGQLNEQPTSE 300

Query: 338 TSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTH 396
           T  T +    IG A+ A + + P +  ++ F LAWD P++ F    +T+ R YTK++   
Sbjct: 301 TLKTKD----IGVAVCAQLALKPQASHELEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDS 356

Query: 397 QNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
                 I   A+  + SWE  I+AWQRPIL D+ LP+WY   +FN+LYY++ GG +W
Sbjct: 357 GECGPRICEYALKHYASWERLIDAWQRPILNDEGLPDWYKCAIFNQLYYISDGGTIW 413


>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
 gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
          Length = 954

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 263/934 (28%), Positives = 410/934 (43%), Gaps = 159/934 (17%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           G P+GG+G+G++GR+YRG+F RW +       +P      ++F +R +G   ++VL    
Sbjct: 63  GAPIGGMGAGTMGRTYRGDFARWHMRVGGHTHRPA---AHTLFAARVDGT--ATVLSALP 117

Query: 156 PEVLKDTTAAGIGSWDWNLK------GDKSTYHALYPRAWTVHEGE---PDPELRIVCRQ 206
              + +    G GS   + +      G   TYHALYPR+W  ++     P   +R+   Q
Sbjct: 118 SHAVMEGVPGGGGSDRTHSRRVLPPDGTGGTYHALYPRSWYEYDPSALAPGGRVRLSQTQ 177

Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG---------------- 250
            SP+IP  Y+++S PV VF +   N G   AD+ L F++ N +                 
Sbjct: 178 FSPVIPGEYRDASMPVGVFRFLASNEGDVPADVALAFSFENVLATTDVAPVDVGPPRGAW 237

Query: 251 --GDSEFTGQHYNSKTKMNDGVHA----VLLHHRTSHQLPPVTFALA---AQETDGVHVS 301
                E T   +        GVH     V+   R+  +      A+A     E  G++V+
Sbjct: 238 RLAPHELTHSTFERSGGRVAGVHMHTDDVVYGTRSPSEPWHGGIAVAVGGNGELGGLNVT 297

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDR-----LNSMETSVTSEPGSSIGAAIAASV 356
             P +  +G++  + +K  W   +  G F R      N  +  + + P S+   A++A+ 
Sbjct: 298 TLPLY-DAGDADSIESK-FWRPFERAGGFGRDESAAANEHDGGLPAGPLSAC--AVSAAF 353

Query: 357 TVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGT--HQN------AAANIARDAI 408
           T+ P     VTF+ AWD P  +F  G  + +RYT+++G+  H+       AA +IA +A+
Sbjct: 354 TLAPGERRNVTFAAAWDLPASSFPRGVRWAKRYTRYHGSFIHRRDVPRGAAAPDIATEAL 413

Query: 409 LEHGSWELQIEAWQRPILEDK----------------------------RLPEWYPITLF 440
           L    WE  I  WQR  ++                              R P W    +F
Sbjct: 414 LRADEWEGDIARWQRGFVQSAAAKDANLDDPAGGRGRNYPGRVRDRPTLRRPAWLVGAMF 473

Query: 441 NELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERM 500
           NELYYL  GG +W  G P     +T+   +  ++ S++          ++    +  +  
Sbjct: 474 NELYYLVDGGTLW--GRP-----LTVAGLEGDVEESEASAAAPGGWRGEDGEGGDAQD-- 524

Query: 501 SSILEQIYTPVALNSS-FGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIML 559
                     VA N   +G  L        G+F  LE  +  ++N  DVHFY S+ L ML
Sbjct: 525 ----------VAGNGGEWGGTL--------GRFGLLESFDAPVYNALDVHFYGSWPLAML 566

Query: 560 FPKIQLSIQRDFAAAVMMHDPSKMKL----LDDG---QWVSRKVLGAVPHDIGICDPWFE 612
           +P + L++  D AA V   D  +  L     D G   +   RKV G+ P D+G     F+
Sbjct: 567 WPGLDLAVLADLAAGVDSEDDEERALSWNARDAGRAPEMRKRKVRGSAPRDLGAP---FD 623

Query: 613 V------NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKK------------FAKAVWPS 654
                  NA+ L D  RWKDL PK +L + R     G  +              +   P 
Sbjct: 624 APLTSSPNAFDLTDVNRWKDLAPKLMLTLARAHALRGGFRGPNDPGGIPRDVLRRMFRPC 683

Query: 655 VYVAMAYMDQFDRDGDGMIEND----GFPDQTYDTWSVSG-ISAYSGGLWVAALQAASAL 709
                A + +FD +GDG+IE+D      PD ++  W   G  SAY+GGLW+AAL+A + L
Sbjct: 684 YESLAAQLRRFDGNGDGLIEHDDDDEAGPDHSFVAWRARGDTSAYAGGLWLAALRAGAGL 743

Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNG------SYFNYDNSGSSQSSSIQADQLAG 762
           AR++ +  + D      + A V ++  LW G       Y+ +D SG+       A Q+ G
Sbjct: 744 ARDLNETHARDSLEQTMRAAAVAHDAALWLGDDDGGRGYYRFDASGTEGGDVSSAAQVMG 803

Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
           +W     G   ++   K R+AL  V   NV    G            G VD  S  S E+
Sbjct: 804 EWALAVIGARGVLPARKVRAALASVARLNVAAAGGAGGAVNGAAA-SGSVDAVSAHSNEV 862

Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
           W GV+YAVA+ M+   + D  ++ A G+    +     G AF+TPEAW+    +RS    
Sbjct: 863 WPGVSYAVASHMLLAGMDDEAWELARGVAAGTYE---RGLAFRTPEAWDARGGFRSAMSQ 919

Query: 883 RPLAIWAMQWALT---RPKPKTLEKQMKPEVTDE 913
           R  ++WA++ ALT   R +    EK  +  V DE
Sbjct: 920 RAGSVWAIEHALTLRHRREVAAWEKAGRGRVKDE 953


>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
 gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
          Length = 913

 Score =  261 bits (668), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 162/444 (36%), Positives = 238/444 (53%), Gaps = 80/444 (18%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
           G++ YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q D+  AV        K L DG
Sbjct: 471 GRYAYLEGHEYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDAVQREITEGRKHLYDG 530

Query: 590 QWVSRK------------------VLGAVP-HDIG------------ICDPWFEVNAYCL 618
           + + RK                  ++ A P HD+             +   ++ +N Y  
Sbjct: 531 KVIPRKIKSSVPHDLGDPDEEPFELINAYPIHDVSEWKDLNTKFILQVYRDYYTLNHYAQ 590

Query: 619 --------YDTARWKDLNPKFVLQVY--RDVVATGDK----------------------- 645
                   + +  + D    F + V   R+ V+  DK                       
Sbjct: 591 LNADNASKFSSIEFIDKESMFEMYVQDNRNKVSPEDKAQNRKSASMYINETNGKVYLMDA 650

Query: 646 -KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
             + K+++P+    +    ++D+D DG+IEN   PDQTYDTW + G S+Y GGLW+A+L 
Sbjct: 651 MTYLKSMYPACKQVLEKTLEWDKDNDGLIENSKSPDQTYDTWVMDGPSSYCGGLWLASLH 710

Query: 705 AASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLA 761
             S +A  +    SED   ++   +K K  + EKLWNGSY+ +D S SS  +SI +DQL 
Sbjct: 711 CMSVMANLLDQ--SEDCIKYREILEKGKASFEEKLWNGSYYKFD-SQSSSKNSIMSDQLC 767

Query: 762 GQWYARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDM 814
           G WY R CG    IV ++  R+ALK +Y+ NV++  GG  GAVNG +P      DGR D 
Sbjct: 768 GHWYLRCCGFDYDIVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPNKDGRADS 827

Query: 815 SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDD 874
           S++Q+ E+W+GVTYA+A++MIHE +    FQTA G+Y+       +G +F+TPEA   + 
Sbjct: 828 STLQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTL--SEKIGMSFETPEALYAER 885

Query: 875 QYRSLCYMRPLAIWAMQWALTRPK 898
            YR++ YMRPL+IW+MQ A  + K
Sbjct: 886 HYRAIGYMRPLSIWSMQTAWEQKK 909



 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 126/424 (29%), Positives = 214/424 (50%), Gaps = 50/424 (11%)

Query: 60  LAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQ 119
           + P+ +R +    + + +G++  +D +   +    +G P+GG+G+G+IGR + GEF R+Q
Sbjct: 50  IIPMALRYIPYYWKVSREGRQVLMDYWYTENGKQIYGAPIGGIGAGTIGRGFAGEFCRFQ 109

Query: 120 IFPRVCEDKPVLANQFSVFVS---------------RSNGQKYSSVLCPKTPEVL----- 159
           + P + E   V ANQF V +                R+   +   V  P    +L     
Sbjct: 110 MRPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRTKDDEVHDVCAPNQTTLLHGLMP 169

Query: 160 -KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKES 218
            +      + SWD  +   K +Y ALYPRAW+ ++   +  +++V RQISPIIPH+YKES
Sbjct: 170 CETRPKTPLASWDSTINSSKCSYTALYPRAWSEYDLS-EHGIKLVQRQISPIIPHDYKES 228

Query: 219 SYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHR 278
           S PV+VF +T+ N       +T+ FT+ N  G          N K             H 
Sbjct: 229 SLPVAVFAWTVENVCGKDRHVTITFTFKNGTG----------NKKQDAEGNAETAAFAHG 278

Query: 279 TSHQLP--------PVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
           T+  +         P T+ ++ + +  ++++ C  F  SGN      + +W+++KE+G  
Sbjct: 279 TAKGVSIKQTIADLPCTYCVSCKTSSEINLTRCEKFDPSGN-----GEKLWNDLKENGHL 333

Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
              +  E+  T +    +  A++A + VP +S  ++ FSL WD P+++F      Y +Y 
Sbjct: 334 TEKSGDESLKTKD----VAVAVSAQILVPSESSSELEFSLVWDMPKIHFTKKMKEYYKYY 389

Query: 391 KF-YGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
              +G + +A   I+  A+  +G WE  I+ WQRPIL+D  LP+WY   +FNELY++  G
Sbjct: 390 TKYFGKNGDAGPVISDYALRNYGKWESLIDEWQRPILDDSDLPDWYKSAIFNELYFIADG 449

Query: 450 GAVW 453
           G++W
Sbjct: 450 GSIW 453


>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
 gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
          Length = 1615

 Score =  252 bits (643), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 144/361 (39%), Positives = 198/361 (54%), Gaps = 46/361 (12%)

Query: 539  EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLG 598
            EY M+NTYDVHFY+SFAL  LFP+I+ SIQ +FA        + +  +  G +V      
Sbjct: 1286 EYRMYNTYDVHFYASFALAKLFPQIEHSIQAEFADQFYNVTQTNVYHISPGTYVF----- 1340

Query: 599  AVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD-VVATGDKKFAKAVWPSVY- 656
                     +PW EVNAY  +DT+ WKDLN KFVL  YRD ++    + F + VWP+V  
Sbjct: 1341 ---------EPWLEVNAYAFHDTSEWKDLNLKFVLTSYRDYLMLNKSESFLRHVWPAVKE 1391

Query: 657  VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
            +    +  +D DGD ++ENDG  DQTYD W +          W   ++ A  L       
Sbjct: 1392 IIETALQNWDHDGDKLVENDGTCDQTYDAWCMH---------WTLLVELAKRL------- 1435

Query: 717  GSEDYFLFKFQKAKVVYEKLW--NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL-LP 773
                    + QK  ++ E+ W   G+Y+ +D   S  SS++ ADQL G  Y + CGL   
Sbjct: 1436 --------EMQKKSLISEQCWMMRGTYYQFDQQ-SCNSSTVMADQLCGYAYLKICGLQTD 1486

Query: 774  IVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAAS 833
            I+  D  +  L+ +YN NV     G  GAVNGML  G  D SS+Q+ E W+GVTY ++A 
Sbjct: 1487 ILPLDNVKKVLETIYNLNVCSFGNGTLGAVNGMLYSGEKDTSSLQADEAWTGVTYFLSAH 1546

Query: 834  MIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
            MI E   + GF TA GIY++ +   G+ Y  QTPEA      +R++ YMRPL+IWA+QW 
Sbjct: 1547 MISEGFVEQGFSTASGIYKSCFESFGMHY--QTPEALYEKKWFRAVGYMRPLSIWAIQWY 1604

Query: 894  L 894
            L
Sbjct: 1605 L 1605



 Score =  176 bits (447), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 117/386 (30%), Positives = 186/386 (48%), Gaps = 22/386 (5%)

Query: 77   KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCED--KPVLANQ 134
            +G    ID F    L   +GVP G +GSGSIGR   G F ++ + P + E     V A+ 
Sbjct: 899  RGTSMTIDSFRPLQLKPIYGVPCGTIGSGSIGRHILGGFCKFSLVPGIVEHDVSSVPADN 958

Query: 135  FSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
            F V V  +    Y  VL       +K      + SW +N   +  +Y   YP +WT ++ 
Sbjct: 959  FIVTVRDAGVTIYQQVL-----SAVKVENKKFLSSWLFNFPAECMSYLGRYPMSWTTYKL 1013

Query: 195  EPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-GDS 253
             P   L ++C Q SP+IPHNYK+S  P+++F +T+ +    S+D+T+  T+T   G G  
Sbjct: 1014 -PCHNLILICHQYSPVIPHNYKDSCLPLTLFKWTVMSEESLSSDLTVSITFTFRNGTGRP 1072

Query: 254  EFTGQHYNSKT----KMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
                QH         +M+  VH V +    +       + +AA  T  + V+   H   +
Sbjct: 1073 RVDDQHQMRANSFLERMSSSVHCVGVSLHQTIDGMNCDYCVAALNTKSISVT---HSTFN 1129

Query: 310  GNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
                G    ++W  + E G     ++ E  V  +   +I  +   ++   P  EG + FS
Sbjct: 1130 PEGCG---SEIWKSLYETGRLSNESTDEFVVVEDRPLAIAVSCCTNIR-SPCQEG-IVFS 1184

Query: 370  LAWDCPEVNFMSGK-TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            L W  PE+ F + + TY RRY+ ++    +AA  ++  A+     +  +I+ WQ PIL++
Sbjct: 1185 LVWHMPEIYFGNKRRTYLRRYSDWFDESSDAAKRLSIYAMENREIFLSKIDEWQEPILKN 1244

Query: 429  KRLPEWYPITLFNELYYLNAGGAVWT 454
              LP+WY   +FNELYYL  GG+VW 
Sbjct: 1245 DNLPKWYKSAIFNELYYLTDGGSVWV 1270


>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
 gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
          Length = 802

 Score =  251 bits (642), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 147/385 (38%), Positives = 216/385 (56%), Gaps = 29/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
           IGQF  LE ++Y  + + DV  Y SFAL+MLFP+++ S+ R FA A+  H    ++++  
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIP-HSDDYLRVIGY 478

Query: 588 ------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
                 D     RKV GA PHD+G  +   W + N  C  D   WKDL   FVLQVYRD 
Sbjct: 479 YYTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDF 538

Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           + TG  D +F    W ++   + Y+  FD DGDG+ EN G PDQT+D W + G+SAY GG
Sbjct: 539 IFTGGQDVEFLADCWDAIVQTLDYLKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGG 598

Query: 698 LWVAALQAASALAREVGD-RGSEDYFLFKFQKA----------KVVYEKLWNGSYFNYDN 746
           LW+AAL++A A+   + + RG+E+    + QK+           +  EKLWNG Y+  D+
Sbjct: 599 LWLAALESAIAICDILTNRRGAENAEKIRSQKSVYQSWLQQSLPIYQEKLWNGQYYRLDS 658

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S+ + ADQL GQ+YAR  GL  IV  D A SAL+ VY+   +K   G+ GA NG+
Sbjct: 659 --ESGSNVVMADQLCGQFYARLLGLPDIVPNDNALSALQTVYDACFVKFCNGQFGAANGV 716

Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG  +   +    E+W+G+ + +AA +I   + +   +    + +  +     G  F+
Sbjct: 717 RPDGSPENPQATHPLEVWTGINFGLAAFLIQMGMKNEAMRLTQAVVQQIYDN---GLQFR 773

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA  +   +R+  Y+R +AIWA+
Sbjct: 774 TPEAITSAGTFRASTYLRAMAIWAI 798



 Score =  192 bits (487), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 131/392 (33%), Positives = 189/392 (48%), Gaps = 45/392 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I           A QFS+F S     K +  L  +
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFNTFPACQFSIFESTGISTK-AYALSSQ 98

Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
            PE   D T   + +W W    +   +   YHALYPR+W V+E     +L   C Q SPI
Sbjct: 99  PPE---DGT---LQTWQWYPASSTTEETGKYHALYPRSWFVYENVFQTDL--TCEQFSPI 150

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
             +NY+E+SYPV+VF +  +N       ++++ TW N VG    FT    + + ++ D  
Sbjct: 151 WANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENMVGW---FTNALKSPQVQIRDDG 207

Query: 271 HAVLLHH---------------RTSHQ---LPPVTFALAAQETDGV-------HVSLCPH 305
             V  +                 T H    L  V       E DG        H  +   
Sbjct: 208 SPVYEYQPRLGASQGNYNQIVENTDHFSCFLGRVASGEPVGEGDGSWSIATPKHPQVEIF 267

Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
           +    +S+G T  D+W      GS   L + + +  ++  + +GAAIA   T+ P    +
Sbjct: 268 YHTRWDSVG-TGADIWQSFATDGS---LANNQDATPADENTRLGAAIAIRFTLQPGETLE 323

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F+LAWD P   F +G  YYRRYT F+G +   A  IA   +LE+ +W   I  WQ PI
Sbjct: 324 IPFTLAWDFPVTEFSAGINYYRRYTDFFGRNGENAGKIATTGLLEYQNWRSHILTWQNPI 383

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           L+ + LP+W+ + LFNELY L +GG +W+  S
Sbjct: 384 LDREDLPKWFKMALFNELYDLTSGGTLWSAAS 415


>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
 gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
          Length = 816

 Score =  250 bits (639), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 146/385 (37%), Positives = 211/385 (54%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
            GQF  LE ++Y  + + DV  Y SFAL+MLFP+++ S+ R FA A+   D  +      
Sbjct: 428 FGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIPQSDDHQRIIGYY 487

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  D     RK+ GA PHD+G  +   W + N  C  D   WKDL   FVLQVYRD +
Sbjct: 488 YTIGADTTTAVRKIAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFL 547

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F    W ++   + Y+  FD+DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 548 FTGADDVEFLADCWDAIVQTLDYLKGFDKDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 607

Query: 699 WVAALQAASAL----------AREVGDRGSED--YFLFKFQKAKVVYEKLWNGSYFNYDN 746
           W+AAL+ A A+          A+   +  ++   Y  +  Q   V  EKLWNG Y+  D+
Sbjct: 608 WLAALECAIAICDILLTNRQDAKNAKNLETQRSIYGAWLAQSLTVYEEKLWNGQYYRLDS 667

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S +  + ADQL GQ+YAR  GL  IV  D+A SAL+ VY+   LK   G+ GA NG+
Sbjct: 668 --ESGTEVVMADQLCGQFYARLLGLPDIVKCDRALSALQTVYDACFLKFFDGQFGAANGV 725

Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG  +  ++    E+W+G+ + +AA ++   + D   +    +    +     G  F+
Sbjct: 726 RPDGVPENPNATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVSQIYDN---GLQFR 782

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA  T   +R+  Y+RP+AIWA+
Sbjct: 783 TPEAITTTGTFRASTYLRPMAIWAI 807



 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 139/401 (34%), Positives = 197/401 (49%), Gaps = 55/401 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
           HG+PLGG G+G IGRS RG+F  W I       K + A QFSVF S  ++ Q Y+  L  
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNIPACQFSVFESHGASSQAYA--LST 97

Query: 154 KTPEVLKDTTAAGIGSWDWNLKG------------DKSTYHALYPRAWTVHEGEPDPELR 201
           + PE   D T   + +W W  +             D  TYHALYPR+W V+E     +L 
Sbjct: 98  EPPE---DGT---LSAWQWYPRSAESKAGKMPTPQDTGTYHALYPRSWFVYENVFQAQLS 151

Query: 202 IVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN 261
             C Q SP+  +NY+E+SYPV+VF +  +N       ++++ TW N VG    FT    +
Sbjct: 152 --CEQFSPVWANNYQETSYPVAVFNWKAHNPTDAPITLSIMLTWENMVGW---FTNALKS 206

Query: 262 SKTKMNDGVHAVLLHHR-------TSHQLPPVTFALAA-----------QETDGV-HVSL 302
            K ++ D    V  +           HQL   T  +             QE DG   ++ 
Sbjct: 207 PKVQIRDDGSPVYEYQPRLGKSQGNYHQLIENTQQVGCVLGRVASDAPVQEGDGTWCIAT 266

Query: 303 CPHFVIS------GNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASV 356
             H +I        N +G T  ++W      GS    N ++ +  +E  + IGAA+A   
Sbjct: 267 RKHPLIEVFHHTRWNPVG-TGDEVWQSFAADGSLP--NYVDETAAAE-NTRIGAALAVRF 322

Query: 357 TVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWEL 416
           T+ P    +V F LAWD P   F +G  YYRRYT F+G     A  IA  ++ E+ +W  
Sbjct: 323 TLQPGECLEVPFVLAWDFPVTEFAAGVNYYRRYTDFFGRSGQNAEAIAFLSLQEYQNWRS 382

Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           QI+ WQ+PIL  + LP W+ + LFNELY L +GG +W+  S
Sbjct: 383 QIQNWQKPILVREDLPPWFKMALFNELYDLTSGGTLWSAAS 423


>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
 gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
          Length = 806

 Score =  249 bits (637), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 147/388 (37%), Positives = 219/388 (56%), Gaps = 32/388 (8%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
           IGQF  LE ++Y  + + DV  Y SFAL++LFP+++ S+ R FA A+  H    ++++  
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFALLILFPELEKSVIRAFARAIP-HSDDHLRVIGY 478

Query: 588 ------DGQWVSRKVLGAVPHDIGICDPWF--EVNAYCLYDTARWKDLNPKFVLQVYRDV 639
                 + +   RKV  A PHD+G  + W   + N  C  D   WKDL+  FVLQVYRD 
Sbjct: 479 YYTIGAETKKAIRKVAKATPHDLGAPNEWVWEKTNYTCYQDCNLWKDLSSDFVLQVYRDF 538

Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           + TG  D +F    W ++   + Y+  FD DGDG+ EN G PDQT+D W ++G+SAY GG
Sbjct: 539 LFTGSHDVQFLVDCWDAIVQTLDYLKTFDLDGDGIPENSGAPDQTFDDWLLNGVSAYCGG 598

Query: 698 LWVAALQAASALAREVGD-RGSED------------YFLFKFQKAKVVYE-KLWNGSYFN 743
           LW+AAL++A A+A  + + RG+E+             +    Q+++ +YE KLWNG Y+ 
Sbjct: 599 LWLAALESAIAIADILTNHRGTENTEKTDKLRSQKSIYQSWLQQSRPIYEQKLWNGQYYR 658

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
            D+   S S  + ADQL GQ+Y R  GL  IV  + A SAL+ VY    +K   GK GA 
Sbjct: 659 LDS--ESGSDVVMADQLCGQFYTRLLGLPDIVPSNHAHSALQTVYEACFVKFYDGKFGAA 716

Query: 804 NGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           NG+LPDG  +   +    E+W+G+ + +AA ++   + +   Q    + E  +     G 
Sbjct: 717 NGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKNEAMQLTQAVVEQIYDN---GL 773

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            F+TPEA      +R+  Y+RP+AIWA+
Sbjct: 774 QFRTPEAITPVGTFRASTYLRPMAIWAI 801



 Score =  192 bits (489), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 132/409 (32%), Positives = 195/409 (47%), Gaps = 54/409 (13%)

Query: 85  PFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R++++      HG+PLGG G+G IGRS RG+F  W I       K   + QFSVF 
Sbjct: 25  PYTVRYVSNIDDGPWHGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFKSFPSCQFSVFE 84

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
           S  +  K  ++    + E+ +D T   + SW W           TYHALYPR+W V+E  
Sbjct: 85  STGSSSKAYAL----STEIPEDGT---LQSWQWYPPSTTTQATGTYHALYPRSWFVYENV 137

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
              + ++ C Q SPI  +NY+E+SYPV+VF +  +N       ++++ TW N VG     
Sbjct: 138 L--QTQLTCEQFSPIWANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENIVGWFTNA 195

Query: 251 -----------GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVH 299
                      G   +  Q    ++K N   H V         L  V   L  QE DG  
Sbjct: 196 LKSPDLQIRDDGSPVYEYQPRLGESKGNYN-HIVENTEFFGCFLGRVPTGLPIQEGDGSW 254

Query: 300 VSLCP-----------HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSI 348
               P           HF    N +G +  D+W      G    L +   +  ++  + +
Sbjct: 255 CIATPKNPRMEVFYHTHF----NPVG-SGADLWESFSTDG---YLPNYLNATPADADTRL 306

Query: 349 GAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAI 408
           GAA+A   T+ P    ++ F+LAWD P   F  G  YYRRYT F+  + + A  I    +
Sbjct: 307 GAALAVRCTLQPGETLEIPFTLAWDFPVTEFAEGINYYRRYTDFFSRYGDNAFAITSRGL 366

Query: 409 LEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
            ++  W  QI+ WQ+PIL+ + LPEW+ + LFNELY L +GG +W+  S
Sbjct: 367 EQYQVWRAQIQNWQKPILDREDLPEWFKMALFNELYDLTSGGTLWSAAS 415


>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
 gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
          Length = 784

 Score =  249 bits (636), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 142/379 (37%), Positives = 209/379 (55%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+  HD +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPTHDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    +    +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 181/389 (46%), Gaps = 47/389 (12%)

Query: 93  SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
           S HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  + 
Sbjct: 39  SWHGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMA 98

Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
            ++P      T   +  W W    +K  Y ALYPR+W  +E +   + +++C Q SP+  
Sbjct: 99  TESP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWA 149

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG---------------------- 250
           HNY+ESSYP+++F +T +N       I+++ TW N++G                      
Sbjct: 150 HNYQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYE 209

Query: 251 -----GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPH 305
                G+SE    +YN     N  V  ++     +        A+A+     + V     
Sbjct: 210 YQSKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGK 266

Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
           +  +G+       ++W     +GS   L  +E    ++ G     A+A   T+ P    +
Sbjct: 267 WNPAGD-----GGEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRK 318

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F LAWD P   F  G  YYRRYT F+G +   A  + R A+ +  +W   I  WQ+PI
Sbjct: 319 IPFILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPI 378

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           LE   L +W+ + LFNELY L  GG +WT
Sbjct: 379 LERSDLADWFKMALFNELYLLTDGGTLWT 407


>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
 gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
          Length = 784

 Score =  249 bits (636), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 153/423 (36%), Positives = 225/423 (53%), Gaps = 32/423 (7%)

Query: 495 NILERMSSILEQI----YTPVALNSSFGTNLLQDG---------EENIGQFLYLEGIEYL 541
           NI+     ILE+I    +  +AL +     LL DG         ++ IGQF  LE I+Y 
Sbjct: 370 NIINWQKPILERIDLADWFKMALFNEL--YLLTDGGTLWTAATEQDPIGQFGVLECIDYR 427

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
            + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +  +     RK   A P
Sbjct: 428 WYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRIIGYNQASAIRKAKNATP 487

Query: 602 HDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYV 657
           HD+G  +  PW + N     D   WKDL   FVL VYRD + TG  D+ F +  WPS+ +
Sbjct: 488 HDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKDEDFLRECWPSIVL 547

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
            + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+AAL+A   +   +G   
Sbjct: 548 TLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALEATLKIGEIIG--- 604

Query: 718 SEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  DQL GQ+YAR   L  +
Sbjct: 605 -EDTAIFTSWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTDQLCGQFYARLLALPDV 661

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAAS 833
           V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   +    E+W G+ + + A 
Sbjct: 662 VENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEATHPLEVWVGINFGLVAF 721

Query: 834 MIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
           ++   +   G+Q    +    +     G  F+TPEA  +   +R+  Y+R +AIWA+   
Sbjct: 722 LLQMGMEKQGWQITEAVVRQVYEN---GLQFRTPEAITSVGTFRACHYLRAMAIWAIYGV 778

Query: 894 LTR 896
           LT 
Sbjct: 779 LTE 781



 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 119/387 (30%), Positives = 183/387 (47%), Gaps = 47/387 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P   KD T   +  W W    +K TY ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP---KDGT---LSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+ESSYP+++F +T++N       I+++ TW N++G                        
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
              G+SE    +YN     N  V  ++     +        A+A+     + V     + 
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            +G+       ++W     +GS   L  +E    +E G     A+A   T+ P    ++ 
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIEDETPAEKGKQTATAMAIRFTIRPGKTRKIP 320

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F LAWD P   F  G  YYRRYT F+G +   A  + R A+ +   W   I  WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDKWRENIINWQKPILE 380

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
              L +W+ + LFNELY L  GG +WT
Sbjct: 381 RIDLADWFKMALFNELYLLTDGGTLWT 407


>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
 gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
          Length = 806

 Score =  248 bits (633), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 154/403 (38%), Positives = 216/403 (53%), Gaps = 30/403 (7%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G  L     EN  IGQF  LE I+Y  + + DV  Y SFAL ML+P++  +I   FA A+
Sbjct: 404 GGTLWTAATENDPIGQFAVLECIDYRWYESLDVRLYGSFALAMLWPRLDKAILEAFARAI 463

Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
              D S   +  +     RK+ GA PHD+G  +  PW + N     D   WKDL+  FVL
Sbjct: 464 PQSDESMRVIGYNQAEAQRKIAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLSSDFVL 523

Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           QVYRD V TG  D  F    WPS+   + Y+  FD DGDG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDFVLTGKEDTDFLWECWPSITQTLNYLKSFDLDGDGIPENAGAPDQTFDDWKLEGI 583

Query: 692 SAYSGGLWVAALQAASALAREVGDR-------GSEDY-----FLFKFQKA------KVVY 733
           SAY GGLW+AAL+A+ A+   + D           DY        K  ++       + +
Sbjct: 584 SAYCGGLWIAALEASIAIGAVLLDNPPMNPQLQPSDYPEGIEATLKTHQSWLTLARSIYH 643

Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           ++LWNG Y++ D    S S  + ADQLAGQ+YA+  GL  +V+++  +S LK +Y    L
Sbjct: 644 QQLWNGQYYSLDT--KSGSVVVMADQLAGQFYAQLLGLPDVVEKEYVQSTLKTIYQSCFL 701

Query: 794 KVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
               GK GA NG+  DG   +  S    E+W+G+ +A+ A +IH  + D G + A  I +
Sbjct: 702 NFHQGKIGAANGVKLDGTGENPQSTHPLEVWTGINFALGAFLIHSGMRDEGLKLAETIVK 761

Query: 853 AAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
             +     G  F+TPEA  T   +R+  Y+R +AIWA+ + L 
Sbjct: 762 QIYD---QGLQFRTPEALTTSGTFRASHYLRAMAIWAIDYTLN 801



 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 131/391 (33%), Positives = 181/391 (46%), Gaps = 53/391 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G +GRS RG+F  W         + + A QFSVF     G+  +  +C K
Sbjct: 41  HGMPLGGFGAGCVGRSPRGDFNLWHFDGGEHIFRSLDACQFSVFEHPEQGEVKAYGMCTK 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W  + D+ TYHALYPR+W  ++G    EL  VC Q SPI   N
Sbjct: 101 PPE---DGT---LSRWRWYPE-DQGTYHALYPRSWFEYQGVFATEL--VCTQFSPIWAKN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+ESSYP+++F +T YN       ++++ TW N VG                        
Sbjct: 152 YQESSYPLAIFEWTAYNPTDQPITLSIMLTWQNVVGWFTNAIKSPEIRIRDDGSPVYQYQ 211

Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSH----QLPPVTFALAAQETDGVHVSLC 303
              GDS  TG   N    + DG     L +R       Q     F +A+     + V   
Sbjct: 212 PRWGDS--TG---NYNQWIQDGSRVGCLFNRIQPHEELQEGEGQFCIASILNPSLEVFYQ 266

Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
             +   G+       D+W +    GS   L   +    +EPG  I  AIA   TV P   
Sbjct: 267 TRWHPRGD-----GSDVWDDFARDGS---LGDRQDETPAEPGEQIAGAIAVRFTVRPGKT 318

Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
             + F L WD P + F  G  YYRRYT F+G +   A ++ R A+     W+ +I+ WQ 
Sbjct: 319 RLIPFVLTWDLPILEFAPGVNYYRRYTDFFGRNGQNAWSMVRTALKHGDMWQDKIDQWQS 378

Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           P+++    P W  + LFNELY L  GG +WT
Sbjct: 379 PVVQRSDYPTWLKMALFNELYLLTQGGTLWT 409


>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
 gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
          Length = 805

 Score =  247 bits (630), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 149/407 (36%), Positives = 213/407 (52%), Gaps = 30/407 (7%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G  L     EN  +GQF  LE ++Y  + + DV  Y SF L+ML+P+++ S+   FA A+
Sbjct: 404 GGTLWTAASENDPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARAI 463

Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
             HD +   +  +     RK  GA PHD+G  +  PW + N     D   WKDL   FVL
Sbjct: 464 PTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFVL 523

Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           QVYRD + TG  D  F    WP++   + Y+  FD D DG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQGI 583

Query: 692 SAYSGGLWVAALQAASALA----------REVGDRGS--------EDYFLFKFQKAKVVY 733
           SAY GGLW+AAL+AA  +A           E+  R +        +++  +  Q   + +
Sbjct: 584 SAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWLEQSRSIYH 643

Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           + LWNG Y+  D+   S S  + ADQL GQ+YAR   L  +V+     SAL KVY    L
Sbjct: 644 DTLWNGEYYKLDS--QSGSDVVMADQLCGQFYARLLNLPDVVETQYTESALNKVYEACFL 701

Query: 794 KVMGGKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
           K   GK GA NGM PDG   D +S   +E+W+G+ + +AA ++     D  F+    + +
Sbjct: 702 KFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVK 761

Query: 853 AAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
             +     G  F+TPEA      +R+  Y+R +AIW +   LT  +P
Sbjct: 762 QVYEN---GLQFRTPEAITAVGTFRASHYLRAMAIWGIYGILTHFRP 805



 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 128/389 (32%), Positives = 181/389 (46%), Gaps = 41/389 (10%)

Query: 94  SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
           +HG+PLGG GSG IGRS  G+F  W +       + + + QFSVF            L  
Sbjct: 40  NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVFEQPEGEDAQVYALST 99

Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
           + PE         +  W W    +K TYHALYPR+W  +EG    +L  +C Q SPIIP+
Sbjct: 100 QPPE------EGTLSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150

Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTG 257
           NY+E+SYP+ +F +T +N       ++++ TW N VG                G  E+  
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210

Query: 258 QHY--NSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFVI 308
           Q    +S    N  V     + R    L  +      QE +G        + S+   ++ 
Sbjct: 211 QPRWGDSTGNFNQWVQD---NFRVGFILNRIQLHDQIQEGEGQICIASVTNPSVEVFYLG 267

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             N  G    ++W     +GS   L   E    +EPG  I  A+A   T+ P    ++ F
Sbjct: 268 KWNPKG-DGSEVWDYFAMNGS---LPDTEDETPAEPGEQIAVAMAIRFTIRPGRTKKIPF 323

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            LAWD P   F  G  YYRRYT FYG +      + R A+     W  ++E WQ PIL+ 
Sbjct: 324 ILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPILKR 383

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           + LP+W+ + LFNELY L  GG +WT  S
Sbjct: 384 EDLPDWFKMALFNELYLLTDGGTLWTAAS 412


>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
 gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
          Length = 784

 Score =  247 bits (630), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 141/379 (37%), Positives = 208/379 (54%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATIKIGEIIG----EDTAIFTSWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    +    +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  184 bits (467), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 179/383 (46%), Gaps = 39/383 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P      T   +  W W    +K  Y ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
           Y+ESSYP+++F +T +N       I+++ TW N++G    FT    +   K+ D    V 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208

Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
            +           +Q     F +            E DG        + SL   ++   N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
             G    ++W     +GS   L  +E    +E G     A+A   T+ P    ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
           WD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE   L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
            +W+ + LFNELY L  GG +WT
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWT 407


>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
 gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
          Length = 784

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 140/379 (36%), Positives = 209/379 (55%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   F+ A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFSRAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATIKIGEIIG----EDTAIFTSWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    + +  +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  185 bits (470), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 121/383 (31%), Positives = 182/383 (47%), Gaps = 39/383 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P   KD T   +  W W    +K +Y ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP---KDGT---LSRWSW-YPAEKGSYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
           Y+ESSYP+++F +T +N       I+++ TW N++G    FT    +   K+ D    V 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208

Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
            +           +Q     F +            E DG        + SL   ++   N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
             G    ++W     +GS   L  +E    +E G     A+A   T+ P    ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
           WD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE   L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
            +W+ + LFNELY L  GG +WT
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWT 407


>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
 gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
          Length = 784

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 141/379 (37%), Positives = 209/379 (55%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    + +  +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVQQIYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  169 bits (427), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 181/387 (46%), Gaps = 47/387 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGGGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P+       A +  W W    +K TY ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SPKY------ATLSRWLW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+ESSYP+++F +T++N       I+++ TW N++G                        
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
              G+SE    +YN     N  V  ++     +        A+A+     + V     + 
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            +G+       ++W     +GS   L  +E    ++ G     A+A   T+ P    ++ 
Sbjct: 269 PAGD-----GSEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F LAWD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
              L +W+ + LFNELY L  GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLADGGTLWT 407


>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
 gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
          Length = 784

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 141/379 (37%), Positives = 208/379 (54%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATIKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    +    +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITQAVVRQIYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  182 bits (462), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 180/387 (46%), Gaps = 47/387 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLPACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P      T   +  W W    +K  Y ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYGALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+ESSYP+++F +T +N       I+++ TW N++G                        
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
              G+SE    +YN     N  V  ++     +        A+A+     + V     + 
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            +G+       ++W     +GS   L  ++    ++ G     A+A   T+ P    ++ 
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIDDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F LAWD P   F  G  YYRRYT F+G +   A  + R A+ +  +W   I  WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILE 380

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
              L +W+ + LFNELY L  GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLTDGGTLWT 407


>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
 gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
          Length = 805

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 148/407 (36%), Positives = 212/407 (52%), Gaps = 30/407 (7%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G  L     EN  +GQF  LE ++Y  + + DV  Y SF L+ML+P+++ S+   FA A+
Sbjct: 404 GGTLWTAASENDPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARAI 463

Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
             HD +   +  +     RK  GA PHD+G  +  PW + N     D   WKDL   FVL
Sbjct: 464 PTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFVL 523

Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           QVYRD + TG  D  F    WP++   + Y+  FD D DG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQGI 583

Query: 692 SAYSGGLWVAALQAASALA----------REVGDRGS--------EDYFLFKFQKAKVVY 733
           SAY GGLW+AAL+AA  +A           E+  R +        +++  +  Q   + +
Sbjct: 584 SAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWLEQSRSIYH 643

Query: 734 EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           + LWNG Y+  D+   S S  + ADQL GQ+YAR      +V+     SAL KVY    L
Sbjct: 644 DTLWNGEYYKLDS--QSGSDVVMADQLCGQFYARLLNFPDVVETQYTESALNKVYEACFL 701

Query: 794 KVMGGKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
           K   GK GA NGM PDG   D +S   +E+W+G+ + +AA ++     D  F+    + +
Sbjct: 702 KFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEAVVK 761

Query: 853 AAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
             +     G  F+TPEA      +R+  Y+R +AIW +   LT  +P
Sbjct: 762 QVYEN---GLQFRTPEAITAVGTFRASHYLRAMAIWGIYGILTHFRP 805



 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 128/389 (32%), Positives = 181/389 (46%), Gaps = 41/389 (10%)

Query: 94  SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
           +HG+PLGG GSG IGRS  G+F  W +       + + + QFSVF            L  
Sbjct: 40  NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVFEQPEGEDAQVYALST 99

Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
           + PE         +  W W    +K TYHALYPR+W  +EG    +L  +C Q SPIIP+
Sbjct: 100 QPPE------EGTLSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150

Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTG 257
           NY+E+SYP+ +F +T +N       ++++ TW N VG                G  E+  
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210

Query: 258 QHY--NSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFVI 308
           Q    +S    N  V     + R    L  +      QE +G        + S+   ++ 
Sbjct: 211 QPRWGDSTGNFNQWVQD---NFRVGFILNRIQLHDEIQEGEGQICIASVTNPSVEVFYLG 267

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
             N  G    ++W     +GS   L   E    +EPG  I  A+A   T+ P    ++ F
Sbjct: 268 KWNPKG-DGSEVWDYFAMNGS---LPDTEDETPAEPGEQIAVAMAIRFTIRPGRTKKIPF 323

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            LAWD P   F  G  YYRRYT FYG +      + R A+     W  ++E WQ PIL+ 
Sbjct: 324 ILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPILKR 383

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           + LP+W+ + LFNELY L  GG +WT  S
Sbjct: 384 EDLPDWFKMALFNELYLLTDGGTLWTAAS 412


>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
 gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
          Length = 784

 Score =  246 bits (627), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 139/374 (37%), Positives = 206/374 (55%), Gaps = 13/374 (3%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +  +
Sbjct: 415 IGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRIIGYN 474

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG  D
Sbjct: 475 QASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKD 534

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
           + F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+AAL+
Sbjct: 535 EDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALE 594

Query: 705 AASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
           A   +   +G+    + F    Q+++ +Y + LWNG Y+N D+   S S  +  DQL GQ
Sbjct: 595 ATIKIGAILGE--DTEIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTDQLCGQ 650

Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREI 822
           +YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   +    E+
Sbjct: 651 FYARLLALPDVVENQYTQSALGKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEV 710

Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
           W G+ + + A ++   +   G+Q    + +  +     G  F+TPEA  +   +R+  Y+
Sbjct: 711 WVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFRACHYL 767

Query: 883 RPLAIWAMQWALTR 896
           R +AIWA+   LT 
Sbjct: 768 RAMAIWAIYGILTE 781



 Score =  185 bits (470), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 120/383 (31%), Positives = 179/383 (46%), Gaps = 39/383 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            P      T   +  W W    +K TY ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 RP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
           Y+ESSYP+++F +T +N       I+++ TW N++G    FT    +   K+ D    V 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208

Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
            +           +Q     F +            E DG        + SL   ++   N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATITNPSLEVFYLGKWN 268

Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
             G    ++W     +GS   L  +E    +E G     A+A   T+ P    ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
           WD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE   L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
            +W+ + LFNELY L  GG +WT
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWT 407


>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
 gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
          Length = 784

 Score =  245 bits (626), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 140/379 (36%), Positives = 209/379 (55%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATIKIGAILG----EDTAIFASWLQQSRTIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+Y+R   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    + +  +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 120/383 (31%), Positives = 179/383 (46%), Gaps = 39/383 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P      T   +  W W    +K TY ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
           Y+ESSYP+++F +T +N       I+++ TW N++G    FT    +   K+ D    V 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGW---FTNAIKSPVVKVRDDGSPVY 208

Query: 275 LHHR-------TSHQLPPVTFALAA---------QETDG-------VHVSLCPHFVISGN 311
            +           +Q     F +            E DG        + SL   ++   N
Sbjct: 209 EYQSKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
             G    ++W     +GS   L  +E    +E G     A+A   T+ P    ++ F LA
Sbjct: 269 PAG-DGGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
           WD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE   L
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERTDL 384

Query: 432 PEWYPITLFNELYYLNAGGAVWT 454
             W+ + LFNELY L  GG +WT
Sbjct: 385 AAWFKMALFNELYLLTDGGTLWT 407


>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
 gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
          Length = 784

 Score =  245 bits (625), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 141/379 (37%), Positives = 208/379 (54%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHDTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    +    +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  187 bits (475), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 118/387 (30%), Positives = 183/387 (47%), Gaps = 47/387 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P   KD T   +  W W    +K TY ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP---KDGT---LSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+ESSYP+++F +T++N       I+++ TW N++G                        
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
              G+SE    +YN     N  V  ++     +        A+A+     + V     + 
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            +G+       ++W     +GS   L  +E    ++ G     A+A   T+ P    ++ 
Sbjct: 269 PAGD-----GSEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F LAWD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
              L +W+ + LFNELY L  GG +WT
Sbjct: 381 RTDLADWFKMALFNELYLLADGGTLWT 407


>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
 gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
          Length = 789

 Score =  245 bits (625), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 139/373 (37%), Positives = 207/373 (55%), Gaps = 17/373 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDAIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    + +  +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAM 890
           +  Y+R +AIWA+
Sbjct: 763 ACHYLRAMAIWAI 775



 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 179/387 (46%), Gaps = 47/387 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P      T   +  W W    +K  Y ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+ESSYP+++F +T +N       I+++ TW N++G                        
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
              G+SE    +YN     N  V  ++     +        A+A+     + V     + 
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            +G+       ++W     +GS   L  +E    +E G     A+A   T+ P    ++ 
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRKIP 320

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F LAWD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
              L +W+ + LFNELY L  GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLTDGGTLWT 407


>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
 gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
          Length = 784

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 138/374 (36%), Positives = 206/374 (55%), Gaps = 13/374 (3%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +  +
Sbjct: 415 IGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRIIGYN 474

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG  D
Sbjct: 475 QASAIRKTKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTGAKD 534

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
           + F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+AAL+
Sbjct: 535 EDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALE 594

Query: 705 AASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQ 763
           A   +   +G+    + F    Q+++ +Y + LWNG Y+N D+   S S  +  DQL GQ
Sbjct: 595 ATLKIGAILGE--DTEIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTDQLCGQ 650

Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREI 822
           +Y+R   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   +    E+
Sbjct: 651 FYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPENPQATHPLEV 710

Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYM 882
           W G+ + + A ++   +   G+Q    + +  +     G  F+TPEA  +   +R+  Y+
Sbjct: 711 WVGINFGLVAFLMQMGMEKQGWQITEAVVKQVYEN---GLQFRTPEAITSVGTFRACHYL 767

Query: 883 RPLAIWAMQWALTR 896
           R +AIWA+   LT 
Sbjct: 768 RAMAIWAIYGVLTE 781



 Score =  184 bits (466), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 119/381 (31%), Positives = 180/381 (47%), Gaps = 35/381 (9%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P      T   +  W W    +K TY ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------------GDSEFTGQHYN 261
           Y+ESSYP+++F +T +N       I+++ TW N++G              D       Y 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNGIKSPVVKVRDDGSPVYEYQ 211

Query: 262 SKTKMNDGVH--------AVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSL 313
           SK   ++G +         V    R  ++ P       A  T   + SL   ++   N  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATI-TNPSLEVFYLGKWNPA 270

Query: 314 GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWD 373
           G    ++W     +GS   L  +E    ++ G     A+A   T+ P    ++ F LAWD
Sbjct: 271 G-DGGEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIPFILAWD 326

Query: 374 CPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPE 433
            P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE   L +
Sbjct: 327 LPVNEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDLAD 386

Query: 434 WYPITLFNELYYLNAGGAVWT 454
           W+ + LFNELY L  GG +WT
Sbjct: 387 WFKMALFNELYLLTDGGTLWT 407


>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
 gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
          Length = 784

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 140/379 (36%), Positives = 207/379 (54%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTTRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  W S+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWSSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATLKIGAILG----EDTAIFASWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK  GGK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPENPEAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    +    +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGVLTE 781



 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 182/389 (46%), Gaps = 47/389 (12%)

Query: 93  SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
           S HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  + 
Sbjct: 39  SWHGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMA 98

Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
            ++P      T   +  W W    +K TY ALYPR+W  +E +   + +++C Q SP+  
Sbjct: 99  TESP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWA 149

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG---------------------- 250
           HNY+ESSYP+++F +T +N       I+++ TW N++G                      
Sbjct: 150 HNYQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYE 209

Query: 251 -----GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPH 305
                G+SE    +YN     N  V  ++     +        A+A+     + V     
Sbjct: 210 YQSKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGK 266

Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
           +  +G+       ++W     +GS   L  +E    +E G     A+A   T+ P    +
Sbjct: 267 WNPAGD-----GGEVWDYFAMNGS---LPDIEDETPAEKGEQTATAMAIRFTIRPGKTRK 318

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F LAWD P   F  G  YYRRYT F+G +   A  + R A+ +  +W   I  WQ+PI
Sbjct: 319 IPFILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPI 378

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           LE   L +W+ + LFNELY L  GG +WT
Sbjct: 379 LERSDLADWFKMALFNELYLLTDGGTLWT 407


>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
 gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
          Length = 804

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 142/390 (36%), Positives = 208/390 (53%), Gaps = 28/390 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           +GQF  LE ++Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +  +
Sbjct: 417 VGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTIRIIGYN 476

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RKV GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + TG  D
Sbjct: 477 QAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDD 536

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
             F    W ++   + Y+ +FD D DG+ EN G PDQT+D W +SGISAY GGLW+AAL+
Sbjct: 537 IDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALE 596

Query: 705 AASALAR------------------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
           AA  +                    E  ++  + Y  +  Q   + +  LWNG Y+N D+
Sbjct: 597 AAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDS 656

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YAR  GL  +V+ +   SAL KVY    LK   GK GA NGM
Sbjct: 657 --KSGSDIVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFLKFQNGKYGAANGM 714

Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG   D +S   +E+W+G+ + +A+ +I   + +   +    + +  +     G  F+
Sbjct: 715 KPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN---GLQFR 771

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
           TPEA      YR+  Y+R +AIW + + L+
Sbjct: 772 TPEAITAVGTYRACHYLRAMAIWGVYYQLS 801



 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 130/389 (33%), Positives = 179/389 (46%), Gaps = 43/389 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       K + + QFS+F    +G   +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W    +K TY ALYP +W  +EG    E  I C Q SPIIP N
Sbjct: 101 PPE---DKT---LSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
           Y+E+SYP+ VF +T+ N       ++++ TW N VG                G  E+  Q
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211

Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
                   ++N   + N  V  +L  L      Q       +A+     V V     +  
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVFYLGKWNP 271

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
            G+       ++W      GS   L   E    +EPG  I AA+A   TV P    ++ F
Sbjct: 272 DGD-----GSEVWDYFAMDGS---LPDQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIPF 323

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            LAWD P   F  G  YYRRYT F+G + N    + R A+     W  ++E W   IL++
Sbjct: 324 ILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSYVWRDKVEEWHNSILKN 383

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
             LP+W  + LFNELY L  GG +WT  S
Sbjct: 384 DNLPDWLKMALFNELYLLTDGGTLWTAAS 412


>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
 gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
          Length = 804

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 144/393 (36%), Positives = 210/393 (53%), Gaps = 28/393 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           IGQF  LE ++Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +  +
Sbjct: 417 IGQFGVLECMDYRWYESLDVRLYGSFGLLMLWPRLEKAVIEAFARAIPSSDDTIRIIGYN 476

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RK  GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + TG  D
Sbjct: 477 QAEGIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGADD 536

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
            +F    W +V  A+ Y+ +FD D DG+ EN G PDQT+D W + G+SAY G LW+AAL+
Sbjct: 537 IEFLWECWEAVPEALNYLKEFDLDNDGIPENSGAPDQTFDDWELRGMSAYCGALWIAALE 596

Query: 705 AASALAR------------------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
           AA  +                    E  ++  + Y  +  Q   V +  LWNG Y+  D+
Sbjct: 597 AAIKIGEILLENRPNNPQLEPKNYPECIEKELDKYQNWLQQSRSVYHSTLWNGEYYKLDS 656

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL+GQ+YAR  GL  +V+   A SALKKVY    LK   GK GA NGM
Sbjct: 657 --ESGSDVVMADQLSGQFYARLLGLPDVVENQYALSALKKVYEACFLKFQNGKYGAANGM 714

Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG   D +S   +E+W+G+ + +A+ +I   + D   +    + +  +     G  F+
Sbjct: 715 KPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN---GLQFR 771

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           TPEA      +R+  Y+R +AIW + + L++ K
Sbjct: 772 TPEAITAVGTFRACHYLRAMAIWGIYYQLSKTK 804



 Score =  193 bits (490), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 129/389 (33%), Positives = 181/389 (46%), Gaps = 43/389 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       K + + QFS+F    +G   +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALSTE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W  + +K +Y ALYP +W  +EG    E  I C Q SPIIP+N
Sbjct: 101 PPE---DNT---LSRWSWYPQ-EKGSYSALYPCSWYQYEGVFQTE--IYCEQFSPIIPNN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
           Y+E+SYP+ VF +TI N       ++++ TW N VG                G  E+  Q
Sbjct: 152 YQETSYPIGVFEWTIKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211

Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
                   ++N   + N  V  +L  L      Q       +A+     V V     +  
Sbjct: 212 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVFYLGKWNP 271

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
           +G+       ++W     +GS   L   E    +EPG  I AA+A   T+ P    ++ F
Sbjct: 272 NGD-----GSEVWDYFAMNGS---LPDQEDETPAEPGEQIAAAMAIRFTIKPGRVKKIPF 323

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            LAWD P   F  G   YRRYT F+G +      + R A+     W  Q+E W   IL+ 
Sbjct: 324 ILAWDLPVTEFAQGIQSYRRYTDFFGRNGQNGWAMVRTALKHSDVWRDQVEEWHNSILKQ 383

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           K LP W+ + LFNELY L  GG +WT  S
Sbjct: 384 KDLPNWFKMALFNELYLLTDGGTIWTAAS 412


>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
 gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
          Length = 802

 Score =  242 bits (618), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 146/388 (37%), Positives = 215/388 (55%), Gaps = 25/388 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SF L++LFP+++ S+ R FA  +   D  +      
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLLLFPELEKSVMRAFARGIPQSDDHQRIIGYY 479

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  D    +RKV GA PHD+G  +   W + N  C  D   WKDL+  FVLQVYRD +
Sbjct: 480 YTIGADTTTAARKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLSCDFVLQVYRDYL 539

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F    W +V   + Y+  FD+DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVEFLADCWDAVVQTLDYLKTFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGGL 599

Query: 699 WVAALQAASALAR---------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
           W+AAL+AA A++          E  ++    Y ++  Q   +  EKLWNG Y+  D+   
Sbjct: 600 WLAALEAAIAISDILITNHKVVESAEKQKSIYEVWLAQSRPIYQEKLWNGQYYRLDS--E 657

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
           S S+ + ADQL GQ+YAR   L  IV  D+A SALK VY+   LK   G+ GA NG+LPD
Sbjct: 658 SGSAVVMADQLCGQFYARLLELPDIVPSDRALSALKTVYDACFLKFQNGEFGAANGVLPD 717

Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           G  +   +    E+W+G+ + +AA ++   + D   +    + E  ++    G  F+TPE
Sbjct: 718 GSPENPKATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVEQVYNN---GLQFRTPE 774

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           A      +R+  Y+R +AIW +   +++
Sbjct: 775 AITATGTFRASTYLRAMAIWGIYLVMSK 802



 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 135/391 (34%), Positives = 188/391 (48%), Gaps = 43/391 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       + V A QFSVF S       SS     
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNVPACQFSVFESYGT----SSQAYAL 95

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKST----YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
           + + L+D +   + +W W     +ST    YHALYPR+W V+E     +L   C Q SPI
Sbjct: 96  STQSLEDGS---LNAWQWYPASSESTSTGTYHALYPRSWFVYENVFQSQL--TCEQFSPI 150

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
             +NY+E+SYPV+VF +  +N       ++++ TW N VG    FT    + + K+ D  
Sbjct: 151 WANNYQETSYPVAVFLWKAHNPSNAPITLSIMLTWENMVGW---FTNALKSPEVKIRDDG 207

Query: 271 HAVLLHHRTSHQ------------------LPPVTFALAAQETDG---VHVSLCPHFVIS 309
             V  +     +                  L  V      QE DG   +     P   I 
Sbjct: 208 SPVYEYQPRWGESQGNYNQVVENSEYFGCFLGRVGITAPVQEGDGSWCIATVKHPQVEIF 267

Query: 310 GNSLGL---TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
            +S      T  ++W      GS    N ++T+   E    +G+AIA    V P    ++
Sbjct: 268 HHSKWHPEGTGDEVWQSFAADGSLP--NYLDTNPVGE-NQRLGSAIAVRFNVQPGETLEI 324

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            F LAWDCP   F +G  YYRRYT F+G   N A  IA  A+  + +W  QI++WQ PIL
Sbjct: 325 PFVLAWDCPVTEFAAGVNYYRRYTDFFGKDGNQAWEIATTALGNYQTWRSQIQSWQAPIL 384

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
             + LP W+ + LFNELY L +GG +W+  S
Sbjct: 385 NREDLPAWFKMALFNELYDLTSGGTLWSAAS 415


>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
 gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
          Length = 784

 Score =  242 bits (617), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 140/379 (36%), Positives = 207/379 (54%), Gaps = 17/379 (4%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ IGQF  LE I+Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +
Sbjct: 412 QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSDDTPRII 471

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +     RK   A PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 472 GYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYRDYLLTG 531

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F +  WPS+ + + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+A
Sbjct: 532 AKDEDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYCGGLWIA 591

Query: 702 ALQAASALAREVGDRGSEDYFLFK--FQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQAD 758
           AL+A   +   +G    ED  +F    Q+++ +Y + LWNG Y+N D+   S S  +  D
Sbjct: 592 ALEATLKIGAILG----EDTAIFAGWLQQSRAIYHQTLWNGEYYNLDS--GSGSDVVMTD 645

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSM 817
           QL GQ+YAR   L  +V+    +SAL K+Y    LK   GK GA NG+ PDG  +   + 
Sbjct: 646 QLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHEGKFGAANGLKPDGTPENPQAT 705

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
              E+W G+ + + A ++   +   G+Q    +    +     G  F+TPEA  +   +R
Sbjct: 706 HPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVYEN---GLQFRTPEAITSVGTFR 762

Query: 878 SLCYMRPLAIWAMQWALTR 896
           +  Y+R +AIWA+   LT 
Sbjct: 763 ACHYLRAMAIWAIYGILTE 781



 Score =  184 bits (466), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 179/387 (46%), Gaps = 47/387 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W +         + A QFS+F    +G   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           +P      T   +  W W    +K  Y ALYPR+W  +E +   + +++C Q SP+  HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+ESSYP+++F +T +N       I+++ TW N++G                        
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 251 ---GDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
              G+SE    +YN     N  V  ++     +        A+A+     + V     + 
Sbjct: 212 SKWGESE---GNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEVFYLGKWN 268

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            +G+       ++W     +GS   L  +E    ++ G     A+A   T+ P    ++ 
Sbjct: 269 PAGD-----GGEVWDYFAMNGS---LPDIEDETPAQKGEQTATAMAIRFTIRPGKTRKIP 320

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F LAWD P   F  G  YYRRYT F+G +   A  + R A+    +W   I  WQ+PILE
Sbjct: 321 FILAWDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILE 380

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
              L +W+ + LFNELY L  GG +WT
Sbjct: 381 RSDLADWFKMALFNELYLLTDGGTLWT 407


>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
 gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
          Length = 814

 Score =  241 bits (616), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 148/382 (38%), Positives = 211/382 (55%), Gaps = 25/382 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SF L+ LFP+++ S+ R FA A+   D  +      
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLHLFPELEKSVIRAFARAIPHSDNHQRVIGYY 479

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  D     RKV GA PHD+G  +   W + N  C  D   WKDL   FVLQVYRD +
Sbjct: 480 YTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFL 539

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F    W  +   + Y+  FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVQFLADCWAGIVQTLDYLKSFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599

Query: 699 WVAALQAASALAREV--GDRGSED-------YFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
           W+AAL+AA A++  +    RG+ED       Y ++  Q   +  EKLWNG YF  D+   
Sbjct: 600 WLAALEAAIAISDILLTNHRGAEDTEELGQRYRVWLAQSRTIYQEKLWNGQYFRLDS--E 657

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
           S S  + ADQL GQ+YAR  GL  IV  D+A SAL+ VY+   +K   G+ GA NG+ PD
Sbjct: 658 SGSDVVMADQLCGQFYARLLGLPDIVPSDRALSALQTVYHACFVKFCNGEFGAANGVRPD 717

Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           G  +  ++    E+W+G+ + +AA ++   + D   +    + +  +     G  F+TPE
Sbjct: 718 GSAENPNATHPLEVWTGINFGLAAFLVQMGMQDEALKLTGAVVQQIYHN---GLQFRTPE 774

Query: 869 AWNTDDQYRSLCYMRPLAIWAM 890
           A      +R+  Y+R +AIWA+
Sbjct: 775 AITASGTFRASTYLRAMAIWAI 796



 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 130/392 (33%), Positives = 187/392 (47%), Gaps = 45/392 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
           HG+PLGG G+G IGRS RG+F  W I       + + A QFSVF S  ++ Q Y+  L  
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESDGTSAQAYA--LAT 97

Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
           + PE         + +W W           TYHALYPR+W V+E     +L   C Q SP
Sbjct: 98  EAPE------NGSLSAWKWYPASTATQSTGTYHALYPRSWFVYENVLQAQL--TCEQFSP 149

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           I   NY+ESSYPV+VF +  +N       ++++ TW N VG    FT    + + ++ D 
Sbjct: 150 IWAENYQESSYPVAVFNWKAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPEVRIRDD 206

Query: 270 VHAVLLHHR------------TSHQLPPVTFALAAQETDGV---HVSLC------PHFVI 308
              V  +              T +      F     + D V     S C      P F +
Sbjct: 207 GSPVYEYQPRLGDSQDNYNCITENTENFSCFCSRVVDDDSVAEGDGSWCIATIKHPQFKV 266

Query: 309 SGNSL---GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
             N+      T  D+W      GS   L + + +  +   + IG AIA   T+ P     
Sbjct: 267 FYNTCWNPAGTGADVWETFAADGS---LTNYQDATPALENTQIGVAIALRFTLQPGETLA 323

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F L+WD P   F +G  YYRRYT F+G   + A  IA  A+ ++ +W   IE WQ+PI
Sbjct: 324 IPFVLSWDFPVTEFAAGINYYRRYTDFFGRGGDHAWAIASTALTQYQTWYKNIENWQQPI 383

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           ++ + LP+W+ + LFNELY L +GG +W+  S
Sbjct: 384 IDREDLPDWFKMALFNELYDLTSGGTLWSAAS 415


>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
 gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
          Length = 819

 Score =  241 bits (616), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 147/390 (37%), Positives = 212/390 (54%), Gaps = 28/390 (7%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D  + IGQF  LE ++Y  + + DV  Y SFAL ML+P++  ++   FA A+ + D +  
Sbjct: 429 DDRDPIGQFGVLECLDYRWYESLDVRLYGSFALAMLWPRLDKAVLEAFARAIPIGDKTPR 488

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
            +  +     RK   A PHD+G  +  PW + N     D  +WKDL   FVLQVYRD V 
Sbjct: 489 IIGYNQAQAIRKAADATPHDLGAPNEHPWEQTNYTSYQDCNQWKDLPCDFVLQVYRDFVF 548

Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           TG  D +F    W SV   +AY+  FD+DGDG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 549 TGANDTEFLWECWSSVVKTLAYLKTFDKDGDGIPENSGAPDQTFDDWRLQGISAYCGGLW 608

Query: 700 VAALQAASALAR-------------EVGDRGS--EDYFLFK--FQKAKVVY-EKLWNGSY 741
           +AAL+AA A+               EV D  S  E   L++   ++++ +Y EKLWNG Y
Sbjct: 609 LAALEAAIAMGEILINHYPPTSELIEVPDPESIQETLKLYRQWLEQSQPIYQEKLWNGQY 668

Query: 742 FNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRG 801
           +  D+   S S  +  DQL GQ+YA+  GL  IV  +  +SALK VY+   LK    + G
Sbjct: 669 YRLDS--ESGSDIVMTDQLCGQFYAQLLGLPDIVPPECVQSALKSVYDSCFLKFQDAQLG 726

Query: 802 AVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
           A NG+ PDG  +   +    E+W+G+ + +AA ++ + +     Q    +    +     
Sbjct: 727 AANGVRPDGSPENPDATHPLEVWTGINFGLAAFLVQQGMKQEALQLTEAVVRQVYEN--- 783

Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 784 GLQFRTPEAITAAGTFRASHYLRAMAIWAV 813



 Score =  203 bits (516), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 134/401 (33%), Positives = 185/401 (46%), Gaps = 56/401 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       K + A QFSVF    N  K +  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHSFKSLPACQFSVFEQSENAPKQAYALCTE 100

Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
            P+   D T   +G+W W       KG+  TY ALYP++W  +E     EL   C Q SP
Sbjct: 101 PPQ---DGT---LGTWQWYPTSQQAKGNTGTYQALYPQSWFTYENVFQTEL--TCEQFSP 152

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           I    Y+E+SYPV+VF +T +N       ++++ TW N VG    FT    + + K+ D 
Sbjct: 153 IWAGCYQETSYPVAVFEWTAHNPTDAPITLSIMLTWQNIVGW---FTNAIKSPQVKVRDD 209

Query: 270 VHAV--------------------------LLHHRTSHQLPPVTFALAAQETDGVHVSLC 303
              V                          +L     H    V     A E       LC
Sbjct: 210 GSPVYEYQPRWGESTGSYNQWIVDNFRVGCILDRVRLHDSEAVADFRGASEIQEGEGQLC 269

Query: 304 PHFVISGNSLGL----------TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIA 353
               ++  S+ +             ++W      GS   L+ M+    + PG  I AAIA
Sbjct: 270 -FATVTNPSMEVFYHTRWNPAGNGAEVWQSFAADGS---LSDMQDETAAAPGEQIAAAIA 325

Query: 354 ASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
              T+ P    ++ F LAWD P   F  G  YYRRYT F+G   N A +I R A+     
Sbjct: 326 IRFTIRPGRTRKIPFILAWDFPVTEFSPGVNYYRRYTDFFGRTGNNAWSIVRTALKHADL 385

Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           W+ +I++WQ PIL+ + LP W+ + LFNELY L  GG +WT
Sbjct: 386 WKEKIQSWQEPILKREDLPHWFKMALFNELYLLADGGTLWT 426


>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
 gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
          Length = 804

 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 142/390 (36%), Positives = 207/390 (53%), Gaps = 28/390 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           +GQF  LE ++Y  + + DV  Y SF L+ML+P ++ ++   FA A+   D +   +  +
Sbjct: 417 VGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPGLEKAVMEAFARAIPSSDDTIRIIGYN 476

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RKV GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + TG  D
Sbjct: 477 QAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDD 536

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
             F    W ++   + Y+ +FD D DG+ EN G PDQT+D W +SGISAY GGLW+AAL+
Sbjct: 537 IDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGISAYCGGLWIAALE 596

Query: 705 AASALAR------------------EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
           AA  +                    E  ++  + Y  +  Q   + +  LWNG Y+N D+
Sbjct: 597 AAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYHSTLWNGEYYNLDS 656

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YAR  GL  +V+ +   SAL KVY    LK   GK GA NGM
Sbjct: 657 --KSGSDIVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFLKFQNGKYGAANGM 714

Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG   D +S   +E+W+G+ + +A+ +I   + +   +    + +  +     G  F+
Sbjct: 715 KPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVKQVYEN---GLQFR 771

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
           TPEA      YR+  Y+R +AIW + + L+
Sbjct: 772 TPEAITAVGTYRACHYLRAMAIWGVYYQLS 801



 Score =  193 bits (490), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 130/389 (33%), Positives = 179/389 (46%), Gaps = 43/389 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       K + + QFS+F    +G   +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W    +K TY ALYP +W  +EG    E  I C Q SPIIP N
Sbjct: 101 PPE---DKT---LSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
           Y+E+SYP+ VF +T+ N       ++++ TW N VG                G  E+  Q
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211

Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
                   ++N   + N  V  +L  L      Q       +A+     V V     +  
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVFNLGKWNP 271

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
            G+       ++W      GS   L   E    +EPG  I AA+A   TV P    ++ F
Sbjct: 272 DGD-----GSEVWDYFAMDGS---LPDQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIPF 323

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            LAWD P   F  G  YYRRYT F+G + N    + R A+     W  ++E W   IL++
Sbjct: 324 ILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSDVWRDKVEEWHNSILKN 383

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
             LP+W  + LFNELY L  GG +WT  S
Sbjct: 384 DNLPDWLKMALFNELYLLTDGGTLWTAAS 412


>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
 gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
          Length = 782

 Score =  241 bits (614), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 149/364 (40%), Positives = 194/364 (53%), Gaps = 19/364 (5%)

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           F  LE   Y  + T DV FY S+AL  L+P I     R F  +V      +         
Sbjct: 427 FTSLESYIYPFYGTSDVRFYGSWALFSLWPDIDKQEVRQFCDSVTTTRSDR--------- 477

Query: 592 VSRKVLGAVPHDIGICDPWFEV-NAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFA 648
              + +G   HD G  +  F   NAY   D+  WKDLN K VL VYRD   TG  DK F 
Sbjct: 478 --PRAIGTTAHDFGDNESVFRKWNAYTYRDSTNWKDLNSKLVLMVYRDWALTGKTDKAFL 535

Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
              WP+V +AM  +   D DGDG+ E++G  DQTYD   + G +AY GGL++AA +AA  
Sbjct: 536 DYCWPAVKMAMQKVKGQDGDGDGLPESNGI-DQTYDDMDLHGNTAYCGGLFIAASEAAKE 594

Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
           LA  VGD      +    +++K  +E KLW GSY+  D +GS  ++ I +DQLAGQWYAR
Sbjct: 595 LATAVGDASLASTYQTWVEQSKGGFESKLWTGSYYKID-TGSRDTNRIMSDQLAGQWYAR 653

Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
           A GL PIVD   A SA  K+Y  N  +  GG RG VN M  +G VD +S Q+RE W G +
Sbjct: 654 ALGLPPIVDPSHAASAFTKIYENNFKRFDGGTRGVVNVMTANGSVDGTSNQTRECWVGTS 713

Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAI 887
           + V A MI E LA    +    + +  W   GL   F+TPEAW  +   R+  YMR   +
Sbjct: 714 WGVVAGMIQEGLAAQAGEIGASLVDTIWKTDGL--WFRTPEAWEGNGSIRAPYYMRATTL 771

Query: 888 WAMQ 891
           WA +
Sbjct: 772 WAAK 775



 Score =  139 bits (351), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 110/368 (29%), Positives = 162/368 (44%), Gaps = 56/368 (15%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV-SRSNGQKYSSVLCPK 154
           G P+GG G+G+I   + G F + ++   +  +   + N    F+  ++ G         +
Sbjct: 97  GAPIGGFGAGTITWRFDGRFYKGRL--DIGSNDQNVDNSAGFFMYQKAEGGD------AQ 148

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
           T  + KD    G           ++TY +L+PR+W  + G   P  +    Q SPIIP +
Sbjct: 149 TKRLDKDGLGQG-----------QATYSSLFPRSWVDYHGAAFP-CKAKVEQYSPIIPGD 196

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
           YK++SYPV V+ + I N      D+ ++ TW N  GG         N  T    G    L
Sbjct: 197 YKKTSYPVGVYRWEITNPTGAPCDVAVMLTWKNDHGG---------NGATVQTSGDDVGL 247

Query: 275 LHHRTSHQLPPVT-----FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
           +  R      P +     F LA++   GV VS       S +S+      +  E+   G+
Sbjct: 248 VLTRGGGA--PSSEGQGEFTLASRNAPGVVVSYQ-----SADSVAA----LQSELANDGA 296

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRY 389
                       +  G S+G AIA   TV P     V   LAWD P     SG  +YR Y
Sbjct: 297 L---------ANTTGGHSLG-AIAFKATVEPGKSVIVPIVLAWDIPVTQAGSGDRWYRAY 346

Query: 390 TKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
           T+ +G   NA+  IA +A+ EH  W   I+ WQ  +L+D + PEW    LFNELYY    
Sbjct: 347 TRHFGRTGNASWTIAAEALAEHEPWLWAIQDWQASVLDDPKYPEWLKGALFNELYYYAIA 406

Query: 450 GAVWTDGS 457
           G  W  G+
Sbjct: 407 GTYWEAGA 414


>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
 gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
          Length = 803

 Score =  241 bits (614), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 143/382 (37%), Positives = 211/382 (55%), Gaps = 26/382 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SF L++LFP+++ S+ R FA A+   D +       
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLVLLFPELEKSVMRAFARAIPQGDDTPRVIGYY 479

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  +     RK +GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 480 YTIGAESPVAVRKAVGATPHDLGAPNEHVWQKTNYTSYQDCNLWKDLGCDFVLQVYRDYL 539

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F +  W ++   + Y+  FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 FTGANDVEFLQDCWDAIVQTLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599

Query: 699 WVAALQAASALAREVGDRGSEDYFLFK--------FQKAKVVY-EKLWNGSYFNYDNSGS 749
           W+AAL+AA A++ E+  +   D  L K         +K+  +Y EKLWNG Y+  D+  +
Sbjct: 600 WMAALEAAIAIS-EILSQHQPDSALLKQKSIYATWLEKSLPIYQEKLWNGQYYRLDS--N 656

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
           S S  + ADQL GQ+YA    L  IV  D+A SAL+ VY+   LK   G+ GA NG+ PD
Sbjct: 657 SGSDVVMADQLCGQFYAALLNLPDIVPGDRAISALRTVYDACFLKFYNGQFGAANGVRPD 716

Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           G  +   +    E+W+G+ + +AA ++   + D   +    + +  +S    G  F+TPE
Sbjct: 717 GSPENPQATHPLEVWTGINFGLAAFLVQMGMKDEALRLTEAVVQQIYSN---GLQFRTPE 773

Query: 869 AWNTDDQYRSLCYMRPLAIWAM 890
           A      +R+  Y+R +AIW +
Sbjct: 774 AITPSGTFRASTYLRAMAIWGV 795



 Score =  162 bits (410), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 128/386 (33%), Positives = 188/386 (48%), Gaps = 39/386 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
           HG+PLGG G+G IGRS RG+F  W I       + + A QFSVF S  ++ Q Y+  L  
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFQNIPACQFSVFESHGTSSQAYA--LAT 97

Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
             PE         + +W W        +   YHALYPR+W V+E     +L   C Q SP
Sbjct: 98  TAPE------DGSLQAWQWYPASTATQETGAYHALYPRSWFVYENVLQAQL--TCEQFSP 149

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-ND 268
           +  HNY+E+SYPV+VF +  +N       ++++ TW N  G    FT    + + K+ +D
Sbjct: 150 VYAHNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWQNMTGW---FTNALKSPEVKVRDD 206

Query: 269 GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLC----PHFVISGNSLGLTAKDMWHEI 324
           G        R         + + + ++ G  +       P     G+   +T K    E+
Sbjct: 207 GSPVYEYQPRLGESQGNYNYLVESPQSLGCFLGRVGITEPLQEGEGSWCIVTQKHPQVEV 266

Query: 325 KEHG-------------SFDRLNSMETSVTSEPGSS---IGAAIAASVTVPPDSEGQVTF 368
             H              SF    S+   + + P      IG AIA   T+ P    ++ F
Sbjct: 267 FHHTRWNAAGKGDEIWQSFAADGSLPDYIDTTPAGENEQIGTAIAVRFTLQPGENLEIPF 326

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            LAWD P   F +G  Y+RRYT F+GT+ N A  IA  A+ ++ +W+ QI+AWQ+PI+E 
Sbjct: 327 VLAWDFPVTEFAAGVNYHRRYTDFFGTNGNNAWAIATLALEQYQTWQQQIQAWQQPIIER 386

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWT 454
           + LP W  + LFNELY L +GG +W+
Sbjct: 387 EDLPNWLKMALFNELYDLTSGGTLWS 412


>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
 gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
          Length = 800

 Score =  240 bits (613), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 149/383 (38%), Positives = 211/383 (55%), Gaps = 26/383 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           +GQF  LE ++Y  + + DV  Y SFAL+MLFP+++ S+ R FA A+   D  +      
Sbjct: 415 VGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIPHSDAHQRVIGYY 474

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  D     RKV GA PHD+G  +   W + N  C  D   WKDL   FVLQVYRD +
Sbjct: 475 YTIGADTTTAVRKVPGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSDFVLQVYRDFL 534

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F    WP++   + Y+  FD+D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 535 LTGADDVEFLTDCWPAIVETLNYLKTFDQDDDGIPENSGAPDQTFDDWRLQGVSAYCGGL 594

Query: 699 WVAALQAASA-----LAREVGDRGS-----EDYFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
           W+AAL+AA A     L + + D G+       Y  +  Q   +  EKLWNG Y+  D+  
Sbjct: 595 WLAALEAAIAISNILLNQNLADLGNLALQKSIYETWLAQSRPIYEEKLWNGQYYRLDS-- 652

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
            S S  + ADQL GQ+YAR  GL  IV  D+A SAL+ VY    LK   G+ GA NG+ P
Sbjct: 653 KSGSDVVMADQLCGQFYARLLGLPDIVASDRALSALQTVYEACFLKFYDGQFGAANGVRP 712

Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
           DG  +   +    E+W+G+ + +AA ++   + D   +    + +  +     G  F+TP
Sbjct: 713 DGSPENPQATHPLEVWTGINFGLAAFLVQMGMQDEALKLTQAVVKQIYDN---GLQFRTP 769

Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
           EA  T   +R+  Y+R +AIWA+
Sbjct: 770 EAITTVGTFRASTYLRAMAIWAI 792



 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 134/386 (34%), Positives = 192/386 (49%), Gaps = 38/386 (9%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
           HG+PLGG G+G IGRS RG+F  W I       + + A QFSVF S  ++ Q Y+  L  
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESHGTSSQAYA--LST 97

Query: 154 KTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
           + PE   D + AG   W W       TYHALYPR+W V+E     +L   C Q SP+  +
Sbjct: 98  QAPE---DGSLAG---WKW-YPASGGTYHALYPRSWFVYENVFQAQL--TCEQFSPVWAN 148

Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------------GDSEFTGQHY 260
           NY+E+SYPV++F +  +N       ++++ TW N VG              D       Y
Sbjct: 149 NYQETSYPVAIFLWKAHNPTDAPITLSIMLTWENMVGWFTNALKSPEVRIRDDGSPVYEY 208

Query: 261 NSKTKMNDGVHAVLLHH--RTSHQLPPVTFALAAQETDGV-------HVSLCPHFVISGN 311
             +   + G +  L+ +  +    L  V    +  E DG        H  +        N
Sbjct: 209 QPRLGESQGNYNQLIENSGQVGCFLSQVDRDASVAEGDGSWCIATLKHPQIEVFHHTRWN 268

Query: 312 SLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLA 371
            +G T +D+W      GS   L + + +  +   + IG AIA   T+ P    ++ FSLA
Sbjct: 269 PVG-TGEDVWQSFAADGS---LPNYQDATPAIENTRIGGAIAIRFTLQPGQTIEIPFSLA 324

Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
           WD P   F +G  YYRRYT F+G     A  IA  A+ E+ +W  QI+ WQ+PILE + L
Sbjct: 325 WDFPVTEFAAGVNYYRRYTDFFGRSGENAWAIASTALQEYQNWRSQIQRWQKPILEREDL 384

Query: 432 PEWYPITLFNELYYLNAGGAVWTDGS 457
           P+W+ + LFNELY L +GG +W+  S
Sbjct: 385 PQWFKMALFNELYDLTSGGTLWSAAS 410


>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
 gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
 gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
          Length = 818

 Score =  239 bits (609), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 140/385 (36%), Positives = 206/385 (53%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           IGQF  LE ++Y  + + DV  Y SF L+ML+P+++ ++   FA A+   D +   +  +
Sbjct: 431 IGQFGVLESMDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPNSDDTVRIIGYN 490

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RK  GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + TG  D
Sbjct: 491 QAEGIRKTKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFLLTGSDD 550

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
            +F    W ++   + Y+  FD D DG+ EN G PDQT+D W + GISAY G LW+AAL+
Sbjct: 551 IEFLWECWEAIPETLNYLKAFDLDNDGIPENSGAPDQTFDDWELRGISAYCGALWIAALE 610

Query: 705 AASALAREVGDRGSE-----------------DYFLFKFQKAKVVYEK-LWNGSYFNYDN 746
           AA  +   + +   +                 D +    Q+++ +Y   LWNG Y+  D+
Sbjct: 611 AAIKIGEILLENAPQNPQLEPENYPKSIEKELDKYQQWLQQSRAIYHSTLWNGEYYKLDS 670

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL+GQ+YAR  GL  +V++  A SALKKVY    LK   GK GA NGM
Sbjct: 671 --ESGSDVVMADQLSGQFYARLLGLPDVVEQQYALSALKKVYEACFLKFQNGKYGAANGM 728

Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG   D +S   +E+W+G+ + +A+ +I   + D   +    + +  +     G  F+
Sbjct: 729 KPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVKQVYEN---GLQFR 785

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA      +R+  Y+R +AIW +
Sbjct: 786 TPEAITAVGTFRACHYLRAMAIWGV 810



 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 133/389 (34%), Positives = 181/389 (46%), Gaps = 43/389 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+P+GG GSG IGRS RG+F  W +       K + + QFS+F    NG   +  L  +
Sbjct: 55  HGMPIGGFGSGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQPENGTAQAYALATE 114

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W    +K TY ALYP +W  +EG    E  I C Q SPIIP N
Sbjct: 115 PPE---DNT---LSRWAW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 165

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
           Y+E+SYP+ VF +T+ N       ++++ TW N VG                G  E+  Q
Sbjct: 166 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 225

Query: 259 --------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVI 308
                   ++N   + N  V  +L  L      Q       +A+     V V     +  
Sbjct: 226 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHQQVQEGEGQICIASVTNPSVEVFYLGKWNP 285

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
           +G+       ++W     +GS   L   E    +EPG  I AA+A   TV P    ++ F
Sbjct: 286 NGD-----GSEVWDHFAMNGS---LPDQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIPF 337

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            LAWD P   F  G  YYRRYT F+G +      + R A+     W  +IE WQ PIL  
Sbjct: 338 ILAWDLPVTEFAQGIQYYRRYTDFFGRNGQNGWAMVRTALKHSDVWREKIEEWQNPILNR 397

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGS 457
             LP+W+ + LFNELY L  GG +WT  S
Sbjct: 398 DDLPDWFKMALFNELYLLTDGGTLWTAAS 426


>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
 gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
          Length = 815

 Score =  238 bits (608), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 145/385 (37%), Positives = 208/385 (54%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           IGQF  LE ++Y  + + DV  Y SFAL ML+PK+  ++   FA A+   D +   +  +
Sbjct: 427 IGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEAFARAIPTSDETLRIIGYN 486

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV--ATGD 644
                RK+  A PHD+G  +  PW + N     D  +WKDL   FVLQVYRD V  +  D
Sbjct: 487 QAQAVRKIADATPHDLGAPNEHPWQQTNYTSYQDCNQWKDLASDFVLQVYRDFVFYSATD 546

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
            +F    W S+  A+AY+ +FD+DGDG+ EN G PDQT+D W + G+SAY GGLW+AAL+
Sbjct: 547 IEFLWECWSSIVKALAYLKRFDQDGDGIPENYGAPDQTFDDWRLQGVSAYCGGLWLAALE 606

Query: 705 AASALAR-------------EVGDRGS-----EDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
           +A A+               EV +  S     + Y  +  Q   +  EKLWNG Y+  D+
Sbjct: 607 SAIAIGEILTKNYPPTSPFVEVPELDSIQDTIDCYRQWLAQAMPIYQEKLWNGQYYQLDS 666

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YA+  GL  IV  +   SALK VY+   LK   G+ GAVNG+
Sbjct: 667 --ESGSDVVMADQLCGQFYAQLLGLPDIVPPECTLSALKTVYDACFLKFHQGQLGAVNGV 724

Query: 807 LPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG   D  +    E+W+G+ + +AA +I + + D   +    +    +     G  F+
Sbjct: 725 RPDGTPEDPDATHPMEVWTGINFGLAAFLIQQGMKDEALRITEAVVRQVYDH---GLQFR 781

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA      +R+  Y+R + IWA+
Sbjct: 782 TPEAITAAGTFRASHYLRAMGIWAV 806



 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 137/397 (34%), Positives = 184/397 (46%), Gaps = 55/397 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       K + A QFSVF    +    +  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSLPACQFSVFEQPEHSLAQAYALCTQ 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKST---------YHALYPRAWTVHEGEPDPELRIVCR 205
            PE  K      +GSW+W     KS          Y+ALYPR+W  ++     EL   C 
Sbjct: 101 PPEDGK------LGSWEWYPTQGKSEENSSEVSGHYYALYPRSWFTYQNVFQTEL--TCE 152

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           Q SPI    Y+ESSYPV+VF +T +N   T   ++++ TW N VG    FT    N + +
Sbjct: 153 QFSPIWAGCYQESSYPVAVFEWTAHNPTDTPITVSIMLTWQNMVGW---FTNGIKNPEVR 209

Query: 266 MNDGVHAV------------------LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFV 307
           + D    V                  + H+R    L  V      +E +G    LC   V
Sbjct: 210 VRDDGSPVYEYQPRWGDSTGNLNQWIVDHYRVGFVLDRVRLGDDIKEGEG---QLCLATV 266

Query: 308 ISG----------NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
            +           N +G  A ++W      GS   L   +    + PG  I  AIA   T
Sbjct: 267 TNPSMEVFYHGRWNPVGDGA-EVWDTFATDGS---LVDHQDETPAAPGEQIACAIAIRFT 322

Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
           + P    ++ F LAWD P   F  G  YYRRYT F+G + N    + R A+  H  W  Q
Sbjct: 323 LRPGKTRKIPFILAWDFPVTEFKLGINYYRRYTDFFGRNGNNGWPMVRTALKHHDLWREQ 382

Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           IE+WQ PIL  + LP W+ + LFNELY L  GG +WT
Sbjct: 383 IESWQEPILNQEDLPNWFKMALFNELYLLTDGGTLWT 419


>gi|12005896|gb|AAG44660.1|AF258662_1 AD035 [Homo sapiens]
          Length = 367

 Score =  238 bits (608), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 153/421 (36%), Positives = 215/421 (51%), Gaps = 71/421 (16%)

Query: 242 LFTWTNSVGGDSEFTGQHYNSK---TKMNDGVHAVLLHHRTSHQLP-PVTFALAAQETDG 297
           +F+  N +GG  +  G  +N      +  + V  +LLHH T   LP P T A+AA+ T  
Sbjct: 1   MFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT---LPNPYTMAVAARVTAA 57

Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
             V+      I+      T + +W ++ + G  D  +    S  ++ G  I  A+  S  
Sbjct: 58  TTVT-----HITAFDPDSTGQQVWQDLLQDGQLD--SPTGQSTPTQKGVGIAGAVCVSSK 110

Query: 358 VPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWEL 416
           + P  + ++ FSLAWD P + F   G+ +YRRYT+F+G   +AA  ++  A+  +  WE 
Sbjct: 111 LRPRGQCRLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEE 170

Query: 417 QIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWS 476
           +I AWQ P+L+D+ LP WY   LFNELY+L  GG VW +                     
Sbjct: 171 RISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--------------------- 209

Query: 477 QSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL--LQDGEENIGQFLY 534
                                     +LE      +L    G N+  L+    + G+F Y
Sbjct: 210 --------------------------VLED-----SLPEELGRNMCHLRPTLRDYGRFGY 238

Query: 535 LEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSR 594
           LEG EY M+NTYDVHFY+SFALIML+PK++LS+Q D A A +  D ++ + L  G     
Sbjct: 239 LEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPV 298

Query: 595 KVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVW 652
           K    +PHDIG  D  PW  VNAY ++DTA WKDLN KFVLQVYRD   TGD+ F K +W
Sbjct: 299 KRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMW 358

Query: 653 P 653
           P
Sbjct: 359 P 359


>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
 gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
          Length = 802

 Score =  238 bits (608), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 148/406 (36%), Positives = 213/406 (52%), Gaps = 30/406 (7%)

Query: 516 SFGTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAA 573
           S G +L     EN  +GQF  LE I+Y  + + DV  Y SF LIM++P++  SI   F+ 
Sbjct: 401 SEGGSLWTAATENDPVGQFGVLECIDYRWYESLDVRLYGSFPLIMMWPRLDKSIMEAFSR 460

Query: 574 AVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKF 631
           A+   D +   +  +     RK  GA PHD+G  +  PW + N     D   WKDL   F
Sbjct: 461 AIPTEDKTPRIIGYNQTQAIRKAKGATPHDLGAPNEHPWEKSNYTSYQDCNLWKDLGCDF 520

Query: 632 VLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
           VL VYRD + TG  D+ F    W SV   + Y+  FD DGDG+ EN G PDQT+D W + 
Sbjct: 521 VLLVYRDYLFTGGLDQDFLWECWDSVVETLHYVKTFDLDGDGIPENSGAPDQTFDDWRLQ 580

Query: 690 GISAYSGGLWVAALQAASALAREV-----------------GDRGSEDYFLFKFQKAKVV 732
           GISAY GGLW+A L+AA A+   +                 G + +   F     +A+ +
Sbjct: 581 GISAYCGGLWIAGLEAAIAIGNTLIANPPMNPSLQPDDYPQGIKDAIAVFQQWLTQARSL 640

Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
           Y E LWNG Y+  D+   S S  + ADQL GQ+YA+  GL  +V+    +S L+KVY+  
Sbjct: 641 YHETLWNGEYYRLDS--ESGSDVVMADQLCGQFYAQLLGLPDVVESKYVKSTLRKVYDAC 698

Query: 792 VLKVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
            LK   GK G  NG+ PDG+ V  +     E+W+G+ Y + A MI   +   G + A  +
Sbjct: 699 FLKFHDGKFGIANGVKPDGKPVKENDTHPLEVWTGINYGIVAFMILNGMKQEGLRVAETV 758

Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
            +  +     G  F+TPEA      +R+  Y+R +A+WA+ ++L +
Sbjct: 759 VKQVYEN---GLQFRTPEAITAVGTFRASHYLRAMAVWAIYYSLEK 801



 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 129/387 (33%), Positives = 187/387 (48%), Gaps = 41/387 (10%)

Query: 93  SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
           ++HG+PLGG G G +GR   GEF  W I       + + A QFSVF            + 
Sbjct: 38  ANHGMPLGGFGGGCVGRGVNGEFNLWHIDGGNHIYQSLPACQFSVFEQVEGEPPQVYAMS 97

Query: 153 PKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
             +P      T   + SW W  +G K TY ALYPR+W  ++G    + R++C Q SPI  
Sbjct: 98  TNSP------TDGSLSSWAWYPQG-KGTYSALYPRSWFDYQGVF--KSRLICEQFSPIWA 148

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------GDSE 254
            +Y+ESSYP++VF ++++N       ++++FTW NSVG                   + E
Sbjct: 149 GSYQESSYPLAVFEWSVHNPTDKEITLSIMFTWQNSVGWFTNAIKSPTVKVRDDGSPEYE 208

Query: 255 FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFV 307
           +  +  +S   +N  +     + R       +      QE +G        + SL   + 
Sbjct: 209 YRPKWGDSTGNLNQWIQD---NFRVGCLFDRIRMEEQPQEGEGQWAISSITNPSLEVFYH 265

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
              N  G  A D+W     +GS   L   +    +EPG  I  A+A   TV P    ++ 
Sbjct: 266 SRWNPKGDGA-DIWDYFAMNGS---LPDYQDETPAEPGEQIAGAMAIRFTVKPGKTKKIP 321

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F LAWD P   F  G TYYRRYT F+G +   A  I R A+  H  W+ +I+ WQ+PIL+
Sbjct: 322 FILAWDFPVTEFAQGITYYRRYTDFFGRNGRNAWAIVRTALKHHDMWQNRIKEWQKPILD 381

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
              LPEW  + LFNELY L+ GG++WT
Sbjct: 382 RGDLPEWLKMALFNELYLLSEGGSLWT 408


>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
 gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 823

 Score =  238 bits (607), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 223/855 (26%), Positives = 358/855 (41%), Gaps = 144/855 (16%)

Query: 94  SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
           +HG+PLGG+G+G  GR+  G    W +         +   QFS++  + +  +  ++   
Sbjct: 44  NHGMPLGGLGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDVRAHALAVA 103

Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELR--IVCRQI 207
              +  +   AA + +W W      K    TY A YP +WT ++G      R  ++C   
Sbjct: 104 PEGDASRPDAAASLPAWSWYPASTNKRCTGTYAARYPLSWTHYDGV----FRSGVLCEAF 159

Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------G 251
           SPI+P +Y+ SSYPV+VF + + N      +++L+ +W N+VG                G
Sbjct: 160 SPILPGDYQRSSYPVAVFCWQLANPTDQPLELSLMVSWRNTVGWFTNTDASAEVHFRDDG 219

Query: 252 DSEFT-------GQHYNSKTKMNDGVHAVLLHHRTSH--QLPPVTFALAA-QETDGVHVS 301
             E         G    ++     G+  VL+  R+S   Q     + LA   +  GV V 
Sbjct: 220 SPEHNYSPAIGRGAGQRNRAHSESGLTGVLMEGRSSEPIQEGEGQWCLAVPDDLAGVEVL 279

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
            C  +  SG+       D+W      G   R+   +    S  G    AA+     + P 
Sbjct: 280 RCSRWDPSGD-----GSDIWTPFSAEG---RIPESDNDRESRAGEQASAAMVVKFILAPG 331

Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
              ++   +                                          SW+L I A+
Sbjct: 332 ESREIPVVI------------------------------------------SWDLPITAF 349

Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLK 481
                + +R  +++     N           W+D    + +            W Q  L+
Sbjct: 350 ASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEA------------WQQPVLE 397

Query: 482 RIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYL 541
           R          A+    RM+ +  ++Y   +  S +     QD     G+F  LE ++Y 
Sbjct: 398 R---------KALPEPVRMA-LFNELYDLCSGGSLWTAATAQD---PYGRFGVLECLDYA 444

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---LDDGQW---VSRK 595
            + + DV  Y S AL+ L+P++  ++ R F+ A+  HD ++  +      G+      RK
Sbjct: 445 WYESLDVRLYGSLALLQLWPELDKAVLRSFSRAIPAHDATQRPIGWYFTQGRGRVEADRK 504

Query: 596 VLGAVPHDIGICDPW-FEVNAYCLY-DTARWKDLNPKFVLQVYR--DVVATG-DKKFAKA 650
           V GA PHD+G  +   F+   Y  Y D   WKDL   FVLQV+R   +  TG D  F   
Sbjct: 505 VAGATPHDLGAPNELPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQLAPTGEDLNFLAD 564

Query: 651 VWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA-------- 702
            WPS   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY G LW+AA        
Sbjct: 565 CWPSAVRALDYLKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAMA 624

Query: 703 --LQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQ 759
             LQ    L   V  R     F    ++++  +++ LWNG Y+  D    S +  + ADQ
Sbjct: 625 QRLQLDLGLDTTVEQR----RFGQWLEQSRSNFDRLLWNGEYYEID--AESGTPVVMADQ 678

Query: 760 LAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR-VDMSSMQ 818
           L G +YAR  GL  +V +  A S L  V +    +  GG  G  NG+  DG  +D +   
Sbjct: 679 LCGDFYARLLGLPSVVSDANALSTLNVVRDACFERFEGGTLGVANGLRRDGTPLDPNGTH 738

Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
             E+W+G+ + +A+       +D        +    +SG   G  F+TPEA    + +R+
Sbjct: 739 PLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQVYSG---GLQFRTPEAITAVNTFRA 795

Query: 879 LCYMRPLAIWAMQWA 893
             Y+R +AIW + WA
Sbjct: 796 CHYLRAMAIWGL-WA 809


>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
 gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
          Length = 869

 Score =  238 bits (606), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 185/623 (29%), Positives = 290/623 (46%), Gaps = 108/623 (17%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +G+R ++D +   +    +GVP+GG+G GSIGR Y GEF R+Q+ P + E   V ANQF 
Sbjct: 13  EGRRVYMDYYYMENGKQIYGVPIGGIGGGSIGRGYAGEFCRFQMRPGIYEYNVVHANQFI 72

Query: 137 VFVSRSNGQK-YSSVLCPKTPEVLKDTTAAG--------------------------IGS 169
           V +    G   + S+L   + +  K  T                             + +
Sbjct: 73  VTIKDHKGCTIFQSLLSRCSTKSFKAKTNNNDAEADEADAQKTKCQLPNCSSRPRQPLSA 132

Query: 170 WDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVF 225
           W  N++  + +Y  LYPRAWT     H G     +R++CRQISP+IPH+YKESS P +VF
Sbjct: 133 WHSNIEDSRCSYTGLYPRAWTEYDLSHYG-----VRLICRQISPVIPHDYKESSLPCAVF 187

Query: 226 TYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP 285
            +++ N       +++ FT+ N  G   +    +  S+         V +    +    P
Sbjct: 188 VWSVQNVCDQERKVSITFTFKNGTGNKKQDADGNAESQLISEGNAKGVAIRQNIADM--P 245

Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
            ++ LA +    + ++ CP F  +GN      + +W ++KEHG      + ET  + +  
Sbjct: 246 CSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQLSEQPTAETLRSKD-- 298

Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIA 404
             IG A+ A + + P +  ++ F LAWD P++ F    +T+ R YTK++     +   I 
Sbjct: 299 --IGVAVCAQLALKPQAYHELEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDSGESGPRIC 356

Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLV 464
             A+  + SWE              RL + +   + N+            DG P      
Sbjct: 357 EYALKHYASWE--------------RLIDAWQRPILND------------DGLP------ 384

Query: 465 TIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYTPVALNSSFGTNL-LQ 523
                    DW +                  I  ++  I +     +  +SS G +L   
Sbjct: 385 ---------DWYK----------------CAIFNQLYFISDGGTIWLKCDSSLGQSLAFD 419

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D     G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q DF  A+        
Sbjct: 420 DPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTR 479

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
           K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDLN KFVLQVYRD   
Sbjct: 480 KMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYV 539

Query: 642 TGDKKFAKAVWPSVYVAMAYMDQ 664
             +   A+A   S + ++ ++D+
Sbjct: 540 LNELAQAQADNASKFSSIEFIDK 562



 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 107/261 (40%), Positives = 156/261 (59%), Gaps = 12/261 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ +    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQ  
Sbjct: 608 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQTM 667

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           S +A  +        +    +K K  + EKLWNGSY+ +D S  +   +I ADQL G WY
Sbjct: 668 SVMATLLDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLS-HNHRDTIMADQLCGHWY 726

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 727 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPSKPGHVDNSNI 786

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+ Y  +TPEA   + +YR
Sbjct: 787 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNY--ETPEALYGEKRYR 844

Query: 878 SLCYMRPLAIWAMQWALTRPK 898
           S+ YMRPL+IW+MQ A  R +
Sbjct: 845 SIGYMRPLSIWSMQVAWERRR 865


>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
 gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
          Length = 806

 Score =  238 bits (606), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 146/378 (38%), Positives = 216/378 (57%), Gaps = 19/378 (5%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
            + +GQF  LE ++Y  + + DV  Y SFAL+ L+P+++ S+ R FA A+   DP+ +++
Sbjct: 425 RDPVGQFAVLECLDYRWYESLDVRLYGSFALLQLWPELEKSVMRAFARAIPTADPT-LRI 483

Query: 586 L--------DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQV 635
           +        +      RK+  AVPHD+G  +  PW + N     D   WKDL   FVL V
Sbjct: 484 IGYFYRGDPETAYKAPRKLANAVPHDLGAPNEHPWEKTNYTAYQDCNLWKDLASDFVLLV 543

Query: 636 YRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
           YRD + TG  D  FA+  WP+V  A+ ++ QFD+DGDG+ EN G PDQTYD W + G+SA
Sbjct: 544 YRDFLFTGGTDLNFARECWPAVVAALDHLKQFDQDGDGLPENGGAPDQTYDDWKLQGVSA 603

Query: 694 YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSS 753
           Y GGLW+AAL+AA AL   +     E Y  +  Q     ++ LWNG Y+  D    S S 
Sbjct: 604 YCGGLWLAALEAAIALGTLLQQPQVEIYRQWLSQARPRYHQLLWNGEYYRLDT--GSGSD 661

Query: 754 SIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVD 813
            I ADQL GQ+YA+  GL+ IV  D    AL+K+Y+   LK   G+ GA NG+LP+G+ +
Sbjct: 662 VIMADQLCGQFYAQLLGLVDIVPPDCCDRALRKIYDTCFLKFHNGQFGAANGLLPNGQPE 721

Query: 814 MS-SMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNT 872
              +    E+W+G+ + +AA +    + D  ++ A  +    +     G  F+TPEA   
Sbjct: 722 NPHATHPLEVWTGINFGLAAFLWQRGMIDEAWRLAEVVVRQIYEN---GLQFRTPEAITA 778

Query: 873 DDQYRSLCYMRPLAIWAM 890
           +  +R+  Y+RP+AIWA+
Sbjct: 779 NGTFRACMYLRPMAIWAL 796



 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 127/395 (32%), Positives = 176/395 (44%), Gaps = 59/395 (14%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           +G+PLGG G+G IGRS  G+F  W +         + A QFS+F      Q Y+    PK
Sbjct: 56  YGMPLGGFGAGCIGRSSAGDFNLWHVDGGEHIFGTLPACQFSLFEQGEQTQAYALGSAPK 115

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
                       + SW W   G K TY   YPR+W V+EG      +I C Q SPI+PHN
Sbjct: 116 D---------GRLSSWQWYPAG-KGTYAVRYPRSWFVYEGVF--RAQITCEQFSPILPHN 163

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFTGQ 258
           Y+E+SYPV+VF +T  N    S  ++L+ +W N+VG                G   +T  
Sbjct: 164 YQETSYPVAVFLWTFSNPTDQSLTLSLMLSWQNTVGWFCNTTPSSAIAIRDDGSPVYT-- 221

Query: 259 HYNSKTKMNDGVHAVLLH---------HRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS 309
            Y  +   +DG    L+           R  H  PP       QE DG   +L P  +  
Sbjct: 222 -YTPRWGQSDGNFNELIQTESFQGWRLRRMPHPNPP-------QEGDGEWAALIPTGL-- 271

Query: 310 GNSLGLT-------AKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
           G   G +          +W      GS   +N    +        + AA A   ++ P  
Sbjct: 272 GEFFGCSRWQPEGDGAHLWQSFSVDGSLPFVNDPTPAAAG---EQVAAAFALRFSLAPGE 328

Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
             Q+   LAWD P   F  G  YYRRYT F   H   A  +A  A+  + +W+ QI  WQ
Sbjct: 329 RKQIPVVLAWDFPVTEFGKGVIYYRRYTDFCDRHGTNAVTLAAQALAAYATWQEQIRTWQ 388

Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
            PIL     P+W+ + L NELY L++GG++W+  S
Sbjct: 389 APILSHPDWPDWFKMALCNELYVLSSGGSLWSAAS 423


>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
 gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
          Length = 820

 Score =  237 bits (605), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 149/400 (37%), Positives = 213/400 (53%), Gaps = 37/400 (9%)

Query: 523 QDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK 582
           QD EE  G+F  LE I+Y  + + DV  Y SFAL+ML+P+++ S+ R FA A+   D + 
Sbjct: 415 QDAEEPRGKFAVLECIDYRWYESLDVRLYGSFALLMLWPELEKSVMRGFARAIPTGDDTP 474

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  +     RKV  A PHD+G  +  PW + N     D   WKDL   F L VYRD +
Sbjct: 475 RIIGYNQAAAIRKVTDATPHDLGAPNEHPWVKTNYTSYQDCNLWKDLPCDFALLVYRDFL 534

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F    WPS+  A+AY+  FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 535 LTGGDDYEFLWDCWPSMVQAIAYLKTFDTDGDGIPENGGAPDQTFDDWPMRGVSAYCGGL 594

Query: 699 WVAAL--------------------------QAASALAREVGDRGSEDYFLFKFQKAKVV 732
           W+AAL                          Q  S L      + +   +    ++++ +
Sbjct: 595 WIAALQAAIAIGEVILNNPPHSGVFTDTIYEQKRSQLLEPATIQATLSTYQDWLEQSRPL 654

Query: 733 YE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
           Y+ KLWNG+Y+  D+   S S  + ADQL GQ+YAR  GL  IV  D A SALK VY   
Sbjct: 655 YQHKLWNGTYYRLDS--ESGSEVVMADQLCGQFYARLLGLADIVPIDCAESALKTVYESC 712

Query: 792 VLKVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
            LK   G+ GA NG+L DG  V+  +    E+W+G+ + +AA ++   + + GF+    +
Sbjct: 713 FLKFHNGEFGAANGVLIDGSPVNPDATHPLEVWTGINFGLAAFLLQMGMKEEGFKLTETV 772

Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            +  +     G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 773 IQQVYEN---GLQFRTPEAITAVGTFRASHYLRAMAIWAV 809



 Score =  180 bits (457), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 180/385 (46%), Gaps = 37/385 (9%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       + + A QFS+F    N Q+++  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFQSLPACQFSIFEETQN-QRHAYALCTE 99

Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
            P      +   + SW W      +    TYHALYPR+W V++     +L   C Q SPI
Sbjct: 100 PP------SDGSLNSWQWYPASTEENPTGTYHALYPRSWFVYQNVLKAQL--TCEQFSPI 151

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------------GDSEFTG 257
           +P  Y+E SYP+++F +T YN       +++L TW N VG              D     
Sbjct: 152 LPGKYQECSYPLAIFEWTAYNPTDEPITLSILLTWENVVGWFTNAIKSPTVKVRDDGSPV 211

Query: 258 QHYNSKTKMNDGVHAVLLH--HRTSHQLPPVTFALAAQETDG---VHVSLCPHFVISGNS 312
             Y SK   + G     +   HR    +         QE +G   +  +  P      N+
Sbjct: 212 YEYQSKWGQSQGNSNRYIEDFHRIGCIMAGGHLGEVPQEGEGELAIATATNPFLDTFKNT 271

Query: 313 L---GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
                 + K++W +    GS   L+  + +  ++ G  IG A+A   T+ P    ++ F 
Sbjct: 272 RWNPTGSGKEVWEQFSHDGSL--LDCTDETPAAK-GERIGVALAVRFTIRPGKTRKIPFI 328

Query: 370 LAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK 429
           LAWD P   F +G   YRRYT F+G H   + ++ R A+     W   I  WQ+ IL+ +
Sbjct: 329 LAWDFPVTEFAAGVWEYRRYTDFFGRHGKNSWSMIRTALKHSEMWRENIITWQQSILDRE 388

Query: 430 RLPEWYPITLFNELYYLNAGGAVWT 454
            L +   + LFNELY L  GG +WT
Sbjct: 389 DLSDSLKMALFNELYILTDGGTLWT 413


>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
          Length = 207

 Score =  237 bits (605), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 123/193 (63%), Positives = 152/193 (78%), Gaps = 1/193 (0%)

Query: 644 DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAAL 703
           D  F + VWP+V  AM YM+QFDRDGDG+IENDGFPDQTYD W+V GISAY G LW+A+L
Sbjct: 6   DMAFGREVWPAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLASL 65

Query: 704 QAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAG 762
           QAA+A+A  +GD    +    KF  AK  +E KLWNGSYFNYD+  SS S SIQADQLAG
Sbjct: 66  QAAAAMAHRLGDHAYAEKCTIKFLTAKHAFESKLWNGSYFNYDSGNSSNSKSIQADQLAG 125

Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
           QWYA + GL P+ DE+K RS L+K++++NV+KV GG+ GAVNGM P+G+VD S MQSREI
Sbjct: 126 QWYAASSGLPPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDESCMQSREI 185

Query: 823 WSGVTYAVAASMI 835
           W+GVTYA++A+MI
Sbjct: 186 WTGVTYALSATMI 198


>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
 gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
          Length = 804

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 149/385 (38%), Positives = 210/385 (54%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           IGQF  LE ++Y  + + DV  Y SF L+ML+ K+  S+   FA A+   D +   +  +
Sbjct: 417 IGQFGVLECMDYRWYESLDVRLYGSFGLMMLWSKLDKSVLEAFARAIPKGDDTPRIIGYN 476

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RK  GA PHD+G  +  PW   N     D   WKDL   FVLQVYRD V TG  D
Sbjct: 477 RAQAIRKAAGATPHDLGAPNEHPWQATNYTSYQDCNLWKDLPSDFVLQVYRDYVLTGSKD 536

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
            +F    W S+ V +AY+ +FD DGDG+ EN G PDQT+D W + GISAY GGLW+AAL+
Sbjct: 537 SEFLWECWDSIVVTLAYLKEFDLDGDGIPENSGAPDQTFDDWRLQGISAYCGGLWIAALE 596

Query: 705 AASALAREV-----------------GDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDN 746
           AA A+ + +                 G     +++    Q+A+ +Y + LWNG Y+  D+
Sbjct: 597 AAIAIGQILMTHPPMNPNLQPPDYPQGIETVINHYRVWLQQARSLYHDTLWNGEYYRLDS 656

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YAR  GL  +V+E   +SALKK+Y    LK   GK GA NG+
Sbjct: 657 --QSNSDVVMADQLCGQFYARLLGLPDVVEEQYTKSALKKIYQACFLKFHNGKYGAANGV 714

Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
           LPDG  V+ +     E+W+G+ + +AA M+   +    FQ    + +  +     G  F+
Sbjct: 715 LPDGSPVNPNDTHPLEVWTGINFGLAAFMLQMGMKAEAFQLTETVVKQVYEN---GLQFR 771

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA      +R+  Y+R +AIW +
Sbjct: 772 TPEAITAVGTFRASHYLRAMAIWGI 796



 Score =  202 bits (513), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 132/384 (34%), Positives = 186/384 (48%), Gaps = 40/384 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS+RG+F  W +       + + A QFSVF S S  +  +  LC +
Sbjct: 42  HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHVFRSLPACQFSVFES-SEAKTQAYALCTE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W  K +K  YHALYPR+W   E +   + ++ C Q SPI   N
Sbjct: 101 PPE---DGT---LSRWQWYPK-EKGNYHALYPRSW--FEYQDVFQSQLTCEQFSPIWAEN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
           Y+ESSYPV++F + ++N       ++++ TW N+VG    FT    +   K+ D    V 
Sbjct: 152 YQESSYPVAIFDWNVHNPTDKPITLSIMLTWQNTVGW---FTNAIKSPTVKVRDDGSPVY 208

Query: 275 LH------------------HRTSHQLPPVTFALAAQETDG------VHVSLCPHFVISG 310
            +                  +R    L  +      QE +G      V       F +S 
Sbjct: 209 EYQPKWGDSTGNFNQWIQDNYRVGCLLNRIQPHAEIQEGEGQIAIATVTNPTLEVFYLSR 268

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
                   D+W      GS   L   +    ++PG  I +AIA   TV P    Q+ F L
Sbjct: 269 WHPDGDGADVWDYFAADGS---LPDRQDETPADPGEQIASAIAVRFTVNPGKTRQIPFVL 325

Query: 371 AWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
           AWD P   F  G  YYRRYT F+G   N A +I + A+     W+  I+AWQ PIL+ + 
Sbjct: 326 AWDLPVTEFAQGIKYYRRYTDFFGRTGNNAWSIVKTALKHSDWWQESIQAWQEPILKRQD 385

Query: 431 LPEWYPITLFNELYYLNAGGAVWT 454
           LP+W+ + LFNELY L  GG +W+
Sbjct: 386 LPDWFKMALFNELYLLTQGGTLWS 409


>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
 gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
          Length = 800

 Score =  237 bits (604), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 141/393 (35%), Positives = 209/393 (53%), Gaps = 28/393 (7%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ +GQF  LE I+Y  + + DV  Y SF L+ LFP+++ S+   F+ A+   D +   +
Sbjct: 413 DDPVGQFGVLECIDYRWYESLDVRLYGSFGLLQLFPRLEKSVMEAFSRAIPTADDTPRII 472

Query: 586 LDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG 643
             +G    RK  GA PHD+G  +  PW + N     D   WKDL   FVL VYRD + TG
Sbjct: 473 GYNGASAIRKAKGATPHDLGAPNEHPWIKSNYTSYQDCNLWKDLGSDFVLLVYRDYLYTG 532

Query: 644 --DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
             D+ F    W SV   + Y+  FD D DG+ EN G PDQT+D W + GISAY GGLW+ 
Sbjct: 533 KNDQDFLWECWDSVVETLHYVKGFDLDNDGIPENSGAPDQTFDDWRLQGISAYCGGLWIC 592

Query: 702 ALQAASALAREVGD-----------------RGSEDYFLFKFQKAKVVY-EKLWNGSYFN 743
           AL+AA A+   + D                 +G+   F     +++ +Y + LWNG Y+N
Sbjct: 593 ALEAAIAIGTILLDNPPVNPALEKENAQENIQGAIALFRQWLHQSRAIYHDTLWNGEYYN 652

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
            D+   S S  + ADQL GQ+YA+   L  +V+++   + L K+Y+   LK   G  G  
Sbjct: 653 LDS--KSGSDVVMADQLCGQYYAQLLKLPDVVEKEYVTATLNKIYDACYLKFHNGTLGIA 710

Query: 804 NGMLPDGRVDMSS-MQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           NG+ PDG  +  +     EIW+G+ Y + A MI   + + G + A  +    +S    G 
Sbjct: 711 NGVKPDGSPEKENDTHPLEIWTGINYGIVAFMILNGMKEEGLKVAETVVNQVYSN---GL 767

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
            F+TPEA    + +R+  Y+R +AIWA+  AL+
Sbjct: 768 QFRTPEAITAVNTFRASHYLRAMAIWAIYQALS 800



 Score =  187 bits (474), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 121/402 (30%), Positives = 185/402 (46%), Gaps = 48/402 (11%)

Query: 84  DPFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF 138
           +P+  R+ ++     +HG+PLGG G+G IGR+  G+F  W I       + + A QF+VF
Sbjct: 24  NPYTVRYESNLDDGPNHGMPLGGFGAGCIGRAPSGDFNLWHIDGGNHTFETLPACQFAVF 83

Query: 139 VSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
                       L    PE         + SW W  +G K TY ALYPR+W  +EG    
Sbjct: 84  EHSEGEDAQVYALSTNAPE------DGSLSSWSWYPQG-KGTYSALYPRSWFSYEGVFKS 136

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------- 250
           +L  VC Q SP+   +Y+ESSYP++VF ++I+N       ++++ +W N+VG        
Sbjct: 137 QL--VCEQFSPVWAGSYQESSYPMAVFEWSIHNPTDKPITMSIMVSWQNTVGWFTNAIKS 194

Query: 251 ----------GDSEFTGQHYNSKTKMN----DGVHAVLLHHRTSHQLPPV----TFALAA 292
                      + E+  +  NS   +N    DG     L  R   +  P      +A+A 
Sbjct: 195 PTVKVRDDGSPEYEYKPRWGNSTGNLNQWIQDGFRVGCLFDRIRLEDEPTEGEGQWAIAT 254

Query: 293 QETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAI 352
                + V     +   G+       ++W     +GS   L   +    +EPG  I  A+
Sbjct: 255 TTNPTLEVFYHTRWNPKGD-----GSEIWDRFAMNGS---LPDFQDETPAEPGEQIAGAM 306

Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
           A   T+ P    ++ F L WD P   F  G  Y+RRYT F+G +     ++ R  +  H 
Sbjct: 307 AIRFTIRPGKTKKIPFILGWDFPVTEFAQGVNYFRRYTDFFGRNGRNVWSMIRTGLKHHD 366

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
            W+ +++ WQ PIL    LP+W  + LFNELY L  GG +WT
Sbjct: 367 MWQNKVKMWQEPILSRSNLPDWLKMALFNELYLLTQGGTLWT 408


>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
 gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
          Length = 808

 Score =  236 bits (602), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 142/385 (36%), Positives = 205/385 (53%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           +GQF   E ++Y  + + DV  Y SF L+ML+ K+  ++   FA A+   D     +  +
Sbjct: 421 VGQFCVCECMDYRWYESLDVRLYGSFGLMMLWSKLDKAVLEAFARAIPKSDDKPRIIGYN 480

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVAT-GDK 645
           G    RK  GA PHD+G  +  PW   N     D  +WKDL   FVLQVYRD + T GD 
Sbjct: 481 GATAIRKAAGATPHDLGAPNEHPWEATNYTSYQDCNQWKDLPSDFVLQVYRDYLFTEGDN 540

Query: 646 -KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
            +F    W SV   +AY+ +FD D DG+ EN G PDQT+D W + GIS+Y GGLW+AAL+
Sbjct: 541 TEFLWECWDSVVCTLAYLKEFDEDNDGIPENSGAPDQTFDDWRLQGISSYCGGLWIAALE 600

Query: 705 AASALAR------------EVGD--RGSE----DYFLFKFQKAKVVYEKLWNGSYFNYDN 746
           AA  +              +  D  +G E    DY  +  +  ++ ++ LWNG Y+  D+
Sbjct: 601 AAIKIGEILIANPTKNPELQTADFPQGIEKTISDYREWLQRSRELYHDTLWNGEYYRLDS 660

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YAR  GL  +V+    +SAL K+Y+   LK   GK GA NG+
Sbjct: 661 --KSNSEVVMADQLCGQFYARLLGLPDVVENQYTKSALSKIYDACFLKFHDGKFGAANGV 718

Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
           LPDG  V+       E+W+G+ + +AA M+   + D  F+    +    +     G  F+
Sbjct: 719 LPDGSAVNPDDTHPLEVWTGINFGLAAFMVQMGMKDEAFRLTETVVRQVYEN---GLQFR 775

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA      +R+  Y+R +AIW +
Sbjct: 776 TPEAITAVGTFRASHYLRAMAIWGV 800



 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 129/388 (33%), Positives = 185/388 (47%), Gaps = 39/388 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYS-SVLCP 153
           HG PLGG G+G IGRS +G+F  W +         + A QFSVF S + G+  + ++ C 
Sbjct: 37  HGAPLGGFGAGCIGRSPKGDFNLWHLDGGEHLLNSIPACQFSVFESVAGGETQAFAMACE 96

Query: 154 KTPEVLK-DTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIP 212
              E+ K       +  W+W  + ++  Y ALYPR+W  ++G    EL   C Q SPI  
Sbjct: 97  DDGEIGKWGDREESLSRWNWYPQ-EQGVYAALYPRSWYEYQGVFASEL--TCEQFSPIWK 153

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT 256
            NY+E+SYPV++F +T +N       ++++ TW N+VG                G  E+ 
Sbjct: 154 ENYQETSYPVAIFDWTAHNPTDKPITLSIMLTWQNTVGWFTNAIKSPEVKVRDDGSPEYE 213

Query: 257 GQ--------HYNSKTKMNDGVHAVL--LHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
            Q        +YN   + +  V  +L  +   +  Q      A+A      V V     +
Sbjct: 214 YQPKWGDSTGNYNQWIQDSYRVGCILNRVQPHSEIQEGEGQIAIATVANPAVEVFYLSRW 273

Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
              G+       ++W      GS   L   E    ++PG  I  AIA  VT+PP    ++
Sbjct: 274 NPEGD-----GSEVWDSFAADGS---LPDKEDETPADPGEQIACAIAVRVTIPPGKTKKI 325

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            F LAWD P   F  G  YYRRYT F+G   N   +I + A+     W  QIEAWQ PIL
Sbjct: 326 PFILAWDFPVTEFAQGINYYRRYTDFFGRTGNNVWSIVKTALKHSDWWREQIEAWQNPIL 385

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWT 454
           E   LP+W+ + LFNELY L  GG +W+
Sbjct: 386 ERPDLPDWFKMALFNELYLLTQGGTLWS 413


>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
 gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
          Length = 801

 Score =  236 bits (601), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 147/383 (38%), Positives = 207/383 (54%), Gaps = 26/383 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SFAL+MLFP+++ S+ R FA A+   D +       
Sbjct: 419 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVMRAFARAIPQGDDTPRIIGYY 478

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
           M +  +     RKV GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 479 MTIKAESPIAVRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVYRDFL 538

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F    W ++   + Y+  FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 539 LTGADDVEFLADCWNAIVQTLDYLKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 598

Query: 699 WVAALQAASA-----LAREVGDRGS-----EDYFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
           W+AAL+AA A     L  ++GD G        Y  +  Q   +  EKLWNG Y+  D+  
Sbjct: 599 WLAALEAAIAISDLLLTHKLGDLGGLAVQKSIYETWLQQSLPIYQEKLWNGQYYQLDS-- 656

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
            S S  + ADQL GQ+YAR   L  IV  D+A SALK VY+   LK   G+ G  NG+ P
Sbjct: 657 QSGSDVVMADQLCGQFYARLLDLPDIVPSDRALSALKTVYDACFLKFCNGEFGTANGVRP 716

Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
           DG  +   +    E+W+G+ + +AA ++   + D   +    + +  +     G  F+TP
Sbjct: 717 DGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEALRLTQAVVQQIYEN---GLQFRTP 773

Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
           EA      +R+  Y+R +AIW +
Sbjct: 774 EAITAAGTFRASTYLRAMAIWGI 796



 Score =  193 bits (491), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 134/392 (34%), Positives = 193/392 (49%), Gaps = 46/392 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-SNGQKYSSVLCP 153
           HG+PLGG G+G IGRS RG+F  W I       K V A QFSVF S  ++ Q Y+  L  
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNVPACQFSVFESNGTSTQAYA--LST 97

Query: 154 KTPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISP 209
           + P          + +W W   L   + T  YHALYPR+W V+E     +L   C Q SP
Sbjct: 98  QGP------NDGALKAWQWYPTLPSTEGTGNYHALYPRSWFVYENIFQAQL--TCEQFSP 149

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +   NY+E+SYPV++F +  +N       ++++ TW N VG    FT    + + ++ D 
Sbjct: 150 VWAGNYQETSYPVAIFLWNAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPEVRVRDD 206

Query: 270 VHAVLLHH-------RTSHQL--PPVTFAL--------AAQETDGV-------HVSLCPH 305
              V  +           +Q+   P  F          A QE DG        H  +   
Sbjct: 207 GSPVYEYQPRWGESQGNYNQIVENPQQFGCLLSRVGSDALQEGDGTWCIATLKHPQVEIF 266

Query: 306 FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
           +    N+ G T  ++W    + GS    N ++ +   E G  +GAA+A   T+ P    +
Sbjct: 267 YHTRWNAEG-TGDEVWQSFAKDGSLS--NYIDANPVGE-GEQLGAAMALRFTLQPGETLE 322

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F LAWD P   F +G  YYRRYT F+    N A  IA  A+ E+ +W  QI+ WQ+PI
Sbjct: 323 IPFVLAWDLPITEFAAGVNYYRRYTDFFDRSGNNAWAIASTALQEYQTWRSQIQTWQQPI 382

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           L+ + LP W+ + LFNELY L +GG +W+  S
Sbjct: 383 LDREDLPNWFKMALFNELYDLTSGGTLWSAAS 414


>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
 gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
          Length = 818

 Score =  235 bits (600), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 148/390 (37%), Positives = 212/390 (54%), Gaps = 20/390 (5%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D  +  GQF  LE I+Y  + + DV  Y SFAL+ML+P ++ S+   FA A+   D +  
Sbjct: 434 DERDPKGQFAVLECIDYRWYESLDVRLYGSFALLMLWPDLEKSVVLAFARAISAGDDTPR 493

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
            +  +     RK +GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + 
Sbjct: 494 IIGWNQASAIRKAVGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLL 553

Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           TG  D +F +  WP++  A+AY+  FD DGDG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 554 TGGTDSEFLQECWPAIVEALAYLKTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLW 613

Query: 700 VAALQAASALAREVGDRGSED----------YFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
           +AAL+AA A+   + D   +           Y  +  Q      EKLWNG Y+  D+   
Sbjct: 614 LAALEAAIAIGEILEDAAPQIIPNPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDS--E 671

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
           S S  + ADQL GQ+YA+  GL  IV  + A SAL+ VY    LK   G+ GA NG++ +
Sbjct: 672 SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVSALETVYESCFLKFNEGEFGAANGVMLN 731

Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           G  +  ++    E+W+G+ + +AA ++   + D  F+    +    +     G  F+TPE
Sbjct: 732 GSPENPNATHPLEVWTGINFGLAAFLVQMGMEDKAFKLTDAVVNQIYEN---GLQFRTPE 788

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           A      +R+  Y+R +AIWA+   LT  K
Sbjct: 789 AITAGGTFRASHYLRAMAIWAIYGVLTNFK 818



 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 134/410 (32%), Positives = 189/410 (46%), Gaps = 70/410 (17%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       + + A QFSVF   + G+K +  L  +
Sbjct: 42  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 100

Query: 155 TPEVLKDTTAAGIGSWDW---------------------NLKGDKSTYHALYPRAWTVHE 193
            P      T   + SW W                            +YHALYPR+W V+E
Sbjct: 101 PP------TDGSLSSWKWYPCEGRRKKEEGRREEEERTGEESNSSGSYHALYPRSWFVYE 154

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
                +L   C Q SPI+P+NY+ESSYPV++F +  +N       ++++ TW N VG   
Sbjct: 155 NVFHAQLS--CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGW-- 210

Query: 254 EFTGQHYNSKTKMND----------------GVHAVLLH--HRTSHQLPPVTFALAAQET 295
            FT  + + K  + D                G   +L+   HR    +  +T     QE 
Sbjct: 211 -FTNANKSPKVMVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTAEDEPQEG 269

Query: 296 DG-----------VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
           DG           V V     +  +GN      +D+W      G F   ++ ET+  +  
Sbjct: 270 DGQWAIATITNPAVEVFYQTRWNPAGN-----GEDIWRNFSAEG-FLLDDNDETAAAA-- 321

Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
           G  +  AIA   T+ P    ++ F +AWD P   F SG  YYRRYT F+G +   A  I 
Sbjct: 322 GEQLACAIAVRFTLRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPII 381

Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           R A+    +W   IEAWQ+PILE K L + + + LFNELY L  GG +W+
Sbjct: 382 RTAMKHSDTWRENIEAWQKPILERKDLSDSFKMALFNELYALTDGGTLWS 431


>gi|147788373|emb|CAN61188.1| hypothetical protein VITISV_019327 [Vitis vinifera]
          Length = 550

 Score =  235 bits (599), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 108/217 (49%), Positives = 152/217 (70%), Gaps = 3/217 (1%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D A P    W+R+L++    L +F++ + E +++  +G+R+   IREEA++G++A IDPF
Sbjct: 31  DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNG 144
            +     ++S GVPLGG+GSGSI R +RGEF+ WQI P  C+  P++ANQFS+F+SR  G
Sbjct: 91  TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150

Query: 145 -QKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIV 203
            +KY+SVL P   E L  +   GI SW WNL G  STYHAL+PRAWT+++GEPDPEL++ 
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADIT 240
           CRQISP IPHNY++SS P +VF YT   + + S+  T
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTPLTTFEVSSKFT 247


>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
 gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
          Length = 809

 Score =  234 bits (598), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 148/390 (37%), Positives = 213/390 (54%), Gaps = 20/390 (5%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D  +  GQF  LE I+Y  + + DV  Y SFAL+ML+P ++ S+   FA A+   D +  
Sbjct: 425 DERDPKGQFAVLECIDYRWYESLDVRLYGSFALLMLWPDLEKSVVLAFARAISAGDDTPR 484

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
            +  +     RK +GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + 
Sbjct: 485 IIGWNQASAIRKAVGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLL 544

Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           TG  D +F +  WP++  A+AY+  FD DGDG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 545 TGGTDCEFLQECWPAIVEALAYLKTFDEDGDGIPENSGAPDQTFDDWQMRGISAYCGGLW 604

Query: 700 VAALQAASALAREVGDRGSED----------YFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
           +AAL+AA A+   + D   E           Y  +  Q      EKLWNG Y+  D+   
Sbjct: 605 LAALEAAIAIGEILEDAAPEIIPNPQAKIALYHSWLLQARPSYQEKLWNGQYYRLDS--E 662

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
           S S  + ADQL GQ+YA+  GL  IV  + A SAL+ VY    LK   G+ GA NG++ +
Sbjct: 663 SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVSALETVYESCFLKFNDGEFGAANGVMLN 722

Query: 810 GRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           G  +  ++    E+W+G+ + +AA ++   + +  F+    + +  +     G  F+TPE
Sbjct: 723 GSPENPNATHPLEVWTGINFGLAAFLVQMGMEEKAFKLTDAVVKQIYEN---GLQFRTPE 779

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
           A      +R+  Y+R +AIWA+   LT  K
Sbjct: 780 AITAGGTFRASHYLRAMAIWAIYGVLTNFK 809



 Score =  192 bits (489), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 135/402 (33%), Positives = 189/402 (47%), Gaps = 62/402 (15%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       + + A QFSVF   + G+K +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 99

Query: 155 TPEVLKDTTAAGIGSWDW-------------NLKGDKSTYHALYPRAWTVHEGEPDPELR 201
            P      T   + SW W               +    TYHALYPR+W V+E     +L 
Sbjct: 100 PP------TDGSLSSWKWYPSEGRRKKEEGRREESTSGTYHALYPRSWFVYENVFQAQLS 153

Query: 202 IVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN 261
             C Q SPI+P+NY+ESSYPV++F +  +N       ++++ TW N VG    FT  + +
Sbjct: 154 --CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGW---FTNANKS 208

Query: 262 SKTKMND----------------GVHAVLLH--HRTSHQLPPVTFALAAQETDG------ 297
            K K+ D                G   +L+   HR    +  +T     QE DG      
Sbjct: 209 PKVKVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTADDEPQEGDGQWAIAT 268

Query: 298 -----VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAI 352
                V V     +  +GN       D+W      G F   +  ET+  +  G  +  AI
Sbjct: 269 ITNPAVEVFYQTRWNPAGN-----GDDIWRNFSAEG-FLLDDRDETAAAA--GEQLACAI 320

Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
           A   T+ P    ++ F +AWD P   F SG  YYRRYT F+G +   A  I R A+    
Sbjct: 321 AVRFTIRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPIIRTAMKHSD 380

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           +W   IEAWQ+PILE + L + + + LFNELY L  GG +W+
Sbjct: 381 TWRENIEAWQKPILERQDLSDSFKMALFNELYALTDGGTLWS 422


>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
 gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
          Length = 804

 Score =  234 bits (597), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 141/393 (35%), Positives = 205/393 (52%), Gaps = 31/393 (7%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D  + +GQF  LE ++Y  + + DV  Y SF L+ML+ K+  SI R +A A+   D +  
Sbjct: 409 DERDPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLWSKLDKSILRAYARAISTSDRTLR 468

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
           ++  +     RKV  A PHD+G  +  PW + N     D   WKDL   FVLQVYRD V 
Sbjct: 469 EIGYNQATGLRKVANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPSDFVLQVYRDFVL 528

Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           TG  D +     W ++   + Y+  FD D DG+ EN G PDQT+D W + G+SAY GGLW
Sbjct: 529 TGSTDWELLWDCWGAIVQTLNYLKTFDLDNDGIPENSGAPDQTFDDWRLQGVSAYCGGLW 588

Query: 700 VAALQAASALAREVGDRGSED---------------------YFLFKFQKAKVVYEKLWN 738
           +A L++A A+ + +     E                      Y  +  Q   V  EKLWN
Sbjct: 589 LAGLESAIAIGKTLISYPQEHPLGEMLTSPLNQPPIEETIGIYQTWLEQSRPVYQEKLWN 648

Query: 739 GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
           G Y+  D+   S S  + ADQL GQ+YAR  G+  IV+ D+  S+LK VY    LK   G
Sbjct: 649 GQYYKLDS--ESGSDVVMADQLCGQFYARLLGVPDIVESDRTVSSLKTVYEACFLKFHNG 706

Query: 799 KRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
             GA NG+  DG  +   +    E+W+G+ + +AA ++  D+ +  F+    + E  ++ 
Sbjct: 707 AFGAANGLKIDGSPENPDATHPLEVWTGINFGLAAFLLQMDMKEEAFKLTQTVVEQIYNN 766

Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
              G  F+TPEA      +R+  Y+RP+AIWA+
Sbjct: 767 ---GLQFRTPEAITAVGTFRACHYLRPMAIWAV 796



 Score =  186 bits (472), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 133/399 (33%), Positives = 202/399 (50%), Gaps = 46/399 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS +G+F  W I       K + A QFSVF  +SN    +  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSSKGDFNLWHIDGGEHTFKSIPACQFSVFEQQSN---LAYALCTE 97

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE         + +W W  + +K TYHALYPR+W V+E     +L   C Q SPI+P N
Sbjct: 98  APE------DGSLKAWQWYPQ-NKGTYHALYPRSWFVYENVFLSQL--TCEQFSPILPDN 148

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
           Y+E+SYPV++F +  +N       ++++ TW N VG    FT    + + ++ D    V 
Sbjct: 149 YRETSYPVAIFVWKAHNPTDQPLTLSIMLTWQNMVGW---FTNSIKSPQVRVRDDGSPVY 205

Query: 275 L-----------HHRTSHQLPPVTFALAAQETDGVHV----------SLCPHFVISG--- 310
                       +++       +   LA   +D V +          S+ P + IS    
Sbjct: 206 EYEPRLGQSAGNYNKLVETADSLGCILARVGSDDVALEGIGQWVIATSINPKYQISHHTR 265

Query: 311 -NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFS 369
            N  G T  D+W      GS    N ++ +  ++ G  + +AIA   T+ P    ++ F 
Sbjct: 266 WNPTG-TGADVWQSFSTDGSLP--NYIDETPIAD-GEQLASAIAIKFTLEPGESLEIPFV 321

Query: 370 LAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK 429
           LAWD P   F  G  YYRRYT F+G + + A  IA +A+  + +W+ Q++AWQ+PI++  
Sbjct: 322 LAWDFPITEFAPGVNYYRRYTDFFGRNGSNAEAIAFNALQHYQTWQQQVQAWQQPIIDRT 381

Query: 430 RLPEWYPITLFNELYYLNAGGAVWT--DGSPPVHSLVTI 466
            LP+W+ + LFNELY L +GG +W+  D   PV     +
Sbjct: 382 DLPDWFKMALFNELYDLTSGGTLWSAADERDPVGQFAVL 420


>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
 gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
          Length = 818

 Score =  234 bits (597), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 144/400 (36%), Positives = 211/400 (52%), Gaps = 38/400 (9%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK- 582
           D  + +GQF  LE ++Y  + + DV  Y SF++++L+PK++ SI   FA A+   D +K 
Sbjct: 418 DDRDPVGQFGVLECLDYRWYESLDVRLYGSFSVLLLWPKLEKSILIAFARAIDTEDTNKR 477

Query: 583 ----------------MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARW 624
                             +  D     RK L A PHD+G  +  PW + N  C  D   W
Sbjct: 478 LIGYYVQTAFGYEDRPQNVDRDAMMAPRKTLHATPHDLGAPNEHPWEKTNYTCYQDCNLW 537

Query: 625 KDLNPKFVLQVYRDVVATGDK--KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQT 682
           KDL   FVLQVYR  + TG K  +F    W ++   +AY+ QFD D DG+IEN G PDQT
Sbjct: 538 KDLGSDFVLQVYRAYIFTGKKDLEFLAECWTAIVDTLAYLKQFDLDNDGIIENSGAPDQT 597

Query: 683 YDTWSVSGISAYSGGLWVAALQAASALAR--------EVGDRGSEDYFLFK--FQKAKVV 732
           +D W + GISAY GGLW+AAL+AA ++          E+  + +E    ++      K +
Sbjct: 598 FDDWRLQGISAYCGGLWLAALEAAISIGNILTQQYHGELTPKPAEVIPQYQAWLDSGKPI 657

Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
           Y ++LWNGSY+  D    S S  + ADQL GQ+Y +   L  IV  D A++AL+ VY   
Sbjct: 658 YQQQLWNGSYYRIDT--GSGSEVVMADQLCGQFYTQLLELDDIVPPDCAKTALQTVYQSC 715

Query: 792 VLKVMGGKRGAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
            L   GGK GA NG+  DG   +       E+W+G+ + +AA ++   + +   + A  I
Sbjct: 716 FLNFQGGKFGAANGVKIDGSPSNPKDTHPLEVWTGINFGIAAFLVQMGMQEEALKLAETI 775

Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            E  +S    G  F+TPEA  +   +R+  Y+R +AIW +
Sbjct: 776 VEQVYSN---GLQFRTPEAITSAGTFRASHYLRAMAIWGI 812



 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 124/398 (31%), Positives = 181/398 (45%), Gaps = 47/398 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS+RGEF  W I       K + A QFSVF   +NG   +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSHRGEFNLWHIDGGEHVFKNLSACQFSVFEQLANGDSQAYAMSTQ 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKS-----TYHALYPRAWTVHEGEPDPELRIVCRQISP 209
            P          +  W W  +   +     +Y ALYPR+W  +E     +L   C Q SP
Sbjct: 101 AP------GDGSLNRWQWYPQTTPTGQSTGSYQALYPRSWYSYENVFQAQL--TCEQFSP 152

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           I   NY+E+SYP++VF +T +N       ++++ TW N+VG    F     +SK ++ D 
Sbjct: 153 IWADNYQETSYPIAVFEWTAHNPTDKPITLSIMLTWENTVGW---FANTSPSSKVEIRDD 209

Query: 270 VHAV------------------LLHHRTSHQLPPVTFALAAQETDG-------VHVSLCP 304
              V                  + ++R    L     +    E +G        + SL  
Sbjct: 210 GSPVYDYQPRIGESEGNFNQWIVDNYRVGCLLDKAHKSDIPSEGEGQWSIASIANPSLEV 269

Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
            +    N +G    ++W      GS   ++    +   E    I  AI    TV P    
Sbjct: 270 FYHSRWNPIG-NGGEIWDSFAADGSLPDIDDESEAKQRE---RIATAITIRFTVKPGQTK 325

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
           ++ F LAWD P   F  G  YYRRYT F+G     +  + R A+    +W+ QI  WQ+P
Sbjct: 326 KIPFILAWDFPVTEFGKGINYYRRYTDFFGRSGENSWAMVRTALKHSDTWQEQIINWQQP 385

Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWT--DGSPPV 460
           IL+   LP+W+ + LFNELY L  GG +W+  D   PV
Sbjct: 386 ILDRSDLPDWFKMALFNELYLLTDGGTIWSAADDRDPV 423


>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
 gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
 gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
 gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
 gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
          Length = 817

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 142/394 (36%), Positives = 205/394 (52%), Gaps = 26/394 (6%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G  L     EN  +GQF   E ++Y  + + DV  Y SF L+M +PK++ ++   FA A+
Sbjct: 404 GGTLWTAATENDPVGQFGVCECMDYRWYESLDVRLYGSFGLLMHWPKLEQAVMLAFARAI 463

Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
              DP +  +  +     RKV  A PHD+G  +  PW   N     D   WKDL   FVL
Sbjct: 464 PTEDPKERVIGYNLSLAPRKVKNATPHDLGAPNEHPWERSNYTAYQDCNLWKDLGSDFVL 523

Query: 634 QVYRD--VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
            VYRD  ++   D +F    WPS+  A+ Y+  FD D DG+ EN G PDQT+D W + GI
Sbjct: 524 LVYRDFLLLPDADGEFLGECWPSIVAALDYLKGFDLDNDGIPENSGAPDQTFDDWRLQGI 583

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSE--DYFLFKFQKAKVVYE------------KLW 737
           SAY GGLW+AAL+AA  + + + +   +  D    + Q A   YE             LW
Sbjct: 584 SAYCGGLWIAALEAALVMGQYLLENPPQQTDLDAMEIQTAIARYETWLAQARSLYQDTLW 643

Query: 738 NGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
           NG Y+N D+   S S  + ADQL GQ+YAR   L  +V+   A  AL KVY+   LK   
Sbjct: 644 NGEYYNLDS--QSGSKVVMADQLCGQFYARLLNLPDVVEHHYAEKALAKVYDTCFLKFAH 701

Query: 798 GKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWS 856
           G+ GA NG+LPDG   + +     E+W+G+ + +AA +I   + +   +    +    ++
Sbjct: 702 GELGAANGLLPDGSPQNPNDTHPLEVWTGINFGLAAFLIQMGMKEEALKMTETVVNQVYN 761

Query: 857 GTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
               G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 762 N---GLQFRTPEAITAVGTFRASHYLRAMAIWAV 792



 Score =  198 bits (504), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 132/401 (32%), Positives = 190/401 (47%), Gaps = 48/401 (11%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++      HG+PLGG G+G IGRS  G+F  W +       K + A QFSV+ 
Sbjct: 26  PYTVRYASNLDDGPDHGMPLGGFGAGCIGRSPTGKFNLWHLDGGNHIFKNLPACQFSVYE 85

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPE 199
             +  +  +  +  + PE         +  W W  K +K  Y ALYPR+W  ++     E
Sbjct: 86  QVAGEEAKAYAMATEAPE------DGSLDRWQWYPK-EKGEYSALYPRSWYEYKDVFKSE 138

Query: 200 LRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--------- 250
           L  +C Q SP+   NY+ESSYP++VF +T++N       ++++ TW NSVG         
Sbjct: 139 L--ICEQFSPVWGGNYQESSYPLAVFDWTVHNPTDKPITLSIMLTWQNSVGWFTNAIKSP 196

Query: 251 -------GDSEFTGQHY------NSKTKMNDGVHAVLLHHRTS-HQL---PPVTFALAAQ 293
                  G  E+  Q        N    + DG     L +R   H+          +A+ 
Sbjct: 197 TVKLRDDGSPEYEYQPRWGDSTGNFNQWIQDGFRVGCLFNRVQPHEALREGEGQMCIASV 256

Query: 294 ETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIA 353
               V V     +  SG+       D+W     +GS   L  ++    +EPG  I  AI 
Sbjct: 257 SNPSVEVFYHNRWNPSGD-----GADVWDWFAGNGS---LPDIQDETAAEPGEQIAGAIC 308

Query: 354 ASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
              TV P    ++ F LAWD P   F  G TYYRRYT FYG +     ++ R A+    +
Sbjct: 309 LRFTVRPGKTKKIPFLLAWDLPITEFAQGVTYYRRYTDFYGRNGKNVWSMIRTALKHSDT 368

Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           W  +IEAWQ PILE+ RLP+W  ++LFNELY L  GG +WT
Sbjct: 369 WRKKIEAWQAPILENPRLPDWLKMSLFNELYLLTQGGTLWT 409


>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
 gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
          Length = 804

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 141/385 (36%), Positives = 209/385 (54%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SF L+ LFP+++ ++ R FA A+   D +       
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAIPQGDDTPRVIGYY 479

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  +     RK  GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 480 YTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 539

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F +  W ++   + Y+  FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599

Query: 699 W----VAALQAASALAREVGDRGSEDYFLFK-------FQKAKVVY-EKLWNGSYFNYDN 746
           W     AA+  +  L +   D  ++++ L++         K+  +Y EKLWNG Y+  D+
Sbjct: 600 WLAALAAAIAISDILLQNHQDSATKEHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDS 659

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YA    L  IV  D+A SAL+ VY+   LK   G+ GA NG+
Sbjct: 660 --ESGSDVVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGV 717

Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG  +   +    E+W+G+ + +AA ++   + D GF+    + E  ++    G  F+
Sbjct: 718 RPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVEQIYNN---GLQFR 774

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA      +R+  Y+RP+AIWA+
Sbjct: 775 TPEAITAAGTFRASTYLRPMAIWAI 799



 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 136/396 (34%), Positives = 196/396 (49%), Gaps = 51/396 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVF-VSRSNGQKYSSVL 151
           HG+PLGG G+G IGRS RG+F  W I     + ++ P  A QFSVF  + ++ Q Y+   
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVFEANGTSSQAYALST 97

Query: 152 CPKTPEVLKDTTAAGIGSWDWNLKGDKS----TYHALYPRAWTVHEGEPDPELRIVCRQI 207
            P   E LK        SW W     ++    TYHALYPR+W V+E     EL   C Q 
Sbjct: 98  QPTDDESLK--------SWQWYPASSETQTTGTYHALYPRSWFVYENVFQAEL--TCEQF 147

Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN 267
           SPI  +NY+E+SYPV+VF +  +N       ++++ TW N VG    FT    +   ++ 
Sbjct: 148 SPIWANNYQETSYPVAVFAWQAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPDVRVR 204

Query: 268 D---------------GVHAVLLHHRTSHQ---LPPVTFALAAQETDGV-------HVSL 302
           D               G +   L     H    L  V      QE +G        H  +
Sbjct: 205 DDGSPVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMTEPLQEGEGSWCIATLKHSQV 264

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDS 362
              +    N +G T +++W      GS    N ++TS  +E    +GAAIA   T+ P  
Sbjct: 265 EVFYHTRWNPVG-TGEEVWQSFAADGSL--ANHIDTSSVTE-NEQLGAAIAVRFTLQPGE 320

Query: 363 EGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQ 422
             ++ F ++WD P   F +G  YYRRYT F+G   + A  IA  A+ ++ +W+ QIE+WQ
Sbjct: 321 TLEIPFVVSWDLPVTEFAAGVNYYRRYTDFFGKTGDHAWAIATTALEQYKTWQQQIESWQ 380

Query: 423 RPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
            PIL    LP+W+ + LFNELY L +GG +W+  +P
Sbjct: 381 DPILNRPDLPDWFKMALFNELYDLTSGGTLWSAATP 416


>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
          Length = 812

 Score =  233 bits (595), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 142/392 (36%), Positives = 205/392 (52%), Gaps = 35/392 (8%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SFAL+MLFP+++ ++ R FA A+ + D +       
Sbjct: 421 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIPVSDSTTRVIGYY 480

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  +  +  RKV GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 481 YTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 540

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D  F    W ++   + Y+  FD D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 541 FTGADDITFLAECWNAIVQTLDYLKTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGL 600

Query: 699 W--------------VAALQAASALAREVGDRG----SEDYFLFKFQKAKVVYE-KLWNG 739
           W              +   +      + +GD G     +  +    Q+++ +YE KLWNG
Sbjct: 601 WLAALEAAIAICDVLITNRRDTENAEKFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNG 660

Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
            Y+  D+   S S  + ADQL GQ+YAR  GL  IV  D+A SALK VYN   LK   G+
Sbjct: 661 QYYRLDS--ESGSDVVMADQLCGQFYARLLGLPDIVKRDRALSALKTVYNACFLKFHHGQ 718

Query: 800 RGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
            GA NG+ PDG  +   +    E+W+G+ + +AA M+   + D   +    +    +   
Sbjct: 719 FGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTEAVVRQVYEN- 777

Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
             G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 778 --GLQFRTPEAITAVGTFRASTYLRAMAIWAI 807



 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 133/389 (34%), Positives = 190/389 (48%), Gaps = 44/389 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       K +   QFSVF  R+     +  LC +
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVF-ERTEAGTQAYALCTQ 98

Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
            PE         + +W W     N +G   TYHALYPR+W V+E     +L   C Q SP
Sbjct: 99  PPE------DGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVYEQVYKTQL--TCEQFSP 150

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +  +NY+E+SYPV++F +  +N    S  ++++ TW N VG    FT    + + +M D 
Sbjct: 151 LWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGW---FTNALKSPEVRMRDD 207

Query: 270 VHAVL---------------LHHRTSH---QLPPVTFALAAQETDG---VHVSLCPHFVI 308
              V                L   + H    L  V      QE +G   +     P   I
Sbjct: 208 GSPVYEYQPRLGESKDNFNQLVENSEHIGCILGRVGVDQPIQEGEGQWCIATRKHPKLEI 267

Query: 309 SGNSLGL---TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
             +S      T +D+W    + GS        ++V    G  I AA+A   T+ P    +
Sbjct: 268 FYHSRWQPRGTGEDVWQSFAQDGSLPNYLDHASAVE---GEQIAAALAVRFTLQPGETIE 324

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F L WD P   F +G  YYRRYT F+G     A  IA  A+ ++ +W+ QI+AWQ+PI
Sbjct: 325 IPFVLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPI 384

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           L+   LP+W+ + LFNELY L+AGG +W+
Sbjct: 385 LDRADLPDWFKMALFNELYDLSAGGTLWS 413


>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
 gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
          Length = 799

 Score =  233 bits (595), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 141/391 (36%), Positives = 210/391 (53%), Gaps = 23/391 (5%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G  L    +EN   GQF  LE ++Y  + + DV  Y SF L++L+P++  S+   FA ++
Sbjct: 408 GGTLWSAADENNPKGQFAVLECLDYRWYESLDVRLYGSFGLLLLWPELDKSVLLAFARSI 467

Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
              D +   +  +     RK  GA PHD+G  +  PW + N     D   WKDL   FVL
Sbjct: 468 SKGDKTPRIIGYNQAEAIRKATGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVL 527

Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           QVYRD V TG  D +F +  W ++   + Y+  FD+D DG+ EN G PDQT+D W + G+
Sbjct: 528 QVYRDFVLTGSEDYEFLQDCWLAIVETLDYLKIFDKDEDGIPENSGAPDQTFDDWKLQGV 587

Query: 692 SAYSGGLWVAALQAASALAR-----EVGDRGSEDY------FLFKFQKAKVVYEKLWNGS 740
           SAY GGLW+AAL+AA  + +     E  D  S         +    +KAK +Y+ LWNG 
Sbjct: 588 SAYCGGLWLAALEAAIEIGKCLQKTEFSDNVSPSISDKISTYETWLEKAKPIYQNLWNGQ 647

Query: 741 YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR 800
           YF  D+   S S  + ADQL GQ+YA+  GL  IV  +K  SAL+ +Y         G+ 
Sbjct: 648 YFQLDS--QSGSDVVMADQLCGQFYAKLLGLEDIVSPEKTISALQTIYQSCFQNFYQGQF 705

Query: 801 GAVNGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
           GA NG+  +G  ++ +     E+W+G+ + +AA +I   + + GF+ A  + +  +    
Sbjct: 706 GAANGVRINGEPINPNDTHPLEVWTGINFGLAAFLIQMGMKEEGFKLAEVVVQQIYEN-- 763

Query: 860 LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 764 -GLQFRTPEAITAAGTFRASHYLRAMAIWAI 793



 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 123/386 (31%), Positives = 194/386 (50%), Gaps = 39/386 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       + + A QFS+F    NG+  +  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFRSLPACQFSIF-EEVNGKTQTYALCTE 99

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKS----TYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
            PE         + +W+W      S    TYHALYPR+W V+E     +L   C Q SP+
Sbjct: 100 PPE------DESLSAWNWYPASSNSQKTGTYHALYPRSWFVYENVFKAQLS--CEQFSPV 151

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQH----------- 259
             +NY+E+SYP++VF +  +N       +++L TW N++G  +    Q            
Sbjct: 152 WANNYQETSYPIAVFEWIAHNPTDEPITLSILLTWQNTIGWFTNALEQPLVQVRDDGSPI 211

Query: 260 YNSKTKMNDGV----HAVLLHHRTSHQLPPVTFALAAQETDG-------VHVSLCPHFVI 308
           Y  + +  + V    H V   HR    +  ++ +   +E +G        + ++  ++  
Sbjct: 212 YEYQPRWGNSVGNFNHLVEDFHRIGCLMSRLSASEEVREGEGQMAIATITNPAVEVYYHT 271

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
           + N++G T +++W+   + G+   L   E+   +  G  IG A+A   T+ P    ++ F
Sbjct: 272 AWNTVG-TGEEIWNYFSQDGT---LLDEESEHPATEGRQIGVALAVKFTIRPGKTRKIPF 327

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            L+WD P   F SG  YYRRYT F+G +   A ++ R A+     W   I+AWQ PIL  
Sbjct: 328 LLSWDLPVTEFESGVGYYRRYTDFFGRNGKNAWSMIRTAMKHSDVWLDNIKAWQNPILNR 387

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWT 454
           + LP W+ + LFNELY L  GG +W+
Sbjct: 388 QDLPSWFKMALFNELYLLTDGGTLWS 413


>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
 gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
          Length = 799

 Score =  233 bits (594), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 144/381 (37%), Positives = 211/381 (55%), Gaps = 25/381 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL-- 586
           +GQ+  LE ++Y  + + DV  Y SFAL+ L+P+++  + R FA A+   DP K++++  
Sbjct: 414 LGQYAVLECLDYRWYESLDVRLYGSFALLYLWPELEKGVMRAFARAIPTEDP-KLRIIGY 472

Query: 587 ------DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
                 +      RK+  A PHD+G  +  PW   N     D   WKDL   FVL VYRD
Sbjct: 473 YYQGDPETAHRAPRKIAHATPHDLGAPNEHPWQASNYTSYQDCNLWKDLASDFVLLVYRD 532

Query: 639 VVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSG 696
            + TG  D +F    WP++  ++ Y+  FD DGDG+ EN G PDQTYD W + G+SAY G
Sbjct: 533 YLLTGATDLEFLTDCWPAIVSSLDYLKTFDTDGDGLPENSGAPDQTYDDWQLKGVSAYCG 592

Query: 697 GLWVAALQAASALAREVGDRGS------EDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSS 750
           GLW+AAL+AA A+A+ +  +G+      E +  +  Q       KLWNG ++  D+  +S
Sbjct: 593 GLWIAALEAALAIAQILEKQGNNTKLEQEKWQAWLSQSRPTYQAKLWNGQFYRLDSDSNS 652

Query: 751 QSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
           Q   + ADQL GQ+YAR   L  +V  + A SALK VY+   LK   G+ GA NG+LPDG
Sbjct: 653 Q--VVMADQLCGQYYARLLNLPDVVPIECAESALKTVYDVCFLKFHEGQFGAANGLLPDG 710

Query: 811 RVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEA 869
             +  ++    E+W G+ Y +AA +    L D  ++    +    +     G  F+TPEA
Sbjct: 711 SPEKPNATHPLEVWIGINYGLAAFLYQMGLEDGAWRLTQTVIRQIYDN---GLQFRTPEA 767

Query: 870 WNTDDQYRSLCYMRPLAIWAM 890
                 +R+  Y+RP+AIWAM
Sbjct: 768 ITAVGTFRACMYLRPMAIWAM 788



 Score =  183 bits (465), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 126/394 (31%), Positives = 184/394 (46%), Gaps = 53/394 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       + +   QFSVF  ++     +  LC +
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFQTMPGCQFSVF-EQAGDHSQAYALCTE 96

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            P      +   +G+W+W     K TY ALYPR+W  ++G    +  + C Q SPIIP N
Sbjct: 97  PP------SDGTLGAWNWYPTA-KGTYGALYPRSWFQYQGVF--QAGLTCEQFSPIIPDN 147

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT------------------ 256
           Y+E+SYPV+VF +  +N       ++++ +W N VG  S  T                  
Sbjct: 148 YQETSYPVAVFVWAAHNPTDQPITLSIMLSWQNLVGWFSNATPASEIRIRDDGSPVYDYT 207

Query: 257 ---GQ---HYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG 310
              GQ   +YNS  +       +L   R  H         A +E +G      P      
Sbjct: 208 PRLGQSQGNYNSFHQSQTQASCLLQRQRDPH--------TAIKEGEGEWAFSVP----VR 255

Query: 311 NSLGLTAKDMWHEIKEHG----SFDRLNSMETSVTSEPGS---SIGAAIAASVTVPPDSE 363
           + L +     W+ + + G    SF R  S+     ++P S    IGA +A   T+ P   
Sbjct: 256 DDLEIFYHTRWNPVGDGGELWSSFSRDGSLSNLENNQPASDIEQIGATLAVRFTLAPGES 315

Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
            ++ F LAWD P   F     Y+RRYT F+      A  IA+ A+  +  W  QI AWQ+
Sbjct: 316 KEIPFGLAWDLPVTEFAPNVNYFRRYTDFWNRSGTNAETIAQVALENYQKWSEQIGAWQQ 375

Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           PILE    P+ + + L NELY L +GG++WT  S
Sbjct: 376 PILEHPDWPDDFKMALINELYDLTSGGSLWTAAS 409


>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
 gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
          Length = 795

 Score =  233 bits (593), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 143/383 (37%), Positives = 212/383 (55%), Gaps = 26/383 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           +GQF  LE ++Y  + + DV  Y SF L+MLFP+++ S+ R FA A+   D         
Sbjct: 413 VGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKSVIRAFARAIPQSDDRTRIIGYY 472

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
           + +  +     RK+ GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 473 LTIKSESPNAVRKIAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 532

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F    W ++   + Y+  FD DGD + EN G PDQT+D W + G+SAY GGL
Sbjct: 533 FTGANDLQFLADCWDAIVQTLDYVKTFDLDGDSIPENSGAPDQTFDDWRLQGVSAYCGGL 592

Query: 699 WVAALQAASALAREVGD-RGSED---------YFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
           W+AAL+AA A+++ + + RG+ED         Y  +  Q   +  EKLWNG Y+  D+  
Sbjct: 593 WLAALEAAIAISKILTNRRGAEDAEVEVQGSIYQGWLSQSKPIYQEKLWNGEYYRLDS-- 650

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
            S S  + ADQL GQ+YAR   L  IV  D+A SAL KVY+   LK   G+ GA NG+LP
Sbjct: 651 DSGSDVVMADQLCGQFYARLLSLPDIVPSDRALSALTKVYDSCFLKFQNGEFGAANGVLP 710

Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
           +G  +  +S    E+W+G+ + +AA ++  ++ +   +    +    +     G  F+TP
Sbjct: 711 NGLPENPNSTHPLEVWTGINFGLAAFLVQMNMKNEAMRLTEAVVRQIYDN---GLQFRTP 767

Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
           EA   +  +R+  Y+R +AIW +
Sbjct: 768 EAITANGTFRASTYLRAMAIWGI 790



 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 137/391 (35%), Positives = 189/391 (48%), Gaps = 48/391 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       K V A QFS+F S   GQ Y+    P 
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKTVPACQFSIFES---GQAYALSTQPP 94

Query: 155 TPEVLKDTTAAGIGSWDW---NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
               L+        SW W   + + +  TYHALYPR+W V+E     +L   C Q SPI 
Sbjct: 95  EDHTLQ--------SWQWYPQSQEQNSGTYHALYPRSWFVYENVLKTQL--TCEQFSPIW 144

Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND--- 268
             NY+ESSYPV+VF +  +N       I+++ TW N VG    FT    + + ++ D   
Sbjct: 145 ADNYQESSYPVAVFLWKAHNPTNAPITISIMLTWENMVGW---FTNTLKSPEVRVRDDGS 201

Query: 269 -------------GVHAVLL--HHRTSHQLPPVTFALAAQETDG---VHVSLCPHFVI-- 308
                        G +  L   H      L  V+     QE DG   +  +  P   I  
Sbjct: 202 PVYEYLPRWGESQGNYNCLAEDHQYFGCVLGQVSDHQTVQEGDGTWCIATAKNPQVEIFY 261

Query: 309 --SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
               N +G   +D+W     +GS    N ++ +   E  + +GAAIA   T+ P    +V
Sbjct: 262 HSRFNPVG-NGEDVWQSFSANGSLP--NYIDETPADE-NTRLGAAIAVRFTLQPGENLEV 317

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
              L+WD P   F +G  YYRRYT F+G +   A  IA  A+ E+ +W   I+ WQ PIL
Sbjct: 318 PLVLSWDFPVTEFAAGVNYYRRYTDFFGCNGENAWQIATTALKEYQNWRSLIQDWQNPIL 377

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
               LP+W+ + LFNELY L +GG +W+  S
Sbjct: 378 AKSDLPDWFKMALFNELYDLTSGGTLWSAAS 408


>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
 gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
          Length = 812

 Score =  232 bits (591), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 145/392 (36%), Positives = 205/392 (52%), Gaps = 35/392 (8%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SFAL+MLFP+++ ++ R FA A+ + D +       
Sbjct: 421 IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIPVSDQTTRVIGYY 480

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  +  +  RKV GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 481 YTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 540

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D  F    W ++   + Y+  FD D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 541 FTGADDITFLAECWNAIVETLDYLKTFDIDNDGIPENSGAPDQTFDDWRLQGVSAYCGGL 600

Query: 699 W----VAALQAASALAREVGDRGSEDYFLFK--------------FQKAKVVYE-KLWNG 739
           W     AA+     L     D  + + FL                 Q+++ +YE KLWNG
Sbjct: 601 WLAALEAAIAICDVLITNRRDTDNAENFLGDLGDLAVQKSIYETWLQQSRPIYEAKLWNG 660

Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
            Y+  D+   S S  + ADQL GQ+YAR  GL  IV  D+A SALK VYN   LK   G+
Sbjct: 661 QYYRLDS--ESGSDVVMADQLCGQFYARLLGLPDIVKRDRALSALKTVYNACFLKFHHGQ 718

Query: 800 RGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT 858
            GA NG+ PDG  +   +    E+W+G+ + +AA M+   + D   +    +    +   
Sbjct: 719 FGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKLTETVVRQIYEN- 777

Query: 859 GLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
             G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 778 --GLQFRTPEAITAVGTFRASTYLRAMAIWAI 807



 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 136/389 (34%), Positives = 190/389 (48%), Gaps = 44/389 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       K +   QFSVF  R+     +  LC +
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVF-ERTGAGTQAYALCTQ 98

Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISP 209
            PE         + +W W     N +G   TYHALYPR+W V+E     +L   C Q SP
Sbjct: 99  PPE------DGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVYEQVYKTQL--TCEQFSP 150

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           I  +NY+E+SYPV++F +  +N    S  ++++ TW N VG    FT    + + KM D 
Sbjct: 151 IWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGW---FTNALKSPEVKMRDD 207

Query: 270 VHAVL---------------LHHRTSH---QLPPVTFALAAQETDG---VHVSLCPHFVI 308
              V                L   + H    L  V      QE +G   +     P   I
Sbjct: 208 GSPVYEYQPPLGESKDNFNQLVENSEHIGCVLGRVGVDQPIQEGEGQWCIATRKHPKLEI 267

Query: 309 SGNSLGL---TAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQ 365
             +S      T +D+W    + GS        ++V    G  I AA+A   T+ P    +
Sbjct: 268 FYHSRWQPRGTGEDVWQSFAQDGSLPNYLDHTSAVE---GEQIAAALAVRFTLQPGETLE 324

Query: 366 VTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPI 425
           + F L WD P   F +G  YYRRYT F+G     A  IA  A+ ++ +W+ QI+AWQ+PI
Sbjct: 325 IPFVLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPI 384

Query: 426 LEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           LE   LP+W+ + LFNELY L+AGG +W+
Sbjct: 385 LERADLPDWFKMALFNELYDLSAGGTLWS 413


>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
 gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
          Length = 439

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 141/385 (36%), Positives = 208/385 (54%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SF L+ LFP+++ ++ R FA A+   D +       
Sbjct: 55  IGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAIPQGDDTPRVIGYY 114

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  +     RK  GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 115 YTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 174

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F +  W ++   + Y+  FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 175 LTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 234

Query: 699 W----VAALQAASALAREVGDRGSEDYFLFK-------FQKAKVVY-EKLWNGSYFNYDN 746
           W     AA+  +  L +   D  ++ + L++         K+  +Y EKLWNG Y+  D+
Sbjct: 235 WLAALAAAIAISDILLQNHQDSATKGHLLYQKSTYETWLTKSLPIYQEKLWNGKYYRLDS 294

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YA    L  IV  D+A SAL+ VY+   LK   G+ GA NG+
Sbjct: 295 --ESGSDVVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGV 352

Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG  +   +    E+W+G+ + +AA ++   + D GF+    + E  ++    G  F+
Sbjct: 353 RPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVEQIYNN---GLQFR 409

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA      +R+  Y+RP+AIWA+
Sbjct: 410 TPEAITAAGTFRASTYLRPMAIWAI 434



 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 35/49 (71%)

Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
           ++ +W+ QIE+WQ PIL    LP+W+ + LFNELY L +GG +W+  +P
Sbjct: 3   QYKTWQQQIESWQDPILNRPDLPDWFKMALFNELYDLTSGGTLWSAATP 51


>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
           7942]
 gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 798

 Score =  229 bits (584), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 140/389 (35%), Positives = 203/389 (52%), Gaps = 26/389 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           +GQF  LE ++Y  + + DV  Y +F L+ML+P+++ ++ R FA A+   D  + +++  
Sbjct: 412 VGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFARAIPQAD-DRTRVI-- 468

Query: 589 GQWVS---------RKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
           G W +         RK+  A PHD+G  +  PW + N     D   WKDL   FVLQV+R
Sbjct: 469 GYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWR 528

Query: 638 DVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
           D   TG  D+ F    WP+   A+ Y+  FD DGDG+ EN G PDQT+D W + GISAY 
Sbjct: 529 DYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTFDDWRLQGISAYC 588

Query: 696 GGLW----VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
           GGLW     AAL  A  L     DR  +D+  +  Q   + ++ LWNG Y+  D+   SQ
Sbjct: 589 GGLWIAALEAALAIADVLELSAEDRDRQDFQSWLAQARSLYHDTLWNGRYYQLDSGSGSQ 648

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
              + ADQL G +Y+R   L P+   + A+S    VY     K   G+ G  NG+LPDG 
Sbjct: 649 --VVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGS 706

Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            VD       E+W+G+ + +AA  +     D  F+    +    +     G  F+TPEA 
Sbjct: 707 PVDPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN---GLQFRTPEAI 763

Query: 871 NTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
             +  +R+  Y+RP+AIWA+   LT   P
Sbjct: 764 TANATFRASHYLRPMAIWAVYGVLTNFSP 792



 Score =  168 bits (425), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 184/396 (46%), Gaps = 57/396 (14%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG PLGG G+G  GRS RG+   W +         + A QF+V+ S +     S+     
Sbjct: 36  HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90

Query: 155 TPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
             E   D +   + SW+W      ++ST  Y ALYPR+   ++   + E  I CRQ SPI
Sbjct: 91  --EAPSDGS---LSSWNWYPASTAERSTGEYSALYPRSQFSYQQVFEAE--IHCRQFSPI 143

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
           +PH+Y+ +SYP ++F + ++N       I++L +W N  G    FT  +   +    D  
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGW---FTNTNKAPEVVYRDDG 200

Query: 271 HAV-------------------------LLHHRTSHQLPPV---TFALAAQETDGVHVSL 302
             V                         LL  +T  Q P      +A+A  E +G+ +  
Sbjct: 201 SPVYDYVPALGQSVGNLNQRIAGEGWQGLLLDQTRSQDPEEGDGQWAIAIAEAEGLEIFR 260

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSF-DRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
           C  +  +G+       ++W      GS  DR +S   +     G  +  AIA  +T+ P 
Sbjct: 261 CDRWDPTGD-----GSELWQSFALDGSIPDRQDSQPAAA----GERLAGAIAVRLTLQPG 311

Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
              ++ FS+AWD P   F +G   +RRYT F+G     AA IA   +  +  WE  I AW
Sbjct: 312 ESREIPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAW 371

Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           Q+PIL+   L + + + LFNELY L +GG++WT  S
Sbjct: 372 QQPILDRDDLSDTFKLGLFNELYDLCSGGSLWTAAS 407


>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
          Length = 842

 Score =  228 bits (582), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 148/367 (40%), Positives = 190/367 (51%), Gaps = 20/367 (5%)

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQW 591
           F +LE  +Y  + T DV FY S+ LI+++P I     R FA +V      K         
Sbjct: 401 FSHLECYKYDFYGTSDVRFYGSWPLILMWPDIDKQAVRQFADSVYHTRTDK--------- 451

Query: 592 VSRKVLGAVPHDIGICDPWFEV-NAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFA 648
                +G   HDIG  +  FE  NAY   D+  WKDLN K VL VYRD   TG  D  F 
Sbjct: 452 --PAAIGTTAHDIGNANNVFETWNAYVYRDSTNWKDLNSKLVLMVYRDWALTGKTDMDFL 509

Query: 649 KAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASA 708
           K  W  V  AM  +   D DGD +  + G  DQTYD   + G +AY GGL+VAA  AA  
Sbjct: 510 KYCWIPVQKAMDKVKSQDSDGDYLPNSSGI-DQTYDDMQLQGNTAYCGGLFVAACNAAKE 568

Query: 709 LAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
           +A  +GD      +   ++K+K  Y  KLWNGSYF  D +GS  +S I +DQL G WYA+
Sbjct: 569 IATAMGDSTKATQYQSWYEKSKENYIAKLWNGSYFKID-TGSYDTSRIMSDQLCGHWYAK 627

Query: 768 ACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVT 827
           ACGL  IV ++ A+ +  KVY +N  K   G  G VN M  DG VDMS  Q+ E W G +
Sbjct: 628 ACGLEGIVPDEYAKKSYSKVYEFNNKKFDNGLHGFVNIMKADGNVDMSHAQTAEAWVGTS 687

Query: 828 YAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTD-DQYRSLCYMRPLA 886
           Y V A MI E L     Q    + +  W    +   F+TPEAW     + R+  YMR   
Sbjct: 688 YGVIAGMIQEGLETEASQVGSNLADTVWKTNDM--WFRTPEAWRQGVSEVRAPYYMRANC 745

Query: 887 IWAMQWA 893
           IWA++ A
Sbjct: 746 IWAVKHA 752



 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 176/370 (47%), Gaps = 59/370 (15%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           G P+GG G+G+I   + G F + ++     E        F ++  + +GQ  S       
Sbjct: 70  GAPIGGFGAGTITWKFDGSFTKGRLDIGSNELSSDADCAFYMY-QKPSGQNIS------- 121

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
              +K   A  IG+        ++TY++L+P++W  + G  +   +    Q SPIIP NY
Sbjct: 122 ---MKKLDAGSIGA-------GQATYYSLFPKSWVDYSGS-NFYCKAKVTQFSPIIPGNY 170

Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLL 275
           + +SYPV V+ + I N G +S D+ ++ +W N  GG         NSK     G+  +L 
Sbjct: 171 ETTSYPVGVYKWEITNPGSSSCDVAIMLSWKNEFGGSFAEGQTSGNSK-----GI--ILR 223

Query: 276 HHRTSHQL--PPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
            + TS+        F+LAA++ DGV V+                         +GS    
Sbjct: 224 RNGTSNATMENQGEFSLAAEQKDGVKVT-------------------------YGSASGT 258

Query: 334 NSMETSVTSEP--GSSIGA----AIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYR 387
           +++ +   ++   G ++G+    A+A   T+ P     V  +L+WD P     SG  +YR
Sbjct: 259 STLSSDFGTDGLLGDTVGSNNIGAVAFKATLEPGQTITVPVALSWDIPISQAGSGSKWYR 318

Query: 388 RYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLN 447
           +YT+FYG     +  IA+DAI  + SWE  I++WQ  +L D + P+W   T+FNELYY  
Sbjct: 319 KYTRFYGRSGLNSWKIAKDAIDNYSSWESSIDSWQNSVLNDTKYPQWLKTTMFNELYYYF 378

Query: 448 AGGAVWTDGS 457
            GG  W  G+
Sbjct: 379 TGGTYWEAGA 388


>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
 gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
          Length = 803

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 135/391 (34%), Positives = 206/391 (52%), Gaps = 28/391 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           +G+F  LE I+Y  + + DV  Y SF L+ L+P ++ ++ + FA ++   D +   +  D
Sbjct: 417 VGKFGVLECIDYRWYESLDVRLYGSFGLLALWPSLEKAVIKTFAESISKSDHTLRLIGYD 476

Query: 589 GQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--D 644
                RKV GAVPHD+G  +  PW E N     D   WKDL   FVLQVYRD + T   D
Sbjct: 477 RSLAPRKVKGAVPHDLGAPNEHPWEETNYTSYQDCNLWKDLGSDFVLQVYRDFLLTDSQD 536

Query: 645 KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQ 704
             F +  WP++   + Y+ +FD D DG+ EN G PDQT+D W + GIS+Y G LW+ AL+
Sbjct: 537 INFLQECWPAIVETLDYLKKFDLDDDGIPENSGAPDQTFDDWQLHGISSYCGALWITALE 596

Query: 705 AASALAREVGDRGSEDYF-------------LFKF----QKAKVVY-EKLWNGSYFNYDN 746
           AA  + + +      +               L K+    +K++++Y E LWNG Y+  D+
Sbjct: 597 AAIGIGKILLKNSENNSHPDLQNSLAFIQENLTKYDDWLKKSRLIYHETLWNGEYYQLDS 656

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S +I  DQL GQ+YAR   L  +VD     SAL K+Y    L    G+ GA NG+
Sbjct: 657 --KSGSDTIMTDQLCGQFYARLLNLPDVVDTSCTISALSKIYEACFLNFYDGQYGAANGV 714

Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
             DG  +  +S   +E+W G+ + +AA +I   + +   +    + E  ++    G  F+
Sbjct: 715 RSDGTPENPNSTHPQEVWVGINFGLAAFLIQMGMKNQALELTKVVVEQIYAN---GLQFR 771

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           TPEA      +R+  Y+R +AIW + + L +
Sbjct: 772 TPEAITPVGTFRASHYLRAMAIWGVYYQLKK 802



 Score =  199 bits (505), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 128/391 (32%), Positives = 193/391 (49%), Gaps = 47/391 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+P+GG G+G IGRS  G+F  W +       K + + QFS+F    +G+  +  L  K
Sbjct: 41  HGMPIGGFGAGCIGRSSAGDFNLWHLDGGEHVFKTISSCQFSIFEELKDGKTKAYALSTK 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W+W  K +K TY+ALYPR+W  +EG    E +IVC Q SPIIP N
Sbjct: 101 EPE---DKT---LSKWNWYPK-NKGTYYALYPRSWYQYEGIF--EAKIVCEQFSPIIPDN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE---- 254
           Y+E+SYP+ +F +T+ N       ++++ TW N +G                G  E    
Sbjct: 152 YQETSYPIGIFEWTVSNPTDQPITLSIMLTWQNMIGWFTNSIKTPKIEVRDDGSPEYEYS 211

Query: 255 ----FTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP----VTFALAAQETDGVHVSLCPHF 306
               F+  ++N   + +     +L   R +  +      + FA  A + D +  S    +
Sbjct: 212 PQLGFSQGNFNKWKQSSSQFGFILDRLRVNDSISEGEGQICFA-TAHDPD-LEFSYLGKW 269

Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
             SGN       ++W      G+    N+++ ++ ++P   I AA+A    + P    ++
Sbjct: 270 NPSGN-----GSEVWDHFSASGTLP--NTLDETI-AKPDEQIAAAMAVKFVIKPGEVKKI 321

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            F+LAWD P   F  G  Y+RRYT F+G +   A  IA   +     W  QI  +Q PIL
Sbjct: 322 PFTLAWDLPITEFSLGNQYHRRYTDFFGINGKNAWMIAETGLKSLNIWREQILEYQLPIL 381

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
            DK LP+W+ + LFNELY L  GG +WT  S
Sbjct: 382 NDKDLPDWFKMALFNELYLLTDGGTLWTTSS 412


>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
 gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
          Length = 801

 Score =  227 bits (578), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 147/429 (34%), Positives = 216/429 (50%), Gaps = 39/429 (9%)

Query: 496 ILERMSSILEQIYTPVALNSSFGTNL--LQDG---------EENIGQFLYLEGIEYLMWN 544
           I+E  + IL+Q   P     +    L  L DG         ++ +GQF  LE I+Y  + 
Sbjct: 373 IIEWQAPILKQKELPDWFKMALFNELYLLADGGTLWTAATEDDPVGQFAVLECIDYRWYE 432

Query: 545 TYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDI 604
           + DV  Y SF L ML+P++  ++   FA A+   D +   +  +G    RK  GA PHD+
Sbjct: 433 SIDVRLYGSFGLGMLWPRLDKAVMEAFARAIPKGDDTPRIIGYNGAKAIRKAKGATPHDL 492

Query: 605 GICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMA 660
           G  +  PW + N     D   WKDL   FVL +YR  + TG  ++ F    W SV   +A
Sbjct: 493 GAPNEHPWEKTNYTSYQDCNLWKDLGCDFVLLLYRSFLWTGANNQDFLWEHWDSVTQTIA 552

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR---- 716
           Y+ +FD DGDG+ EN G PDQT+D W + GISAY G LW+A L+AA  +   +  R    
Sbjct: 553 YVKKFDLDGDGIPENSGAPDQTFDDWQLRGISAYCGALWIAGLEAAIKIGETLLQRPPMN 612

Query: 717 ---GSED-----------YFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAG 762
               SE+           Y  +  Q   + ++ LW G Y+  D+   S S  + +DQL G
Sbjct: 613 PQLQSENAPEKIEIAIATYRQWLHQARALYHDTLWTGEYYRLDS--ESGSDVVMSDQLCG 670

Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSRE 821
           Q+YAR  GL  +V+     SAL+K+Y    L+  GGK GA NG+ PDG  +   +    E
Sbjct: 671 QFYARLLGLPDVVEPQYTLSALRKIYESCFLQFYGGKYGAANGVRPDGSPENPDATHPLE 730

Query: 822 IWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCY 881
           +W+G+ + +AA MI   +   G      + +  +     G  F+TPEA      +R+  Y
Sbjct: 731 VWTGINFGLAAFMIQMGMKQEGLSITEAVVKQIYEN---GLQFRTPEAITAAGTFRASHY 787

Query: 882 MRPLAIWAM 890
           +R +AIWA+
Sbjct: 788 LRAMAIWAI 796



 Score =  209 bits (531), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 191/384 (49%), Gaps = 39/384 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       + +   QFS+F     G+  +  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFRNLPPCQFSIFEHPEGGEAQAYALCTQ 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   +  W W  +  K TYHALYPR+W  +EG    + +I+C Q SPI+  N
Sbjct: 101 PPE---DGT---LSRWSWYPQ-KKGTYHALYPRSWYEYEGVF--QSQIICEQYSPILADN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS------------------EFT 256
           Y+ESSYP+++F +TIYN       I+++ +W N+VG  S                  E+ 
Sbjct: 152 YQESSYPLAIFEWTIYNPSDRPVTISIMMSWQNTVGWFSNAIKTPEVKVRDDGSPVYEYQ 211

Query: 257 GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDG---VHVSLCPHFVISGNSL 313
            +  +S    N     ++ ++R    L  +      +E +G   +   L P+  +  N+ 
Sbjct: 212 PRWGDSTGNFN---QWIVDNYRVGCLLNRIQPHPQVEEGEGQVCIASILNPNLEVFYNTR 268

Query: 314 GLTAKD---MWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
                D   +W     +GS   L   +    + PG  I AA+A   T+PP    ++ F L
Sbjct: 269 WNPQGDGSQIWDYFAMNGS---LGDYQNESPAAPGEQIAAAMAIRFTIPPGKTRKIPFIL 325

Query: 371 AWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
           AWD P   F  G  YYRRYT F+G +     ++ R A+  H  W  +I  WQ PIL+ K 
Sbjct: 326 AWDLPITEFAQGINYYRRYTDFFGRNGQNVWSVVRTALKHHEMWRDRIIEWQAPILKQKE 385

Query: 431 LPEWYPITLFNELYYLNAGGAVWT 454
           LP+W+ + LFNELY L  GG +WT
Sbjct: 386 LPDWFKMALFNELYLLADGGTLWT 409


>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 863

 Score =  227 bits (578), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 142/408 (34%), Positives = 215/408 (52%), Gaps = 42/408 (10%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D  +  GQF  LE I+Y  + + DV  Y SFAL+ML+P++  ++   FA A+   D +  
Sbjct: 456 DERDPKGQFAVLECIDYRWYESLDVRLYGSFALLMLWPELDKAVLLAFARAIFTGDDTPR 515

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
            +  +     RK+ GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + 
Sbjct: 516 VIGWNQASAVRKIAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDFLL 575

Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           TG  D +F +  WP++  A+AY+  FD DGDG+ EN G PDQT+D W + G+SAY GGLW
Sbjct: 576 TGSTDYEFLQECWPAIVEALAYLKTFDLDGDGIPENSGAPDQTFDDWQMRGVSAYCGGLW 635

Query: 700 V----------AALQAASALAREVG-------DRGSEDYFLFK--------------FQK 728
           +          + L    +   E+        ++  +D  LF                ++
Sbjct: 636 LAALEAAIAIGSLLIKHQSSVEELNQSPITNFNKDFKDVSLFPITNYQLPITNYQNWLEQ 695

Query: 729 AKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKV 787
           AK +Y+ +LWNG Y+  D+   S S  + ADQL GQ+YAR  GL  IV E+ A SAL+ V
Sbjct: 696 AKPIYQQRLWNGQYYRLDSE--SGSDVVMADQLCGQFYARLLGLPDIVPEECAISALETV 753

Query: 788 YNYNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQT 846
           Y    +K   G+ GA NG++ DG  +   +    E+W+G+ + +AA M+   + +   + 
Sbjct: 754 YESCFIKFNQGEFGAANGVMLDGSPEKPGATHPLEVWTGINFGLAAFMMQMGMKEKALEL 813

Query: 847 ACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
              + +  ++    G  F+TPEA      +R+  Y+R +AIWA+ + L
Sbjct: 814 TEVVVKQIYAN---GLQFRTPEAITATGTFRASHYLRAMAIWAIYYEL 858



 Score =  176 bits (446), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 124/401 (30%), Positives = 187/401 (46%), Gaps = 54/401 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS+RG+F  W I       + + A QFSVF    +G+K +  L  +
Sbjct: 66  HGMPLGGFGAGCIGRSHRGDFNLWHIDGGEHVFRSLPACQFSVF-EEIDGKKQAYALSTE 124

Query: 155 TPEVLKDTTAAGIGSWDW-----NLKGDK--------------STYHALYPRAWTVHEGE 195
            P      T   + +W W      +K +K               +YHALYPR+W V+E  
Sbjct: 125 PP------TDGSLSAWKWYPSELKIKNEKLKSEEQNSDSSLSTGSYHALYPRSWFVYENV 178

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
              +L   C Q SPI   NY+ESSYPV++F +  +N       ++++ TW N+VG     
Sbjct: 179 FQAQLS--CEQFSPIWAGNYQESSYPVAIFEWIAHNPTDRPITLSIMLTWQNTVGWFTNA 236

Query: 251 --------GDSEFTGQHYNSKTKMNDGVHAVLLH--HRTSHQLPPVTFALAAQETDG--- 297
                    D       YNS+   + G    L+   HR    +  ++ +   +E +G   
Sbjct: 237 NKSPEIKVRDDGSPVYEYNSRWGESGGNFNRLVEDFHRIGCVMTRLSASEEVKEGEGQWA 296

Query: 298 ----VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIA 353
                + ++   +    N  G + +D+W    + G F   +  ET      G  +  AIA
Sbjct: 297 IATVANAAVEVFYQTRWNPAG-SGEDIWRPFSQEG-FLLDDPDETPAAG--GEQLACAIA 352

Query: 354 ASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGS 413
              T+ P    ++ F LAWD P   F     YYRRYT F+G +     +I R A+    +
Sbjct: 353 VRFTIRPGKTRKIPFILAWDFPITEFAPEVWYYRRYTDFFGRNGKNVWSIIRTAMKHSDT 412

Query: 414 WELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           W   I+AWQ+PIL  + LP+   + +FNELY L  GG +W+
Sbjct: 413 WRENIQAWQQPILAREDLPDIIKMAMFNELYALTDGGTLWS 453


>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
 gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
          Length = 814

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 150/410 (36%), Positives = 216/410 (52%), Gaps = 46/410 (11%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD---- 579
           D  + IGQF  LE ++Y  + + DV  Y SFAL++L+P+++ ++   FA A+   D    
Sbjct: 414 DERDPIGQFAVLECLDYRWYESLDVRLYGSFALLLLWPRLEKAVMEAFARAIPTQDDRTR 473

Query: 580 ----------PSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
                     PS   L         K+  A PHD+G  +  PW + N     D   WKDL
Sbjct: 474 VIGYYYTQGMPSPTAL--------HKIANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDL 525

Query: 628 NPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
              FV+ VYRD + TG  D  F    WP+++  +AY+  FD DGDG+ EN G PDQT+D 
Sbjct: 526 PADFVILVYRDYLLTGGTDYDFLTTCWPAIHATLAYLKTFDLDGDGIPENSGAPDQTFDD 585

Query: 686 WSVSGISAYSGGLWVAALQAASA----LAREVGDRGSEDYFLFKFQ----KAKVVYEK-L 736
           W + GISAY GGLW+AAL+AA A    L  +  D       L  +Q    +++ +Y++ L
Sbjct: 586 WRLQGISAYCGGLWIAALEAAIAIIKLLLVQAPDLPHLQESLTTYQTWLAQSRPLYQQTL 645

Query: 737 WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM 796
           WNG Y+  D+   S S  + ADQL GQ+YAR  GL  +V ED AR+AL  VY    L+  
Sbjct: 646 WNGQYYRLDS--GSGSDVVMADQLCGQFYARLVGLPDVVPEDCARAALDMVYQTCFLQFH 703

Query: 797 GGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQ-TACGIYEAA 854
            G  GA NG+ P G  +   +    E+W+G+ + +AA ++         Q TA  +Y+  
Sbjct: 704 NGTLGAANGLRPGGLAEKPDATHPLEVWTGINFGLAAFLVQMGRKAEAMQLTAAVVYQI- 762

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT---RPKPKT 901
                 G  F+TPEA   +  +R+  Y+RP+AIWA+   LT    P P++
Sbjct: 763 ---YEHGLQFRTPEAITAEQTFRAGMYLRPMAIWAIYHLLTSHANPNPES 809



 Score =  196 bits (499), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 136/390 (34%), Positives = 191/390 (48%), Gaps = 50/390 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF--VSRSNGQKYS-SVL 151
           HG+PLGG G+G +GRS+RG+F  W I       + + A QFSVF  V +S G+ YS S +
Sbjct: 42  HGMPLGGFGAGCVGRSHRGDFNLWHIDGGEHVFQSLPACQFSVFEQVGQSPGKAYSLSTV 101

Query: 152 CPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
            P+            + SW W  +   S YHALYPR+W V++     +L  VC Q SPI+
Sbjct: 102 APED---------GSLNSWQWYPQSGGS-YHALYPRSWFVYQNVFQADL--VCEQFSPIL 149

Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVH 271
             NY+E+SYPV+VF +T +N       ++L+ TW N VG    FT    +   K+ D   
Sbjct: 150 TENYQETSYPVAVFHWTAHNPTDQPLTLSLMLTWQNVVGW---FTNADKSPVVKVRDDGS 206

Query: 272 AVLLH-----------HRTSHQLPPVTFAL--------AAQETDG---VHVSLCPH---- 305
            V  +           +R       V F L        A  E +G   V     P+    
Sbjct: 207 PVYTYQPRLGESQGNFNRVIDDNGRVGFVLDRLRSEEAAIAEGEGQWAVATFASPYTSEV 266

Query: 306 -FVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
            +    N +G    ++W      GS   L + +    +  G  I AAIA   T+PP    
Sbjct: 267 FYYTRWNPVG-DGSELWQTFAADGS---LANCDDETPAGVGEQIAAAIAVRFTLPPGQTR 322

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
           Q+ F LAWD P   F +G  Y+RRYT F+G   +    IAR  +  H +W  QI+AWQ+P
Sbjct: 323 QIPFVLAWDFPVTEFAAGVNYFRRYTDFFGRTGDHGWAIARTGLKMHQTWTDQIQAWQQP 382

Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           IL D   P+W+ + L NELY L +GG +W+
Sbjct: 383 IL-DSNEPDWFKMALLNELYDLTSGGTLWS 411


>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
 gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 798

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 139/389 (35%), Positives = 202/389 (51%), Gaps = 26/389 (6%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDD 588
           +GQF  LE ++Y  + + DV  Y +F L+ML+P+++ ++ R FA A+   D  + +++  
Sbjct: 412 VGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFARAIPQAD-DRTRVI-- 468

Query: 589 GQWVS---------RKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
           G W +         RK+  A PHD+G  +  PW + N     D   WKDL   FVLQV+R
Sbjct: 469 GYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWKDLGCDFVLQVWR 528

Query: 638 DVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
           D   TG  D+ F    WP+   A+ Y+  FD DGDG+ EN G PDQT+D W + GISAY 
Sbjct: 529 DYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTFDDWRLQGISAYC 588

Query: 696 GGLW----VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQ 751
           GGLW     AAL  A  L     DR  +D+  +  Q   + ++ LWNG Y+  D+   SQ
Sbjct: 589 GGLWIAALEAALAIADVLELSAEDRDRQDFQSWLAQARSLYHDTLWNGRYYQLDSGSGSQ 648

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
              + ADQL G +Y+R   L P+   + A+S    VY     K   G+ G  NG+LPDG 
Sbjct: 649 --VVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQKFHSGQFGLANGLLPDGS 706

Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            V        E+W+G+ + +AA  +     D  F+    +    +     G  F+TPEA 
Sbjct: 707 PVGPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIRQIYDN---GLQFRTPEAI 763

Query: 871 NTDDQYRSLCYMRPLAIWAMQWALTRPKP 899
             +  +R+  Y+RP+AIWA+   LT   P
Sbjct: 764 TANATFRASHYLRPMAIWAVYGVLTNFSP 792



 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 121/396 (30%), Positives = 184/396 (46%), Gaps = 57/396 (14%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG PLGG G+G  GRS RG+   W +         + A QF+V+ S +     S+     
Sbjct: 36  HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90

Query: 155 TPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
             E   D +   + SW+W      ++ST  Y ALYPR+   ++   + E  I CRQ SPI
Sbjct: 91  --EAPSDGS---LSSWNWYPASTAERSTGEYSALYPRSQFSYQQVFEAE--IHCRQFSPI 143

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
           +PH+Y+ +SYP ++F + ++N       I++L +W N  G    FT  +   +    D  
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGW---FTNTNKAPEVVYRDDG 200

Query: 271 HAV-------------------------LLHHRTSHQLPPV---TFALAAQETDGVHVSL 302
             V                         LL  +T  Q P      +A+A  E +G+ +  
Sbjct: 201 SPVYDYVPALGQSVGNLNQRIAGEGWQGLLLDQTRSQDPEEGDGQWAIAIAEAEGLEIFR 260

Query: 303 CPHFVISGNSLGLTAKDMWHEIKEHGSF-DRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
           C  +  +G+       ++W      GS  DR +S   +     G  +  AIA  +T+ P 
Sbjct: 261 CDRWDPTGD-----GSELWQSFALDGSIPDRQDSQPAAA----GERLAGAIAVRLTLQPG 311

Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
              ++ FS+AWD P   F +G   +RRYT F+G     AA IA   +  +  WE  I AW
Sbjct: 312 ESREIPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAW 371

Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           Q+PIL+   L + + + LFNELY L +GG++WT  S
Sbjct: 372 QQPILDRDDLSDTFKLGLFNELYDLCSGGSLWTAAS 407


>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
 gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
          Length = 812

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 142/389 (36%), Positives = 216/389 (55%), Gaps = 32/389 (8%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE I+Y  + + DV  Y SFAL++LFP+++ ++ R FA A+   D  +      
Sbjct: 422 IGQFAILECIDYRWYESLDVRLYGSFALLILFPELEKAVMRAFARAIPASDNHRRLIGYY 481

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
           + +  +     RKV GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 482 LTIGAEMPLAPRKVAGATPHDLGAPNEHVWEKTNYTAYQDCNLWKDLPSDFVLQVYRDYL 541

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F  + W ++  A+ Y+  FD+DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 542 LTGATDVEFLASCWEAIVQALNYLKTFDKDGDGIPENSGAPDQTFDDWKLQGVSAYCGGL 601

Query: 699 WVAALQAASALAREVGD---------------RGSEDYFLFKFQKAKVVY-EKLWNGSYF 742
           W+AAL+AA A+ + + +               + + D +     +++ +Y EKLWNG Y+
Sbjct: 602 WLAALEAAIAIGKILLNNVERLHVTSLHVTSLQETIDTYQSWLSRSRPIYQEKLWNGQYY 661

Query: 743 NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
             D+  +S S  + ADQL GQ+YAR   L  IV ++ A SALK +Y+   LK   GK GA
Sbjct: 662 RLDS--NSGSDVVMADQLCGQFYARLLKLPDIVPQECALSALKTIYDACFLKFHQGKFGA 719

Query: 803 VNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLG 861
            NG+ P+G  +  ++    E+W+G+ + +AA ++   +     Q    +    +     G
Sbjct: 720 ANGLKPNGSPENPNATHPLEVWTGINFGLAAFLLQMGMKAEALQLTETVVNQIYEN---G 776

Query: 862 YAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
             F+TPEA  +   +R+  Y+R +AIWA+
Sbjct: 777 LQFRTPEAITSVGTFRASYYLRAMAIWAI 805



 Score =  202 bits (514), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 133/394 (33%), Positives = 195/394 (49%), Gaps = 54/394 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVF--VSRSNGQKYSSVLC 152
           HG+PLGG G+G IGRS RG+F  WQ+       +P+ A QFSVF  VS  +  + +  LC
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNVWQVDGGEHIFQPMPACQFSVFESVSERSASRQAYALC 100

Query: 153 PKTPEVLKDTTAAGIGSWDW-NLKGDK----STYHALYPRAWTVHEGEPDPELRIVCRQI 207
            + P+        G+ +W W   +G++     TYHALYPR+W V+EG    +L   C Q 
Sbjct: 101 TEAPD-------DGLQNWQWYPSRGEQHAHTGTYHALYPRSWFVYEGVFQSQL--TCEQF 151

Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN 267
           SPI  +NY+ESSYPV VF +T +N       ++++ +W N  G    FT    +S+ K+ 
Sbjct: 152 SPIWANNYQESSYPVGVFVWTAHNPTNAPMTLSIMLSWQNMTGW---FTNASKSSEVKIR 208

Query: 268 DGVHAV-------------------------LLHHRTSHQLPP--VTFALAAQETDGVHV 300
           D    V                         +L    S  +      + +A  +  GV V
Sbjct: 209 DDGSPVYEYKSRWGESQGNFNRLVESEYVGCVLGRDNSEDVEEGDGQWCIATAKQPGVEV 268

Query: 301 SLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
                +    N +G T +D+W    + G+   L +++    +  G  I  AIA   T+ P
Sbjct: 269 FHHTRW----NPVG-TGEDIWTSFAQSGA---LPNIQDETPAIEGEQIAGAIAIRFTLQP 320

Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
                + F L WD P   F +G  YYRR+T F+      A  IAR A+  + SW+ QI+A
Sbjct: 321 GETRTIPFVLTWDFPVTEFAAGVKYYRRHTDFFDRTGKNAWAIARTALQAYQSWQSQIQA 380

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           WQ+PIL+   LP W+ + LFNELY L +GG +W+
Sbjct: 381 WQQPILDKTDLPNWFKMALFNELYDLTSGGTLWS 414


>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
 gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
          Length = 804

 Score =  225 bits (574), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 139/385 (36%), Positives = 206/385 (53%), Gaps = 28/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           IGQF  LE ++Y  + + DV  Y SF L+ LFP+++ ++ R FA A+   D +       
Sbjct: 420 IGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAIPQGDDTPRVIGYY 479

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             +  +     RK  GA PHD+G  +   W + N     D   WKDL   FVLQVYRD +
Sbjct: 480 YTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGCDFVLQVYRDFL 539

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D +F +  W ++   + Y+  FD DGDG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 540 LTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGL 599

Query: 699 WV----AALQAASALAREVGDRGSEDYFLFK-------FQKAKVVY-EKLWNGSYFNYDN 746
           W+    AA+  +  L +   D  +++  L +         K+  +Y EKLWNG Y+  D+
Sbjct: 600 WMAALAAAIAISDILLQNHQDSETKEKLLLQKSTYETWLTKSLPIYQEKLWNGKYYRLDS 659

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S S  + ADQL GQ+YA    L  IV  D+A SAL+ VY+   LK   G+ GA NG+
Sbjct: 660 --ESGSDVVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFLKFYDGQFGAANGV 717

Query: 807 LPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
            PDG  +   +    E+W+G+ + +AA ++   + D GF+    +    ++    G  F+
Sbjct: 718 RPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVAQIYNN---GLQFR 774

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAM 890
           TPEA      +R+  Y+R +AIWA+
Sbjct: 775 TPEAITAAGTFRASTYLRAMAIWAI 799



 Score =  186 bits (472), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 141/398 (35%), Positives = 193/398 (48%), Gaps = 55/398 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
           HG+PLGG G+G IGRS RG+F  W I     + ++ P  A QFSVF   SNG    +   
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVF--ESNGTSSQAYAL 95

Query: 153 PKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQIS 208
              P     T    + SW W           TYHALYPR+W V+E     EL   C Q S
Sbjct: 96  STQP-----TDDGSLKSWQWYPASTATQTTGTYHALYPRSWFVYENVFQAEL--TCEQFS 148

Query: 209 PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
           PI  +NY+E+SYPV+VF +  +N       ++++ TW N VG    FT    +   ++ D
Sbjct: 149 PIWANNYQETSYPVAVFVWQAHNPTNAPITLSIMLTWQNMVGW---FTNALKSPDVRVRD 205

Query: 269 ---------------GVHAVLLHHRTSHQ---LPPVTFALAAQETDGVHVSLC------P 304
                          G +   L     H    L  V  A   QE +G   S C      P
Sbjct: 206 DGSPVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMAEPLQEGEG---SWCIVTRKHP 262

Query: 305 HFVI----SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPP 360
              I      N +G T +++W      GS    N ++TS  SE    +GAAIA   T+ P
Sbjct: 263 QVEIFHHTRWNPVG-TGEEVWQSFAADGSL--ANYIDTSPVSE-NEQLGAAIAVRFTLQP 318

Query: 361 DSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEA 420
               ++ F ++WD P   F +G  YYRRYT F+G   N A  IA  A+ ++ +W+ QI+A
Sbjct: 319 GETLEIPFVVSWDLPVTEFAAGVNYYRRYTDFFGKSGNHAWAIATIALEQYQTWQQQIQA 378

Query: 421 WQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
           WQ PIL    LP+W+ + LFNELY L +GG +W+  +P
Sbjct: 379 WQDPILNRDDLPDWFKMALFNELYDLTSGGTLWSAATP 416


>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
          Length = 190

 Score =  225 bits (574), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 119/190 (62%), Positives = 146/190 (76%), Gaps = 1/190 (0%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           F   VWP+V  AM YM+QFDRD D +IENDGFPDQTYDTW+V G+SAY G LW+AALQAA
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           +A+A ++GD+   +    KF  AK   EK LWNGSYFNYD+  SS S SIQ DQLAGQWY
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
           A + GL PI +E K RS ++K++++NV+K  GG+ GAVNGM PDG+VD + MQSREIW+G
Sbjct: 121 AASSGLPPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDETCMQSREIWTG 180

Query: 826 VTYAVAASMI 835
           VTYA AA+MI
Sbjct: 181 VTYAAAATMI 190


>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
          Length = 190

 Score =  225 bits (573), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 119/190 (62%), Positives = 146/190 (76%), Gaps = 1/190 (0%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           F   VWP+V  AM YM+QFDRD D +IENDGFPDQTYDTW+V G+SAY G LW+AALQAA
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           +A+A ++GD+   +    KF  AK   EK LWNGSYFNYD+  SS S SIQ DQLAGQWY
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
           A + GL PI +E K +S ++K++++NV+K  GGK GAVNGM PDG+VD + MQSREIW+G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180

Query: 826 VTYAVAASMI 835
           VTYA AA+MI
Sbjct: 181 VTYAAAATMI 190


>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
 gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
           7002]
          Length = 803

 Score =  224 bits (572), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 146/404 (36%), Positives = 211/404 (52%), Gaps = 34/404 (8%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G +L     EN  +GQF  LE ++Y  + + DV  Y SF L ML+P++  ++   FA A+
Sbjct: 404 GGSLWTAATENDPVGQFGVLECLDYCWYESLDVRMYGSFGLTMLWPRLDKAVLEAFARAI 463

Query: 576 MMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVL 633
              D ++  +  +    +RKV GA PHD+G  +  PW   N     D   WKDL   FVL
Sbjct: 464 PTSDDTQRMIGYNKTMATRKVAGATPHDLGAPNEHPWEATNYTAYQDCNLWKDLGSDFVL 523

Query: 634 QVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           QVYRD V TG  D +F  + W SV  A+ Y+  FD DGDG+ EN G PDQT+D W + GI
Sbjct: 524 QVYRDFVMTGAEDTEFLWSCWGSVLTALDYLKTFDLDGDGIPENGGAPDQTFDDWKLKGI 583

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYF--------LFKFQKAKVVYEK-------- 735
           SAY GGLW+AAL+AA A+A+ +  + S  +F          +F++A   Y+         
Sbjct: 584 SAYCGGLWIAALEAAIAMAKIL--QQSPSHFEAHQNQVSAAEFEQAIATYQTWLDQARPL 641

Query: 736 ----LWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
               LWNGSY+  D    S S  + ADQL GQ+Y +   L  +   D+  + L  +Y   
Sbjct: 642 YQDILWNGSYYTLDT--GSNSRVVMADQLCGQYYTQLLSLPDVNPGDRTATTLATIYTAC 699

Query: 792 VLKVMGGKRGAVNGMLPDGRVDMSS-MQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
             K  GG+ GA NG+ PDG  +  +     E+W+G+ + +AA MI   +     +    +
Sbjct: 700 FEKFHGGQFGAANGLNPDGTPEKENDTHPLEVWTGINFGIAALMIRNGMQTEALRMVEAV 759

Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
            E  +     G  F+TPEA   +  +R+  Y+R + IWA+  AL
Sbjct: 760 VEQVYDN---GLQFRTPEAITANGTFRACHYLRAMGIWAIYDAL 800



 Score =  189 bits (480), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 130/388 (33%), Positives = 178/388 (45%), Gaps = 47/388 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +       +   A QFSVF         +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFSVFEQPEGEAAQAYAMSTV 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   + SW W     K TY ALYPR+W  ++G    EL   C Q SPII  N
Sbjct: 101 APE---DGT---LDSWQW-YPAKKGTYAALYPRSWYEYQGAFAAEL--TCEQFSPIIAQN 151

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG------------------------ 250
           Y+E+SYPV+VF +T +N       ++++ TW N VG                        
Sbjct: 152 YQETSYPVAVFEWTAHNPTDKPITVSIMMTWQNMVGWFTKAQKTPEVVLRDDGSPVYEYQ 211

Query: 251 ---GDSEFT-GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
              GDS     Q    + ++   ++ V  H   S     + FA     T      L   +
Sbjct: 212 SRWGDSTGNLNQWVQDRYRVGCLLNRVRPHADLSEGEGQLCFATVTNPT------LEVFY 265

Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
               N +G   +D+W     +GS   L  +E    + PG  I  AIA   T+ P    ++
Sbjct: 266 HNRWNPVG-DGRDVWDYFAGNGS---LPDIEDETPAAPGEQIAGAIAVRFTLRPGKTKKI 321

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            F LAWD P + F    TY+RRYT F+G   N    + R A+     W+ +I AWQ+PIL
Sbjct: 322 PFFLAWDLPVMEFAPDITYFRRYTDFFGRSGNNVWTMIRTAMKHSDLWKERIAAWQQPIL 381

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWT 454
               LP+W  + LFNELY L  GG++WT
Sbjct: 382 GRDDLPDWLKMALFNELYLLTQGGSLWT 409


>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
          Length = 191

 Score =  224 bits (572), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 119/190 (62%), Positives = 146/190 (76%), Gaps = 1/190 (0%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           F   VWP+V  AM YM+QFDRD D +IENDGFPDQTYDTW+V G+SAY G LW+AALQAA
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           +A+A ++GD+   +    KF  AK   EK LWNGSYFNYD+  SS S SIQ DQLAGQWY
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSG 825
           A + GL PI +E K +S ++K++++NV+K  GGK GAVNGM PDG+VD + MQSREIW+G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180

Query: 826 VTYAVAASMI 835
           VTYA AA+MI
Sbjct: 181 VTYAAAATMI 190


>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
 gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
          Length = 798

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 152/402 (37%), Positives = 218/402 (54%), Gaps = 42/402 (10%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G  L   G E   +GQF  LE ++Y  + + DV  Y SF L++LFP+++ ++ R FA A+
Sbjct: 405 GGTLWSAGTEKDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAVIRAFARAI 464

Query: 576 MMHD--------------PSKMKLLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLY- 619
              D              PS + L        RKV GA PHD+G  + W +E   Y  Y 
Sbjct: 465 PQSDDRSRIIGYYLTINSPSPVAL--------RKVAGATPHDLGAPNEWVWEQTNYTSYQ 516

Query: 620 DTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDG 677
           D   WKDL   FVLQVYRD + TG  D +F    W  + + + Y+ QFD D DG+ EN G
Sbjct: 517 DCNLWKDLGCDFVLQVYRDFLFTGGNDIEFLVDCWYGIVLTLDYLKQFDIDADGIPENSG 576

Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAR------EVGDRGSED--YFLFKFQKA 729
            PDQT+D W +SG+SAY G LW+AAL+AA A+        E+ + G +   Y  +  Q  
Sbjct: 577 APDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLMNHPELSNVGEQRSIYENWLNQSL 636

Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
            V  +KLWNG ++  D+   S S+ + ADQL+GQ+YAR   L  IV +D+A SAL  +Y+
Sbjct: 637 PVYQQKLWNGKFYRLDS--ESGSNVVMADQLSGQFYARLLNLPDIVPQDRALSALTTIYD 694

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
              LK   GK GA NG+LPDG  +   +    E+W+G+ + +AA ++   + D G +   
Sbjct: 695 ACFLKFQDGKFGAANGVLPDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKDQGLRLTQ 754

Query: 849 GIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            +    +     G  F+TPEA      +R+  Y+RP+AIWA+
Sbjct: 755 AVVRQIYDH---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 793



 Score =  209 bits (531), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 138/400 (34%), Positives = 197/400 (49%), Gaps = 61/400 (15%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       K V   QFS+F S   GQ Y+  L  +
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYA--LSTQ 92

Query: 155 TPEVLKDTTAAGIGSWDW---------NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
            PE   D T   + +W W           K D  TYHALYPR+W V+E      L+  C 
Sbjct: 93  PPE---DNT---LQTWQWYPNVVHSSTGTKTDGGTYHALYPRSWFVYENLF--PLQFTCE 144

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
           Q SP+  HNY+E+SYPV+VF + I+N        +++ +W N  G    FT    + + K
Sbjct: 145 QFSPVWAHNYRETSYPVAVFLWNIHNPTNQPITASIMLSWENMTGW---FTNALKSPQVK 201

Query: 266 M--------------------------NDGVHAVLLHHRTSH--QLPPVTFALAAQETDG 297
           +                          ND     +L   T+H  Q    T+ +A  +   
Sbjct: 202 IRDDGSPVYEYQPRWGESQGNYNWLAENDQYLGCVLGRATNHPVQEGDGTWCIATVKNPQ 261

Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
           V +     F   GN      +++W +  ++GS   L++ +    ++  S +GAA+A   T
Sbjct: 262 VELFYHCQFNPVGN-----GEELWRDFSQNGS---LSNYQDETPADVNSRLGAAVAVRFT 313

Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
           + P     V F L+WD P   F +G  YYRRYT F+GT  + A  IA  A+ E  +W   
Sbjct: 314 ISPGETFTVPFVLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSH 373

Query: 418 IEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           IE+WQ+PILE + LP W+ + LFNELY L +GG +W+ G+
Sbjct: 374 IESWQKPILEREDLPSWFKMALFNELYDLTSGGTLWSAGT 413


>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
 gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
          Length = 837

 Score =  223 bits (568), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 138/387 (35%), Positives = 207/387 (53%), Gaps = 25/387 (6%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           E+ +G+F  LE ++Y  + + DV  Y SFAL+ L+P++  ++ R FA A+   DP+   +
Sbjct: 440 EDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADPTPRPI 499

Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
                 G+      RKV GA PHD+G  +  PW   N     D   WKDL   FVLQV+R
Sbjct: 500 GWYFTQGKGRVEAPRKVAGATPHDLGAPNERPWDATNYTAYQDCNLWKDLASDFVLQVWR 559

Query: 638 D--VVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
              +  TG D +F    WP+   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 560 TFRLAPTGEDLRFLADCWPAAVEALRYLKGFDANSDGLPDNGGAPDQTFDDWPLKGVSAY 619

Query: 695 SGGLWVAALQAASALAR----EVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNS 747
            G LW+AAL+AA A+ +    E+G D  S+   F    ++++  +++ LWNG Y+N D  
Sbjct: 620 CGALWIAALEAALAMGQRLQLELGLDTSSQQRDFSSWLEQSRANFDRLLWNGEYYNID-- 677

Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
             S +  + ADQL G +YAR   L P+V E++ARSAL+ +         GG  G  NG+ 
Sbjct: 678 ADSGTPVVMADQLCGDFYARLLELPPVVAEERARSALRAIREACFEGFQGGSLGVANGLR 737

Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
            DG  +D       E+W+G+ + +AA        D        +    + G   G  F+T
Sbjct: 738 RDGTPLDPDGTHPLEVWTGINFGLAAYYRLMGETDTALAITGAVVHQVYGG---GLQFRT 794

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
           PEA    + +R+  Y+R +AIWA+ WA
Sbjct: 795 PEAITAVNTFRACHYLRAMAIWAL-WA 820



 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 118/413 (28%), Positives = 185/413 (44%), Gaps = 49/413 (11%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++     +HG+PLGG G+G IGR+  G F  W +         +   QF++F 
Sbjct: 37  PYTVRYASNLDDGPNHGMPLGGFGAGCIGRAPDGSFNLWNLDGGEHWFGTLPDCQFALFE 96

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
              NG +  ++      +  +      + +W W           TY A YP +WT   G 
Sbjct: 97  RDGNGSRAHALAVAPEADASRPDAGPPLSAWSWYPASTADTSTGTYAARYPLSWTSFSGV 156

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
                R+ C   SPI+P +Y+ SSYPV+VFT+++ N  +   +++LL +W N+ G     
Sbjct: 157 F--SARVGCEAFSPILPGDYERSSYPVAVFTWSLSNPTRRPLELSLLLSWRNTTGWFTNT 214

Query: 251 -----------GDSEFT-----GQHYNSKTK-MNDG-VHAVLLHHRTSHQLPP------- 285
                      G  E       G+    + + ++DG +  VLL    S  +         
Sbjct: 215 DPAAAVHFRDDGSPEHNYVPAIGRSEGQRNRWVDDGPLKGVLLEGPISDPIAEGQGQWCI 274

Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
            T A       G+ +  C  +  +G+       ++W      GS    N+   S  ++P 
Sbjct: 275 ATDAGLEARHPGLRIHRCSRWNPAGD-----GAELWESFARDGSIPDSNNDRHSGANDPA 329

Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
           S   AA+A  + + P +   +   ++WD P   F +G    RRYT  +G     AA IA 
Sbjct: 330 S---AALAVQLRLEPGASIDIPLVISWDLPVTAFATGSRALRRYTDHFGADGTNAAAIAA 386

Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
           +A+    SW  QI AWQ+P+LE   LPE   + LFNELY L +GG +WT  SP
Sbjct: 387 EALGSWPSWREQIAAWQQPVLERTDLPEPLRMALFNELYDLASGGTLWTAASP 439


>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
 gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
          Length = 809

 Score =  223 bits (568), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 145/405 (35%), Positives = 210/405 (51%), Gaps = 39/405 (9%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D  + +GQF  LE  +Y  + + DV  Y SFAL+ML+ K++ S+ R FA A+   D  + 
Sbjct: 410 DDRDPVGQFAVLECFDYRWYESLDVRLYGSFALLMLWSKLEKSVIRAFARAIGQRD-DRT 468

Query: 584 KLLD-------DGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQ 634
           +++        +     RKV GA PHD+G  +  PW   N     D   WKDL+  FVLQ
Sbjct: 469 RVIGYYYTQGLESPTALRKVAGATPHDLGAPNEHPWEATNYTSYQDCNLWKDLSCDFVLQ 528

Query: 635 VYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGIS 692
           VYRD V TG  D +     W +V   + Y+  FD D DG+ EN G PDQT+D W + G+S
Sbjct: 529 VYRDFVLTGGTDWELLWDCWSAVVQTLTYLKTFDLDEDGIPENSGAPDQTFDDWRLQGVS 588

Query: 693 AYSGGLWVAAL----------------QAASALAREVGDRG----SEDYFLFKFQKAKVV 732
           AY GGLW+AAL                  AS +     D      + D F     +++ +
Sbjct: 589 AYCGGLWLAALEAAIAIGKTLLSYPDDHPASKILASAPDYPPIPETLDVFQSWLMRSRPI 648

Query: 733 Y-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
           Y EKLWNG Y+  D+   S S  + ADQL GQ+YAR  GL  IV  + A  ALK VY+  
Sbjct: 649 YQEKLWNGQYYRLDS--KSGSDVVMADQLCGQFYARLLGLPDIVPPECATQALKTVYDAC 706

Query: 792 VLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
            +K   G+ GA NG+ PDG  +  ++    E+W+G+ + +AA ++   +    ++    +
Sbjct: 707 FVKFHNGQFGAANGLKPDGLPENPNATHPLEVWTGINFGLAAFLMQMGMKSEAWRITQAV 766

Query: 851 YEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALT 895
            +  +     G  F+TPEA      +R+  Y+RP+AIWA+   LT
Sbjct: 767 VQQVYDN---GLQFRTPEAITAKGTFRACHYLRPMAIWAIYRVLT 808



 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 131/414 (31%), Positives = 196/414 (47%), Gaps = 52/414 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGR+ RG+F  W I       K + A QFS+F  +++  + +  LC  
Sbjct: 40  HGMPLGGFGAGCIGRASRGDFNLWHIDGGEHTFKSLPACQFSIFEQQASTSQ-AFALCTL 98

Query: 155 TPEVLKDTTAAGIGSWDWNLK-GDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPH 213
            P    D T   + +W W  +  +   YHALYPR+W V+E      L   C Q SPI+P 
Sbjct: 99  PP---ADGT---LTTWQWYPQHCENGVYHALYPRSWFVYENVFQAHLS--CEQFSPILPE 150

Query: 214 NYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG---- 269
           NY+E+SYPV+VF +  +N       ++++ TW N+VG    F     N + +M D     
Sbjct: 151 NYQETSYPVAVFVWRAHNPTNAPLTLSIMLTWQNTVGW---FQNAIKNPEIRMRDDGSPV 207

Query: 270 -----------------------VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHF 306
                                     VL    +  Q     +A+A  +   + V     +
Sbjct: 208 YEYEPCLGNNLGNFNRLIAKDNLCGCVLERVASDLQDGIGQWAIATPKQPNIEVFYHTRW 267

Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
             SG+       ++WH     GS   L + E   ++     + AAIA   T+ P    ++
Sbjct: 268 NPSGD-----GAEIWHSFARDGS---LPNTEDKTSASEQEQLAAAIAVRFTLQPGETREL 319

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            F++AWD P   F  G  Y+RRYT F+G   + A  IA+ A+  + SW+ QI+AWQ+PIL
Sbjct: 320 PFAIAWDFPITEFAPGVKYFRRYTDFFGRGGDNAVAIAQTALQNYQSWQQQIQAWQQPIL 379

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWT--DGSPPVHSLVTIGHRKFSLDWSQS 478
           +   LP  + + LFNELY L +GG +W+  D   PV     +    F   W +S
Sbjct: 380 DRDDLPSVFKMALFNELYDLTSGGTLWSAADDRDPVGQFAVL--ECFDYRWYES 431


>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
 gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
          Length = 811

 Score =  222 bits (565), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 146/391 (37%), Positives = 211/391 (53%), Gaps = 32/391 (8%)

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL 586
           + IGQF  LE ++Y  + + DV  Y SFAL ML+PK+  ++   +A A+   D +   + 
Sbjct: 421 DGIGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEAYARAIPTSDNTPRIIG 480

Query: 587 DDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG- 643
            +     RK   A PHD+G  +  PW + N     D   WKDL   FVLQVYRD + TG 
Sbjct: 481 YNQASAIRKAANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVLQVYRDYLLTGA 540

Query: 644 -DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAA 702
            D +F    W ++   +AY+ +FD DGDG+ EN G PDQT+D W + G+SAY GGLW+AA
Sbjct: 541 TDIEFLSECWSAIVKTLAYLKEFDLDGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAA 600

Query: 703 LQAASALAREVGDRGSE----DY------FLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQ 751
           L+AA A+   + + G E    D+      +    +++K +Y EKLWNG Y+  D+   S 
Sbjct: 601 LEAAIAIGEILINHGVELEDGDFESAIATYKSWLEQSKPLYQEKLWNGQYYQLDS--ESG 658

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK-----------VMGGKR 800
           S+ + ADQL GQ+YAR   L  IV  D   SALK VY+   LK           +     
Sbjct: 659 SNVVMADQLCGQFYARLLNLPDIVPNDCVESALKTVYDSCFLKFNQKLSQHSESLTPTFI 718

Query: 801 GAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
           GA NG+L DG  +   +    E+W+G+ + +AA MI   + D  FQ    + +  ++   
Sbjct: 719 GAANGVLADGSPENPQATHPLEVWTGINFGLAAFMIQMGMKDEAFQLTEAVVQQIYNN-- 776

Query: 860 LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 777 -GLQFRTPEAITAAGTFRASHYLRAMAIWAI 806



 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 139/393 (35%), Positives = 190/393 (48%), Gaps = 45/393 (11%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G +GRS RG+F  W I       K + A QFSVF    N    +  +C +
Sbjct: 41  HGMPLGGFGAGCLGRSSRGDFNLWHIDGGEHTFKSLPACQFSVFEQAGNSAPQAYAMCTE 100

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKST-----YHALYPRAWTVHEGEPDPELRIVCRQISP 209
            PE   D T   + +W W  K  K T     YHALYPR+W V++G    +L   C Q SP
Sbjct: 101 PPE---DGT---LSTWQWYPKTGKGTESSGTYHALYPRSWFVYDGVFQAQL--TCEQFSP 152

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           I   NY+ESSYPV VF +T +N       ++++ TW NSVG    FT      + K+ D 
Sbjct: 153 IWADNYQESSYPVGVFEWTAHNPTDQPITLSIMLTWQNSVGW---FTNAIKTPQVKVRDD 209

Query: 270 VHAV------------------LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGN 311
              V                  + H R    L  V      QE +G          I+  
Sbjct: 210 GSPVYEYQPRWKDSTGNLNQWIVDHFRAGCLLDRVRIHDEVQEGEGQWAIA----TITNP 265

Query: 312 SLGLTAKDMWH------EIKEHGSFD-RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
           S+ +     W+      EI ++ +FD  L   E    ++PG  I AA+    TV P    
Sbjct: 266 SVEVFYHTRWNPAGSGEEIWDNFAFDGSLRDKEDETPADPGEQIAAAMTIRFTVRPGRTR 325

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
           ++ F LAWD P   F +G  YYRRYT F+G   N A +I R A+    +W+ QI++WQ  
Sbjct: 326 KIPFVLAWDFPVTEFATGVNYYRRYTDFFGRTGNNAWSIVRTALKHDDTWKEQIQSWQEL 385

Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
            L++  LP+W+ + LFNELY L  GG +WT  S
Sbjct: 386 TLKNANLPDWFKMALFNELYILADGGTLWTAAS 418


>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 789

 Score =  222 bits (565), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 152/402 (37%), Positives = 218/402 (54%), Gaps = 42/402 (10%)

Query: 518 GTNLLQDGEEN--IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV 575
           G  L   G E   +GQF  LE ++Y  + + DV  Y SF L++LFP+++ ++ R FA A+
Sbjct: 396 GGTLWSAGTEKDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAVIRAFARAI 455

Query: 576 MMHD--------------PSKMKLLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLY- 619
              D              PS M L        RKV GA PHD+G  + W +E   Y  Y 
Sbjct: 456 PQSDDRSRIIGYYLTINSPSPMAL--------RKVAGATPHDLGAPNEWVWEQTNYTSYQ 507

Query: 620 DTARWKDLNPKFVLQVYRDVVATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDG 677
           D   WKDL   FVLQVYRD + TG  D +F    W  + + + Y+ QFD DGDG+ EN G
Sbjct: 508 DCNLWKDLGCDFVLQVYRDFLFTGANDIEFLVDCWHGMVLTLDYLKQFDMDGDGIPENSG 567

Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAR------EVGDRGSED--YFLFKFQKA 729
            PDQT+D W +SG+SAY G LW+AAL+AA A+        E+ + G +   Y  +  Q  
Sbjct: 568 APDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLINRPELPNVGEQKSIYEHWLNQSL 627

Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
            V  +KLWNG ++  D+   S S+ + ADQL+GQ+YAR   L  IV +D+A SAL  +Y+
Sbjct: 628 PVYQQKLWNGKFYRLDS--ESGSNVVMADQLSGQFYARLLNLPDIVPKDRALSALTTIYD 685

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
              LK   GK GA NG+L DG  +   +    E+W+G+ + +AA ++   + + G +   
Sbjct: 686 ACFLKFQDGKFGAANGVLLDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKEQGLRLTE 745

Query: 849 GIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            +    +     G  F+TPEA      +R+  Y+RP+AIWA+
Sbjct: 746 AVVRQVYEH---GLQFRTPEAITAAGTFRASTYLRPMAIWAV 784



 Score =  215 bits (547), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 138/391 (35%), Positives = 196/391 (50%), Gaps = 52/391 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W I       K V   QFS+F S   GQ Y+  L  +
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYA--LSTQ 92

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
            PE   D T   + +W W    D  TYHALYPR+W V+E      L+  C Q SP+  HN
Sbjct: 93  PPE---DNT---LQTWQWYPNTDGGTYHALYPRSWFVYENVF--PLQFTCEQFSPVWAHN 144

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-------- 266
           Y+E+SYPV+VF + I+N        +++ +W N  G    FT    + + K+        
Sbjct: 145 YRETSYPVAVFLWNIHNPTNQPLTASIMLSWENMTGW---FTNALKSPQVKIRDDGSPVY 201

Query: 267 ------------------NDGVHAVLLHHRTSH--QLPPVTFALAAQETDGVHVSLCPHF 306
                             ND     +L   T H  Q    T+ +A  +   V +     F
Sbjct: 202 EYQPRWGESQGNYNWLAENDQYLGCVLGRATDHPIQEGDGTWCIATVKNPQVELFYHCQF 261

Query: 307 VISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQV 366
              GN      +++W +  ++GS   L++ +    ++  S +GAA+A   T+PP     V
Sbjct: 262 NPLGN-----GEEIWRDFSQNGS---LSNYQDETPADVNSRLGAAVAVRFTLPPGETLTV 313

Query: 367 TFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPIL 426
            F L+WD P   F +G  YYRRYT F+GT  + A  IA  A+ E  +W   IE+WQ+PIL
Sbjct: 314 PFVLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSHIESWQKPIL 373

Query: 427 EDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           E + LP W+ + LFNELY L +GG +W+ G+
Sbjct: 374 EREDLPSWFKMALFNELYDLTSGGTLWSAGT 404


>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
 gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
          Length = 804

 Score =  221 bits (563), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 135/386 (34%), Positives = 204/386 (52%), Gaps = 27/386 (6%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           ++ +GQF  LE ++Y  + + DV  Y SF L+MLFP+++ ++ R FA A+   D  K ++
Sbjct: 415 KDPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKAVMRAFARAIPASD-EKTRV 473

Query: 586 LD-------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVY 636
           +        +     RK  GA PHD+G  +   W + N     D   WKDL   FVLQVY
Sbjct: 474 IGYYYTIGAESPLAIRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLGSDFVLQVY 533

Query: 637 RDVVATGDK--KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
           RD V TG K  +F    W ++ + + Y+  FD+D DG+ EN G PDQT+D W + G+SAY
Sbjct: 534 RDFVFTGSKDIEFLAECWDAIALTLEYLKAFDKDHDGIPENSGAPDQTFDDWKLQGVSAY 593

Query: 695 SGGLW----VAALQAASALAREVGDR----GSEDYFLFKFQKAKVVY-EKLWNGSYFNYD 745
            GGLW     AA+     L    GD+      +  +     K++ +Y +KLWNG Y+  D
Sbjct: 594 CGGLWLAALEAAIAICDILLENHGDQEKLTAQKIIYTDWLAKSRPIYQQKLWNGQYYRLD 653

Query: 746 NSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNG 805
           +   S S  + ADQL GQ+YAR   L  IV  D A SA++ VY    +    GK GA NG
Sbjct: 654 S--ESGSDVVMADQLCGQFYARLLKLADIVPHDCAISAVRTVYEACFINFNDGKFGAANG 711

Query: 806 MLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAF 864
           + PDG  +   +    E+W+G+ + +AA ++   + D  ++    + +  +     G  F
Sbjct: 712 LRPDGSPENPKATHPLEVWTGINFGLAAFLMQMGMQDEAWKLTEVVVQQIYDN---GLQF 768

Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAM 890
           +TPEA      +R+  Y+R +AIW +
Sbjct: 769 RTPEAITATGTFRACTYLRAMAIWGI 794



 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 128/387 (33%), Positives = 182/387 (47%), Gaps = 35/387 (9%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS+ G+F  W I       K + A QFS+F S       SS     
Sbjct: 40  HGMPLGGFGAGCIGRSHGGDFNFWHIDGGEHVFKNIPACQFSLFESVGE----SSQAYAL 95

Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
                 D T   + SW +    + + +  TYHALYPR+W  + G    +  I C+Q SPI
Sbjct: 96  ATAAPADGT---LQSWKYYPQSSPEQNTGTYHALYPRSWFEYGGVY--QANITCQQFSPI 150

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG 269
              NY+ESSYPV+VF + ++N       ++++ TW N VG    FT    + +  + +DG
Sbjct: 151 WADNYQESSYPVAVFLWNLHNPTNAPITLSIMLTWENMVGW---FTNTSKSPEVIVRDDG 207

Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVIS---GNSLGLTAKDMWHEIKE 326
                   R           +    T GV +S     V+    G    LT+K    EI  
Sbjct: 208 SPVYEYQPRLGQSQGNFNEIVDNGNTIGVVLSGISGEVVQEGDGQWCFLTSKQAGVEIFS 267

Query: 327 HG-------------SFDRLNSMETSVTSEPGSS--IGAAIAASVTVPPDSEGQVTFSLA 371
           H              SF    S+     + P +   +  A+A  +T+ P    +V F L 
Sbjct: 268 HSRWNPTGNGEEIWQSFAADGSLSNYQDATPATDERLAGALAVRLTLQPGETLEVPFVLT 327

Query: 372 WDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRL 431
           WD P   F +G  Y RRYT F+G++   A  IA  A+ ++  W  +I+ WQ+PIL    L
Sbjct: 328 WDFPVTEFAAGVNYNRRYTDFFGSNGKNALQIAETALEQYRDWLEKIQTWQKPILNRHDL 387

Query: 432 PEWYPITLFNELYYLNAGGAVWTDGSP 458
           P+W  + LFNELY L AGG +W+  SP
Sbjct: 388 PDWMKMALFNELYDLTAGGTLWSAASP 414


>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
 gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
           IMS101]
          Length = 816

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 150/396 (37%), Positives = 216/396 (54%), Gaps = 44/396 (11%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
           GQF  LE ++Y  + + DV  Y SF L++L+P+++ S+   FA AV   D +   +  + 
Sbjct: 422 GQFAVLECLDYRWYESLDVRLYGSFGLLILWPELEKSVLVAFARAVSTADDTLRIIGYNQ 481

Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATG--DK 645
               RKV GA PHD+G  +  PW   N     D  +WKDL+  FVLQVYRD + TG  D 
Sbjct: 482 VSAVRKVAGATPHDLGAPNEHPWEMTNYTSYQDCNQWKDLSSDFVLQVYRDFLLTGADDY 541

Query: 646 KFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQA 705
           +F    W ++   +AY+  FD+D DG+ EN+G PDQT+D W + G+SAY GGLW+AAL+A
Sbjct: 542 EFLWQSWSAITETLAYLKGFDKDNDGIPENEGAPDQTFDDWQLRGVSAYCGGLWLAALEA 601

Query: 706 ASALA-------REVG-----------DRGSED-----YFLFK--FQKAKVVY-EKLWNG 739
           A A+        RE+            D+ S D      +L++   +K   +Y EKLWNG
Sbjct: 602 AIAIGKVLIEHPREIPYYPPKGFYSEVDKNSVDAINNQVYLYQGWLKKGLPIYQEKLWNG 661

Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
            Y+  D+   S S  + ADQL+GQ+YA+   L  IV  + A SALK VYN        GK
Sbjct: 662 EYYRLDS--ESNSEVVMADQLSGQFYAKLLNLEDIVPAECALSALKTVYNSCFKNFHNGK 719

Query: 800 RGAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIH----EDLADIGFQTACGIYEAA 854
            GA NG+LPDG  +  ++    E+W+G+ + +AA M+     ++  +I       IYE  
Sbjct: 720 FGAANGVLPDGSPENPNATHPLEVWTGINFGLAAFMVQIGMKKEALEITEVVVGQIYEN- 778

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
                 G  F+TPEA      +R+  Y+R +AIWA+
Sbjct: 779 ------GLQFRTPEAITVMGTFRASHYLRAMAIWAI 808



 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 129/394 (32%), Positives = 196/394 (49%), Gaps = 49/394 (12%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQI--FPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
           HG+PLGG G+G+ GRS RG+F  W +     + ++ P  A QFSVF   + GQK +  L 
Sbjct: 41  HGMPLGGFGAGATGRSPRGDFNLWHLDGGEHIYQNLP--ACQFSVF-EETKGQKQAYALS 97

Query: 153 PKTPEVLKDTTAAGIGSWDWNLK---GDKS-TYHALYPRAWTVHEGEPDPELRIVCRQIS 208
            + P      T   + +W W  +   G K+ TYHALYPR+W V+E     +L   C Q S
Sbjct: 98  TELP------TDGSLSAWQWYPREKAGLKTGTYHALYPRSWFVYENVFTAQLS--CEQFS 149

Query: 209 PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
           PI+  NY+E+SYP+++F +T +N    +  ++++ +W N+VG    FT      + ++ D
Sbjct: 150 PILAGNYQETSYPIAIFEWTAHNPTDEAITLSIMLSWQNTVGW---FTNSVKTPEVRVRD 206

Query: 269 GVHAVLLH------------------HRTSHQLPPVTFALAAQETDG-------VHVSLC 303
               V  +                  HR    +  ++ A    E +G       ++  + 
Sbjct: 207 DGSPVYEYKPRWGESTDNFNLLVEDFHRIGCTMTKLSIADEPAEGEGQMAIATFINAGME 266

Query: 304 PHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSE 363
             +    N  G T +D+WH     GS   L   E  + +  G  IG AI+   T+ P   
Sbjct: 267 VFYHTRWNPTG-TGEDIWHYFALDGS---LIDEENELPATEGEQIGVAISVRFTIRPGKN 322

Query: 364 GQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQR 423
            ++ F L WD P  +F +G +YYRRYT FYG +   A ++ R A+  + +W   IEAWQ 
Sbjct: 323 RKIPFFLVWDLPVTDFGNGVSYYRRYTDFYGRNGKNAWSMIRTAMKHYQTWRENIEAWQN 382

Query: 424 PILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           PIL+ + LP W  + L NELY L +GG +W   S
Sbjct: 383 PILQREDLPNWLKMALLNELYDLTSGGTIWAAAS 416


>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
 gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
          Length = 799

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 141/389 (36%), Positives = 210/389 (53%), Gaps = 20/389 (5%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D ++  G+F  LE ++Y  + + DV  Y SF L+M +P+++ S+ R FA A+   D +  
Sbjct: 416 DEDQPWGRFAVLECLDYRWYESLDVRLYGSFGLLMSWPELEKSVIRTFAKAIATADHTPR 475

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
            +  +     RK  GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + 
Sbjct: 476 IIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLL 535

Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           TG  D +F    WP++   + Y+  FDRD DG+ EN G PDQT+D W + GISAY GGLW
Sbjct: 536 TGADDIQFLVECWPAIVETLDYLKTFDRDQDGIPENGGAPDQTFDDWRLVGISAYCGGLW 595

Query: 700 VAALQAASALA-------REVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGS 749
           +AAL+AA  +A       R++    ++    +    + AK +Y++ LWNGSY+N D+   
Sbjct: 596 LAALEAAIEIANILLSHNRDLTPDTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDS--Q 653

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
           S S  + ADQL GQ+YA   GL  IV   +A+ AL+ +Y         GK GA NG+   
Sbjct: 654 SGSDVVMADQLCGQFYAALLGLPDIVPPHRAQVALQTIYQACFQNFHNGKFGAANGVRTT 713

Query: 810 GR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           G  ++ +     E+W+G+ + + A  I   + +  FQ A  +    +     G  F+TPE
Sbjct: 714 GEPMNPNDTHPLEVWTGINFGLGAFFIQMGMKEEAFQLAEAVITQVYQN---GLQFRTPE 770

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
           A      +R+  Y+R +AIWAM + +  P
Sbjct: 771 AITAAGTFRASHYLRAMAIWAMYYQIQHP 799



 Score =  187 bits (476), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 122/393 (31%), Positives = 185/393 (47%), Gaps = 53/393 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS+RG+F  W +       +P+   QFSVF    +G++ +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDK----STYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
            P+         + +W W    D+     TY+ALYPR+W V+E     + ++ C Q SP+
Sbjct: 100 PPQ------DGSLSAWSWYPTSDQHTQTGTYYALYPRSWFVYENVF--QTQLTCEQFSPV 151

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
              NY+E+SYP++VF +  +N       +++L TW N++G    FT    N   K+ D  
Sbjct: 152 WGGNYQETSYPIAVFEWIAHNPTDKLIILSILLTWENTIGW---FTNSLENPTVKVRDDG 208

Query: 271 HAVLLH------------------HRTSHQLPPVT-----------FALAAQETDGVHVS 301
             V  +                  HR    +  +             A+A      V V 
Sbjct: 209 SPVYEYQPRWGDSQDNLNRFVEDFHRVGCVMTRLNINDQPREGEGQIAIATISNPAVEVY 268

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
               +  +GN       D+W    + GS   +     +V    G  IGAA+A    + P 
Sbjct: 269 YHNRWNPNGNGY-----DIWEYFSKDGSLIDIQDDRPAVE---GERIGAALAVRCILRPG 320

Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
              ++ F +AWD P   F +G +YYR YT F+G +   A ++ R A+    +W   IEAW
Sbjct: 321 KIRKIPFFIAWDLPVCEFEAGVSYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENIEAW 380

Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           Q PIL  + LP W+ + LFNELY L  GG++WT
Sbjct: 381 QNPILNRQDLPPWFKMALFNELYLLTDGGSLWT 413


>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
 gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
          Length = 799

 Score =  218 bits (556), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 139/383 (36%), Positives = 206/383 (53%), Gaps = 27/383 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
           IGQF  LE ++Y  + + DV  Y SF L+ LFP+++ ++ R FA A+   D  K +++  
Sbjct: 419 IGQFAVLECLDYRWYESLDVRLYGSFGLLNLFPELEKAVMRAFARAIPESD-EKTRVIGY 477

Query: 588 ------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
                 +     RK  GA PHD+G  +   W + N     D  +WKDL   FVLQVYRD 
Sbjct: 478 YYTIGAESPLAVRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNQWKDLGCDFVLQVYRDF 537

Query: 640 VATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           + TG  D +F    W ++   + Y+  FD+DGDG+ EN G PDQT+D W + G+SAY GG
Sbjct: 538 IFTGSNDIEFLADCWNAIVETLNYLKAFDKDGDGIPENSGAPDQTFDDWRLLGVSAYCGG 597

Query: 698 LWVAALQAASALAREVGDR--------GSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSG 748
           LW+AAL+AA A+A  +  +          +  +    +K++ VY EKLWNG ++  D+  
Sbjct: 598 LWLAALEAAIAIAEVLSSKQVQVENLAAQKSIYEGWLEKSRPVYQEKLWNGKFYRLDS-- 655

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
            S S  + ADQL GQ+YAR   L  IV  +   SAL  VY+   LK   GK GA NG+  
Sbjct: 656 ESGSDVVMADQLCGQFYARLLKLPDIVPPECTMSALNSVYDACFLKFNQGKLGAANGLRL 715

Query: 809 DGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
           DG  +   +    E+W+G+ + +AA ++   +    F+    +    +     G  F+TP
Sbjct: 716 DGSPENPDATHPLEVWTGINFGIAAFLVQMGMKSEAFKLTEAVVNQIYEH---GLQFRTP 772

Query: 868 EAWNTDDQYRSLCYMRPLAIWAM 890
           EA      +R+  Y+R +AIWA+
Sbjct: 773 EAITATRNFRACVYLRAMAIWAI 795



 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 137/390 (35%), Positives = 197/390 (50%), Gaps = 42/390 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
            G+PLGG G+G IGRS RG+F  W I         + A QFS+F S SN     +V    
Sbjct: 40  QGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFGNIPACQFSLFESSSN----RNVAYAL 95

Query: 155 TPEVLKDTTAAGIGSWDW---NLKGDKS-TYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
           + E   D+T   + +W+W     KGD S TYHALYPR+W V+E     + R+ C Q SPI
Sbjct: 96  STEPSNDST---LSAWEWYPSRSKGDSSGTYHALYPRSWFVYENVF--QSRLSCEQFSPI 150

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
              NY+ESSYPV++F +   N       ++++ TW N VG    FT    + + +M D  
Sbjct: 151 WADNYQESSYPVAIFLWNASNPTSEPVTLSIMLTWQNMVGW---FTNTLKSPEVEMRDDG 207

Query: 271 HAVLLHH--------------RTSHQLPPVTFALAAQETDGVHVSLC------PHFVISG 310
             V  +                 S  +  V   +   E      S C      P   I  
Sbjct: 208 SPVYDYQPNLGESKGNFNRLVTNSEYIGCVLDKIGTAEPIEGEGSWCIATAKHPKVEIFH 267

Query: 311 NSL---GLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
           ++      T +D+W    + GS   ++    ++++E    +GAAIA   T+ P    Q+ 
Sbjct: 268 HNRWNPTSTGEDIWRSFAKTGSLANISDETPAISNE---QVGAAIAVRFTLQPGESLQIP 324

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
           F++AWD P   F+SG TY RRYT F+G +   A NIA  A+ E+ +W  Q++AWQ+PI+E
Sbjct: 325 FAVAWDFPITEFVSGVTYKRRYTDFFGDNGKNAWNIATTALEEYQNWLSQVQAWQQPIVE 384

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
              LP+W+ + LFNELY L AGG +W+  S
Sbjct: 385 RPDLPDWFKMALFNELYDLTAGGTLWSAAS 414


>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
 gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
          Length = 898

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 109/260 (41%), Positives = 163/260 (62%), Gaps = 11/260 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + K+++P+    +    ++D+D DG+IEN   PDQTYDTW + G SAY GGLW+A+L   
Sbjct: 638 YLKSMYPACKQVLEKTLEWDKDNDGIIENSKCPDQTYDTWVMDGPSAYCGGLWLASLHCM 697

Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           +A+A  +      + F    +K K  + EKLWNG+Y+ +D   +S+ ++I +DQL G WY
Sbjct: 698 TAMANLLDQNDDCNKFREILEKGKTSFEEKLWNGTYYKFDGQSASK-NTIMSDQLCGHWY 756

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP------DGRVDMSSMQ 818
            R CG    I  ++  R+ALK +Y+ NV++  GG  GAVNG +P      DGR D S++Q
Sbjct: 757 LRCCGFDYDIFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPNKDGRADASTVQ 816

Query: 819 SREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRS 878
             E+W+GVTYA+A++MIHE +    FQTA G+Y+       +G  F+TPEA   +  YR+
Sbjct: 817 GEEVWTGVTYALASTMIHEGMFTEAFQTAGGMYQTL--SEKIGMNFETPEALYAERHYRA 874

Query: 879 LCYMRPLAIWAMQWALTRPK 898
           + YMRPL+IW++Q A  + K
Sbjct: 875 IGYMRPLSIWSLQTAWEQKK 894



 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 187/364 (51%), Gaps = 33/364 (9%)

Query: 111 YRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVL------CPKTPEVLKDTT 163
           + GEF R+Q+ P + E   V ANQF V +   N    + S+L      C    +   D  
Sbjct: 87  FAGEFCRFQLKPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRYVCTWCNDRYSDLF 146

Query: 164 AAGIG-------------SWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
           A  +G             SW+  +   K  Y ALYPRAW+ ++   D  +++V RQISPI
Sbjct: 147 ARKLGLVPIEIRPKNPLSSWETYINSSKCNYTALYPRAWSEYDLS-DYGIKLVQRQISPI 205

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGV 270
           IPH+YKESS P +VF +T+ N       +T+ FT+ N  G   +      NS+T      
Sbjct: 206 IPHDYKESSLPAAVFVWTVENVCGKDRQVTITFTFKNGTGNKKQ--DAEGNSETAEFSHG 263

Query: 271 HAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
            A  +  + S      T+ ++ + +  ++++ C  F  +GN      + +W+++KE+G  
Sbjct: 264 SAKGVSIKQSIVDMQSTYCISCKTSSEINLTRCKKFDPTGN-----GEKLWNDLKENGQL 318

Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRY 389
             + S+E ++ S+    +  A++A + V       + FSL WD P++ F    K YY+ Y
Sbjct: 319 TEV-SVEENLKSK---DVAVAVSAQILVQSTGSSDLEFSLVWDMPKIRFGKKTKEYYKYY 374

Query: 390 TKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
           TK++G   +A  +I+  A+   G WE  I  WQRPIL+D  LP+WY   +FNELY++  G
Sbjct: 375 TKYFGKSGDAGPSISDYALQNFGKWETLINEWQRPILDDNDLPDWYKSAIFNELYFIADG 434

Query: 450 GAVW 453
           G+VW
Sbjct: 435 GSVW 438



 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 85/137 (62%), Gaps = 2/137 (1%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
           G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q D+   +        K L DG
Sbjct: 456 GRFAYLEGHEYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDTINREISEGRKHLYDG 515

Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
           + + RK+  +VPHD+G  D  P+  +NAY ++D + W+DLN KF+LQVYRD         
Sbjct: 516 KIIPRKIKNSVPHDLGDPDEEPFDLINAYPIHDVSEWRDLNTKFILQVYRDYYTLNRYAQ 575

Query: 648 AKAVWPSVYVAMAYMDQ 664
             A   S + ++ ++D+
Sbjct: 576 QNAENASKFSSIEFIDK 592


>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 127/209 (60%), Positives = 157/209 (75%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  + D+DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
 gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
          Length = 799

 Score =  216 bits (551), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 143/389 (36%), Positives = 208/389 (53%), Gaps = 20/389 (5%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           D +E  G+F  LE ++Y  + + DV  Y SF L+M +P+++ S+ R FA A+   D +  
Sbjct: 416 DEDEPWGRFAVLECLDYRWYESLDVRLYGSFGLLMWWPELEKSVMRAFAKAIATDDDTPR 475

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVA 641
            +  +     RK  GA PHD+G  +  PW + N     D   WKDL   FVLQVYRD + 
Sbjct: 476 IIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPCDFVLQVYRDYLL 535

Query: 642 TG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
           TG  D +F    WP++   + Y+  FDRD D + EN G PDQT+D W + GISAY GGLW
Sbjct: 536 TGADDIQFLVECWPAIVQTLDYLKTFDRDRDCIPENGGAPDQTFDDWRMMGISAYCGGLW 595

Query: 700 VAALQAASALAREVGDRGSE---------DYFLFKFQKAKVVYEK-LWNGSYFNYDNSGS 749
           +AAL+AA A+A  +    S+           +    + AK +Y++ LWNGSY+N D+   
Sbjct: 596 LAALEAAIAIANILLSHHSDITPDTTKAIATWTNWLETAKPLYDQTLWNGSYYNLDS--Q 653

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD 809
           S S  + ADQL GQ+YA   GL  IV   +AR AL+ +Y         GK GA NG+   
Sbjct: 654 SGSDVVMADQLCGQFYAALLGLPDIVPHHRARVALQTIYQACFQNFHNGKFGAANGVRTT 713

Query: 810 GR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           G  ++       E+W+G+ + + A +I   + +  FQ A  +    +     G  F+TPE
Sbjct: 714 GEPINPQDTHPLEVWTGINFGLGAFLIQMGMKEEAFQLAEAVITQVYQN---GLQFRTPE 770

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
           A      +R+  Y+R +AIWAM + +  P
Sbjct: 771 AITAAGTFRASHYLRAMAIWAMYYQIQHP 799



 Score =  194 bits (493), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 128/393 (32%), Positives = 186/393 (47%), Gaps = 53/393 (13%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS+RG+F  W +       +P+   QFSVF    +G++ +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDK----STYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
            PE         + +W+W    D+     TYHALYPR+W V+E     + ++ C Q SP+
Sbjct: 100 PPE------DGSLSTWNWYPSSDQHTQTGTYHALYPRSWFVYENVF--QTQLTCEQFSPV 151

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG- 269
              NY+E+SYPV+VF +  +N       +++L TW N++G    FT +      K+ D  
Sbjct: 152 WGGNYQETSYPVAVFEWIAHNPTDKLIILSILLTWENTIGW---FTNRLDTPAVKVRDDG 208

Query: 270 -----------------VHAVLLHHRTSHQLPPVTFALAAQETDG-----------VHVS 301
                             H V   HR    +  +       E DG           V V 
Sbjct: 209 SPVYEYQPRWGDSQDNCNHFVEDFHRVGCVMTRLNINDQLGEGDGQMAIATITNPVVEVY 268

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
               +  SGN       D+W    + GS   L  +     +  G  IGAA+A    + P 
Sbjct: 269 CHNRWRPSGNGY-----DIWEYFSQDGS---LIDIHDDGPALEGERIGAALAVRCILRPG 320

Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
              ++ F +AWD P   F +G TYYR YT F+G +   A ++ R A+    +W   IEAW
Sbjct: 321 KTRKIPFFIAWDLPVTEFEAGVTYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENIEAW 380

Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           Q PIL  + LP W+ + LFNELY L  GG++WT
Sbjct: 381 QNPILNRQDLPPWFKMALFNELYLLTDGGSLWT 413


>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
          Length = 912

 Score =  216 bits (551), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 123/297 (41%), Positives = 174/297 (58%), Gaps = 10/297 (3%)

Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD 684
           +D   K   Q Y  VV    K++   ++PS    +     +DRDGDG+IEN GFPDQTYD
Sbjct: 621 EDCGEKLQPQEYAAVVWNS-KQYLSDMYPSCVTLLRRGLDWDRDGDGLIENGGFPDQTYD 679

Query: 685 TWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFN 743
            W ++G SAY GGLWVA++ A  A+A+ +G    E  F    +KA+  YE KLWNGSY+ 
Sbjct: 680 AWVMTGPSAYCGGLWVASVSAVHAMAKILGFTDDEKEFSTLLEKARDSYERKLWNGSYYK 739

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLL-PIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           +D +    S  + ADQLAGQW+ RA G   P+  E   + AL  +Y  NV + + G+ GA
Sbjct: 740 FD-TKPCNSEVVMADQLAGQWFLRASGWTEPVFPEANVKKALHTIYENNVQRFLNGRMGA 798

Query: 803 VNGML--PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGL 860
           VNG +  P   +D +++QS E+W+GVTY +AA MI+E + +  F TA G+Y        +
Sbjct: 799 VNGFVRGPRPGIDTTAIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTL---MKM 855

Query: 861 GYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA-LTRPKPKTLEKQMKPEVTDESLL 916
           G AF+TPEA   +  +RS+ YMRPL+IW+M  A +T+P         +P    E+ L
Sbjct: 856 GLAFETPEALYENGNHRSVAYMRPLSIWSMYHAIITKPPQHVSTSNGQPHQAKENHL 912



 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 162/585 (27%), Positives = 258/585 (44%), Gaps = 108/585 (18%)

Query: 111 YRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQK-YSSVLC--PKTPEVLKDTTAAGI 167
           ++GEF R+Q++P + E   V   QF V +  +  +  + SVL    K  +VL        
Sbjct: 51  FKGEFCRFQLYPGLYEYVTVPECQFIVNIRDAKKETIFQSVLSTYSKPKKVLP------- 103

Query: 168 GSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTY 227
            SW+WN+KG    Y ALYPRAWT ++      ++++CRQISP+IPHNYK+SS P +VF +
Sbjct: 104 -SWEWNIKGADCEYTALYPRAWTTYD-LTKYGIKLICRQISPVIPHNYKDSSLPCAVFVF 161

Query: 228 TIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT 287
           +  N      D+++ FTWT  +G   +  G   + +    +    V L  + +    P T
Sbjct: 162 SAKNISNEQRDVSITFTWTECLGESKKKGGCKLDLERYSAENTCGVTLEQKIADT--PCT 219

Query: 288 FALAAQETDG--VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF--------------- 330
           F +A  + D   +H   C       NS   T+  +W  +K+HG                 
Sbjct: 220 FTIAKNKPDNETIHEGYC-----LWNSTK-TSGYVWECLKKHGRLEPDPSQTPPPPKVDK 273

Query: 331 -DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMS-GKTYYRR 388
            D+ N  +     +   +   A+++++++ P   G   F L WD P + +    K + R 
Sbjct: 274 TDKDNKEDKEKAKKIYKNDSIALSSTISLDPRGSGSTEFCLVWDMPVIKYKKDSKVHKRY 333

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YTK++G+   A A+IA  A+  +  WE Q+  WQ PIL +  +P+W    L NELY++  
Sbjct: 334 YTKYFGSDGIAGASIASYALRSYRRWEKQLAEWQDPILNNSCIPDWLKSALMNELYFVAD 393

Query: 449 GGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIY 508
           GG +W D                + D+  +D       P         LE     +   Y
Sbjct: 394 GGTIWFD---------------VAEDYPDTD-------PRHEFGVFGYLEGHEYRMYNTY 431

Query: 509 TPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
             V   +SF    L    + + Q+++ E I              S  +  L P       
Sbjct: 432 D-VHFYASFALAQLWPNLQVVLQYMFRESI--------------SVEINKLRPS------ 470

Query: 569 RDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKD 626
                   ++D    K          K   ++PHD+G  D  P+  +NAY ++D + W+D
Sbjct: 471 --------LYDGKSCKF---------KTKDSIPHDLGEPDGVPFSHINAYNIHDVSEWRD 513

Query: 627 LNPKFVLQVYRD--VVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
           LN KF+LQV RD  ++   +  F        Y +MAY+D   RDG
Sbjct: 514 LNLKFILQVMRDYRLLRGHNAPFNN----ERYHSMAYIDDI-RDG 553


>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  216 bits (550), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 127/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++N+NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
 gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
          Length = 210

 Score =  216 bits (549), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 127/209 (60%), Positives = 155/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ S   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIFTAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL  P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEAP 210


>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  216 bits (549), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 127/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  EDK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
 gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  215 bits (548), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 126/209 (60%), Positives = 157/209 (75%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH ++ D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAATAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
 gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
 gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
 gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
 gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
 gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
 gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
 gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
 gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
 gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
 gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
 gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
 gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
 gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
 gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
 gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
 gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
 gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
 gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
 gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
 gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
 gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
 gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
 gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
 gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
 gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
 gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
 gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
 gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
 gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
 gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
 gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
 gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
 gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
 gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
 gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
 gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
 gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
 gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
 gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
 gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
 gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
 gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
 gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
 gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
 gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
 gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
 gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
 gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
 gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
 gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
 gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
 gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
 gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
 gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
 gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
 gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
 gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
 gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
 gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
 gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
 gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
 gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
 gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
 gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
 gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
 gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
 gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
 gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
 gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
 gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
 gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
 gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
 gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
 gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
 gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
 gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
 gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
 gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
 gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  214 bits (544), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKLAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 279

 Score =  213 bits (542), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 114/240 (47%), Positives = 147/240 (61%), Gaps = 10/240 (4%)

Query: 659 MAYMD---QFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
           MA M+   +FD+D DG+IEN G+ DQTYD W  +G SAY GGLW+AA+     +A   G 
Sbjct: 1   MAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGA 60

Query: 716 RGSEDYFLFKFQKAKVVYEKL-WNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGL--- 771
           +  ++ F     + +  YE+L WNG Y+NYD+S   QS SI +DQ AGQW+ +ACGL   
Sbjct: 61  QDVQEKFTSILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEG 120

Query: 772 -LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
              +        AL+ ++ +NV    GG  GAVNGM P G  D SS+QS E+W GV Y +
Sbjct: 121 DTEVFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGL 180

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           AA+MI E L   GF+TA G Y   W    LG AFQTPEA+     +RSL YMRPL+IWAM
Sbjct: 181 AATMIQEGLTWEGFRTAEGCYRTVWE--RLGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 238


>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  213 bits (541), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 126/209 (60%), Positives = 156/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  212 bits (540), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 126/209 (60%), Positives = 155/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK  S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  212 bits (540), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 125/209 (59%), Positives = 156/209 (74%), Gaps = 1/209 (0%)

Query: 690 GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSG 748
           GISAY G LW+AALQAA+A+A E+ D+ +   +  KF KAK  +E KLWNGSYFNYD+  
Sbjct: 2   GISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGS 61

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+KV GG+ GAVNGM P
Sbjct: 62  SSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHP 121

Query: 809 DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPE 868
           +G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A WS  G GYAFQTPE
Sbjct: 122 NGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPE 181

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
            W  D  YRSL YMRPLAIW +Q AL +P
Sbjct: 182 GWTMDGSYRSLVYMRPLAIWGIQQALEKP 210


>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
 gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
          Length = 211

 Score =  212 bits (539), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 102/163 (62%), Positives = 124/163 (76%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+K
Sbjct: 49  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 108

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 109 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 168

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 169 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 211


>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  212 bits (539), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 102/163 (62%), Positives = 124/163 (76%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+K
Sbjct: 48  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 107

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 108 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 167

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 168 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
          Length = 207

 Score =  212 bits (539), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 102/163 (62%), Positives = 124/163 (76%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+K
Sbjct: 45  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL +P
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 207


>gi|443318654|ref|ZP_21047901.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
 gi|442781756|gb|ELR91849.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
          Length = 857

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 135/409 (33%), Positives = 200/409 (48%), Gaps = 52/409 (12%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK------ 582
           +GQF  LE ++Y  + + DV  Y  FA ++L+P+++ ++ R FA A+   D         
Sbjct: 439 VGQFGVLECVDYRWYESLDVRLYGGFATLILWPELEKAVIRAFARAIPTADDHPRIIGYY 498

Query: 583 MKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
             + +   +  RK+ GA PHD+G  +  P+ + N     D  +WKDL   FV+QVYR   
Sbjct: 499 YTIGETDHYAPRKLPGATPHDLGAPNEHPFVKTNYTSYQDCNQWKDLPSDFVIQVYRAYK 558

Query: 641 ATG--DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
            TG  D  F    WP+V   + Y+ +FD D DG+ EN G PDQT+D W + G+SAY GGL
Sbjct: 559 MTGATDGDFLADCWPAVTETLKYLKRFDTDSDGIPENGGAPDQTFDDWQLKGLSAYCGGL 618

Query: 699 WVAALQAASALAREVGDRGS---------EDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
           W+AAL++A A+A  +   G            Y  +  Q  KV +++LWNG Y+  +    
Sbjct: 619 WMAALESAIAIADTLTAAGRAPGNTPILVTQYRRWLEQSRKVYHKRLWNGRYYRLET--G 676

Query: 750 SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVM------------- 796
           S SS + ADQL GQ+ AR   L  +V+E   +SAL  +Y+   LK               
Sbjct: 677 SDSSVVMADQLCGQFCARLMELPDLVEEKYVQSALAAIYDACFLKFNNYAATQARPQEQK 736

Query: 797 --------------GGKRGAVNGMLPDGRV-DMSSMQSREIWSGVTYAVAASMIHEDLAD 841
                         G   GA NG+LPDG   D       E+W+G+ + +A         D
Sbjct: 737 FIGSQTGTFRSAQPGIPIGAANGVLPDGSPEDPDGTHQLEVWTGINFGLATFFAQMGQVD 796

Query: 842 IGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
             F     + +  ++    G  F+TPEA      +R+  Y+RPLAIW +
Sbjct: 797 KAFAITEAVVQQIYT---HGLQFRTPEAITALGTFRACHYLRPLAIWGL 842



 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 129/410 (31%), Positives = 184/410 (44%), Gaps = 67/410 (16%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG PLGG G+G IGRS  G+F  W +       +   A QFSVF    + Q+ +  L  +
Sbjct: 39  HGAPLGGFGAGCIGRSPHGDFNLWHLDGGEHTFQTFPACQFSVFEQWGD-QRQAYALSTE 97

Query: 155 TPEVLKDTTAAGIGSWDW------------NLKGDKS-----------TYHALYPRAWTV 191
            P      T   +G W W              K   S           TYHALYPR+W V
Sbjct: 98  AP------TDGTLGQWQWYPATLQNSNQNSKFKIQNSELPASPPLSTGTYHALYPRSWYV 151

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG- 250
           +E     +L   C Q SPI   NY+E+SYPV+VF +T +N       +++L +W N VG 
Sbjct: 152 YENVFQAQL--TCEQFSPIWARNYQEASYPVAVFLWTAHNPTDQPITLSILLSWQNMVGW 209

Query: 251 ---------------GDSEF-----TGQHYNSKTKMNDGVHAV--LLHHRTSHQLPPV-- 286
                          G   +      GQ   +  ++     AV  +++   S  + P   
Sbjct: 210 FTNTAKSPQVLQRDDGSPYYDYVSAIGQSAGNANRLVQTGAAVGYVMNGAISPGITPTEG 269

Query: 287 --TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
              +A+A     GV V     +    N +G    D+W+     GS   +NS +    +  
Sbjct: 270 EGQWAIATLTQPGVEV----FYHTRWNPVG-DGGDLWNGFAIDGSL--VNSGDDRPAAA- 321

Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
           G  IG AIA   T+ P    Q+  +L WD P   + +G   YRRYT F+G     A  IA
Sbjct: 322 GEQIGGAIAVRFTLQPGETRQMPMTLVWDLPVTEYAAGVADYRRYTDFFGREGQNAWVIA 381

Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           + A+  + +W  QI  WQ+PIL+    P+W+ + LFNELY L +GG +WT
Sbjct: 382 QTALANYTTWRQQIINWQQPILDQPDFPDWFKMALFNELYDLTSGGTLWT 431


>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  211 bits (537), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 102/160 (63%), Positives = 123/160 (76%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  + D+DK +S+L+K++++NV+K
Sbjct: 45  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMK 104

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
 gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
          Length = 955

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 140/427 (32%), Positives = 219/427 (51%), Gaps = 44/427 (10%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           K+T+ + P+  R      + + +G+R ++D +   +    +GVP+GG+G G+IGR Y GE
Sbjct: 81  KQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 140

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNG--------QKYSSVLCPKTPEVLKDTTAAG 166
           F R+Q+ P + E   VLANQF V +    G         K SS++     E  +D  +  
Sbjct: 141 FCRFQMRPGIYEYNTVLANQFIVTIKDQKGCTIFQSLLSKCSSIINNNDTEADEDEKSRT 200

Query: 167 ---------------IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQI 207
                          + +W  N+   + +Y  LYPR+WT     H G     +R++CRQI
Sbjct: 201 KCQLPNCSSSRSRQPLSTWHSNIDDSRCSYTGLYPRSWTEYDLSHYG-----VRLICRQI 255

Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMN 267
           SP+IPH+YKESS P +VF ++  N       +++ FT+ N  G   +       S+  +N
Sbjct: 256 SPVIPHDYKESSLPCAVFVWSAENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQL-IN 314

Query: 268 DGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEH 327
           +G HA  +  R      P ++ LA +    + ++ CP F  +GN   L     W ++KEH
Sbjct: 315 EG-HAKGVAIRQKIADMPCSYNLACRVLPDISITRCPQFDPAGNGEAL-----WAQLKEH 368

Query: 328 GSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYY 386
           G      + ET  T +    IG A+ A + + P +  ++ F LAWD P++ F    KT+ 
Sbjct: 369 GQLSDEPTQETLKTKD----IGVAVCAHIALKPQASHELEFVLAWDMPKIQFPRKLKTHT 424

Query: 387 RRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYL 446
           R YTK++     A   I   A+  + SWE  I+AWQRPIL D+ LPEWY   +FN+LY++
Sbjct: 425 RYYTKYFDDSGEAGPRICEYALKHYQSWERLIDAWQRPILNDEGLPEWYKCAIFNQLYFI 484

Query: 447 NAGGAVW 453
           + GG +W
Sbjct: 485 SDGGTIW 491



 Score =  202 bits (513), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 109/261 (41%), Positives = 158/261 (60%), Gaps = 12/261 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + K+++ +    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+A+LQA 
Sbjct: 694 YLKSMYAACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLASLQAM 753

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 754 SAMATILDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 812

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 813 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKAGHVDNSNI 872

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + D  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 873 QAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTISQRIGMN--FETPEALYGEKRYR 930

Query: 878 SLCYMRPLAIWAMQWALTRPK 898
           S+ YMRPL+IW+MQ A  R +
Sbjct: 931 SIGYMRPLSIWSMQVAWERRR 951



 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 94/157 (59%), Gaps = 3/157 (1%)

Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +  +SS G  L  D      G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 492 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQF 551

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 552 DFKDAIAAELNDTRKMLYDGKIMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDL 611

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           N KFVLQVYRD     +   A+A   S + ++ ++D+
Sbjct: 612 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 648


>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
 gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
          Length = 208

 Score =  210 bits (534), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 102/163 (62%), Positives = 122/163 (74%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  +  +DK +S+L K++++NV+K
Sbjct: 46  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLHKIFDFNVMK 105

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 106 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFTAG 165

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRP 897
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL  P
Sbjct: 166 WSEEGFGYAFQTPEGWTMDGCYRSLVYMRPLAIWGMQQALEAP 208


>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 101/160 (63%), Positives = 122/160 (76%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+K
Sbjct: 45  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 101/160 (63%), Positives = 122/160 (76%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+K
Sbjct: 45  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 101/160 (63%), Positives = 122/160 (76%)

Query: 735 KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           KLWNGSYFNYD+  SS S SIQADQLAGQWY  + GL  +  +DK +S+L+K++++NV+K
Sbjct: 45  KLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMK 104

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           V GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A 
Sbjct: 105 VRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAG 164

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL
Sbjct: 165 WSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|303285310|ref|XP_003061945.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
 gi|226456356|gb|EEH53657.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
          Length = 1073

 Score =  206 bits (525), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 249/995 (25%), Positives = 394/995 (39%), Gaps = 227/995 (22%)

Query: 93   SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL- 151
            + HG PLGG+GSGSIGRSY G+F RW +       + V A+ F+    R NG   ++VL 
Sbjct: 74   ARHGAPLGGLGSGSIGRSYLGDFSRWHLKVGSHTHR-VAAHTFAA--CRVNG--VATVLS 128

Query: 152  ----------CPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPD---P 198
                       P + +                  G    Y ALYPRAW  +E  PD   P
Sbjct: 129  ASPAHASLEGVPSSEDDDDSPPPRTRRRRTLPEDGIGGNYTALYPRAW--YEYAPDALHP 186

Query: 199  ELRIVCRQI--SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTN--------- 247
            + R+V  Q+  SP++P +Y+ S+ P  V  +   N G+ +A+++++ ++ +         
Sbjct: 187  DARVVVSQVQFSPVLPGDYRASAVPTGVLRFVARNDGRATANVSIMLSFESVLASADVAP 246

Query: 248  -SVGGDSE------------------------------FTGQHYNSKTKMNDGVHAVLLH 276
              V G  +                                G  +     + DG       
Sbjct: 247  LDVSGPRDAWRVASEPTTHETFFSSRESRREGRRRRRIIVGGAHAHTVGVTDGTFTPSAD 306

Query: 277  HRTSHQLPPVTFALAAQETDGVHVSLCPHF-VISGNSLGLTAKDMWHEIKEHGSFDRLNS 335
            H           A+AA+  DGV VS+   +  + G+++       W   + +G F     
Sbjct: 307  HHGG-------VAIAAEGGDGVEVSVLREYDTLDGDAVAAA----WDAFERNGGFAEDED 355

Query: 336  METSV----------------TSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF 379
             E                    + PG++  AA++A  ++ P     + F++AWD P   F
Sbjct: 356  EEDEAAAEEDAASAASASAAKVASPGAA--AAVSAKFSLKPGERRSIAFAIAWDLPVATF 413

Query: 380  MSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDK---------R 430
              G  + +RY          AA++A  A+    SWE ++ AWQ P ++           R
Sbjct: 414  PRGPAFLKRYAWRRRRRGGGAADVAMSALRRAASWERRVRAWQAPYVDSAANADSKSPVR 473

Query: 431  LPEWYPITLFNELYYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQ- 489
             P W+   LFNELYYL  GG +W  G P     +++G           D +    +    
Sbjct: 474  RPGWFVTALFNELYYLVDGGTLW--GRP-----LSVGGLDGDEFDDDEDDEVEDALDGDS 526

Query: 490  -NDTAVNIL--------ERMSSILEQIYTPVALNSSFGTNLLQDGEE---NIGQFLYLEG 537
             +D  V +L        ER S     +      +     +++ +G      +G+F     
Sbjct: 527  ADDAEVRLLPIRPRSRGERRSLRRLGLLGGEDGDGGDARDVIGNGGAWGGTLGRFGSASD 586

Query: 538  IEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLL----------- 586
            ++   +N  D +FY S+ L  L+P + L++  D A AV   D ++ ++L           
Sbjct: 587  VDRATYNALDEYFYGSWPLATLWPGLDLAVIGDLAHAVDAADDTERRVLWRALGSAKKSA 646

Query: 587  ---------DDGQWVSRKVLGAVPHDIGICD--PWFEV-NAYCLYDTARWKDLNPKFVLQ 634
                        Q   RK+ G  P D+G     P     NA    D+    DL PK +L 
Sbjct: 647  GGTGATTTTIASQTKRRKIRGVAPRDLGTSRDAPLTSAPNALSDVDSNALLDLAPKLMLL 706

Query: 635  VYRDVVATG---------------DKKFAKAVWPSVYVAMA-YMDQFDRDGDGMIEN--- 675
            V R     G               ++   + V+   Y A+A  +   D +GDG+ E+   
Sbjct: 707  VARAHALRGGFRGPEAMASFGGGLERDALRRVFEPAYRALATQLRARDLNGDGLPEHGDA 766

Query: 676  DGFPDQTYDTWSVSG-ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQ-KAKVVY 733
            D      YD W VSG  S+Y GGLW+AAL+A + +AR++ +  +        +  AK   
Sbjct: 767  DAPTLSGYDRWVVSGNASSYGGGLWLAALRAGAGIARDLDETDARTSLEETLRIAAKSFD 826

Query: 734  EKLWNG------------------------SYFNYDNSGSSQSSSIQADQLAGQW---YA 766
            E LW G                         Y+  D SG+     + A Q+ G+W     
Sbjct: 827  EALWRGGGGGGGGGVRKSQSRSRRGTHALEGYYAADASGTDAGDVVLAGQVMGEWALGMI 886

Query: 767  RACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR-----------------GAVNGMLPD 809
            RA G+L   D  K R+AL   Y  NV K  G  R                 GAVNG   D
Sbjct: 887  RAPGVL---DPRKVRAALSTTYERNV-KAFGRARGGSGSGSDDEPSTYVPSGAVNGARVD 942

Query: 810  GRVDMSSM----------QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
               D +++          Q+RE   G +YA+A+ +I    +D G+  A G Y   +    
Sbjct: 943  DATDAAALGVGAGDGIPAQARESRVGQSYALASHLILAGFSDEGWDVARGAYRVTYED-- 1000

Query: 860  LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
             G+AF+TPE ++ + ++R     R  A+WA++ +L
Sbjct: 1001 -GFAFRTPEIFDAERRFRGAISGRAGAVWAIERSL 1034


>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
 gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
 gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
 gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
          Length = 948

 Score =  206 bits (523), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 112/263 (42%), Positives = 160/263 (60%), Gaps = 12/263 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ S    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 687 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 746

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 747 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 805

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 806 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNI 865

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 866 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 923

Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
           S+ YMRPL+IW+MQ AL R + +
Sbjct: 924 SIGYMRPLSIWSMQVALERRRAQ 946



 Score =  200 bits (508), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 127/425 (29%), Positives = 211/425 (49%), Gaps = 42/425 (9%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           ++T+ + P+  R      + + +G+R ++D +   +    +GVP+GG+G G+IGR Y GE
Sbjct: 76  RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 135

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAG-------- 166
           F R+Q+ P + E   VLANQF V +    G      L  K     K +   G        
Sbjct: 136 FCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDGERTK 195

Query: 167 -------------IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
                        + +W  N++  + +Y  LYPR+WT     H G     +R+ CRQ+SP
Sbjct: 196 CQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 250

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +IPH Y+ESS P +VF +++ N       +++ FT+ N  G   +       S+      
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310

Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
              V +  + S    P ++ LA +    + ++ CP F  +GN      + +W ++KEHG 
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 363

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
                + E   T +    IG A+   V + P +   + F LAWD P++ F    +T+ R 
Sbjct: 364 LSEHPTSEALKTKD----IGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 419

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YTK++    ++   I   A+ ++ +WE  I+AWQRPIL D+ LP+WY   +FN+LY+++ 
Sbjct: 420 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 479

Query: 449 GGAVW 453
           GG +W
Sbjct: 480 GGTIW 484


>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
          Length = 432

 Score =  206 bits (523), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 112/263 (42%), Positives = 160/263 (60%), Gaps = 12/263 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ S    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 171 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 230

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 231 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 289

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 290 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNI 349

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 350 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 407

Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
           S+ YMRPL+IW+MQ AL R + +
Sbjct: 408 SIGYMRPLSIWSMQVALERRRAQ 430



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 2/125 (1%)

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
           M+NTYDVHFY+S AL  L+P +Q+S+Q DF  A+        K+L DG+ + RKV   VP
Sbjct: 1   MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60

Query: 602 HDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           HD+G  D  P+  +N Y ++D   WKDLN KFVLQVYRD     +   A++   S + ++
Sbjct: 61  HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120

Query: 660 AYMDQ 664
            ++D+
Sbjct: 121 EFIDK 125


>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
 gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
          Length = 947

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 111/263 (42%), Positives = 160/263 (60%), Gaps = 12/263 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ S    +    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922

Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
           S+ YMRPL+IW+MQ AL R + +
Sbjct: 923 SIGYMRPLSIWSMQVALERRRAQ 945



 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/425 (30%), Positives = 212/425 (49%), Gaps = 42/425 (9%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           ++T+ L P+  R      + + +G+R ++D +   +    +GVP+GG+G G+IGR Y GE
Sbjct: 75  RQTLPLVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK--TPEVLKDTTAAGIG---- 168
           F R+Q+ P + E   V ANQF V +    G      L  K  T +   D  A   G    
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDGERTK 194

Query: 169 ---------------SWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
                          +W  N++  + +Y  LYPR+WT     H G     +R+ CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 249

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +IPH Y+ESS P +VF +++ N       +++ FT+ N  G   +       S+      
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
              V +  + +    P ++ LA +    + ++ CP F  +GN      + +W ++KEHG 
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 362

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
                + E   T +    IG A+ A V + P +   + F LAWD P++ F    +T+ R 
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YTK++    ++   I   A+ ++ +WE  I+AWQRPIL D+ LP+WY   +FN+LY+++ 
Sbjct: 419 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 478

Query: 449 GGAVW 453
           GG +W
Sbjct: 479 GGTIW 483



 Score =  120 bits (302), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 93/157 (59%), Gaps = 3/157 (1%)

Query: 511 VALNSSFGTNL-LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +   SS G  L   D     G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 484 LKCESSLGKELSYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 544 DFKDAIAAELNDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           N KFVLQVYRD     +   A+A   S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640


>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
 gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
          Length = 865

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 113/261 (43%), Positives = 158/261 (60%), Gaps = 12/261 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ +    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 604 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 663

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        F    +K K  + EKLWNGSY+ +D S  S   SI ADQL G WY
Sbjct: 664 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLS-PSHRDSIMADQLCGHWY 722

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK +Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 723 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 782

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 783 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMN--FETPEALYGEKRYR 840

Query: 878 SLCYMRPLAIWAMQWALTRPK 898
           S+ YMRPL+IW+MQ AL R +
Sbjct: 841 SIGYMRPLSIWSMQVALERRR 861



 Score =  189 bits (479), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 202/404 (50%), Gaps = 43/404 (10%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +G+R ++D +   +    +GVP+GG+G G+IGR + GEF R+Q+ P + E   V ANQF 
Sbjct: 14  EGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGFAGEFCRFQMRPGIYEYNVVQANQFI 73

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLK-----DTTAAG-----------------IGSWDWNL 174
           V +    G      L  K   + +     D  A G                 + +W  N+
Sbjct: 74  VTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDADGEKTKCQLPNCSSRPKQPLSAWHSNI 133

Query: 175 KGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
           +  + +Y  LYPR+WT     H G     +R+VCRQ+SP+IPH Y++SS P +VF +++ 
Sbjct: 134 EDARCSYTGLYPRSWTEYDLSHYG-----VRLVCRQVSPVIPHEYRDSSLPCAVFVWSVE 188

Query: 231 NSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFAL 290
           N       +++ F++ N  G   +       S+         V +  + +    P ++ L
Sbjct: 189 NVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAGNAKGVAIRQKIADM--PCSYNL 246

Query: 291 AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
           A +    + ++ CP F  +G     + + +W ++KEHG      + ET  T +    IG 
Sbjct: 247 ACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHGQLSEQPTGETLKTKD----IGV 297

Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
           A+ A + + P +   + F LAWD P + F    +T+ R YTK++     +   I   A+ 
Sbjct: 298 AVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRYYTKYFDDSGESGPKICEYALK 357

Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
           ++ SWE  I+AWQRPIL D  LP+WY   +FN+LY+++ GG +W
Sbjct: 358 QYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISDGGTIW 401



 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/157 (42%), Positives = 94/157 (59%), Gaps = 3/157 (1%)

Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +  +SS G  L  D      G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 402 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 461

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 462 DFKDAIAAELSDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 521

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           N KFVLQVYRD     +   ++A   S + ++ ++D+
Sbjct: 522 NTKFVLQVYRDYYVLNELAQSQADNASKFSSIEFIDK 558


>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
 gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
          Length = 867

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 113/261 (43%), Positives = 158/261 (60%), Gaps = 12/261 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ +    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 606 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 665

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        F    +K K  + EKLWNGSY+ +D S  S   SI ADQL G WY
Sbjct: 666 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLS-PSHRDSIMADQLCGHWY 724

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK +Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 725 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 784

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 785 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSERIGMN--FETPEALYGEKRYR 842

Query: 878 SLCYMRPLAIWAMQWALTRPK 898
           S+ YMRPL+IW+MQ AL R +
Sbjct: 843 SIGYMRPLSIWSMQVALERRR 863



 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 202/404 (50%), Gaps = 43/404 (10%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +G+R ++D +   +    +GVP+GG+G G+IGR + GEF R+Q+ P + E   V ANQF 
Sbjct: 16  EGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGFAGEFCRFQMRPGIYEYNVVQANQFI 75

Query: 137 VFVSRSNGQKYSSVLCPKTPEVLK-----DTTAAG-----------------IGSWDWNL 174
           V +    G      L  K   + +     D  A G                 + +W  N+
Sbjct: 76  VTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDADGEKTKCQLPNCSSRPKQPLSAWHSNI 135

Query: 175 KGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
           +  + +Y  LYPR+WT     H G     +R+VCRQ+SP+IPH Y++SS P +VF +++ 
Sbjct: 136 EDARCSYTGLYPRSWTEYDLSHYG-----VRLVCRQVSPVIPHEYRDSSLPCAVFVWSVE 190

Query: 231 NSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTFAL 290
           N       +++ F++ N  G   +       S+         V +  + +    P ++ L
Sbjct: 191 NVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAGNAKGVAIRQKIADM--PCSYNL 248

Query: 291 AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
           A +    + ++ CP F  +G     + + +W ++KEHG      + ET  T +    IG 
Sbjct: 249 ACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHGQLSEQPTGETLKTKD----IGV 299

Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
           A+ A + + P +   + F LAWD P + F    +T+ R YTK++     +   I   A+ 
Sbjct: 300 AVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRYYTKYFDDSGESGPKICEYALK 359

Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
           ++ SWE  I+AWQRPIL D  LP+WY   +FN+LY+++ GG +W
Sbjct: 360 QYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISDGGTIW 403



 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/157 (42%), Positives = 94/157 (59%), Gaps = 3/157 (1%)

Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +  +SS G  L  D      G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 404 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 463

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 464 DFKDAIAAELSDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 523

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           N KFVLQVYRD     +   ++A   S + ++ ++D+
Sbjct: 524 NTKFVLQVYRDYYVLNELAQSQADNASKFSSIEFIDK 560


>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
 gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
          Length = 947

 Score =  204 bits (518), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 111/261 (42%), Positives = 158/261 (60%), Gaps = 12/261 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++      M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922

Query: 878 SLCYMRPLAIWAMQWALTRPK 898
           S+ YMRPL+IW+MQ AL R +
Sbjct: 923 SIGYMRPLSIWSMQVALERRR 943



 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 124/425 (29%), Positives = 211/425 (49%), Gaps = 42/425 (9%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           ++T+ + P+  R      + + +G+R ++D +   +    +GVP+GG+G G+IGR Y GE
Sbjct: 75  RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT--------PEVLKD----- 161
           F R+Q+ P + E   V ANQF V +    G      L  K         P+   D     
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKSSDPDADPDGERTK 194

Query: 162 --------TTAAGIGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
                        + +W  N++  + +Y  LYPR+WT     H G     +R+ CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 249

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +IPH Y++SS P +VF +++ N       +++ F++ N  G   +       S+      
Sbjct: 250 VIPHEYRDSSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
              V +  + +    P ++ LA +    + ++ CP F  +GN      + +W ++KEHG 
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 362

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
                + E   T +    IG A+ A V + P +   + F LAWD P++ F    +T+ R 
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YTK++    ++   I   A+ ++ +WE  I+AWQRPIL D+ LP+WY   +FN+LY+++ 
Sbjct: 419 YTKYFDASGDSGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISD 478

Query: 449 GGAVW 453
           GG +W
Sbjct: 479 GGTIW 483



 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 94/157 (59%), Gaps = 3/157 (1%)

Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +  +SS G  L  D      G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 484 LKCDSSLGKELTYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 544 DFKDAIAAELSDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           N KFVLQVYRD     +   A+A   S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640


>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
 gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
          Length = 894

 Score =  204 bits (518), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 111/263 (42%), Positives = 159/263 (60%), Gaps = 12/263 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ S    M    ++D+D DG+IEN   PDQTYD+W + G S+Y  GLW+AALQA 
Sbjct: 633 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSSYCSGLWLAALQAM 692

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S      +I ADQL G WY
Sbjct: 693 SAMATILDQPNDCMRYQDILEKGKRSLEEKLWNGSYYRFDLS-HGHRDTIMADQLCGHWY 751

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 752 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDPSKPGHVDNSNI 811

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           QS E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 812 QSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMN--FETPEALYGEKRYR 869

Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
           S+ YMRPL+IW+MQ AL R + +
Sbjct: 870 SIGYMRPLSIWSMQVALERRRAQ 892



 Score =  198 bits (504), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 129/406 (31%), Positives = 206/406 (50%), Gaps = 45/406 (11%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +G+R ++D +   +    +GVP+GG+G G+IGR Y GEF R+Q+ P + E   V ANQF 
Sbjct: 41  EGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGEFCRFQMRPGIYEYNVVQANQFI 100

Query: 137 VFVSRSNGQK-YSSVL--C----------PKTPEV-----------LKDTTAAGIGSWDW 172
           V +    G   + S+L  C          P  P+                T   + +W  
Sbjct: 101 VTIKDPKGCTIFQSLLSKCSTRGSSGGGNPADPDADGEKTKCQLPNCSSRTKQPLSAWHS 160

Query: 173 NLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYT 228
           N++  +  Y  LYPR+WT     H G     +R+VCRQ+SP+IPH Y+ESS P +VF + 
Sbjct: 161 NIEDSRCQYTGLYPRSWTEYDLSHYG-----VRLVCRQVSPVIPHEYRESSLPCAVFVWD 215

Query: 229 IYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTF 288
           + N       +++ FT+ N  G   +       S+         V +  + +    P ++
Sbjct: 216 VENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGNAKGVAIRQKIAEM--PCSY 273

Query: 289 ALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSI 348
            LA +    + +S CP F  +GN      + +W ++KEHG  +   + ET  T +    I
Sbjct: 274 NLACRVRPEISISRCPQFDPAGN-----GEQLWAQLKEHGQLNEQPTPETLKTKD----I 324

Query: 349 GAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSG-KTYYRRYTKFYGTHQNAAANIARDA 407
           G A+ A V + P +  ++ F LAWD P++ F    +T+ R YTK++     +   I   A
Sbjct: 325 GVAVCAQVALKPKASHKLEFVLAWDMPKIQFPRKLQTHTRYYTKYFDDSGESGPKICEYA 384

Query: 408 ILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
           + ++ SWE  I+AWQRPIL D+ LP+WY   +FN+LY+++ GG +W
Sbjct: 385 LKQYPSWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDGGTIW 430



 Score =  119 bits (299), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 63/137 (45%), Positives = 87/137 (63%), Gaps = 2/137 (1%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDG 589
           G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q DF  A+        K+L DG
Sbjct: 451 GRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAADLTDTRKMLYDG 510

Query: 590 QWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKF 647
           + + RKV   VPHD+G  D  P+  +N Y ++D   WKDLN KFVLQVYRD     +   
Sbjct: 511 KVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQ 570

Query: 648 AKAVWPSVYVAMAYMDQ 664
           A+A   S + ++ ++D+
Sbjct: 571 AQADNASKFSSIEFIDK 587


>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
 gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
          Length = 947

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 111/261 (42%), Positives = 158/261 (60%), Gaps = 12/261 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++      M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLDEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922

Query: 878 SLCYMRPLAIWAMQWALTRPK 898
           S+ YMRPL+IW+MQ AL R +
Sbjct: 923 SIGYMRPLSIWSMQVALERRR 943



 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 128/425 (30%), Positives = 213/425 (50%), Gaps = 42/425 (9%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           ++T+ + P+  R      + + +G+R ++D +   +    +GVP+GG+G G+IGR Y GE
Sbjct: 75  RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK--TPEVLKDTTAAGIG---- 168
           F R+Q+ P + E   V ANQF V +    G      L  K  T +   D  A   G    
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDGERTK 194

Query: 169 ---------------SWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
                          +W  N++  + +Y  LYPR+WT     H G     +R++CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLICRQVSP 249

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +IPH Y+ESS P +VF +++ N       +++ F++ N  G   +       S+      
Sbjct: 250 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
              V +  + +    P ++ LA +    + ++ CP F  +GN      + +W ++KEHG 
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEPLWAQLKEHGQ 362

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
                + E   T +    IG A+ A V + P +   + F LAWD P++ F    +T+ R 
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YTK++    +A   I   A+ ++ +WE  I+AWQRPIL D+ LP+WY   +FN+LY+++ 
Sbjct: 419 YTKYFDDSGDAGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISD 478

Query: 449 GGAVW 453
           GG +W
Sbjct: 479 GGTIW 483



 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 94/157 (59%), Gaps = 3/157 (1%)

Query: 511 VALNSSFGTNLLQDGEE-NIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +  +SS G  L  D      G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 484 LKCDSSLGKELAYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 544 DFKDAIAAELGDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           N KFVLQVYRD     +   A+A   S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640


>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
 gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
          Length = 832

 Score =  202 bits (515), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 127/383 (33%), Positives = 196/383 (51%), Gaps = 25/383 (6%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
           G+F  LE I+Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D ++  +    
Sbjct: 442 GRFGVLECIDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPIGWYF 501

Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
             G+      RKV GA PHD+G  +  PW   N     D   WKDL   +VLQV+R   +
Sbjct: 502 TQGKGRVEADRKVKGATPHDLGAPNENPWDATNYTAYQDCNLWKDLASDYVLQVWRTFKL 561

Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
             +G D  F    WP+   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 562 SPSGEDINFLAECWPAAVEALTYLKGFDTNNDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 621

Query: 699 WVAALQAASALAREVG-----DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
           W+AAL+AA A+ +++      D  +E + F    ++++  ++  LWNG Y++ D    S 
Sbjct: 622 WIAALEAALAMGQQLQLATGLDTAAEQHTFSTWLEQSRANFDTLLWNGEYYDID--AESG 679

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
           +  + ADQL G +YAR   L  +V +   RS LK V         GG  G  NG+  DG 
Sbjct: 680 TPVVMADQLCGDFYARLLKLPAVVSDANCRSTLKAVKESCFEAFQGGSLGVANGLRRDGT 739

Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            +D       E+W+G+ + +A+     D           +    ++G   G  F+TPEA 
Sbjct: 740 PLDPQGTHPLEVWTGINFGIASYYRLMDDTSTALAITSAVVNQVYAG---GLQFRTPEAI 796

Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
              + +R+  Y+R +AIW + WA
Sbjct: 797 TAVNTFRACHYLRAMAIWGL-WA 818



 Score =  168 bits (426), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 116/408 (28%), Positives = 188/408 (46%), Gaps = 44/408 (10%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++     +HG+PLGG G+G  GR+  G F  W +         +   QF++F 
Sbjct: 40  PYTVRYASNLDDGPNHGMPLGGFGAGCFGRAPDGSFNLWHLDGGEHWFGALPDCQFALFE 99

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
              +  +  ++      +  +      + +WDW      +    TY A YP +WT ++G 
Sbjct: 100 RNGSSTRAHALAVKPEADASRPDGGEPLAAWDWYPASTPERSTGTYAARYPLSWTHYDGV 159

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
            D E+R  C   SPI+P +Y+ +SYPV+VF +T+ N      D++LL +W N+ G     
Sbjct: 160 YDAEVR--CEAFSPILPGDYQRTSYPVAVFVWTLRNPTPKPLDLSLLLSWRNTSGWFTNT 217

Query: 251 --------GDSEFTGQHYNSKTKMNDG----------VHAVLLHHRTSHQLPPV--TFAL 290
                    D      +Y       DG          +  V+L    S+ +      + +
Sbjct: 218 DASAEVHFRDDGSPEHNYAPAIGTTDGQRNRWIDDGNLKGVVLEGNVSNPIAEGEGQWCI 277

Query: 291 AAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGA 350
           A  E  GV +  C  +   G+       ++W   +  GS    N+   S  ++P   + A
Sbjct: 278 ATAEQPGVTIQRCSRWNPHGD-----GNELWSSFRADGSIPDSNNDRRSGHNDP---LSA 329

Query: 351 AIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILE 410
           A+A    + P    ++   ++WD P   F SG    RRYT F+G   + AA IA +A+ +
Sbjct: 330 ALAVQCQLAPGQSLEIPVVISWDLPVTGFASGSQALRRYTDFFGAEGHQAAAIAAEALRD 389

Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
              W  QIEAWQ+P+LE   LPE   + LFNELY L +GG++W+  +P
Sbjct: 390 WRDWRQQIEAWQQPVLERSDLPEPVRMALFNELYDLCSGGSLWSAATP 437


>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
 gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
          Length = 947

 Score =  200 bits (508), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 110/263 (41%), Positives = 159/263 (60%), Gaps = 12/263 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ S    +    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 804

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYR 877
           Q+ E+W GV YA+AA+MI E + +  FQTA G+Y+      G+   F+TPEA   + +YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTLSQRIGMN--FETPEALYGEKRYR 922

Query: 878 SLCYMRPLAIWAMQWALTRPKPK 900
           S+ YMRPL+I +MQ AL R + +
Sbjct: 923 SIGYMRPLSILSMQVALERRRAQ 945



 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/425 (30%), Positives = 212/425 (49%), Gaps = 42/425 (9%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           ++T+ L P+  R      + + +G+R ++D +   +    +GVP+GG+G G+IGR Y GE
Sbjct: 75  RQTLPLVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 134

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK--TPEVLKDTTAAGIG---- 168
           F R+Q+ P + E   V ANQF V +    G      L  K  T +   D  A   G    
Sbjct: 135 FCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDGERTK 194

Query: 169 ---------------SWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
                          +W  N++  + +Y  LYPR+WT     H G     +R+ CRQ+SP
Sbjct: 195 CQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 249

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +IPH Y+ESS P +VF +++ N       +++ FT+ N  G   +       S+      
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
              V +  + +    P ++ LA +    + ++ CP F  +GN      + +W ++KEHG 
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 362

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
                + E   T +    IG A+ A V + P +   + F LAWD P++ F    +T+ R 
Sbjct: 363 LSEQPTSEALKTKD----IGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 418

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YTK++    ++   I   A+ ++ +WE  I+AWQRPIL D+ LP+WY   +FN+LY+++ 
Sbjct: 419 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 478

Query: 449 GGAVW 453
           GG +W
Sbjct: 479 GGTIW 483



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 93/157 (59%), Gaps = 3/157 (1%)

Query: 511 VALNSSFGTNL-LQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQR 569
           +   SS G  L   D     G+F YLEG EY M+NTYDVHFY+S AL  L+P +Q+S+Q 
Sbjct: 484 LKCESSLGKELSYDDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQY 543

Query: 570 DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDL 627
           DF  A+        K+L DG+ + RKV   VPHD+G  D  P+  +N Y ++D   WKDL
Sbjct: 544 DFKDAIAAELNDTRKMLYDGKVMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDL 603

Query: 628 NPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ 664
           N KFVLQVYRD     +   A+A   S + ++ ++D+
Sbjct: 604 NTKFVLQVYRDYYVLNELAQAQADNASKFSSIEFIDK 640


>gi|85857488|gb|ABC86280.1| RE08574p [Drosophila melanogaster]
          Length = 544

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 127/425 (29%), Positives = 211/425 (49%), Gaps = 42/425 (9%)

Query: 55  KETVQLAPIGVRILCLIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGE 114
           ++T+ + P+  R      + + +G+R ++D +   +    +GVP+GG+G G+IGR Y GE
Sbjct: 76  RQTLPMVPLVCRYAAYYWKVSREGRRVYMDYYYMENGKQIYGVPIGGIGGGTIGRGYAGE 135

Query: 115 FQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAG-------- 166
           F R+Q+ P + E   VLANQF V +    G      L  K     K +   G        
Sbjct: 136 FCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDGERTK 195

Query: 167 -------------IGSWDWNLKGDKSTYHALYPRAWT----VHEGEPDPELRIVCRQISP 209
                        + +W  N++  + +Y  LYPR+WT     H G     +R+ CRQ+SP
Sbjct: 196 CQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLSHYG-----VRLTCRQVSP 250

Query: 210 IIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDG 269
           +IPH Y+ESS P +VF +++ N       +++ FT+ N  G   +       S+      
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310

Query: 270 VHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGS 329
              V +  + S    P ++ LA +    + ++ CP F  +GN      + +W ++KEHG 
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHGQ 363

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-MSGKTYYRR 388
                + E   T +    IG A+   V + P +   + F LAWD P++ F    +T+ R 
Sbjct: 364 LSEHPTSEALKTKD----IGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRY 419

Query: 389 YTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNA 448
           YTK++    ++   I   A+ ++ +WE  I+AWQRPIL D+ LP+WY   +FN+LY+++ 
Sbjct: 420 YTKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISD 479

Query: 449 GGAVW 453
           GG +W
Sbjct: 480 GGTIW 484


>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 907

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 128/409 (31%), Positives = 192/409 (46%), Gaps = 70/409 (17%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVS-RSNGQKYSSVLCP 153
           HG+PLGG G+G IGRS++G+F  W I       +P+ A QFS+F     + Q Y+  LC 
Sbjct: 41  HGMPLGGFGAGCIGRSHQGDFNLWHIDGGEHVFQPMPACQFSIFEQVEQDAQAYA--LCT 98

Query: 154 KTPEVLKDTTAAGIGSWDWN--------------------LKGDKSTYHALYPRAWTVHE 193
           + P      T   + +W W                     L     TYHALYPR+W  ++
Sbjct: 99  EAP------TDGSLAAWSWYPTVEDADELGETSLDQARSLLSPHTGTYHALYPRSWFTYQ 152

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
                +L   C Q SPI+PHNY+E+SYPV++F +T++N       ++L+ TW N  G   
Sbjct: 153 NVFLADL--TCEQFSPILPHNYQETSYPVAIFEWTVHNPTNQPLTLSLMLTWQNMAGW-- 208

Query: 254 EFTGQHYNSKTKMND-------------------------GVH-AVLLHHRTSHQLPP-- 285
            FT    +   +M D                         G H A+++    + +L    
Sbjct: 209 -FTNASKSPDVQMRDDGSPVYDYQPRLYESAGNWNQIVKEGQHVAIVMAGARNQELEEGD 267

Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
             +A+A +      V    H+   G+       D+W    ++GS   +++   +   E  
Sbjct: 268 GQWAIAVKLPTQAEVFYHTHWNPVGD-----GADIWQSFSQNGSLTNVSNNTPAAKDE-- 320

Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
             IG AIA    + P    Q+ F L+WD P + F  G TY++RYT F+      A  IA+
Sbjct: 321 -QIGGAIAVRFMLQPGETWQIPFCLSWDFPVIEFAEGVTYFKRYTDFFDRSGQNAWTIAQ 379

Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
            A+  + +W+ QI  WQ+PILE   LP+W+ + LFNELY L  GG +WT
Sbjct: 380 TALNHYKTWQEQIHTWQQPILERNDLPDWFKMALFNELYNLTDGGTLWT 428



 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 107/295 (36%), Positives = 159/295 (53%), Gaps = 38/295 (12%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLD- 587
           +GQF  LE I+Y  + + DV  Y SFAL+MLFP+++ ++ R FA A+   D  + +++  
Sbjct: 436 VGQFAVLECIDYRWYESLDVRLYGSFALLMLFPELEKAVIRAFARAIPASD-DRTRIIGY 494

Query: 588 ------DGQWVSRKVLGAVPHDIGICDP--WFEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
                 +     RKV GA PHD+G  +   W + N     D   WKDL   FVLQVYRD 
Sbjct: 495 YYTIGAESPQAVRKVAGATPHDLGAPNEHVWQKTNYTSYQDCNLWKDLPCDFVLQVYRDF 554

Query: 640 VATGDKK--FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           + TG K   F +  W ++   + Y+  FD D DG+ EN G PDQT+D W + GISAY GG
Sbjct: 555 LYTGSKDIAFLQECWSAIAQTLHYLKTFDYDFDGIPENSGAPDQTFDDWKLDGISAYCGG 614

Query: 698 LW-----------------VAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLWNG 739
           LW                    ++  + L+ ++G+      F    ++A+ +Y + LWNG
Sbjct: 615 LWIAALEAALAIGRILAEYTTNMRQLATLSNDIGN------FQVWLEQARSIYHDTLWNG 668

Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
            Y+  D+  +S S  + ADQL GQ+YAR   L  +VD + A++ALK +Y+   +K
Sbjct: 669 QYYRLDS--NSCSEVVMADQLCGQFYARLLKLPDVVDLNCAQTALKTIYDTCFIK 721



 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 4/91 (4%)

Query: 801 GAVNGMLPDGRVDM-SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTG 859
           GA NG+ PDG  +  +S    E+W+G+ + +AA ++   + D   +    + +  ++   
Sbjct: 811 GAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEAMELTQAVVQQIYTN-- 868

Query: 860 LGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
            G  F+TPEA      +R+  Y+R LAIWA+
Sbjct: 869 -GLQFRTPEAITPSGTFRACHYLRALAIWAI 898


>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
 gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 833

 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 138/388 (35%), Positives = 210/388 (54%), Gaps = 25/388 (6%)

Query: 525 GEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK 584
           G++  G+F  LE ++Y  + + DV  Y SFAL+ L+P++  ++ R FA A+   D ++  
Sbjct: 438 GKDPHGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADATQRP 497

Query: 585 L---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVY 636
           +      G+      RKV GA PHD+G  +  P+   N     D   WKDL   FVLQV+
Sbjct: 498 IGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDFVLQVW 557

Query: 637 R--DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
           R   +  TG D KF    WP+   A+ Y+ +FD + DG+ +N G PDQT+D W + G+SA
Sbjct: 558 RTFKLAPTGEDLKFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSA 617

Query: 694 YSGGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDN 746
           Y G LW+AAL+AA A    L +E+G D GS+ + F    ++++  +++ LWNG Y++ D 
Sbjct: 618 YCGALWIAALEAALAMAQRLQQELGLDTGSDQHTFSAWLEQSRSNFDRLLWNGEYYDID- 676

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S +  + ADQL G +YAR  GL P+V E  +RS LK V      K  GG  G  NG+
Sbjct: 677 -AESGTPVVMADQLCGDFYARLLGLEPVVSEANSRSTLKAVKESCFEKFQGGSLGVANGL 735

Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
             DG  +D +     E+W+G+ + +A+                 + +  +SG   G  F+
Sbjct: 736 RRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAICSAVVDQVYSG---GLQFR 792

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
           TPEA    + +R+  Y+R +AIW + WA
Sbjct: 793 TPEAITAVNTFRACHYLRAMAIWGL-WA 819



 Score =  179 bits (453), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 122/393 (31%), Positives = 190/393 (48%), Gaps = 40/393 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +         +   QF++F S  + Q+  ++    
Sbjct: 55  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFALFESNGSSQRAHALAVKP 114

Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
             +  +  +A  + +WDW      +    TY A YP +WT +EG  D E  + C   SPI
Sbjct: 115 EADASRPNSAEPLAAWDWYPASTPERSTGTYAARYPLSWTHYEGVYDAE--VSCEAFSPI 172

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE 254
           +P +Y+ +SYPV+VF +T+ N      D++LL +W N+VG                G  E
Sbjct: 173 LPGDYQRTSYPVAVFRWTLRNPSDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 232

Query: 255 FT-----GQHYNSKTKMND--GVHAVLLHHRTSHQLPPVT--FALAAQET-DGVHVSLCP 304
                  GQ    + +  D  GV  ++L  + S  +      + LA  +  +GV V  C 
Sbjct: 233 HNYAPAIGQGEGQRNRWLDQEGVRGIVLEGQRSEPIGEGQGQWCLAVPDALEGVEVLRCS 292

Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
            +  +G+      +++W      G   R+ +     +S  G    AAIA   T+ P    
Sbjct: 293 RWDPTGD-----GQELWQSFAAEG---RIPNSNNDRSSRAGEQASAAIALKFTLAPGESR 344

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
           ++  +++WD P   F +G    RRYT F+G     AA +A +A+ +   W  QI+AWQ P
Sbjct: 345 EIPIAISWDLPVTAFATGVRDLRRYTDFFGADGCNAAALAAEALRDWRQWRDQIDAWQAP 404

Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           +L  + LPE   + LFNELY L +GG++WT  S
Sbjct: 405 VLARQELPEALRMALFNELYDLASGGSLWTAAS 437


>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
          Length = 836

 Score =  195 bits (496), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 134/387 (34%), Positives = 205/387 (52%), Gaps = 25/387 (6%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           E+ +G+F  LE ++Y  + + DV  Y S AL+ L+P++  ++ R FA A+   DPS   +
Sbjct: 442 EDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADPSPRPI 501

Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
                 G+     +RKV GA PHD+G  +  PW   N     D   WKDL   +VLQV+R
Sbjct: 502 GWYFTQGKGRVEAARKVKGATPHDLGAPNERPWDATNYTAYQDCNLWKDLASDYVLQVWR 561

Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
              +   G D  F    WP+   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 562 TFKLAPNGEDINFLAECWPAAVEALHYLKGFDANNDGLPDNGGAPDQTFDDWPLKGVSAY 621

Query: 695 SGGLWVAALQAASALAREVG-----DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNS 747
            G LW+AAL+AA A+A+++      D  SE + F    ++++  ++  LWNG Y++ D  
Sbjct: 622 CGALWIAALEAALAIAQQLQLSTGLDTASEQHTFSSWLEQSRANFDHLLWNGEYYDID-- 679

Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
             S +  + ADQL G +YAR  GL  +V E  +RS LK V      +  GGK G  NG+ 
Sbjct: 680 AESGTPVVMADQLCGDFYARLLGLPAVVSEANSRSTLKAVKEACFEQFEGGKLGVANGLR 739

Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
            DG  +D +     E+W+G+ + +A+        D        +    ++G   G  F+T
Sbjct: 740 RDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDTDTALAITAAVVNQVYAG---GLQFRT 796

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
           PEA    + +R+  Y+R +AIW + WA
Sbjct: 797 PEAITAVNTFRACHYLRAMAIWGV-WA 822



 Score =  177 bits (448), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 124/412 (30%), Positives = 198/412 (48%), Gaps = 54/412 (13%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++     +HG+PLGG G+G IGR+  G F  W +         +   QF++F 
Sbjct: 44  PYTVRYASNLDDGPNHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEHWFGVLPDCQFALF- 102

Query: 140 SRSNGQKYSSVLCPKTPE--VLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHE 193
             SNGQ+  +      PE    +    A + SW W      +    TY A YP +W+ ++
Sbjct: 103 -ESNGQQSRAHALAVQPERDASRPDAGAPLPSWSWYPASTAERTTGTYAARYPLSWSHYQ 161

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--- 250
           G  D E  + C+  SPI+P NY+++SYP++VF +T++N  +   D++LL +W N+ G   
Sbjct: 162 GVFDAE--VSCQAFSPILPGNYQQTSYPLAVFVWTLHNPTRKPLDLSLLLSWRNTSGWFT 219

Query: 251 -------------GDSEFT-----GQHYNSKTK-MNDG-VHAVLLHHRTSHQLPPVT--- 287
                        G  E       G     + + ++DG +  V+L    S+   PV    
Sbjct: 220 NTDSSAEVHFRDDGSPEHNYAPAIGTTEGQRNRWVDDGPLKGVVLERDVSN---PVAEGE 276

Query: 288 --FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
             + +A  E  GV +  C  +   G+       ++W     +GS    N+   S  S+P 
Sbjct: 277 GQWCIATGEQPGVRIQRCSRWNPHGD-----GSELWSSFSANGSIPDSNNDRRSSQSDPA 331

Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
           S   AA+A    + P    ++   ++WD P   F +G +  RRYT F+G+    A  IA 
Sbjct: 332 S---AALAVQCRLEPGQSIEIPVVISWDLPVTAFATGTSALRRYTDFFGSSGGNATAIAA 388

Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           + + +   W  QIEAWQ+P+LE   LPE   + LFNELY L +GG++W+  S
Sbjct: 389 EGLRDWRRWREQIEAWQQPVLERSDLPEPLRMALFNELYDLCSGGSLWSAAS 440


>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
 gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
          Length = 830

 Score =  192 bits (489), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 135/386 (34%), Positives = 206/386 (53%), Gaps = 25/386 (6%)

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL- 585
           + +G+F  LE ++Y  + + DV  Y SFAL+ L+P++  ++ R FA A+   D +   + 
Sbjct: 436 DPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADATPRPIG 495

Query: 586 --LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR- 637
                G+      RKV GA PHD+G  +  P+   N     D   WKDL   FVLQV+R 
Sbjct: 496 WYFTQGRGRVEAPRKVAGATPHDLGAPNERPFDATNYTAYQDCNLWKDLASDFVLQVWRT 555

Query: 638 -DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
             +  TG D +F    WP+V  A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY 
Sbjct: 556 FSLAPTGEDLRFLADCWPAVVEALRYLKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYC 615

Query: 696 GGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSG 748
           G LW+AAL+AA A    L  ++G D G+E   F    + ++  +++ LWNG Y+  D   
Sbjct: 616 GALWIAALEAALAMAQRLQLDLGLDTGAEQREFGSWLEPSRANFDRLLWNGEYYRID--A 673

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
            S +  + ADQL G +YAR  GL  +V +++A S+LK +      +  GG+ G  NG+  
Sbjct: 674 GSGTPVVMADQLCGDFYARLLGLPQVVSDERALSSLKAIREACFERFEGGRLGVANGLRR 733

Query: 809 DGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
           DG  +D       E+W+G+ + +AA        D        +    ++G   G  F+TP
Sbjct: 734 DGTPLDPKGTHPLEVWTGINFGLAAYYRLMGQTDTALAITSAVVGQVYAG---GLQFRTP 790

Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWA 893
           EA      +R+  Y+R +AIWA+ WA
Sbjct: 791 EAITAVGTFRACHYLRAMAIWAL-WA 815



 Score =  176 bits (445), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 123/405 (30%), Positives = 191/405 (47%), Gaps = 43/405 (10%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++     +HG PLGG G+G IGR   G F  W +         +   QF++F 
Sbjct: 40  PYTVRYASNLDDGPNHGAPLGGFGAGCIGRGPDGSFNLWHLDGGEHWFGVLPDCQFALFE 99

Query: 140 SRSNGQKYSSV-LCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDP 198
                 +  ++ + P+  +   + +   +G W W L+G+  TY A YP  W  ++     
Sbjct: 100 HDGTASRARALAMAPERDDSRPEASERPLGGWQW-LEGEAGTYAARYPLHWFDYQDNFHT 158

Query: 199 ELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-------- 250
           E  + C   SPI+P +Y+ +SYPV+VF + + NS K   D++LL +W N+VG        
Sbjct: 159 E--VSCEAFSPILPGDYQRTSYPVAVFRWRLANSTKKPLDLSLLLSWRNTVGWFTNTDPS 216

Query: 251 --------GDSEFT-----GQHYNSKTKMND--GVHAVLLHHRTSHQLPPV--TFALAA- 292
                   G  E       G     + ++ D  G+  VLL    S  +      + LA  
Sbjct: 217 AAVTFRDDGSPEHNYLPAIGAGVGQRNRLIDAPGLKGVLLEGAPSQPIAEGEGQWCLAVP 276

Query: 293 QETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAI 352
            + +GV V  C  +  SG+       ++W      GS    N+   S  SE  S   AAI
Sbjct: 277 DQLEGVEVMRCSRWDPSGD-----GAEIWEPFARDGSIPDSNNDRRSTGSEQAS---AAI 328

Query: 353 AASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHG 412
           A  + + P +E ++   ++WD P   F +G +  RRYT F+G    +A  IA +A+ +  
Sbjct: 329 ALRLRLEPGAEIELPVVISWDLPVTAFATGTSALRRYTDFFGAAGTSAVAIAAEALRDWR 388

Query: 413 SWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
            W   IEAWQ+P+LE   LPE   + L NELY L +GG++WT  S
Sbjct: 389 QWRDAIEAWQQPVLERSDLPEPLRMALLNELYDLTSGGSLWTAAS 433


>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
 gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
          Length = 835

 Score =  192 bits (487), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 198/386 (51%), Gaps = 25/386 (6%)

Query: 527 ENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL- 585
           + +G+F  LE ++Y  + + DV  Y SFAL+ L+P++  ++ R FA A+   DP+   + 
Sbjct: 441 DPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFARAIPAADPTPRPIG 500

Query: 586 --LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR- 637
                G+     +RKV GA PHD+G  +  P+   N     D   WKDL   +VLQV+R 
Sbjct: 501 WYFTQGRGRVEAARKVAGATPHDLGAPNERPFDATNYTAYQDCNLWKDLASDYVLQVWRT 560

Query: 638 -DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYS 695
             +  +G D +F    WP+   A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY 
Sbjct: 561 YSLAPSGPDIRFLADCWPAAVTALDYLKQFDVNDDGLPDNGGAPDQTFDDWPLQGVSAYC 620

Query: 696 G----GLWVAALQAASALAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSG 748
           G        AAL     L  E+G   +E+   F    ++++  ++  LWNG Y+  D   
Sbjct: 621 GALWIAALEAALAIGQRLQLELGLDTAEEQRRFGGWLEQSRSHFDTLLWNGEYYAID--A 678

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
            S +  + ADQL G +YAR   L P+V +++A S+LK +         GG+ G  NG+  
Sbjct: 679 GSGTPVVMADQLCGDFYARLLALPPVVADERALSSLKAIRQACFESFEGGRLGVANGLRR 738

Query: 809 DGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTP 867
           DG  +D       E+W+G+ + +AA       +   F     +    + G   G  F+TP
Sbjct: 739 DGTPLDPEGTHPLEVWTGINFGLAAYYRLMGQSSTAFAITGAVVRQVYEG---GLQFRTP 795

Query: 868 EAWNTDDQYRSLCYMRPLAIWAMQWA 893
           EA      +R+  Y+R +AIWA+ WA
Sbjct: 796 EAITAAGTFRACHYLRAMAIWAL-WA 820



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/413 (25%), Positives = 179/413 (43%), Gaps = 49/413 (11%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++     +HG+PLGG G+G +GR   G F  W +         +   QF++F 
Sbjct: 37  PYTVRYASNLDDGPNHGMPLGGFGAGCLGRGLDGAFNLWHLDGGEHWFGVLPDCQFALFE 96

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGE 195
              +G++  ++      +  +  +   + SW+W      +    T    YP   + H   
Sbjct: 97  QDGDGRRAHALATAPQADDSRPGSGPPLASWNWYPAATPEQPSGTVAVRYPI--SSHHYA 154

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
                 + C   SPI+P +Y+ +SYPV+VF +T  N      +++LL +W N+VG     
Sbjct: 155 AVFSADVRCEAFSPILPGDYRRTSYPVAVFAWTFSNPTNQPLELSLLLSWRNTVGWFTNT 214

Query: 251 -----------GDSEFT-----GQHYNSKTKMNDG--VHAVLLHHRTSHQLPP------- 285
                      G  E       G+    + +  DG  +  +LL    +  L         
Sbjct: 215 DPAAEVHFRDDGSPEHNYVPAIGRGKGQRNRWVDGAGLKGLLLEGERTQPLAEGEGQWCL 274

Query: 286 VTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
                AA    GV V  C  +  SG+       ++W      GS    ++   S   +P 
Sbjct: 275 AVPDEAALNHPGVEVFRCSRWDPSGD-----GAELWEPFSRDGSIPDSDNDRRSTGDDP- 328

Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIAR 405
             + AA+A    + P +  ++   ++WD P   F +G    RRYT F+    ++AA IA 
Sbjct: 329 --LSAALAVRFRLEPGASLEIPVVISWDLPVTAFATGTRALRRYTDFFSAEGDSAAAIAG 386

Query: 406 DAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
           +A+ +   W+  IEAWQ P+++   L E   + L NELY L +GG++WT  SP
Sbjct: 387 EALADWRDWQAAIEAWQAPVVQRHDLAEPLRMALLNELYDLASGGSLWTAASP 439


>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
          Length = 837

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 130/387 (33%), Positives = 199/387 (51%), Gaps = 25/387 (6%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           E+ +G+F  LE ++Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D +   +
Sbjct: 443 EDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATPRPI 502

Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
                 G+     +RKV GA PHD+G  +  PW   N     D   WKDL   +VLQV+R
Sbjct: 503 GWYFTQGKGRVEAARKVKGATPHDLGAPNEKPWDATNYTAYQDCNLWKDLASDYVLQVWR 562

Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
              +   G D +F    WP+   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 563 TFKLAPNGEDLRFLADCWPAAVEALRYLKTFDINNDGLPDNGGAPDQTFDDWPLKGVSAY 622

Query: 695 SGGLWVAALQAASA------LAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNS 747
            G LW+AAL AA A      L+  +     +  F     +A+  +++ LWNG Y++ D  
Sbjct: 623 CGALWIAALDAALAIAQQLQLSTGLDTSAEQQEFSAWLAQARGNFDRLLWNGEYYDID-- 680

Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
             S +  + ADQL G +YAR  GL P+V E  +RS L  V +    K  GG  G  NG+ 
Sbjct: 681 AESGTPVVMADQLCGDFYARLLGLEPVVSEANSRSTLNAVKDSCFEKFEGGSLGVANGLR 740

Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
            DG  +D +     E+W+G+ + +A+        +        +    +SG   G  F+T
Sbjct: 741 RDGTPLDPNGTHPLEVWTGINFGIASYFRLMGETETALAITSAVVNQVYSG---GLQFRT 797

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
           PEA    + +R+  Y+R +AIW + WA
Sbjct: 798 PEAITAVNTFRACHYLRAMAIWGV-WA 823



 Score =  182 bits (463), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 126/424 (29%), Positives = 201/424 (47%), Gaps = 54/424 (12%)

Query: 75  AAKGKRAF----IDPFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVC 125
           AA   RAF      P+  R+ ++     +HG+PLGG G+G IGR+  G F  W +     
Sbjct: 31  AASWSRAFGQGWASPYTVRYASNLDDGPNHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEH 90

Query: 126 EDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW----NLKGDKSTY 181
               +   QFS+F S    Q+  ++      +  +   A  + SW W    + +    TY
Sbjct: 91  WFGVLPDCQFSLFESNGTEQRAHALAVKPNRDDSRPEAADPLSSWSWYPASSEQRQTGTY 150

Query: 182 HALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITL 241
            A YP +W+ ++G  + E  + C   SPI+P NY+++SYP++ F +T+ N  + + D++L
Sbjct: 151 AARYPLSWSHYQGVFEAE--VSCEAFSPILPGNYQQTSYPLATFVWTLRNPTRQALDLSL 208

Query: 242 LFTWTNSVG----------------GDSEFTGQHYNSKTK------MNDG-VHAVLLHHR 278
           + +W N+ G                G  E        K++      ++DG +  V+L   
Sbjct: 209 MLSWRNTCGWFTNTDSSAEVHFRDDGSPEHNYAPAIGKSEGHRNDWIDDGSLKGVVLGGD 268

Query: 279 TSHQLPPVT-----FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
            S    PV      + +A QE  GV +  C  +   G+       D+W      GS    
Sbjct: 269 VSS---PVAEGEGQWCIATQEQAGVSIQRCSRWNPQGD-----GSDLWGSFSRDGSIPES 320

Query: 334 NSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFY 393
           N+   S  +EP   + AA+A    + P    ++   ++WD P  +F +G +  RRYT F+
Sbjct: 321 NNNRRSGAAEP---VSAALAVRCRLEPGQSIEIPVVISWDLPVTSFATGTSALRRYTDFF 377

Query: 394 GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
           G     AA IA + + +  SW  QIEAWQ+P+LE   LPE   + LFNELY L +GG++W
Sbjct: 378 GASGKNAAAIAAEGLRDWRSWRQQIEAWQQPVLERSDLPEPLRMALFNELYDLCSGGSLW 437

Query: 454 TDGS 457
           +  S
Sbjct: 438 SAAS 441


>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
 gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
          Length = 849

 Score =  190 bits (483), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 135/388 (34%), Positives = 205/388 (52%), Gaps = 25/388 (6%)

Query: 525 GEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK 584
           G++  G+F  LE ++Y  + + DV  Y S  L+ L+P++  ++ R FA A+   D ++  
Sbjct: 453 GKDPHGRFGVLECLDYAWYESLDVRLYGSLGLLQLWPELDKAVLRSFARAIPAADATQRP 512

Query: 585 L---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVY 636
           +      G+      RKV GA PHD+G  +  P+   N     D   WKDL   FVLQV+
Sbjct: 513 IGWYFTQGRGRVEADRKVEGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDFVLQVW 572

Query: 637 R--DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA 693
           R   +  TG D  F    WP+   A+ Y+ +FD + DG+ +N G PDQT+D W + G+SA
Sbjct: 573 RTFKLAPTGEDLTFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQTFDDWPLKGVSA 632

Query: 694 YSGGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDN 746
           Y G LW+AAL+AA A    L +E+G D GSE + F    ++++  ++  LWNG Y++ D 
Sbjct: 633 YCGALWIAALEAALAMAQRLQQELGLDTGSEQHTFSNWLEQSRANFDALLWNGEYYDID- 691

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGM 806
              S +  + ADQL G +YAR  GL P+V +  +RS LK V      K  GG  G  NG+
Sbjct: 692 -AESGTPVVMADQLCGDFYARLLGLEPVVSDANSRSTLKAVKESCFEKFQGGSLGVANGL 750

Query: 807 LPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQ 865
             DG  +D +     E+W+G+ + +A+                 +    +SG   G  F+
Sbjct: 751 RRDGTPLDPNGTHPLEVWTGINFGIASYYQLMGEGQTAEAICSAVVHQVYSG---GLQFR 807

Query: 866 TPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
           TPEA    + +R+  Y+R +AIW + WA
Sbjct: 808 TPEAITAVNTFRACHYLRAMAIWGL-WA 834



 Score =  189 bits (479), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 122/393 (31%), Positives = 194/393 (49%), Gaps = 40/393 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG+PLGG G+G IGRS RG+F  W +         +   QFS+F +  + ++  ++    
Sbjct: 70  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFSLFENNGSSKRAHALALKP 129

Query: 155 TPEVLKDTTAAGIGSWDW--NLKGDKST--YHALYPRAWTVHEGEPDPELRIVCRQISPI 210
             +  +  +   + +W+W     G++ST  Y A YP +WT +EG  D E++  C   SPI
Sbjct: 130 EADASRSNSGEPLAAWNWYPASTGEQSTGTYAARYPLSWTHYEGVFDAEVQ--CEAFSPI 187

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE 254
           +P +Y+ +SYPV+VF +T+ N      D++LL +W N+VG                G  E
Sbjct: 188 LPGDYQRTSYPVAVFRWTLRNPTDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 247

Query: 255 FT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT--FALAAQET-DGVHVSLCP 304
                    G+   ++    +G+  ++L  + S  +      + LA  +T DGV V  C 
Sbjct: 248 HNYAPAIGHGEGQRNRWLDQEGIRGIVLEGQRSEPIGEGQGQWCLAVPDTLDGVEVLRCS 307

Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
            +  +G+      +++W      G   R+ +      S  G    AAIA + T+ P    
Sbjct: 308 RWDPTGD-----GQELWQPFAAEG---RIPNSNNDRASRAGEHASAAIALTFTLAPGESR 359

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
           ++  +++WD P   F SG    RRYT F+G     AA +A +A+ +   W  QI+AWQ P
Sbjct: 360 EIPIAISWDLPVTAFASGVRDLRRYTDFFGADGRNAAAMATEALRDWRHWRDQIDAWQAP 419

Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           +L  K LPE   + LFNELY L +GG++WT  S
Sbjct: 420 VLARKELPEALRMALFNELYDLASGGSLWTAAS 452


>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
 gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
          Length = 833

 Score =  189 bits (481), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 202/383 (52%), Gaps = 25/383 (6%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
           G+F  LE ++Y  + + DV  Y SFAL+ L+P++  ++ R FA A+   D ++  +    
Sbjct: 443 GRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVIRSFARAIPAADATQRPIGWYF 502

Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
             G+      RKV GA PHD+G  +  P+   N     D   WKDL   +VLQV+R   +
Sbjct: 503 TQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKL 562

Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
             TG D  F    WP+   A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 563 APTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 622

Query: 699 WVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
           W+AAL+AA A    L  E+G D G E + F    ++++  ++K LWNG Y++ D    S 
Sbjct: 623 WIAALEAALAMAQRLQLELGLDTGDEQHTFSGWLEQSRANFDKLLWNGEYYDID--AESG 680

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
           +  + ADQL G +YAR  GL P+V E  +RS LK V         GG  G  NG+  DG 
Sbjct: 681 TPVVMADQLCGDFYARLLGLPPVVSEANSRSTLKAVKEACFDNFAGGSLGVANGLRRDGT 740

Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            +D +     E+W+G+ + +A+                 +    ++G   G  F+TPEA 
Sbjct: 741 PLDPNGTHPLEVWTGINFGIASYYRLMGEGKTAEAICSAVVTQVYAG---GLQFRTPEAI 797

Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
              + +R+  Y+R +AIW + WA
Sbjct: 798 TAVNTFRACHYLRAMAIWGL-WA 819



 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 134/447 (29%), Positives = 206/447 (46%), Gaps = 65/447 (14%)

Query: 60  LAPIGVRILC--LIREEAAKG--------KRAF----IDPFIKRHLTSS-----HGVPLG 100
           +AP+G+  L   L R  + KG         R+F      PF  R+ ++      HG+PLG
Sbjct: 1   MAPLGLSALRSRLSRGSSGKGWQPPEASWSRSFGLGWQTPFTVRYASNLDDGPWHGMPLG 60

Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
           G G+G IGRS RG+F  W +         +   QF+++  + +  +  ++      +  +
Sbjct: 61  GFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQGDQVRVHALATEPARDDSR 120

Query: 161 DTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
             +   + SW W      +    TY A YP +W  ++G    E  + C   SPI+P +Y+
Sbjct: 121 PESGKPLSSWQWYPASTEENSTGTYAARYPLSWNHYKGVFRAE--VSCEAFSPILPGDYQ 178

Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT---- 256
            SSYPV+VF +T+ N      +++LL +W N+VG                G  E      
Sbjct: 179 RSSYPVAVFRWTLTNPTNKPLEVSLLLSWRNTVGWFTNTDASAEVHFRDDGSPEHNYAPA 238

Query: 257 ---GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT-----FALAAQET-DGVHVSLCPHFV 307
              G+  +++    DG+  VLL  + S    PV      + LA  +T +GV +  C  + 
Sbjct: 239 IGDGEGQSNRWIDGDGLSGVLLDGKRST---PVAEGEGQWCLALPDTLEGVELMRCSRWD 295

Query: 308 ISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVT 367
            S +       ++W      G     N+   S   E  S   AAIA   T+ P    ++ 
Sbjct: 296 PSSD-----GAELWQPFAADGVIPDSNNDRASRKGEHAS---AAIAVKFTLAPGETREIP 347

Query: 368 FSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILE 427
            +++WD P  +F +G    RRY  FYG     AA IA +A+ +  SW  QIEAWQ P+L 
Sbjct: 348 VAISWDLPVTSFATGVRDLRRYCDFYGVDGCHAAAIASEALRDWRSWHQQIEAWQEPVLA 407

Query: 428 DKRLPEWYPITLFNELYYLNAGGAVWT 454
            K LPE   + LFNELY L +GG++WT
Sbjct: 408 RKELPEELRMALFNELYDLASGGSLWT 434


>gi|295828610|gb|ADG37974.1| AT1G33700-like protein [Capsella grandiflora]
          Length = 105

 Score =  189 bits (481), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 88/105 (83%), Positives = 95/105 (90%)

Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
           DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD  FAKAVWPSVY A+AY+DQFD+
Sbjct: 1   DPWFEVNAYNLFNTNRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
           DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105


>gi|295828604|gb|ADG37971.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828608|gb|ADG37973.1| AT1G33700-like protein [Capsella grandiflora]
          Length = 105

 Score =  189 bits (480), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 88/105 (83%), Positives = 95/105 (90%)

Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
           DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD  FAKAVWPSVY A+AY+DQFD+
Sbjct: 1   DPWFEVNAYNLFNTBRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
           DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105


>gi|295828602|gb|ADG37970.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828606|gb|ADG37972.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828612|gb|ADG37975.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828614|gb|ADG37976.1| AT1G33700-like protein [Neslia paniculata]
          Length = 105

 Score =  189 bits (479), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 88/105 (83%), Positives = 95/105 (90%)

Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
           DPWFEVNAY L++T RWKDLN KFVLQVYRDVVATGD  FAKAVWPSVY A+AY+DQFD+
Sbjct: 1   DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 668 DGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALARE 712
           DGDGMIEN+GFPDQTYD WS SG+SAY GGLWVAALQA SALARE
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105


>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
 gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
          Length = 832

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 129/391 (32%), Positives = 196/391 (50%), Gaps = 33/391 (8%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           E+  G+F  LE ++Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D ++  +
Sbjct: 438 EDPHGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPI 497

Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
                 G+      RKV GA PHD+G  +  PW   N     D   WKDL   +VLQV+R
Sbjct: 498 GWYFTQGKGRVEADRKVKGATPHDLGAPNELPWDATNYTAYQDCNLWKDLGSDYVLQVWR 557

Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
              +  +G D +F    WPS   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 558 TFKLSPSGEDVRFLADCWPSAVEALRYLKTFDVNDDGLPDNGGAPDQTFDDWPLKGVSAY 617

Query: 695 SGGLWVA----------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN 743
            G LW+A           LQ  + L      R     F    ++++  +++ LWNG Y++
Sbjct: 618 CGALWIAALEAALAIAQTLQLKTGLDTSAEQR----EFSGWLEQSRGNFDRLLWNGEYYD 673

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
            D    S +  + ADQL G +YAR  GL P+V E  +RS LK V      K  GGK G  
Sbjct: 674 ID--AESGTPVVMADQLCGDFYARLLGLEPVVSEANSRSTLKAVREACFEKFDGGKLGVA 731

Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           NG+  DG  +D +     E+W+G+ + +A+                 +    ++G   G 
Sbjct: 732 NGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAICSAVVNQVYAG---GL 788

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
            F+TPEA    + +R+  Y+R +AIW + WA
Sbjct: 789 QFRTPEAITAVNTFRACHYLRAMAIWGL-WA 818



 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 129/450 (28%), Positives = 214/450 (47%), Gaps = 64/450 (14%)

Query: 60  LAPIGVRILCLIREEAAKGKR--------------AFIDPFIKRHLTS-----SHGVPLG 100
           +AP+G+  L  +     + KR               +  P+  R+ ++     +HG+PLG
Sbjct: 1   MAPLGLSALKSLLGRGKQAKRWQPPEACWSRPFGLGWDKPYTVRYASNLDDGPNHGMPLG 60

Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
           G G+G IGR+  G    W +         +   QF++F S  +G++  ++    + +  +
Sbjct: 61  GFGAGCIGRAPDGNINLWHLDGGEHWFGVLPDCQFALFESNGSGKRAHALAVKPSADASR 120

Query: 161 DTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
             +   + +WDW      +    TY A YP +WT +EG  D E+R  C   SPI+P +Y+
Sbjct: 121 PESGEPLPAWDWYPASTAERSTGTYAARYPLSWTSYEGVYDAEVR--CEAFSPILPGDYQ 178

Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT---- 256
            +SYPV+VF +T++N      D++LL +W N+ G                G  E      
Sbjct: 179 RTSYPVAVFVWTLHNPTDQPLDLSLLLSWRNTTGWFTNTDASAEVHFRDDGSPEHNYAPA 238

Query: 257 -GQHYNSKTK-MNDG-VHAVLLHHRTSHQLPPVT-----FALAAQETDGVHVSLCPHFVI 308
            G+    + + ++DG +  V+L    S+   PV      + +A  E  GV +  C  +  
Sbjct: 239 IGKTVGQRNRWVDDGNLKGVVLEGNVSN---PVAEGEGQWCIATTEQPGVSIQRCSRWNP 295

Query: 309 SGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTF 368
           SG+       ++W+     GS    N+  TS  ++P   + AA+A    + P    ++  
Sbjct: 296 SGD-----GSELWNSFSADGSIPESNNDRTSGANDP---LSAALAVQCQLAPGQSLEIPV 347

Query: 369 SLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILED 428
            ++WD P   F +G    RRYT F+GT  + AA IA +A+ +   W  QI AWQ+P+L+ 
Sbjct: 348 MISWDLPVTAFATGSQALRRYTDFFGTGGDQAAAIAAEALRDWKQWREQIGAWQQPVLQR 407

Query: 429 KRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
             LPE   + LFNELY L +GG++W+  SP
Sbjct: 408 SELPEPLRMALFNELYDLCSGGSLWSAASP 437


>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 261

 Score =  186 bits (472), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 106/240 (44%), Positives = 142/240 (59%), Gaps = 7/240 (2%)

Query: 665 FDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLF 724
           +D DGD +IEN G PDQTYD WS+ G SAY GGLW+ AL+    +A  +G+      F  
Sbjct: 23  WDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALTLGEVVDAQKFAN 82

Query: 725 KFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARAC-GLLP--IVDEDKA 780
           K   A+  YE KLWNG YF++D   S+   SI ADQL G W+     G +   I+   + 
Sbjct: 83  KLNNARKAYERKLWNGKYFDFDEH-STDHKSIMADQLCGFWFMCITDGKVDDVIITRQQI 141

Query: 781 RSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLA 840
            ++LK ++ YNV K   G+ G VN M+P G VD + +QS E+W GV YA+A+  +  +  
Sbjct: 142 CASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYALASFHLLVEEN 201

Query: 841 DIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
           +  F+TA G Y + W   GL Y  Q+PEA N    YR++ YMRPLAIWAMQ AL   + K
Sbjct: 202 ESAFKTAEGWYRSCWERYGLQY--QSPEAINESSYYRAIGYMRPLAIWAMQSALDALRNK 259


>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
 gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
          Length = 832

 Score =  185 bits (470), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 128/391 (32%), Positives = 195/391 (49%), Gaps = 33/391 (8%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           E+  G+F  LE ++Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D ++  +
Sbjct: 438 EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPI 497

Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
                 G+      RKV GA PHD+G  +  PW   N     D   WKDL   FVLQV+R
Sbjct: 498 GWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWR 557

Query: 638 D--VVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
              +  +G D +F    WP+   A+ ++  FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 558 SFKLAPSGEDIRFLADCWPAAVEALRFLKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAY 617

Query: 695 SGGLWVA----------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN 743
            G LW+A           LQ ++ L      R     F    ++++  ++K LWNG Y++
Sbjct: 618 CGALWIAALEAALAIAQTLQLSTGLDTSAEQR----EFSGWLEQSRGNFDKLLWNGEYYD 673

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
            D    S +  + ADQL G +YAR  GL P+V +  +RS LK V         GG  G  
Sbjct: 674 ID--AESGTPVVMADQLCGDFYARLLGLPPVVSDSNSRSTLKVVKEACFEAFDGGSLGVA 731

Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           NG+  DG  +D +     E+W+G+ + +A+                 + E  +SG   G 
Sbjct: 732 NGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDKQTAQAICSAVVEQVYSG---GL 788

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
            F+TPEA    + +R+  Y+R +AIW   WA
Sbjct: 789 QFRTPEAITAVNTFRACHYLRAMAIWGF-WA 818



 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 118/411 (28%), Positives = 192/411 (46%), Gaps = 50/411 (12%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++     +HG+PLGG G+G  GR+  G    W +         +   QF++F 
Sbjct: 40  PYTVRYASNLDDGPNHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDCQFALFE 99

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG--DKST--YHALYPRAWTVHEGE 195
               G++  ++      +  +      + +W+W      D+ST  Y A YP +WT +EG 
Sbjct: 100 GNGRGKRAHALAVQPDKDASRQEAGQPLKAWEWYPASTPDRSTGTYAARYPFSWTSYEGV 159

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
            D E+R  C   SPI+P +Y+ +SYPV+VF +T+ N      D++LL  W N+ G     
Sbjct: 160 YDAEVR--CEAFSPILPGDYQRTSYPVAVFIWTLRNPTDQPLDLSLLLCWRNTTGWFTNT 217

Query: 251 -----------GDSEFT-----GQHYNSKTKMND--GVHAVLLHHRTSHQLPPVT----- 287
                      G  E       G     + +  D   +  V+L    S+   PV      
Sbjct: 218 DASAEVHFRDDGSPEHNYAPAIGSTAGQRNRCIDYGSLKGVVLEGNASN---PVAEGEGQ 274

Query: 288 FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSS 347
           + +AA E  GV +  C  +  SG+      +++W      GS    N+   S + +P   
Sbjct: 275 WCIAAAEQPGVTIQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSDDP--- 326

Query: 348 IGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDA 407
           + AA+A    + P    ++   ++WD P   F +G    RRYT F+    N A  IA ++
Sbjct: 327 LSAALAVQCQLAPGQSIEIPLVISWDLPVTGFATGSQALRRYTDFFAADANQAVAIAAES 386

Query: 408 ILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
           + +  SW  +I+AWQ P+L+ + LPE   + LFNELY L +GG++W+  SP
Sbjct: 387 LRDWRSWRQKIDAWQEPVLQRQDLPEPLRMALFNELYDLCSGGSLWSAASP 437


>gi|352096901|ref|ZP_08957615.1| Glucosylceramidase [Synechococcus sp. WH 8016]
 gi|351675485|gb|EHA58646.1| Glucosylceramidase [Synechococcus sp. WH 8016]
          Length = 833

 Score =  183 bits (465), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 133/383 (34%), Positives = 200/383 (52%), Gaps = 25/383 (6%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
           G+F  LE ++Y  + + DV  Y SFAL+ L+P++  ++ R FA A+   D ++  +    
Sbjct: 443 GRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVIRSFARAIPAADATQRPIGWYF 502

Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
             G+      RKV GA PHD+G  +  P+   N     D   WKDL   +VLQV+R   +
Sbjct: 503 TQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDYVLQVWRTFKL 562

Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
             TG D  F    WP+   A+ Y+ QFD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 563 APTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 622

Query: 699 WVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
           W+AAL+AA A    L  E+G D G E + F    ++++  ++  LWNG Y++ D    S 
Sbjct: 623 WIAALEAALAMAQRLQLELGLDTGDEQHTFSGWLEQSRANFDTLLWNGEYYDID--AESG 680

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
           +  + ADQL G +YAR   L P+V E  +RS LK V         GG  G  NG+  DG 
Sbjct: 681 TPVVMADQLCGDFYARLLELPPVVSEANSRSTLKAVKEACFDNFAGGLLGVANGLRRDGT 740

Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            +D +     E+W+G+ + +A+                 +    ++G   G  F+TPEA 
Sbjct: 741 PLDPNGTHPLEVWTGINFGIASYYRLMGEGPTAEAICSAVVTQVYAG---GLQFRTPEAI 797

Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
              + +R+  Y+R +AIW + WA
Sbjct: 798 TAVNTFRACHYLRAMAIWGL-WA 819



 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 136/444 (30%), Positives = 208/444 (46%), Gaps = 59/444 (13%)

Query: 60  LAPIGVRILC--LIREEAAKG--------KRAF----IDPFIKRHLTSS-----HGVPLG 100
           +AP+G+  L   L R  + KG         R+F      PF  R+ ++      HG+PLG
Sbjct: 1   MAPLGLSALRSRLSRGSSGKGWQPPEASWSRSFGLGWSSPFTVRYASNLDDGPWHGMPLG 60

Query: 101 GVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLK 160
           G G+G IGRS RG+F  W +         +   QF+++  + +  +  ++    T +  +
Sbjct: 61  GFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQDDQVRVHALATEPTRDDSR 120

Query: 161 DTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
             +   + SW W      +    TY A YP +W  +EG    E  + C   SPI+P +Y+
Sbjct: 121 PESGKPLSSWQWYPASTEESSTGTYAARYPLSWNHYEGVFRAE--VSCEAFSPILPGDYQ 178

Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSEFT---- 256
            SSYPV+VF +T+ N      +++LL +W N+VG                G  E      
Sbjct: 179 RSSYPVAVFRWTLTNPTTKPLEVSLLLSWRNTVGWFTNTDSSAEVHFRDDGSPEHNYAPA 238

Query: 257 ---GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQET-DGVHVSLCPHFVISG 310
              G+  +++    DG+  VLL  + S  L      + LA  +  DGV +  C  +  SG
Sbjct: 239 IGVGEGQSNRWIDGDGLCGVLLDGKRSTPLAEGEGQWCLALPDRLDGVELMRCSRWDPSG 298

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
           +       ++W      G     N+   S   E  S   AAIA  +T+ P    ++  ++
Sbjct: 299 D-----GAELWQPFAADGLIPDSNNDRASRAGEHAS---AAIAVKLTLAPGETREIPVAI 350

Query: 371 AWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKR 430
           +WD P  +F +G    RRY+ FYG     AA IA +A+ +  SW  QIEAWQ P+L  K 
Sbjct: 351 SWDLPVTSFATGVRDLRRYSDFYGADGCHAAAIAAEALRDWRSWHEQIEAWQAPVLARKE 410

Query: 431 LPEWYPITLFNELYYLNAGGAVWT 454
           LPE   + LFNELY L +GG++WT
Sbjct: 411 LPEELRMALFNELYDLASGGSLWT 434


>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
 gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 832

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 127/391 (32%), Positives = 194/391 (49%), Gaps = 33/391 (8%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           E+  G+F  LE ++Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D ++  +
Sbjct: 438 EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATQRPI 497

Query: 586 ---LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR 637
                 G+      RKV GA PHD+G  +  PW   N     D   WKDL   FVLQV+R
Sbjct: 498 GWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTAYQDCNLWKDLGSDFVLQVWR 557

Query: 638 --DVVATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAY 694
              +  +G D +F    WP+   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY
Sbjct: 558 TFKLAPSGEDIRFLADCWPAAVEALRYLKTFDVNNDGLPDNGGAPDQTFDDWPLKGVSAY 617

Query: 695 SGGLWVA----------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN 743
            G LW+A           LQ ++ L      R     F    ++++  ++K LWNG Y++
Sbjct: 618 CGALWIAALEAALAIAQTLQLSTGLDTSAEQR----EFSGWLEQSRGNFDKLLWNGEYYD 673

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
            D    S +  + ADQL G +YAR   L P+V +  + S LK V         GG  G  
Sbjct: 674 ID--AESGTPVVMADQLCGDFYARLLALPPVVSDANSLSTLKAVKEACFEAFDGGSLGVA 731

Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           NG+  DG  +D ++    E+W+G+ + +A+                 + E  +S    G 
Sbjct: 732 NGLRRDGTPLDPNATHPLEVWTGINFGIASYYRLMGDKQTAQSICSAVVEQVYSS---GL 788

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
            F+TPEA    + YR+  Y+R +AIW + WA
Sbjct: 789 QFRTPEAITAVNTYRACHYLRAMAIWGL-WA 818



 Score =  176 bits (445), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 118/414 (28%), Positives = 189/414 (45%), Gaps = 56/414 (13%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++     +HG+PLGG G+G  GR+  G    W +         +   QF++F 
Sbjct: 40  PYTVRYASNLDDGPNHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDCQFALFE 99

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG--DKST--YHALYPRAWTVHEGE 195
               G++  ++      +  +      +  W+W      D+ST  Y A YP +WT +EG 
Sbjct: 100 GNGRGKRAHALAVQPEQDASRPEAGQPLKVWEWYPASTPDRSTGTYAARYPLSWTSYEGV 159

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEF 255
            D E+R  C   SPI+P +Y+ +SYPV VF +T+ N      D++LL +W N+ G    F
Sbjct: 160 YDAEVR--CEAFSPILPGDYQRTSYPVVVFVWTLRNPTDQPLDLSLLLSWRNTTG---WF 214

Query: 256 TGQHYNSKTKMND--------------------------GVHAVLLHHRTSHQLPPVT-- 287
           T    +++    D                           +  VLL    S    PV   
Sbjct: 215 TNTDASAEVHFRDDGSPEHNYAPAIGSTAGQRNCCIDDGSLKGVLLEGNVSD---PVAEG 271

Query: 288 ---FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEP 344
              + +A  E  GV +  C  +  SG+      +++W      GS    N+   S + +P
Sbjct: 272 EGQWCIATAEQPGVTIQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSDDP 326

Query: 345 GSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIA 404
              + AA+A    + P    ++   ++WD P   F +G    RRYT F+    N A  IA
Sbjct: 327 ---LSAALAVQCQLAPGQSIEIPLVISWDLPVTAFATGSQALRRYTDFFAADANQAVAIA 383

Query: 405 RDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
            +A+ +   W  QIEAWQ+P+L+ + LP+   + LFNELY L +GG++W+  SP
Sbjct: 384 AEALRDWRRWRQQIEAWQQPVLQRQDLPKPLRMALFNELYDLCSGGSLWSAASP 437


>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 837

 Score =  181 bits (459), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 132/385 (34%), Positives = 204/385 (52%), Gaps = 27/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL--- 585
           +G+F  LE  +Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D +   +   
Sbjct: 442 VGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATPRPIGWY 501

Query: 586 LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
              G+      RK   A PHD+G  +  P+   N     D   WKDL   +VLQV+R  +
Sbjct: 502 FTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFL 561

Query: 641 ATG---DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
            +    D  F    WP+V  A++Y+ +FD + DG+ +N G PDQT+D W + G+SAY G 
Sbjct: 562 LSPNGEDLSFLAECWPAVVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGA 621

Query: 698 LWVAALQAASA----LAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGSS 750
           LW+AAL+AA A    L  ++G   +E+   F    ++++V +++ LWNG Y+  D    S
Sbjct: 622 LWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWLEQSRVNFDRLLWNGEYYKID--AES 679

Query: 751 QSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
            +  + ADQL G +YAR  GL  +V ++++RS+L  V         GG+ G  NG+  DG
Sbjct: 680 GTPVVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDG 739

Query: 811 R-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT-GLGYAFQTPE 868
             +D       E+W+G+ + +AA   +  + D    TA  I  A  +   G G  F+TPE
Sbjct: 740 MPLDPKGTHPLEVWTGINFGLAA--YYRLMGDA--TTATAICSAVVNQVYGGGLQFRTPE 795

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWA 893
           A      YR+  Y+R +AIWA+ WA
Sbjct: 796 AITAVKTYRACHYLRAMAIWAL-WA 819



 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 128/428 (29%), Positives = 201/428 (46%), Gaps = 59/428 (13%)

Query: 75  AAKGKRAF----IDPFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVC 125
           AA   RAF      P+  R+ ++      HG+PLGG G+G IGRS RG+F  W +     
Sbjct: 26  AASWSRAFGLPWQQPYTVRYASNLDDGPWHGMPLGGFGAGCIGRSPRGDFNLWNLDGGEH 85

Query: 126 EDKPVLANQFSVFVSRSNG-QKYSSVLCPKT----PEVLKDTTAAGIGSWDW----NLKG 176
               +   QF++F  + +  + ++    P+T    PE+ K      + SW W      + 
Sbjct: 86  WQGSIPDCQFALFERQGDQVRAHALATAPQTDDSQPELDKP-----LSSWSWYPASTEQR 140

Query: 177 DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTS 236
              +Y A YP +WT ++G    EL  VC   SPI+P +Y+ +SYP++VF + + N    S
Sbjct: 141 KTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPILPGDYRRTSYPLAVFRWQLQNPTSQS 198

Query: 237 ADITLLFTWTNSVG----------------GDSEFT-------GQHYNSKTKMNDGVHAV 273
            +++LL +W N+ G                G  E +       G+   ++     G+  V
Sbjct: 199 LELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPEHSYVPAIGRGEGQCNRWIDQPGLIGV 258

Query: 274 LLHHRTSHQLPPVT--FALAAQE-TDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
           L+    +  L      + LA  +   GV V  C  +  SG+       ++W      G+ 
Sbjct: 259 LMEGERADPLAEGQGQWCLAVPDHLPGVEVMRCSRWDPSGD-----GSELWSSFASEGTI 313

Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
              N+   S+  E  S   AA+A  VT+ P    ++   ++WD P   F +G    RRYT
Sbjct: 314 PNSNTAHKSLAGEQTS---AALAVKVTLAPGESLEIPVVISWDLPVTAFATGVRDLRRYT 370

Query: 391 KFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
            F+G+   AAA +A +A+ +   W  QI+AWQ P+L  + LPE   + LFNELY L +GG
Sbjct: 371 DFHGSDGQAAAALAAEALSDWSDWREQIDAWQAPVLAREDLPERLRMALFNELYDLASGG 430

Query: 451 AVWTDGSP 458
           ++WT   P
Sbjct: 431 SLWTAARP 438


>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
 gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 833

 Score =  180 bits (456), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 130/383 (33%), Positives = 201/383 (52%), Gaps = 25/383 (6%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
           G+F  LE ++Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D S+  +    
Sbjct: 443 GRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADGSQRPIGWYF 502

Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRD--V 639
             G+      RKV GA PHD+G  +  P+   N     D   WKDL   ++LQV+R   +
Sbjct: 503 TQGRGRVEADRKVAGATPHDLGAPNERPFDATNYTAYQDCNLWKDLASDYMLQVWRTFRL 562

Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
             TG D +F    WPS   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 563 APTGEDLRFLAECWPSAVQALDYLKGFDVNNDGLPDNGGAPDQTFDDWPLQGVSAYCGAL 622

Query: 699 WVAALQAASA----LAREVG-DRGSED-YFLFKFQKAKVVYEK-LWNGSYFNYDNSGSSQ 751
           W+AAL+AA A    L  ++G D  +E   F    ++++  ++  LWNG Y+  D    S 
Sbjct: 623 WIAALEAALAMAQRLQLDLGLDTAAEQRRFGAWLEQSRANFDDLLWNGEYYRID--AESG 680

Query: 752 SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGR 811
           +  + ADQL G +YAR  GL P+V + +ARS L+ V      +   G+ G  NG+  DG 
Sbjct: 681 TPVVMADQLCGDFYARLLGLPPVVSDARARSTLQAVKEACFERFESGRLGVANGLRRDGS 740

Query: 812 -VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAW 870
            +D +     E+W+G+ + +A+       +         +    +SG   G  F+TPEA 
Sbjct: 741 PLDPNGTHPLEVWTGINFGLASYYRLMGESKTALAICSAVVGQVYSG---GLQFRTPEAI 797

Query: 871 NTDDQYRSLCYMRPLAIWAMQWA 893
              + +R+  Y+R +AIW + WA
Sbjct: 798 TAVNTFRACHYLRAMAIWGL-WA 819



 Score =  161 bits (408), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 188/409 (45%), Gaps = 45/409 (11%)

Query: 85  PFIKRHLTS-----SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFV 139
           P+  R+ ++      HG+PLGG G+G +GRS  G F  W +         +   QF+++ 
Sbjct: 40  PYTVRYASNLDDGPDHGMPLGGFGAGCLGRSPSGAFNLWNLDGGEHWFGSIPDCQFALWE 99

Query: 140 SRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW---NLKGDKS-TYHALYPRAWTVHEGE 195
            +    +  ++      +    +    + +W W   + +G  + TY A YP +WT HE  
Sbjct: 100 QQGEDVRAHALATAPDRDDTDPSGGTPLSAWQWYPASTQGRSTGTYAARYPLSWTHHEDV 159

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
                 ++C   SPI+P +Y+ +SYPV+VF +   N      +++L+  W N+VG     
Sbjct: 160 Y--RAGVLCEAFSPILPGDYQRTSYPVAVFRWQFSNPTDQPLELSLMLCWRNTVGWFTNT 217

Query: 251 -----------GDSEFT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV--TFAL 290
                      G  E         G+   ++     G   VLL  + S  L      + L
Sbjct: 218 DAAAEVHFRDDGSPEHNYAPAIGRGEGQRNRQVDQPGFSGVLLEGQPSMPLAEGEGQWCL 277

Query: 291 A-AQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG 349
           A   + DGVHV  C  +  SG+       ++W      G   R+        S  G    
Sbjct: 278 ALPDDLDGVHVMRCSRWDPSGD-----GSELWTPFAADG---RIPDSNNDRASRSGEQAS 329

Query: 350 AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAIL 409
           AAIA   T+ P +  ++   ++WD P   F SG +  RRYT F+G+  + AA IA +A+ 
Sbjct: 330 AAIAVKFTLAPGASREIPLVISWDLPVTAFASGSSALRRYTDFFGSSGSNAAAIAAEALR 389

Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVWTDGSP 458
           +   W  QI+AWQ+P+L   +LPE   + LFNELY L +GG++WT  SP
Sbjct: 390 DWRQWREQIDAWQQPVLARMQLPEPLRMALFNELYDLCSGGSLWTAASP 438


>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
 gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
           MIT 9303]
          Length = 837

 Score =  178 bits (451), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 130/385 (33%), Positives = 202/385 (52%), Gaps = 27/385 (7%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL--- 585
           +G+F  LE  +Y  + + DV  Y S AL+ L+P++  ++ R FA A+   D +   +   
Sbjct: 442 VGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFARAIPAADATPRPIGWY 501

Query: 586 LDDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVV 640
              G+      RK   A PHD+G  +  P+   N     D   WKDL   +VLQV+R  +
Sbjct: 502 FTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLWKDLASDYVLQVWRTFL 561

Query: 641 ATG---DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGG 697
            +    D  F    WP+   A++Y+ +FD + DG+ +N G PDQT+D W + G+SAY G 
Sbjct: 562 LSPNGEDLSFLAECWPAAVQALSYLKRFDVNHDGLPDNGGAPDQTFDDWPLQGVSAYCGA 621

Query: 698 LWVAALQAASA----LAREVGDRGSEDYFLFK--FQKAKVVYEK-LWNGSYFNYDNSGSS 750
           LW+AAL+AA A    L  ++G   +E+   F    ++++  +++ LWNG Y+  D    S
Sbjct: 622 LWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWLEQSRANFDRLLWNGEYYKID--AES 679

Query: 751 QSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG 810
            +  + ADQL G +YAR  GL  +V ++++RS+L  V         GG+ G  NG+  DG
Sbjct: 680 GTPVVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEACFEGFEGGRLGVANGLCRDG 739

Query: 811 R-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGT-GLGYAFQTPE 868
             +D       E+W+G+ + +AA   +  + D    TA  I  A  +   G G  F+TPE
Sbjct: 740 MPLDPKGTHPLEVWTGINFGLAA--YYRLMGDA--TTATAICSAVVNQVYGGGLQFRTPE 795

Query: 869 AWNTDDQYRSLCYMRPLAIWAMQWA 893
           A      YR+  Y+R +AIWA+ WA
Sbjct: 796 AITAVKTYRACHYLRAMAIWAL-WA 819



 Score =  148 bits (373), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 128/428 (29%), Positives = 200/428 (46%), Gaps = 59/428 (13%)

Query: 75  AAKGKRAF----IDPFIKRHLTSS-----HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVC 125
           AA   RAF      P+  R+ ++      HG+PLGG G+G IGRS RG+F  W +     
Sbjct: 26  AASWSRAFGLAWQQPYTVRYASNLDDGPWHGMPLGGFGAGCIGRSPRGDFNLWNLDAGEH 85

Query: 126 EDKPVLANQFSVFVSRSNG-QKYSSVLCPKT----PEVLKDTTAAGIGSWDW----NLKG 176
               +   QF++F  + +  + ++    P+T    PE+ K      + SW W      + 
Sbjct: 86  WQGSIPDCQFALFERQGDQVRAHALATAPQTDDSQPELDKP-----LSSWSWYPASTEQR 140

Query: 177 DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTS 236
              +Y A YP +WT ++G    EL  VC   SPI+P +Y+ +SYP++VF + + N    S
Sbjct: 141 TTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPILPGDYRRTSYPLAVFRWQLQNPTSQS 198

Query: 237 ADITLLFTWTNSVG----------------GDSEFT-------GQHYNSKTKMNDGVHAV 273
            +++LL +W N+ G                G  E +       G+   ++     G+  V
Sbjct: 199 LELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPEHSYVPAIGRGEGQCNRWIDQPGLIGV 258

Query: 274 LLHHRTSHQLPPVT--FALAAQE-TDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
           L+    +  L      + LA  +   GV V  C  +  SG+       ++W      G+ 
Sbjct: 259 LMEGERADPLAEGQGQWCLAVPDHLPGVEVMRCSRWDPSGD-----GSELWSSFASEGTI 313

Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
              N    S+  E  S   AA+A  VT+ P    ++   ++WD P   F +G    RRYT
Sbjct: 314 PNSNDTHKSLAGEQTS---AALAVKVTLAPGESLEIPVVISWDLPVTAFATGVRDLRRYT 370

Query: 391 KFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGG 450
            F+G+   AAA +A +A+ +   W  QI+AWQ P+L  + LPE   + LFNELY L +GG
Sbjct: 371 DFHGSDGQAAAALAAEALRDWPDWREQIDAWQAPVLAREDLPERLRMALFNELYDLASGG 430

Query: 451 AVWTDGSP 458
           ++WT   P
Sbjct: 431 SLWTAARP 438


>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
 gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
          Length = 820

 Score =  177 bits (449), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 127/387 (32%), Positives = 191/387 (49%), Gaps = 33/387 (8%)

Query: 530 GQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL---L 586
           G+F  LE ++Y  + + DV  Y SFAL+ L+P++  ++ R F+ A+   D ++  +    
Sbjct: 430 GRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFSRAIPARDATQRPIGWYF 489

Query: 587 DDGQW---VSRKVLGAVPHDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYR--DV 639
             G+      RKV GA PHD+G  +  P+   N     D   WKDL   FVLQV+R   +
Sbjct: 490 TQGRGRVEADRKVAGATPHDLGAPNEVPFDATNYTAYQDCNLWKDLASDFVLQVWRTFQL 549

Query: 640 VATG-DKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
             TG D  F    WPS   A+ Y+  FD + DG+ +N G PDQT+D W + G+SAY G L
Sbjct: 550 APTGEDLNFLAECWPSAVQALDYLKGFDVNDDGLPDNGGAPDQTFDDWPLKGVSAYCGAL 609

Query: 699 WVAA----------LQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFNYDNS 747
           W+AA          LQ    L   V  R     F    ++++  +++ LWNG Y+  D  
Sbjct: 610 WIAALEAALAMAQRLQLDLGLDTTVEQR----RFGQWLEQSRSNFDRLLWNGEYYEID-- 663

Query: 748 GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGML 807
             S +  + ADQL G +YAR  GL  +V +  A S L  V      +   G  G  NG+ 
Sbjct: 664 AESGTPVVMADQLCGDFYARLLGLPSVVSDANALSTLNVVREACFERFEAGTLGVANGLR 723

Query: 808 PDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQT 866
            DG  +D +     E+W+G+ + +A+       +D        +    +SG   G  F+T
Sbjct: 724 RDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAVVNQVYSG---GLQFRT 780

Query: 867 PEAWNTDDQYRSLCYMRPLAIWAMQWA 893
           PEA    + +R+  Y+R +AIW + WA
Sbjct: 781 PEAITAVNTFRACHYLRAMAIWGL-WA 806



 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 116/393 (29%), Positives = 181/393 (46%), Gaps = 44/393 (11%)

Query: 94  SHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCP 153
           +HG+PLGG G+G  GR+  G    W +         +   QFS++  + +  +  ++   
Sbjct: 41  NHGMPLGGFGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDLRAHALAVS 100

Query: 154 KTPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELR--IVCRQI 207
              +  +   AA + +W W      +    TY A YP +WT ++G      R  ++C   
Sbjct: 101 PEGDASRPDAAASLPAWSWYPASTDQCSTGTYAARYPLSWTHYDGV----FRSGVLCEAF 156

Query: 208 SPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------G 251
           SPI+P +Y+ SSYPV+VF + + N      +++L+ +W N+VG                G
Sbjct: 157 SPILPGDYQRSSYPVAVFRWQLANPTDQPLELSLMVSWRNTVGWFTNTDAAAEVHFRDDG 216

Query: 252 DSEFT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV--TFALAA-QETDGVHVS 301
             E         G+   ++     GV  VLL  R+S  +      + LA   + DGV V 
Sbjct: 217 SPEHNYSPAIGRGEGQRNRAINEPGVTGVLLEGRSSEPIAEGEGQWCLAVPDDLDGVDVL 276

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
            C  +   G+       D+W      G   R+   +    S  G    AA+    T+ P 
Sbjct: 277 RCSRWDPCGD-----GSDIWTPFAAEG---RIPESDNDRESRAGEQASAAMVVKFTLAPG 328

Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAW 421
              ++   ++WD P   F SG    RRYT F+G+    AA IA +A+ +   W  +IEAW
Sbjct: 329 ESREIPVVISWDLPITAFASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEAW 388

Query: 422 QRPILEDKRLPEWYPITLFNELYYLNAGGAVWT 454
           Q+P+LE K LPE   + LFNELY L +GG++WT
Sbjct: 389 QQPVLERKALPEPVRMALFNELYDLCSGGSLWT 421


>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
          Length = 226

 Score =  159 bits (401), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/185 (46%), Positives = 108/185 (58%), Gaps = 7/185 (3%)

Query: 709 LAREVGDRGSEDYFLFKFQKAKVVYEKL-WNGSYFNYDNSGSSQSSSIQADQLAGQWYAR 767
           +A   G +  +D F     + +  YE+L WNG Y+NYD+S   QS SI +DQ AGQW+ R
Sbjct: 1   MAVLCGAQDVQDKFSSILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLR 60

Query: 768 ACGL----LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIW 823
           ACGL      +        AL+ ++  NV    GG  GAVNGM P G  D SS+QS E+W
Sbjct: 61  ACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVW 120

Query: 824 SGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMR 883
            GV Y +AA+MI E L   GF+TA G Y   W    LG AFQTPEA+     +RSL YMR
Sbjct: 121 VGVVYGLAATMIQEGLTREGFRTAEGCYRTVWE--RLGLAFQTPEAYCQQQVFRSLAYMR 178

Query: 884 PLAIW 888
           PL+IW
Sbjct: 179 PLSIW 183


>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
          Length = 391

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 122/203 (60%), Gaps = 10/203 (4%)

Query: 647 FAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
           + KA++ S    M    ++D+D DG+IEN   PDQTYD+W + G SAY  GLW+AALQA 
Sbjct: 171 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 230

Query: 707 SALAREVGDRGSEDYFLFKFQKAK-VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWY 765
           SA+A  +        +    +K K  + EKLWNGSY+ +D S  S   +I ADQL G WY
Sbjct: 231 SAMATILDQPNDCLRYQDILEKGKRSLEEKLWNGSYYRFDLS-HSHRDTIMADQLCGHWY 289

Query: 766 ARACGL-LPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD-------GRVDMSSM 817
            ++CG    I  ++  R+ALK++Y+ NV+    G  GA NG + +       G VD S++
Sbjct: 290 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNI 349

Query: 818 QSREIWSGVTYAVAASMIHEDLA 840
           Q+ E+W GV YA+AA+MI E ++
Sbjct: 350 QAEEVWPGVVYALAATMIQEGIS 372



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 2/125 (1%)

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVP 601
           M+NTYDVHFY+S AL  L+P +Q+S+Q DF  A+        K+L DG+ + RKV   VP
Sbjct: 1   MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60

Query: 602 HDIGICD--PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM 659
           HD+G  D  P+  +N Y ++D   WKDLN KFVLQVYRD     +   A++   S + ++
Sbjct: 61  HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120

Query: 660 AYMDQ 664
            ++D+
Sbjct: 121 EFIDK 125


>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
 gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
 gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
 gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
          Length = 103

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 66/103 (64%), Positives = 76/103 (73%)

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFIA 60

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
            WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL +
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103


>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
 gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
 gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
 gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
 gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
 gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
 gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
 gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
 gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
 gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
 gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
 gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
 gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
 gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
 gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
 gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
 gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
 gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
 gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
 gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
 gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
 gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
 gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
 gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
 gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
 gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
 gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
 gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
 gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
 gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
 gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
 gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
 gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
 gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
          Length = 103

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 66/103 (64%), Positives = 76/103 (73%)

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
            WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL +
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103


>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
          Length = 103

 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 66/101 (65%), Positives = 75/101 (74%)

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
            WS  G GYAFQTPE W  D  YRSL YMRPLAIW MQ AL
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 101


>gi|119578750|gb|EAW58346.1| glucosidase, beta (bile acid) 2, isoform CRA_b [Homo sapiens]
          Length = 370

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 81/211 (38%), Positives = 123/211 (58%), Gaps = 12/211 (5%)

Query: 72  REEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL 131
           R+   + K  FID      L   +G PLGG+G G+I R +RG+F RWQ+ P + + + V+
Sbjct: 127 RKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQFCRWQLNPGMYQHRTVI 186

Query: 132 ANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTV 191
           A+QF+V + R     Y  VL  + P VL+        SW+W L G  + YHALYPRAWTV
Sbjct: 187 ADQFTVCLRREGQTVYQQVLSLERPSVLR--------SWNWGLCGYFAFYHALYPRAWTV 238

Query: 192 HEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG 251
           ++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  + D++++F+  N +GG
Sbjct: 239 YQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMFSMRNGLGG 297

Query: 252 DSEFTGQHYNSK---TKMNDGVHAVLLHHRT 279
             +  G  +N      +  + V  +LLHH T
Sbjct: 298 GDDAPGGLWNEPFCLERSGETVRGLLLHHPT 328


>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
           magnipapillata]
          Length = 169

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/162 (44%), Positives = 100/162 (61%), Gaps = 7/162 (4%)

Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE-----DKARSALKKVYNYNVLK 794
           +Y+NYD S +   + I A Q AGQWY  AC L    D+     ++  S LK ++  NV++
Sbjct: 1   TYYNYDGSENDYHNCIMAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMR 60

Query: 795 VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
              G  GAVNGM  DG VD +S+Q+ E+W+GVTYA+AA+MI   +   GF+TA GIY   
Sbjct: 61  FEDGFMGAVNGMRADGTVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTV 120

Query: 855 WSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
           +   GLG  FQTPEA   +  +RS  YMRPL+IW++  A ++
Sbjct: 121 YEKWGLG--FQTPEAITANKTFRSRGYMRPLSIWSIYHAYSK 160


>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
          Length = 103

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 65/103 (63%), Positives = 75/103 (72%)

Query: 794 KVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEA 853
           KV GG+ GAVNGM P+G+VD + MQSRE+W+GVTY VAA+MIH  + D  F TA GI+ A
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60

Query: 854 AWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTR 896
            WS  G GYAFQTPE W  D  YRSL YMRPLAIW  Q AL +
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGKQQALEK 103


>gi|148242847|ref|YP_001228004.1| hypothetical protein SynRCC307_1748 [Synechococcus sp. RCC307]
 gi|147851157|emb|CAK28651.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 606

 Score =  135 bits (341), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 111/393 (28%), Positives = 172/393 (43%), Gaps = 40/393 (10%)

Query: 95  HGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK 154
           HG PLGG G+G IGRS +G+F  W I         +   QF++F    +  +  ++    
Sbjct: 54  HGAPLGGFGAGCIGRSSKGDFNLWNIDGGEHWFGSLPDCQFALFEDNGSDTRAHALAVAP 113

Query: 155 TPEVLKDTTAAGIGSWDW----NLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPI 210
             +  K    A + SW W      +    +Y A YP  W  + G    E    C   SPI
Sbjct: 114 EADSSKPEGEAPLSSWSWYPASTAEKPTGSYSARYPLNWFHYSGVFGCE--ASCESFSPI 171

Query: 211 IPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----------------GDSE 254
           IP NY++++ PV+VF +   N  +    ++LL +W N+ G                G  E
Sbjct: 172 IPGNYRDTALPVAVFRWRFRNPSRKPLKLSLLLSWRNTTGWFCNTDPSAAVHFRDDGSPE 231

Query: 255 FT-------GQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT--FALAA-QETDGVHVSLCP 304
                     Q   ++     G+  V++    S  +      + +A  +E +GV +  C 
Sbjct: 232 HNYLPAIGATQGQRNRAINAPGLRGVVMEGPMSEPIAEGQGQWCIAVPEELEGVELMRCS 291

Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
            +   G+       ++W      GS    N+   S  ++P   + AA+A    + P +E 
Sbjct: 292 RWNPHGD-----GAEIWQRFAADGSVPESNNDRRSGNNDP---VSAAMALRFELAPGAEL 343

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRP 424
           ++   ++WD P   F +G    RRYT F+G   + AA IA DA+L    W  QIE WQ P
Sbjct: 344 ELPVVISWDLPVTAFATGSQCLRRYTDFFGPSGDQAAAIAADALLRWPRWRQQIEEWQAP 403

Query: 425 ILEDKRLPEWYPITLFNELYYLNAGGAVWTDGS 457
           +L    LPE   + L NELY L +GG +W+  S
Sbjct: 404 VLLRDDLPEELRMALCNELYDLASGGTLWSAAS 436



 Score = 47.4 bits (111), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 34/57 (59%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           +G+F  LE ++Y  + + DV  Y SFAL+ L+P +  S+ R FA A+   D ++  +
Sbjct: 441 VGRFGVLECLDYAWYESLDVRLYGSFALLQLWPALDRSVLRSFARAIPAADATQRPI 497


>gi|332796021|ref|YP_004457521.1| hypothetical protein Ahos_0332 [Acidianus hospitalis W1]
 gi|332693756|gb|AEE93223.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 652

 Score =  129 bits (324), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 95/346 (27%), Positives = 160/346 (46%), Gaps = 37/346 (10%)

Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
           SFAL+  FP++   +   F   +                    + G VPHD+G  +   E
Sbjct: 334 SFALLEYFPELVKKMDEHFGNFI--------------------IDGEVPHDLG--EESIE 371

Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
              Y       W DL P ++L +YRD   T D  F K  +  +   + ++ + D D DG+
Sbjct: 372 NPIYGASYPYSWNDLGPTWILMIYRDYKFTNDLSFLKRNYNKMKEVIDWLIKKDEDNDGI 431

Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK--FQKAK 730
            ++ G  D +YD   + G S+Y G L++ AL+A    ++ +    S DY  ++    KAK
Sbjct: 432 PDSKGGFDNSYDGTYMYGTSSYIGSLFLCALKAFIESSKIL----SYDYSKYEEILNKAK 487

Query: 731 VVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNY 790
              E LWNG YF        ++ S    QL G+++    GL  ++DEDK ++ALK +Y +
Sbjct: 488 SSLESLWNGRYFINWKYKDQKNESCLNSQLLGEFWCNLLGLGNVIDEDKIKTALKYIYEH 547

Query: 791 NVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGI 850
           N       K   VN + PDG +D S+ Q +  W  +++A+A+ MI + + + G + A   
Sbjct: 548 N---GKASKYCLVNSVNPDGSIDESTDQMKSCWPRISFAIASHMIMKGMINEGIEIA--- 601

Query: 851 YEAAWSGTGLGYAFQTPEAWN--TDDQYRSLCYMRPLAIWAMQWAL 894
               W      Y +  P   N    + +    Y+  L+I+ +++A+
Sbjct: 602 -RKEWETISSRYPYNQPSKINAFNGEHFGLPYYIGSLSIYLVKYAI 646


>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
 gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
          Length = 173

 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/103 (61%), Positives = 75/103 (72%), Gaps = 2/103 (1%)

Query: 798 GKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSG 857
           G  GAVNG+ PDG++D SS+Q+ E+W+GVTYAVAASMI E L D GF+TA GIY   +  
Sbjct: 52  GTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTCFE- 110

Query: 858 TGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPK 900
             LG  FQTPEA   +  YRSL YMRPL+IWAMQWAL + K K
Sbjct: 111 -RLGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNK 152



 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 55/109 (50%), Gaps = 14/109 (12%)

Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
           PW  VNAY ++DT++WKDLN KFVLQVYRD V T D  + + +WP             ++
Sbjct: 3   PWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWP-----------ITKE 51

Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
           G     N   PD   DT S+     ++G   V    AAS +   + D G
Sbjct: 52  GTMGAVNGIRPDGQLDTSSLQAEEVWTG---VTYAVAASMIQEGLVDEG 97


>gi|422908697|ref|ZP_16943374.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
 gi|341639159|gb|EGS63785.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
          Length = 981

 Score =  127 bits (318), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 167/640 (26%), Positives = 261/640 (40%), Gaps = 101/640 (15%)

Query: 104 SGSIGRSYRGEFQRWQIF------PRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPE 157
           SG    + +  F+RWQI         +  D   L  Q  V +   NG     +L   +  
Sbjct: 94  SGLFYLNNKSNFERWQIEFSDKTQQAITNDPSSLETQLLVLIDFFNG-----LLVNNSAR 148

Query: 158 VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ-ISPIIPHNYK 216
            L  T  A  GS   ++      Y ALYP A   ++G    E   + RQ ISPI+ ++  
Sbjct: 149 ALSLT--ANNGSSVESIDPSDIEYRALYPIAEYEYQG---LESVALTRQVISPIVKNDPM 203

Query: 217 ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLLH 276
             S P+    + + N    +  +TL+    N +G          ++  K  +GV     H
Sbjct: 204 LCSLPMHWNHFEMSNRSDKTQLVTLVQPLRNLIG----------STYKKAREGVQDSCCH 253

Query: 277 ---HRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
              +  S Q  PVT + A  E  G  VS+  +    G+  G     +      H  F+  
Sbjct: 254 LVQNAISQQHQPVTISGAGFEFHG--VSMTSNSPYDGDISGEVLYGV------HTQFEG- 304

Query: 334 NSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFY 393
                          GAA   +VTV P    QV  S A    E    +G+T        Y
Sbjct: 305 ---------------GAAEQVTVTVKP----QVYCSKADQQVERALKTGRTNAHFDRGIY 345

Query: 394 GTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAGGAVW 453
              +  +A +     L+ G   + +   Q        L  W     +   +   +     
Sbjct: 346 SGREPLSALVCVQVTLKPGE-TIALRFAQVMDHSKILLDGWESEKAYARYFDAQSRAQSI 404

Query: 454 TDGSPP----VHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSILEQIYT 509
            + + P    +   +    R+F  + SQ+     VD    ND+A+             + 
Sbjct: 405 LNQTLPRLSEIEQNIISQQREFLDNASQT----FVD----NDSALR------------FA 444

Query: 510 PVALNS-SFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQ 568
            +A+NS SF        +EN+   L  E ++Y  +N+ DV+FY SF+L+ L P++   + 
Sbjct: 445 TMAMNSLSFLAESTVWDKENV--LLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVM 502

Query: 569 RDFAAAVMMHDPSKMK-----------LLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAY 616
           + FA A++  D    +           L+D+    SR   GAV HD+G   P+  + +AY
Sbjct: 503 KHFAKAILNSDEKLRRYWEYEDKPYAELVDEKYQGSRAERGAVIHDLG--SPFDIDPDAY 560

Query: 617 CLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEND 676
             ++   WKDL PKF+L VYR    TGD++  ++ W SV  ++ Y+     DGD +    
Sbjct: 561 SWHNVKEWKDLAPKFILMVYRHWQKTGDREVVESCWSSVKESIDYLINLIEDGDTLPLTR 620

Query: 677 GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
           G  D T+D  +  G+S Y G LWVA L+AAS LA  VG+ 
Sbjct: 621 G-TDDTFDNLASHGVSIYCGSLWVAGLRAASELALLVGEE 659


>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
 gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
          Length = 908

 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 93/309 (30%), Positives = 154/309 (49%), Gaps = 41/309 (13%)

Query: 486 VPNQNDTAVNILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQ 531
           +P        I+E+ ++ LEQ             Y  +A+NS SF   + + D E+   +
Sbjct: 337 LPKLETIEQQIVEQQTTFLEQAQNKISQPESALRYATMAMNSLSFLAESTVWDKED---K 393

Query: 532 FLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK------- 584
           FL  E ++Y  +N+ DV+FY SF L+ L P++   + ++F+ A++  D ++ +       
Sbjct: 394 FLVKECVDYPFFNSLDVYFYGSFTLLYLLPELDGCVMKEFSKAILAEDFTERRYWEYEAT 453

Query: 585 ----LLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDV 639
               L+DD     R + GAV HD+G   P+  + +AY  ++   WKDL PK++L VYR  
Sbjct: 454 PNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHY 511

Query: 640 VATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLW 699
             T D    K  W +V  ++ ++     +GD +    G  D T+D  +  GIS Y   LW
Sbjct: 512 QNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASLW 570

Query: 700 VAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQ 756
           VA LQAAS LA+ +G+    + +L + +KA    E+ LW+    Y+++        + +Q
Sbjct: 571 VAGLQAASELAQLMGESERANGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPVQ 624

Query: 757 ADQLAGQWY 765
           A  L G  Y
Sbjct: 625 AKHLTGAGY 633


>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
 gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
          Length = 982

 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 92/299 (30%), Positives = 153/299 (51%), Gaps = 41/299 (13%)

Query: 496 ILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYL 541
           I+E+ ++ LEQ             Y  +A+NS SF   + + D E+   +FL  E ++Y 
Sbjct: 421 IVEQQTAFLEQAQSKISQPESALRYATMAMNSLSFLAESTVWDKED---KFLVKECVDYP 477

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK-----------LLDDGQ 590
            +N+ DV+FY SF+L+ L P++   + ++F+ A++  D ++ +           L+D+  
Sbjct: 478 FFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEATPNAELIDEKY 537

Query: 591 WVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
              R + GAV HD+G   P+  + +AY  ++   WKDL PK++L VYR    T D    K
Sbjct: 538 QGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDMSVVK 595

Query: 650 AVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL 709
             W +V  ++ ++     DGD +    G  D T+D  +  GIS Y   LWVA LQAAS L
Sbjct: 596 ECWQAVTESIDFLSNLIADGDDLPLTRG-TDDTFDNLASHGISIYCASLWVAGLQAASEL 654

Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
           A+ +G+      +L + +KA    E+ LW+    Y+++        + +QA  L G+ Y
Sbjct: 655 AKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPVQAKHLTGEGY 707


>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
 gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
          Length = 1024

 Score =  125 bits (313), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/301 (30%), Positives = 152/301 (50%), Gaps = 31/301 (10%)

Query: 508 YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQL 565
           Y  +A+NS SF   + + D E+   +FL  E ++Y  +N+ DV+FY SF+L+ L P++  
Sbjct: 487 YATMAMNSLSFLAESTVWDKED---KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDG 543

Query: 566 SIQRDFAAAVMMHDPSKMK-----------LLDDGQWVSRKVLGAVPHDIGICDPW-FEV 613
            + ++F+ A++  D ++ +           L+DD     R + GAV HD+G   P+  + 
Sbjct: 544 CVMKEFSKAILAEDFTERRYWEYEATPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQP 601

Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
           +AY  ++   WKDL PK++L VYR    T D    K  W +V  ++ ++     +GD + 
Sbjct: 602 DAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLP 661

Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
              G  D T+D  +  GIS Y   LWVA LQAAS LA+ +G+      +L + +KA    
Sbjct: 662 LTRG-TDDTFDNLASHGISIYCASLWVAGLQAASELAQLMGESELASGYLTRSKKALATV 720

Query: 734 EK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWYA--RACGLLPIVDEDKARSALKKVY 788
           E+ LW+    Y+++        + +QA  L G+ Y      GL+   D    ++ L    
Sbjct: 721 EQSLWDDKEGYYHF------FVTPVQAKHLTGEGYQALETLGLILTGDAIADKNTLNTYL 774

Query: 789 N 789
           N
Sbjct: 775 N 775


>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
 gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
          Length = 1069

 Score =  124 bits (311), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/299 (30%), Positives = 152/299 (50%), Gaps = 41/299 (13%)

Query: 496 ILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYL 541
           I+E+ ++ LEQ             Y  +A+NS SF   + + D E+   +FL  E ++Y 
Sbjct: 508 IVEQQTTFLEQAQSKISQPESALRYATMAMNSLSFLAESTVWDKED---KFLVKECVDYP 564

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK-----------LLDDGQ 590
            +N+ DV+FY SF+L+ L P++   + ++F+ A++  D ++ +           L+DD  
Sbjct: 565 FFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEATPNAELIDDKY 624

Query: 591 WVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
              R + GAV HD+G   P+  + +AY  ++   WKDL PK++L VYR    T D    K
Sbjct: 625 QGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQNTQDISVVK 682

Query: 650 AVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL 709
             W +V  ++ ++     +GD +    G  D T+D  +  GIS Y   LWVA LQAAS L
Sbjct: 683 ECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASLWVAGLQAASEL 741

Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
           A+ + +      +L + +KA    E+ LW+    Y+++        + +QA  L G+ Y
Sbjct: 742 AKLMNESELAAGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPVQAKHLTGEGY 794


>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
 gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
          Length = 1046

 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/299 (30%), Positives = 153/299 (51%), Gaps = 41/299 (13%)

Query: 496 ILERMSSILEQI------------YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYL 541
           I+E+ ++ LEQ             Y  +A+NS SF   + + D E+   +FL  E ++Y 
Sbjct: 485 IVEQQTAFLEQAQSKISQPESALRYATMAMNSLSFLAESTVWDKED---KFLVKECVDYP 541

Query: 542 MWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK-----------LLDDGQ 590
            +N+ DV+FY SF+L+ L P++   + ++F+ A++  D ++ +           L+D+  
Sbjct: 542 FFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEATPNAELIDEKY 601

Query: 591 WVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAK 649
              R + GAV HD+G   P+  + +AY  ++   WKDL PK++L VYR    T D    K
Sbjct: 602 QGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRHYQKTQDISVVK 659

Query: 650 AVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASAL 709
             W +V  ++ ++     +GD +    G  D T+D  +  GIS Y   LWVA LQAAS L
Sbjct: 660 ECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASLWVAGLQAASEL 718

Query: 710 AREVGDRGSEDYFLFKFQKAKVVYEK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
           A+ +G+      +L + +KA    E+ LW+    Y+++        + +QA  L G+ Y
Sbjct: 719 AKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPVQAKHLTGEGY 771


>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 239

 Score =  123 bits (308), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 91/156 (58%), Gaps = 20/156 (12%)

Query: 534 YLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVS 593
           +  G EY M+NTYDVH+Y+SFALIML+PK+ LS+Q D A +V+  DP++   L +G+   
Sbjct: 1   FFPGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSP 60

Query: 594 RKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWP 653
            K  G VPHDIG  DP          +   W+        +VYRD   T D ++ + +WP
Sbjct: 61  VKTKGVVPHDIG--DP----------EDEPWQ--------RVYRDFHLTQDAQYLRDMWP 100

Query: 654 SVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS 689
              +      QFD DGDG+IEN G+ DQTYD W+ S
Sbjct: 101 ICEMVTESELQFDLDGDGLIENSGYADQTYDGWAAS 136



 Score = 72.8 bits (177), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 32/61 (52%), Positives = 46/61 (75%)

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
           ++K ++ALK +++ NV+   GG+ GAVNGM P+G  D SS+QS E+W GV Y +AA+MIH
Sbjct: 178 KEKIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIH 237

Query: 837 E 837
           E
Sbjct: 238 E 238


>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
 gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 646

 Score =  122 bits (307), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 79/252 (31%), Positives = 127/252 (50%), Gaps = 11/252 (4%)

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           G VPHD+G  +   E   Y       W DL   +VL +YRD   T D  F K  +  +  
Sbjct: 354 GEVPHDLG--EESIEAPIYGASYGYPWTDLGSTWVLMIYRDYKFTNDLAFLKRNYRKMKE 411

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
            + ++   D+D D + ++ G  D +YD   + G S+Y G +++ AL+A  + ++ +G   
Sbjct: 412 VIDWLISLDKDKDCIPDSKGGFDNSYDGTYMYGASSYVGSMFLCALRAFISASKILG--- 468

Query: 718 SEDYFLFK--FQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIV 775
             +Y +++   ++  + +  LWNG YF    S + +  S  + Q+ GQ++    GL PI+
Sbjct: 469 -MEYSIYEDCLRRGIMTFNSLWNGKYFIAWKSDNRKKESCMSSQILGQFWCDILGLEPII 527

Query: 776 DEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMI 835
           DEDK   AL+ +Y  N       K   VN + PDG +D  + Q R  WS V +AV+A MI
Sbjct: 528 DEDKIVQALRSIYELN---GKASKFCLVNSVNPDGSIDTETDQMRSCWSRVAFAVSAHMI 584

Query: 836 HEDLADIGFQTA 847
              L + G + A
Sbjct: 585 IRGLKNEGIEIA 596


>gi|374921957|gb|AFA26156.1| non-lysosomal glucosylceramidase-like protein, partial [Lolium
           perenne]
          Length = 109

 Score =  122 bits (307), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 72/109 (66%), Positives = 84/109 (77%), Gaps = 1/109 (0%)

Query: 657 VAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
            AM YM+QFDRD D MIENDGFPDQTYD W+V G+SAY G LW+A+LQAA+A+AR +G  
Sbjct: 1   TAMEYMEQFDRDDDSMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARSLGHA 60

Query: 717 GSEDYFLFKFQKAKVVYE-KLWNGSYFNYDNSGSSQSSSIQADQLAGQW 764
              +  + KF KAK V+E KLWNGSYFNYD+  S  S SIQADQLAGQW
Sbjct: 61  DYAERCMVKFAKAKHVFEAKLWNGSYFNYDSGTSYSSRSIQADQLAGQW 109


>gi|355689818|gb|AER98955.1| glucosidase, beta 2 [Mustela putorius furo]
          Length = 157

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/166 (38%), Positives = 98/166 (59%), Gaps = 12/166 (7%)

Query: 117 RWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG 176
           RWQ+ P + + + V+A+QF+V + R     Y  VL  + P VL+        SW+W L G
Sbjct: 1   RWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSVLR--------SWNWGLCG 52

Query: 177 DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTS 236
             + YHALYPRAWTV++  P   + + CRQI+PI+PH+Y++SS PV VF + + N G  +
Sbjct: 53  YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEA 111

Query: 237 ADITLLFTWTNSVGGDSEFTGQHYNSKTKM-NDG--VHAVLLHHRT 279
            D++++F+  N +G   +  G  +N    +  DG  V  +LLHH T
Sbjct: 112 LDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPT 157


>gi|284173198|ref|ZP_06387167.1| hypothetical protein Ssol98_00875 [Sulfolobus solfataricus 98/2]
          Length = 647

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 87/304 (28%), Positives = 143/304 (47%), Gaps = 18/304 (5%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHD+G   I DP +   +Y  +    W DL P  VL +YRD V T +++  +  +  
Sbjct: 355 GETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSNREILEKNYNK 409

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 410 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMFLSALTAFIKMSEILD 469

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +  + Y+ F  +  K  +  LWNG YF        +++S    QL GQ++    GL PI
Sbjct: 470 VKIDDKYYRF-LECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQLLGQFWCDILGLPPI 528

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  DG VD S+ Q R  W  V++AVAA M
Sbjct: 529 TDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHM 585

Query: 835 IHEDLADIGFQTACGIYEAAWSGTGLGYAF-QTPEAWNTDDQYRSL-CYMRPLAIWAMQW 892
           I   +   G + A    +  W        F Q+      D +Y  L  Y+   ++W ++ 
Sbjct: 586 ILRGMVKEGIEVA----KREWETIKELNPFDQSSRIDAIDGKYVGLMSYIGSTSVWLLKL 641

Query: 893 ALTR 896
           AL +
Sbjct: 642 ALDK 645


>gi|15899397|ref|NP_344002.1| hypothetical protein SSO2674 [Sulfolobus solfataricus P2]
 gi|384433011|ref|YP_005642369.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|13815992|gb|AAK42792.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
 gi|261601165|gb|ACX90768.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
          Length = 648

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 87/304 (28%), Positives = 143/304 (47%), Gaps = 18/304 (5%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHD+G   I DP +   +Y  +    W DL P  VL +YRD V T +++  +  +  
Sbjct: 356 GETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSNREILEKNYNK 410

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 411 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMFLSALTAFIKMSEILD 470

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +  + Y+ F  +  K  +  LWNG YF        +++S    QL GQ++    GL PI
Sbjct: 471 VKIDDKYYRF-LECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQLLGQFWCDILGLPPI 529

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  DG VD S+ Q R  W  V++AVAA M
Sbjct: 530 TDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQLRSCWPRVSFAVAAHM 586

Query: 835 IHEDLADIGFQTACGIYEAAWSGTGLGYAF-QTPEAWNTDDQYRSL-CYMRPLAIWAMQW 892
           I   +   G + A    +  W        F Q+      D +Y  L  Y+   ++W ++ 
Sbjct: 587 ILRGMVKEGIEVA----KREWETIKELNPFDQSSRIDAIDGKYVGLMSYIGSTSVWLLKL 642

Query: 893 ALTR 896
           AL +
Sbjct: 643 ALDK 646


>gi|159042199|ref|YP_001541451.1| hypothetical protein Cmaq_1638 [Caldivirga maquilingensis IC-167]
 gi|157921034|gb|ABW02461.1| protein of unknown function DUF608 [Caldivirga maquilingensis
           IC-167]
          Length = 661

 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 90/298 (30%), Positives = 138/298 (46%), Gaps = 32/298 (10%)

Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG---ICDP 609
           SFAL+ LFP   L+++ D      +H+                  G VPHD+G   I DP
Sbjct: 339 SFALLDLFP--DLTVKMDELLGFYIHN------------------GEVPHDLGEESIEDP 378

Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDG 669
            +   A  LY    W DL   ++L +YRD + TG+ +  +     +   + ++   D DG
Sbjct: 379 IY--GASYLYP---WNDLGSTWILMIYRDYLLTGNVEVLRRNIDKMREVIDWLISRDYDG 433

Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
           D + ++ G  D +YD  ++ G S+Y   L++ +LQA    A  +G R S D +     K 
Sbjct: 434 DCIPDSRGGFDNSYDGTNMYGASSYIASLFLCSLQAFIKSAEVLGVRLS-DRYESCLSKG 492

Query: 730 KVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
           +     LWNG YF    S  + + S    QL GQ++     L P+VDEDK + AL+ +Y 
Sbjct: 493 RETLNSLWNGRYFMAWKSSGNSNESCMNSQLLGQFWCDFLKLPPVVDEDKIKVALRSIYE 552

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
            N            N + P G +D SS Q R  W  V++ V A M+   + + G + A
Sbjct: 553 LN---HKSSPHCLPNSVKPSGEIDTSSGQMRSCWPRVSFVVTAHMVLRGMVNEGLEIA 607


>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
          Length = 1024

 Score =  119 bits (299), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 85/275 (30%), Positives = 142/275 (51%), Gaps = 29/275 (10%)

Query: 508 YTPVALNS-SF-GTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQL 565
           Y  +A+NS SF   + + D E+   +FL  E ++Y  +N+ DV+FY SF+L+ L P++  
Sbjct: 487 YATMAMNSLSFLAESTVWDKED---KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDG 543

Query: 566 SIQRDFAAAVMMHDPSKMK-----------LLDDGQWVSRKVLGAVPHDIGICDPW-FEV 613
            + ++F+ A++  D ++ +           L+D+     R + GAV HD+G   P+  + 
Sbjct: 544 CVMKEFSKAILAEDFTQRRYWEYEATPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQP 601

Query: 614 NAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMI 673
           +AY  ++   WKDL PK++L VYR    T D    K  W +V  ++ ++     +GD + 
Sbjct: 602 DAYSWHNVKEWKDLAPKYILMVYRHYQNTQDLSVVKECWQAVTESIEFLSNLVAEGDDLP 661

Query: 674 ENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY 733
              G  D T+D  +  GIS Y   LWVA LQAA+ LA  + +      +L + +KA    
Sbjct: 662 LTRG-TDDTFDNLASHGISIYCASLWVAGLQAANELALLMNENELAADYLARSKKALATV 720

Query: 734 EK-LWNGS--YFNYDNSGSSQSSSIQADQLAGQWY 765
           E+ LW+    Y+++        + +QA  L GQ Y
Sbjct: 721 EQSLWDDKEGYYHF------FVTPVQAKHLTGQGY 749



 Score = 43.5 bits (101), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 10/117 (8%)

Query: 758 DQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSM 817
           D L    Y +  GL  I  ED+ + AL  VY +N  ++   K G  N  L DG     + 
Sbjct: 821 DALLADSYLKLIGLEGIFPEDRIQRALDYVYKHN-FEINSPKLGVANMTLADGS-PHEAF 878

Query: 818 QSREIWSGVTYAVAASMIHEDLADIGFQTAC---GIYEAAWSGTGLGYAFQTPEAWN 871
           Q++++W GV ++VA ++   +LA    Q       +Y A +  + + +A   PE +N
Sbjct: 879 QAQDVWIGVQFSVATAL---NLAGKSQQAETLMDTVYTALYDYSKIPFA--APEGFN 930


>gi|284997264|ref|YP_003419031.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284445159|gb|ADB86661.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
          Length = 740

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 72/235 (30%), Positives = 121/235 (51%), Gaps = 6/235 (2%)

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           G VPHD+GI      V+        +WKD NP F+L VYR    T D  F K ++P +  
Sbjct: 450 GYVPHDLGIHSLDHPVDGTT--SPPKWKDTNPTFILLVYRYYKFTNDLDFLKEMYPKMLK 507

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
           AM +    D+D DG+ E +G  D  +D  SV GI +Y+  +++A++ A    A+ + D  
Sbjct: 508 AMKWELTQDKDKDGVPELEGQGDTGFDAMSVKGIDSYTTSIYIASIIALKETAKILKDND 567

Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
           + +      +K++ ++ +L+NG YF+    G  +   I   QL G+W++   GL PI++E
Sbjct: 568 TLNEMTILLEKSRKIFSRLFNGKYFD-PWIGEPEIKGIFLGQLVGEWWSEILGLEPIIEE 626

Query: 778 DKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAA 832
           +K  SAL+ + + N            N +  DG++   S QS   W  + +A+++
Sbjct: 627 EKISSALEAMLSIN---ANSSNFCTPNIVSEDGKIIDISPQSYSSWPRLVFAISS 678



 Score = 47.0 bits (110), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 72/360 (20%), Positives = 130/360 (36%), Gaps = 48/360 (13%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           GVPLGG+G+G I  + +G+     IF       P +   F + +   +G  +        
Sbjct: 74  GVPLGGIGAGKIEINNKGKVVNVTIFNNKGFPIPNIRG-FHILIKPDDGNPFF------- 125

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
             + KD     +     +L+ D   Y  LYP A+ +  G  D ++R      SP+IP N 
Sbjct: 126 --LEKDLPFISLK----HLESDILIYEGLYPFAYLI--GIKD-KIRAKLTAFSPLIPRNI 176

Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVLL 275
            +S+ P   F+  I   G  S  I + F             G +   +   +D    +  
Sbjct: 177 NDSTLPAVGFSLKI--EGSNSGLIAISF---------PNIVGSNPIGRVNRSDEDKIIFT 225

Query: 276 HHRTSHQLP-PVTFALAAQETDGV---HVSLCPHFVISGNSL--GLTAKDMWHEIKEHGS 329
           + R+    P      L    TD +   ++++ P   +   S       +D W  I     
Sbjct: 226 NLRSIDTDPRKGEIVLMGDTTDLITQYNINVKPEIALKNKSWKEKFENEDPWKSIINGEK 285

Query: 330 FDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRY 389
           +       + +  +P        A  +    + + ++ F L+W      ++ GK     Y
Sbjct: 286 YKEDVHEVSGLWDDP--------AGMLITRYNKDSEIKFVLSW------YIRGKHVLYPY 331

Query: 390 TKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLNAG 449
             +Y  + + A  IA            Q   + + +  +  LPEW    + N  Y L++ 
Sbjct: 332 GTYYQRNFSNAIEIAEYFYKNFERLRNQSTNFHKIVYNNTDLPEWLKDAIINSAYILSSN 391


>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
 gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
          Length = 660

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 88/299 (29%), Positives = 138/299 (46%), Gaps = 32/299 (10%)

Query: 552 SSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIG---ICD 608
           + FAL+ LFP + +     FA                      +  G +PHD G   I D
Sbjct: 334 AGFALLELFPDLVVRADEVFAMY--------------------RRGGEIPHDYGEESIED 373

Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
           P +       Y T  WKDL   +VL +YRD   TGD +  +    ++  A+ ++   DRD
Sbjct: 374 PIYGAT----YLTP-WKDLASTWVLMIYRDYYYTGDVEILRRNIDAMKDAVDWLMSLDRD 428

Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
           GD + ++ G  D +YD  ++ G  AY   L++ AL A    A  +G +  E  +L   ++
Sbjct: 429 GDCVPDSRGRNDNSYDGSNMYGRVAYVASLFLCALTAYIKAAERLGLK-PEPKYLECLER 487

Query: 729 AKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
            K   ++LWNG Y+     G  +  +  + QL GQ++     L PI  +DK  +AL+ +Y
Sbjct: 488 GKRSLDELWNGRYYVAWADGPRRKDACMSSQLLGQFWCDMLDLPPIAPDDKVAAALRSIY 547

Query: 789 NYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
               L +   K    N + PDG+ D  + Q R  W+ V +AVAA M+   +   G + A
Sbjct: 548 E---LGMRASKYCIPNSVTPDGKPDEETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVA 603


>gi|385774575|ref|YP_005647144.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323478691|gb|ADX83929.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 661

 Score =  115 bits (289), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHDIG   I DP +   +Y  +    W DL P  VL +YRD V T D+   +  +  
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +    Y+ F  +  K  +  LWNG YF        ++ S    QL GQ++    GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  D  +D S+ Q R  W  V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599

Query: 835 IHEDLADIGFQTA 847
           I   +   G + A
Sbjct: 600 ILRGMVKEGMEVA 612


>gi|385777223|ref|YP_005649791.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323475970|gb|ADX86576.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 661

 Score =  115 bits (289), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHDIG   I DP +   +Y  +    W DL P  VL +YRD V T D+   +  +  
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +    Y+ F  +  K  +  LWNG YF        ++ S    QL GQ++    GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  D  +D S+ Q R  W  V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599

Query: 835 IHEDLADIGFQTA 847
           I   +   G + A
Sbjct: 600 ILRGMVKEGMEVA 612


>gi|227831601|ref|YP_002833381.1| hypothetical protein LS215_2801 [Sulfolobus islandicus L.S.2.15]
 gi|229580553|ref|YP_002838953.1| hypothetical protein YG5714_2812 [Sulfolobus islandicus Y.G.57.14]
 gi|238621059|ref|YP_002915885.1| hypothetical protein M164_2625 [Sulfolobus islandicus M.16.4]
 gi|284999153|ref|YP_003420921.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|227458049|gb|ACP36736.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
 gi|228011269|gb|ACP47031.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
 gi|238382129|gb|ACR43217.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
 gi|284447049|gb|ADB88551.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
          Length = 661

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHDIG   I DP +   +Y  +    W DL P  VL +YRD V T D+   +  +  
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +    Y+ F  +  K  +  LWNG YF        ++ S    QL GQ++    GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  D  +D S+ Q R  W  V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599

Query: 835 IHEDLADIGFQTA 847
           I   +   G + A
Sbjct: 600 ILRGMVKEGMEVA 612


>gi|227828867|ref|YP_002830647.1| hypothetical protein M1425_2640 [Sulfolobus islandicus M.14.25]
 gi|227460663|gb|ACP39349.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 661

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHDIG   I DP +   +Y  +    W DL P  VL +YRD V T D+   +  +  
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +    Y+ F  +  K  +  LWNG YF        ++ S    QL GQ++    GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  D  +D S+ Q R  W  V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599

Query: 835 IHEDLADIGFQTA 847
           I   +   G + A
Sbjct: 600 ILRGMVKEGMEVA 612


>gi|229586074|ref|YP_002844576.1| hypothetical protein M1627_2694 [Sulfolobus islandicus M.16.27]
 gi|228021124|gb|ACP56531.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 661

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 77/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHDIG   I DP +   +Y  +    W DL P  VL +YRD V T D+   +  +  
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +    Y+ F  +  K  +  LWNG YF        ++ S    QL GQ++    GL PI
Sbjct: 484 VKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  D  +D S+ Q R  W  V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599

Query: 835 IHEDLADIGFQTA 847
           I   +   G + A
Sbjct: 600 ILRGMVKEGMEVA 612


>gi|229583411|ref|YP_002841810.1| hypothetical protein YN1551_3001 [Sulfolobus islandicus Y.N.15.51]
 gi|228014127|gb|ACP49888.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 661

 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/253 (30%), Positives = 120/253 (47%), Gaps = 12/253 (4%)

Query: 598 GAVPHDIG---ICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPS 654
           G  PHDIG   I DP +   +Y  +    W DL P  VL +YRD + T D+   +  +  
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYIFTSDRGILEKNYNK 423

Query: 655 VYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
           +   + ++ + D D D + ++ G  D +YD   + G S+Y   ++++AL A   ++  + 
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKISEILD 483

Query: 715 DRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPI 774
            +    Y+ F  +  K  +  LWNG YF        ++ S    QL GQ++    GL PI
Sbjct: 484 IKIDTKYYRF-LECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
            D DK  +AL+ +Y  N       K    N +  D  +D S+ Q R  W  V++AVAA M
Sbjct: 543 TDNDKINTALRSIYELN---FRASKYCLTNAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599

Query: 835 IHEDLADIGFQTA 847
           I   +   G + A
Sbjct: 600 ILRGMVKEGMEVA 612


>gi|194359640|gb|ACF57667.1| bile acid beta-glucosidase [Ctenopharyngodon idella]
          Length = 89

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 52/89 (58%), Positives = 63/89 (70%)

Query: 609 PWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD 668
           PW  VNAY +YDTA WKDLN KFVLQV+RD   T D+++ K +WP     M    +FD+D
Sbjct: 1   PWVRVNAYLIYDTADWKDLNLKFVLQVFRDYHLTQDEQYLKDMWPICQTVMETELKFDKD 60

Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGG 697
           GDG+IEN G+ DQTYD W V+G SAY GG
Sbjct: 61  GDGLIENSGYADQTYDGWKVTGPSAYCGG 89


>gi|159041841|ref|YP_001541093.1| hypothetical protein Cmaq_1276 [Caldivirga maquilingensis IC-167]
 gi|157920676|gb|ABW02103.1| protein of unknown function DUF608 [Caldivirga maquilingensis
           IC-167]
          Length = 688

 Score =  110 bits (275), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 80/270 (29%), Positives = 128/270 (47%), Gaps = 35/270 (12%)

Query: 530 GQFLYLEGIEY------LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKM 583
           G+F  LEG+E       L    Y+     S  ++++FP+++ S+ R F  A M  D    
Sbjct: 336 GRFSILEGVEVCPCHGTLAGACYET---GSLPVVLMFPELEKSLLRQFTEA-MRSD---- 387

Query: 584 KLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTA--RWKDLNPKFVLQVYRDVVA 641
                         G +PH +GI    + ++      TA  RWKDLN  ++L V+R    
Sbjct: 388 --------------GYIPHSLGI----YSLDHIEDGTTAPPRWKDLNSTYILLVHRYFKR 429

Query: 642 TGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVA 701
           + D +F K ++P +  A  ++   D+DGDG+ E  G  D  +D  SV G  +Y+  LW+A
Sbjct: 430 SNDVEFIKEIYPKLIKAFEWVLVQDKDGDGVPELSGDGDTGFDAMSVKGFDSYTTSLWIA 489

Query: 702 ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLA 761
           AL     LA+ +GD+ +         KA+  Y + W G  F   +      +S  A Q+ 
Sbjct: 490 ALMVMGELAKLMGDQATLSKVESTLLKARDSYNRRWLGDRFKAWDEPDMGKASFLA-QIW 548

Query: 762 GQWYARACGLLPIVDEDKARSALKKVYNYN 791
           G+W++   GL  I DEDK ++A+  +   N
Sbjct: 549 GEWWSLMLGLGHITDEDKVKAAMGTIIRVN 578


>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
 gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 220

 Score =  108 bits (270), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 72/211 (34%), Positives = 112/211 (53%), Gaps = 14/211 (6%)

Query: 691 ISAYSGGLWVAALQAASA----LAREVG-DRGSEDY-FLFKFQKAKVVYEKL-WNGSYFN 743
           +SAY G LW+AAL+AA A    LA E+G D  +E   F    Q+++  +++L WNG ++ 
Sbjct: 1   MSAYCGALWIAALEAALAMGQTLALELGVDTAAEQRDFSQWLQQSRGNFDQLLWNGEFYK 60

Query: 744 YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
            D    S +  + ADQL G +YAR  GL P+V E  ARS+L+ +         GG+ G  
Sbjct: 61  ID--AESGTPVVMADQLCGDFYARLLGLPPVVSEANARSSLQAIKEACFEGFAGGRLGVA 118

Query: 804 NGMLPDGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGY 862
           NG+  DG  +D +     E+W+G+ + +AA        +        +    ++G   G 
Sbjct: 119 NGLRRDGTPLDPNGTHPLEVWTGINFGLAAYYRLMGETNTAVAITQAVVNQVYAG---GL 175

Query: 863 AFQTPEAWNTDDQYRSLCYMRPLAIWAMQWA 893
            F+TPEA    + +R+  Y+R +AIWA+ WA
Sbjct: 176 QFRTPEALTGQNTFRACHYLRAMAIWAL-WA 205


>gi|15922946|ref|NP_378615.1| hypothetical protein ST2609 [Sulfolobus tokodaii str. 7]
 gi|15623737|dbj|BAB67724.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 661

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 76/248 (30%), Positives = 129/248 (52%), Gaps = 24/248 (9%)

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTA--RWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV 655
           G VPHD+G C     +++     TA  +WKDLNP ++L +YR    TGD +F K+V+  V
Sbjct: 364 GYVPHDLGYCS----LDSPTDGTTAPPKWKDLNPTYILLIYRYYKLTGDIEFLKSVYDKV 419

Query: 656 YVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
             A  +  +F R G      +G  D  +D   + GI++Y+  L++A+L A   +++  GD
Sbjct: 420 KKAFEWELKFSRYG-----LEGKMDSAFDVTPIKGINSYTLSLYIASLFAMREISKTAGD 474

Query: 716 RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIV 775
             + D    + ++AK  +EK++NG YF    +      ++   Q+ G+W+    GL PI 
Sbjct: 475 NLNLDE---QIKEAKEAFEKMFNGKYF---IAWEGMEDAVFLAQVFGEWWTTLLGLEPIA 528

Query: 776 DEDKARSALKKVYNYNVLKVMGG--KRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAAS 833
           DE+K +SAL+      ++KV G   K    N +  DG+V   S Q+   W  + +A+   
Sbjct: 529 DEEKIKSALRW-----IIKVNGNASKYCTPNLVKEDGKVVSLSPQTYSSWPRLVFAICWL 583

Query: 834 MIHEDLAD 841
            I + +++
Sbjct: 584 SIEKGISE 591


>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
           succinogenes S85]
          Length = 1047

 Score =  107 bits (266), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 76/229 (33%), Positives = 114/229 (49%), Gaps = 18/229 (7%)

Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMK------ 584
           +FL  E  +Y  +N+ DV+FY SF+L+ L P++   + + F  A++  + ++ +      
Sbjct: 521 RFLVRECADYPFFNSLDVYFYGSFSLLALMPRLDGVVMKRFGDAILAVNNNRRRHHEFVN 580

Query: 585 -----LLDDGQWVSRKVLGAVPHDIGICDPW-FEVNAYCLYDTARWKDLNPKFVLQVYRD 638
                L D      R V GAV HD+G   P+  E +AY  ++   WKDL PK+VL VYR 
Sbjct: 581 LPYADLPDPKLEGPRAVRGAVIHDLG--SPFDAEPDAYDWHNVKEWKDLAPKYVLMVYRH 638

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              T D +       +VY AM Y+++    G+      G  D T+D     GIS Y G L
Sbjct: 639 YHKTKDMQCLADCKEAVYAAMEYLEKMVNPGENFPLTHG-TDDTFDNLCSYGISVYCGSL 697

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLW--NGSYFNY 744
           W+A L+AA+ +A  +GD      +  K + A K   E LW  N  YF +
Sbjct: 698 WIAGLRAAAKIAELLGDNEQAAKWNEKSEAANKEFTESLWDENEGYFRF 746



 Score = 47.0 bits (110), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 39/162 (24%), Positives = 70/162 (43%), Gaps = 17/162 (10%)

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP 808
           ++    + AD +    Y R   L PI D  KA+S L +++N N  K      GA N +  
Sbjct: 830 ANDCDDVFADTMLADTYLRLLDLEPITDSAKAKSNLLRIFNTN-YKANSPLIGAANLVRK 888

Query: 809 DGR-VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYA---F 864
           DG  +D  + Q+ ++W G+ Y++  +M+       G +    +   +  G     A   F
Sbjct: 889 DGSPLDEFNFQAHDVWIGIQYSIMTAMMFH-----GLEKEASVLADSMIGNLYDEARVPF 943

Query: 865 QTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPKPKTLEKQM 906
             PE +N        C + P A+ A ++ L+      + K++
Sbjct: 944 AAPEGFNGS------CRLHPEALVA-KFGLSATAADKMHKEL 978


>gi|15921102|ref|NP_376771.1| hypothetical protein ST0868 [Sulfolobus tokodaii str. 7]
 gi|15621887|dbj|BAB65880.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 683

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 72/239 (30%), Positives = 118/239 (49%), Gaps = 23/239 (9%)

Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
           S  ++++FP ++ S    F           + + +DG          +PHD+G     F+
Sbjct: 360 SLPVVLMFPMLEKSTIEQFI----------LNMREDGY---------IPHDLGTYS--FD 398

Query: 613 VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM 672
             +       +WKD N  FVL VYR  + T DK+F K+V+P V  AM+++   D+DGDG+
Sbjct: 399 APSDGTTAPPKWKDTNTTFVLMVYRYYLRTKDKEFLKSVYPYVKKAMSWIISKDKDGDGL 458

Query: 673 IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV 732
            E DG  DQ +D   + G+ +Y   +++AAL+A   +A  VGD  S    L    K+ ++
Sbjct: 459 PEVDGSTDQGFDCVPIEGVCSYISTVYIAALEAMIKIAEIVGDSTSYYSSLLSKAKSSLM 518

Query: 733 YEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
             KL++G  F       +   ++ + Q+ GQW+A    L  + D     SAL ++Y  N
Sbjct: 519 --KLFDGKKFIPWTGKPNHHKAVFSAQIFGQWWAYLLDLDVVADRSAILSALDEIYRVN 575


>gi|332796035|ref|YP_004457535.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
 gi|332693770|gb|AEE93237.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
          Length = 840

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/407 (24%), Positives = 165/407 (40%), Gaps = 96/407 (23%)

Query: 524 DGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAV-------- 575
           D  +   Q+ Y +G  +   NT DV  YS   ++ LFP++   I   F   +        
Sbjct: 336 DKRKTANQYPYTDGPLHTALNTIDVLTYSLPTILNLFPELAEKIILQFKDKLIEENTPEH 395

Query: 576 ----MMHDPSKMKLLD----------DGQWVSRKV-----------LGAVPHDIGICDPW 610
               +  + ++ K L+          DG+ +   +            G VPH       +
Sbjct: 396 VVYSLSFEENREKFLEKLSKDPSLPTDGEKLYSTINEIVKETGKDPKGRVPH-------F 448

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV------------YVA 658
           F  N     D     DLNP+F+L  Y     TGD  F K ++P +            Y  
Sbjct: 449 FTDNLRV--DEYHRVDLNPEFILMTYLIAKTTGDLNFLKEIFPKMKEALESTMKTQTYDG 506

Query: 659 MAY------------------MDQFDRDGDGMIENDGFP--DQTYDTWSVSGISAYSGGL 698
           + Y                  + + D +   ++ ++  P   QT+D WS+ GI++++  L
Sbjct: 507 LIYHTLPAGLEWLRYVNNKLNLPRGDNNSASILGHNLIPLSMQTFDDWSMIGITSFTSIL 566

Query: 699 WVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFN--YDNSGSSQSSSIQ 756
           W++++QA +     +    S DY        K + + LWNG YF+  YD     +  +  
Sbjct: 567 WISSIQAVNDACSNLKINCSYDY----ESLVKKLIDYLWNGEYFDLWYDPKSKMRDKACN 622

Query: 757 ADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDM-- 814
           A Q+ G WY+   G L  +D+   ++ LK +  YN    +  + G +NG  P+G   +  
Sbjct: 623 ASQILGHWYSTLLG-LRFLDDSLVKTTLKSIVKYN----LKEEEGLLNGAYPNGYRPLKR 677

Query: 815 ---------SSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
                    ++ Q    WSGV + VA+ +I+E L D G +    IYE
Sbjct: 678 NYQNQLNLPATTQIDTPWSGVEFYVASHLIYEKLRDEGEKILRNIYE 724


>gi|327309977|ref|YP_004336874.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
 gi|326946456|gb|AEA11562.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
          Length = 694

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 86/286 (30%), Positives = 132/286 (46%), Gaps = 32/286 (11%)

Query: 553 SFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFE 612
           S  ++++FP+++    R F   +  H      + DDG          VPHD+G       
Sbjct: 364 SLPVVLMFPQLE----RSFLEQLARH------MRDDGY---------VPHDLGT----HS 400

Query: 613 VNAYCLYDTA--RWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
           ++A     TA  RWKD NP F+L VYR    TGD  F K+V+  +  A  +M   DRDGD
Sbjct: 401 LDAPSDGTTAPPRWKDTNPTFILLVYRYYKRTGDIDFVKSVYGRLVAAYRWMRGQDRDGD 460

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
           G+ E +G  D  +D   + G SAY+  LW+AAL A   LA  VGD  + +      ++A+
Sbjct: 461 GLPELEGSGDSGFDCAKIRGPSAYTSSLWIAALVALRDLAEAVGDSKTAEEAGSDLERAR 520

Query: 731 VVYEKLWNG-SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
             +E+ ++G SY  +D  G+ + +   A  +   W   A         D+  SAL  V  
Sbjct: 521 NSFERSFDGVSYKAWD--GAPEGAVFAALAVGDWWLYLAGSREGAARRDRLASALSAVLR 578

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI---WSGVTYAVAA 832
            N          A  G  P+   ++S    R++   W  + ++VAA
Sbjct: 579 VNGSASANCAPNASTGR-PEADGEISERVGRQLTASWPRLIFSVAA 623



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 79/372 (21%), Positives = 133/372 (35%), Gaps = 57/372 (15%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           GVPLGG+G+G +    +G       F             F VFV   +  K+   L  + 
Sbjct: 13  GVPLGGIGAGKVELDNKGRLVN-ATFANNWSRPIRELRGFHVFVYAEDLDKH---LFAQD 68

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNY 215
             VL      G   +       ++ Y   +P  W          L +     S ++P + 
Sbjct: 69  VSVLGTLPTLGSAGFV------EAEYEGRWPFVWLRGR---RGGLEVEVEGFSAVVPGDL 119

Query: 216 KESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL- 274
           K+SS P +     +  S +  A +++     N VG          N+  ++N+ V     
Sbjct: 120 KDSSLPAAGLAVRVRGSRRGIAAVSI----PNVVGA---------NNVGRINEPVPGGFK 166

Query: 275 LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGN--------SLGLTAKDMWHEIKE 326
             +  +H+  P    LA    D   V++ P F +           +LG   +  W  +  
Sbjct: 167 ARNARAHEYDPQNGELALLAKD--PVAVIPQFNLDVPRSEAPKVCALGPEDERPWLYLMG 224

Query: 327 HGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAW--DCPEVNFMSGKT 384
              F    +  +    +P   + A+            G+  F L+W  D P   +  G  
Sbjct: 225 KADFGPRVAEASGGGEQPAGVVAASYEG---------GEARFVLSWYFDKPWALYPYGH- 274

Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELY 444
           YY  + K        AA++ R  + E+     +   WQ  ++ D  LPEW    + N  Y
Sbjct: 275 YYENFFK-------DAADVGRYFLDEYDRLRRRTAEWQEGLV-DPALPEWLRDAVINSAY 326

Query: 445 YLNAGGAVWTDG 456
            L+AG  +  DG
Sbjct: 327 ILSAGTWLTKDG 338


>gi|284045431|ref|YP_003395771.1| hypothetical protein Cwoe_3979 [Conexibacter woesei DSM 14684]
 gi|283949652|gb|ADB52396.1| protein of unknown function DUF608 [Conexibacter woesei DSM 14684]
          Length = 870

 Score = 97.8 bits (242), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 119/234 (50%), Gaps = 25/234 (10%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYDTWSVSG 690
           VL+ YR+    G + +    W ++   M ++D+ ++  GDG++   G    TYD  ++ G
Sbjct: 509 VLKTYREARQGGGEAWLADRWDALVRLMDHVDRTWNSAGDGVLR--GEQPCTYDI-ALHG 565

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSS 750
            +   GGLW+AAL++   +A  +G  G+  Y + +F++A+  YEKL+NG Y+    +G  
Sbjct: 566 PNLLIGGLWLAALRSMEEIALRLGVAGAAGYRV-RFEQARGGYEKLFNGEYYAQPVTGEP 624

Query: 751 Q--SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK--------- 799
                   +DQL GQW+A    L  ++D ++ RSAL+ +  +N+ +  G +         
Sbjct: 625 HDFGDGCLSDQLLGQWWAHQLELGHLLDPERVRSALRAIVAHNLREGFGARVADEQPPGH 684

Query: 800 ----RGAVNGML-----PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
                G  +G++       GR D+      E+WSGV YAVAA  I E L D G 
Sbjct: 685 RVFADGEDSGLVVCSWPRGGRPDVPLRYCDEVWSGVEYAVAAHCIDEGLEDEGL 738


>gi|238619389|ref|YP_002914214.1| hypothetical protein M164_0933 [Sulfolobus islandicus M.16.4]
 gi|238380458|gb|ACR41546.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
          Length = 661

 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 133/295 (45%), Gaps = 31/295 (10%)

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           + S  +I++FP+++ S  +   + V                      G VPHD+G     
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             ++        +WKD+NP  VL VYR    T D  F K V+P++   M +  +  RDG 
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMNWELRQCRDGL 448

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
             +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VGD     +   K   A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506

Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
             + K++NG YF  +D  G  ++S +   QL G+W+     L  IVDE+  +SAL+ +  
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVDENMIKSALESIIR 562

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
            N            N +  +G++   S+Q+   W  + +A+      + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|229584458|ref|YP_002842959.1| hypothetical protein M1627_0991 [Sulfolobus islandicus M.16.27]
 gi|228019507|gb|ACP54914.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 661

 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 133/295 (45%), Gaps = 31/295 (10%)

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           + S  +I++FP+++ S  +   + V                      G VPHD+G     
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             ++        +WKD+NP  VL VYR    T D  F K V+P++   M +  +  RDG 
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
             +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VGD     +   K   A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506

Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
             + K++NG YF  +D  G  ++S +   QL G+W+     L  IVDE+  +SAL+ +  
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVDENMIKSALESIIR 562

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
            N            N +  +G++   S+Q+   W  + +A+      + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|238620760|ref|YP_002915586.1| hypothetical protein M164_2319 [Sulfolobus islandicus M.16.4]
 gi|238381830|gb|ACR42918.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
          Length = 818

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 106/437 (24%), Positives = 181/437 (41%), Gaps = 80/437 (18%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
                    K +++ D    +  +K+ G +    ++ G  DP   +  Y  Y    DT  
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-----------------YVAMAYMDQFD 666
             D+NP+FVL  Y     TGD++  K+V+                    Y+ +    ++ 
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTVDGLPYLTLPSGIEWI 525

Query: 667 RDGDGMIE-ND----------GFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
           R+ + M+  ND              QT D WS  G S Y   LW++AL+A +  ++ + +
Sbjct: 526 RNVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585

Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
              ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +  
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
            L   +D ++ +S    +  YN     G + G+ N  +    V + S+QS+  WSGV Y 
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYNYSAEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYY 696

Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
           +A+ M      +   +    +YE       +   F     W          YMRPL   +
Sbjct: 697 LASHMFSSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALS 745

Query: 890 MQWALTRPKPKTLEKQM 906
           M +A+   K   L K++
Sbjct: 746 MIYAIEGMKVNMLNKEV 762


>gi|227827237|ref|YP_002829016.1| hypothetical protein M1425_0924 [Sulfolobus islandicus M.14.25]
 gi|227459032|gb|ACP37718.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 661

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 133/295 (45%), Gaps = 31/295 (10%)

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           + S  +I++FP+++ S  +   + V                      G VPHD+G     
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             ++        +WKD+NP  VL VYR    T D  F K V+P++   M +  +  RDG 
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
             +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VGD     +   K   A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506

Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
             + K++NG YF  +D  G  ++S +   QL G+W+     L  IVDE+  +SAL+ +  
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVDENMIKSALESIIR 562

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
            N            N +  +G++   S+Q+   W  + +A+      + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|15899746|ref|NP_344351.1| hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
 gi|13816440|gb|AAK43141.1| Hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
          Length = 803

 Score = 96.7 bits (239), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 93/390 (23%), Positives = 159/390 (40%), Gaps = 83/390 (21%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD--------- 579
           +G++ Y  G E    NT DV  Y+   +++LFP++  +I RD +   +  D         
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391

Query: 580 --PSKM------------------KLLDDGQWVSRKV----LGAVPHDIGICDPWFEVNA 615
             P  +                  KL D  + + ++      G +PH I           
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHS-------- 443

Query: 616 YCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-----------------YVA 658
               DT    D+NP+FVL  Y     TGD++F  +V+                    Y+ 
Sbjct: 444 -LTVDTYERVDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLT 502

Query: 659 MAYMDQFDRDGDGMIEND-----------GFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
           +    ++ R  + M++                 QT D WS  G S Y   LW+AAL+A +
Sbjct: 503 LPSGIEWMRHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALN 562

Query: 708 ALAREVGDRGSEDYFLFKFQKAKV---VYEKLWNGSYFN--YDNSGSSQSSSIQADQLAG 762
             ++ + +       L  ++  ++   V + LWNG Y+   YD   + +  S+ A Q+ G
Sbjct: 563 EASKLLNN-------LHNYEVKELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTG 615

Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
            WY +  GL   +D +K +S    +  YN  +  G K G+ N  +    V + S+QS+  
Sbjct: 616 DWYVQLLGLPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLGVKL-SVQSKTP 674

Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYE 852
           WSG+ Y +A+ M +    +   +    +YE
Sbjct: 675 WSGIEYYLASHMFYNGFDEYAKKILRNVYE 704


>gi|284174574|ref|ZP_06388543.1| hypothetical protein Ssol98_07941 [Sulfolobus solfataricus 98/2]
 gi|356934758|gb|AET42959.1| hypothetical protein [Sulfolobus solfataricus 98/2]
          Length = 803

 Score = 96.7 bits (239), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 93/390 (23%), Positives = 159/390 (40%), Gaps = 83/390 (21%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHD--------- 579
           +G++ Y  G E    NT DV  Y+   +++LFP++  +I RD +   +  D         
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391

Query: 580 --PSKM------------------KLLDDGQWVSRKV----LGAVPHDIGICDPWFEVNA 615
             P  +                  KL D  + + ++      G +PH I           
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHS-------- 443

Query: 616 YCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSV-----------------YVA 658
               DT    D+NP+FVL  Y     TGD++F  +V+                    Y+ 
Sbjct: 444 -LTVDTYERVDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLT 502

Query: 659 MAYMDQFDRDGDGMIEND-----------GFPDQTYDTWSVSGISAYSGGLWVAALQAAS 707
           +    ++ R  + M++                 QT D WS  G S Y   LW+AAL+A +
Sbjct: 503 LPSGIEWMRHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALN 562

Query: 708 ALAREVGDRGSEDYFLFKFQKAKV---VYEKLWNGSYFN--YDNSGSSQSSSIQADQLAG 762
             ++ + +       L  ++  ++   V + LWNG Y+   YD   + +  S+ A Q+ G
Sbjct: 563 EASKLLNN-------LHNYEVKELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTG 615

Query: 763 QWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREI 822
            WY +  GL   +D +K +S    +  YN  +  G K G+ N  +    V + S+QS+  
Sbjct: 616 DWYVQLLGLPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLGVKL-SVQSKTP 674

Query: 823 WSGVTYAVAASMIHEDLADIGFQTACGIYE 852
           WSG+ Y +A+ M +    +   +    +YE
Sbjct: 675 WSGIEYYLASHMFYNGFDEYAKKILRNVYE 704


>gi|149196161|ref|ZP_01873217.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
 gi|149141008|gb|EDM29405.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
          Length = 1052

 Score = 96.3 bits (238), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 96/193 (49%), Gaps = 15/193 (7%)

Query: 531 QFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS--------- 581
           +FL  E  +Y  +N+ DV+FY SF+++ L P++  +  R F  A+M  +           
Sbjct: 519 KFLVRECSDYPFFNSLDVYFYGSFSMMWLLPQVDTNTMRCFRDAIMAENDELRRFYVYLE 578

Query: 582 --KMKLLDDGQWVSRKVLGAVPHDIGI-CDPWFEVNAYCLYDTARWKDLNPKFVLQVYRD 638
               KL        R   GAV HD+G   DP    +AY  ++ A WKDL PK++L + R+
Sbjct: 579 LPNAKLPHPKYEGPRARKGAVIHDLGSPFDP--RPDAYNWHNVAEWKDLAPKYILMLLRN 636

Query: 639 VVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGL 698
              T D    +  W SV  ++ Y+     +G  +   +G  D T+D  S  GI+ Y G L
Sbjct: 637 YHFTKDVYLLEECWDSVQASLQYLKDMILEGHSIPLTNG-TDDTFDNLSSFGITLYCGSL 695

Query: 699 WVAALQAASALAR 711
           WVA L+AA  +A+
Sbjct: 696 WVAGLKAAGEIAK 708


>gi|385774246|ref|YP_005646813.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
 gi|323478361|gb|ADX83599.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
          Length = 818

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 105/437 (24%), Positives = 179/437 (40%), Gaps = 80/437 (18%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVLLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
                    K +++ D    +  +K+ G +    ++ G  DP   +  Y  Y    DT  
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
             D+NP+FVL  Y     TGD++  K+V+                        PS    +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525

Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
            Y++   R  D        +     QT D WS  G S Y   LW++AL+A +  ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585

Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
              ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +  
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
            L   +D ++ +S    +  YN     G + G+ N  +    V + S+QS+  WSGV Y 
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYY 696

Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
           +A+ M +    +   +    +YE       +  +F     W          YMRPL   +
Sbjct: 697 LASHMFYSGFDEYAKKILRNVYERY----EIAGSFWNHIEWGAR-------YMRPLVALS 745

Query: 890 MQWALTRPKPKTLEKQM 906
           M +A+   K   L  ++
Sbjct: 746 MIYAIEGMKVNMLNNEV 762


>gi|385776902|ref|YP_005649470.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
 gi|323475650|gb|ADX86256.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
          Length = 818

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 105/437 (24%), Positives = 179/437 (40%), Gaps = 80/437 (18%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
                    K +++ D    +  +K+ G +    ++ G  DP   +  Y  Y    DT  
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
             D+NP+FVL  Y     TGD++  K+V+                        PS    +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525

Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
            Y++   R  D        +     QT D WS  G S Y   LW++AL+A +  ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585

Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
              ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +  
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYA 829
            L   +D ++ +S    +  YN     G + G+ N  +    V + S+QS+  WSGV Y 
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYY 696

Query: 830 VAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWA 889
           +A+ M +    +   +    +YE       +  +F     W          YMRPL   +
Sbjct: 697 LASHMFYSGFDEYAKKILRNVYERY----EIAGSFWNHIEWGAR-------YMRPLVALS 745

Query: 890 MQWALTRPKPKTLEKQM 906
           M +A+   K   L  ++
Sbjct: 746 MIYAIEGMKVNMLNNEV 762


>gi|284998829|ref|YP_003420597.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284446725|gb|ADB88227.1| hypothetical protein LD85_2606 [Sulfolobus islandicus L.D.8.5]
          Length = 803

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/436 (23%), Positives = 175/436 (40%), Gaps = 78/436 (17%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 332 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 391

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVPHDIGIC--DPWFEVNAYCLY----DTARW 624
                    K +++ D    +  +K+ G +   +     DP   +  Y  Y    DT   
Sbjct: 392 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 451

Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAMA 660
            D+NP+FVL  Y     TGD++  K+V+                        PS    + 
Sbjct: 452 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 511

Query: 661 YMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
           Y++   R  D        +     QT D WS  G S Y   LW++AL+A +  ++ + + 
Sbjct: 512 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSN- 570

Query: 717 GSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARACG 770
             ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +   
Sbjct: 571 -PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLD 623

Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
           L   +D ++ +S    +  YN     G + G+ N  +    V + S+QS+  WSGV Y +
Sbjct: 624 LPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYYL 682

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           A  M +    +   +    +YE       +   F     W          YMRPL   +M
Sbjct: 683 ALHMFYSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALSM 731

Query: 891 QWALTRPKPKTLEKQM 906
            +A+   K   L  ++
Sbjct: 732 IYAIEGMKVNMLNNEV 747


>gi|227831316|ref|YP_002833096.1| hypothetical protein LS215_2489 [Sulfolobus islandicus L.S.2.15]
 gi|229580221|ref|YP_002838621.1| hypothetical protein YG5714_2457 [Sulfolobus islandicus Y.G.57.14]
 gi|227457764|gb|ACP36451.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
 gi|228010937|gb|ACP46699.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
          Length = 818

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/436 (23%), Positives = 175/436 (40%), Gaps = 78/436 (17%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVPHDIGIC--DPWFEVNAYCLY----DTARW 624
                    K +++ D    +  +K+ G +   +     DP   +  Y  Y    DT   
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 466

Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAMA 660
            D+NP+FVL  Y     TGD++  K+V+                        PS    + 
Sbjct: 467 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 526

Query: 661 YMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
           Y++   R  D        +     QT D WS  G S Y   LW++AL+A +  ++ + + 
Sbjct: 527 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSN- 585

Query: 717 GSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARACG 770
             ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +   
Sbjct: 586 -PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLD 638

Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
           L   +D ++ +S    +  YN     G + G+ N  +    V + S+QS+  WSGV Y +
Sbjct: 639 LPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYYL 697

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           A  M +    +   +    +YE       +   F     W          YMRPL   +M
Sbjct: 698 ALHMFYSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALSM 746

Query: 891 QWALTRPKPKTLEKQM 906
            +A+   K   L  ++
Sbjct: 747 IYAIEGMKVNMLNNEV 762


>gi|227829616|ref|YP_002831395.1| hypothetical protein LS215_0672 [Sulfolobus islandicus L.S.2.15]
 gi|227456063|gb|ACP34750.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
          Length = 661

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 71/250 (28%), Positives = 121/250 (48%), Gaps = 16/250 (6%)

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           G VPHD+G       ++        +WKD+NP  +L VYR    T D  F K V+P++  
Sbjct: 378 GYVPHDLGFHSLDSPIDGTT--SPPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVK 435

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
            M +  +  RDG   +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VGD  
Sbjct: 436 VMDWELRQCRDGLPFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSN 493

Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
              +   K   A+  + K++NG YF  +D  G  ++S +   QL G+W+     L  IVD
Sbjct: 494 YVGFINEKLNVAREAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVD 549

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPD--GRVDMSSMQSREIWSGVTYAVAASM 834
           E+  +SAL+     +++++ G         L D  G++   S+Q+   W  + +A+    
Sbjct: 550 ENMIKSALE-----SIIRLNGNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLA 604

Query: 835 IHEDLADIGF 844
             + + D+ F
Sbjct: 605 YKKGVGDLSF 614


>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
 gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
          Length = 806

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 168/796 (21%), Positives = 294/796 (36%), Gaps = 151/796 (18%)

Query: 99  LGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEV 158
           LGG+G+G++    RG +  W+IF +  +        F+++    N Q    +L  +    
Sbjct: 22  LGGIGTGNVSVGTRGNYCDWEIFGQPGKGIHFPYTFFALWAKSKNTQPIVRILESRIQPP 81

Query: 159 LKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
              T + G  S D  L G    ++S++ A YP      E    P L +     +P IP N
Sbjct: 82  Y--TCSHGFISAD--LAGLPRFERSSFSAAYPFVNVALEDSKLP-LSVELEAFTPFIPLN 136

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
            K+S  P +V  Y + N+   + + T+  +  N+VG    F G        +   V    
Sbjct: 137 AKDSGIPGAVIRYKVKNNSGEALEATVASSVANAVG----FNGYDLFDNLLVKRPVKNQY 192

Query: 275 LHHRTSHQLPPVTFALAAQE-TDGVHVSLCPHFVISGNSLGLTA------KDMWHEIKEH 327
           L       L   +  LA  + T+G    +     ++     L+        D W +  E 
Sbjct: 193 LDQDQMKGLLYTSPELAEDDFTNGSIAVMTGEDSVTVKEEWLSGAWWDGIHDFWDDFSED 252

Query: 328 GSFD---RLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKT 384
           G  +        E+         IG+ +    T+P   E    F L W            
Sbjct: 253 GRLEFHSEFQGKESKFNDLSDLRIGS-LGVCKTIPAYGEAVFEFILTW------------ 299

Query: 385 YYRRYTKFYGTHQNAAANIARDAILEHGSW-ELQIEAWQRPILEDKRLPEWYPITLFNEL 443
           Y+    K++G H        ++  +EH  + + Q  AW             Y + L+  L
Sbjct: 300 YFPNRPKYWGGHVCPENRNKKE--IEHNYYADFQKNAWD------------YGVYLYKNL 345

Query: 444 YYLNAGGAVWTDGSPPVHSLVTIGHRKFSLDWSQSDLKRIVDVPNQNDTAVNILERMSSI 503
             L                      R F+    +S L   V            +E +++ 
Sbjct: 346 ESLEKAS------------------RSFTKALYESTLPDYV------------IEALANN 375

Query: 504 LEQIYTPVALNSSFGTNLLQDGEENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKI 563
           +  + +P     S GT L  +G  +  Q    EG    +WN      Y+  A   LFP++
Sbjct: 376 ITVLRSPTCFRISDGTFLSWEG--SFPQHGSCEGNCTHVWN------YAQTA-AFLFPEL 426

Query: 564 QLSIQRDFAAAVMMHDPSKM-----KLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCL 618
           + S++R     +   D  +M     ++  D +W   +++ AV   +G             
Sbjct: 427 EQSMRRT-EFLLETDDTGRMAFRTLQVFGDKKW---EMIPAVDGQMGT------------ 470

Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDG 677
                        +L++YRD          K  W    +A+ +   + D D D +++   
Sbjct: 471 -------------ILRLYRDWKLGAGDALLKETWEKAALALDFAFTYWDSDKDCVLD--- 514

Query: 678 FPDQTYDTWSVS--GISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK 735
             DQ ++T+ +   G ++ S  L++AAL+A   +A  +GD    + +   F+K     + 
Sbjct: 515 --DQQHNTYDIEFYGPNSLSNSLFLAALKAGKEMAAYLGDTAHAEKYAEAFRKGSEKMDS 572

Query: 736 L-WNGSYF-----NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
           L W G Y+     + +       +   ADQ+ GQ  A    L  ++ E+  + A+  V+ 
Sbjct: 573 LLWGGDYYIQVIDDINAYKYQYGTGCLADQVFGQMLAHLNHLGYVLPEEHIKKAIHSVFQ 632

Query: 790 YNVLKVM------------GGKRGAVNGMLPDG-RVDMSSMQSREIWSGVTYAVAASMIH 836
           YN  + M              ++G V    P G R  +  + S E+WSG+ Y VAA +I 
Sbjct: 633 YNFKETMEDHVNVQRTYALNQEKGLVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIF 692

Query: 837 EDLADIGFQTACGIYE 852
           E   + G      + E
Sbjct: 693 EGFVEEGLAVVKAVRE 708


>gi|229581136|ref|YP_002839535.1| hypothetical protein YN1551_0454 [Sulfolobus islandicus Y.N.15.51]
 gi|228011852|gb|ACP47613.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 816

 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/436 (23%), Positives = 175/436 (40%), Gaps = 78/436 (17%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 345 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 404

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVPHDIGIC--DPWFEVNAYCLY----DTARW 624
                    K +++ D    +  +K+ G +   +     DP   +  Y  Y    DT   
Sbjct: 405 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 464

Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAMA 660
            D+NP+FVL  Y     TGD++  K+V+                        PS    + 
Sbjct: 465 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 524

Query: 661 YMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDR 716
           Y++   R  D        +     QT D WS  G S Y   LW++AL+A +  ++ + + 
Sbjct: 525 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSN- 583

Query: 717 GSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARACG 770
             ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +   
Sbjct: 584 -PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLD 636

Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
           L   +D ++ +S    +  YN     G + G+ N  +    V + S+QS+  WSGV Y +
Sbjct: 637 LPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLGVKL-SIQSKAPWSGVEYYL 695

Query: 831 AASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           A  M +    +   +    +YE       +   F     W          YMRPL   +M
Sbjct: 696 ALHMFYSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLVALSM 744

Query: 891 QWALTRPKPKTLEKQM 906
            +A+   K   L  ++
Sbjct: 745 IYAIEGMKVNMLNNEV 760


>gi|15898746|ref|NP_343351.1| hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
 gi|229578524|ref|YP_002836922.1| hypothetical protein YG5714_0710 [Sulfolobus islandicus Y.G.57.14]
 gi|229582969|ref|YP_002841368.1| hypothetical protein YN1551_2495 [Sulfolobus islandicus Y.N.15.51]
 gi|284173504|ref|ZP_06387473.1| hypothetical protein Ssol98_02460 [Sulfolobus solfataricus 98/2]
 gi|384435081|ref|YP_005644439.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|13815223|gb|AAK42141.1| Hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
 gi|228009238|gb|ACP45000.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
 gi|228013685|gb|ACP49446.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
 gi|261603235|gb|ACX92838.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
          Length = 661

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 118/248 (47%), Gaps = 12/248 (4%)

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           G VPHD+G       ++        +WKD+NP  +L VYR    T D  F K V+P++  
Sbjct: 378 GYVPHDLGFHSLDSPIDGTT--SPPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVK 435

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
            M +  +  RDG   +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VGD  
Sbjct: 436 VMDWELRQCRDGLPFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSN 493

Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVD 776
              +   K   A+  + K++NG YF  +D  G  ++S +   QL G+W+     L  IVD
Sbjct: 494 YVGFINEKLNVAREAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIVD 549

Query: 777 EDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
           E+  +SAL+ +   N            N +  +G++   S+Q+   W  + +A+      
Sbjct: 550 ENMIKSALESIIRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYK 606

Query: 837 EDLADIGF 844
           + + D+ F
Sbjct: 607 KGVGDLSF 614


>gi|227828566|ref|YP_002830346.1| hypothetical protein M1425_2317 [Sulfolobus islandicus M.14.25]
 gi|227460362|gb|ACP39048.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 818

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 107/440 (24%), Positives = 177/440 (40%), Gaps = 86/440 (19%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
                    K +++ D    +  +K+ G +    ++ G  DP   +  Y  Y    DT  
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
             D+NP+FVL  Y     TGD++  K+V+                        PS    +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525

Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
            Y++   R  D        +     QT D WS  G S Y   LW++AL+A +  ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585

Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
              ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +  
Sbjct: 586 --PQNY------EVKELIEKVDKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS---SMQSREIWSGV 826
            L   +D ++ +S    +  YN      G+ G  NG   D    +    S+QS+  WSGV
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYN----YSGEEGVRNGSSNDDITPLGIKLSIQSKAPWSGV 693

Query: 827 TYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLA 886
            Y +A+ M      +   +    +YE       +   F     W          YMRPL 
Sbjct: 694 EYYLASHMFSSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLV 742

Query: 887 IWAMQWALTRPKPKTLEKQM 906
             +M +A+   K   L K++
Sbjct: 743 ALSMIYAIEGMKVNMLNKEV 762


>gi|229585796|ref|YP_002844298.1| hypothetical protein M1627_2396 [Sulfolobus islandicus M.16.27]
 gi|228020846|gb|ACP56253.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 818

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 107/440 (24%), Positives = 177/440 (40%), Gaps = 86/440 (19%)

Query: 529 IGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPS------- 581
           +G++ Y +G E    NT DV  Y+   +++LFP +  +I +D +   +  D         
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 582 ---------KMKLLDDGQWVS--RKVLGAVP---HDIGICDPWFEVNAYCLY----DTAR 623
                    K +++ D    +  +K+ G +    ++ G  DP   +  Y  Y    DT  
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANETG-KDPKGRMPHYIRYSLTVDTYE 465

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVW------------------------PSVYVAM 659
             D+NP+FVL  Y     TGD++  K+V+                        PS    +
Sbjct: 466 RIDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWI 525

Query: 660 AYMDQFDRDGDGM----IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGD 715
            Y++   R  D        +     QT D WS  G S Y   LW++AL+A +  ++ + +
Sbjct: 526 RYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLNN 585

Query: 716 RGSEDYFLFKFQKAKVVYEK----LWNGSYFN--YDNSGSSQSSSIQADQLAGQWYARAC 769
              ++Y      + K + EK    LWNG Y+   YD   + +  S  A Q+ G WY +  
Sbjct: 586 --PQNY------EVKELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQML 637

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMS---SMQSREIWSGV 826
            L   +D ++ +S    +  YN      G+ G  NG   D    +    S+QS+  WSGV
Sbjct: 638 DLPEFLDYERRKSVFSSIMKYN----YSGEEGVRNGSSNDDITPLGIKLSIQSKAPWSGV 693

Query: 827 TYAVAASMIHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLA 886
            Y +A+ M      +   +    +YE       +   F     W          YMRPL 
Sbjct: 694 EYYLASHMFSSGFDEYAKKILRNVYERY----EIAGNFWNHIEWGAR-------YMRPLV 742

Query: 887 IWAMQWALTRPKPKTLEKQM 906
             +M +A+   K   L K++
Sbjct: 743 ALSMIYAIEGMKVNMLNKEV 762


>gi|385775519|ref|YP_005648087.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323474267|gb|ADX84873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 661

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/295 (25%), Positives = 132/295 (44%), Gaps = 31/295 (10%)

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           + S  +I++FP+++ S  +   + V                      G VPHD+G     
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             ++        +WKD+NP  VL VYR    T D  F K V+P++   M +  +  RDG 
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
             +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VGD     +   K   A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506

Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
             + K++NG YF  +D  G  ++S +   QL G+W+     L  I DE+  +SAL+ +  
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIADENMIKSALESIIR 562

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
            N            N +  +G++   S+Q+   W  + +A+      + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|15898194|ref|NP_342799.1| hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
 gi|13814563|gb|AAK41589.1| Hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
          Length = 663

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/297 (26%), Positives = 137/297 (46%), Gaps = 35/297 (11%)

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           + S  +I++FP+++    + F   ++ H      + +DG          VPHD+G     
Sbjct: 352 FGSLPVILMFPELE----KSFLKLLIRH------IREDGY---------VPHDLGYHSLD 392

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             ++        RWKD+NP  +L VYR    T D +F K V+P   + +  MD   R   
Sbjct: 393 SPIDGTT--SPPRWKDMNPSLILLVYRYFKFTNDIEFLKEVYP---ILVKVMDWELRQCK 447

Query: 671 GMIE-NDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
           G +   +G  D  +D   + G  +Y+  L++ +L A   +A+ VGD    D+   K   A
Sbjct: 448 GNLPFMEGEMDNAFDATIIKGHDSYTSSLFIGSLIAMREIAKLVGDSNYVDFISEKLSSA 507

Query: 730 KVVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
           +  + +++NG YF  +D   S  ++S  A QL G+W+    GL  IV+ED  + AL+ + 
Sbjct: 508 REAFRRMFNGRYFKAWD---SVDNASFLA-QLYGEWFTTLVGLEDIVEEDIIKKALESII 563

Query: 789 NYNVLKVMGGKRGAVNGMLPD-GRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
             N           V  ++ D G++   S+Q+   W  + +A+      + + D+ F
Sbjct: 564 RLN----GNASPHCVPNLVDDNGKIVGLSVQTYSSWPRMVFAICWLAYKKGVGDLSF 616


>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
 gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
          Length = 928

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 93/365 (25%), Positives = 161/365 (44%), Gaps = 69/365 (18%)

Query: 547 DVHFYSSFALIMLFPKIQLSIQRDFAAA--VMMHDPSKMK--LLDDGQWVSRKVLGAVPH 602
           D H+Y  + L  L     +S+ R+       + +DP+K    + +  +   +   G VPH
Sbjct: 433 DKHYY--YELFALGFHENMSLYREMLRKDPSIQNDPAKFSEAIAEVVRRTGKDPTGRVPH 490

Query: 603 DIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAM-AY 661
                       ++ L D+    DL P+F+L       ATGD+++ +++W  V   +   
Sbjct: 491 SF--------RASFNLIDSYDRNDLMPEFLLIALLPYYATGDREYLRSLWNDVLTVIDGT 542

Query: 662 MDQFDRDGDGMIEN----------------------DGF------------PD------Q 681
             Q D  G G+IE+                       GF            P+       
Sbjct: 543 RRQHDSRGTGLIEHYMPSDYEGLSRVAQDAAEKGLVPGFYGGNVVKLLFSGPEFVMNSVN 602

Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSY 741
           T+DT+S+ G+++++G LW A+L+AA+  A   G   +E+      +  + + + LWNGSY
Sbjct: 603 TFDTFSLLGVASFTGDLWAASLRAAAEAASREGLARAEELRQLSLRAYESMVKLLWNGSY 662

Query: 742 FN--YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGK 799
           F+  YD     +  +I + QL G+WY +  GL   VD +K RSAL++VY  N  +  G  
Sbjct: 663 FDDWYDPESGLRDRAILSAQLTGEWYLQVLGLGDGVDSEKVRSALREVYRANFRRWEGLL 722

Query: 800 RGAVNGM----------LPDGR--VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTA 847
            G   G            P+G   ++  S Q+   W+GV + VA+ M++E + +   +  
Sbjct: 723 NGTYPGTPRPSMVGDVEEPNGTKILNRVSSQADTPWTGVEFGVASQMLYEGMVEEAMEVL 782

Query: 848 CGIYE 852
             I++
Sbjct: 783 RSIHD 787


>gi|402589212|gb|EJW83144.1| hypothetical protein WUBG_05948, partial [Wuchereria bancrofti]
          Length = 174

 Score = 93.6 bits (231), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 57/159 (35%), Positives = 84/159 (52%), Gaps = 14/159 (8%)

Query: 97  VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDK--PVLANQFSVFVSRSNGQKYSSVLCPK 154
           VP G +GSGSIGR +RG F ++ + P + E K   V ANQF+   +      Y  VLC +
Sbjct: 1   VPCGTIGSGSIGRDFRGGFCKFGLRPGITEQKIGVVKANQFNP--NNLWETVYQKVLCAR 58

Query: 155 TPEVLKDTTAAGIG---SWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
           +      + ++G G   SWD++   DK  Y  LYPR WT +   P+    +  +Q+SP+I
Sbjct: 59  S------SVSSGRGELTSWDFSFPPDKLIYRGLYPRFWTYYS-IPEFNFILCIQQVSPVI 111

Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
           P+NY+ES  PV++F     N         + FT+ N  G
Sbjct: 112 PNNYEESLLPVTLFIIDAENQSSVDLQAAVAFTFRNGTG 150


>gi|385772401|ref|YP_005644967.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323476515|gb|ADX81753.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 661

 Score = 93.2 bits (230), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 75/295 (25%), Positives = 132/295 (44%), Gaps = 31/295 (10%)

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           + S  +I++FP+++ S  +   + V                      G VPHD+G     
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYVRND-------------------GYVPHDLGFHSLD 390

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             ++        +WKD+NP  VL VYR    T D  F K V+P++   M +  +  RDG 
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAK 730
             +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VGD     +   K   A+
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLSVAR 506

Query: 731 VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
             + K++NG YF  +D  G  ++S +   QL G+W+     L  I +E+  +SAL+ +  
Sbjct: 507 EAFRKMFNGKYFKAWD--GVDKASFLA--QLYGEWFTTLLELENIANENMIKSALESIIR 562

Query: 790 YNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
            N            N +  +G++   S+Q+   W  + +A+      + + D+ F
Sbjct: 563 LN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|297725625|ref|NP_001175176.1| Os07g0444000 [Oryza sativa Japonica Group]
 gi|255677726|dbj|BAH93904.1| Os07g0444000 [Oryza sativa Japonica Group]
          Length = 87

 Score = 93.2 bits (230), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 44/77 (57%), Positives = 57/77 (74%), Gaps = 2/77 (2%)

Query: 63  IGVRILCLIREEAAKGKRAFIDPFIKR--HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQI 120
           +G+R+   IREEA+ G++A IDPF +     ++S GVPLGG+GSGSI R +RGEF+ WQI
Sbjct: 3   LGLRLWSYIREEASHGRKAPIDPFTRESDKPSASQGVPLGGMGSGSISRGFRGEFKHWQI 62

Query: 121 FPRVCEDKPVLANQFSV 137
            P  CE  PV+ANQFSV
Sbjct: 63  IPGSCEMSPVMANQFSV 79


>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
          Length = 814

 Score = 90.1 bits (222), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 87/323 (26%), Positives = 140/323 (43%), Gaps = 51/323 (15%)

Query: 551 YSSFALIMLFPKIQLSIQR-DFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDP 609
           Y + A+  LFP+++ S++R DF          + + + +    S   L AV   +G    
Sbjct: 421 YYAQAMAYLFPELEQSMRRTDFLTETNDEGEMQFRAMRELNGASWGFLPAVDGQMGT--- 477

Query: 610 WFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRD 668
                                 ++++YRD + +GD  F K +WP   +A+ Y +  +D D
Sbjct: 478 ----------------------IVRLYRDWLISGDDAFLKELWPKAKLALDYGIRTWDTD 515

Query: 669 GDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK 728
           GD ++  DG     YD     GI       ++AAL+AA  +A  + D G+ + +   +  
Sbjct: 516 GDFVL--DGMKHVDYDV-EFYGIDPLGNLCYLAALKAAVKIAEYLKDEGAHNRYQTIYDT 572

Query: 729 AKVVYEK-LWN--GSYF-----NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKA 780
           A V  +K +W+    YF     + D       +   ADQL GQ+YA   GL P+VD    
Sbjct: 573 ASVQADKFMWDSKAGYFIQVLKDVDEFKYQHGTGCLADQLIGQFYAHLTGLGPLVDAGHI 632

Query: 781 RSALKKVYNYNVLKVM----GGKRG-AVN---GMLP-----DGRVDMSSMQSREIWSGVT 827
           + A + ++ YN +         +RG AVN   G+L       GR       S E WS   
Sbjct: 633 KQAAQSIFKYNFISDFTDHPNMQRGYAVNDDKGLLMTTWPLGGRPRYPFFYSEEAWSRTE 692

Query: 828 YAVAASMIHEDLADIGFQTACGI 850
           Y VA+++I+E   D G   A  +
Sbjct: 693 YHVASTLIYEGFLDEGLTIAKAV 715


>gi|76156398|gb|AAX27603.2| SJCHGC03819 protein [Schistosoma japonicum]
          Length = 192

 Score = 89.7 bits (221), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 62/157 (39%), Positives = 93/157 (59%), Gaps = 8/157 (5%)

Query: 71  IREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRV-CED-K 128
           I++   + +  FID          +GVP+GG+G GSIGR +RGEF R  + P + C D +
Sbjct: 34  IKKRYIEHRLPFIDAISHVPWRPIYGVPIGGIGCGSIGRGFRGEFCRSSLIPGIYCYDIQ 93

Query: 129 PVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRA 188
           PV  +QF V V ++N   Y+ VL P T      +   G+  W W  + +   Y  LYPR+
Sbjct: 94  PV--DQFIVTVRKNNVIIYNQVLSPLTK---PPSNGKGLRKWIWGFRPENGYYIGLYPRS 148

Query: 189 WTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVF 225
           WTV+E  P+ +L +VC+QISP+IPH+Y+ +  PV++F
Sbjct: 149 WTVYE-IPELQLTLVCQQISPVIPHDYQVTCLPVAIF 184


>gi|327311335|ref|YP_004338232.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
 gi|326947814|gb|AEA12920.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
          Length = 892

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/228 (32%), Positives = 104/228 (45%), Gaps = 35/228 (15%)

Query: 681 QTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGS 740
           QT+DTW+  G+S+Y   LW AAL+A    A+  G R  E Y     ++A    E+LWNG 
Sbjct: 576 QTFDTWAFYGVSSYVLFLWAAALKAMVEGAKLTG-RSPEKYAEL-LRRAFEGLERLWNGE 633

Query: 741 YFN--YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGG 798
           YF+  +D     +  +  A QL GQ  A    L  + D+ +  SAL+ V  YN    +  
Sbjct: 634 YFDLWWDPVTGERDRASMAAQLFGQLLAHVADLGYLSDKQRVISALRAVAKYN----LAP 689

Query: 799 KRGAVNGMLPDGRVD----------------MSSMQSREIWSGVTYAVAASMIHEDLADI 842
             G +NGM PD R                  + + Q    W+GV YAVA  M +E L + 
Sbjct: 690 DEGLINGMYPDRRRPSFVGPTLYENFTRGPYLPTWQMDTPWTGVEYAVAGHMFYEGLVEE 749

Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAM 890
           G      ++E    G   G+ +   E W T        YMRPL+ WA+
Sbjct: 750 GTAVLKALHERYERG---GHYWNHIE-WGTH-------YMRPLSAWAV 786


>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 811

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/229 (30%), Positives = 107/229 (46%), Gaps = 22/229 (9%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQTYDTWSVSG 690
           V++VYR+   +GD  F K +WP V  A+ Y   + D+D D ++E  G    TYD     G
Sbjct: 471 VIRVYREWKFSGDDAFLKELWPKVKSAVNYSSLYWDKDQDFILE--GRQHNTYDI-EFYG 527

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-----NYD 745
            +  +G L++ AL+AA  +A+ +G   +  Y       +K + + LWNG Y+     + +
Sbjct: 528 PNPLTGILFLGALRAAEEMAKYLGSESASSYAQAFEASSKKLDQLLWNGEYYIQKLDDPN 587

Query: 746 NSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNV------------L 793
                 +S   +DQL GQ  A   GL  ++ ++    AL+ ++ YN             +
Sbjct: 588 EHRYQHASGCLSDQLFGQTLASLTGLGYLLPKEHISRALESIFAYNFKPNFWNHTNTQRV 647

Query: 794 KVMGGKRGAVNGMLPDG-RVDMSSMQSREIWSGVTYAVAASMIHEDLAD 841
             +G   G V    P G R       S E+WSG  Y VA  MI+E L D
Sbjct: 648 YALGDDAGLVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEGLLD 696



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 67/295 (22%), Positives = 115/295 (38%), Gaps = 23/295 (7%)

Query: 93  SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL- 151
           S+   PLGG+G+G++    RGE + W+IF R  +   +    F+++         + VL 
Sbjct: 22  SAVAFPLGGIGTGNVSLGARGELRDWEIFNRPAKGFSLPNTYFALWARPEGESPVTKVLQ 81

Query: 152 -CPKTPEV----LKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
             P  P         T+  G+  +   L      +   YP AW   E + D  +++    
Sbjct: 82  SLPPPPHTQSHGYHPTSGLGLPHFTSTL------FRGEYPIAWVYFE-DSDLPIKVTLES 134

Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGG--DSEFTGQHYNSKT 264
            +P +P N ++S  P +   Y + N      D+ +  + TN VG      F G       
Sbjct: 135 FTPFVPLNAEDSGIPGAYLIYRVENISPKRIDVAIAASITNPVGEVRRDRFGGFSVERGG 194

Query: 265 KMND----GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDM 320
             ND    G    L     S+ L  + +   +  T   HV+    ++ S  +   + ++ 
Sbjct: 195 NANDFRRQGRILGLFLRSNSYPLYDLRYGDLSLVTMNSHVTFKRVWLPSRWACD-SVEEF 253

Query: 321 WHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
           W +    G   +L  +     SE G +   +IAA  T+ P       F L+W  P
Sbjct: 254 WRDFSTDG---KLEDLGIEDPSEEGQTYTGSIAAMETIAPGEVKDFVFILSWFFP 305


>gi|10440618|gb|AAG16856.1|AC069145_5 hypothetical protein [Oryza sativa Japonica Group]
          Length = 143

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 65/118 (55%), Gaps = 21/118 (17%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+R+L++    L +F++ + E +++                   +A IDPF
Sbjct: 31  DGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM-------------------KAPIDPF 71

Query: 87  IKRHL--TSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS 142
            K     ++S GVPLGG+GSGSI R +RGEF+ W I P +CE  PV+ NQFSV +S S
Sbjct: 72  TKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQFSVMLSSS 129


>gi|297741933|emb|CBI33368.3| unnamed protein product [Vitis vinifera]
          Length = 108

 Score = 82.8 bits (203), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 35/56 (62%), Positives = 39/56 (69%)

Query: 843 GFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWALTRPK 898
            F TA GI+ A WS  G GY FQTPE W  D  +RSL YMRPLAIW MQWAL+ P+
Sbjct: 5   AFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMPR 60


>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
 gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
          Length = 804

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 106/235 (45%), Gaps = 24/235 (10%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQTYDTWSVSG 690
           +L+VYR+   +GD  F K +W  V  A+ +   + D + D ++  DG    TYD     G
Sbjct: 477 ILRVYREWKISGDDDFLKKIWNQVVSALEFSASYWDSNQDCVL--DGQQHNTYDI-EFYG 533

Query: 691 ISAYSGGLWVAALQAASALAREVGD-RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
           +++    ++ AAL+A + +A  +G+   S+ +   +    K + E L+NG Y+     G 
Sbjct: 534 VNSLGNSIYYAALKAGAEMAEYLGEHERSQKWRSMEQAGCKRMDEMLFNGEYYRQVTDGD 593

Query: 750 SQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAV 803
                 Q      +DQL GQ  A   GL  +  ED  +SA+  +Y YN  + MG  +   
Sbjct: 594 IDEYKYQYGEGCLSDQLLGQTLAHLYGLGHLFPEDHVKSAVFAIYKYNFKERMGSHKSLQ 653

Query: 804 NGML-------------PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQ 845
            G                 GR     + S E+W+G+ Y VAA +I+E     G +
Sbjct: 654 RGYAYQDEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQEGLE 708



 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 116/291 (39%), Gaps = 21/291 (7%)

Query: 99  LGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEV 158
           LGG+G+G+I  + RGE   ++IF R  +   +  + F+++ S  +G+  + VL  K P+ 
Sbjct: 24  LGGIGTGNISLNARGELCDFEIFNRQSKGLKLPYSFFALWYSLPSGESDALVLEAK-PQG 82

Query: 159 LKDTTAAGIGSWDWNL-KGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKE 217
           L D       S    L + D S + + YP A    + +  P   +  +  +P IP +   
Sbjct: 83  LCDAPMGVAPSLVPGLPRFDSSCFSSAYPFANIQLKKDGLP-FEVTLQAYTPFIPLDDVN 141

Query: 218 SSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFT---------GQHYNSKTKMND 268
           S  P   F Y + N         +  T  N+ G DS  T         G+  N   K N 
Sbjct: 142 SGIPGIRFVYRLKNLLDQPVKAAVSATMPNACGFDSFSTDGLNKLYLKGRPENRVVKKNG 201

Query: 269 GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHG 328
                        QL   T AL   E +G   S  P ++  G   G  A++ W ++ E G
Sbjct: 202 LQGICYSTDLEKEQLGFGTMALLTDEQEG--FSAKPRWLNEGWWDG--AEEYWQDLMEDG 257

Query: 329 SFDRLNSMETSVT----SEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
               + S    V+    S   S+IG +IA    + P       F L W  P
Sbjct: 258 CLGEVVSGGKPVSRINPSGEESAIG-SIAVQKVLQPGESKDFVFYLTWHFP 307


>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
 gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 836

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 90/401 (22%), Positives = 161/401 (40%), Gaps = 89/401 (22%)

Query: 526 EENIGQFLYLEGIEYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKL 585
           E  +G+  Y +G      NT DV  Y  + L +L+P++  ++    + + + +D      
Sbjct: 333 ERKVGEHPYTDGPVNTALNTIDVSTYFIYTLTVLYPQLAKNLLLTSSRSALSYDNPLYIF 392

Query: 586 L------DDGQWVSRKVLG-AVPHDIGIC-------------DPWFEVNAYCL----YDT 621
                  +  ++V R +   ++P  I                DP   +  Y      +D 
Sbjct: 393 YSLAMPENRSKYVERVIKDPSIPSSIEKLLQTVKEIAKETGKDPKGRIAHYIYRDLKFDE 452

Query: 622 ARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ--------FDRDGDGM- 672
               DLNP+FVL        TGD +F K+++P    AM  + +        + R   G  
Sbjct: 453 YGRNDLNPEFVLMWGLVSKYTGDIEFMKSLYPVAKEAMESVLRTHSYEGLIYSRLPSGFE 512

Query: 673 --------------------------IENDGFPDQTYDTWSVSGISAYSGGLWVAALQAA 706
                                     I++      T+D W+  GI+++   L ++AL+  
Sbjct: 513 WNRQVFSYFKDVNIYDNLFLVVSLLGIDSFHMSVNTFDDWTTIGINSFVSLLGISALKIL 572

Query: 707 SALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYFN--YDNSGSSQSSSIQADQLAGQ 763
           + L    GD+       +  + A  +YE  LWNG YF+  YD     +  + QA QL G+
Sbjct: 573 NELG---GDK-------YNVENALSLYESMLWNGEYFDLWYDPISGYRDKTCQASQLLGE 622

Query: 764 WYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDG------------R 811
           +Y    G   ++D +K R  L  +  YN+ +    + G +NG  PDG            +
Sbjct: 623 FYLNLLGY-SLLDREKTRKTLLSIVKYNLKE----EEGVINGAYPDGYRPLMREYENPLK 677

Query: 812 VDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
           +  +S+     WSGV + +A+ +I+E + D   +    +Y+
Sbjct: 678 IKEASIHQDTPWSGVEFYLASHLIYEKMIDEAKKVLKEVYD 718


>gi|150009713|ref|YP_001304456.1| hypothetical protein BDI_3128 [Parabacteroides distasonis ATCC
           8503]
 gi|149938137|gb|ABR44834.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 901

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 93/421 (22%), Positives = 161/421 (38%), Gaps = 69/421 (16%)

Query: 70  LIREEAAKGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKP 129
            I  +   G R   D     +   + GVPLGG+G G++  +  G F R  +       + 
Sbjct: 34  FIPSKGLLGGRTDFDYLKLENHPGNSGVPLGGIGVGNVEFAPDGRFARIGLNNIHLPIRK 93

Query: 130 VLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYH-ALYPRA 188
             A+ FS++     G K ++V+      +++D+         + L G +  Y+  L+PR 
Sbjct: 94  STASFFSLWYK---GAKETAVV-----RLVRDSLP------QYGLLGVEDIYYTGLFPRV 139

Query: 189 WTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNS 248
             +  G   P +    R  SP+IPH+ K SS PV  F   + +  +   ++++ F+W + 
Sbjct: 140 -EMDFGNSIPGINATIRAFSPLIPHDVKNSSLPVVFFDVDLVS--QEGGEVSIAFSWEDF 196

Query: 249 VGG-----------DSEFTGQHYNSKTK------------------------MNDGVHAV 273
           +G            D +  GQ  NS                           +       
Sbjct: 197 IGRGIKEPDDIKGMDGQLFGQERNSLCNGEAWPERPLEQTYSESWSCGKMQGIRQFASGP 256

Query: 274 LLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRL 333
           L   R + Q      A+ A+  + V VS+ P F I          + W   K+ G F+ +
Sbjct: 257 LKPKRANFQNYVNEVAVLAESREDVEVSVLPSFNIK------ETHEAWDLFKQEGRFEDI 310

Query: 334 NSMETSVTSEPGSSI-GAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGK---TYY--- 386
            S + S+ S PG  + G+A+A    +    +  V F L W  PE+          YY   
Sbjct: 311 -SKDVSLLSIPGKQMGGSAVALKTKLKAKEKKTVRFMLVWFYPEMMIDRENDPLEYYWVG 369

Query: 387 -RRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYY 445
              Y +++  + N    +          +  +   WQ P+L +  +P+WY   L N  Y 
Sbjct: 370 GSDYGRYFHNYFNRLEQLVDYGYGNRERFREKTTEWQLPVL-NSTMPDWYKFKLINSGYV 428

Query: 446 L 446
           +
Sbjct: 429 I 429



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/168 (25%), Positives = 81/168 (48%), Gaps = 10/168 (5%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           ++Q+ +D   TGD  +       VY  M ++     +G     N      TYD ++   +
Sbjct: 523 IIQLVKDYEQTGDFHYLSRYAGRVYNGMDFLKSLMPEG----LNIPIGGTTYDDFTHPPV 578

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNGSYFNY----DN 746
            +Y   +++A L+AA  +A+ + D   E+ +  +F K  K +   LWNG +F+Y    D 
Sbjct: 579 YSYGASIYLATLRAAQVVAQAMKDTVRENEYKEQFDKTQKDMIRLLWNGRFFSYGCELDG 638

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK 794
           SG  + + +   QL GQ+ +R CG   ++  +  +++L   +  ++ K
Sbjct: 639 SGR-KDNLLFTGQLGGQFVSRYCGWGDVIPMEMTKASLVSQFKISLSK 685


>gi|332796016|ref|YP_004457516.1| hypothetical protein Ahos_0327 [Acidianus hospitalis W1]
 gi|332693751|gb|AEE93218.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 662

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/194 (29%), Positives = 84/194 (43%), Gaps = 16/194 (8%)

Query: 598 GAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV 657
           G VPHD+GI    F+          RWKDLNP F+L VYR    TGD  F +  +P +  
Sbjct: 383 GYVPHDLGIWS--FDSPIEGTTAPPRWKDLNPTFILLVYRYFKFTGDVDFLRETYPKMLK 440

Query: 658 AMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRG 717
           A  +M    R     IE +G  D  +D   + G +     L++A+  A     + + ++ 
Sbjct: 441 AYEWM--LTRS----IEAEGSGDTAFDVLPIKGKNPMLLTLFIASALALRETKKVLNEKD 494

Query: 718 SEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDE 777
                     K + +   L+NG YF            I   QL G+W+    GL  + DE
Sbjct: 495 EST----DLSKLREMLNSLYNGKYF----IAWEGQEGIFMAQLLGEWWTELLGLENVTDE 546

Query: 778 DKARSALKKVYNYN 791
           +K  SAL+ +   N
Sbjct: 547 EKISSALRYMLEVN 560



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 78/368 (21%), Positives = 148/368 (40%), Gaps = 59/368 (16%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQ-FSVFVSRSNGQKYSSVLCPK 154
           GVPLGG+G+G +  + RG+     +F  +    P+ + + F +F+S   G  +       
Sbjct: 13  GVPLGGIGTGKLEINNRGKLVNITVFGNLS--SPIKSVRGFHIFISPDEGDPFF------ 64

Query: 155 TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHN 214
              + KD     +       + D+  Y  +YP  +   +     E+++     SPIIP N
Sbjct: 65  ---LEKDLRIRNMR------EVDELYYEGVYPFCYLTGK---KGEIKVTLEAFSPIIPRN 112

Query: 215 YKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMNDGVHAVL 274
            K+S+ P    +  +  S K     T+  ++ N VG  S   G+      K+ +G+  + 
Sbjct: 113 IKDSNIPAVGLSIKVSGSKKG----TIYLSFPNFVGSTS--IGR---INRKVENGI--IF 161

Query: 275 LHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEI--KEHGSFDR 332
            + RT+   P        +   G+  S  P  +IS  ++ + A++ +     K+    D 
Sbjct: 162 TNIRTNEYDP-------RRGEVGIFSSY-PTLIISQYNINVDAREAFKTFIYKDLYEDDN 213

Query: 333 LNSMETSVTSEPGSSIG-----AAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYR 387
           +      V S+    +G     A I AS     +   ++ F +AW      +   K+Y  
Sbjct: 214 VWRNINKVKSDEHEVVGFWDDPAGIIASNY---EENKEIRFVIAW------YSRAKSYQY 264

Query: 388 RYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLN 447
            Y  +Y    +    +A+  +  +     +   W+  I+ +  LP+W    + N  Y L 
Sbjct: 265 PYGFYYHNRFSGVLEVAQYFMKNYEELRKRTIEWKERIM-NINLPDWLKDAIINSSYILT 323

Query: 448 AGGAVWTD 455
           +  + W D
Sbjct: 324 S--STWLD 329


>gi|326436882|gb|EGD82452.1| hypothetical protein PTSG_11966 [Salpingoeca sp. ATCC 50818]
          Length = 1720

 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 86/320 (26%), Positives = 128/320 (40%), Gaps = 42/320 (13%)

Query: 97   VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
             PLG  G G I  +  G  ++WQI  R+C D  V  + F+++ +   G   + VL     
Sbjct: 885  CPLGPCGGGCIALAGDGGLRQWQILNRICHDAHVPDSFFALYTNDGKGGVDARVL----- 939

Query: 157  EVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYK 216
            ++LKD            ++G        Y    +++ G     ++     + P IP + K
Sbjct: 940  QLLKDLPHLSA----IEIEGKYPVMDVDYT---SLNNG-----IKASMECMCPCIPLDSK 987

Query: 217  ESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD--------SEFTGQHYNSKTKMN- 267
             S  PV  F +T+ N+   + D++LL +  N VG D        +EF G   N   KM  
Sbjct: 988  NSGLPVIFFNFTLTNTTSDAVDVSLLMSQHNLVGWDGETAIENETEFAGYGGNVNAKMTM 1047

Query: 268  DGVHAVLLHHRTSHQLPPVT--FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIK 325
            DGV AV +         P     A+AA   D   VS    +           K +W    
Sbjct: 1048 DGVTAVDMTSTQFRDTDPQAGHIAIAALTGDSSEVSTMLQYD--------AHKTLWSHFA 1099

Query: 326  EHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPE--VNFMSGK 383
            + G   +L     S  S  G +   A+  S T+   +   +TF LAW  P   VN+ S K
Sbjct: 1100 KTG---KLPGQGQSTPSPSGKTWNGAVCVSRTLAAGATTTITFVLAWHFPNRYVNW-SQK 1155

Query: 384  TYYRRYTKFYGTHQNAAANI 403
             +Y   TK      NA AN 
Sbjct: 1156 GFYIDDTKSQYWLGNAYANF 1175



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/180 (25%), Positives = 85/180 (47%), Gaps = 14/180 (7%)

Query: 632  VLQVYRDVVATGDKKFAKAVWPSVYVAMA-YMDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
            V++ YR+V+      + K  +  V   M  ++   D  GDG++   G    TYD  ++ G
Sbjct: 1328 VIKTYREVLMGAPSSWFKDKYHQVLKIMERWLIIMDAQGDGVVR--GAQPSTYDV-ALHG 1384

Query: 691  ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLW--NGSYFN----- 743
             +++ G L++ AL+AA  +A    D  S   F  +F+ +   Y+KL   NG ++      
Sbjct: 1385 ANSFIGSLYMCALRAAEEMAVLQNDPDSAAKFRARFKLSVANYDKLCFTNGKWYTQVVDP 1444

Query: 744  -YDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
             +D +   +S+ +  D L G+W+A+  G+  ++      S L+  + YN +      R A
Sbjct: 1445 AHDTNIIGESTFV--DSLLGEWWAQFLGIGAMLPPAHVLSTLQNCFKYNHVDAFDPARQA 1502


>gi|158316208|ref|YP_001508716.1| hypothetical protein Franean1_4431 [Frankia sp. EAN1pec]
 gi|158111613|gb|ABW13810.1| protein of unknown function DUF608 [Frankia sp. EAN1pec]
          Length = 934

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 70/309 (22%), Positives = 129/309 (41%), Gaps = 37/309 (11%)

Query: 172 WN-LKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIY 230
           WN LK    +Y  L+P  +  + G    + ++  +  SPI+ +  + +S PV+ F   + 
Sbjct: 184 WNTLKPGDGSYAVLFPFGYMTYSGF---QSKVSTKIWSPIVANEDERTSMPVAFFDMLMN 240

Query: 231 NSGKTSADITLLFTWTNS--VGGDSEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVT- 287
           N       I+++ T+ N+      S  TG +         G+  V L   +    P    
Sbjct: 241 NPTAKPIKISVMLTFPNAPAFATGSVRTGLYSRFDRDSASGIGGVTLGSDSPENTPDTVT 300

Query: 288 --FALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPG 345
             + +AA    G  ++ C  +  SG+       D++      G+  +L + +   ++   
Sbjct: 301 SEWTIAAHPFAGQTLTYCTSWDGSGD-----GSDIYAPFSAAGADGKLPNGDIDQSAS-- 353

Query: 346 SSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-----MSGKTYYRRYTKFYGTHQNAA 400
                A+A ++T+ PD    V F+L+WD P+V +      +   + RRYT F G   +  
Sbjct: 354 ---AGAVAVALTLEPDQTQTVRFALSWDFPQVYYDGEDATTRAVWMRRYTAFLGGKTSRT 410

Query: 401 AN-------------IARDAILEHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYLN 447
            +             IAR  +  +      +E+W +PI E+ ++P W      NELY++ 
Sbjct: 411 NDYVQDSYPFRQGFTIARKELARYDDSLAAVESWWKPIAENPQVPPWLRKASLNELYHMI 470

Query: 448 AGGAVWTDG 456
             G+ W  G
Sbjct: 471 FNGSFWESG 479



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 128/319 (40%), Gaps = 52/319 (16%)

Query: 624 WKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF--PDQ 681
           ++D   K + + Y  +    D    +  +  +  A+ Y   F   G  +  +     P  
Sbjct: 613 FRDCPHKLIYRTYALIKFYDDDDLLEYGYAPMLKALTYSQFFRPTGSHLPADPASNNPPN 672

Query: 682 TYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK--------AKVVY 733
           T D   V+G   Y+ GL++ +LQ  S L  +    G  D    + QK        AK  +
Sbjct: 673 TMDQAVVNGHGIYNCGLYLLSLQILSTLTPQAARLGV-DEATPEIQKELDEELAAAKEEF 731

Query: 734 EKL-WN---GSYFNYDNSGS--SQSSSIQA-------------DQLAGQWYARACGLLPI 774
           E++ WN   G Y   D +G    ++ +I+              +  AGQ  A   GL  +
Sbjct: 732 ERIFWNPATGRYRYCDGTGGIGDRTGTIRGRFKPVPPPDAIWLESFAGQLVAMELGLPDV 791

Query: 775 VDEDKARSALKKVYNYNVL--KVMGGKRGAVNGMLPDGRVDMSSMQSREI---WSGVTYA 829
           VD D AR+ LK   +  V      G   G    + PD  +  SS+++ EI     G+ + 
Sbjct: 792 VDLDHARTHLKNTLDSFVRFRDPEGNLMGGPIILKPDFSIYPSSLRTTEINEVIPGIAFL 851

Query: 830 VAASM-----------IHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQY-- 876
            AA             I E    +G   A  IY+   +G    YAF TPE+W  DD +  
Sbjct: 852 AAAGAFRIGAKVKDKDITEKALKLGEGCALQIYDIESNG----YAFATPESWFVDDHHIS 907

Query: 877 RSLCYMRPLAIWAMQWALT 895
           R   Y R  ++W++  A++
Sbjct: 908 RFPGYTRTRSVWSLYDAVS 926


>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 807

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 68/295 (23%), Positives = 119/295 (40%), Gaps = 35/295 (11%)

Query: 98  PLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL-CPKTP 156
           PLGG+G+G++    RG+ + W+I  R  +   +    F+++   ++G+  + VL  P  P
Sbjct: 18  PLGGIGTGNVSLGARGDLRDWEICNRPAKGNRLPNTFFALWTRTASGRCVTRVLEGPIQP 77

Query: 157 -----EVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
                     +T AG+       +   S +   YP AW V   +P   +R+     +P+I
Sbjct: 78  PHDGSHGYHPSTGAGLP------RCRHSRFRGEYPLAW-VELADPALPVRVSLEAYTPLI 130

Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS-----EFTGQHYNSKTKM 266
           P   ++S  P ++ +YT+ N+      +TL+ + TN VGG +      + G   N     
Sbjct: 131 PLCPEDSGIPCAILSYTVENTSDEVVALTLVGSLTNPVGGVTFDRFGNYRGAGGNQNEYR 190

Query: 267 NDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTA------KDM 320
              V+  L    T+     + +      TD        H VI+     L        ++ 
Sbjct: 191 ESEVYRGLWLRSTTLPSDDLRYGDMTLATD--------HPVITYKRAWLRGAWWDYLREF 242

Query: 321 WHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
           W ++ E G   RL  +     SEPG +   ++     + P       F L W  P
Sbjct: 243 WQDLSEDG---RLEDLGYESPSEPGKTDTCSLGLRDELAPGETHTYRFVLTWYFP 294



 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 72/232 (31%), Positives = 107/232 (46%), Gaps = 20/232 (8%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           +L+V R+   +GD ++ + VWP V  A+AY      DGDG    DG    TYD     G 
Sbjct: 462 ILRVLREWRLSGDMEWLRRVWPGVKSALAYASA-QWDGDGDGVLDGRQHNTYDV-EFYGP 519

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYF----NYDN 746
           +   G  ++AAL+A   LA  +G+          F++ +  + E LWNG Y+    + D 
Sbjct: 520 NPLCGLYYLAALRAVEELAEVLGEANLARRCREVFERGSHRLDELLWNGEYYEQLGDEDA 579

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL-----KVMGGKRG 801
                     ADQL GQ  A   G+  +V E+  R AL+ ++ YN        V   +  
Sbjct: 580 WPYQHGKGCLADQLLGQLQAHVLGMGYLVPEEHVRQALRSIWRYNFRADLSEHVNCQRTF 639

Query: 802 AVN---GML-----PDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQ 845
           A+N   G+L       G+       S E+++G  Y VAA MI+E L + G Q
Sbjct: 640 ALNDEAGLLMCTWPRGGQPTFPFPYSDEVFTGSEYQVAAHMIYEGLLEEGLQ 691


>gi|383764296|ref|YP_005443278.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381384564|dbj|BAM01381.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 905

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 117/272 (43%), Gaps = 60/272 (22%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY------MDQFDRDGDGMIENDGFPDQTYDT 685
           VL+ YRD    GD ++ + +WPSV  ++ +      +D++D +  G++   G    T D 
Sbjct: 494 VLKTYRDWKICGDNEWLRRLWPSVKKSLEFAWHPNNIDRWDPEKTGVLW--GRQHHTLDM 551

Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSY--- 741
             + G +++   +++AAL+A + +A  +GD  +   +   F + K   +  L+NG Y   
Sbjct: 552 -ELFGPNSWLTSMYLAALKAGAEMAEYLGDEAAAIEYREIFARGKKWADANLFNGEYYIQ 610

Query: 742 ------------FNYDNSGSSQSSSIQA-------------------DQLAGQWYARACG 770
                       F  D S     S+++A                   DQ+ GQW+A   G
Sbjct: 611 RINLHDRSIVEAFADDESVLIGGSTLEAYWDDEHGEIKYQIGEGSSIDQVLGQWHASLYG 670

Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMG------------GKRGAVNGMLPD--GRVDMSS 816
           L  I D  + RSAL  +Y YN + VMG             + G V    P    +  + +
Sbjct: 671 LGEIFDPTQVRSALAAIYKYNFIPVMGEVYNPCRIYCLNDEGGLVICAWPPHVAKPTIPA 730

Query: 817 MQSREIWSGVTYAVAASMIHEDLADIGFQTAC 848
             S+E  +G  YA A+ MI   L D G  T C
Sbjct: 731 PYSQETMNGFEYAAASHMIMNGLVDEG--TTC 760


>gi|410096859|ref|ZP_11291844.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225476|gb|EKN18395.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 847

 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 77/324 (23%), Positives = 129/324 (39%), Gaps = 49/324 (15%)

Query: 97  VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
           +PLGG+G+G++    RGE + W+I         V A ++S   + +N   ++    P+  
Sbjct: 43  LPLGGIGTGTVSLGGRGELRDWEIMN-------VPAKKYSTVTTGNNAPFFAIYAKPQNQ 95

Query: 157 EVLKDTTAAGIGSWDW-NLKGDKSTYHAL-----------YPRAWTVHEGEPDPELRIVC 204
           E      A  +   ++ + +G    +H L           YP    VH  + D  +++  
Sbjct: 96  EATTTLLAGPLYPQEYLHYEGRPVNHHGLPRFAQASFDAAYPFG-QVHLSDKDLPVKVTI 154

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-SEF-------- 255
           +  +P+IP + + S  PV+V +Y + N      +I +  +  N +G D S+F        
Sbjct: 155 KGFNPLIPGDAEASGLPVAVLSYEVTNITSQPMEIAICGSMRNFIGKDGSKFRTDWKGDY 214

Query: 256 --TGQHYN-SKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISG 310
             TG   N +K   N G+  + L+     +  P   T AL  Q T GV            
Sbjct: 215 IPTGVKDNKNKYVENKGIKGIYLYSDGVDKNDPAWGTVALTTQATSGVSYRTSSKSDNWN 274

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
           N +     + W +    G     N  E     +P     A+++   TV P S    TF +
Sbjct: 275 NGI----LNFWDDFSADGMLTERNKQEDE---DPM----ASLSVKKTVKPQSTETFTFYI 323

Query: 371 AWDCPEVNFMS----GKTYYRRYT 390
            W+ P     S    G  Y R+YT
Sbjct: 324 TWNFPNRKAWSSTVVGNYYSRQYT 347



 Score = 66.6 bits (161), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 109/249 (43%), Gaps = 51/249 (20%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           ++++YR+   +GD  F K  W  +   ++Y      +D + DG++E  G    T D  + 
Sbjct: 486 IMKIYREWQLSGDNDFLKNNWGQIKKVLSYAWTEKGWDGNQDGVME--GSQHNTMDV-NY 542

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK--------LWNGS 740
            G +   G  ++ AL+AA  +A  + D+G         QK + ++E+        L+NG 
Sbjct: 543 FGPNPQMGFWYMGALKAAEKMAIAMKDKGFA-------QKCRTLFEQGSNWMDKNLFNGE 595

Query: 741 YFNYD------------NSGSSQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARS 782
           Y+ +             N+ + +  S Q       DQL GQ+ +  CGL  + ++D  ++
Sbjct: 596 YYEHKITDPETFEYLDMNNPNVKIPSFQLGPGCLVDQLVGQYMSHLCGLGYLGNKDHIQT 655

Query: 783 ALKKVYNYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
            +  +  YN +             VMG + G +    P GR+++      E+ +G  Y  
Sbjct: 656 TMNSIMKYNYVSDFSRHFNNMRSYVMGNESGLLMASWPKGRLEVPFPYFAEVMTGFEYCA 715

Query: 831 AASMIHEDL 839
           A  MI+E +
Sbjct: 716 AVGMIYEGM 724


>gi|224025536|ref|ZP_03643902.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
           18228]
 gi|224018772|gb|EEF76770.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
           18228]
          Length = 846

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 62/251 (24%), Positives = 107/251 (42%), Gaps = 51/251 (20%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           +++ YRD   +GD +F K  W  V   ++Y      +D + DG++E  G    T D  + 
Sbjct: 484 IMKFYRDWQLSGDSRFLKENWAQVKKVLSYAWTEHGWDDNQDGVME--GSQHNTMDV-NY 540

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK--------LWNGS 740
            G +   G  ++ AL+AA  +A  + DR  E       +K + ++E         L+NG 
Sbjct: 541 FGPNPQMGFWYLGALKAAEKMAHAMKDRAFE-------KKCRALFENGSEWLDNNLFNGE 593

Query: 741 YFNYD------------NSGSSQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARS 782
           Y+ +             N   ++    Q       DQL GQ+ A  CGL  +  ++  R+
Sbjct: 594 YYEHKITDPETFEFLDMNDPDTKIPDFQLGKGCLVDQLVGQYMAHICGLGYLAKKEHIRT 653

Query: 783 ALKKVYNYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
            LK +  YN  +            V+G + G +    P GR+++      E+ +G  Y  
Sbjct: 654 TLKSIMRYNFKEDFSRHFNNMRSYVLGDEAGLLMASWPKGRLEVPFPYFAEVMTGFEYCA 713

Query: 831 AASMIHEDLAD 841
           A  MI+E + +
Sbjct: 714 ATGMIYEGMEE 724



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/342 (19%), Positives = 134/342 (39%), Gaps = 59/342 (17%)

Query: 87  IKRHLTSSH----GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS 142
           + RH    H     +PLGG+G+G++    RGE + W+I         V   ++S   + +
Sbjct: 27  VLRHYDQQHLYQIALPLGGIGTGTVSLGGRGELRDWEIMN-------VPGKKYSTVTTGN 79

Query: 143 NGQKYSSVLCPKTPEVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWT 190
           N   ++    P+  E      A  +   ++        N  G      S++ A YP    
Sbjct: 80  NAPFFAIYTKPEKGEAQTTLLAGALYDQEYLHYEGRPVNHHGLPRFKNSSFDAAYPFG-Q 138

Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
           VH  +    + +  +  +P++P + + S  P++V  Y + N+ +T  D+++  +  N +G
Sbjct: 139 VHLSDKGIPVEVTVKGFNPLVPGDAEASGIPIAVLCYEVTNTSQTPVDVSICGSIRNFIG 198

Query: 251 GD-----SEFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPPVTF-ALAAQETDGVHVSLCP 304
            D     +++ G +     K N  +      ++++  L  + F +   ++TD  + ++  
Sbjct: 199 KDGSKFVTDWKGDYIPVGCKDNRNI------YKSAQGLQGIYFCSEGVEKTDPAYGNMT- 251

Query: 305 HFVISGNSLGLTAKD-------------MWHEIKEHGSFDRLNSMETSVTSEPGSSIGAA 351
             +++    G+T +               W +    G    L  M     ++P     A+
Sbjct: 252 --LVTNAPEGVTYRTSSKADDWSNGILGFWDDFSADG---ELTEMSKQYDADP----MAS 302

Query: 352 IAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFY 393
           +A    + P      TF L W+ P     S       Y++ Y
Sbjct: 303 LAVKKKLAPGETRSFTFFLTWNFPNRKAWSSSVIGNYYSQQY 344


>gi|320109368|ref|YP_004184958.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
 gi|319927889|gb|ADV84964.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
          Length = 898

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 56/167 (33%), Positives = 86/167 (51%), Gaps = 10/167 (5%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           V+Q+ +D   TGD  F K     +    AYM    RD D  I   G   QTYD +    I
Sbjct: 532 VIQLAKDYQQTGDDTFIKKNADHIRRGFAYMKAQIRD-DSFIPVGG---QTYDDFPHPEI 587

Query: 692 SAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNY--DNS 747
           S Y+G +++AAL+A   L   +GD     E    FK  +A +++  LWNG +F Y  D  
Sbjct: 588 SVYTGTVYLAALRAGVVLGDALGDTQMSHEAETQFKQTQAGLIH-ALWNGRFFAYGTDIG 646

Query: 748 GS-SQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL 793
           G+  +   + + QLAGQ+ +R  G   ++  D+A+SA+ +  + +VL
Sbjct: 647 GAHRRDDRLFSGQLAGQFLSRYAGWGDVLPHDQAKSAIYEQLHTSVL 693


>gi|283835266|ref|ZP_06355007.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
 gi|291068424|gb|EFE06533.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
          Length = 896

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 119/272 (43%), Gaps = 67/272 (24%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
           VL +YRD +  GD+ + +  W +V   + +       DQ+D +  G++   G    T D 
Sbjct: 470 VLAIYRDWLLCGDRAWLEEKWSAVRDMLRFAWNPDNPDQWDPEQRGVLT--GRQHHTLDM 527

Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFK--FQKAKV-VYEKLWNGSYF 742
             + G + +  GL++AAL+A  A+A  +G   ++D  L++   ++ +V + E+L+NG YF
Sbjct: 528 -ELFGPNPWLTGLYIAALEAGCAIANHLGH--NDDAMLWQDIARRGRVTLRERLFNGEYF 584

Query: 743 ----------------------NYDNSGS---------------SQSSSIQADQLAGQWY 765
                                 +Y  SG                   S    DQL GQW+
Sbjct: 585 QQVIDLANRDTLAPWRTSAESADYGLSGDVWQQYWCEENHEVKYQFGSGCHIDQLMGQWF 644

Query: 766 ARACGLLPIVDEDKARSALKKVYNYNVLKV-----------MGGKRGAVNGMLPDGRVDM 814
           A  CGL P+ D  + + AL+ ++ YN +             +  ++G      P G  DM
Sbjct: 645 AGLCGLPPVFDPGQTQRALRALWQYNFVSARHHTNPCRLYGLNDEQGVSICSWPTG--DM 702

Query: 815 SSMQ---SREIWSGVTYAVAASMIHEDLADIG 843
            ++    ++E+ +G  Y +AA M+   +A  G
Sbjct: 703 PAIPVPYAQEMMTGFEYQLAALMVASGMATEG 734



 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 68/294 (23%), Positives = 117/294 (39%), Gaps = 31/294 (10%)

Query: 98  PLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVL------ 151
           PLGG+G+G I  +  G     +IF    +        F+V   R+     + +L      
Sbjct: 15  PLGGIGTGCISLAGNGHLTDVEIFNHPDKGYHHGYTHFAVKAERNGALVDARILQGDWQG 74

Query: 152 --CPK---TPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGE-PDPELRIVCR 205
             C +      + K  T  G+G     L G     H+ +   + + +    DPE     R
Sbjct: 75  NACGQWQIHDRLRKRHTGYGVGPDAETLAGMPHFAHSEFDGEFPMAQMSFSDPEFPGNIR 134

Query: 206 QIS--PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSK 263
            ++  P IP +   SS P++ F   + N+   + D TL  +  N +  DS       +++
Sbjct: 135 MLAFNPFIPCDEDNSSLPLAWFELEVENTTDAALDYTLCLSLCNGLAPDST------HNR 188

Query: 264 TKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISG--NSLGLTAKDMW 321
            ++ +  H + L  R      P  +A  A  TD   VS   ++      +SL +  +D  
Sbjct: 189 HEIREDRHMMTLTQRHYPADDP-RYAELATTTDAQDVSWQTYWYRGDWFDSLTMFWRDFC 247

Query: 322 HEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCP 375
            +        RL   E +  S+PG +  A++AA + +     G+V F   W CP
Sbjct: 248 GQ-------PRLPEREYT-QSQPGFADHASLAAHLHLAAGERGRVRFWTGWYCP 293


>gi|224535965|ref|ZP_03676504.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522420|gb|EEF91525.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 899

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/396 (22%), Positives = 159/396 (40%), Gaps = 76/396 (19%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           GVPLGG+G G+I  +  G+F R  +       K    + FS++  + N ++  +V   + 
Sbjct: 64  GVPLGGIGVGNINFAPSGKFTRIGMNNIHTPIKRSEHSFFSLWTRKGNEKE--AVRLVRD 121

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKST-YHALYPRAWTVHEGEPDPELRI--VCRQISPIIP 212
             VL            + +KG + T Y  L+P   T      D  L++  V R  S ++P
Sbjct: 122 NHVL------------YGMKGVEHTQYKGLFP---TAELSYADNNLKVTPVIRAYSGLVP 166

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--GDSEF-----TGQHYNS-KT 264
           HN K+SS PV  F   + +     A   L F+W + +G   D +       GQ  +  + 
Sbjct: 167 HNVKDSSLPVVWFEVDLISQEDMEA--ALAFSWEDFIGLFNDPKSLEGFDNGQLLSEGRA 224

Query: 265 KMNDG--------------------VHAVLLHHRTSHQLPPVTF-------ALAAQETDG 297
            +N+G                    +  ++ +   S Q   +TF        +A +E   
Sbjct: 225 NINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEEEK- 283

Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
            +VS  P +         +  + W + + +G F   +  +  +T +  +S  +A+A    
Sbjct: 284 -NVSYLPTY--------RSNSEAWDQFRNNGEFTS-SLTKNVLTEQSQTSSASALAVKTQ 333

Query: 358 VPPDSEGQVTFSLAWDCPEVNFMSGK----TYY---RRYTKFYGTHQNAAANIARDAILE 410
           +    +  + F LAW  PE+   +      +Y+     Y K+Y  + N   ++   A+  
Sbjct: 334 LKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNCMNSMVSYAVSN 393

Query: 411 HGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYL 446
                 Q   WQ P+LE   LP+WY   L N  Y +
Sbjct: 394 RARIARQTTEWQIPVLESS-LPDWYKFKLINSGYVI 428



 Score = 59.3 bits (142), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 78/157 (49%), Gaps = 8/157 (5%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           ++Q+ +D   TGD ++ KA    +  AM ++  + R   G     G    TYD ++   +
Sbjct: 522 IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFL--YSRMPQGSTIPVG--PTTYDDFTHPPL 577

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNGSYFNY--DNSG 748
            +Y  G+W+  L+A  A+ + +GD         +F  + K   EKLWNG +F Y  +  G
Sbjct: 578 YSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKLWNGRFFAYGCEPDG 637

Query: 749 SSQSSSIQ-ADQLAGQWYARACGLLPIVDEDKARSAL 784
           S +  ++    QLAGQ+ +R CG   +   D  ++++
Sbjct: 638 SKRLDNVLFTGQLAGQFLSRYCGWGDVYPMDIVKASM 674


>gi|89068658|ref|ZP_01156050.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
 gi|89045845|gb|EAR51906.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
          Length = 876

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 94/375 (25%), Positives = 144/375 (38%), Gaps = 54/375 (14%)

Query: 92  TSSHGV-PLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS-------- 142
           TS + V PLGG+G+G++G    G  Q W+IF R  +      + F+V   R+        
Sbjct: 13  TSPYAVFPLGGIGAGTVGIGADGRLQDWEIFNRPAKGSVNGFSHFAVRAERAGEVVDTRV 72

Query: 143 -----NGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHE 193
                 G +  S L P       +    G G     + G      S+    YP A    E
Sbjct: 73  LHGPFTGNRTGSFLAP-------EFNGYGFGPRRQQMTGFPSFRSSSLAGPYPAADLTFE 125

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
            +P     +  R +SP +P   ++SS PV++F  T  N G T  D TL            
Sbjct: 126 -DPAFPGAVDMRALSPFVPLEDRDSSMPVAMFEVTFNNPGDTPLDYTLFGCLA------F 178

Query: 254 EFTGQHYNSKTKMNDGVHAVLLHHRTSHQLPP--VTFALAAQETDGVHVSLCPHFVISGN 311
           +F     N++  +  G   +L    T  +  P     +LA    D  H     H +  GN
Sbjct: 179 DFVDLEGNAEVDIATGDRTILTGRTTLGEDHPDHRQLSLATDHDDTSH----QHHLYRGN 234

Query: 312 SLGLTAKDMWHEIKEHGSF-DR-LNSMETSVTSEPGSSI-GAAIAASVTVPPDSEGQVTF 368
               T +  W +I   G   DR   S ++SV +    +I  + +AA VTVP      V  
Sbjct: 235 WFD-TMEVYWADIARGGRLRDRKYPSAKSSVANNQADAIEHSLLAAHVTVPAGESRTVRM 293

Query: 369 SLAWDCPEVNFMSGKTY-----------YRRYTKFYGTHQNAAANIARDAILEHGSWELQ 417
           +L+W  P        T+              +  +Y T  ++A  +AR A+        +
Sbjct: 294 ALSWYVPNFRKYWLTTFPMACCKPSAEVQTHWKNYYATQWSSAEEVARVALDGWADLRDR 353

Query: 418 IEAWQRPILEDKRLP 432
             A+ R +L D  LP
Sbjct: 354 TLAF-RNVLSDTTLP 367



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/209 (22%), Positives = 92/209 (44%), Gaps = 46/209 (22%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
           V++ YRD   +GD  + +++WPS+  A+ +       D++D D  G++   G    T D 
Sbjct: 474 VMKAYRDWKLSGDDAWLRSLWPSIKSAIEFAWHPDNPDRWDPDQTGVLS--GRQHHTLDM 531

Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFN- 743
             + G +++  G ++ AL A + +A  VG+  +   +   F+K K  V E+L+NG +F  
Sbjct: 532 -ELFGPNSWLAGFYLGALSAGAEMAEFVGEADTAAQYRALFEKGKAWVGEELFNGEWFAQ 590

Query: 744 ------------YDNSGSSQS-----------------------SSIQADQLAGQWYARA 768
                       +   G   S                       +    DQ+  +W+AR 
Sbjct: 591 KIDIADRDTLAPFQEPGGDASHISGSIFEQYWSEEHGELKYQIGAGCAIDQVLAEWHARL 650

Query: 769 CGLLPIVDEDKARSALKKVYNYNVLKVMG 797
            GL  + D +   +A++ +Y +N ++ +G
Sbjct: 651 YGLDHLYDPNDFATAVRAIYEHNFVERLG 679


>gi|423347418|ref|ZP_17325105.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
           CL03T12C32]
 gi|409217877|gb|EKN10851.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
           CL03T12C32]
          Length = 837

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/243 (25%), Positives = 106/243 (43%), Gaps = 39/243 (16%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           V+++YRD   +GD +F +  W  V   +AY      +D + DG++E  G    T D  + 
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532

Query: 689 SGISAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
            G +   G  ++ AL+AA  +A  + D+    +   LF+ Q +  +   L+NG Y+ +  
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFR-QGSTWMDANLFNGEYYEHKI 591

Query: 747 SGSSQSSSIQ------------------ADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
           +       +                    DQL GQ+ A  CGL  + D++  R+ L  + 
Sbjct: 592 TDPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIM 651

Query: 789 NYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
            YN +K            VMG + G +    P GR+++      E+ +G  Y  A  MI+
Sbjct: 652 KYNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIY 711

Query: 837 EDL 839
           E +
Sbjct: 712 EGM 714



 Score = 62.8 bits (151), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 64/305 (20%), Positives = 121/305 (39%), Gaps = 45/305 (14%)

Query: 97  VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
           +PLGG+G+G++    RGE + W+I         V   ++S   + +N   ++    P+T 
Sbjct: 33  LPLGGIGTGTVSLGGRGELRDWEIM-------NVPGKKYSTVTTGNNAPFFAIYAKPQTG 85

Query: 157 EVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWTVHEGEPDPELRIVC 204
           E +    A  +   ++        N  G      +T+ A YP    VH  +    +++  
Sbjct: 86  EAMTTLLAGPLYPQEYLHYEGRPVNHHGMPRFADATFEAAYPFG-QVHLSDRQLPVKVTV 144

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-----SEFTGQH 259
           +  +P++P +   S  P++V  Y + N+     ++ +  +  N +G D     +++ G +
Sbjct: 145 KGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRNFIGKDGSKFRTDWKGDY 204

Query: 260 YNSKTKMN-------DGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISG 310
             +  K N       +G+  + L+     +  P   T AL  Q  +GV            
Sbjct: 205 IPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQAVEGVTYRTSSRADDWN 264

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
           N +     + W +    G     +  E     +P +S    +A   TV P S    TF L
Sbjct: 265 NGI----LNFWDDFSADGELTERDRQE---DEDPMAS----LAVKKTVGPGSTETFTFYL 313

Query: 371 AWDCP 375
            W+ P
Sbjct: 314 TWNFP 318


>gi|154490957|ref|ZP_02030898.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
           43184]
 gi|423724888|ref|ZP_17699030.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
           CL09T00C40]
 gi|154088705|gb|EDN87749.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
           43184]
 gi|409236060|gb|EKN28869.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
           CL09T00C40]
          Length = 837

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/243 (25%), Positives = 106/243 (43%), Gaps = 39/243 (16%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           V+++YRD   +GD +F +  W  V   +AY      +D + DG++E  G    T D  + 
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532

Query: 689 SGISAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDN 746
            G +   G  ++ AL+AA  +A  + D+    +   LF+ Q +  +   L+NG Y+ +  
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFR-QGSTWMDANLFNGEYYEHKI 591

Query: 747 SGSSQSSSIQ------------------ADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
           +       +                    DQL GQ+ A  CGL  + D++  R+ L  + 
Sbjct: 592 TDPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIM 651

Query: 789 NYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIH 836
            YN +K            VMG + G +    P GR+++      E+ +G  Y  A  MI+
Sbjct: 652 KYNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIY 711

Query: 837 EDL 839
           E +
Sbjct: 712 EGM 714



 Score = 62.8 bits (151), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 64/305 (20%), Positives = 121/305 (39%), Gaps = 45/305 (14%)

Query: 97  VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTP 156
           +PLGG+G+G++    RGE + W+I         V   ++S   + +N   ++    P+T 
Sbjct: 33  LPLGGIGTGTVSLGGRGELRDWEIM-------NVPGKKYSTVTTGNNAPFFAIYAKPQTG 85

Query: 157 EVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWTVHEGEPDPELRIVC 204
           E +    A  +   ++        N  G      +T+ A YP    VH  +    +++  
Sbjct: 86  EAMTTLLAGPLYPQEYLHYEGRPVNHHGMPRFADATFEAAYPFG-QVHLSDRQLPVKVTV 144

Query: 205 RQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-----SEFTGQH 259
           +  +P++P +   S  P++V  Y + N+     ++ +  +  N +G D     +++ G +
Sbjct: 145 KGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRNFIGKDGSKFRTDWKGDY 204

Query: 260 YNSKTKMN-------DGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISG 310
             +  K N       +G+  + L+     +  P   T AL  Q  +GV            
Sbjct: 205 IPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQAVEGVTYRTSSRADNWN 264

Query: 311 NSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSL 370
           N +     + W +    G     +  E     +P +S    +A   TV P S    TF L
Sbjct: 265 NGI----LNFWDDFSADGELTERDRQE---DEDPMAS----LAVKKTVGPGSTETFTFYL 313

Query: 371 AWDCP 375
            W+ P
Sbjct: 314 TWNFP 318


>gi|348173841|ref|ZP_08880735.1| hypothetical protein SspiN1_25464 [Saccharopolyspora spinosa NRRL
           18395]
          Length = 995

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 73/254 (28%), Positives = 122/254 (48%), Gaps = 41/254 (16%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYD-TWSVSG 690
           +L++YR+   + D ++ + VWP    A+ Y+   D + DG++  DG    T D TW   G
Sbjct: 608 ILRIYREHQMSPDSQWLQRVWPRTRQAVEYLIASDAEPDGIL--DGAQPNTQDATW--FG 663

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-----AKVVYEKLWNGSYFNYD 745
            +++  GL++AAL+A +A+A EVGD    D F  +  +      +V+   L+NG YF ++
Sbjct: 664 RNSWLSGLYLAALRAGAAMAGEVGD----DAFARRCTELATSGTEVIVRDLFNGEYFVHE 719

Query: 746 ----NSGSSQSS-SIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVL------- 793
               + GS  ++    ADQL G  +A   GL  ++  +  RSAL+ ++ +N +       
Sbjct: 720 LDPAHPGSVNTNRGCFADQLLGPSWATQLGLPRVLPPEPTRSALRSIWRHNFVPRPMEYR 779

Query: 794 ---KVMGGK-------RGAVNGMLPDGRVDM-----SSMQSREIWSGVTYAVAASMIHED 838
               + GG+          V    P+G  D      S     E W G+ Y VAA +I E 
Sbjct: 780 QNSPIEGGRIFYDADVPALVMCTWPNGGGDEAGDNWSVSYFNEAWHGIEYQVAAHLIAEG 839

Query: 839 LADIGFQTACGIYE 852
           + D G   A  +++
Sbjct: 840 MVDEGLAVARSVHD 853



 Score = 46.2 bits (108), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 36/158 (22%), Positives = 63/158 (39%), Gaps = 17/158 (10%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           G+P+GG+G G +     G    W +     +++   AN         N   ++  + P +
Sbjct: 28  GLPIGGIGCGQLHLGGDGRLWLWDV-----DNRTAPAN--------INDLHFTRPMLPSS 74

Query: 156 PE----VLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQISPII 211
           P      ++ T    + +   + +G      A  P A TV   +P   +++     SP I
Sbjct: 75  PVEQGFAVRVTDGDDVRTRWLDARGFPDATFAGRPPAATVDYADPQEPVQVRLDACSPFI 134

Query: 212 PHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSV 249
           P    +SSYP     YT  N+G  + D+ +     N V
Sbjct: 135 PTEIDDSSYPAVFLDYTATNTGPATVDVAIAGFLANPV 172


>gi|332665583|ref|YP_004448371.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332334397|gb|AEE51498.1| protein of unknown function DUF608 [Haliscomenobacter hydrossis DSM
           1100]
          Length = 857

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 129/312 (41%), Gaps = 53/312 (16%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           V++ YR+ + +GD ++ K  W  V  AM+Y      +D + DG+ E  G    T D    
Sbjct: 491 VMRFYREWILSGDNEWLKKHWARVKAAMSYAWVPGGWDANQDGVQE--GKQHNTMDV-DY 547

Query: 689 SGISAYSGGLWVAALQAASALAREVGD----RGSEDYFLFKFQKAKVVYEKLWNGSYFNY 744
            G +      +  AL+A++A+AR + D    +  E       + +K V E L+NG Y+ +
Sbjct: 548 YGPNPQMQFWYFGALKASAAMARAMNDPVFAKQCEQVLA---KGSKWVDENLFNGEYYEH 604

Query: 745 -----------DNSGSSQS-------SSIQADQLAGQWYARACGLLPIVDEDKARSALKK 786
                      D S  S +            DQ+ GQ+ A  CGL  +  ++  ++AL+ 
Sbjct: 605 KITDPKTFEFLDRSNPSTAIPNFQLGEGCLLDQVVGQYMAHVCGLGYLAKKENIQTALQT 664

Query: 787 VYNYNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASM 834
           V  YN +             VM  + G +    P GR+ +      E  SG  YA A  M
Sbjct: 665 VMKYNFVPRFDNVFNNMRSYVMDKESGLIMASWPKGRLKVPFPYFAESMSGFEYAAAIGM 724

Query: 835 IHEDLADIGFQTACGIYEAAWSGTGLGYAFQTPEAWNTDDQYRSLCYMRPLAIWAMQWAL 894
           ++E   + G Q    I +  + G      F  PE  +         Y R +A W+   AL
Sbjct: 725 LYEGQTEAGLQCIQSIRD-RFDGEKRN-PFDEPECGHH--------YARAMASWSAVLAL 774

Query: 895 TRPKPKTLEKQM 906
           +  +   +EK M
Sbjct: 775 SGFQYSGVEKSM 786



 Score = 47.8 bits (112), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 65/317 (20%), Positives = 124/317 (39%), Gaps = 45/317 (14%)

Query: 87  IKRHLTSSH----GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL--ANQ---FSV 137
           + +H    H     +PLGG+G+G++    RGE + W+I  +  +    +   NQ   F+V
Sbjct: 25  VLKHYDQDHLLNIALPLGGIGTGTVSLGGRGELRDWEIMNKPGKKFSTITDGNQAPFFAV 84

Query: 138 FVSRSNGQKYS-SVLCPKTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVH 192
           +V        S ++L P  P   +      +     N  G      +++ A YP    V+
Sbjct: 85  YVKPIGKTAQSKALLGPFHPSEYQHYEGRPV-----NHHGMPRFSVASFDAAYPFG-QVN 138

Query: 193 EGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG-- 250
             +    + +  +  +P+IP +   SS P+++  Y + N+     ++ +  T  N +G  
Sbjct: 139 LSDDVLPISVKIKGFNPLIPGDVAASSIPLAILKYEVTNNTDAPLEVAICGTMRNFIGID 198

Query: 251 ---------GDSEFTGQHYN-SKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQETDGV 298
                    GD  + G   N ++ + +  +  + +  +      P   T AL+  E + V
Sbjct: 199 GSKQSKNWKGDLVYLGAKNNQNEYRESPDLKGIFMSSQGVDNQDPAWGTIALSTAERELV 258

Query: 299 HVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTV 358
                     + N    +  D W +    G    L+  +  V   P     A++A   T+
Sbjct: 259 SFRRSS----TPNDWENSILDFWDDFSADGV---LSDKDKLVDDNP----MASLAVKKTI 307

Query: 359 PPDSEGQVTFSLAWDCP 375
           P  S    TF ++W  P
Sbjct: 308 PARSSLTFTFYISWHFP 324


>gi|374373788|ref|ZP_09631448.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
 gi|373234761|gb|EHP54554.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
          Length = 897

 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 86/176 (48%), Gaps = 16/176 (9%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           VL  YR+   + D  F K  W  +  A  ++   DR+ DGM E+    +     W   G 
Sbjct: 504 VLGCYREHQMSADDGFLKRNWDKIKRATQFIINQDRNKDGM-EDTPLENTLDAIWD--GE 560

Query: 692 SAYSGGLWVAALQAASALAREVGDRG----SEDYFLFKFQKAKVVYEKLWNGSYFNY--- 744
            A+  GL +AA++A  A+A E+ D       +DY     + ++ + ++L+NG YF +   
Sbjct: 561 IAWLVGLCIAAVKAGQAMAEEMNDTAFAAVCKDYVQ---KGSRHMSDQLFNGEYFIHRPD 617

Query: 745 ---DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG 797
                 G    ++   DQ+ GQ +A   GL  ++D+ +  SALK ++ YN +K MG
Sbjct: 618 PVKGRKGLGSYNTCHIDQVLGQSWAFQVGLGRVIDQQQTLSALKALWKYNFIKDMG 673



 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 83/376 (22%), Positives = 136/376 (36%), Gaps = 69/376 (18%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCED--------KPVLANQFSVFVSRSNGQKY 147
           G+P+GG+ +G +  S  G    W +F    E         K  +  +     SR  G   
Sbjct: 69  GMPVGGINAGGMYLSGDGRLWLWDVFNLNQEGINPVDIPWKTSVQGELRKVRSRDGG--- 125

Query: 148 SSVLCPKTPEVL--------------KDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHE 193
           S +  PK  E+               K+ T   +   DW    D+  + A YP A T+  
Sbjct: 126 SYIAPPKAKELRPLDQGFALKLEYEGKEQTRQ-LREEDW----DEVRFEATYPVA-TITY 179

Query: 194 GEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS 253
              D  ++ V +  SP IP +   S  PVSVFT TI N+GK     TL     N     +
Sbjct: 180 SSKDIPVQAVLQAYSPFIPLDTDRSGLPVSVFTITIKNAGKHPVKATLAGWLENKAALHT 239

Query: 254 EFTGQHYNSKTKMND----GVHAVLLHHRTSHQLPPVTF-----ALAAQETDGVHVSLCP 304
               +     T   +    GV++ L      +Q    +F      LAA + D V ++   
Sbjct: 240 AAVNRLGRKNTTFKEEQITGVYSTLEKTAFGNQQADKSFDYGTLCLAALDRDAVAIA--- 296

Query: 305 HFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEG 364
                         D+   I     F     ++ +V +   +  G+ I  +VT   +   
Sbjct: 297 --------------DI-EAIDNARLFTSTTKVDQAVKAAQQTLTGSVI-TTVTAAANESA 340

Query: 365 QVTFSLAWDCPEVNFMSGKTYYRR-----YTKFYGTHQNAAANIARDAILEHGSWELQIE 419
            V++ ++W  P +    GK    +     Y   +   +  A+ +A +     G+  L  +
Sbjct: 341 TVSYIISWHTPNLMIGDGKVLPEKDRGQYYDNQFKNAKEVASYVAANFKELSGTTLLWRD 400

Query: 420 AWQRPILEDKRLPEWY 435
            W      D  LP W+
Sbjct: 401 TWY-----DATLPYWF 411


>gi|423226023|ref|ZP_17212489.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392630880|gb|EIY24861.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 899

 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 92/397 (23%), Positives = 161/397 (40%), Gaps = 78/397 (19%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKT 155
           GVPLGG+G G+I  +  G+F R  +       K    + FS++  + N ++  +V   + 
Sbjct: 64  GVPLGGIGVGNINFAPSGKFTRIGMNNIHTPIKRSEHSFFSLWTRKGNEKE--AVRLVRD 121

Query: 156 PEVLKDTTAAGIGSWDWNLKGDKST-YHALYPRAWTVHEGEPDPELRI--VCRQISPIIP 212
             VL            + +KG + T Y  L+P   T      D  L++  V R  S ++P
Sbjct: 122 NHVL------------YGMKGVEHTQYKGLFP---TAELSYADNNLKVTPVIRAYSGLVP 166

Query: 213 HNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG--GDSEF-----TGQHYNS-KT 264
           HN K+SS PV  F   +    +   +  L F+W + +G   D +       GQ  +  + 
Sbjct: 167 HNVKDSSLPVVWFEVDLI--AQEDMEAALAFSWEDFIGLFNDPKSLEGFDNGQLLSEGRA 224

Query: 265 KMNDG--------------------VHAVLLHHRTSHQLPPVTF-------ALAAQETDG 297
            +N+G                    +  ++ +   S Q   +TF        +A +E   
Sbjct: 225 NINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEEEK- 283

Query: 298 VHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVT 357
            +VS  P +         +  + W + + +G F   +S+  +V +E   +  A++ A  T
Sbjct: 284 -NVSYLPAY--------RSNSEAWEQFRNNGEFT--SSLTKNVLTEQSQTSSASVLAVKT 332

Query: 358 -VPPDSEGQVTFSLAWDCPEVNFMSGK----TYY---RRYTKFYGTHQNAAANIARDAIL 409
            +    +  + F LAW  PE+   +      +Y+     Y K+Y  + N   ++   A+ 
Sbjct: 333 QLKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNNMNSMVSYAVS 392

Query: 410 EHGSWELQIEAWQRPILEDKRLPEWYPITLFNELYYL 446
                  Q   WQ P+LE   LP+WY   L N  Y +
Sbjct: 393 NRARIARQTTEWQIPVLESS-LPDWYKFKLINSGYVI 428



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 78/157 (49%), Gaps = 8/157 (5%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI 691
           ++Q+ +D   TGD ++ KA    +  AM ++  + R   G     G    TYD ++   +
Sbjct: 522 IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFL--YSRMPQGSTIPVG--PTTYDDFTHPPL 577

Query: 692 SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNGSYFNY--DNSG 748
            +Y  G+W+  L+A  A+ + +GD         +F  + K   EKLWNG +F Y  +  G
Sbjct: 578 YSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKLWNGRFFAYGCEPDG 637

Query: 749 SSQSSSIQ-ADQLAGQWYARACGLLPIVDEDKARSAL 784
           S +  ++    QLAGQ+ +R CG   +   D  ++++
Sbjct: 638 SKRLDNVLFTGQLAGQFLSRYCGWGDVYPMDIVKASM 674


>gi|116624646|ref|YP_826802.1| hypothetical protein Acid_5570 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227808|gb|ABJ86517.1| protein of unknown function DUF608 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 844

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 83/192 (43%), Gaps = 30/192 (15%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           ++  Y D   +GD  + + +WP V  A+ +      +D D DG++E  G    TYD    
Sbjct: 480 IMHAYLDWSLSGDDAWLREMWPRVKKAIEFAWIPGGWDGDRDGVLE--GAQHNTYDV-EF 536

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFNYDNS 747
            G +   G  ++ AL+A   +AR VGD  S   +   F+  +  +  KL+NG Y+     
Sbjct: 537 YGPNPMCGIYYLGALRAGEEMARAVGDTASAAEYRRLFESGRQWIDAKLFNGEYYIQQIR 596

Query: 748 GSSQSSSIQA-----------------------DQLAGQWYARACGLLPIVDEDKARSAL 784
           G ++    +A                       DQL GQ+ A   GL P+V  +  R  L
Sbjct: 597 GVAKDKVAKAIIGTMGSDDTENPQYQVGAGCLVDQLVGQYMAEVAGLGPLVSPENIRKTL 656

Query: 785 KKVYNYNVLKVM 796
           + +Y YN  + M
Sbjct: 657 QSIYRYNYKRSM 668



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/226 (24%), Positives = 99/226 (43%), Gaps = 22/226 (9%)

Query: 86  FIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQ 145
           F  R LT+    PLGGV +GSI    RG+ + W+IF R  +         +++V     +
Sbjct: 36  FSGRKLTAI-AFPLGGVCAGSISLGGRGQLRDWEIFNRSDKGNAPSYAFPAIWVQSGARK 94

Query: 146 KYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCR 205
             + VL  +     +     G  S    ++   +T+   +P A         P +++   
Sbjct: 95  PVARVLESRIQPPYEGQNGLGTRSAPGLVRLAGATFTGEFPLARIDFTDAALP-VKVSLE 153

Query: 206 QISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTK 265
             +P IPH   +S  PV+V  Y + N G  +A +++ ++  + V       G+   + T+
Sbjct: 154 AFTPFIPHEPDDSGLPVAVLRYRVSNPGAAAAKVSIAWSIDSPV-------GRTQAADTR 206

Query: 266 MND-----GVHAVLLHHRTSHQLPPV-----TFALAAQETDGVHVS 301
            N+     G+  +++   T+ +LP       +FALAA  T G  V+
Sbjct: 207 TNEYREAGGLAGLMM---TNAELPAADALKGSFALAAMGTAGARVT 249


>gi|405382777|ref|ZP_11036555.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
 gi|397320811|gb|EJJ25241.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
          Length = 863

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 65/274 (23%), Positives = 109/274 (39%), Gaps = 59/274 (21%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
           +++ YRD    GD  + +  WP+V  A+ Y       D++D D  G++   G   QT D 
Sbjct: 475 IIKTYRDWKLCGDTDWLRRYWPNVKRAIEYAWSSENPDRWDPDQTGILS--GRQHQTLDM 532

Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRG-SEDYFLFKFQKAKVVYEKLWNGSYF-- 742
             + G +++   ++VAAL   S +A  +GD   SE         A  +  +L+NG +F  
Sbjct: 533 -ELFGPNSWLASMYVAALLGVSEMAAVLGDTELSEKTARMGKAGAAYINSELFNGRWFIQ 591

Query: 743 ---------------------------------NYDNSGSSQSSSIQADQLAGQWYARAC 769
                                             +            +DQ+ GQW+A   
Sbjct: 592 KVDLSDKGVLIPFDVGRAAGVLADGFMETYWSEEFQELKYQMGEGCISDQILGQWHAEVA 651

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKVM------------GGKRGAVNGMLPDG-RVDM-S 815
           G+   +D DK R+ALK V+  N    +              + G +    P+G R  M +
Sbjct: 652 GIGAFLDADKVRTALKSVHRNNFRPTLEDHFNPCRNYAYEDEAGLLIATYPEGIRQPMVA 711

Query: 816 SMQSREIWSGVTYAVAASMIHEDLADIGFQTACG 849
           +  + E+W+G+ Y  A+ +I   L + G    CG
Sbjct: 712 APYAEEVWTGIEYMAASHLIMHGLIEEGMDIVCG 745


>gi|253761387|ref|XP_002489099.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
 gi|241947398|gb|EES20543.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
          Length = 175

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 50/79 (63%), Gaps = 2/79 (2%)

Query: 27  DPAKPASLTWQRKLSTGEIPLSQFTLNWKETVQLAPIGVRILCLIREEAAKGKRAFIDPF 86
           D   P    W+RKLS+    L +F + ++E +++  +G+R+   IREEA+ G++A IDPF
Sbjct: 31  DDGSPPEHAWRRKLSSHANRLKEFNVTFREAIRMMKLGLRLWSYIREEASHGRKAPIDPF 90

Query: 87  IKR--HLTSSHGVPLGGVG 103
            +     ++S GVPLGG+G
Sbjct: 91  TREGNRPSASQGVPLGGMG 109



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 23/29 (79%), Positives = 27/29 (93%)

Query: 166 GIGSWDWNLKGDKSTYHALYPRAWTVHEG 194
           GI SWDW LKGD+STYHAL+PRAWTV++G
Sbjct: 115 GISSWDWKLKGDRSTYHALFPRAWTVYDG 143


>gi|423217835|ref|ZP_17204331.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
           CL03T12C61]
 gi|392627338|gb|EIY21373.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
           CL03T12C61]
          Length = 851

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 57/244 (23%), Positives = 106/244 (43%), Gaps = 37/244 (15%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           ++++YR+   +GD  F K  W  V   ++Y      +D + DG++E  G    T D  + 
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYFNYDNS 747
            G +   G  ++ AL+AA  ++  + D+         F+K ++ + E L+NG Y+ +  +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604

Query: 748 ------------------GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
                             G         DQL GQ+ A  CGL  + D+   ++ +K +  
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664

Query: 790 YNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
           YN ++            VMG + G +    P GR+++      E+ +G  Y  A  M++E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724

Query: 838 DLAD 841
            + +
Sbjct: 725 GMEE 728



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 75/373 (20%), Positives = 145/373 (38%), Gaps = 49/373 (13%)

Query: 60  LAPIGVRILCLIREEAAKGKRA---FIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQ 116
           L  IG  +  +I  +A + K      +  + K HL     +PLGG+G+G++    RGE +
Sbjct: 6   LIAIGALLSGIITVQAQEWKPQQWPVLKKYDKEHLYQI-ALPLGGIGTGTVSLGGRGELR 64

Query: 117 RWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDW---- 172
            W+I         V A ++S   + +N   ++    P+  +      A  + S ++    
Sbjct: 65  DWEIM-------NVPAKKYSTVTTGNNAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYE 117

Query: 173 ----NLKG----DKSTYHALYPRAWTVHEGEPDPELRIVCRQISPIIPHNYKESSYPVSV 224
               N  G    +++++ A YP    V   + +  +++  +  +P IP +   S  PV+V
Sbjct: 118 GRPVNHHGLPRFEQASFEAAYPFG-QVSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAV 176

Query: 225 FTYTIYNSGKTSADITLLFTWTNSVGGD-----SEFTGQHYNSKTKMN-------DGVHA 272
            +Y + N+     ++++  +  N +G D     +++ G +  +  K N       + +  
Sbjct: 177 LSYEVTNTTDQPMNVSICGSIRNFIGKDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKG 236

Query: 273 VLLHHRTSHQLPPV--TFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSF 330
           + L+     +      T AL  Q  +GV            NS+     + W +    G  
Sbjct: 237 IYLYSDGVDKDDSAWGTVALTTQAVNGVTYRTSSKADNWNNSI----LNFWDDFSADG-- 290

Query: 331 DRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNFMSGKTYYRRYT 390
                M T  + +      A+++ + T+ P S    TF + W  P     S       Y+
Sbjct: 291 -----MLTERSKQEDEDPMASLSVNKTIAPQSTETFTFYITWSFPNRKAWSSVVVGNYYS 345

Query: 391 KFYGTHQNAAANI 403
             Y     AA  I
Sbjct: 346 NQYPDAWKAAETI 358


>gi|153807893|ref|ZP_01960561.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
 gi|149129502|gb|EDM20716.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
          Length = 851

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 57/244 (23%), Positives = 106/244 (43%), Gaps = 37/244 (15%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           ++++YR+   +GD  F K  W  V   ++Y      +D + DG++E  G    T D  + 
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYFNYDNS 747
            G +   G  ++ AL+AA  ++  + D+         F+K ++ + E L+NG Y+ +  +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604

Query: 748 ------------------GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYN 789
                             G         DQL GQ+ A  CGL  + D+   ++ +K +  
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664

Query: 790 YNVLK------------VMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHE 837
           YN ++            VMG + G +    P GR+++      E+ +G  Y  A  M++E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724

Query: 838 DLAD 841
            + +
Sbjct: 725 GMEE 728



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 63/319 (19%), Positives = 128/319 (40%), Gaps = 46/319 (14%)

Query: 83  IDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRS 142
           +  + K HL     +PLGG+G+G++    RGE + W+I         V A ++S   + +
Sbjct: 32  LKKYDKEHLYQI-ALPLGGIGTGTVSLGGRGELRDWEIM-------NVPAKKYSTVTTGN 83

Query: 143 NGQKYSSVLCPKTPEVLKDTTAAGIGSWDW--------NLKG----DKSTYHALYPRAWT 190
           N   ++    P+  +      A  + S ++        N  G    +++++ A YP    
Sbjct: 84  NAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYEGRPVNHHGLPRFEQASFEAAYPFG-Q 142

Query: 191 VHEGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG 250
           V   + +  +++  +  +P IP +   S  PV+V +Y + N+     ++++  +  N +G
Sbjct: 143 VSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAVLSYEVTNTTDQPMNVSICGSIRNFIG 202

Query: 251 GD-----SEFTGQHYNSKTKMN-------DGVHAVLLHHRTSHQLPPV--TFALAAQETD 296
            D     +++ G +  +  K N       + +  + L+     +      T AL  Q  +
Sbjct: 203 KDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKGIYLYSDGVDKDDSAWGTVALTTQAVN 262

Query: 297 GVHVSLCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASV 356
           GV            NS+     + W +    G       M T  + +      A+++ + 
Sbjct: 263 GVTYRTSSKADNWNNSI----LNFWDDFSADG-------MLTERSKQEDEDPMASLSVNK 311

Query: 357 TVPPDSEGQVTFSLAWDCP 375
           T+ P S    TF + W  P
Sbjct: 312 TIAPQSTETFTFYITWSFP 330


>gi|333382634|ref|ZP_08474302.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828576|gb|EGK01276.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 859

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 63/262 (24%), Positives = 108/262 (41%), Gaps = 53/262 (20%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           ++++YR+   +GD  + K ++P V  +M Y +  +D    G IE         + W   G
Sbjct: 495 IMKIYREWRISGDNNWLKEIYPFVKASMDYCIKTWDSREVGAIEEPHHNTYDIEFWGADG 554

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF------- 742
           +       +  ALQA   +   +G+  S    L    K+K+  E KL++G YF       
Sbjct: 555 MIT---SFYTGALQAMCLMGEYLGEDISRYEAL--MDKSKIYMETKLFDGEYFIQNIQWK 609

Query: 743 --NYD--------NSGSSQSS---------------SIQADQLAGQWYARACGLLPIVDE 777
             N D        NSG S  +                  +D + G W +  CG+  I+D+
Sbjct: 610 GLNADDPTKVQAFNSGYSDEALKILEAEGPKYQYGKGCLSDGILGSWMSLVCGMPEIIDK 669

Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
            K  S L  V+ YN+                +G + G +    P  G++ +  + S E+W
Sbjct: 670 TKVTSHLNAVHKYNLKHDLREHANPQRPTFALGNEGGLLLCSWPKGGKLQLPFVYSDEVW 729

Query: 824 SGVTYAVAASMIHEDLADIGFQ 845
           +G+ Y VAA ++ E   + G +
Sbjct: 730 TGIEYQVAAHLMFEGEVEKGLE 751


>gi|336426603|ref|ZP_08606612.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336010619|gb|EGN40601.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 850

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/257 (26%), Positives = 112/257 (43%), Gaps = 48/257 (18%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
           V+  YRD   +GD+++ +  W SV  A+AY       D +DR   G++  DG    T D 
Sbjct: 452 VMLTYRDYKISGDREWLEGKWESVKKAIAYAWSEENPDAWDRKKQGVL--DGRQHHTLDM 509

Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSE-DYFLFKFQKAKVVYEKLWNGSYF-- 742
             + G +A+  G +V AL+AA+ +A  +GD  +E +Y          + ++L+NG YF  
Sbjct: 510 -ELFGPNAWLQGFYVGALEAAAEMAAILGDGEAEAEYRALAAGGKAYLNKELFNGEYFYQ 568

Query: 743 -----------------NYDNSGSSQ-----SSSIQADQLAGQWYARACGLLPIVDEDKA 780
                             Y N    Q     +     DQ+  QW+   CGL  I +++K 
Sbjct: 569 KIDVCDRSAVDRFNASKEYWNEEKGQIKYQIAEGCGIDQVLAQWHGNLCGLPEIFEKEKV 628

Query: 781 RSALKKVYNYNVLKVMGG------------KRGAVNGMLPDG--RVDMSSMQSREIWSGV 826
           R AL  ++ YN    +G             + GA+    P+G  +  +    ++E   G+
Sbjct: 629 RKALASIFRYNYKPALGDIYNPWRIFAIDEESGAIMCEWPEGTDKPAIPLTYAQECMDGM 688

Query: 827 TYAVAASMIHEDLADIG 843
            Y  AA MI E + + G
Sbjct: 689 QYQAAAHMIQEGMTEEG 705


>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
 gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
          Length = 822

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/233 (27%), Positives = 112/233 (48%), Gaps = 23/233 (9%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           V+++YR+   +GD +  +  W  V  +M Y +  +D DGD +   D     TYDT  + G
Sbjct: 491 VVRLYREWRLSGDNRLIEDCWDGVVRSMEYAIRTWDTDGDFV--PDSAQHVTYDT-ELYG 547

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQK-AKVVYEKLWNGSYF-----NY 744
           +++    L+ AAL AA+ +A  +GD+     +    +K AK++ EK WNG Y+     + 
Sbjct: 548 MTSMVSTLFFAALIAAAEMAEYMGDQERARRYRDGAEKGAKLMDEKCWNGEYYIQLIDDV 607

Query: 745 DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLK---------- 794
           D           +DQL GQ+ A+  GL  ++ ++  + A + ++ YN ++          
Sbjct: 608 DKYRYQYGKGCLSDQLLGQFLAQEAGLGYVLPKNHVKKAAESIFRYNFIERASDFGHVQR 667

Query: 795 --VMGGKRGAVNGMLPDG-RVDMSSMQSREIWSGVTYAVAASMIHEDLADIGF 844
             +   + G      P G R     +   E+W+G+ Y VAA +I EDL + G 
Sbjct: 668 AYIFNDEMGLTPCTWPLGERPRFPFIYFGEVWTGIEYEVAALLIREDLLEEGL 720



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 58/254 (22%), Positives = 97/254 (38%), Gaps = 28/254 (11%)

Query: 93  SSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLC 152
           S+    LGG+G+G++    RGE + WQIF    +   +    F++      G+  S +L 
Sbjct: 26  SAAAFLLGGIGTGNVSVGARGELRSWQIFNEPGQFNFLPYTFFALRAEVEGGEAVSKILE 85

Query: 153 PKT------PEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGEPDPELRIVCRQ 206
            K       PE        G+  +      +KST  A YP      E +  P +++    
Sbjct: 86  SKLNPPFTHPEGTLRCELGGLPRF------EKSTMSAEYPFVRVKLEDKQVP-VQVEMEA 138

Query: 207 ISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDS-------EFTGQH 259
            +P IP N  +S  P +   Y + N       +++  +  N+VG +        +  G  
Sbjct: 139 FTPFIPLNADDSGIPGAYLNYRVKNPTNQPVTVSIAGSLANAVGFEKYDIWNHMKLNGNP 198

Query: 260 YNSKTKMNDGVHAVLLHHRT--SHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTA 317
            N   + N G+  +           +   T  LA  E D   V+  P +     + G  A
Sbjct: 199 VNEYQERN-GLKGLFFRGENVPEKHIANGTMCLATSEDD---VTAKPLWFQGEWTDG--A 252

Query: 318 KDMWHEIKEHGSFD 331
           +D W +    GS D
Sbjct: 253 QDFWDDFISDGSLD 266


>gi|160890984|ref|ZP_02071987.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
 gi|156859205|gb|EDO52636.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
          Length = 858

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 63/261 (24%), Positives = 104/261 (39%), Gaps = 51/261 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+    GD  + K ++P+V  +M Y +  +D    G +E         + W  +G
Sbjct: 498 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 557

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF--NYDNSG 748
           +       +V ALQA   + + VGD  S    L    KA  +  KL++G YF  N   +G
Sbjct: 558 MIT---SFYVGALQAICLMGKTVGDDISRYEELMNKSKA-YLESKLFDGEYFIQNIQWTG 613

Query: 749 SSQSSSIQA------------------------------DQLAGQWYARACGLLPIVDED 778
                 + A                              D + G W    CG+  +VD  
Sbjct: 614 LDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPEVVDRL 673

Query: 779 KARSALKKVYNYNVLKVMGGK---------RGAVNGML-----PDGRVDMSSMQSREIWS 824
           K +S L  V+ YN  K +             G   G+L       G++ +  + S E+W+
Sbjct: 674 KVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYSNEVWT 733

Query: 825 GVTYAVAASMIHEDLADIGFQ 845
           G+ Y VAA ++ E   + G +
Sbjct: 734 GIEYQVAAHLMFEGEVEKGLE 754


>gi|270294306|ref|ZP_06200508.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270275773|gb|EFA21633.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 851

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 63/261 (24%), Positives = 104/261 (39%), Gaps = 51/261 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+    GD  + K ++P+V  +M Y +  +D    G +E         + W  +G
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF--NYDNSG 748
           +       +V ALQA   + + VGD  S    L    KA  +  KL++G YF  N   +G
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDISRYEELMNKSKA-YLESKLFDGEYFIQNIQWTG 606

Query: 749 SSQSSSIQA------------------------------DQLAGQWYARACGLLPIVDED 778
                 + A                              D + G W    CG+  +VD  
Sbjct: 607 LDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPEVVDRL 666

Query: 779 KARSALKKVYNYNVLKVMGGK---------RGAVNGML-----PDGRVDMSSMQSREIWS 824
           K +S L  V+ YN  K +             G   G+L       G++ +  + S E+W+
Sbjct: 667 KVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYSNEVWT 726

Query: 825 GVTYAVAASMIHEDLADIGFQ 845
           G+ Y VAA ++ E   + G +
Sbjct: 727 GIEYQVAAHLMFEGEVEKGLE 747


>gi|317480931|ref|ZP_07940011.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
 gi|316902824|gb|EFV24698.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
          Length = 851

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 63/261 (24%), Positives = 104/261 (39%), Gaps = 51/261 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+    GD  + K ++P+V  +M Y +  +D    G +E         + W  +G
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF--NYDNSG 748
           +       +V ALQA   + + VGD  S    L    KA  +  KL++G YF  N   +G
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDISRYEELMNKSKA-YLESKLFDGEYFIQNIQWTG 606

Query: 749 SSQSSSIQA------------------------------DQLAGQWYARACGLLPIVDED 778
                 + A                              D + G W    CG+  +VD  
Sbjct: 607 LDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPEVVDRL 666

Query: 779 KARSALKKVYNYNVLKVMGGK---------RGAVNGML-----PDGRVDMSSMQSREIWS 824
           K +S L  V+ YN  K +             G   G+L       G++ +  + S E+W+
Sbjct: 667 KVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYSNEVWT 726

Query: 825 GVTYAVAASMIHEDLADIGFQ 845
           G+ Y VAA ++ E   + G +
Sbjct: 727 GIEYQVAAHLMFEGEVEKGLE 747


>gi|284041275|ref|YP_003391205.1| hypothetical protein Slin_6449 [Spirosoma linguale DSM 74]
 gi|283820568|gb|ADB42406.1| protein of unknown function DUF608 [Spirosoma linguale DSM 74]
          Length = 857

 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 62/252 (24%), Positives = 100/252 (39%), Gaps = 42/252 (16%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           +++ YRD   +GD  F +  WP V  A+A+      +D + DG++E  G    T D    
Sbjct: 484 IMKFYRDWQLSGDTDFLRRYWPQVKQALAFAWVPGGWDANRDGVME--GAQHNTMDV-EY 540

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYFNY--- 744
            G +      ++ AL+AASA+A  + DR         FQK     +  L+NG Y+     
Sbjct: 541 FGPNPQMQLWYLGALKAASAMATAMNDRSFAQTCDQIFQKGSTWTDANLFNGDYYEQQVI 600

Query: 745 --------------------DNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSAL 784
                               ++     +     DQL GQ+ A  CGL  +V+ D  ++ L
Sbjct: 601 SPAGHLIAKGTFSGFNKVEQNDPAYQLAKGCLVDQLVGQFMAHVCGLGYLVNPDHIKTTL 660

Query: 785 KKVYNYNVLKVM------------GGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAA 832
           K +  YN    M            G +   +    P GR  +      E+ +G  Y  A 
Sbjct: 661 KSILKYNYRASMTDHFNNMRSYALGDEASLLMASWPKGRPKVPFPYFSEVMTGFEYTAAI 720

Query: 833 SMIHEDLADIGF 844
            M++E+  + G 
Sbjct: 721 GMLYENQTEPGL 732



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 75/347 (21%), Positives = 136/347 (39%), Gaps = 43/347 (12%)

Query: 85  PFIKR----HLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVL-----ANQF 135
           P +KR    HLT    +PLGG+G+G++    RG+ Q W+I  R  +    +     A  F
Sbjct: 29  PVLKRYDQQHLTRI-ALPLGGIGTGTVSLGGRGQLQDWEIMNRPAKGYSTVLTGNTAPFF 87

Query: 136 SVFVSRSNGQKYSSVLCPKTPEVLKDTTAAGIGSWDWNLKGDKSTYHALYPRAWTVHEGE 195
           ++ V  +  +K  ++L P      +      +         + S +   YP    V+  +
Sbjct: 88  AIHVLET--KKTKALLGPLESHEYQHMEGRPVDHHGLPRFANAS-FETAYPFG-QVNLSD 143

Query: 196 PDPELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVG----- 250
               + +  +  +P++P +   S  P++V TY + N+G     +++  +  N +G     
Sbjct: 144 ATMPVSVRIKGFNPLVPTDADASGLPIAVLTYEVTNTGNQPITVSVCGSMHNFIGRDGSK 203

Query: 251 ------GDSEFTGQHYN-SKTKMNDGVHAVLLHHRTSHQLPPV--TFALAAQETDGVHVS 301
                 G+ E  G   N ++ +  +G+  + ++     +      T AL   E+ GV   
Sbjct: 204 NHKSWKGELEPQGAKQNRNQFRQENGLTGIYMYSDGVDKADAAWGTVALTTAESQGVSYR 263

Query: 302 LCPHFVISGNSLGLTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPD 361
                    N+L     D W +  + G   +L    T    +P     A++A   T+ P+
Sbjct: 264 TSSVTNAWENAL----LDFWDDFSDDG---KLTEKTTPADEDP----MASLAVQKTIGPN 312

Query: 362 SEGQVTFSLAWDCPEVNFMSGKTYYRRYTKFYGTHQNAAANIARDAI 408
                TF L W  P   F   K    R   +Y T  N A  +A   +
Sbjct: 313 QTQAFTFFLTWHFPN-RFGWSK---ERVGNYYTTQYNDAWAVADKTV 355


>gi|284997210|ref|YP_003418977.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284445105|gb|ADB86607.1| hypothetical protein LD85_0902 [Sulfolobus islandicus L.D.8.5]
          Length = 498

 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 75/164 (45%), Gaps = 23/164 (14%)

Query: 551 YSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPW 610
           + S  +I++FP+++ S  +   + V                   +  G VPHD+G     
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RSDGYVPHDLGFHSLD 390

Query: 611 FEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGD 670
             ++        +WKD+NP  +L VYR    T D  F K V+P++   M +  +  RDG 
Sbjct: 391 SPIDGTT--SPPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDWELRQCRDGL 448

Query: 671 GMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVG 714
             +E  G  D  +D   + G  +Y+  L++A+L A   +A+ VG
Sbjct: 449 PFME--GEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490


>gi|326798039|ref|YP_004315858.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326548803|gb|ADZ77188.1| hypothetical protein Sph21_0609 [Sphingobacterium sp. 21]
          Length = 877

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 111/266 (41%), Gaps = 53/266 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYRD   +GD ++ K ++P V  ++ Y  Q +D +G G++E         + W  +G
Sbjct: 517 IMKVYRDWRISGDDEWMKKLYPMVQKSLDYCIQTWDPNGKGILEEPHHNTYDIEFWGPNG 576

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
           +       ++ AL+A   + R           L+   +A  + ++L+NG YF        
Sbjct: 577 MCT---SFYLGALKALIEMGRYQRKNMKRYETLYDSGRA-YMEDQLYNGEYFIQKIQWTG 632

Query: 743 -NYDNSGSSQSSSIQ-----------------------ADQLAGQWYARACGLLPIVDED 778
               +   +QS   Q                       +D + G W A  CGL   +D++
Sbjct: 633 LKAPDPTKAQSFHTQYTPEAIEILKKEGPKYQYGTGCLSDGVIGAWMAAMCGLENPLDQE 692

Query: 779 KARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIWS 824
           K  S L+ V+ YN+                MG + G +    P  G + +  + S E+W+
Sbjct: 693 KVNSHLRAVHQYNLKNDLTDHINPQRPTYAMGHEGGLLLCTWPKGGMLSLPFVYSNEVWT 752

Query: 825 GVTYAVAASMIHEDLADIGFQ--TAC 848
           G+ Y VA+ ++     D G +  +AC
Sbjct: 753 GIEYQVASHLMQIGETDKGLEIISAC 778


>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
          Length = 90

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 51/88 (57%), Gaps = 12/88 (13%)

Query: 745 DNSGSSQSSSIQADQLAGQW--YARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGA 802
           +NSG+       ADQ    W   +RA  LLP+      +SAL ++Y+ N+L   GG+ GA
Sbjct: 3   ENSGT-------ADQTYDAWKLLSRARALLPL---SHIKSALNQIYSSNILNFAGGRLGA 52

Query: 803 VNGMLPDGRVDMSSMQSREIWSGVTYAV 830
           VNGM  DG V    +Q+ E+W GVTYA+
Sbjct: 53  VNGMRKDGTVGRRHLQADEMWVGVTYAL 80


>gi|410638454|ref|ZP_11349015.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
 gi|410141863|dbj|GAC16220.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
          Length = 1124

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 62/287 (21%), Positives = 127/287 (44%), Gaps = 27/287 (9%)

Query: 632  VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
            +L  YR+   + +  + K+ WP V  A+ + ++++D +  G + N      T D  +++G
Sbjct: 726  ILNTYREHQCSANDSWLKSQWPRVKKAVEWGINEWDPNRIGFMSNT--QHNTLDG-NITG 782

Query: 691  ISAYSGGLWVAALQAASALAREVGD-RGSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSGS 749
             S++ G L++ +L+AA+ +A  VG+ + +++Y   +    K+  ++LWNG Y+  +    
Sbjct: 783  CSSWIGSLYLTSLEAAARMADIVGEPQTAKEYRAIRESGKKLQNKRLWNGEYYIQEEGKE 842

Query: 750  SQSSSIQA---DQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKR------ 800
                 +     DQL G+W+A    +      ++A+ A+  +  YN      G+       
Sbjct: 843  RFQDYLDGCHIDQLLGEWWADQLNIDRNYPRERAQQAMSALLKYNFRADFYGQSLKPRQY 902

Query: 801  -----GAVNGML-PDGRVDMSSMQ-SREIWSGVTYAVAASMIHEDLADIG---FQTACGI 850
                 G +  +  P G+  +  M+   E+ +G  Y  A +M+   + D G    +     
Sbjct: 903  CEIEDGGMKMITWPRGQQPIPGMKYGDEVMTGFEYGAAVTMMQNGMIDEGLMLLKVVADR 962

Query: 851  YEAAWSGTGLGYAFQTPEAWNTDDQYRSLC---YMRPLAIWAMQWAL 894
            Y+      G+      P  ++ +      C   Y R L++W+   AL
Sbjct: 963  YDGRLRTEGVSNVVNGPWGYSGNPYGDDECGKYYGRSLSVWSALTAL 1009



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 35/171 (20%), Positives = 69/171 (40%), Gaps = 14/171 (8%)

Query: 202 IVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYN 261
           +     +P IP + + SS P +++  T+ N+       ++L +  N++G      G+  +
Sbjct: 409 VALEVFNPFIPMDLQNSSIPCAIYQVTVKNTSTDEVTTSVLSSQKNALGYTEGEKGEFGS 468

Query: 262 SKTKMNDGVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMW 321
           ++  +    +  LLH   +   P     L  Q  D            +G +   +  +  
Sbjct: 469 NQNTLLKKGNTTLLHMTRAGSHPKADMVLLTQAADA-----------TGTTNWTSPAEFK 517

Query: 322 HEIKEHGSFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAW 372
            E  ++G     N+   S  S  G ++  A++A +T+ P     VTF L W
Sbjct: 518 REFTDNGGLTGPNN---SPISRAGQTVNGALSAPLTLAPGESKSVTFVLTW 565


>gi|156383694|ref|XP_001632968.1| predicted protein [Nematostella vectensis]
 gi|156220031|gb|EDO40905.1| predicted protein [Nematostella vectensis]
          Length = 1641

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 87/188 (46%), Gaps = 19/188 (10%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM---IEN--DGFPD 680
           D+NP F+ QVY   + TG+KK    +WP++      M  +  +G  +   + N  D    
Sbjct: 676 DVNPAFLSQVYHLYMWTGNKKLLDDLWPAIKDVTHAMVTWTTNGTSLPLRMTNTYDILQF 735

Query: 681 QTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWN- 738
           Q YD      +S Y+  + +  L  A A+ +   +R        + +KA    ++ LW+ 
Sbjct: 736 QRYD------VSMYNSVMHLLGLTVARAIGQVRKERKFVKDLTLRIRKATSAIDRLLWDE 789

Query: 739 --GSYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN----V 792
             G Y  + +     +S++ AD L GQ +A   GL  ++DE+K  S L+K    N     
Sbjct: 790 KRGYYRAWWDHAQPSTSALMADSLYGQVWAYTLGLGNLLDENKMASHLRKEAELNDTPFG 849

Query: 793 LKVMGGKR 800
           LKVM G R
Sbjct: 850 LKVMSGLR 857


>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
 gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
           12333]
          Length = 865

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 111/256 (43%), Gaps = 40/256 (15%)

Query: 632 VLQVYRDVV--ATGDKKFAKAVWPSVYVAMAYM-DQFDRDGDGMIENDGFPDQTYDTWSV 688
           +L+ YR+V   A G    A A WP + + + ++  ++D  G G++   G    T+D   +
Sbjct: 509 LLKTYREVRSGAIGLGHLA-ASWPRLMLLLDHVRGRWDPSGSGVLR--GVQPSTHDI-DL 564

Query: 689 SGISAYSGGLWVAALQAASALA-REVGDRGSEDYFLFK-------------FQKAKVVYE 734
            G++ + G LW+AAL+A   +A R   D    D                  F++    Y+
Sbjct: 565 RGVNTFMGTLWLAALRACEEMALRLAADPHHADALGGATLGRELAAELHALFERGSAAYD 624

Query: 735 KL-WNGSYFNY----DNSGSSQ-SSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVY 788
           ++ + G Y+      D     Q      +DQL GQW+A    L  ++     RSAL+ V 
Sbjct: 625 EICFTGEYYRQVLEPDEPTQFQWGDGCLSDQLIGQWWAHLLDLGHVLPAAHVRSALRAVV 684

Query: 789 NYNVLKVMGG--------KRGAVNGML-----PDGRVDMSSMQSREIWSGVTYAVAASMI 835
            +N+ +   G          G   G+L       GR ++++    E+WSGV Y VAA  +
Sbjct: 685 GHNLRRGFDGWEHTQRVYADGDDVGLLMCTWPHGGRPEVATRYCDEVWSGVEYEVAAHCL 744

Query: 836 HEDLADIGFQTACGIY 851
            E L   G     G++
Sbjct: 745 LEGLTAEGRAILDGLW 760



 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 133/353 (37%), Gaps = 58/353 (16%)

Query: 97  VPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSR-----SNGQKYSSVL 151
           +PLGG+G+G++     G  ++WQ+         +  + F+V VS+        +   +  
Sbjct: 17  MPLGGIGTGNVAIGADGGLRQWQLHNIGNHHGDLPGSLFAVRVSQWEPPLDEVRVLQAPA 76

Query: 152 CP----KTPEVLKDTTAAGIGSWDWNLKG----DKSTYHALYPRAWTVHEGEPDPELRIV 203
            P    +TP V  D     +  W   L        +T  A YP A   ++    P L I 
Sbjct: 77  APAGNTRTPMVDDDH----VPDWQRELAAAHGVTATTISATYPVARVAYDLGEVP-LAIE 131

Query: 204 CRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD-------SEFT 256
               +P++P +   SS PV++FT+T+ N+G       L     N+VG D       + F 
Sbjct: 132 LEATTPLVPLDEAASSLPVAMFTFTLRNAGTFPVHGYLAGALQNAVGYDGVTPIDGTRFR 191

Query: 257 GQHYNSKTKMNDGVHAVLLHHRTSHQLPPV-----TFALAAQETDGVHVSLCPHFVISGN 311
           G   N+           L+   T+  LPP      + ALA    D    S  P +  +  
Sbjct: 192 GYGGNTNRVRRRASWTDLV--MTNDSLPPEHPGAGSMALAC---DDPRASALPAYSGTEE 246

Query: 312 SLG-LTAKDMWHEIKEHGSFDRLNSMETSVTSEPGSSIG---------AAIAASVTVPPD 361
            L  L ++ M  +      F        S    PG++ G          A+A+   V P 
Sbjct: 247 LLAFLRSRSMGGDTAR---FTGAPWAADSHAEGPGAARGPSPAAETWLGALASRFHVEPG 303

Query: 362 SEGQVTFSLAWDCPE--VNFMS--------GKTYYRRYTKFYGTHQNAAANIA 404
               V F +AW  P   VNF          G T +     +   H +AAA IA
Sbjct: 304 ESTTVRFLMAWHFPNRYVNFTQFGPERPEWGPTRFWIGNHYTLAHDDAAAVIA 356


>gi|406663141|ref|ZP_11071211.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
 gi|405552803|gb|EKB48138.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
          Length = 937

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 74/352 (21%), Positives = 143/352 (40%), Gaps = 24/352 (6%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLA--NQFSVFVSRSNGQKYSSVLCP 153
           G+P+GG+ +G++     G+   W IF R  E    +   N     V   +G  Y   + P
Sbjct: 108 GMPVGGLCTGTVYIGGDGKLWLWDIFNRQKEGVKSVTHDNWHGQRVRPRDGANYIFPMSP 167

Query: 154 KTPEV----LKDTTAAGIGSWDWNLKG-DKSTYHALYPRAWTVHEGEPDPELRIVCRQIS 208
           + P      L+ +    I     + KG +  ++ A YP A   +     P + +  +  S
Sbjct: 168 EYPFEQGFGLRISQNNKIWERSLDFKGFEDISFKAQYPIAEISYRDSQMP-IEVDLQAFS 226

Query: 209 PIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGDSEFTGQHYNSKTKMND 268
           P IP + K SS P +   +TI N+ K      L     N++   ++   +   S +   +
Sbjct: 227 PFIPLDVKNSSNPAAFMHFTITNTSKEEVKAELSGWLENAILRFTKDAEETELSNSIEKE 286

Query: 269 GVHAVLLHHRTSHQLPPVTFALAAQETDGVHVSLCPHFVISGNSLGLTAKDMWHEIKEHG 328
             + VL     S  +   +  +   + D   +SL    +I  +S  + + ++   +K + 
Sbjct: 287 SGNTVLTCSARSKNIHNFSTTIVQNQRDFGTMSL---MLIGNDSSTVGSAEI--PVKSNL 341

Query: 329 SFDRLNSMETSVTSEPGSSIGAAIAASVTVPPDSEGQVTFSLAWDCPEVNF-----MSGK 383
            F   N+ ET+ T+E G  +   ++ +V + P    ++TF ++W  P +        +G+
Sbjct: 342 LFPE-NTTETA-TAEFGEQLCGGVSKTVRLNPGESQKITFIISWHFPNIELPQNDRSTGE 399

Query: 384 TYYRRYTKFYGTHQNAAANIARDAILEHGSWELQIEAWQRPILEDKRLPEWY 435
              R YT  +   +  A  +A +    +         W+    E+  LP W+
Sbjct: 400 NKGRHYTNHFDDSREVAMELAAN----YQKLSYLTNLWREAFYENSSLPHWF 447



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 72/259 (27%), Positives = 111/259 (42%), Gaps = 46/259 (17%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRD----GDGMIENDGFPDQTYDTWS 687
           VL+ YRD   + D  F K  W ++ +A+ Y+   D++     +GMI   G    T D   
Sbjct: 540 VLRAYRDYQMSEDDTFLKNNWDNIKLALKYLIDLDKEYGETANGMIY--GEQHNTLDAEW 597

Query: 688 VSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF----- 742
              I A +  L++AAL AA  +A+  GD  SE  +L   +  +   E L+NG YF     
Sbjct: 598 FGYIPAITS-LYLAALAAAVEMAKATGDISSEKEYLSILESGRKNIESLFNGEYFIQEED 656

Query: 743 --NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMG--- 797
             + D  G  +   I  DQ+ GQ +A    L  + ++   +S+L  ++ YN +  MG   
Sbjct: 657 PDHLDAIGIGKGCYI--DQVFGQGWAFQLNLGRLYNKKMIQSSLDSLWKYNFVPDMGPLR 714

Query: 798 --------GKRGAVNGMLPDGRVDMSSMQ---SREIW-------------SGVTYAVAAS 833
                   G+  A++G   D  + M +      RE W             +G  Y  A+ 
Sbjct: 715 DSLPPQIAGRPYAIHG---DAGLVMCTWPLGGKREDWQRHWQFGYFNECMTGFEYQAASH 771

Query: 834 MIHEDLADIGFQTACGIYE 852
           MI ED  D G      I+E
Sbjct: 772 MIWEDKLDEGLSIIKAIHE 790


>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
          Length = 109

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/60 (46%), Positives = 39/60 (65%), Gaps = 3/60 (5%)

Query: 771 LLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAV 830
           LLP+      +SAL ++Y+ N+L   GG+ GAVNGM  DG V    +Q+ E+W GVTYA+
Sbjct: 1   LLPL---SHIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 57


>gi|429739681|ref|ZP_19273428.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
           F0055]
 gi|429156127|gb|EKX98766.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
           F0055]
          Length = 989

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/262 (21%), Positives = 107/262 (40%), Gaps = 52/262 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++ YRD   +GD ++ K ++P++  ++ Y +  +D    G IE         + W   G
Sbjct: 633 IMKAYRDWRISGDTEWLKKLYPAIKASLDYCIRTWDPRQKGCIEEPHHNTYDIEFWGADG 692

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
           +       +V AL+A   ++  +  R S  Y     +  +++  +L+NG Y+        
Sbjct: 693 MCT---SFYVGALEAFIRMSDAL-HRPSTPYRKLLKKSVELMNTQLFNGEYYIQKTQWTG 748

Query: 743 --------------NYDNSGSSQ-----------SSSIQADQLAGQWYARACGLLPIVDE 777
                         NY N  + Q            +   +D + G W A  CGL   +++
Sbjct: 749 LRAKSPTEVLSFGSNYSNPEALQLLQAEGPKYQYGTGCLSDGVIGMWMASVCGLPEPLNQ 808

Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPDGRV-DMSSMQSREIW 823
            +  S L  VY YNV +              +G + G +    P G+   +  + S E+W
Sbjct: 809 AQTLSHLNAVYKYNVREDLYRHYNTQRPSYALGHEGGLLLCTWPKGKKPTLPFVYSNEVW 868

Query: 824 SGVTYAVAASMIHEDLADIGFQ 845
           +G+ Y VA+ ++ +   D G +
Sbjct: 869 TGIEYQVASHLLMKGEIDKGLK 890


>gi|282877348|ref|ZP_06286172.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
 gi|281300533|gb|EFA92878.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
          Length = 997

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 53/262 (20%), Positives = 107/262 (40%), Gaps = 52/262 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYRD   +G+ ++ K+++P+V  ++ Y +  +D    G +          + W   G
Sbjct: 638 IMKVYRDWRISGNDEWLKSLYPAVKASLDYCIRTWDPHEKGSLTEPHHNTYDIEFWGADG 697

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
           +       +  AL+A   ++  +  +  +   L K +    + + L+NG Y+        
Sbjct: 698 MCT---SFYTGALEAFVQMSEHLNKQAGKYKKLLK-RSISCMDKDLFNGEYYFQKTQWTG 753

Query: 743 --------------NYDNSGSSQ-----------SSSIQADQLAGQWYARACGLLPIVDE 777
                         NY ++ + +            +   +D + G W A  CGL   +++
Sbjct: 754 LEAKSPTEIPSFGSNYRHAEAQELLKTEGPKYQYDTGCLSDGILGMWMASVCGLPEPLNQ 813

Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPDG-RVDMSSMQSREIW 823
              +S L  VY YN+ K              MGG  G +    P G +  +  + S E+W
Sbjct: 814 THVKSHLNAVYQYNLRKNMYNHYNTQRPSYAMGGDGGLLLCTWPKGAKPTLPFVYSNEVW 873

Query: 824 SGVTYAVAASMIHEDLADIGFQ 845
           +G+ Y VA+ +I +   + G +
Sbjct: 874 TGIEYQVASHLIMKGEVEKGLE 895


>gi|332981217|ref|YP_004462658.1| hypothetical protein Mahau_0633 [Mahella australiensis 50-1 BON]
 gi|332698895|gb|AEE95836.1| protein of unknown function DUF608 [Mahella australiensis 50-1 BON]
          Length = 880

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 66/276 (23%), Positives = 114/276 (41%), Gaps = 58/276 (21%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM------DQFDRDGDGMIENDGFPDQTYDT 685
           V++ YR+   +GD ++ K++WP V  ++ Y       D++D +  G++   G    T D 
Sbjct: 469 VIKTYREWKISGDDEWLKSLWPRVKKSIEYAWVDTNEDRWDPEQKGVLT--GRQHHTLDM 526

Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSY--- 741
             + G +++  G ++AAL+A + +A  VGD  +   +   F++ K      L+NG Y   
Sbjct: 527 -ELFGPNSWLTGFYLAALKAGAEMASYVGDEQAAQRYTDIFRRGKAWADSNLFNGEYYYH 585

Query: 742 ------------FNYDNSGS-SQSSSIQA-------------------DQLAGQWYARAC 769
                       F+ D +G+    S+I+A                   DQ+  QW+A   
Sbjct: 586 LIDLQDKSILERFDTDKTGTLIGDSTIKAYWDEERGQIKYQVGEGCIIDQVLAQWHANLI 645

Query: 770 GLLPIVDEDKARSALKKVYNYNVLKV-----------MGGKRGAVNGMLPDGRVD--MSS 816
           GL  I D    R A+  VY  N  K+           +  + G +    P GR    +S 
Sbjct: 646 GLGEIFDPGNTRKAVYAVYKNNFKKMREVFNPCRVFSLNDESGMIICSYPAGRKKPAISV 705

Query: 817 MQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYE 852
             ++E  +G  Y  A  +I +     G +    I E
Sbjct: 706 PYAQETMNGFEYQAAEHLIQKGFIREGEEVVRAIRE 741


>gi|156383620|ref|XP_001632931.1| predicted protein [Nematostella vectensis]
 gi|156219994|gb|EDO40868.1| predicted protein [Nematostella vectensis]
          Length = 1090

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/233 (24%), Positives = 101/233 (43%), Gaps = 23/233 (9%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMA-YMDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           VL+ YR+V     K +   +WP V   M  +M   D +G G+I   G    TYD  ++ G
Sbjct: 640 VLKTYREVRQGAPKTWFNKMWPQVKRIMKRWMTDLD-NGTGVIP--GPQPNTYDC-TLYG 695

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF----NYD 745
           ++ Y  G ++A L+AA  +A+   +      +  +F+      +K L+NGS++    + +
Sbjct: 696 VNVYISGYYLAGLRAAEKMAKIQNEPELAATYHQRFELGSAKIDKTLFNGSWYIQEVDSE 755

Query: 746 NSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNVLKVMGGKRGAVNG 805
           +  +    +   D L GQW+A A GL  I+ +   +S L+ ++  N ++       A   
Sbjct: 756 HPVNVVGDATWVDCLVGQWWAHALGLGYILSQKNIQSTLQNIFLKNHVERFVPATQAPRQ 815

Query: 806 MLPDGRVDMSSM-------------QSREIWSGVTYAVAASMIHEDLADIGFQ 845
              +    +                 S   W+G+ Y  A    +E L DIG +
Sbjct: 816 FFDERDAGLRICVYPNDLPKSPLLYHSEGAWTGLEYEYAELCFYEGLNDIGLK 868



 Score = 47.8 bits (112), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 74/175 (42%), Gaps = 27/175 (15%)

Query: 96  GVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFSVFVSRSNGQKYSSVLCPK- 154
            +PLG +G GS+  +  G  ++WQI   V     V  + F++ V ++N  K  ++     
Sbjct: 162 ALPLGALGGGSVALAGDGGLRQWQISGAVNHTAHVPNSFFAIRVEQTNSVKAVALQSNTW 221

Query: 155 ----------------TPEVLKDTTAAGIGSWDWNLKGDKS-TYHALYPRAWTVHEGEPD 197
                            PE  KD            + G K+ T  A YP A   +  +  
Sbjct: 222 YDQTGFKPAAYISDHVVPEASKDLLKT--------IPGVKTITVGAKYPIAEVSYLSDEV 273

Query: 198 PELRIVCRQISPIIPHNYKESSYPVSVFTYTIYNSGKTSADITLLFTWTNSVGGD 252
           P +++     SP IP + K S  PV +F +++ N+  + A ++LL +  N  G D
Sbjct: 274 P-VQVHLEAFSPCIPLDSKNSGIPVIMFNFSVTNTTSSDAQVSLLGSLQNIAGWD 327


>gi|340616465|ref|YP_004734918.1| hypothetical protein zobellia_454 [Zobellia galactanivorans]
 gi|339731262|emb|CAZ94527.1| Conserved hypothetical protein [Zobellia galactanivorans]
          Length = 923

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 61/274 (22%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGM-IENDGFPDQTYD 684
           D    +VL+ YR+ + + + +F   VW  V +A+ Y     +DG  + +E +G  +    
Sbjct: 495 DAQSGYVLKAYREHLNSPNNEFLDTVWEKVKMAIGY--HIFKDGAEIGLEPNGVLEGEQT 552

Query: 685 TWSVS--GISAYSGGLWVAALQAASALAREVG--DRGSEDYFLFKFQKAKVVYEKLWNGS 740
            W     G + Y+  L++AAL+AA  +A+  G  +  S  + LF+    K + E++WNG 
Sbjct: 553 FWDPMWYGPNPYNNTLYLAALRAAEEMAKVKGEPELASRYHKLFE-SGQKFMNEEMWNGE 611

Query: 741 Y-----------------------------------FNYDNSGSSQSSSIQADQLAGQWY 765
           +                                   FN   SG    ++  A QL GQ +
Sbjct: 612 FYIHLYPVGLNGNVGIKNGFKLPSEIDDNAQSYIKAFNEGRSGYFPGTACDAQQLFGQNW 671

Query: 766 ARACGLLPIVDEDKARSALKKVYNYN----VLKVMGGKRGAVNGMLPDGRVDM------- 814
           A   GL  I+  +K+  A K ++ YN    +  +   ++     +   G   M       
Sbjct: 672 ANQLGLGYILSPEKSEIAAKSIFKYNWTPDIATIYDFEKPNARVLAAKGEAAMINGGWTK 731

Query: 815 -------SSMQSREIWSGVTYAVAASMIHEDLAD 841
                  ++     +W+G+ Y  A  MI+E L +
Sbjct: 732 EKPKSFENTHDKTNVWTGLEYEAACDMINEGLLE 765


>gi|410867532|ref|YP_006982143.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
           ATCC 4875]
 gi|410824173|gb|AFV90788.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
           ATCC 4875]
          Length = 845

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 151/361 (41%), Gaps = 54/361 (14%)

Query: 526 EENIGQFLYLEGI---EYLMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSK 582
            ++ G+F   EG+      MW T DV +        ++         ++A A     PS 
Sbjct: 396 RDHEGRFFGFEGVLGASTAMW-TGDVGYSCPLNCTHVW---------NYAQAASALWPSL 445

Query: 583 MKLLDDGQW-VSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKF------VLQV 635
              + D ++ V +   G++PH + I   W E     L+D       +P        VL+ 
Sbjct: 446 EASMRDTEFEVMQAPDGSLPHRV-IAPVWLEQ----LWDVEIGGPDHPALDGMLGAVLKT 500

Query: 636 YRDVV--ATGDKKFAKAVWPSVYVAMAYMD-QFDRDGDGMIENDGFPDQTYDTWSVSGIS 692
           YR+V   A G   + +  W ++   M ++  ++D +  G++   G    T+D  S+ G +
Sbjct: 501 YREVRRGAVG-AGWLRDRWNNLCRLMDHVAARWDPECSGLLT--GEQPSTHDI-SLYGAN 556

Query: 693 AYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK-LWNGSYF-----NYDN 746
            + G LW+AAL+A+  +A  +G       +  +F  +   Y+  L+ G Y+         
Sbjct: 557 MFMGSLWLAALRASEEMALILGATDRAMGWRERFVLSSAAYDTALFRGGYYVQAADETHG 616

Query: 747 SGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYNV-----------LKV 795
                     ADQL GQW+A    L  ++  +  RSAL+ V  +N+              
Sbjct: 617 DDDEFGDGCLADQLIGQWWAHLLDLGHLLPAEHVRSALQAVVAHNLNPADETRHSQRAYA 676

Query: 796 MGGKRGAVNGMLP-DGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIGFQTACGIYEAA 854
           + G++G V    P  GR   ++    E+W+G  Y VAA  + E L   G +    I +A 
Sbjct: 677 VRGEQGLVMCSWPRGGRPARATRYCDEVWTGCEYEVAALCLAEGLEKDGHR----ILDAI 732

Query: 855 W 855
           W
Sbjct: 733 W 733


>gi|312086024|ref|XP_003144914.1| hypothetical protein LOAG_09338 [Loa loa]
 gi|307759923|gb|EFO19157.1| hypothetical protein LOAG_09338, partial [Loa loa]
          Length = 84

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 28/76 (36%), Positives = 41/76 (53%)

Query: 670 DGMIENDGFPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA 729
           DGMIEN G  DQTYDT  ++G+S Y G LW+  L A  + A E+ D  S + +     +A
Sbjct: 9   DGMIENSGTADQTYDTLKMNGVSTYCGSLWLTLLHAEVSFASEIDDIRSGNLYTGFLLRA 68

Query: 730 KVVYEKLWNGSYFNYD 745
             ++E  +   Y   +
Sbjct: 69  GDIFESEYYNIYLQME 84


>gi|312373134|gb|EFR20945.1| hypothetical protein AND_18252 [Anopheles darlingi]
          Length = 734

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 40/68 (58%)

Query: 77  KGKRAFIDPFIKRHLTSSHGVPLGGVGSGSIGRSYRGEFQRWQIFPRVCEDKPVLANQFS 136
           +G++  +D     +    +G PLGG+G+G+IGR + GEF R+Q+ P + E   V ANQF 
Sbjct: 655 EGRQVLMDYLYTENAKQIYGAPLGGIGAGTIGRGFAGEFCRYQLRPGLYEYNTVHANQFI 714

Query: 137 VFVSRSNG 144
           V +    G
Sbjct: 715 VTIKDETG 722


>gi|340616467|ref|YP_004734920.1| hypothetical protein zobellia_456 [Zobellia galactanivorans]
 gi|339731264|emb|CAZ94529.1| Conserved hypothetical protein [Zobellia galactanivorans]
          Length = 822

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 87/192 (45%), Gaps = 27/192 (14%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
           D +  F+L VY   +  GD++F   +WP V  A  +  Q +R         G PD+   T
Sbjct: 476 DCSTSFILSVYASHIWDGDEEFVNEMWPHVKRAAQW--QINRS-----RKFGLPDRLAAT 528

Query: 686 WSVSG-----ISAYSGGLWVAALQAASALAREVGDRGSEDYF-LFKFQKA---KVVYEKL 736
           + +SG     + +Y+  + +AA++AA+ LA+E   R  ED   L+K   A     +    
Sbjct: 529 YDLSGFGKKDLVSYNAFMHLAAMKAATELAKE---REKEDLVSLYKKNIAAAQNTLKAHF 585

Query: 737 WNGSYF-NYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALK-------KVY 788
           W G+YF N+ N    ++  I  D   GQ ++   GL  +++ +  +S L+         Y
Sbjct: 586 WTGTYFRNWWNKDEPKNDDIHIDTQFGQVWSYLLGLGDVIEPELMKSHLEAEVVAADSPY 645

Query: 789 NYNVLKVMGGKR 800
              VL   G  R
Sbjct: 646 GMKVLAYAGNDR 657


>gi|338212680|ref|YP_004656735.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306501|gb|AEI49603.1| hypothetical protein Runsl_3226 [Runella slithyformis DSM 19594]
          Length = 855

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 59/258 (22%), Positives = 99/258 (38%), Gaps = 42/258 (16%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAYM---DQFDRDGDGMIENDGFPDQTYDTWSV 688
           +++ YRD   +G+  F +  WP +  A+A+    D +D + DG++  DG    T D    
Sbjct: 484 IMKFYRDWQLSGNTAFLQKNWPKIKRALAFAWVPDGWDGNLDGVM--DGAQHNTMDV-EY 540

Query: 689 SGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFNYD-- 745
            G +      ++ AL+AA  +A  + D          FQ       ++L+NG Y+     
Sbjct: 541 FGPNPQMQIWYLGALKAAEKMAEALHDAAFAQQCKTLFQSGSTWADQQLFNGEYYEQQVM 600

Query: 746 ---------------NSGSSQSSSIQ------ADQLAGQWYARACGLLPIVDEDKARSAL 784
                          NS        Q       DQL GQ+ A  CGL  +V     +++L
Sbjct: 601 SPVGRKFAKGTVSGWNSTQGDFPPFQLAKGCLVDQLVGQFMAHVCGLGYLVKPQNVQTSL 660

Query: 785 KKVYNYNVLKV------------MGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAA 832
           K +  YN                +  +   +    P GR  +      E+ +G  Y  A 
Sbjct: 661 KSILKYNYRATLTEHFNNMRSYALADEPALLMASWPKGRPTVPFPYFAEVMTGFEYTAAI 720

Query: 833 SMIHEDLADIGFQTACGI 850
            M++E++ D G +    I
Sbjct: 721 GMLYENMTDDGLKCIQNI 738


>gi|361067543|gb|AEW08083.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162873|gb|AFG64132.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162874|gb|AFG64133.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162875|gb|AFG64134.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162876|gb|AFG64135.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162877|gb|AFG64136.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162878|gb|AFG64137.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162879|gb|AFG64138.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162880|gb|AFG64139.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162881|gb|AFG64140.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162882|gb|AFG64141.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
          Length = 79

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 30/62 (48%), Positives = 35/62 (56%), Gaps = 1/62 (1%)

Query: 875 QYRSLCYMRPLAIWAMQWALTRPKPKTLEKQMKPEVTDESLLRYHAGFSKVARLLKLPEE 934
            YRSL YMRPLAIWAMQWAL  PK   LE      +  E   ++HAG + VA   K   +
Sbjct: 2   HYRSLIYMRPLAIWAMQWALNPPK-AVLEAPKINLMEREHPSQFHAGLAAVAEAFKHVHQ 60

Query: 935 QG 936
            G
Sbjct: 61  NG 62


>gi|156376779|ref|XP_001630536.1| predicted protein [Nematostella vectensis]
 gi|156217559|gb|EDO38473.1| predicted protein [Nematostella vectensis]
          Length = 666

 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 79/175 (45%), Gaps = 15/175 (8%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
           D +  FV++VY     TGD    + +WP+V  A+ ++      G G+      P +   T
Sbjct: 277 DSSALFVVEVYHYFKWTGDSALLRRLWPNVKRAVTWLIGESTKGTGL------PFRKVST 330

Query: 686 WSVSGISA-----YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWNG 739
           + + GI       Y  G+++ AL+AA  LA  + D+           +A ++V   LWN 
Sbjct: 331 YDLVGIDKYDHVMYEAGVFLLALRAAQHLAVHLKDKPFNATTTHALIRATQLVDVTLWNE 390

Query: 740 S---YFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALKKVYNYN 791
               Y  + +S    S  I +D L GQ +A   GL  ++ E+K +  L+    YN
Sbjct: 391 ERQFYRAWWDSEYGASDWIMSDSLYGQVWANTLGLGKLLPEEKMKVHLQNEIKYN 445


>gi|270294313|ref|ZP_06200515.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|317480939|ref|ZP_07940019.1| hypothetical protein HMPREF1007_03138 [Bacteroides sp. 4_1_36]
 gi|270275780|gb|EFA21640.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|316902832|gb|EFV24706.1| hypothetical protein HMPREF1007_03138 [Bacteroides sp. 4_1_36]
          Length = 853

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/261 (21%), Positives = 105/261 (40%), Gaps = 53/261 (20%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYRD    G+ ++ K ++  V  ++ Y ++ +D    G+IE         + W  SG
Sbjct: 491 IMKVYRDWHIYGNDEWLKLIYSYVQNSLDYCINTWDPKRKGVIEEPHHNTYDIEFWGPSG 550

Query: 691 -ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF------- 742
            I++Y    +  ALQA  A+   + ++   +Y     +    +  +L++G YF       
Sbjct: 551 MINSY----YTGALQAFVAMGEHL-EKDMTEYRELLDKSIDYMENQLYDGEYFIQNIRWK 605

Query: 743 ---------------NYDNSG----------SSQSSSIQADQLAGQWYARACGLLPIVDE 777
                          NY   G                  +D + G W +  CGL   +D 
Sbjct: 606 ELQASDPTKVQSVNSNYSKEGLDLLEKEGPKYQYGKGCLSDGVVGAWLSLVCGLDEAIDR 665

Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
            K  S L  V+ YN+ +              +G + G +    P  G++ +  + S E+W
Sbjct: 666 KKILSHLLSVHKYNLKRNLRKHVNPQRSTFALGDEGGLLLCSWPKGGKLQLPFVYSNEVW 725

Query: 824 SGVTYAVAASMIHEDLADIGF 844
           +G+ Y VA+ ++ E   + G 
Sbjct: 726 TGIEYQVASHLMFEGEVEKGL 746


>gi|160890992|ref|ZP_02071995.1| hypothetical protein BACUNI_03439 [Bacteroides uniformis ATCC 8492]
 gi|156859213|gb|EDO52644.1| hypothetical protein BACUNI_03439 [Bacteroides uniformis ATCC 8492]
          Length = 831

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/261 (21%), Positives = 105/261 (40%), Gaps = 53/261 (20%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYRD    G+ ++ K ++  V  ++ Y ++ +D    G+IE         + W  SG
Sbjct: 469 IMKVYRDWHIYGNDEWLKLIYSYVQNSLDYCINTWDPKRKGVIEEPHHNTYDIEFWGPSG 528

Query: 691 -ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF------- 742
            I++Y    +  ALQA  A+   + ++   +Y     +    +  +L++G YF       
Sbjct: 529 MINSY----YTGALQAFVAMGEHL-EKDMTEYRELLDKSIDYMENQLYDGEYFIQNIRWK 583

Query: 743 ---------------NYDNSG----------SSQSSSIQADQLAGQWYARACGLLPIVDE 777
                          NY   G                  +D + G W +  CGL   +D 
Sbjct: 584 ELQASDPTKVQSVNSNYSKEGLDLLEKEGPKYQYGKGCLSDGVVGAWLSLVCGLDEAIDR 643

Query: 778 DKARSALKKVYNYNVLK-------------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
            K  S L  V+ YN+ +              +G + G +    P  G++ +  + S E+W
Sbjct: 644 KKILSHLLSVHKYNLKRNLRKHVNPQRSTFALGDEGGLLLCSWPKGGKLQLPFVYSNEVW 703

Query: 824 SGVTYAVAASMIHEDLADIGF 844
           +G+ Y VA+ ++ E   + G 
Sbjct: 704 TGIEYQVASHLMFEGEVEKGL 724


>gi|160887666|ref|ZP_02068669.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
 gi|156862844|gb|EDO56275.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
          Length = 992

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/260 (20%), Positives = 107/260 (41%), Gaps = 51/260 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+   +GD ++ K ++P+V  ++ Y +  +D    G +E         + W   G
Sbjct: 634 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 693

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
           +       ++ AL A   + +E+  +  ++Y     +  K +   L++G YF        
Sbjct: 694 MCT---SFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 749

Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
                     ++  S S ++              +   +D + G W A  CGL  ++D +
Sbjct: 750 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 809

Query: 779 KARSALKKVYNYNV-------------LKVMGGKRGAVNGMLPD-GRVDMSSMQSREIWS 824
           K RS L  V+ YN+             +   G   G +    P  G + +  + S E+W+
Sbjct: 810 KVRSHLVAVHKYNLKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 869

Query: 825 GVTYAVAASMIHEDLADIGF 844
           G+ Y VA+ ++ +   + G 
Sbjct: 870 GIEYQVASHLMMKGEVEKGL 889


>gi|317478064|ref|ZP_07937244.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
 gi|316905767|gb|EFV27541.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
          Length = 970

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/260 (20%), Positives = 109/260 (41%), Gaps = 51/260 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+   +GD ++ K ++P+V  ++ Y +  +D    G +E         + W   G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
           +       ++ AL A   + +E+  +  ++Y     +  K +   L++G YF        
Sbjct: 672 MCT---SFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727

Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
                     ++  S S ++              +   +D + G W A  CGL  ++D +
Sbjct: 728 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787

Query: 779 KARSALKKVYNYN----VLKVMGGKR-----GAVNGML-----PDGRVDMSSMQSREIWS 824
           K RS L  V+ YN    ++     +R     G   G+L       G + +  + S E+W+
Sbjct: 788 KVRSHLVAVHKYNLKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 847

Query: 825 GVTYAVAASMIHEDLADIGF 844
           G+ Y VA+ ++ +   + G 
Sbjct: 848 GIEYQVASHLMMKGEVEKGL 867


>gi|270295078|ref|ZP_06201279.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274325|gb|EFA20186.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 970

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/260 (20%), Positives = 107/260 (41%), Gaps = 51/260 (19%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+   +GD ++ K ++P+V  ++ Y +  +D    G +E         + W   G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
           +       ++ AL A   + +E+  +  ++Y     +  K +   L++G YF        
Sbjct: 672 MCT---SFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727

Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
                     ++  S S ++              +   +D + G W A  CGL  ++D +
Sbjct: 728 LQAPNPVDVMSFGGSYSDEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787

Query: 779 KARSALKKVYNYNV-------------LKVMGGKRGAVNGMLPD-GRVDMSSMQSREIWS 824
           K RS L  V+ YN+             +   G   G +    P  G + +  + S E+W+
Sbjct: 788 KVRSHLVAVHKYNLKHDLVDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 847

Query: 825 GVTYAVAASMIHEDLADIGF 844
           G+ Y VA+ ++ +   + G 
Sbjct: 848 GIEYQVASHLMMKGEVEKGL 867


>gi|384263211|ref|YP_005418399.1| amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
 gi|378404313|emb|CCG09429.1| Amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
          Length = 728

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 36/62 (58%), Gaps = 6/62 (9%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
           D  P FV      V  TGD  FA+A+WP+V  A+A++D   DRDGDG +E     +DG  
Sbjct: 397 DATPLFVALAGAYVERTGDGAFARALWPAVDAALAWIDHHGDRDGDGFVEYGRRCDDGLV 456

Query: 680 DQ 681
           +Q
Sbjct: 457 NQ 458


>gi|29348317|ref|NP_811820.1| hypothetical protein BT_2908 [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|29340221|gb|AAO78014.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 848

 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 57/270 (21%), Positives = 105/270 (38%), Gaps = 61/270 (22%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+   +G+ +F  +++P V  ++ Y +  +D    G IE         + W   G
Sbjct: 490 IMKVYREWRISGENEFLISMYPKVKKSLDYCISTWDPRRVGSIEEPHHNTYDIEFWGPDG 549

Query: 691 I--SAYSGGLWVAALQAASALAR--EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF---- 742
           +  S Y G L        SA  R  E  D+   +Y     +  K     L+NG YF    
Sbjct: 550 MHNSFYYGAL--------SAFIRMSEFLDKDVTEYKKLLKKGRKFTETGLFNGEYFIQKI 601

Query: 743 --------------NYDNSGSSQSSSIQ--------------ADQLAGQWYARACGLLPI 774
                         ++ +S S ++  I               +D + G W +R CG+   
Sbjct: 602 EWRGLNAKDPTVAQSFHSSYSPEAKEILEKEGPKYQYGNGCLSDGVLGSWLSRMCGMEET 661

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP--------------DGRVDMSSMQSR 820
           ++ +K +S L  V+ YN  K +                           G++ +  + S 
Sbjct: 662 LNTEKVKSHLLSVHRYNFKKDLTDHANPQRSPYALGKEGGLLLGSWPKGGKLSLPFVYSN 721

Query: 821 EIWSGVTYAVAASMIHEDLADIGFQT--AC 848
           E+W+G+ Y VA+ ++ +   + G +   AC
Sbjct: 722 EVWTGIEYQVASHLMLQGEVEKGLEIVRAC 751


>gi|255013072|ref|ZP_05285198.1| hypothetical protein B2_04148 [Bacteroides sp. 2_1_7]
 gi|410102535|ref|ZP_11297461.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
 gi|409238607|gb|EKN31398.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
          Length = 620

 Score = 47.8 bits (112), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 64/145 (44%), Gaps = 8/145 (5%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
           D+N  F+L++ R      +K+     WP++  AM ++   D D  G+     F     D 
Sbjct: 346 DINAFFILRIARFYRYYHNKQDLINYWPAMKNAMNWLISRDTDKIGLPAQVSFWGDWKDV 405

Query: 686 WSVSG--ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEK------LW 737
             + G   S ++  ++++AL+    LA E GD+   +Y+   +QK      K      LW
Sbjct: 406 KGIEGRKYSPFTCLIYLSALKEMVYLAEECGDQAGANYYQSAYQKGYKTMNKDVREGGLW 465

Query: 738 NGSYFNYDNSGSSQSSSIQADQLAG 762
           NG+Y+       S +  I  DQ  G
Sbjct: 466 NGNYYCQIWKDGSVNDRILQDQTIG 490


>gi|428168440|gb|EKX37385.1| hypothetical protein GUITHDRAFT_145081 [Guillardia theta CCMP2712]
          Length = 887

 Score = 47.0 bits (110), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 53/226 (23%), Positives = 86/226 (38%), Gaps = 62/226 (27%)

Query: 548 VHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGIC 607
           +H Y S  L +LFP++                              R +L   P D    
Sbjct: 528 LHLYRSIPLALLFPQL-----------------------------VRNILDTGPID---- 554

Query: 608 DPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDR 667
           +P F   +  + D+      NP FVL V    ++TGD++          V + +++    
Sbjct: 555 EPKFPCGSRMMGDS------NPGFVLSVLNLFMSTGDRR----------VLLDFLEPV-- 596

Query: 668 DGDGMIENDGFPDQTYDTWS-----VSGISAYSGGLWVAALQAASALAREVGDRGSEDYF 722
               M    G P    +T+      +  +SAY+  L++A L AA  +A  V D   + Y 
Sbjct: 597 ----MSSRFGLPRHLQNTYDWFHFHLKDVSAYNAVLYLAQLAAAERIAALVQDDSFQHYC 652

Query: 723 LFKFQKAK-VVYEKLWNGSYFN-YDNSGSSQSSSIQADQLAGQWYA 766
                KA+  V E LWNG +   + N       ++  + L GQ +A
Sbjct: 653 QLHLSKAQEKVKELLWNGRFLRAWWNEEGELVEALHTETLYGQLWA 698


>gi|427384889|ref|ZP_18881394.1| hypothetical protein HMPREF9447_02427 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728150|gb|EKU91009.1| hypothetical protein HMPREF9447_02427 [Bacteroides oleiciplenus YIT
           12058]
          Length = 998

 Score = 47.0 bits (110), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 50/254 (19%), Positives = 106/254 (41%), Gaps = 57/254 (22%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+   +G+ ++ K ++P+V  ++ Y +  +D    G +E         + W   G
Sbjct: 640 IMKVYREWRISGNTQWMKDLFPAVKKSLDYCIRTWDPLQKGYLEEPHHNTYDIEFWGPDG 699

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF-------- 742
           +       ++ AL A   +++ +  + ++ Y +   +  K +   L++G YF        
Sbjct: 700 MCT---SFYLGALTAFVEMSKAL-KQSTKKYDILLAKGKKYMENTLFDGEYFIQKIQWEG 755

Query: 743 ----------NYDNSGSSQS--------------SSIQADQLAGQWYARACGLLPIVDED 778
                     ++    S ++              S   +D + G W A  CGL  +++ +
Sbjct: 756 LQAPNPVDVISFGGGYSDEAMKLLKKEGPKYQYGSGCLSDGILGMWMASVCGLNEVLNNE 815

Query: 779 KARSALKKVYNYNVLKVM------------GGKRGAVNGML-----PDGRVDMSSMQSRE 821
           K +S L  V+ YN+   +             GK G   G+L       G + +  + S E
Sbjct: 816 KVKSHLVSVHKYNLKDNLVDHFNPQRPAYACGKEG---GLLLCTWPKGGMLSLPFVYSNE 872

Query: 822 IWSGVTYAVAASMI 835
           +W+G+ Y VA+ ++
Sbjct: 873 VWTGIEYQVASHLM 886


>gi|383124512|ref|ZP_09945175.1| hypothetical protein BSIG_1742 [Bacteroides sp. 1_1_6]
 gi|251841336|gb|EES69417.1| hypothetical protein BSIG_1742 [Bacteroides sp. 1_1_6]
          Length = 848

 Score = 46.6 bits (109), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 56/270 (20%), Positives = 105/270 (38%), Gaps = 61/270 (22%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++VYR+   +G+ +F  +++P V  ++ Y +  +D    G IE         + W   G
Sbjct: 490 IMKVYREWRISGENEFLISMYPKVKKSLDYCISTWDPRRVGSIEEPHHNTYDIEFWGPDG 549

Query: 691 I--SAYSGGLWVAALQAASALAR--EVGDRGSEDYFLFKFQKAKVVYEKLWNGSYF---- 742
           +  S Y G L        SA  R  E  D+   +Y     +  K     L+NG YF    
Sbjct: 550 MHNSFYYGAL--------SAFIRMSEFLDKDVTEYKKLLKKGRKFTETGLFNGEYFIQKI 601

Query: 743 --------------NYDNSGSSQSSSIQ--------------ADQLAGQWYARACGLLPI 774
                         ++ +S S ++  I               +D + G W +R CG+   
Sbjct: 602 EWRGLNAKDPTVAQSFHSSYSPEAKEILEKEGPKYQYGNGCLSDGVLGSWLSRMCGMEET 661

Query: 775 VDEDKARSALKKVYNYNVLKVMGGKRGAVNGMLP--------------DGRVDMSSMQSR 820
           ++ +K +S L  V+ YN  K +                           G++ +  + S 
Sbjct: 662 LNTEKVKSHLLSVHRYNFKKDLTDHANPQRSPYALGKEGGLLLGSWPKGGKLSLPFVYSN 721

Query: 821 EIWSGVTYAVAASMIHEDLADIGFQT--AC 848
           ++W+G+ Y VA+ ++ +   + G +   AC
Sbjct: 722 KVWTGIEYQVASHLMLQGEVEKGLEIVRAC 751


>gi|220925399|ref|YP_002500701.1| amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
 gi|219950006|gb|ACL60398.1| Amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
          Length = 765

 Score = 46.6 bits (109), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 18/130 (13%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+   +    TGDK   +A+WP++  A+ ++D + DRDGDG +E          
Sbjct: 413 DATPLFVMLAGQYFEQTGDKDTIQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 472

Query: 675 NDGFPDQTYDTWSVSGISAYS-------GGLWVAALQAASALAREVGDRGSEDYFLFKFQ 727
           N G+ D     +   G  A          G   AA +AAS LA  +G+    +      +
Sbjct: 473 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAATLGNAALAERLKDAAE 532

Query: 728 KAKVVYEKLW 737
           + +V +++ +
Sbjct: 533 RLRVAFDEAF 542


>gi|374372806|ref|ZP_09630467.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
 gi|373234882|gb|EHP54674.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
          Length = 872

 Score = 46.2 bits (108), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 57/252 (22%), Positives = 104/252 (41%), Gaps = 53/252 (21%)

Query: 632 VLQVYRDVVATGDKKFAKAVWPSVYVAMAY-MDQFDRDGDGMIENDGFPDQTYDTWSVSG 690
           +++V+R+   +G+  + K ++P V  ++ Y ++ +D    G++E         + W   G
Sbjct: 517 IMKVHREWRISGNHTWLKKIYPLVKRSLDYCIETWDPGKKGVLEEPHHNTYDIEFW---G 573

Query: 691 ISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVYE-KLWNGSYF------- 742
                   ++ ALQ+  A+ + + +  +    L K  K K+  E +L+NG +F       
Sbjct: 574 PDPMCTTFYLGALQSIIAMGQFLKEDITGYSSLLK--KGKMFLETRLFNGEFFFQQIQWE 631

Query: 743 --NYDNSGSSQS-----------------------SSIQADQLAGQWYARACGLLPIVDE 777
                N   +QS                           +D + G W A  CGL   VD+
Sbjct: 632 GLKAQNPARAQSFGGNYSQEAIEILKKEGPKYQYGKGCLSDGVLGSWIAAMCGLEDPVDK 691

Query: 778 DKARSALKKVYNYNV---LK----------VMGGKRGAVNGMLPD-GRVDMSSMQSREIW 823
            K    L  V+ YN+   LK           MG + G +    P  G++ +  + S E+W
Sbjct: 692 RKITGHLLSVHKYNLKATLKDHVNPQRPAYAMGHEGGLLLCTWPKGGKLSLPFVYSNEVW 751

Query: 824 SGVTYAVAASMI 835
           +G+ Y VAA ++
Sbjct: 752 TGIEYQVAAHLM 763


>gi|154246650|ref|YP_001417608.1| amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
 gi|154160735|gb|ABS67951.1| Amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
          Length = 733

 Score = 46.2 bits (108), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 67/140 (47%), Gaps = 30/140 (21%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+     V  TGD   A+ +WP+V  A+ ++  + DRDGDG +E          
Sbjct: 381 DSTPLFVMLAGAYVERTGDLDTARLLWPAVAAALGWITDYGDRDGDGFVEYGRRSGDGLV 440

Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           N G+ D     +   G           + AY  G W    QAA+++AR +G   + D  +
Sbjct: 441 NQGWKDSHDSVFHADGRLARGPIALVEVQAYVFGAW----QAAASIARRLGQ--APDAMV 494

Query: 724 FKFQKAKVVYEKLWNGSYFN 743
           F+ +KA ++ +  ++ ++F+
Sbjct: 495 FE-RKADLLRDA-FDAAFFD 512


>gi|393766406|ref|ZP_10354962.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
 gi|392728187|gb|EIZ85496.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
          Length = 780

 Score = 46.2 bits (108), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P FV+  +     TGD +  +A+WPS+ +A+ +MD++ DRDGDG +E
Sbjct: 428 DGTPLFVMLAWEYYAVTGDLETIRAIWPSLELALEWMDRYGDRDGDGFVE 477


>gi|170749555|ref|YP_001755815.1| amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
           2831]
 gi|170656077|gb|ACB25132.1| Amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
           2831]
          Length = 781

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P FV+  +     TGD +  +A+WP++ +A+ +MD++ DRDGDG +E
Sbjct: 429 DGTPLFVMLAWEYYAVTGDLETVRAIWPALELALEWMDRYGDRDGDGFVE 478


>gi|431796995|ref|YP_007223899.1| glycogen debranching protein [Echinicola vietnamensis DSM 17526]
 gi|430787760|gb|AGA77889.1| glycogen debranching enzyme [Echinicola vietnamensis DSM 17526]
          Length = 1213

 Score = 44.7 bits (104), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 65/141 (46%), Gaps = 16/141 (11%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYD 684
           D+N  F  Q+ R    TGDK + +AVWPS+   +A+  + FD DGDG+       +    
Sbjct: 419 DMNQVFFDQLLRHFDWTGDKAYLQAVWPSIERHLAWEKRNFDPDGDGLY------NAYAS 472

Query: 685 TWSVSGISAYSGGLWVA------ALQAASALAREVGDRGSEDYFLFKFQKAKVVYEKLW- 737
            W+   +   SGG+  A      A + A+ LA  +G+ G + Y        K + E LW 
Sbjct: 473 IWASDALQYNSGGVAHASAYNYFANKKAAELAAFIGEDGKQ-YAAEAEAILKAMNETLWL 531

Query: 738 -NGSYFNYDNSGSSQSSSIQA 757
            +G Y  Y +    Q+   QA
Sbjct: 532 PSGWYAEYKDFLGPQNLHPQA 552


>gi|217978466|ref|YP_002362613.1| amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
 gi|217503842|gb|ACK51251.1| Amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
          Length = 729

 Score = 44.3 bits (103), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 47/108 (43%), Gaps = 18/108 (16%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+        TGD +    +WP++  A+ ++D+F DRDGDG +E          
Sbjct: 377 DATPLFVMLAGMYFERTGDSETIAKIWPNILAALRWIDEFGDRDGDGFVEYLRETENGLA 436

Query: 675 NDGFPDQTYDTWSVSGISAYS-------GGLWVAALQAASALAREVGD 715
           N G+ D     +   G  A          G   AA  AA  LAR +GD
Sbjct: 437 NQGWKDSHDSIFHADGSLAEGPIALCEVQGYVYAAKSAAGRLARLMGD 484


>gi|170060875|ref|XP_001865995.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879232|gb|EDS42615.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 183

 Score = 44.3 bits (103), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 45/110 (40%), Gaps = 31/110 (28%)

Query: 604 IGICDPWFE--VNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYV---- 657
           + I + W    V AY ++D   WKDLN KF+LQVYRD       ++A       ++    
Sbjct: 43  LKIVEVWLRGTVRAYPIHDVYEWKDLNSKFILQVYRDYYTQLKAEYAGKFSSIEFINKES 102

Query: 658 ------------------------AMAYMDQFDRDGDGMIENDGFPDQTY 683
                                   A  Y  ++D D DG+IEN   PD TY
Sbjct: 103 FRNRMSRTERDVLLEENVSGLQAGAGEYPLEWDMDNDGLIENKP-PDLTY 151


>gi|384915661|ref|ZP_10015873.1| Glycogen debranching enzyme [Methylacidiphilum fumariolicum SolV]
 gi|384526964|emb|CCG91744.1| Glycogen debranching enzyme [Methylacidiphilum fumariolicum SolV]
          Length = 733

 Score = 43.9 bits (102), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 67/310 (21%), Positives = 120/310 (38%), Gaps = 56/310 (18%)

Query: 565 LSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARW 624
             I+ D     ++H+  + ++ + G+   RK  G++             +A CL      
Sbjct: 345 FDIEADAEPGKILHETREGEMANLGEIPFRKYYGSI-------------DATCL------ 385

Query: 625 KDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE--------- 674
                 F+L        T D+KF + +WPS+ +A+ ++D++ D DGDG +E         
Sbjct: 386 ------FLLLAAEYYQRTADRKFIEEIWPSIELALLWIDRYGDVDGDGFVEYRRRSSRGL 439

Query: 675 -NDGFPD------QTYDTWSVSGISAYS-GGLWVAALQAASALAREVGDRGSEDYFLFKF 726
            N G+ D      Q   ++ V  I+     G   AA +  + LA  +G     D  L + 
Sbjct: 440 TNQGWKDSFDSVFQKDGSFPVGPIALCEVQGFVFAAKKKIAYLASVLGQISLHDRLLKEA 499

Query: 727 QKAKVVYEK-LWNGSYFNYDNS--GSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSA 783
           +  K  +EK  W     +Y  +  G  +   + A        A  C    I  +DKA   
Sbjct: 500 EILKEKFEKEFWVEELSSYALALDGRKEKCKVLASN------AGLCLYTGIAKQDKAELI 553

Query: 784 LKKVYNYNVLKVMGGKRGAVNGMLPDGRVDMSSMQSREIWSGVTYAVAASMIHEDLADIG 843
            K + +       G +  A   +    R +  S  +  +W   T  +AA M+   + ++ 
Sbjct: 554 GKLLMSDKFFTGWGIRTVASTEI----RYNPISYHNGSVWPFDTAFIAAGMMSYGMKEMA 609

Query: 844 FQTACGIYEA 853
                G++EA
Sbjct: 610 INLFKGLFEA 619


>gi|170743420|ref|YP_001772075.1| amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
 gi|168197694|gb|ACA19641.1| Amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
          Length = 764

 Score = 43.5 bits (101), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 49/107 (45%), Gaps = 18/107 (16%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+   +    TGD+   +A+WP++  A+ ++D + DRDGDG +E          
Sbjct: 412 DATPLFVMLAGQYFEQTGDRATVQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 471

Query: 675 NDGFPDQTYDTWSVSGISAYS-------GGLWVAALQAASALAREVG 714
           N G+ D     +   G  A          G   AA +AAS LA  +G
Sbjct: 472 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAGVLG 518


>gi|86604990|ref|YP_473753.1| glycogen debranching enzyme family protein [Synechococcus sp.
           JA-3-3Ab]
 gi|86553532|gb|ABC98490.1| glycogen debranching enzyme family protein [Synechococcus sp.
           JA-3-3Ab]
          Length = 721

 Score = 42.7 bits (99), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDG 677
           D  P FV+   R V  +GD +  + +WP++  A+++MD + D DGDG +E  G
Sbjct: 370 DATPLFVVLAGRYVRRSGDLELLRQLWPNIEAALSWMDTYGDPDGDGFLEYAG 422


>gi|383768111|ref|YP_005447094.1| hypothetical protein PSMK_30380 [Phycisphaera mikurensis NBRC
           102666]
 gi|381388381|dbj|BAM05197.1| hypothetical protein PSMK_30380 [Phycisphaera mikurensis NBRC
           102666]
          Length = 823

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 53/211 (25%), Positives = 82/211 (38%), Gaps = 33/211 (15%)

Query: 541 LMWNTYDVHFYSSFALIMLFPKIQLSIQRDFAAAVMMHDPSKMKLLDDGQWVSRKVLGAV 600
           L+ +T D  + SS   + L+P +     R +A  +  H  S   +L              
Sbjct: 410 LLHSTVDAEYNSSLFALCLWPDLLALQLRQWAERLEPHAASGGAILQ------------- 456

Query: 601 PHDIGICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMA 660
            HD+G  +          Y  A  ++ N   +LQ Y     TGD+  A+A+  +     A
Sbjct: 457 -HDLGSGND--ASGQASRYRMAVEENSNFLLLLQTY--TRWTGDRSVARALAATASSLTA 511

Query: 661 YMDQFDRDGDGMIENDGFPDQTYDTWSVSGISA--------YSGGLWVAALQAASALARE 712
           Y+   DR      ENDGFPD   +   + G +A        Y     +AALQA++ L R 
Sbjct: 512 YLLWCDR------ENDGFPDDPGENTMIDGPAALRLARRPTYPAVKRLAALQASADLLRL 565

Query: 713 VG-DRGSEDYFLFKFQKAKVVYEKLWNGSYF 742
            G D  + D        A  +    W G ++
Sbjct: 566 AGQDDAARDTESIADADAAKIESAAWLGDHY 596


>gi|326797904|ref|YP_004315723.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326548668|gb|ADZ77053.1| hypothetical protein Sph21_0471 [Sphingobacterium sp. 21]
          Length = 1132

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 58/123 (47%), Gaps = 22/123 (17%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIE-------NDG 677
           D+N  F  Q+Y     TGD  F KA WP++   +A+  + FD+DGDG+ +       +D 
Sbjct: 416 DMNLVFFDQLYNHFQYTGDTAFIKASWPAIKRHLAWEKRNFDQDGDGLYDAYCAIWASDA 475

Query: 678 FPDQTYDTWSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKV---VYE 734
                Y    V+  SAY+        + A+ LA  VG    ED   +K + AK+   V +
Sbjct: 476 L---QYSGGGVTHTSAYN----YRGFEMAAKLATIVG----EDPRPYKQEAAKIKEAVNK 524

Query: 735 KLW 737
            LW
Sbjct: 525 HLW 527


>gi|16263864|ref|NP_436656.1| hypothetical protein SM_b20116 [Sinorhizobium meliloti 1021]
 gi|334319999|ref|YP_004556628.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
 gi|384539727|ref|YP_005723811.1| hypothetical protein SM11_pD1478 [Sinorhizobium meliloti SM11]
 gi|407724161|ref|YP_006843822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
 gi|15139988|emb|CAC48516.1| CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti 1021]
 gi|334097738|gb|AEG55748.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
 gi|336038380|gb|AEH84310.1| conserved hypothetical protein [Sinorhizobium meliloti SM11]
 gi|407324221|emb|CCM72822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
          Length = 734

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 18/107 (16%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
           D  P F++     +  TGD    + +WP+V  A+ ++D+F DRDGDG +E       G  
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441

Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
           +Q    ++D+   +G +  SG +  A        A QAA+ L+R++G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRKLG 488


>gi|384533984|ref|YP_005716648.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
 gi|333816160|gb|AEG08827.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
          Length = 734

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 18/107 (16%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
           D  P F++     +  TGD    + +WP+V  A+ ++D+F DRDGDG +E       G  
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441

Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
           +Q    ++D+   +G +  SG +  A        A QAA+ L+R++G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRKLG 488


>gi|418404986|ref|ZP_12978419.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
 gi|359501068|gb|EHK73697.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
          Length = 734

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 55/107 (51%), Gaps = 18/107 (16%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
           D  P F++     +  TGD    + +WP+V  A+ ++D+F DRDGDG +E       G  
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441

Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
           +Q    ++D+   +G +  SG +  A        A QAA+ L+R++G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRKLG 488


>gi|300770983|ref|ZP_07080860.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762256|gb|EFK59075.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 1205

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 8/73 (10%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYD 684
           D+N  F  Q++     TGD+ F K VWP++   +A+  + FD DGDG+       D    
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGLY------DAYCA 471

Query: 685 TWSVSGISAYSGG 697
            W+  G+  YSGG
Sbjct: 472 IWASDGLQ-YSGG 483


>gi|227537591|ref|ZP_03967640.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227242593|gb|EEI92608.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 1205

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 8/73 (10%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQ-FDRDGDGMIENDGFPDQTYD 684
           D+N  F  Q++     TGD+ F K VWP++   +A+  + FD DGDG+       D    
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGLY------DAYCA 471

Query: 685 TWSVSGISAYSGG 697
            W+  G+  YSGG
Sbjct: 472 IWASDGLQ-YSGG 483


>gi|433611718|ref|YP_007195179.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
 gi|429556660|gb|AGA11580.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
          Length = 734

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 54/107 (50%), Gaps = 18/107 (16%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
           D  P F++     +  TGD    + +WP+V  A+ ++D+F DRDGDG +E       G  
Sbjct: 382 DSTPLFIMLAGAYLDRTGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLV 441

Query: 680 DQ----TYDTWSVSGISAYSGGLWVA--------ALQAASALAREVG 714
           +Q    ++D+   +G +  SG +  A        A QAA+ L+R +G
Sbjct: 442 NQCWKDSHDSIFHAGGNLASGPIATAEVQAYVFGAWQAAARLSRRLG 488


>gi|429221453|ref|YP_007173779.1| glycogen debranching protein [Deinococcus peraridilitoris DSM
           19664]
 gi|429132316|gb|AFZ69330.1| glycogen debranching enzyme [Deinococcus peraridilitoris DSM 19664]
          Length = 430

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 54/121 (44%), Gaps = 5/121 (4%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
           D N  FVL  Y     T D  F +A WP +  A+ ++   D +  G++E     D   D 
Sbjct: 99  DANMWFVLGHYACHRRTNDTAFLRAAWPRIQAALLWLRYQDMNDCGLLEAPEAADWA-DL 157

Query: 686 WSVSGISAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVV----YEKLWNGSY 741
           ++    + Y+  L+V  L+AA  L+  +G+ G+         + K+      ++ W+G  
Sbjct: 158 YATRYNTLYANALYVGTLEAAGYLSAALGEDGAAHRVRANDVRRKINLLLWLDRPWDGQR 217

Query: 742 F 742
           F
Sbjct: 218 F 218


>gi|398838157|ref|ZP_10595439.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
 gi|398116719|gb|EJM06477.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
          Length = 1066

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 31/127 (24%), Positives = 55/127 (43%), Gaps = 26/127 (20%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+     +  T D    + +WP++  A+++++++ DRDGDG +E          
Sbjct: 714 DSTPLFVMLAAAYLERTNDLGTLRQLWPNIEAALSWIEEYGDRDGDGFVEYGRQSAEGLI 773

Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           N G+ D     +  +G           + AY  G W      A+ +AR +GD        
Sbjct: 774 NQGWKDSHDSVFHANGQLAVGPIAIVEVQAYVYGAWT----GAAKIARRLGDPDRAQRLK 829

Query: 724 FKFQKAK 730
           +K Q+ +
Sbjct: 830 YKAQRLR 836


>gi|268317317|ref|YP_003291036.1| amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
 gi|262334851|gb|ACY48648.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
          Length = 868

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 19/128 (14%)

Query: 598 GAVPHDI----GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWP 653
           G +PH+I     + D WFE   Y         D  P F++       A+GD  + +  W 
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417

Query: 654 SVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI------SAYSGGLWVAALQAAS 707
           ++  A  +    D DG+ ++EN G        W   G+        Y  G+W+AAL+   
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473

Query: 708 ALAREVGD 715
           A+A  +G+
Sbjct: 474 AMATALGE 481


>gi|427409761|ref|ZP_18899963.1| hypothetical protein HMPREF9718_02437 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425711894|gb|EKU74909.1| hypothetical protein HMPREF9718_02437 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 739

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 61/140 (43%), Gaps = 30/140 (21%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+     +  TGD ++   + P++   + ++D   DRDGDG +E          
Sbjct: 387 DSTPLFVMLAGAYLERTGDLEYLAGLLPNIEAGLRWIDTHGDRDGDGFVEYGRLTDQGLQ 446

Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           N G+ D     + V G           + AY  G W    QAAS + R +GD    + +L
Sbjct: 447 NQGWKDSHDSIFHVDGSIARGPIALAEVQAYVYGAW----QAASLIYRAMGDLDRSETYL 502

Query: 724 FKFQKAKVVYEKLWNGSYFN 743
            +     V  ++ ++ S+F+
Sbjct: 503 AR----AVDLQQRFDASFFD 518


>gi|398384505|ref|ZP_10542535.1| glycogen debranching enzyme [Sphingobium sp. AP49]
 gi|397722664|gb|EJK83200.1| glycogen debranching enzyme [Sphingobium sp. AP49]
          Length = 738

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 52/129 (40%), Gaps = 25/129 (19%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+     +  TGD  F  A+ P V  A+A++D   DRDGDG +E          
Sbjct: 386 DSTPLFVMLAGAYLERTGDAAFLAAILPHVEAALAWIDAHGDRDGDGFVEYGRLTDQGLQ 445

Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALARE---VGDRGSED 720
           N G+ D     +   G           + AY  G W AA    +A+ +E    G R   D
Sbjct: 446 NQGWKDSHDSIFHADGSIARGPIALAEVQAYVYGAWQAAGAIFAAVGQEDKATGYRARAD 505

Query: 721 YFLFKFQKA 729
               +F  A
Sbjct: 506 DLRTRFDAA 514


>gi|159040682|ref|YP_001539934.1| glycoside hydrolase 15-like protein [Caldivirga maquilingensis
           IC-167]
 gi|157919517|gb|ABW00944.1| glycoside hydrolase 15-related [Caldivirga maquilingensis IC-167]
          Length = 604

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 49/119 (41%), Gaps = 9/119 (7%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
           D     VL  Y     T D++F K  W  V  A  Y+          + +DG    T   
Sbjct: 346 DETASVVLATYVHFKLTLDRQFLKNAWVMVRKAAEYLA-------ANVSDDGLTTPTVGP 398

Query: 686 WSVS-GISAYSGGLWVAALQAASALAREVGDRG-SEDYFLFKFQKAKVVYEKLWNGSYF 742
           W    G+  Y+     AAL +AS LA E+GDR  + ++  +          + WNGS+F
Sbjct: 399 WEEHLGVHTYTNASVYAALSSASYLAGEIGDRNRAVEWGRYASVIRSTTLNQAWNGSFF 457


>gi|334345035|ref|YP_004553587.1| amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
 gi|334101657|gb|AEG49081.1| Amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
          Length = 744

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQ 681
           D  P FV+     +  TGD  F  ++ P V  A+A++D+  DRDGDG +E     DQ
Sbjct: 392 DSTPLFVMLAGAYLERTGDSAFIASILPHVRAALAWIDEHGDRDGDGFVEYGRLTDQ 448


>gi|345303570|ref|YP_004825472.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
 gi|345112803|gb|AEN73635.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
          Length = 868

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 19/128 (14%)

Query: 598 GAVPHDI----GICDPWFEVNAYCLYDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWP 653
           G +PH+I     + D WFE   Y         D  P F++       A+GD  + +  W 
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417

Query: 654 SVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSGI------SAYSGGLWVAALQAAS 707
           ++  A  +    D DG+ ++EN G        W   G+        Y  G+W+AAL+   
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473

Query: 708 ALAREVGD 715
           A+A  +G+
Sbjct: 474 AMATALGE 481


>gi|337288158|ref|YP_004627630.1| alpha-L-rhamnosidase [Thermodesulfobacterium sp. OPB45]
 gi|334901896|gb|AEH22702.1| alpha-L-rhamnosidase [Thermodesulfobacterium geofontis OPF15]
          Length = 707

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 52/110 (47%), Gaps = 21/110 (19%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQ--- 681
           D  P FV+   +    TGD  F K++W ++ +A++++D + D DGDG +E   FP +   
Sbjct: 358 DATPLFVILASKYFERTGDIDFIKSIWKNIVLAISWIDNYGDVDGDGFVEY--FPSKKGL 415

Query: 682 TYDTWSVSGISA-YSGGLWV--------------AALQAASALAREVGDR 716
           T   W  S  S  Y  G  V               A + AS LAR +G++
Sbjct: 416 TNKGWKDSEDSIFYEDGTLVKPPVALVEVQGYVYKAKREASKLARILGEK 465


>gi|300787634|ref|YP_003767925.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei U32]
 gi|384151022|ref|YP_005533838.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
 gi|399539517|ref|YP_006552179.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
 gi|299797148|gb|ADJ47523.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei U32]
 gi|340529176|gb|AEK44381.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
 gi|398320287|gb|AFO79234.1| alpha-L-rhamnosidase [Amycolatopsis mediterranei S699]
          Length = 726

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 72/151 (47%), Gaps = 13/151 (8%)

Query: 642 TGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVSG-ISAYSGGLWV 700
           TGDK +  AV P++   +AY D F R+ D ++ ++   D+ Y   + +G ++ YS    +
Sbjct: 384 TGDKAYVTAVLPALRKELAYNDSF-RNADNLVVSN---DRDYWQATQTGEVTKYSIDYSI 439

Query: 701 AALQAASALAREVGDRGSEDYFLFKFQKAKV-VYEKLWNGSYFNYDNSGSSQSSSIQ-AD 758
             L+  S L R VG     D +  K    K  V  KLWN +   Y  S +     ++ A+
Sbjct: 440 -VLREMSWLERAVGSAALADGYDTKADAIKTAVMSKLWNPALGAYGQSSAKPDVLVEDAN 498

Query: 759 QLAGQWYARACGLLPIVDEDKARSALKKVYN 789
            LA Q+     G +P   E K  +ALK ++ 
Sbjct: 499 ALALQY-----GFVPAGQEAKVLAALKTLWT 524


>gi|86608269|ref|YP_477031.1| glycogen debranching enzyme family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86556811|gb|ABD01768.1| glycogen debranching enzyme family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 721

 Score = 41.6 bits (96), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDG 677
           D  P FV+   R V  +GD +  + +WP++  A+ +MD + D DGDG +E  G
Sbjct: 370 DSTPLFVVLAGRYVRRSGDLELLRQLWPNIEAALHWMDTYGDADGDGFLEYAG 422


>gi|373456251|ref|ZP_09548018.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
 gi|371717915|gb|EHO39686.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
          Length = 1123

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 60/138 (43%), Gaps = 8/138 (5%)

Query: 619 YDTARWKDLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGF 678
           YD A   D  P +V+     + A+GD  F +  W  +  A+ ++   D DGD +IEN   
Sbjct: 647 YDAA---DATPLYVILAAHYLKASGDLTFIQQSWSHIKRAIDFLYSTDTDGDMLIENKNV 703

Query: 679 PDQTYDTWSVSGISA--YSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEK 735
                +   + G +   Y  GLW   L+ A+ LA+++  +     +    +K  +++   
Sbjct: 704 GHGWVEGGKLWGANPTFYLAGLWAQTLKDAAYLAKKLKLQDLSRRYAADAEKVIRILNRD 763

Query: 736 LWN--GSYFNYDNSGSSQ 751
            WN    +F Y    S Q
Sbjct: 764 FWNEKDDFFYYGKIKSCQ 781


>gi|374624040|ref|ZP_09696525.1| Amylo-alpha-1,6-glucosidase [Ectothiorhodospira sp. PHS-1]
 gi|373943126|gb|EHQ53671.1| Amylo-alpha-1,6-glucosidase [Ectothiorhodospira sp. PHS-1]
          Length = 722

 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 34/62 (54%), Gaps = 6/62 (9%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE-----NDGFP 679
           D  P F++   R    T D+ F   +WP +  A+A++D++ D DGDG +E     +DG  
Sbjct: 370 DATPLFIVLAERYYRHTADRSFIAGLWPHIERALAWIDRYGDVDGDGFVEYHRESDDGLV 429

Query: 680 DQ 681
           +Q
Sbjct: 430 NQ 431


>gi|150376166|ref|YP_001312762.1| amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
 gi|150030713|gb|ABR62829.1| Amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
          Length = 734

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P F++     +  TGD    + +WP++  A+ ++D+F DRDGDG +E
Sbjct: 382 DSTPLFIMLAGAYLDRTGDTDTVRGLWPNIVAALDWIDRFGDRDGDGFVE 431


>gi|75676297|ref|YP_318718.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
 gi|74421167|gb|ABA05366.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
          Length = 741

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 45/106 (42%), Gaps = 22/106 (20%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+     +  TGD    + +WP +  A+ +++Q+ DRDGDG +E          
Sbjct: 389 DSTPLFVVLAGAYLDRTGDVVTIRRLWPHIEAALTWIEQYGDRDGDGFVEYHRQTSEGLI 448

Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASAL 709
           N G+ D     +   G           + AY  G W AA + A  L
Sbjct: 449 NQGWKDSYDSVFHADGTFAKGPIALVEVQAYVYGAWCAAAKIAEQL 494


>gi|398384993|ref|ZP_10543020.1| glycogen debranching enzyme [Sphingobium sp. AP49]
 gi|397721792|gb|EJK82339.1| glycogen debranching enzyme [Sphingobium sp. AP49]
          Length = 739

 Score = 40.8 bits (94), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 60/140 (42%), Gaps = 30/140 (21%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE---------- 674
           D  P FV+     +  TGD +F   + P++  A+ ++D   DRDGDG +E          
Sbjct: 387 DSTPLFVMLAGAYLERTGDLQFLARILPNIKSALQWIDNHGDRDGDGFVEYGRLTDQGLQ 446

Query: 675 NDGFPDQTYDTWSVSG-----------ISAYSGGLWVAALQAASALAREVGDRGSEDYFL 723
           N G+ D     +   G           + AY  G W    QAAS++ R  GD    + +L
Sbjct: 447 NQGWKDSHDSIFHADGSIAKGPIALAEVQAYVYGAW----QAASSIYRAKGDLDRSESYL 502

Query: 724 FKFQKAKVVYEKLWNGSYFN 743
            +     V  +  ++ ++F+
Sbjct: 503 AR----AVDLQDRFDATFFD 518


>gi|163851052|ref|YP_001639095.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
 gi|163662657|gb|ABY30024.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
          Length = 778

 Score = 40.4 bits (93), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P FV+   +    TGD +  +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLETIRAIWPQLKLALEWIDTYGDRDGDGFVE 475


>gi|110636016|ref|YP_676224.1| amylo-alpha-1,6-glucosidase [Chelativorans sp. BNC1]
 gi|110287000|gb|ABG65059.1| Amylo-alpha-1,6-glucosidase [Chelativorans sp. BNC1]
          Length = 734

 Score = 40.4 bits (93), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 33/50 (66%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P FV+     +  TGD +  K++WP++  A+++++++ DRDGDG +E
Sbjct: 382 DATPLFVMLAGAYLDRTGDLETLKSLWPNIEAALSWIEEYGDRDGDGFVE 431


>gi|323495701|ref|ZP_08100771.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
 gi|323319168|gb|EGA72109.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
          Length = 924

 Score = 40.0 bits (92), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 74/173 (42%), Gaps = 22/173 (12%)

Query: 573 AAVMMHDPSKMKLLDDGQWVSRKVLGAVPHDIGICDPWFEVNAYCLYDTARWKDLNPKFV 632
           A  ++ + + M+L DD      ++   V     I           +Y+T    D  P  +
Sbjct: 443 AKAIIENFASMQLQDDTSLNDGRIPNRVTSKTNI-----------IYNTT---DGTPWMI 488

Query: 633 LQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIEN---DGFPDQTYD---TW 686
            ++   +  +GD  FAK ++P V   +A +++   D DG+I++   D + D   D    W
Sbjct: 489 REILEYINYSGDIDFAKRIYPVVQRFIAGVERNYLDQDGLIKHRHPDTWMDAKIDGQIPW 548

Query: 687 SVSGISAYS-GGLWVAALQAASALAREVGDRGSEDYFLFKFQKAKVVY-EKLW 737
           S  G  A     LW  AL  A  LA+   D    D+ +   Q+AK  + +K W
Sbjct: 549 SPRGPKANDIQALWFDALHVAQTLAQLNQDDVYADHVVSLAQQAKQSFIDKFW 601


>gi|449672634|ref|XP_002169551.2| PREDICTED: uncharacterized protein LOC100201428 [Hydra
           magnipapillata]
          Length = 502

 Score = 40.0 bits (92), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 73/166 (43%), Gaps = 9/166 (5%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDT 685
           D+N  F++Q       + D  F   ++P    AM ++ Q    G G+      PD TYD 
Sbjct: 107 DVNVGFIVQALLMYKWSNDTVFFDEIYPFTVRAMNWLMQDATKGTGLPYRK--PD-TYDL 163

Query: 686 WSVSGI--SAYSGGLWVAALQAASALAREVGDRGSEDYFLFKFQKA-KVVYEKLWN---G 739
           + +      AY+   ++ AL A  ++A    D  +         +A K +  ++WN   G
Sbjct: 164 FDLEKYDHCAYNSISYLLALHAMKSMASMQKDNNTIKNVNAALARATKQLEVEMWNEKKG 223

Query: 740 SYFNYDNSGSSQSSSIQADQLAGQWYARACGLLPIVDEDKARSALK 785
            Y  + +      + + +D L GQ +A   GL  ++D+DK +  LK
Sbjct: 224 FYHAWFDDEWGSPTWLMSDVLYGQVWAYTLGLGDLLDKDKMKKHLK 269


>gi|15921050|ref|NP_376719.1| hypothetical protein ST0815 [Sulfolobus tokodaii str. 7]
 gi|342306303|dbj|BAK54392.1| glucan 1,4-alpha-glucosidase [Sulfolobus tokodaii str. 7]
          Length = 615

 Score = 40.0 bits (92), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 64/147 (43%), Gaps = 20/147 (13%)

Query: 634 QVYRDVVATGD--KKFAKAVWPSVYVAMAYMDQFDRDGDGMIENDGFPDQTYDTWSVS-G 690
           Q+Y+D+    D  K F K   P++   M Y++            DG P  ++D W    G
Sbjct: 378 QIYKDIDELIDIFKNFLK---PAIRFLMRYVE------------DGLPKPSFDLWEERYG 422

Query: 691 ISAYSGGLWVAALQAASALAREVGDR--GSEDYFLFKFQKAKVVYEKLWNGSYFNYDNSG 748
           I  Y+      AL +AS LAR++GD    S+   + ++ K +V+   + NG +    +  
Sbjct: 423 IHIYTVSTVYGALVSASELARDIGDEVLASDMLDVAEYMKEEVLRRMVHNGRFIRRIDEN 482

Query: 749 SSQSSSIQADQLAGQWYARACGLLPIV 775
            ++   I A   +  ++       PI+
Sbjct: 483 GNKDLVIDASMYSPYFFGMVDVRDPIM 509


>gi|240138186|ref|YP_002962658.1| hypothetical protein MexAM1_META1p1519 [Methylobacterium extorquens
           AM1]
 gi|418061390|ref|ZP_12699252.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
 gi|240008155|gb|ACS39381.1| conserved hypothetical protein, putative glycosyltransferase
           [Methylobacterium extorquens AM1]
 gi|373565054|gb|EHP91121.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
          Length = 778

 Score = 39.7 bits (91), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P FV+   +    TGD    +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVE 475


>gi|218529882|ref|YP_002420698.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
 gi|218522185|gb|ACK82770.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
          Length = 778

 Score = 39.7 bits (91), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P FV+   +    TGD    +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVE 475


>gi|294012573|ref|YP_003546033.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
 gi|292675903|dbj|BAI97421.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
          Length = 749

 Score = 39.7 bits (91), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 1/57 (1%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIENDGFPDQ 681
           D  P FV+     +  TGD  F  ++ P V  A++++D+  DRDGDG +E     DQ
Sbjct: 397 DSTPLFVMLAGAYLERTGDSGFIASILPHVEAALSWIDEHGDRDGDGFVEYGRLTDQ 453


>gi|254560746|ref|YP_003067841.1| hypothetical protein METDI2293 [Methylobacterium extorquens DM4]
 gi|254268024|emb|CAX23895.1| conserved hypothetical protein, putative glycosyltransferase
           [Methylobacterium extorquens DM4]
          Length = 778

 Score = 39.7 bits (91), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 626 DLNPKFVLQVYRDVVATGDKKFAKAVWPSVYVAMAYMDQF-DRDGDGMIE 674
           D  P FV+   +    TGD    +A+WP + +A+ ++D + DRDGDG +E
Sbjct: 426 DGTPLFVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVE 475


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.417 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,363,448,587
Number of Sequences: 23463169
Number of extensions: 722419741
Number of successful extensions: 1540419
Number of sequences better than 100.0: 552
Number of HSP's better than 100.0 without gapping: 412
Number of HSP's successfully gapped in prelim test: 140
Number of HSP's that attempted gapping in prelim test: 1536609
Number of HSP's gapped (non-prelim): 1273
length of query: 954
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 802
effective length of database: 8,792,793,679
effective search space: 7051820530558
effective search space used: 7051820530558
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)