BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 038924
         (920 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
 gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
          Length = 891

 Score = 1375 bits (3559), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 641/891 (71%), Positives = 745/891 (83%), Gaps = 34/891 (3%)

Query: 55  IGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH 114
           +G RL+R+ KQE AK +  +F+  +KH +TSD GIPLGGIGAGSIGRSYRGEFQ F+LF 
Sbjct: 4   MGLRLWRHIKQEAAKERATIFDFSKKHILTSDHGIPLGGIGAGSIGRSYRGEFQHFRLFP 63

Query: 115 GICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYH 174
           GIC++ PVLANQFS FVSRP+GE FSSVLC R+P +PK++T SGIESWDWNL G+ CTYH
Sbjct: 64  GICEEGPVLANQFSAFVSRPSGETFSSVLCSRTPDIPKESTGSGIESWDWNLNGQKCTYH 123

Query: 175 ALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLL 234
           ALFPRAWT+YDG PDPEL IV RQISPFIPHNYKESSFP SVFTFTLSN G+TSADVTL+
Sbjct: 124 ALFPRAWTIYDGAPDPELTIVSRQISPFIPHNYKESSFPVSVFTFTLSNHGKTSADVTLM 183

Query: 235 FTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
           FTWANSV G SGLSGHHFNSK MTKDGVH +TLHH+  N +PPVTFA+AA+ETADVHVSE
Sbjct: 184 FTWANSVGGVSGLSGHHFNSKMMTKDGVHAVTLHHKMTNRQPPVTFAIAAQETADVHVSE 243

Query: 295 CPCFLLSGNSK--GITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
           CPCFL+SG+S+   +TAKDMW+ I+K+G+FD +  +KTSPSEPGSSIGAAIAAS+T+PSG
Sbjct: 244 CPCFLISGDSQDVSVTAKDMWDVIRKNGTFDQIGCNKTSPSEPGSSIGAAIAASVTVPSG 303

Query: 353 STRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
           S R+VTFSL+WD PE++F E+ YHRRYTKFYGTLGD+AA IA DAILEHA WE +IEAWQ
Sbjct: 304 SIRTVTFSLSWDIPEIRFSERSYHRRYTKFYGTLGDAAANIARDAILEHANWESQIEAWQ 363

Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS----- 467
           RPILEDKR PEWYPITLFNELYYLNAGGTIWTD SPPMQ+L  +RER+FSL+  S     
Sbjct: 364 RPILEDKRVPEWYPITLFNELYYLNAGGTIWTDESPPMQNLTAVRERRFSLERSSSGYKN 423

Query: 468 -----------------------KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
                                  ++H P SS++  G  LL+NGEENIGQ LYLEG EY+M
Sbjct: 424 GNGIAYKNDTAIEILERMASTYEQLHNPVSSNSVFGANLLQNGEENIGQLLYLEGTEYLM 483

Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
           +NTYDVHFY+SFALVMLFPKLEL++QRDFAAAV+MHDP  M+IMSDGKWV RK LGAVPH
Sbjct: 484 WNTYDVHFYASFALVMLFPKLELNLQRDFAAAVLMHDPSRMQIMSDGKWVPRKVLGAVPH 543

Query: 565 DIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM 624
           D+GL+DPWFEIN+YNLF+++RWKDLN KFVLQVYRD VATGDK+FARAVWPSVY+AMAYM
Sbjct: 544 DVGLNDPWFEINAYNLFSTARWKDLNCKFVLQVYRDVVATGDKDFARAVWPSVYVAMAYM 603

Query: 625 EQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYF 684
           +QFDKDGDGMIENEG PDQTYD WS  GVSAYCGGLWVAALQA SA+A++VGD  SA+YF
Sbjct: 604 DQFDKDGDGMIENEGIPDQTYDTWSVTGVSAYCGGLWVAALQATSAMAHEVGDDGSANYF 663

Query: 685 WVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKK 744
           W++YQKAK VY  LWNGSYFNYD+   S + SIQADQLAGQWYARACGL P+ D  K +K
Sbjct: 664 WIKYQKAKTVYGKLWNGSYFNYDS---SGNKSIQADQLAGQWYARACGLSPVVDGDKARK 720

Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
           AL KIY+FNVLKVKGG  GA+NGM PDGR+D++ +Q +EIWPGVTY L+ASMIQE + +M
Sbjct: 721 ALEKIYEFNVLKVKGGTRGAVNGMLPDGRVDLTAMQTKEIWPGVTYALSASMIQEGLEEM 780

Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQ 864
           AFQTA G+Y  AWSE+GLGYSFQ PESW+ ND+YRSLCYMRPL IWAMQWAL+KPK  ++
Sbjct: 781 AFQTAVGIYNAAWSEEGLGYSFQIPESWDMNDQYRSLCYMRPLAIWAMQWALSKPKDFKE 840

Query: 865 EIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
           E+ HE    D SYL+QHA FSKVA LL+LP+EEA K F + +Y+FT  R+L
Sbjct: 841 EMHHE-GIEDESYLKQHAGFSKVAHLLRLPEEEAPKSFFQAIYEFTCKRML 890


>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
 gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
          Length = 948

 Score = 1359 bits (3517), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 666/948 (70%), Positives = 770/948 (81%), Gaps = 37/948 (3%)

Query: 1   MENGVKEEGENAA----SMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
           MEN +KEEGE  +    S  K++PG PP L+W+R L+SN  VP GFTLSF+E  H+  IG
Sbjct: 2   MENRIKEEGEKDSYLDSSTQKINPGVPPSLSWERPLNSNGNVPLGFTLSFREILHMLPIG 61

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
            RL+RYSK+E  KG+VP+F+  +KH IT D  +PLGGIGAGSIGRSY+GEFQ FKL    
Sbjct: 62  LRLWRYSKEEPTKGRVPIFDFSKKHVITGDHAVPLGGIGAGSIGRSYKGEFQCFKLLPLA 121

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C++ P+LANQFSVFVSRPNG+KFSSVLC R P +P +   SGIESWDWNL GENCTYHAL
Sbjct: 122 CEEGPILANQFSVFVSRPNGKKFSSVLCSRRPELPTEIKGSGIESWDWNLNGENCTYHAL 181

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           FPRAWT Y G+PDPEL+IV +QISPFIPHNYKESSFP SVFTFTLSN G+TSADVTLLFT
Sbjct: 182 FPRAWTTYKGQPDPELKIVSQQISPFIPHNYKESSFPVSVFTFTLSNFGRTSADVTLLFT 241

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANS+ G SG S HHFNS  M KDGVH +TLHH+T +G+PP+TFA+AA+ET DVHVSECP
Sbjct: 242 WANSIGGVSGSSRHHFNSSIMKKDGVHTVTLHHKTGDGQPPLTFAIAAQETPDVHVSECP 301

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
           CFL+SGNS+G+TAKDMW+EIKKHG+FDHL  +K SPSE GS IGAAIAA+LTIP  + R+
Sbjct: 302 CFLISGNSQGVTAKDMWDEIKKHGTFDHLSYNKISPSEGGSCIGAAIAATLTIPPDTIRT 361

Query: 357 VTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
            TFSLAWDCPEV+F  + YHRRYTKFYGTLGD+AA IAHDAILEH  WE +IEAWQRPIL
Sbjct: 362 ATFSLAWDCPEVRFSGRTYHRRYTKFYGTLGDAAADIAHDAILEHGHWESQIEAWQRPIL 421

Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP---------- 466
           EDKR P+WYPITLFNELYYLNAGGT+WTDGSPPMQS A I+  K +LD            
Sbjct: 422 EDKRLPKWYPITLFNELYYLNAGGTVWTDGSPPMQSFAAIKGGKLTLDKSRSEFENARPV 481

Query: 467 ------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTY 508
                              K+H P +S+ A GT LL+N EENIGQFLYLEG+EY+M+NTY
Sbjct: 482 AHRNDTAVEILNRMASIYEKMHNPVTSNAAFGTYLLQNDEENIGQFLYLEGSEYLMWNTY 541

Query: 509 DVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL 568
           DVHFYSSFAL+MLFPKLELSIQRDFAAAVMMHDP  M+IMSDG+ V RK LGAVPHDIGL
Sbjct: 542 DVHFYSSFALLMLFPKLELSIQRDFAAAVMMHDPSRMQIMSDGRRVPRKVLGAVPHDIGL 601

Query: 569 DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFD 628
           +DPWFE+N+YNL +++RWKDLN KFVLQ+YRD VATGDK+FA AVWPSVY+AMAYM+QFD
Sbjct: 602 NDPWFEVNAYNLISTARWKDLNPKFVLQIYRDVVATGDKSFALAVWPSVYVAMAYMDQFD 661

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
           KDGDGMIENEGFPDQTYD WS  GVSAYCGGLWVAALQAASALA +VGD  SAS+FWV+Y
Sbjct: 662 KDGDGMIENEGFPDQTYDTWSVTGVSAYCGGLWVAALQAASALACEVGDSESASFFWVKY 721

Query: 689 QKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
           QKAKAVY +LWNGSYFNYD+S  S   SI ADQLAGQWYARACGL  I DE KV+ AL K
Sbjct: 722 QKAKAVYSTLWNGSYFNYDSSGNS---SIHADQLAGQWYARACGLSSIVDEEKVRSALEK 778

Query: 749 IYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           IY FNVLKVK G  GA+NGM PDG++DMS +Q+REIWPGVTY L+ASMIQE M +MAFQT
Sbjct: 779 IYKFNVLKVKEGKRGAVNGMLPDGKVDMSVMQSREIWPGVTYALSASMIQEGMAEMAFQT 838

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIK- 867
           A+G+YE AWS++GLGYSFQ PE WN +D+YRSLCYMRPL IWAMQWAL+KPK+ ++E+K 
Sbjct: 839 ASGIYEAAWSQEGLGYSFQIPEGWNTDDQYRSLCYMRPLAIWAMQWALSKPKVFKEEMKL 898

Query: 868 HEISDRDSSYLEQHAAFSKVASLLKLPKEEAS-KGFLKVVYDFTIGRV 914
             ++  D  +  Q+A FSKVA LLKLP +E S K FL+  Y+FT  R+
Sbjct: 899 LSLAADDRLHPSQNAGFSKVAHLLKLPDDEGSNKSFLQSAYEFTCRRL 946


>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 969

 Score = 1328 bits (3437), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 657/954 (68%), Positives = 770/954 (80%), Gaps = 35/954 (3%)

Query: 1   MENGVKEEGEN----AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
           MENG KE+G+      +S  KV+PG+P  LTWQRKL++     T F L  +E +HLA +G
Sbjct: 17  MENGHKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPLG 76

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
            RL+ +   E AKG++ + + F K  +TS  G+PLGGIG GSIGRSYRGEFQR++LF  I
Sbjct: 77  VRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPRI 136

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D+PVLANQFSVFVSRPNG+K S+VLCPR+P V K +  SGI SWDWNL GE+CTYHAL
Sbjct: 137 CEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHAL 196

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PRAWTVY+GEPDPE+ I+  QISPFIPHNYKESSFP SVF FTLSNSG+TSAD+TLLFT
Sbjct: 197 YPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLFT 256

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANSV G S  SGHH+NSK  TKDGVHG+ LHH+TANG PPVTFA+AAEET DVH+SECP
Sbjct: 257 WANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISECP 316

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
           CFL+SGNS G+TAK+MW EIK+HGSFDHLD D +S  SEPGSSIGAA+AASLT+P  + R
Sbjct: 317 CFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTVR 376

Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
           +VTFSLAW CPEV+F   K YHRRYT+FYGT  D+A  IAHDAILEHA W  EIEAWQ P
Sbjct: 377 TVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQGP 436

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-------- 466
           ILED+R PEWY ITLFNELY+LNAGGTIWTDG PPMQSLATI + KFSLD          
Sbjct: 437 ILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTT 496

Query: 467 --------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
                                +IH P +S++A GT LL++GEEN+GQFLYLEG EY M+N
Sbjct: 497 DIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWN 556

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFYSSFA++MLFP+LELSIQRDFAAAVM+HDP  MKIMSDGKWV RK LGAVPHDI
Sbjct: 557 TYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDI 616

Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
           G+ DPWFE+N+YNL+++ RWKDLNSKFVLQVYRD VATGDKNFARAVWP+VYIA+A+++Q
Sbjct: 617 GISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQ 676

Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
           FDKDGDGMIEN+GFPDQTYDAWS  GVSAYCGGLWVAALQAASA+A +VGD  +A YFW 
Sbjct: 677 FDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWF 736

Query: 687 RYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
           ++QKAKAVYD LWNGSYFNYDNS GSSS+SIQADQLAGQWYARACGL PI D+ K + AL
Sbjct: 737 KFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSAL 796

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
            K+Y+FNVLKVK G CGA+NGM PDGR+DMS +Q+REIW GVTY +AA+MI E MV+ AF
Sbjct: 797 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 856

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEI 866
            TA+G+Y+ AWS++GLGYSFQTPE+WN ++EYRSLCYMRPL IWAMQWAL+KP+L   ++
Sbjct: 857 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPELHNHDM 916

Query: 867 KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVLCHNLN 920
           KHE   + +   E H  F KVA LLKLP+EEASK FL++ +D T  R+    LN
Sbjct: 917 KHE-EGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLYTLRLN 969


>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
          Length = 953

 Score = 1328 bits (3437), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 657/954 (68%), Positives = 770/954 (80%), Gaps = 35/954 (3%)

Query: 1   MENGVKEEGEN----AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
           MENG KE+G+      +S  KV+PG+P  LTWQRKL++     T F L  +E +HLA +G
Sbjct: 1   MENGHKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPLG 60

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
            RL+ +   E AKG++ + + F K  +TS  G+PLGGIG GSIGRSYRGEFQR++LF  I
Sbjct: 61  VRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPRI 120

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D+PVLANQFSVFVSRPNG+K S+VLCPR+P V K +  SGI SWDWNL GE+CTYHAL
Sbjct: 121 CEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHAL 180

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PRAWTVY+GEPDPE+ I+  QISPFIPHNYKESSFP SVF FTLSNSG+TSAD+TLLFT
Sbjct: 181 YPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLFT 240

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANSV G S  SGHH+NSK  TKDGVHG+ LHH+TANG PPVTFA+AAEET DVH+SECP
Sbjct: 241 WANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISECP 300

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
           CFL+SGNS G+TAK+MW EIK+HGSFDHLD D +S  SEPGSSIGAA+AASLT+P  + R
Sbjct: 301 CFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTVR 360

Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
           +VTFSLAW CPEV+F   K YHRRYT+FYGT  D+A  IAHDAILEHA W  EIEAWQ P
Sbjct: 361 TVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQGP 420

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-------- 466
           ILED+R PEWY ITLFNELY+LNAGGTIWTDG PPMQSLATI + KFSLD          
Sbjct: 421 ILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTT 480

Query: 467 --------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
                                +IH P +S++A GT LL++GEEN+GQFLYLEG EY M+N
Sbjct: 481 DIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWN 540

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFYSSFA++MLFP+LELSIQRDFAAAVM+HDP  MKIMSDGKWV RK LGAVPHDI
Sbjct: 541 TYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDI 600

Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
           G+ DPWFE+N+YNL+++ RWKDLNSKFVLQVYRD VATGDKNFARAVWP+VYIA+A+++Q
Sbjct: 601 GISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQ 660

Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
           FDKDGDGMIEN+GFPDQTYDAWS  GVSAYCGGLWVAALQAASA+A +VGD  +A YFW 
Sbjct: 661 FDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWF 720

Query: 687 RYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
           ++QKAKAVYD LWNGSYFNYDNS GSSS+SIQADQLAGQWYARACGL PI D+ K + AL
Sbjct: 721 KFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSAL 780

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
            K+Y+FNVLKVK G CGA+NGM PDGR+DMS +Q+REIW GVTY +AA+MI E MV+ AF
Sbjct: 781 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 840

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEI 866
            TA+G+Y+ AWS++GLGYSFQTPE+WN ++EYRSLCYMRPL IWAMQWAL+KP+L   ++
Sbjct: 841 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPELHNHDM 900

Query: 867 KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVLCHNLN 920
           KHE   + +   E H  F KVA LLKLP+EEASK FL++ +D T  R+    LN
Sbjct: 901 KHE-EGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLYTLRLN 953


>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
          Length = 949

 Score = 1316 bits (3406), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 640/944 (67%), Positives = 759/944 (80%), Gaps = 32/944 (3%)

Query: 1   MENGVKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
           +E G +E   ++    KVDPG+P  LTWQRKL+S+   P  F ++ +E  HLA IGFRL+
Sbjct: 6   LEEGEREPSNSSIEEIKVDPGKPGSLTWQRKLNSDGNAPVEFKINLRETFHLAPIGFRLW 65

Query: 61  RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
           R+ ++E AKG+  + + F K +I+S QG+PLGGIGAGSIGRSY+GEFQRF+LF    ++ 
Sbjct: 66  RHVREETAKGRGAMIDPFVKRYISSSQGVPLGGIGAGSIGRSYKGEFQRFQLFPITSENE 125

Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRA 180
           PVL NQFSVFVSRPNGEK+S+VLC +SP   K+   SGI SWDWNL G   TY AL+PRA
Sbjct: 126 PVLENQFSVFVSRPNGEKYSTVLCRQSPEALKECPPSGIGSWDWNLNGNKSTYLALYPRA 185

Query: 181 WTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANS 240
           WTVYDGEPDP L+IVCRQISP IPHNYKESSFP +VFTFTL NSG+T+AD+TLLFTWANS
Sbjct: 186 WTVYDGEPDPALKIVCRQISPIIPHNYKESSFPVAVFTFTLFNSGKTAADITLLFTWANS 245

Query: 241 VAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLL 300
           V G SGLSG H NSK M KDGV G+ LHH+TANGRPPVT+A+AA+E   VH+SECPCF +
Sbjct: 246 VGGVSGLSGQHLNSKFMMKDGVRGVLLHHKTANGRPPVTYAIAAQEMDGVHISECPCFFI 305

Query: 301 SGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTF 359
           SG++ GITAKDMWNEIK+HGSFD L++ +TS PSE GSS+GAA+AASLTIPS S ++VTF
Sbjct: 306 SGDTPGITAKDMWNEIKEHGSFDRLNSAETSMPSELGSSVGAAVAASLTIPSDSEQTVTF 365

Query: 360 SLAWDCPEVKFF-EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
           SLAWDCPE+ F  E+ Y+RRYTKFYGT GD+AA+IAHDAIL+H  WE +IEAWQ+P+LED
Sbjct: 366 SLAWDCPEINFSKERTYYRRYTKFYGTRGDAAAKIAHDAILDHGHWESQIEAWQKPVLED 425

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP------------ 466
           KRFPEWYPITLFNELYYLN+GGT+WTDGSPP+ S  +I ERKFSLD              
Sbjct: 426 KRFPEWYPITLFNELYYLNSGGTVWTDGSPPVHSFTSIIERKFSLDRSRSDLKNTVSISH 485

Query: 467 ----------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDV 510
                            ++HTP +S++A G  LL+ GEENIGQFLYLEG EY M+NT DV
Sbjct: 486 HNDTAVDILERMTSVLEQVHTPVTSNSAFGPNLLQQGEENIGQFLYLEGVEYFMWNTNDV 545

Query: 511 HFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDD 570
           HFYSSFAL+MLFPKLELSIQRDFAA+VMMHDP  MK++ +GKWV+RK LGAVPHD+G  D
Sbjct: 546 HFYSSFALIMLFPKLELSIQRDFAASVMMHDPSKMKLLCNGKWVSRKVLGAVPHDVGNYD 605

Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
           PWFE+N YNL+N+ RWKDLN KFVLQVYRD VATGDKNFA+AVWPSVY+A+AYM QFDKD
Sbjct: 606 PWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDKD 665

Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK 690
           GDGMIENEGFPDQTYD WS +GVSAY GGLWVAALQAASALA  VGD  S  YFW ++QK
Sbjct: 666 GDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQK 725

Query: 691 AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIY 750
           AK VY  LWNGSYFNYD+SDGSSS+SIQADQLAGQWYARACGL PI DE K K AL K+Y
Sbjct: 726 AKGVYQKLWNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLSPIVDEDKAKSALEKVY 785

Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
            +NVLKV GG  GA+NGM PDG++D + +Q+REIW GVTYG+AA+MI E +VDMAFQTA+
Sbjct: 786 HYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAFQTAS 845

Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEI 870
           GVYE AWS++GLGYSFQTPESWN +D+YRSLCYMRPL IWAMQWA ++PKL + E   E+
Sbjct: 846 GVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLHKYEANPEM 905

Query: 871 SDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRV 914
           +  + S + QHA FS+VA LLKLP E+ S+  L+V+YD+T  R+
Sbjct: 906 N--EDSLVLQHAGFSRVARLLKLPDEDVSRSALQVIYDYTCKRM 947


>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
          Length = 960

 Score = 1311 bits (3393), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 637/928 (68%), Positives = 752/928 (81%), Gaps = 32/928 (3%)

Query: 17  KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           KVDPG+P  LTWQRKL+S+   P  F ++ +E  HLA IGFRL+R+ ++E AKG+  + +
Sbjct: 33  KVDPGKPGSLTWQRKLNSDGNAPVEFKINLRETFHLAPIGFRLWRHVREETAKGRGAMID 92

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            F K +I+S QG+PLGGIGAGSIGRSY+GEFQRF+LF    ++ PVL NQFSVFVSRPNG
Sbjct: 93  PFVKRYISSSQGVPLGGIGAGSIGRSYKGEFQRFQLFPITSENEPVLENQFSVFVSRPNG 152

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           EK+S+VLC +SP   K+   SGI SWDWNL G   TY AL+PRAWTVYDGEPDP L+IVC
Sbjct: 153 EKYSTVLCRQSPEALKECPPSGIGSWDWNLNGNKSTYLALYPRAWTVYDGEPDPALKIVC 212

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQISP IPHNYKESSFP +VFTFTL NSG+T+AD+TLLFTWANSV G SGLSG H NSK 
Sbjct: 213 RQISPIIPHNYKESSFPVAVFTFTLFNSGKTAADITLLFTWANSVGGVSGLSGQHLNSKF 272

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
           M KDGV G+ LHH+TANGRPPVT+A+AA+E   VH+SECPCF +SG++ GITAKDMWNEI
Sbjct: 273 MMKDGVRGVLLHHKTANGRPPVTYAIAAQEMDGVHISECPCFFISGDTPGITAKDMWNEI 332

Query: 317 KKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF-EKV 374
           K+HGSFD L++ +TS PSE GSS+GAA+AASLTIPS S ++VTFSLAWDCPE+ F  E+ 
Sbjct: 333 KEHGSFDRLNSAETSMPSELGSSVGAAVAASLTIPSDSEQTVTFSLAWDCPEINFSKERT 392

Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
           Y+RRYTKFYGT GD+AA+IAHDAIL+H  WE +IEAWQ+P+LEDKRFPEWYPITLFNELY
Sbjct: 393 YYRRYTKFYGTRGDAAAKIAHDAILDHGHWESQIEAWQKPVLEDKRFPEWYPITLFNELY 452

Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTP---------------------------- 466
           YLN+GGT+WTDGSPP+ S  +I ERKFSLD                              
Sbjct: 453 YLNSGGTVWTDGSPPVHSFTSIIERKFSLDRSRSDLKNTVSISHHNDTAVDILERMTSVL 512

Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
            ++HTP +S++A G  LL+ GEENIGQFLYLEG EY M+NT DVHFYSSFAL+MLFPKLE
Sbjct: 513 EQVHTPVTSNSAFGPNLLQQGEENIGQFLYLEGVEYFMWNTNDVHFYSSFALIMLFPKLE 572

Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
           LSIQRDFAA+VMMHDP  MK++ +GKWV+RK LGAVPHD+G  DPWFE+N YNL+N+ RW
Sbjct: 573 LSIQRDFAASVMMHDPSKMKLLCNGKWVSRKVLGAVPHDVGNYDPWFEVNGYNLYNTDRW 632

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
           KDLN KFVLQVYRD VATGDKNFA+AVWPSVY+A+AYM QFDKDGDGMIENEGFPDQTYD
Sbjct: 633 KDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDKDGDGMIENEGFPDQTYD 692

Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNY 706
            WS +GVSAY GGLWVAALQAASALA  VGD  S  YFW ++QKAK VY  LWNGSYFNY
Sbjct: 693 TWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQKAKGVYQKLWNGSYFNY 752

Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
           D+SDGSSS+SIQADQLAGQWYARACGL PI DE K K AL K+Y +NVLKV GG  GA+N
Sbjct: 753 DDSDGSSSSSIQADQLAGQWYARACGLSPIVDEDKAKSALEKVYHYNVLKVGGGKRGAVN 812

Query: 767 GMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
           GM PDG++D + +Q+REIW GVTYG+AA+MI E +VDMAFQTA+GVYE AWS++GLGYSF
Sbjct: 813 GMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVYEAAWSQEGLGYSF 872

Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSK 886
           QTPESWN +D+YRSLCYMRPL IWAMQWA ++PKL + E   E++  + S + QHA FS+
Sbjct: 873 QTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLHKYEANPEMN--EDSLVLQHAGFSR 930

Query: 887 VASLLKLPKEEASKGFLKVVYDFTIGRV 914
           VA LLKLP E+ S+  L+V+YD+T  R+
Sbjct: 931 VARLLKLPDEDVSRSALQVIYDYTCKRM 958


>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
 gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
          Length = 952

 Score = 1298 bits (3359), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 643/943 (68%), Positives = 760/943 (80%), Gaps = 33/943 (3%)

Query: 3   NGVKEEGENAASM-PKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYR 61
           NG +E+ E + S+  KVDPG P  LTWQRKL+S     + F LSFQE   LA +G RL+R
Sbjct: 9   NGCQEDREPSDSLLEKVDPGSPASLTWQRKLNSEDIALSQFNLSFQEKFQLAPVGIRLWR 68

Query: 62  YSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAP 121
             ++E AKG+V + N F K  ITS  GIPLGGIG+GSIGRSY+GEFQR++LF  IC++ P
Sbjct: 69  LIREETAKGRVSIINPFLKRFITSCHGIPLGGIGSGSIGRSYKGEFQRWQLFPRICEEKP 128

Query: 122 VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAW 181
           VLANQFSVFVSR +GEK+SSVLCPR+P V  +   SGI SWDWNLKG+N TYHAL+PRAW
Sbjct: 129 VLANQFSVFVSRSSGEKYSSVLCPRNPEVLMEPAISGIGSWDWNLKGDNSTYHALYPRAW 188

Query: 182 TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV 241
           T+YDGEPDPELRIVCRQISP IPHNYKESS+P SVFTFTL NSG+T+ADV+LLFTW NSV
Sbjct: 189 TIYDGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTFTLYNSGKTTADVSLLFTWTNSV 248

Query: 242 AGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS 301
            G+S  SG HFNS TM +DGVH + LHH+TA G PPVTFA+AA+ET DVHVS+CP F++S
Sbjct: 249 GGNSEYSGQHFNSTTMMEDGVHAVLLHHKTAEGFPPVTFAIAAQETNDVHVSKCPRFVIS 308

Query: 302 GNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFS 360
           GN +GITAKDMW+E+K+HGSFD+L +  TS PSEPGSSIGAAIAAS+TIP  + RSVTFS
Sbjct: 309 GNCQGITAKDMWHEVKEHGSFDNLKSAGTSGPSEPGSSIGAAIAASVTIPPDAIRSVTFS 368

Query: 361 LAWDCPEVKFF-EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDK 419
           L+WDCPEV F   + YHRRYTKFY T GD+AARIAHDAILEH  WE +I AWQRPILEDK
Sbjct: 369 LSWDCPEVYFMGGRTYHRRYTKFYSTHGDAAARIAHDAILEHGLWESQIVAWQRPILEDK 428

Query: 420 RFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP------------- 466
           R PEWYPITLFNELYYLN+GGTIWTDGSPP  +L +IR  KFSLDT              
Sbjct: 429 RLPEWYPITLFNELYYLNSGGTIWTDGSPPYHNLVSIRGSKFSLDTSGAGLKSIIDVTHE 488

Query: 467 ---------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
                           +IH   +S++A GT LL+ GEENIGQFLYLEG EY M+NTYDVH
Sbjct: 489 NDTAVNILGRMTSALEQIHAHVASNSAFGTNLLQEGEENIGQFLYLEGIEYHMWNTYDVH 548

Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDP 571
           FYSSFALVMLFPKLELS+QRDFAAAVMMHDP  M+++ DG+WV RK LGAVPHDIG++DP
Sbjct: 549 FYSSFALVMLFPKLELSVQRDFAAAVMMHDPSKMQLLHDGQWVCRKVLGAVPHDIGINDP 608

Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
           W+E+N+Y+L+N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVYIAMAYM+QFD+DG
Sbjct: 609 WYEVNAYSLYNTDRWKDLNPKFVLQVYRDVVATGDKKFAEAVWPSVYIAMAYMDQFDRDG 668

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
           DGMIEN+GFPDQTYD WS +GVSAY GGLWVAALQAASALA +VGD  S  YFW R+QKA
Sbjct: 669 DGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYFWARFQKA 728

Query: 692 KAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           K VYD LWNGSYFNYDNS G +S+SIQADQLAGQWYARACGL PI D+ K + AL K+Y+
Sbjct: 729 KDVYDKLWNGSYFNYDNSGGRNSSSIQADQLAGQWYARACGLFPIVDKDKARSALEKVYN 788

Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
           +NVLKVK G  GA+NGM PDG++D+S +Q+REIW GVTY LAA+MIQE+M+DMAF TA+G
Sbjct: 789 YNVLKVKDGKRGAINGMLPDGKVDLSSMQSREIWSGVTYALAATMIQEDMLDMAFHTASG 848

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEIS 871
           +YE AWSE GLGYSFQTPE+WNN D+YRSLCYMRPL IWAMQWAL++PKL ++E++ E++
Sbjct: 849 IYEAAWSERGLGYSFQTPEAWNNVDQYRSLCYMRPLAIWAMQWALSRPKLEKEEMEMEVN 908

Query: 872 DRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRV 914
             +   L  HA F+KVA  L+LP+ E S G L+ ++++T  ++
Sbjct: 909 --EDYLLPHHAGFTKVARFLRLPEGEESLGLLQSLFNYTCKKL 949


>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 944

 Score = 1269 bits (3285), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 613/941 (65%), Positives = 731/941 (77%), Gaps = 34/941 (3%)

Query: 7   EEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQE 66
           E+G N AS  +VDP +PP LTW+RKLD   K P  F+ +  +  H+   G+RL+R  K+E
Sbjct: 2   EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPESFSFTLTDAWHMGMTGYRLWRNGKEE 61

Query: 67  EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
            AKG++P++  F    IT   G+ LGGIGAGSIGRSYRGEFQRF++F+G C+D PVLANQ
Sbjct: 62  IAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLANQ 121

Query: 127 FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
           FSVFVSRPNG KFSSVLC   P   K    +GIESWDWNL GEN TYHALFPR+WTVYDG
Sbjct: 122 FSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYDG 181

Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
           EPDP+L+IVCRQ+SP IPHNYKESSFP SVFTF LSN GQTSA VTLLFTWANSV G SG
Sbjct: 182 EPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKSG 241

Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
            +GHHFNSK   +DG  G+ LHH++ANGRP VT+ +AAE T DVHVS CPCF++SG+S+G
Sbjct: 242 FTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSEG 301

Query: 307 ITAKDMWNEIKKHGSFDHLDN-DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
           I+AKDMW EIK HGSFD+L +      S+PG SIGAA+AA+LTIP  S R+VTFSLAWDC
Sbjct: 302 ISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWDC 361

Query: 366 PEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
           PEVKF  K YHR+Y+KFYGTLGD+A  IA DAI +H KWE EIEAWQRPI+EDKR P+WY
Sbjct: 362 PEVKFDGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPDWY 421

Query: 426 PITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------TPSKIHT----- 471
           P+TLFNELY+LN+GGTIWTDG PP+Q+L+TI  +K+ L+          P+  H      
Sbjct: 422 PVTLFNELYFLNSGGTIWTDGLPPLQNLSTISHKKYFLERSKSELNGGAPNGDHRKDVAV 481

Query: 472 ----------------PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSS 515
                              S+ ALGTRLL  GEEN+G  L +EG++Y+M+NTYDVHFYSS
Sbjct: 482 DILERMSQILDQTHGGAGPSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVHFYSS 541

Query: 516 FALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
           FAL+MLFPKLELSIQRDFAAAV+MHDP   K M DG WV RK LGAVPHDIG +DPW E+
Sbjct: 542 FALIMLFPKLELSIQRDFAAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDPWLEV 601

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
           N+YNL N SRWKDL SKFVLQVYRD VATGDKNFA++VWPSVY+A+A+MEQFDKD DGMI
Sbjct: 602 NAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDKDGMI 661

Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
           ENEGFPDQTYD W+  GVSAYCGGLWVAALQAASALA++V D A+A YFW++YQKA++VY
Sbjct: 662 ENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKARSVY 721

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           ++LWNGSYFNYDNS G  S+SIQADQLAGQWYARACGL PIADE K++ AL KIY+FNV+
Sbjct: 722 ETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYNFNVM 781

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
           KVKGG  GA+NGM PDG +D S LQ +EIW GVTY +AA+MIQE MV+  FQTA G+Y+ 
Sbjct: 782 KVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQA 841

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA-LTKPKLSRQEIKHEISDRD 874
           AW++DGLGYSFQTPE+W+ +D +RS+ YMRPL IWAMQWA ++  K ++   K     ++
Sbjct: 842 AWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFSEMQE 901

Query: 875 SSYLEQHAAFSKVASLLKLPKEE--ASKGFLKVVYDFTIGR 913
           S++  QHAAF KVASLLKLP  +  A +  ++  YDF   R
Sbjct: 902 SAFATQHAAFLKVASLLKLPSNDDTARRSLVEAAYDFICKR 942


>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 956

 Score = 1268 bits (3280), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 599/934 (64%), Positives = 726/934 (77%), Gaps = 35/934 (3%)

Query: 17  KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           KVDP  P  LTWQRK+DS+ K P  F L+ +E   LA +G RL+   ++E AKG++   +
Sbjct: 22  KVDPAVPASLTWQRKIDSDAKPPREFNLTAKEIFQLAPVGIRLWFLCREEAAKGRLAFID 81

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            F +H +TS  G+PLGGIGAGSIGRS++GEFQR++LF   C+  PVLANQFS FVSR NG
Sbjct: 82  PFSRHSVTSSHGVPLGGIGAGSIGRSFQGEFQRWQLFPPKCEYEPVLANQFSAFVSRANG 141

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           +K+S+VLCPR+P + K++++SGI SWDWNLKG+  TYHAL+PR+WT+Y+GEPDPELRI+C
Sbjct: 142 KKYSTVLCPRNPKMGKQDSESGIGSWDWNLKGDKSTYHALYPRSWTMYEGEPDPELRIIC 201

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQ+SPFIPHNYKESSFP SVFTFTL N G T+A+VTLLFTWANSV GDS  SG H+NSK 
Sbjct: 202 RQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTANVTLLFTWANSVGGDSEFSGGHYNSKI 261

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
           M  DGV G+ LHH+TANG P +++A++A+ T  V VS CP F++SG   GITAKDMW  +
Sbjct: 262 MMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACPFFIVSGKQNGITAKDMWQAV 321

Query: 317 KKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KV 374
           K++GSFDH    + S  SE GSSIGAA+AASLT+  G +R VTFSLAWDCPEV+F   K+
Sbjct: 322 KENGSFDHFKASEASMQSENGSSIGAAVAASLTVLPGESRIVTFSLAWDCPEVQFPSGKI 381

Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
           Y RRYTKFYG  GD+AA+IA+DAIL H++WE  IE WQRPILEDKR P WYP+TLFNELY
Sbjct: 382 YSRRYTKFYGNHGDAAAQIAYDAILGHSQWESWIEDWQRPILEDKRLPAWYPVTLFNELY 441

Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLD----------------------------TP 466
           YLN+GGT+WTDGS P+ SLA +RE+KFSLD                            T 
Sbjct: 442 YLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLNNDIEVPHQNDTAISVLGKMASTL 501

Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
            ++H   +S++A GT+LLE GEENIG FLYLEG EY M+NTYDVHFY+SFALVMLFPKLE
Sbjct: 502 EQLHASTASNSAFGTKLLEEGEENIGHFLYLEGIEYRMWNTYDVHFYASFALVMLFPKLE 561

Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
           LSIQRDFAAAVM+HDP  +K +S+G+WV RK LGAVPHD+G++DPWFE+N Y L N+ RW
Sbjct: 562 LSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDLGINDPWFEVNGYTLHNTDRW 621

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
           KDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM QFDKDGDGMIENEGFPDQTYD
Sbjct: 622 KDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDKDGDGMIENEGFPDQTYD 681

Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFN 705
            WSA+GVSAYCGGLWVAALQAAS LA  VGD  S  YFW ++QKAK VY+  LWNGSYFN
Sbjct: 682 TWSASGVSAYCGGLWVAALQAASVLARVVGDKNSQDYFWSKFQKAKVVYEKKLWNGSYFN 741

Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
           YDNS    S+SIQADQLAGQWYARA GLLPI DE K + AL K+Y+FNV+K+K G  GA+
Sbjct: 742 YDNSGSQYSSSIQADQLAGQWYARASGLLPIVDEDKARTALEKVYNFNVMKIKDGKRGAV 801

Query: 766 NGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYS 825
           NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MAFQTA+GVYE AWSE GLGYS
Sbjct: 802 NGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGVYEAAWSEAGLGYS 861

Query: 826 FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEI----KHEISDRDSSYLEQH 881
           FQTPE+WN NDEYRSL YMRPL IWAMQWALTK    +Q++    + E     +S ++  
Sbjct: 862 FQTPEAWNTNDEYRSLTYMRPLAIWAMQWALTKTSQKQQQLGLEPEQEPEVETNSLMKHD 921

Query: 882 AAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
             FS+V+ LL LP E ++K  L+ ++D+T  R++
Sbjct: 922 IGFSRVSRLLNLPNEASAKSTLQTLFDYTCRRLM 955


>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
 gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
          Length = 922

 Score = 1263 bits (3267), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 620/922 (67%), Positives = 738/922 (80%), Gaps = 26/922 (2%)

Query: 17  KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           KVDPG+P  LTWQR+LD++  V + FTL++QE   +A IG RL+RY K+   K K    +
Sbjct: 1   KVDPGKPAPLTWQRRLDTSETVLSQFTLTWQEILRMAPIGIRLWRYVKENAKKKKGIFID 60

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            F K ++TS  GIP+GG+G+GSIGRSYRGEFQR++LF  + ++ PVLANQFS+FVSR NG
Sbjct: 61  PFAKRNVTSSHGIPVGGVGSGSIGRSYRGEFQRWQLFPRV-EEKPVLANQFSIFVSRSNG 119

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           +K+ SVLC RSP   ++   SGIESW+WNLKG+N TYHAL+PRAWTVY+GEPDPELR+VC
Sbjct: 120 KKYCSVLCSRSPDKLEEPAGSGIESWEWNLKGDNSTYHALYPRAWTVYEGEPDPELRVVC 179

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQISP IPHNYKESSFP SVFTFTL NSG+T+ADVTLLFTWANSV G S  SG H NS  
Sbjct: 180 RQISPVIPHNYKESSFPVSVFTFTLYNSGETAADVTLLFTWANSVGGVSEFSGQHLNSTK 239

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
           M  DGVH + LHH+TAN  PP+TFA+AA+ET  VHVS+CP F++SGNS+G+TAK+MWNE+
Sbjct: 240 MMDDGVHCVLLHHKTANELPPLTFAIAAQETPGVHVSKCPSFVISGNSQGLTAKEMWNEV 299

Query: 317 KKHGSFDHLDND-KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KV 374
           K+HGSFD+L++  K  PSEPGSSIGAAIAA+ T+P  S  +VTFSLAWDCPEV F   + 
Sbjct: 300 KEHGSFDNLNSSGKPVPSEPGSSIGAAIAATSTVPPDSVCTVTFSLAWDCPEVIFASGRT 359

Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
           YHRRYTKFYGT GD+AA IAHDAIL H  W+ +IEAWQRPILEDKR PEWYP+TLFNELY
Sbjct: 360 YHRRYTKFYGTHGDAAANIAHDAILGHGHWDSQIEAWQRPILEDKRLPEWYPVTLFNELY 419

Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTP----------------------SKIHTP 472
           YLN+GGTIWTDGS P+ SLAT+  +KFSLD                         +IHTP
Sbjct: 420 YLNSGGTIWTDGSSPLHSLATVGGKKFSLDRTGSDLGHQGDTSVDILGRMTSVLEQIHTP 479

Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
            ++++ALGT LL+ GEEN+GQFLYLEG EY M+NTYDVHFY+SFAL+MLFPKL+LSIQRD
Sbjct: 480 LATNSALGTNLLQEGEENVGQFLYLEGIEYPMWNTYDVHFYASFALIMLFPKLQLSIQRD 539

Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK 592
           FAAAVMMHDP  M ++ DG+ V RK LGAVPHDIG+DDPWFE+N+YNL N+ RWKDLN K
Sbjct: 540 FAAAVMMHDPSKMHLLLDGQCVTRKVLGAVPHDIGIDDPWFEVNAYNLHNTDRWKDLNPK 599

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           FVLQVYRD VATGDK FA+A WPSVY+AMAYM+QFDKDGDGMIEN+GFPDQTYD WS +G
Sbjct: 600 FVLQVYRDVVATGDKKFAQAAWPSVYVAMAYMDQFDKDGDGMIENDGFPDQTYDTWSVSG 659

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGS 712
           VSAYCGGLWVAALQAASALA +VGD  SA YFW R+QKAK VYD LWNGSYFNYD+S+G 
Sbjct: 660 VSAYCGGLWVAALQAASALAWEVGDKDSAEYFWFRFQKAKVVYDKLWNGSYFNYDDSNGR 719

Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
           +S SIQADQLAGQWYARACGLLPI DE K + AL KIY++N LKV  G  GA+NGM PDG
Sbjct: 720 NSPSIQADQLAGQWYARACGLLPIVDEDKARSALEKIYNYNFLKVHDGKRGAVNGMLPDG 779

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
            +DMS +Q+REIW GVTY +AA+M+QE ++DMAF TA+GVYE AW+E GLGYSFQTPE W
Sbjct: 780 TVDMSDMQSREIWSGVTYAVAATMMQEGLMDMAFHTASGVYEAAWAEQGLGYSFQTPEGW 839

Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQ-HAAFSKVASLL 891
           N N +YRSL YMRPL IWAMQW L+ PKL +QE+  ++   DS    Q HA F+KVA  L
Sbjct: 840 NTNGQYRSLGYMRPLAIWAMQWTLSSPKLHKQEMNFQVKLEDSLLGHQHHAGFAKVARFL 899

Query: 892 KLPKEEASKGFLKVVYDFTIGR 913
           KLP+EE+S  +L+ ++D+   +
Sbjct: 900 KLPEEESSVSYLQALFDYACKK 921


>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
 gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
          Length = 948

 Score = 1262 bits (3266), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 627/938 (66%), Positives = 742/938 (79%), Gaps = 29/938 (3%)

Query: 3   NGVKEEGENAASMP--KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
           NG  EE    ++ P  KVD G+P  LTWQRKLD +  V + FTLS QE   +A IG RL+
Sbjct: 9   NGFDEEQGEPSNYPVDKVDAGKPAPLTWQRKLDGDETVLSQFTLSLQEKLLMAPIGIRLW 68

Query: 61  RYSKQEEAKGKVPVF-NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDD 119
           R+ ++E A+ +   + + F K ++TS  GIP+GGIG+GSIGRSY+GEFQR++LF  IC++
Sbjct: 69  RHIREENARKRGGFYMDPFAKRNVTSCLGIPVGGIGSGSIGRSYKGEFQRWQLFPRICEE 128

Query: 120 APVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
            PVLANQFS+FVSR NG+K+SSVLC RSP V ++   SGIESWDWNLKG N TYHAL+PR
Sbjct: 129 KPVLANQFSIFVSRSNGKKYSSVLCSRSPDVLEEAAGSGIESWDWNLKGNNSTYHALYPR 188

Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
           AWTVY+GEPDPELR+VCRQISP IPHNYKESSFPASVFTF L NSG+TSADVTLLFTWAN
Sbjct: 189 AWTVYEGEPDPELRVVCRQISPIIPHNYKESSFPASVFTFKLYNSGKTSADVTLLFTWAN 248

Query: 240 SVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
           SV G S  SG H NS  M +DGVH   L+H+TANG P V+FA+AA+ET  VHVS+CPCF+
Sbjct: 249 SVGGVSEFSGQHLNSTKMMEDGVHCALLNHKTANGLPSVSFAIAAQETPVVHVSKCPCFV 308

Query: 300 LSGNSKGITAKDMWNEIKKHGSFDHLDN-DKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
           +SGNS+G+TAK+MWNE+K+HGSFD+L++  K  PSEPGSSIGAA+AA+ T+P     +VT
Sbjct: 309 ISGNSQGVTAKEMWNEVKEHGSFDNLNSPGKLVPSEPGSSIGAAVAATSTVPPDGVCTVT 368

Query: 359 FSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           FSLAWDCP+VKF   + YHRRYTKFYGT GD+AA IAHDAILEH  WE EIEAWQRPILE
Sbjct: 369 FSLAWDCPDVKFGSGRTYHRRYTKFYGTHGDAAANIAHDAILEHGLWESEIEAWQRPILE 428

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP----------- 466
           DKR PEWY  TLFNELYYLN+GGT+WTDGSPP +SLATI   KFSLD             
Sbjct: 429 DKRLPEWYAGTLFNELYYLNSGGTVWTDGSPPFRSLATIEGSKFSLDRAGSNLGHQGDTA 488

Query: 467 -----------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSS 515
                       +IHTP ++++A GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+S
Sbjct: 489 VDILGRMTSALEEIHTPLTTNSAFGTNLLQEGEENIGQFLYLEGIEYHMWNTYDVHFYAS 548

Query: 516 FALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
           FA +MLFPKL+LSIQRDFAAAVMMHDP  M ++ DG+ V RK +GAVPHDIG+ DPWFE+
Sbjct: 549 FAFIMLFPKLQLSIQRDFAAAVMMHDPSMMPLLHDGQRVPRKVIGAVPHDIGIHDPWFEV 608

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
           N+YNL N+ RWKDLN KFVLQVYRD +ATGDK FARAVWPSVY+AMAYM+QFD+DGDGMI
Sbjct: 609 NAYNLHNTDRWKDLNPKFVLQVYRDVIATGDKKFARAVWPSVYVAMAYMDQFDRDGDGMI 668

Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
           EN+GFPDQTYD WS +GVSAYCGGLWVAALQAASALA +VGD  SA YFW R+QKAK VY
Sbjct: 669 ENDGFPDQTYDTWSLSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWCRFQKAKIVY 728

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
             LWNGSYFNYDNS G +S+SIQADQLAGQWYARACGL PI DE K + AL KIY++NVL
Sbjct: 729 GKLWNGSYFNYDNSGGINSSSIQADQLAGQWYARACGLSPIVDEDKARCALEKIYNYNVL 788

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
           KV+ G  GA+NGM PDG +D+S LQ+REIW GVTY +AA+MIQE + DMAF TA+GVYE 
Sbjct: 789 KVQDGKRGAVNGMLPDGTVDLSCLQSREIWSGVTYAVAATMIQEGLTDMAFHTASGVYEA 848

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDS 875
            W+E GLGYSFQTPE WN  D+YRSLCYMRPL IWAMQWAL++P+L RQE+K  +   + 
Sbjct: 849 VWAEQGLGYSFQTPEGWNTTDQYRSLCYMRPLAIWAMQWALSRPELHRQEMK--LQAEED 906

Query: 876 SYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
           S    HA F+KVA  LKLP  E+SK   + ++++   +
Sbjct: 907 SVPVHHAGFAKVARFLKLPHAESSKSHFQSLFEYATTK 944


>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Glycine max]
          Length = 944

 Score = 1261 bits (3264), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 625/923 (67%), Positives = 737/923 (79%), Gaps = 30/923 (3%)

Query: 18  VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           V+PG+P  LTWQRKL++N    +  +LS +E  HLA IG+RL+R+ ++E AKG+  + + 
Sbjct: 24  VEPGKPAGLTWQRKLNNNGNASSQISLSLKEIIHLAPIGYRLWRHCREEAAKGRGGMIDP 83

Query: 78  FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
           F K H+T   G+PLGGIGAGSIGRS+RGEFQR++LF  IC++ PVLANQFSVFVSRP+GE
Sbjct: 84  FAKRHVTFCHGVPLGGIGAGSIGRSFRGEFQRWQLFPVICEEKPVLANQFSVFVSRPSGE 143

Query: 138 KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
           K+SSVLCPR P + K+N  SGIESWDWN+ G + TYHAL+PRAWTVY+ EPDP LRIVCR
Sbjct: 144 KYSSVLCPRKPVIIKQNPVSGIESWDWNINGNSSTYHALYPRAWTVYE-EPDPALRIVCR 202

Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM 257
           QISP IPHNYKESSFP +VFTFTL N G T+ADVTLLFTW NSV G S  +G HFNSKT 
Sbjct: 203 QISPVIPHNYKESSFPVTVFTFTLKNLGNTTADVTLLFTWTNSVGGISEFTGDHFNSKTT 262

Query: 258 TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIK 317
             DGVH + LHH+TAN R PVTFA+AAEET DVH+SECP F++SG    I+AK+MWNE+K
Sbjct: 263 RNDGVHAVLLHHKTANERSPVTFAIAAEETEDVHISECPVFVISGAYSSISAKNMWNEVK 322

Query: 318 KHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVY 375
           +HGSFDHL+  +TS PSEPGSSIGAAIAA++TIPS + R VTFSLAWDCPEVKF E K Y
Sbjct: 323 QHGSFDHLNFAETSAPSEPGSSIGAAIAATVTIPSNAQRIVTFSLAWDCPEVKFPEGKTY 382

Query: 376 HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYY 435
           +RRYTKFYGT GD+AA IAHDAI+EH +WE +I+ WQRPILEDKRFPEWYP TL NELYY
Sbjct: 383 YRRYTKFYGTNGDAAADIAHDAIMEHCQWEAQIDDWQRPILEDKRFPEWYPTTLLNELYY 442

Query: 436 LNAGGTIWTDGSPPMQSLATIRERKFSLD----------------------TPSKIHTPA 473
           LN+GGTIWTDGS P+  L    ERKFSLD                         + H+P 
Sbjct: 443 LNSGGTIWTDGSLPVHGLVNTGERKFSLDGFNNNLSHKNDIAINILEMFNSVVEQTHSPP 502

Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
           +S +A G  LL+ GEENIGQFLYLEG EY M+NTYDVHFYSSF+LVMLFPKLELSIQRDF
Sbjct: 503 ASKSAYGVNLLQEGEENIGQFLYLEGIEYQMWNTYDVHFYSSFSLVMLFPKLELSIQRDF 562

Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKF 593
           AAAV+MHDP  MK++ DG+W  RK LGAVPHDIGL+DPWFE+N YNL+N+ RWKDLN KF
Sbjct: 563 AAAVLMHDPSKMKLLYDGQWAPRKVLGAVPHDIGLNDPWFEVNGYNLYNTDRWKDLNPKF 622

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           VLQ+YRD VATGDK FA+AVWP+VYIA+AYM+QFDKDGDGMIENEGFPDQTYD WS +GV
Sbjct: 623 VLQIYRDVVATGDKKFAQAVWPAVYIAIAYMDQFDKDGDGMIENEGFPDQTYDTWSVSGV 682

Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSS 713
           SAY GGLWVAALQAASALA +VGD  S  YFW+++QKAK+VY++LWNGSYFNYD+S GSS
Sbjct: 683 SAYSGGLWVAALQAASALAREVGDKGSEDYFWLKFQKAKSVYENLWNGSYFNYDSSGGSS 742

Query: 714 STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
           S SIQADQLAGQWYARACGLLPI +E K + AL  +Y+ NV+KV  G  GA+NGM PDG+
Sbjct: 743 SLSIQADQLAGQWYARACGLLPIVEEKKSRSALQIVYNHNVMKVHXGRRGAVNGMLPDGK 802

Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
           +DMS +Q+REIW GVTY LAA+MIQE M+DMAFQTA GVYE AWS +GLGYSFQTPE+W 
Sbjct: 803 VDMSTMQSREIWSGVTYALAATMIQENMIDMAFQTAGGVYETAWSNNGLGYSFQTPEAWT 862

Query: 834 NNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRD--SSYLEQHAAFSKVASLL 891
             DEYRSLCYMRPL IWAMQW L++ K  + E K ++ ++D  S Y   H  FSKVA LL
Sbjct: 863 TKDEYRSLCYMRPLAIWAMQWELSRAKHIQHESKSDMKEKDMLSRY---HDGFSKVARLL 919

Query: 892 KLPKEEASKGFLKVVYDFTIGRV 914
           K+ +E  S+   +V+YDFT  R+
Sbjct: 920 KVKEETDSRSLFQVIYDFTCKRM 942


>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
 gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
 gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 957

 Score = 1252 bits (3240), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 607/955 (63%), Positives = 736/955 (77%), Gaps = 40/955 (4%)

Query: 1   MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
            E  + + GE+     +S  KVDP  P  LTWQRK+DS+VK P  F LS +E   LA +G
Sbjct: 2   FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
            RL+   ++E AKG++   + F KH +TS  G+PLGGIGAGSIGRS++GEFQR++LF   
Sbjct: 62  IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+  TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PR+WT+Y+GEPDPELRIVCRQ+SPFIPHNYKESSFP SVFTFTL N G T+ADVTLLFT
Sbjct: 182 YPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVTLLFT 241

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANSV GDS  SG H+NSK    DGV G+ LHH+TANG P +++A++A+ T  V VS CP
Sbjct: 242 WANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACP 301

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
            F++SG   GITAKDMW  +K++GSFDHL   + S  S+ GSSIGAA+AAS+T+  G +R
Sbjct: 302 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 361

Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            VTFSLAWDCPEV+F   K+Y RRYTKFYG  GD+AA+IAHDAIL H++WE  IE WQRP
Sbjct: 362 IVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIEDWQRP 421

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------- 464
           ILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD          
Sbjct: 422 ILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLKNDI 481

Query: 465 ------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
                             T  ++H   +S++A GT+LLE GEENIG FLYLEG EY M+N
Sbjct: 482 DVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEYRMWN 541

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP  +K +S+G+WV RK LGAVPHD+
Sbjct: 542 TYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDL 601

Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
           G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM Q
Sbjct: 602 GINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQ 661

Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
           FDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA  VGD  S  YFW 
Sbjct: 662 FDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWS 721

Query: 687 RYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
           ++QKAK VY+  LWNGSYFNYDNS    S++IQADQLAGQWYARA GLLPI DE K + A
Sbjct: 722 KFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDKARTA 781

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
           L K+Y++NV+K+K G  GA+NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MA
Sbjct: 782 LEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMA 841

Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
           FQTA+G+YE AWSE GLGYSFQTPESWN  DEYRSL YMRPL IWAMQWALTK    +++
Sbjct: 842 FQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQKQEQ 901

Query: 866 I-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
           +     + E     SS ++    FS+V+ LL LP E ++K  L+ ++D+T  R++
Sbjct: 902 LGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956


>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
          Length = 957

 Score = 1251 bits (3236), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 606/955 (63%), Positives = 736/955 (77%), Gaps = 40/955 (4%)

Query: 1   MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
            E  + + GE+     +S  KVDP  P  LTWQRK+DS+VK P  F LS +E   LA +G
Sbjct: 2   FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
            RL+   ++E AKG++   + F KH +TS  G+PLGGIGAGSIGRS++GEFQR++LF   
Sbjct: 62  IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+  TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PR+WT+Y+GEPDPELRIVCRQ+SPFIPHNYKESSFP SVFTFTL N G T+ADVTLLFT
Sbjct: 182 YPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVTLLFT 241

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANSV GDS  SG H+NSK    DGV G+ LHH+TANG P +++A++A+ T  V VS CP
Sbjct: 242 WANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACP 301

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
            F++SG   GITAKDMW  +K++GSFDHL   + S  S+ GSSIGAA+AAS+T+  G +R
Sbjct: 302 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 361

Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            VTFSLAWDCPEV+F   K+Y RRYTKFYG  GD+AA+IAHDAIL H++WE  IE WQRP
Sbjct: 362 IVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIEDWQRP 421

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------- 464
           ILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD          
Sbjct: 422 ILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLKNDI 481

Query: 465 ------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
                             T  ++H   +S++A GT+LLE GEENIG FLYLEG EY M+N
Sbjct: 482 DVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEYRMWN 541

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP  +K +S+G+WV RK LGAVPHD+
Sbjct: 542 TYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDL 601

Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
           G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM Q
Sbjct: 602 GINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQ 661

Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
           FDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA  VGD  S  YFW 
Sbjct: 662 FDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWS 721

Query: 687 RYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
           ++QKAK VY+  LWNGSYFNYDNS    S++IQADQLAGQWYARA GLLPI DE K + A
Sbjct: 722 KFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDKARTA 781

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
           L K+Y++NV+K+K G  GA+NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MA
Sbjct: 782 LEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMA 841

Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
           FQTA+G+YE AWS+ GLGYSFQTPESWN  DEYRSL YMRPL IWAMQWALTK    +++
Sbjct: 842 FQTASGIYEAAWSKTGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQKQEQ 901

Query: 866 I-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
           +     + E     SS ++    FS+V+ LL LP E ++K  L+ ++D+T  R++
Sbjct: 902 LGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956


>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
          Length = 957

 Score = 1248 bits (3229), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 606/955 (63%), Positives = 734/955 (76%), Gaps = 40/955 (4%)

Query: 1   MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
            E  + + GE+     +S  KVDP  P  LTWQRK+DS+VK P  F LS +E   LA +G
Sbjct: 2   FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
            RL+   ++E AKG++   + F KH +TS  G+PLGGIGAGSIGRS++GEFQR++LF   
Sbjct: 62  IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+  TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PR+WT+Y+GEPDPELRIVCRQ+SPFIPHNYKESSFP SVFTFTL N G T+ADVTLLFT
Sbjct: 182 YPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVTLLFT 241

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANSV GDS  SG H+NSK    DGV G+ LHH+TANG P +++A++A+ T  V VS CP
Sbjct: 242 WANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACP 301

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
            F++SG   GITAKDMW  +K++GSFDHL   + S  S+ GSSIGAA+AAS+T+  G +R
Sbjct: 302 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 361

Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            VTFSLAWDCPEV+F   K+Y RRYTKFYG  GD+AA+IAHDAIL H++W   IE WQRP
Sbjct: 362 IVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWGSWIEDWQRP 421

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------- 464
           ILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD          
Sbjct: 422 ILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLKNDI 481

Query: 465 ------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
                             T   +H   +S++A GT+LLE GEENIG FLYLEG EY M+N
Sbjct: 482 DVPHQNDTAVSVLEKMASTLEGLHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEYRMWN 541

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP  +K +S+G+WV RK LGAVPHD+
Sbjct: 542 TYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDL 601

Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
           G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM Q
Sbjct: 602 GINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQ 661

Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
           FDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA  VGD  S  YFW 
Sbjct: 662 FDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWS 721

Query: 687 RYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
           ++QKAK VY+  LWNGSYFNYDNS    S++IQADQLAGQWYARA GLLPI DE K + A
Sbjct: 722 KFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDKARTA 781

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
           L K+Y++NV+K+K G  GA+NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MA
Sbjct: 782 LEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMA 841

Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
           FQTA+G+YE AWSE GLGYSFQTPESWN  DEYRSL YMRPL IWAMQWALTK    +++
Sbjct: 842 FQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQKQEQ 901

Query: 866 I-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
           +     + E     SS ++    FS+V+ LL LP E ++K  L+ ++D+T  R++
Sbjct: 902 LGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956


>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 947

 Score = 1230 bits (3182), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 585/935 (62%), Positives = 713/935 (76%), Gaps = 39/935 (4%)

Query: 18  VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           +D   PP  +W+RKL+S  K P+ F LS ++  HL  +G+RL+R++K E AKG+  +F++
Sbjct: 13  IDDKLPP-FSWERKLNSQAKTPSEFKLSKRDHLHLFPLGYRLWRHTKDEAAKGRASIFDI 71

Query: 78  FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
           FRKHHIT D G+PLGGIGAGSIGRSY+GEFQ+FKLF  IC++AP+L NQFS FVSRP G 
Sbjct: 72  FRKHHITGDHGVPLGGIGAGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPGGV 131

Query: 138 KFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP 188
           K S+VLCP  P V K N          + GIESWDWN+ GE  TYHAL+PR+WTVYDGEP
Sbjct: 132 KHSTVLCPSKPQVIKDNGGYLCKGQVPNIGIESWDWNMTGEKSTYHALYPRSWTVYDGEP 191

Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS 248
           DPELRIV RQ+SPFIPHNY+ESS P SVF FT++N+G   A VTLLFTW NSV G SGL+
Sbjct: 192 DPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEQAIVTLLFTWENSVGGASGLT 251

Query: 249 GHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG-I 307
           G HFNS    KDGVH + L H+TANG PPV++A+AA+ET DV VS CPCF++SG +   I
Sbjct: 252 GQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPNKI 311

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
           TA DMW+EIKK+ SFD L ++  SPS+PG+SIGAAIAA + +P G  R+VTFSL+WDCPE
Sbjct: 312 TAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPE 371

Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
            +F EK YHRRYT+FYG+LG++A  +AHDA+L  ++WE +IE WQ PIL D   PEWY I
Sbjct: 372 ARFDEKTYHRRYTRFYGSLGNAAVAMAHDALLNFSEWETQIEEWQAPILADTTLPEWYRI 431

Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP--------------------- 466
           TLFNELYY N+GGT+WTDG PP QSL +I  RK SL                        
Sbjct: 432 TLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLSISTTDNTDPDQNDIALDILGRID 491

Query: 467 ---SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
              S+IH P SS+ ALGT +++N  ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFP
Sbjct: 492 AVCSQIHAPLSSNAALGTTMIQNTSENIGQFLYLEGIQYLMYNTYDVHFYSSFALLMLFP 551

Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNS 583
           KLELSIQRDFAAAV+MHD    ++MS G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+
Sbjct: 552 KLELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKALGAVPHDIGLNDPWFEVNAYNLFNT 611

Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
            RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQ
Sbjct: 612 DRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQ 671

Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSY 703
           TYDAWS +GVSAYCGGLWVAALQA SALA +VGD+ +A YF  +Y+KA++VY+ LWNGSY
Sbjct: 672 TYDAWSCSGVSAYCGGLWVAALQAGSALAREVGDNGAAVYFNAKYEKARSVYEKLWNGSY 731

Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
           FNYDNS   SS+SI ADQLAGQWYARACGL PIA E  +KKAL  +YDFNV++V+ G  G
Sbjct: 732 FNYDNSRSGSSSSILADQLAGQWYARACGLKPIAKEEWIKKALETVYDFNVMRVRDGTRG 791

Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
           A+NGM PDGR+D S + +RE+W G TY +AA MIQE + D  F+TA+G+YE AWS+ GLG
Sbjct: 792 AVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLG 851

Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE---IKHEISDRDSSYLEQ 880
            +FQTPE+W  NDE+RSLCYMRPL IW +QWA T P+ +R++   +K +  D  S   +Q
Sbjct: 852 CAFQTPEAWTTNDEFRSLCYMRPLAIWGIQWAHTMPQPNREQEVSLKQQEEDATSVLFQQ 911

Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
           HA F KVA  LK  K +  +  L+  Y+ T  RV+
Sbjct: 912 HAGFIKVAHYLKNTKGKDHRNRLQTAYE-TFLRVI 945


>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
 gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
          Length = 951

 Score = 1214 bits (3140), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 594/932 (63%), Positives = 713/932 (76%), Gaps = 46/932 (4%)

Query: 17  KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           KV  GQ P LTW+ KL         F L+++E   LA IG RL R+  +E +KG++ V +
Sbjct: 28  KVYSGQLPVLTWEHKLTRGRHDLPPFRLTWRETMQLAGIGLRLSRHILEETSKGRIAVID 87

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
             +K    S QG+PLGG    SIGRSY+G+FQR++LF G C+D PVLANQFS F+SR +G
Sbjct: 88  PMKKRSARSGQGVPLGG----SIGRSYKGDFQRWQLFPGTCEDKPVLANQFSAFISRQDG 143

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
            K+S+VL P  P +PK +  SGI SWDWN+ GE  TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 RKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGEQSTYHALYPRAWTVYDGEPDPELNIVC 203

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS  
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
           + KDGVHG+ LHHRTA+G+PPVTF +AA+E  D+ +S+CP F++SG+S     TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPPVTFVIAAQEKEDILISQCPYFVISGSSASDEFTAKDMWN 323

Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
            +K+HGSFDHLD  KTS  S PGSSIGAAIAAS+ +   +T+ ++FSLAW CPEVKF   
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATKDISFSLAWACPEVKFSSG 383

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
           K YHRRYTKFYGT GD+AA +AHDAILEHA WE +I+ WQ PIL+D+RFP WYP+TLFNE
Sbjct: 384 KTYHRRYTKFYGTDGDAAATLAHDAILEHASWERQIDEWQDPILQDERFPAWYPVTLFNE 443

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS------------------------- 467
           LYYLNAGGTIWTDG PP+QSL  I  +KFSLD  +                         
Sbjct: 444 LYYLNAGGTIWTDGLPPIQSLTAIGWKKFSLDMLNGESDDFNEMVRQNNTASDILHRMAS 503

Query: 468 ---KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
              ++H   +S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SF+L+MLFPK
Sbjct: 504 VFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPK 562

Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
           L+LS+QRDFAAAVMMHDP  ++I+ DGKW ARK LGAVPHD+GL DPWF++N+Y L+N+ 
Sbjct: 563 LQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAYTLYNTD 622

Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           RWKDLN KFVLQVYRD VATGDK+FARAVWPSVYIAMAYMEQFDKD DGMIENE FPDQT
Sbjct: 623 RWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENEDFPDQT 682

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF 704
           YD WS  G+SAYCGGLWVAALQAASALA +VGD AS   FW +Y+KAK+VY  LWNGSYF
Sbjct: 683 YDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKSVYSKLWNGSYF 742

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
           +YD+ D   STSIQADQLAGQWYA+ACGL PI D+ K + AL KI+ FNV+K K G  GA
Sbjct: 743 SYDDGDNKVSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKIFSFNVMKFKDGTRGA 802

Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           MNGM PDG +DMS +Q+REIWPGVTY LAASMIQE MV+  F+TA G+Y  AWS +GLGY
Sbjct: 803 MNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGIYHAAWSPEGLGY 862

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRD---SSYLEQH 881
           SFQTPE+WNN+DEYRSLCYMRPL IWAMQWAL+ PKL      H     D    S+ +  
Sbjct: 863 SFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSNPKL------HNTPQADIPQDSFPKNQ 916

Query: 882 AAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
            A++++  LL+LP+EE SK FL+VVY+    R
Sbjct: 917 FAYARITKLLQLPEEETSKSFLRVVYEIIRNR 948


>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 951

 Score = 1206 bits (3120), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 587/928 (63%), Positives = 721/928 (77%), Gaps = 38/928 (4%)

Query: 18  VDPGQPPQLTWQRKLDSNVK--VPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVF 75
           VDPGQ P+LTW+ KL S+V+  +P+ F L ++E   +A +GFRL ++  +E +KG+  + 
Sbjct: 27  VDPGQLPELTWEHKL-SHVRYDLPS-FGLKWREAVKMAGLGFRLGQHIVEETSKGRTAII 84

Query: 76  NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPN 135
           +  +K    S QG+PLGGIGAGSIGRSY+GEFQR++LF G C+D PVLANQFS F+SR +
Sbjct: 85  DPMKKRTAKSGQGVPLGGIGAGSIGRSYKGEFQRWQLFPGACEDKPVLANQFSAFISRQD 144

Query: 136 GEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
           G K+S+VL P  P +PK    SGI SWDWNL G+  TYHAL+PRAWTVYDGEPDP+L+IV
Sbjct: 145 GRKYSTVLHPGKPDLPKGTNISGIGSWDWNLSGQKSTYHALYPRAWTVYDGEPDPDLKIV 204

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQ+SP IPHNY++SS+PA+VFTFT++NSG T+ DVTLLFTWANSV G S L+G+H NS 
Sbjct: 205 CRQVSPIIPHNYQQSSYPAAVFTFTVANSGNTAVDVTLLFTWANSVGGKSELTGYHCNSS 264

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            + KDGVHG+ LHHRTA+G+PPVTFA+AA+E  DVH+SECP F++SG+S   TAKDMWN 
Sbjct: 265 MLEKDGVHGILLHHRTADGQPPVTFAIAAQEKEDVHISECPYFVMSGSSDEFTAKDMWNS 324

Query: 316 IKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-K 373
           +K+HGSFD LD  + S  S PG+SIGAAIAAS+ +   ST+ V+FSLAW CPEVKF   K
Sbjct: 325 VKEHGSFDLLDPIEASICSRPGTSIGAAIAASVKLAPQSTKDVSFSLAWACPEVKFSSGK 384

Query: 374 VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
            YHRRYTKFYGT  D+AA +AHDAI++H+ WE +IE WQ PIL+DKRFP WYP+TLFNEL
Sbjct: 385 TYHRRYTKFYGTDVDAAASLAHDAIVDHSSWEKQIEEWQNPILQDKRFPAWYPVTLFNEL 444

Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS-------------------------- 467
           YYLNAGG+IWTDG PP+QSL  I  +KFSLD  +                          
Sbjct: 445 YYLNAGGSIWTDGLPPIQSLTAIGGKKFSLDMSNGETDDVDEMIPHNNTATDILQQMASI 504

Query: 468 --KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
             +IH   +S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SFAL+MLFPKL
Sbjct: 505 LERIHASLASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFALIMLFPKL 563

Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
           +LSIQRDFAAAVMMHDP  +K++ DGK   RK LGAVPHD+GL DPWF++N+Y L N+ R
Sbjct: 564 QLSIQRDFAAAVMMHDPEKLKLLHDGKLAPRKVLGAVPHDLGLYDPWFKVNAYTLHNTDR 623

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           WKDLN KFVLQVYRD VATG+K+FARAVWPSVY+AMAYMEQFDKD DGMIENE FPDQTY
Sbjct: 624 WKDLNPKFVLQVYRDVVATGNKSFARAVWPSVYMAMAYMEQFDKDKDGMIENEDFPDQTY 683

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN 705
           D WS  GVSAYCGGLWVAALQAASALA +VGD AS   FW +Y+KAK+VY  LWNGSYFN
Sbjct: 684 DVWSMAGVSAYCGGLWVAALQAASALAREVGDKASEELFWNKYEKAKSVYGKLWNGSYFN 743

Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
           YD++    STSI ADQLAGQWYA++CGL  I D+ K + AL KIY FNV+K K G  GAM
Sbjct: 744 YDDAGTKVSTSIHADQLAGQWYAKSCGLSSIVDKDKSQSALEKIYSFNVMKFKDGKRGAM 803

Query: 766 NGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYS 825
           NGM PDG +DMS +Q+REIWPGVTY LAASMIQE MV+  F+TA G+Y  AWS +GLGY+
Sbjct: 804 NGMWPDGTVDMSTMQSREIWPGVTYALAASMIQEGMVEEGFKTAEGIYHAAWSPEGLGYA 863

Query: 826 FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFS 885
           FQTPE+W N+D YRSLCYMRPL IW++QWAL+ PKL ++  +    D   S+ +   +++
Sbjct: 864 FQTPEAWTNDDGYRSLCYMRPLAIWSIQWALSSPKLHKEPQRDLAQD---SFPKNQFSYA 920

Query: 886 KVASLLKLPKEEASKGFLKVVYDFTIGR 913
           ++A LL+LP++E+SK FL+V+Y+    R
Sbjct: 921 RIAKLLQLPEDESSKSFLQVIYEIVRNR 948


>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 951

 Score = 1202 bits (3111), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 607/946 (64%), Positives = 726/946 (76%), Gaps = 36/946 (3%)

Query: 1   MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
           + NG+ E+ E+          +DPGQ P LTWQRKL+        F L  +E  HLA +G
Sbjct: 7   LGNGLVEQDEDINKSLTEKTGIDPGQTPSLTWQRKLNREGTSLWQFRLHLKEIIHLAPVG 66

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
           +RL+RY ++E  KG+  + N F +  IT    IPLGG+G+GSIGRSYRGEFQR++LF   
Sbjct: 67  YRLFRYIREESVKGRGAMINPFIRRRITDTHAIPLGGLGSGSIGRSYRGEFQRWQLFPRK 126

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D P+LANQFSVFVSRPN EK+S+VLC +SP   ++   SGI SWDWNLKG + TYHAL
Sbjct: 127 CEDKPILANQFSVFVSRPNAEKYSTVLCAQSPETQREVESSGIGSWDWNLKGHSSTYHAL 186

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PRAWT+YDGEPDPELRIVCRQISP IPHNYKESS+P SVFTFTL NSG+T ADV+LLFT
Sbjct: 187 YPRAWTIYDGEPDPELRIVCRQISPVIPHNYKESSYPVSVFTFTLHNSGKTDADVSLLFT 246

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANSV G S  SG+H NS+T  KDGVH + LHH+TA+G PPVT+A+AA+E   VHVS+CP
Sbjct: 247 WANSVGGLSEYSGNHINSRTKKKDGVHTVLLHHKTASGFPPVTYAIAAQEGNGVHVSDCP 306

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLD-NDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
           CF++SGNS+GI+AKDMW EIK+HGSFD L   D + PSE GSSIGAA++AS+T+ S S R
Sbjct: 307 CFVISGNSQGISAKDMWLEIKEHGSFDRLKFADMSMPSEVGSSIGAAVSASVTVSSDSVR 366

Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
           +VTFSL+WDCPEV F   K YHRRYTKFYG LGD+AA IA DAILEH  WE +I+AWQRP
Sbjct: 367 TVTFSLSWDCPEVNFCRGKTYHRRYTKFYGNLGDAAADIARDAILEHHHWESQIDAWQRP 426

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-------- 466
           +LEDKRFP+WYP+TLFNELYYLNAGGTIWTDGS P+QSL +I ER F LD          
Sbjct: 427 VLEDKRFPKWYPVTLFNELYYLNAGGTIWTDGSLPIQSLVSIGERGFCLDQCRSGDSVIY 486

Query: 467 -------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNT 507
                               ++    +S++A G  LL+ GEEN+GQFLYLEG EY M+NT
Sbjct: 487 ESHENDTANDILGRMTSRLDELRDSVASNSAFGVNLLQKGEENVGQFLYLEGVEYTMWNT 546

Query: 508 YDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG 567
           YDVHFYSSFA++MLFPKLELSIQRDFAAAVMMHDP  M+++ +GKW AR  LGAVPHDIG
Sbjct: 547 YDVHFYSSFAIIMLFPKLELSIQRDFAAAVMMHDPSKMQLLDNGKWEARNVLGAVPHDIG 606

Query: 568 LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF 627
           ++DPWFE+N YNL+N+ RWKDLN KFVLQ+YRD VATGD  FA+AVWP VY+A+AYM+QF
Sbjct: 607 VNDPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDMKFAKAVWPPVYLAIAYMDQF 666

Query: 628 DKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVR 687
           D+DGDGMIEN+GFPDQTYD WS +GVSAY GGLWVAALQAASALA    +  +  YFW +
Sbjct: 667 DRDGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVADEKDAEHYFWFK 726

Query: 688 YQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALT 747
           +QKAK  Y+ LWNGSYFNYD+S GSSS+SIQADQLAGQWYARA GLLPI DE K K AL 
Sbjct: 727 FQKAKRAYEKLWNGSYFNYDSSGGSSSSSIQADQLAGQWYARASGLLPIVDEDKAKSALA 786

Query: 748 KIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQ 807
           K+Y++NV+KVK G  GA+NGM PDG ID S +Q+REIW GVTY +AASMI E+M DMAF+
Sbjct: 787 KVYNYNVMKVKDGKRGAVNGMLPDGTIDFSSMQSREIWSGVTYAVAASMIHEDMTDMAFR 846

Query: 808 TAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIK 867
           TA G++E AWSEDGLGY+FQTPE+W   D YRSLCYMRPL IWAMQWA ++     +E+K
Sbjct: 847 TAEGIHEAAWSEDGLGYNFQTPEAWTTTDRYRSLCYMRPLAIWAMQWAFSEKVSVIEELK 906

Query: 868 HEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
              SD  S +   HA FSKVA  LKLP++  S   L+ VYD+T+ R
Sbjct: 907 ELDSDAISRH---HARFSKVARYLKLPEDGTSSSVLQTVYDYTLKR 949


>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
 gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
          Length = 947

 Score = 1201 bits (3106), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 575/928 (61%), Positives = 707/928 (76%), Gaps = 38/928 (4%)

Query: 18  VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           +D   PP  +W+RKL+S  K P+ F LS ++  HL  +G+RL+R++K E AKG+  +F++
Sbjct: 13  IDDKLPP-FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIFDI 71

Query: 78  FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
           FRKHHIT D G+PLGGIG+GSIGRSY+GEFQ+FKLF  IC++AP+L NQFS FVSRP G 
Sbjct: 72  FRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPGGV 131

Query: 138 KFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP 188
           K S+VLCP  P V K N          + GIESWDWN+ GE  TYHAL+PR+WTVYDGEP
Sbjct: 132 KHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDGEP 191

Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS 248
           DPELRIV RQ+SPFIPHNY+ESS P SVF FT++N+G   A VTLLFTW NSV G SGL+
Sbjct: 192 DPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASGLT 251

Query: 249 GHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG-I 307
           G HFNS    KDGVH + L H+TANG PPV++A+AA+ET DV VS CPCF++SG +   I
Sbjct: 252 GQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPNQI 311

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
           TA DMW+EIKK+ SFD L ++  SPS+PG+SIGAAIAA + +P G  R+VTFSL+WDCPE
Sbjct: 312 TAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPE 371

Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
            +F EK YHRRYT+FYG+LG++A  +AHDA+L   +WE +IE WQ P+L D   PEWY +
Sbjct: 372 ARFDEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPEWYRV 431

Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP--------------------- 466
           TLFNELYY N+GGT+WTDG PP QSL +I  RK SL                        
Sbjct: 432 TLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTIDKNDQDQNNVALDILGRID 491

Query: 467 ---SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
              S+IH P +S+ ALG  +++N  ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFP
Sbjct: 492 AVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYDVHFYSSFALLMLFP 551

Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNS 583
           K+ELSIQRDFAAAV+MHD    ++MS G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+
Sbjct: 552 KIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFELNAYNLFNT 611

Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
            RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQ
Sbjct: 612 DRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQ 671

Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSY 703
           TYDAWS +GVSAYCGGLWVAALQA SALA ++GD+ +A YF  +Y+KA++VY+ LWNGSY
Sbjct: 672 TYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVYEKLWNGSY 731

Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
           FNYDNS   SS+SI ADQ+AGQWYARACGL PIA E  +KKAL  +YDFNV++V+ G  G
Sbjct: 732 FNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVMRVRDGTRG 791

Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
           A+NGM PDGR+D S + +RE+W G TY +AA MIQE + D  F+TA+G+YE AWS+ GLG
Sbjct: 792 AVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLG 851

Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE---IKHEISDRDSSYLEQ 880
            +FQTPE+W  NDEYRSLCYMRPL IW +QWA T PK +R++   ++ +  D  S   +Q
Sbjct: 852 CAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEEDATSVLFQQ 911

Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYD 908
           HA F KVA  LK  K +  +  L+  Y+
Sbjct: 912 HAGFIKVAHYLKNTKGKDHRNRLQTAYE 939


>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
 gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
 gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
          Length = 950

 Score = 1201 bits (3106), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 595/955 (62%), Positives = 741/955 (77%), Gaps = 51/955 (5%)

Query: 1   MENGVKEEGENAA-SMPK-------VDPGQPPQLTWQRKLDSNV--KVPTGFTLSFQEFR 50
           +ENGV E+ +  + + P+       VDPG  P+LTW+ KL SN+   +P+ F L+++E  
Sbjct: 2   VENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKL-SNIGYDLPS-FRLTWRETF 59

Query: 51  HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
            LA +G RL R+  +E +KG+  V +  +K    S QG+PLGGIG+GSIGRSY+GEFQR+
Sbjct: 60  QLAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRW 119

Query: 111 KLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGEN 170
           +LF G C++ PVLANQFS F+SR +G  +SSVL P  P +PK +  SGI SWDWN+ G+N
Sbjct: 120 QLFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQN 179

Query: 171 CTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSAD 230
            TYHAL+PR+WT+Y+GEPDP++ IVCRQISP IPHNY++SS+P SVFTFT++NSG T+AD
Sbjct: 180 STYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAAD 239

Query: 231 VTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADV 290
           VTLLFTWANSV G S L+G+H NS  + KDGVHG+ LHHRTANG+PPVTFA+AA+E  D+
Sbjct: 240 VTLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDI 299

Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTI 349
           H+SECP F++SG+S   +AKDMWN +K++GSFD+LD  KTS  S+PG SIGAAIAAS+ +
Sbjct: 300 HISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKL 359

Query: 350 PSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
           P  +T++V+F+LAW CPEVKF   K YHRRYTKF+GT  D+AA +AHDAILEH  WE +I
Sbjct: 360 PPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQI 419

Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS- 467
           E WQ PIL+D+RFP+WYP+TLFNELYYLNAGGTIWTDG PP+QSL  I E+KFSLD  + 
Sbjct: 420 EEWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNG 479

Query: 468 ---------------------------KIHTPASSDTALGTRLLENGEENIGQFLYLEGA 500
                                      +IH    S++A+GT LL+ GEENIGQFLYLEG 
Sbjct: 480 DADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGI 538

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
           EY M+NTYDVHFY+SF+L+MLFPKL+LSIQRDFAAAV+MHDP  ++++ DGKWVARK LG
Sbjct: 539 EYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLG 598

Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           AVPHD+GL DPWF++N+Y L+N+ RWKDLN KFVLQVYRD VATGDK+FARAVWPSVY+A
Sbjct: 599 AVPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMA 658

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           MAYMEQFD+D DGMIENE FPDQTYD WS  G+SAYCGGLWVAALQAASALA++VGD AS
Sbjct: 659 MAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKAS 718

Query: 681 ASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
              FW +Y+KAK+VY  LWNGSYFNYD+ D   S SI ADQLAGQWYA+ACGL PI D+ 
Sbjct: 719 EKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKD 778

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
           K + AL KIY FNV+K K G  GAMNGM PDG +DMS +Q+REIWPGVTY LAA+MIQE 
Sbjct: 779 KAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEG 838

Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           MV+  F+TA G+Y  AWS +GLGYSFQTPE+WNN+DEYRSLCYMRPL IWA+QWAL+ PK
Sbjct: 839 MVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK 898

Query: 861 LSRQ--EIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
           L +Q  +I  +      S+ +   +++++A LL LP++E+ K FL+V+Y+    R
Sbjct: 899 LHKQTADIPQD------SFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947


>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
          Length = 950

 Score = 1201 bits (3106), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 595/955 (62%), Positives = 741/955 (77%), Gaps = 51/955 (5%)

Query: 1   MENGVKEEGENAA-SMPK-------VDPGQPPQLTWQRKLDSNV--KVPTGFTLSFQEFR 50
           +ENGV E+ +  + + P+       VDPG  P+LTW+ KL SN+   +P+ F L+++E  
Sbjct: 2   VENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKL-SNIGYDLPS-FRLTWRETF 59

Query: 51  HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
            LA +G RL R+  +E +KG+  V +  +K    S QG+PLGGIG+GSIGRSY+GEFQR+
Sbjct: 60  QLAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRW 119

Query: 111 KLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGEN 170
           +LF G C++ PVLANQFS F+SR +G  +SSVL P  P +PK +  SGI SWDWN+ G+N
Sbjct: 120 QLFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQN 179

Query: 171 CTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSAD 230
            TYHAL+PR+WT+Y+GEPDP++ IVCRQISP IPHNY++SS+P SVFTFT++NSG T+AD
Sbjct: 180 STYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAAD 239

Query: 231 VTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADV 290
           VTLLFTWANSV G S L+G+H NS  + KDGVHG+ LHHRTANG+PPVTFA+AA+E  DV
Sbjct: 240 VTLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDV 299

Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTI 349
           H+SECP F++SG+S   +AKDMWN +K++GSFD+LD  KTS  S+PG SIGAAIAAS+ +
Sbjct: 300 HISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKL 359

Query: 350 PSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
           P  +T++V+F+LAW CPEVKF   K YHRRYTKF+GT  D+AA +AHDAILEH  WE +I
Sbjct: 360 PPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQI 419

Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS- 467
           E WQ PIL+D+RFP+WYP+TLFNELYYLNAGGTIWTDG PP+QSL  I E+KFSLD  + 
Sbjct: 420 EEWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNG 479

Query: 468 ---------------------------KIHTPASSDTALGTRLLENGEENIGQFLYLEGA 500
                                      +IH    S++A+GT LL+ GEENIGQFLYLEG 
Sbjct: 480 DADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGI 538

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
           EY M+NTYDVHFY+SF+L+MLFPKL+LSIQRDFAAAV+MHDP  ++++ DGKWVARK LG
Sbjct: 539 EYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLG 598

Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           A+PHD+GL DPWF++N+Y L+N+ RWKDLN KFVLQVYRD VATGDK+FARAVWPSVY+A
Sbjct: 599 AIPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMA 658

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           MAYMEQFD+D DGMIENE FPDQTYD WS  G+SAYCGGLWVAALQAASALA++VGD AS
Sbjct: 659 MAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKAS 718

Query: 681 ASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
              FW +Y+KAK+VY  LWNGSYFNYD+ D   S SI ADQLAGQWYA+ACGL PI D+ 
Sbjct: 719 EKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKD 778

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
           K + AL KIY FNV+K K G  GAMNGM PDG +DMS +Q+REIWPGVTY LAA+MIQE 
Sbjct: 779 KAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEG 838

Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           MV+  F+TA G+Y  AWS +GLGYSFQTPE+WNN+DEYRSLCYMRPL IWA+QWAL+ PK
Sbjct: 839 MVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK 898

Query: 861 LSRQ--EIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
           L +Q  +I  +      S+ +   +++++A LL LP++E+ K FL+V+Y+    R
Sbjct: 899 LHKQTADIPQD------SFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947


>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
          Length = 947

 Score = 1197 bits (3098), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 574/928 (61%), Positives = 706/928 (76%), Gaps = 38/928 (4%)

Query: 18  VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           +D   PP  +W+RKL+S  K P+ F LS ++  HL  +G+RL+R++K E AKG+  +F++
Sbjct: 13  IDDKLPP-FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIFDI 71

Query: 78  FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
           FRKHHIT D G+PLGGIG+GSIGRSY+GEFQ+FKLF  IC++AP+L NQFS FVSRP G 
Sbjct: 72  FRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPGGV 131

Query: 138 KFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP 188
           K S+VLCP  P V K N          + GIESWDWN+ GE  TYHAL+PR+WTVYDGEP
Sbjct: 132 KHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDGEP 191

Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS 248
           DPELRIV RQ+SPFIPHNY+ESS P SVF FT++N+G   A VTLLFTW NSV G SGL+
Sbjct: 192 DPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASGLT 251

Query: 249 GHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG-I 307
           G HFNS    KDGVH + L H+TANG PPV++A+AA+ET DV VS CPCF++SG +   I
Sbjct: 252 GQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPNQI 311

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
           TA DMW+EIKK+ SFD L ++  SPS+PG+SIGAAIAA + +P G  R+VTFSL+WDCPE
Sbjct: 312 TAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPE 371

Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
            +F EK YHRRYT+FYG+LG++A  +AHDA+L   +WE +IE WQ P+L D   PEWY +
Sbjct: 372 ARFDEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPEWYRV 431

Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP--------------------- 466
           TLFNELYY N+GGT+WTDG PP QSL +I  RK SL                        
Sbjct: 432 TLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTIDKNDQDQNNVALDILGRID 491

Query: 467 ---SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
              S+IH P +S+ ALG  +++N  ENIGQFLYLEG +Y+MYNTY VHFYSSFAL+MLFP
Sbjct: 492 AVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYGVHFYSSFALLMLFP 551

Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNS 583
           K+ELSIQRDFAAAV+MHD    ++MS G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+
Sbjct: 552 KIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFELNAYNLFNT 611

Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
            RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQ
Sbjct: 612 DRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQ 671

Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSY 703
           TYDAWS +GVSAYCGGLWVAALQA SALA ++GD+ +A YF  +Y+KA++VY+ LWNGSY
Sbjct: 672 TYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVYEKLWNGSY 731

Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
           FNYDNS   SS+SI ADQ+AGQWYARACGL PIA E  +KKAL  +YDFNV++V+ G  G
Sbjct: 732 FNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVMRVRDGTRG 791

Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
           A+NGM PDGR+D S + +RE+W G TY +AA MIQE + D  F+TA+G+YE AWS+ GLG
Sbjct: 792 AVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLG 851

Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE---IKHEISDRDSSYLEQ 880
            +FQTPE+W  NDEYRSLCYMRPL IW +QWA T PK +R++   ++ +  D  S   +Q
Sbjct: 852 CAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEEDATSVLFQQ 911

Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYD 908
           HA F KVA  LK  K +  +  L+  Y+
Sbjct: 912 HAGFIKVAHYLKNTKGKDHRNRLQTAYE 939


>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
          Length = 951

 Score = 1190 bits (3079), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 594/932 (63%), Positives = 714/932 (76%), Gaps = 46/932 (4%)

Query: 17  KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           KV PGQ P LTW+ KL         F L+++E   LA IG RL R+  +E +KG++ V +
Sbjct: 28  KVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKGRIAVID 87

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
             +K    S QG+PLGG    SIGRSY+G+FQR++LF G C+D  VLANQFS F+SR +G
Sbjct: 88  PMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAFISRQDG 143

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           +K+S+VL P  P +PK +  SGI SWDWN+ G+  TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 KKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDPELNIVC 203

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS  
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
           + KDGVHG+ LHHRTA+G+P VTF +AA+E  D+ +SECP F++SG+S     TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFTAKDMWN 323

Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
            +K+HGSFDHLD  KTS  S PGSSIGAAIAAS+ +   +T+ ++FSLAW CPEVKF   
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPEVKFSSG 383

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
           K YHRRYTKFYGT GD+AA +AHDAILEHA WE +IE WQ PIL+D+RFP WYP+TLFNE
Sbjct: 384 KTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYPVTLFNE 443

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS------------------------- 467
           LYYLNAGGTIWTDG PP+QSL  +  +KFSLD  +                         
Sbjct: 444 LYYLNAGGTIWTDGLPPIQSLTAVGWKKFSLDMLNGESDDFNEMVRQNNTASDILHRMAS 503

Query: 468 ---KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
              ++H   +S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SF+L+MLFPK
Sbjct: 504 VFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPK 562

Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
           L+LS+QRDFAAAVMMHDP  ++I+ DGKW ARK LGAVPHD+GL DPWF++N+Y L+N+ 
Sbjct: 563 LQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAYTLYNTD 622

Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           RWKDLN KFVLQVYRD VATGDK+FARAVWPSVYIAMAYMEQFDKD DGMIENE FPDQT
Sbjct: 623 RWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENEDFPDQT 682

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF 704
           YD WS  G+SAYCGGLWVAALQAASALA +VGD AS   FW +Y+KAK VY  LWNGSYF
Sbjct: 683 YDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKFVYSKLWNGSYF 742

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
           +YD+ D   STSIQADQLAGQWYA+ACGL PI D+ K + AL K+Y FNV+K KGG  GA
Sbjct: 743 SYDDGDNKMSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKVYSFNVMKFKGGTRGA 802

Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           MNGM PDG +DMS +Q+REIWPGVTY LAASMIQE MV+  F+TA GVY  AWS  GLGY
Sbjct: 803 MNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGVYHAAWSPGGLGY 862

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRD---SSYLEQH 881
           SFQTPE+WNN+DEYRSLCYMRPL IWAMQWAL+KPKL      H     D    S+ +  
Sbjct: 863 SFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSKPKL------HNTPQADIPQDSFPKNQ 916

Query: 882 AAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
            A++++  LL+LP++E+SK F++VVY+    R
Sbjct: 917 FAYARITKLLQLPEDESSKSFVRVVYEIIRNR 948


>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
 gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
          Length = 922

 Score = 1146 bits (2965), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 567/930 (60%), Positives = 697/930 (74%), Gaps = 32/930 (3%)

Query: 1   MENGVKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
           ME     E E    M  V+  + PQ+TWQRKL+S VK P+ F +S ++  HL  +G+RL+
Sbjct: 1   MEKNGHTESELQTQM--VENERLPQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLW 58

Query: 61  RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
           RY+K+E  KG+  ++++F+K H+  D G+PLGGIG GSIGRSY+GEFQ+FKLF  IC++A
Sbjct: 59  RYTKEEAKKGRFSMYDIFKKRHVRGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEA 118

Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
           P+L NQFSVFVSRP G  +S+VLCP  P  V  K  D GIESWDWN++G+  TYHAL+PR
Sbjct: 119 PILTNQFSVFVSRPGGLSYSTVLCPTKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPR 178

Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
           +WTVY+ EPDPELRIV RQ+SPFIPHNYKESS P SVF FT+SN G+  A VTLLFTW N
Sbjct: 179 SWTVYN-EPDPELRIVSRQVSPFIPHNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWEN 237

Query: 240 SVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
           SV G SGL+G HFNS  M +DGVH + LHH+T NG PPVT+A+AA+ET DVHVSECPCFL
Sbjct: 238 SVGGASGLTGEHFNSTIMERDGVHAIVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFL 297

Query: 300 LSGNS-KGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
           +SG+S K ITAK+MW+EIKK+ SFD L+++  SPS PG+SIGAAIAA + +P G  R+VT
Sbjct: 298 VSGHSPKEITAKEMWDEIKKNKSFDELNSEPGSPSRPGTSIGAAIAAKVKVPPGCDRTVT 357

Query: 359 FSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
           FSL+WDCPEV+F EK YHRRYTKFYG LGD+A ++A DA+L +  WE +IEAWQ PIL D
Sbjct: 358 FSLSWDCPEVRFNEKTYHRRYTKFYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSD 417

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSL-----ATIRERKFSLDTPSKI---- 469
              P+WY +TLFNELYY N+GGTIWTDG PP +S+         +    +D   KI    
Sbjct: 418 TTLPDWYRVTLFNELYYFNSGGTIWTDGLPPKESIERSKVTNTEQNDIVIDLFQKINAVC 477

Query: 470 ---HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
              ++P SS          N EENIGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKL 
Sbjct: 478 EQIYSPQSS----------NSEENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLA 527

Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
           LSIQRDFAA V++ DP   KIMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RW
Sbjct: 528 LSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRW 587

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
           KDLN+KFVLQVYRD VAT D++FA+AVWPSVY A+AY++QFDKD DGMIENEGFPDQTYD
Sbjct: 588 KDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYD 647

Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNY 706
           AWS  GVSAYCGGLWVAALQAASA A+ VG++A A YF  +Y+KAK VY+ LWNGSYFNY
Sbjct: 648 AWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNY 707

Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
           D+S   SS+SI ADQLAGQWYARACGL PI  E  +KKAL  IY+FNV+KVKGG  GA+N
Sbjct: 708 DDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVN 767

Query: 767 GMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
           GM  +G++D + L ++E+W G TY +AA MIQE   +  FQTA+G+YE  WS+ GL  SF
Sbjct: 768 GMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSF 827

Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEIS----DRDSSYLEQHA 882
           QTPE+WN NDEYRSLCYMRPL IWA+QWALT+ +   +E +  ++    +  +  L QH 
Sbjct: 828 QTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHK 887

Query: 883 AFSKVASLLKL-PKEEASKGFLKVVYDFTI 911
            F  VA  +K+ P     +  L+  Y+  +
Sbjct: 888 GFKDVARFVKIVPTSNVHRSRLQHTYETVL 917


>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 922

 Score = 1144 bits (2960), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 567/930 (60%), Positives = 696/930 (74%), Gaps = 32/930 (3%)

Query: 1   MENGVKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
           ME     E E    M  V   + PQ+TWQRKL+S VK P+ F +S ++  HL  +G+RL+
Sbjct: 1   MEKNGHTESELQTQM--VGNERLPQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLW 58

Query: 61  RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
           RY+K+E  KG+  ++++F+K H+  D G+PLGGIG GSIGRSY+GEFQ+FKLF  IC++A
Sbjct: 59  RYTKEEAKKGRFSMYDIFKKRHVRGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEA 118

Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
           P+L NQFSVFVSRP G  +S+VLCP  P  V  K  D GIESWDWN++G+  TYHAL+PR
Sbjct: 119 PILTNQFSVFVSRPGGLSYSTVLCPTKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPR 178

Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
           +WTVY+ EPDPELRIV RQ+SPFIPHNYKESS P SVF FT+SN G+  A VTLLFTW N
Sbjct: 179 SWTVYN-EPDPELRIVSRQVSPFIPHNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWEN 237

Query: 240 SVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
           SV G SGL+G HFNS  M +DGVH + LHH+T NG PPVT+A+AA+ET DVHVSECPCFL
Sbjct: 238 SVGGASGLTGEHFNSTIMERDGVHAIVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFL 297

Query: 300 LSGNS-KGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
           +SG+S K ITAK+MW+EIKK+ SFD L+++  SPS PG+SIGAAIAA + +P G  R+VT
Sbjct: 298 VSGHSPKEITAKEMWDEIKKNKSFDELNSEPGSPSRPGTSIGAAIAAKVKVPPGCDRTVT 357

Query: 359 FSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
           FSL+WDCPEV+F EK YHRRYTKFYG LGD+A ++A DA+L +  WE +IEAWQ PIL D
Sbjct: 358 FSLSWDCPEVRFNEKTYHRRYTKFYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSD 417

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSL-----ATIRERKFSLDTPSKI---- 469
              P+WY +TLFNELYY N+GGTIWTDG PP +S+         +    +D   KI    
Sbjct: 418 TTLPDWYRVTLFNELYYFNSGGTIWTDGLPPKESIERSKVTNTEQNDIVIDLFQKINAVC 477

Query: 470 ---HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
              ++P SS          N EENIGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKL 
Sbjct: 478 EQIYSPQSS----------NSEENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLA 527

Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
           LSIQRDFAA V++ DP   KIMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RW
Sbjct: 528 LSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRW 587

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
           KDLN+KFVLQVYRD VAT D++FA+AVWPSVY A+AY++QFDKD DGMIENEGFPDQTYD
Sbjct: 588 KDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYD 647

Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNY 706
           AWS  GVSAYCGGLWVAALQAASA A+ VG++A A YF  +Y+KAK VY+ LWNGSYFNY
Sbjct: 648 AWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNY 707

Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
           D+S   SS+SI ADQLAGQWYARACGL PI  E  +KKAL  IY+FNV+KVKGG  GA+N
Sbjct: 708 DDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVN 767

Query: 767 GMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
           GM  +G++D + L ++E+W G TY +AA MIQE   +  FQTA+G+YE  WS+ GL  SF
Sbjct: 768 GMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSF 827

Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEIS----DRDSSYLEQHA 882
           QTPE+WN NDEYRSLCYMRPL IWA+QWALT+ +   +E +  ++    +  +  L QH 
Sbjct: 828 QTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHK 887

Query: 883 AFSKVASLLKL-PKEEASKGFLKVVYDFTI 911
            F  VA  +K+ P     +  L+  Y+  +
Sbjct: 888 GFKDVARFVKIVPTSNVHRSRLQHTYETVL 917


>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
 gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
          Length = 921

 Score = 1127 bits (2915), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 557/908 (61%), Positives = 687/908 (75%), Gaps = 39/908 (4%)

Query: 24  PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
           PQ+TWQRKL+S VK P+ F +S ++  HL  IG+RL+ ++K+E  KG+  ++++F+K H+
Sbjct: 22  PQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPIGYRLWSHTKEEAKKGRASMYDIFKKRHV 81

Query: 84  TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVL 143
           T D G+PLGGIG GSIGRSY+GEFQ+FKLF  +C++AP+L NQFSVFV R  G  +SSVL
Sbjct: 82  TGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKVCEEAPILTNQFSVFVMRSGGVNYSSVL 141

Query: 144 CPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           CP  P  V  K  + GIESWDWN+KG    YHAL+PR+WT+Y+ EPDPELRIV RQISPF
Sbjct: 142 CPTKPKSVRGKTEELGIESWDWNMKGYKSMYHALYPRSWTIYN-EPDPELRIVSRQISPF 200

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
           IPHNYKESS P SVFTFT++N G+  A VTLLFTW NSV G SGL+G HFNS  M +DGV
Sbjct: 201 IPHNYKESSLPVSVFTFTVTNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTMMERDGV 260

Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS-KGITAKDMWNEIKKHGS 321
           H + LHH+T+NG PPVT+A+AA+ET DVHVSEC CFL+SG+S K ITAK+MW+EIKK+ S
Sbjct: 261 HAIALHHKTSNGHPPVTYAIAAQETEDVHVSECSCFLVSGHSPKDITAKEMWDEIKKNKS 320

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTK 381
           FD L+++  SPS PG+SIGAAIAA++ +P G  R+V FSL+WDCPEV+F EK YHRRYTK
Sbjct: 321 FDGLNSEPGSPSRPGTSIGAAIAANVKVPPGCDRTVMFSLSWDCPEVRFNEKTYHRRYTK 380

Query: 382 FYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGT 441
           FYG LGD+A ++A DA+L +  WE +IEAWQ PIL D   P+WY +TLFNELYY N+GGT
Sbjct: 381 FYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTALPDWYRVTLFNELYYFNSGGT 440

Query: 442 IWTDGSPPMQSLATIRERKFSLDTP----------------SKIHTPASSDTALGTRLLE 485
           IWTDG PP + +    ER   ++T                 ++IH+P S           
Sbjct: 441 IWTDGLPPKERI----ERSKIINTEQNNIVTELFKKINVICAQIHSPQSL---------- 486

Query: 486 NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTM 545
           N EE+IGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKLELSIQRDFAA V+  DP   
Sbjct: 487 NAEEHIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLELSIQRDFAATVLFQDPTKK 546

Query: 546 KIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG 605
           +IMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RWKDLN+KFVLQVYRD V+TG
Sbjct: 547 QIMSSGEWVPRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVSTG 606

Query: 606 DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAAL 665
           D++FA+AVWPSVY A+AY++QFDKD D MIENEGFPDQTYDAWS  GVSAYCGGLWVAAL
Sbjct: 607 DQSFAKAVWPSVYTAVAYLDQFDKDEDEMIENEGFPDQTYDAWSVKGVSAYCGGLWVAAL 666

Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
           QAASA A+ VG++  A YF  +Y+KAK+VY+ LWNGSYFNYD+S   SS+SI ADQLAGQ
Sbjct: 667 QAASAFASIVGENGVAIYFNTKYEKAKSVYEKLWNGSYFNYDDSGSGSSSSILADQLAGQ 726

Query: 726 WYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIW 785
           WYARACGL PI  E  +KKAL  IY+FNV+KVKGG  GA+NGM PDG++D + L ++E+W
Sbjct: 727 WYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSPDGQVDTNSLVSKEVW 786

Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
            G TY +AA MIQE   +  FQTA+G+YE  WS+ GL  SFQTPE+WN NDEYRSLCYMR
Sbjct: 787 AGTTYSVAACMIQEGQRERGFQTASGIYEAVWSDRGLSCSFQTPEAWNMNDEYRSLCYMR 846

Query: 846 PLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKL-PKEEASK 900
           PL IW++QWALT+ +   +E    +K E  + D   L QH  F  VA  +K+ P     +
Sbjct: 847 PLAIWSIQWALTRTQSFGEEKEKLVKGEEEESD-LLLRQHKGFKDVARFVKIVPTRNEHR 905

Query: 901 GFLKVVYD 908
             L+  Y+
Sbjct: 906 SRLQHTYE 913


>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
          Length = 928

 Score = 1092 bits (2824), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 553/964 (57%), Positives = 683/964 (70%), Gaps = 87/964 (9%)

Query: 1   MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
            E  + + GE+     +S  KVDP  P  LTWQRK+DS+VK P  F LS +E   LA +G
Sbjct: 2   FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
            RL+   ++E AKG++   + F KH +TS  G+PLGGIGAGSIGRS++GEFQR++LF   
Sbjct: 62  IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+  TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PR+WT+Y+G                        +  A  ++FTL N  Q+     + F 
Sbjct: 182 YPRSWTMYEG------------------------NIYAFEYSFTLVNLIQSLGLFVVKF- 216

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
                        H     T  K     L    +TANG P +++A++A+ T  V VS CP
Sbjct: 217 -------------HPLYLTTTKKAAFQCLFSLSQTANGLPSLSYAISAQATDGVSVSACP 263

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
            F++SG   GITAKDMW  +K++GSFDHL   + S  S+ GSSIGAA+AAS+T+  G +R
Sbjct: 264 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 323

Query: 356 SVTFSLAWDCPEVKFFE-KVYHR---------RYTKFYGTLGDSAARIAHDAILEHAKWE 405
            VTFSLAWDCPEV+F   K+Y R         RYTKFYG  GD+AA+IAHDAIL H++WE
Sbjct: 324 IVTFSLAWDCPEVQFPSGKIYSRQPIKQNLFRRYTKFYGNNGDAAAQIAHDAILGHSQWE 383

Query: 406 CEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD- 464
             IE WQRPILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD 
Sbjct: 384 SWIEDWQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDK 443

Query: 465 ---------------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYL 497
                                      T  ++H   +S++A GT+LLE GEENIG FLYL
Sbjct: 444 SQLGLKNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYL 503

Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
           EG EY M+NTYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP  +K +S+G+WV RK
Sbjct: 504 EGIEYRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRK 563

Query: 558 CLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV 617
            LGAVPHD+G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSV
Sbjct: 564 VLGAVPHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSV 623

Query: 618 YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGD 677
           Y+AMAYM QFDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA  VGD
Sbjct: 624 YVAMAYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGD 683

Query: 678 HASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
             S  YFW ++QKAK VY+  LWNGSYFNYDNS    S++IQADQLAGQWYARA GLLPI
Sbjct: 684 KNSQDYFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPI 743

Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
            DE K + AL K+Y++NV+K+K G  GA+NGM P+G++D + +Q+REIW GVTY L+A+M
Sbjct: 744 VDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATM 803

Query: 797 IQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           IQE +V+MAFQTA+G+YE AWSE GLGYSFQTPESWN  DEYRSL YMRPL IWAMQWAL
Sbjct: 804 IQEGLVEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWAL 863

Query: 857 TKPKLSRQEI-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTI 911
           TK    ++++     + E     SS ++    FS+V+ LL LP E ++K  L+ ++D+T 
Sbjct: 864 TKTSQKQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTC 923

Query: 912 GRVL 915
            R++
Sbjct: 924 RRMM 927


>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 797

 Score = 1063 bits (2748), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 518/788 (65%), Positives = 618/788 (78%), Gaps = 35/788 (4%)

Query: 157 SGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASV 216
           SGI SWDWNL G   TYHAL+PRAWTVYDGEPDP+L+IVCRQISP IPHNY++SS+PA+V
Sbjct: 11  SGIGSWDWNLNGGKSTYHALYPRAWTVYDGEPDPDLKIVCRQISPIIPHNYQQSSYPAAV 70

Query: 217 FTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRP 276
           FTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS    KDGVHG+ LHHRTA+GRP
Sbjct: 71  FTFTVTNSGNTAADVTLLFTWANSVGGKSELTGNHSNSSMTEKDGVHGILLHHRTADGRP 130

Query: 277 PVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEP 335
           PV FA+AA+E  D+H+SECP F++SG+S   TAKDMWN +K+HGSFD LD  K S  S P
Sbjct: 131 PVAFAIAAQEKEDIHISECPYFVMSGSSDEFTAKDMWNSVKEHGSFDLLDPVKESTSSRP 190

Query: 336 GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIA 394
           G+SIGAAIAAS+ +   +T++V+FSLAW  PEVKF   K YHRRYTKFYGT  D+AA +A
Sbjct: 191 GTSIGAAIAASIKLAPQATQNVSFSLAWASPEVKFCSGKTYHRRYTKFYGTDVDAAASLA 250

Query: 395 HDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLA 454
           HDAIL+H+ WE +IE WQ PIL+DKRFP WYP+TLFNELYYLNAGGTIWTDG PP+QSL 
Sbjct: 251 HDAILDHSSWETQIENWQHPILQDKRFPAWYPVTLFNELYYLNAGGTIWTDGLPPIQSLT 310

Query: 455 TIRERKFSLDTPS----------------------------KIHTPASSDTALGTRLLEN 486
            I  +KFSLD  +                            +IH   +S +A+GT LL+ 
Sbjct: 311 AIGGKKFSLDMSNEETDDDSQMNPQTNTATDILHQMASVLERIHASLASSSAIGTTLLQ- 369

Query: 487 GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK 546
           GEENIGQFLYLEG EY M+NTYDVHFYSSF+L+MLFPKL+LS+QRDFAAAVM+HDP  +K
Sbjct: 370 GEENIGQFLYLEGIEYYMWNTYDVHFYSSFSLIMLFPKLQLSVQRDFAAAVMIHDPEKLK 429

Query: 547 IMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD 606
           ++ DGK  ARK LGAVPHD+GL DPW ++N+Y L N+ RWKDLN KFVLQVYRD VATGD
Sbjct: 430 LLHDGKLAARKVLGAVPHDLGLYDPWIKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGD 489

Query: 607 KNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQ 666
           K+FARAVWPSVY+AMAYMEQFDKD DGMIENE FPDQTYD WS  GVSAYCGGLWVAALQ
Sbjct: 490 KSFARAVWPSVYMAMAYMEQFDKDKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQ 549

Query: 667 AASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
           AASALA++VGD AS   FW +Y+KAK+VYD  LWNGSYFNYD++   +STSI ADQLAGQ
Sbjct: 550 AASALAHEVGDKASEKLFWNKYEKAKSVYDKKLWNGSYFNYDDAGTKASTSIHADQLAGQ 609

Query: 726 WYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIW 785
           WYA+ACGL  I D+ K + AL KIY FNV+K K G  GA+NGM PDG +DMS +Q+REIW
Sbjct: 610 WYAKACGLSSIVDKDKSQSALEKIYAFNVMKFKDGNRGAINGMWPDGTLDMSTMQSREIW 669

Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
           PGVTY LAASMIQE MVD  F+TA G+Y  AWS +GLGY+FQTPESWNN+DEYRSLCYMR
Sbjct: 670 PGVTYALAASMIQEGMVDEGFKTAEGIYNAAWSTEGLGYAFQTPESWNNDDEYRSLCYMR 729

Query: 846 PLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKV 905
           PL IW++QWAL+ PKL +        D   S+ +   +++++A LL+LP++E+SK   +V
Sbjct: 730 PLAIWSIQWALSNPKLHKAPQTDITQD---SFPKNQFSYARIAKLLQLPEDESSKSLPRV 786

Query: 906 VYDFTIGR 913
           +Y+    R
Sbjct: 787 IYEIVRNR 794


>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
          Length = 931

 Score = 1037 bits (2681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 535/921 (58%), Positives = 661/921 (71%), Gaps = 84/921 (9%)

Query: 65  QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
           Q++   +  V +  +K    S QG+PLGGIG+GSIGRSY+GEFQR++LF G C++ PVLA
Sbjct: 20  QKDVFIQAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQLFPGTCEERPVLA 79

Query: 125 NQFS------------------VFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNL 166
           NQFS                   F+SR +G  +SSVL P  P +PK +  SGI SWDWN+
Sbjct: 80  NQFSDRYLATVNVLTVSRDYTQAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNM 139

Query: 167 KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN-----YKESSFPASVFTFTL 221
            G+N TYHAL+PR+WT+Y+G        V  Q +  +  N       E     ++ T   
Sbjct: 140 SGQNSTYHALYPRSWTIYNGIAVTNNNRVYFQATTILFSNVGFCLLGEPDPDVNIVT--- 196

Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHR----------- 270
            NSG T+ADVTLLFTWA  V  +      +       KDGVHG+ LHHR           
Sbjct: 197 -NSGNTAADVTLLFTWA-FVINEHLFYTFNILMNDREKDGVHGILLHHRLSPKQISICLN 254

Query: 271 ------TANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
                 TANG+PPVTFA+AA+E  D+H+SECP F++SG+S   +AKDMWN +K++GSFD+
Sbjct: 255 DMYESRTANGQPPVTFAIAAQEKEDIHISECPYFIISGSSDAFSAKDMWNYVKENGSFDN 314

Query: 325 LDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKF 382
           LD  KTS  S+PG SIGAAIAAS+ +P  +T++V+F+LAW CPEVKF   K YHRRYTKF
Sbjct: 315 LDLTKTSMCSKPGLSIGAAIAASVKLPPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKF 374

Query: 383 YGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTI 442
           +GT  D+AA +AHDAILEH  WE +IE WQ PIL+D+RFP+WYP+TLFNELYYLNAGGTI
Sbjct: 375 HGTDNDAAASLAHDAILEHNSWERQIEEWQNPILQDERFPDWYPVTLFNELYYLNAGGTI 434

Query: 443 WTDGSPPMQSLATIRERKFSLDTPS----------------------------KIHTPAS 474
           WTDG PP+QSL  I E+KFSLD  +                            +IH    
Sbjct: 435 WTDGLPPIQSLTGIGEKKFSLDMQNGDADDANGIIPRNNTASDILNQMASVLERIHASME 494

Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
           S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SF+L+MLFPKL+LSIQRDFA
Sbjct: 495 SNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFA 553

Query: 535 AAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFV 594
           AAV+MHDP  ++++ DGKWVARK LGAVPHD+GL DPWF++N+Y L+N+ RWKDLN KFV
Sbjct: 554 AAVLMHDPEKLRMLHDGKWVARKVLGAVPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFV 613

Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
           LQVYRD VATGDK+FARAVWPSVY+AMAYMEQFD+D DGMIENE FPDQTYD WS  G+S
Sbjct: 614 LQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGIS 673

Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSS 714
           AYCGGLWVAALQAASALA++VGD AS   FW +Y+KAK+VY  LWNGSYFNYD+ D   S
Sbjct: 674 AYCGGLWVAALQAASALAHEVGDKASEKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMS 733

Query: 715 TSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRI 774
            SI ADQLAGQWYA+ACGL PI D+ K + AL KIY FNV+K K G  GAMNGM PDG +
Sbjct: 734 ASIHADQLAGQWYAKACGLFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTV 793

Query: 775 DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNN 834
           DMS +Q+REIWPGVTY LAA+MIQE MV+  F+TA G+Y  AWS +GLGYSFQTPE+WNN
Sbjct: 794 DMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNN 853

Query: 835 NDEYRSLCYMRPLTIWAMQWALTKPKLSRQ--EIKHEISDRDSSYLEQHAAFSKVASLLK 892
           +DEYRSLCYMRPL IWA+QWAL+ PKL +Q  +I  +      S+ +   +++++A LL 
Sbjct: 854 DDEYRSLCYMRPLAIWAIQWALSNPKLHKQTADIPQD------SFPKNQFSYARIAKLLH 907

Query: 893 LPKEEASKGFLKVVYDFTIGR 913
           LP++E+ K FL+V+Y+    R
Sbjct: 908 LPEDESPKSFLRVIYEIVRNR 928


>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
 gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
 gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 928

 Score = 1023 bits (2644), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 520/929 (55%), Positives = 663/929 (71%), Gaps = 42/929 (4%)

Query: 8   EGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
           E E   ++  +D  +PP ++W+R  D   K    F+++  +   +  +  ++   + ++ 
Sbjct: 5   ELEETKAVLHMDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDN 64

Query: 68  AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
           AKG   V++  +K      +G+PLGGIGAGSIGRSYRG FQ+F++F  I ++ P+LANQF
Sbjct: 65  AKGLASVYDPLKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQF 124

Query: 128 SVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           S F+SRP+G+++S+VL   +  V K    +GI SWDW LK +NCTYH LFPR+WTVY+GE
Sbjct: 125 SAFISRPDGKRYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGE 184

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
           PDPE++I CRQISPFIPHNYKESSFP +VFTFTL NSG T ADVTLLFTWANSV G S L
Sbjct: 185 PDPEIKITCRQISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSEL 244

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS-GNSKG 306
           +G+H NS+ MT            TA+GRPPVTFA+A+ ET  V V+ CP F +   NS  
Sbjct: 245 TGNHKNSR-MT------------TADGRPPVTFAIASRETDGVRVTGCPRFTMGPSNSGD 291

Query: 307 ITAKDMWNEIKKHGSFDHLDN---DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
           +TAKDMW++I K+GSF    N     T  S PGSSIGAA+AA+ T+ +G  R+V+F+L+W
Sbjct: 292 VTAKDMWDQINKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSW 351

Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
            CPEVKF   + YHRRYTKF+GT  D+AA R+AHDA+LEH KWE +IE WQRPIL+DK  
Sbjct: 352 SCPEVKFPAGRTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSL 411

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS-DTALG 480
           PEWYPITLFNELYYLNAGGTIWTDG PP  +  +     F+LDT S +    S+ D  L 
Sbjct: 412 PEWYPITLFNELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLDTFSTVANGGSAVDGILS 471

Query: 481 T--------------------RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
           T                     LL +GEEN+GQ LYLEG EY M+NTYDVHFY+SFAL+ 
Sbjct: 472 TVAVAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLS 531

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNL 580
           LFP+LEL++QRDF   V++HDP   + + DG  VARK LGAVPHD+GL+DPWFE+N+Y L
Sbjct: 532 LFPELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYML 590

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            +  RWKDLN KFVLQVYRD VATG+  FA A WP+VY+AMAYM+QFD+DGDGM+ENEG 
Sbjct: 591 HDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGR 650

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLW 699
           PDQTYD WS +GVSAY GGLWVAALQAA+A+A  VGD A+ +YF  RY +A+ VY D LW
Sbjct: 651 PDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELW 710

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
           NG YFNYDNS G++S+SIQADQLAGQWYARACGL PI D  K ++AL  + D+NV++VKG
Sbjct: 711 NGGYFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRVKG 770

Query: 760 GMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
           G  GA+NGM+PDG +D S  Q++E+WPGVTY +AA+MI E M + AF+TA G+++  W +
Sbjct: 771 GAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGK 830

Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLE 879
            G GY+FQTPESW  +  YR+L YMRPL +WAMQWAL+ P L ++     ++        
Sbjct: 831 HGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAAL 890

Query: 880 QHAAFSKVASLLKLPKEEASKGFLKVVYD 908
               F KVAS+L+LP+E+  KG L+ +YD
Sbjct: 891 GQEKFDKVASMLRLPEEQQHKGILRALYD 919


>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
 gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
          Length = 929

 Score = 1016 bits (2627), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 523/946 (55%), Positives = 654/946 (69%), Gaps = 59/946 (6%)

Query: 7   EEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQE 66
           EE + A  +  +D  QPP  TWQR+ D   K    ++L+  +   +              
Sbjct: 4   EELQQAKVLSSIDCSQPPARTWQRRFDDEGKKVAMYSLTMNDLTAI-------------- 49

Query: 67  EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
                V V++  RK      +G+PLGGIGAGSIGRSYRG FQ+F++F    ++ P+LANQ
Sbjct: 50  -----VGVYDPLRKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPATNEEKPILANQ 104

Query: 127 FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
           FS FVSRPNG K+S+VL   +  + K+   + I SWDW LK E CTYHALFPR+WTVYDG
Sbjct: 105 FSAFVSRPNGTKYSTVLSAPTADLLKEVDKASIGSWDWKLKEEKCTYHALFPRSWTVYDG 164

Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
           EPDPE++I  RQISPFIPHNY+ESSFPA+VFTFT+ NSG T+ADVTLLFTWANSV G S 
Sbjct: 165 EPDPEIKITSRQISPFIPHNYRESSFPAAVFTFTVHNSGSTAADVTLLFTWANSVGGKSE 224

Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS-GNSK 305
           L+G+H NSK   +DGVHG+ LHHRTA G PPVTFA+A++ET  V VS CP F +    S+
Sbjct: 225 LTGNHVNSKMTEQDGVHGVLLHHRTAGGHPPVTFAIASQETDGVRVSVCPSFTMGPSGSR 284

Query: 306 GITAKDMWNEIKKH--GSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
             TA DMW+EIKKH         N   + S+ GSS+GAA+AAS T+P+G TR V+FSLAW
Sbjct: 285 DFTAADMWDEIKKHGAFGHAGAGNAARAASKAGSSLGAAVAASTTVPAGETRVVSFSLAW 344

Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
            CP++KF     YHRRYTKFYG   D+AA ++AHDA+LEH  WE +IE WQRPIL DKR 
Sbjct: 345 ACPDIKFPAGSTYHRRYTKFYGVDADAAAEQLAHDALLEHMNWESQIEEWQRPILNDKRL 404

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQ---SLATIRERKFSLDT------------- 465
           PEWYP+ LFNELYYLNAGGTIWTDG PP +   + +      FS+DT             
Sbjct: 405 PEWYPVALFNELYYLNAGGTIWTDGQPPKKAGFASSLTGTEPFSIDTLLAEDGGTGGSAV 464

Query: 466 -------PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
                   S      S+  A G  LL +GE+N+GQFLYLEG EY MYNTYDVHFY+SFAL
Sbjct: 465 DGVVRAVASAKELSHSAAAAFGAALLRDGEDNVGQFLYLEGMEYNMYNTYDVHFYASFAL 524

Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSY 578
           + LFP LELS+QRDFA AV++HDP   +   DG+ V RK LG VPHD+GL+DPW E+N Y
Sbjct: 525 LSLFPSLELSLQRDFARAVLLHDPRRRRTF-DGRTVPRKVLGTVPHDVGLNDPWVEMNEY 583

Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
            L + + WKDLN KFVLQVYRD VATGD  FA A WP+VYIAMAYM QFD+DGDGM+ENE
Sbjct: 584 MLHDPALWKDLNPKFVLQVYRDAVATGDAAFAEAAWPAVYIAMAYMHQFDRDGDGMVENE 643

Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS- 697
           G PDQTYD WS +GVSAY GGLWVAAL+AA+ +A  VGD  + +YF  R  +A  VYD+ 
Sbjct: 644 GIPDQTYDIWSVSGVSAYTGGLWVAALEAAAGMARVVGDRHAEAYFRGRRDRAARVYDAE 703

Query: 698 LWNGSYFNYDNSDGSS-STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
           LWNG+YF YDNS G + S SI ADQ+AGQWYARACG+ P+ +E K + AL  + D+NV++
Sbjct: 704 LWNGTYFRYDNSGGGACSESIMADQMAGQWYARACGMEPVVEEGKARSALATVLDYNVMR 763

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
           VKGG  GA+NGM+PDG +D S  Q++E+WPGVTY +AA+M+ E M + AF+TA G ++  
Sbjct: 764 VKGGAVGAVNGMRPDGAVDASSGQSKEVWPGVTYAVAAAMVHEGMHEAAFRTAKGAHDAG 823

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSR-------QEIKHE 869
           W +DG GY+FQTPE+W  +  YRSL YMRPL IWAMQWAL+ P+L +       + ++ E
Sbjct: 824 WGKDGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQWALSPPELHKDLRAPGARSVEEE 883

Query: 870 ISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
            S  D++     A F KVA++L+LP+E+  KG+L  +Y      VL
Sbjct: 884 SSPADAAL--GQAQFEKVANMLRLPEEKQPKGYLWAIYKVIRQMVL 927


>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
           [Brachypodium distachyon]
          Length = 932

 Score = 1005 bits (2599), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 524/924 (56%), Positives = 661/924 (71%), Gaps = 27/924 (2%)

Query: 5   VKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSK 64
           +K E   A S+  +D  QP   TWQRK D   K    F+++  +   +  + F+    + 
Sbjct: 4   LKLEQTKAHSL--IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNA 61

Query: 65  QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
           +E  KG+  +++ F+K      +G+PLGG+GAGSIGRSYRG FQ F++F  I +  P+LA
Sbjct: 62  EERGKGRDVIYDPFKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILA 121

Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVY 184
           NQFS F+SRPNG+ +S+VL   +  V K    +GI SWDW LK +NCTYHALFPR+WTVY
Sbjct: 122 NQFSAFISRPNGKSYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVY 181

Query: 185 DGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD 244
           DGEPDP ++I CRQISPFIPHNYKESSFP  VFTFT+ N+G T ADVTLLFTWANSV G 
Sbjct: 182 DGEPDPAIKITCRQISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGR 241

Query: 245 SGLSGHHFNSKTMTKDGVHGLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSGN 303
           S L+G+H NSK M +DGVHG+ LHHRTAN G PPVTFA+A++ET DV V++CP F +  +
Sbjct: 242 SELTGNHTNSKMMERDGVHGVLLHHRTANDGHPPVTFAIASQETEDVRVTDCPSFTMGSS 301

Query: 304 SKG-ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
           S G  TAKDMW EIKKHGSF   D +  + S+PGSSIGAA+AA+ T+P+G TR V+F+L+
Sbjct: 302 SSGDFTAKDMWQEIKKHGSFITTDKEPRA-SKPGSSIGAAVAAATTVPAGGTRVVSFALS 360

Query: 363 WDCPEVKF-FEKVYHRRYTKFYGTLGDSAAR-IAHDAILEHAKWECEIEAWQRPILEDKR 420
           W CPEVKF   + YHRRYT F+G   D+AA  +AHDA+LEH  WE +IE WQRPIL+DKR
Sbjct: 361 WSCPEVKFPAGRTYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKR 420

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT--IRERKFSLDTP-------SKIHT 471
            P+WYP+ LFNELYYLNAGGTIWTDG PP  S+++  +    FSL T          I +
Sbjct: 421 LPDWYPVALFNELYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSLATTYLPGTAVDGILS 480

Query: 472 PASSDT-------ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
             +S T       A G  LL +GEEN+GQFLYLE  EY MYNTYDVHFY+SFAL+ LFP+
Sbjct: 481 AVASATEDQHSAAAFGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSLFPE 540

Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
           LELS+QRDFA AV++HDP  M +  DG  V RK LGAVPHDIGL+DPWFE+N+Y + + +
Sbjct: 541 LELSLQRDFARAVLIHDPRPM-VTLDGATVLRKVLGAVPHDIGLNDPWFELNAYMIHDPA 599

Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           RWKDLN KFVLQVYR  V TG+  FA+A WP+VY+AMAYM+Q+D+DGDGM+ENEG PDQT
Sbjct: 600 RWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-PDQT 658

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSY 703
           YD WS  GVSAY GG+WV ALQA +A+A  VGD  +  YF  RY +AK VYD+ LWNG+Y
Sbjct: 659 YDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWNGTY 718

Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
           F YDNS G +S SI ADQLAGQWYA  CGL P+ +E K + AL  + D+NV++VKGG  G
Sbjct: 719 FRYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGGAVG 778

Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
           A+NGM+PDG IDMS  Q++EIW G TY +AA+M+ E M + AF+TA G ++ +W + G G
Sbjct: 779 AVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKAGFG 838

Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAA 883
           Y+FQTPE+W     YR L YMRPL++WAMQWAL+ P+L +      +S    +       
Sbjct: 839 YAFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPELHKDLGLASVSPPGDA-ARGREK 897

Query: 884 FSKVASLLKLPKEEASKGFLKVVY 907
           F KVA +L+LP++   KGF   +Y
Sbjct: 898 FDKVAGMLRLPEKVQHKGFFLALY 921


>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
          Length = 919

 Score = 1004 bits (2596), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 511/929 (55%), Positives = 653/929 (70%), Gaps = 51/929 (5%)

Query: 8   EGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
           E E   ++  +D  +PP ++W+R  D   K    F+++  +   +  +  ++   + ++ 
Sbjct: 5   ELEETKAVLHMDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDN 64

Query: 68  AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
           AKG   V++  +K      +G+PLGGIGAGSIGRSYRG FQ+F++F  I ++ P+LANQF
Sbjct: 65  AKGLASVYDPLKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQF 124

Query: 128 SVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           S F+SRP+G+++S+VL   +  V K    +GI SWDW LK +NCTYH LFPR+WTVY+GE
Sbjct: 125 SAFISRPDGKRYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGE 184

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
           PDPE++I CRQISPFIPHNYKESSFP +VFTFTL NSG T ADVTLLFTWANSV G S L
Sbjct: 185 PDPEIKITCRQISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSEL 244

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS-GNSKG 306
           +G+H NS+   +DGVHG+ LHHRTA+GRPPVTFA+A+ ET  V V+ CP F +   NS  
Sbjct: 245 TGNHKNSRMTARDGVHGVLLHHRTADGRPPVTFAIASRETDGVRVTGCPRFTMGPSNSGD 304

Query: 307 ITAKDMWNEIKKHGSFDHLDN---DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
           +TAKDMW++I K+GSF    N     T  S PGSSIGAA+AA+ T+ +G  R+V+F+L+W
Sbjct: 305 VTAKDMWDQINKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSW 364

Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
            CPEVKF   + YHRRYTKF+GT  D+AA R+AHDA+LEH KWE +IE WQRPIL+DK  
Sbjct: 365 SCPEVKFPAGRTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSL 424

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS-DTALG 480
           PEWYPITLFNELYYLNAGGTIWTDG PP  +  +     F+LDT S +    S+ D  L 
Sbjct: 425 PEWYPITLFNELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLDTFSTVANGGSAVDGILS 484

Query: 481 T--------------------RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
           T                     LL +GEEN+GQ LYLEG EY M+NTYDVHFY+SFAL+ 
Sbjct: 485 TVAVAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLS 544

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNL 580
           LFP+LEL++QRDF   V++HDP   + + DG  VARK LGAVPHD+GL+DPWFE+N+Y L
Sbjct: 545 LFPELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYML 603

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            +  RWKDLN KFVLQVYRD VATG+  FA A WP+VY+AMAYM+QFD+DGDGM+ENEG 
Sbjct: 604 HDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGR 663

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLW 699
           PDQTYD WS +GVSAY GGLWVAALQAA+A+A  VGD A+ +YF  RY +A+ VY D LW
Sbjct: 664 PDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELW 723

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
           NG YFNYDNS G++S+SIQADQLAGQWYARACGL PI D  K ++AL  + D+NV++   
Sbjct: 724 NGGYFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRT-- 781

Query: 760 GMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
                                ++E+ PGVTY +AA+MI E M + AF+TA G+++  W +
Sbjct: 782 --------------------HSKEVCPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGK 821

Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLE 879
            G GY+FQTPESW  +  YR+L YMRPL +WAMQWAL+ P L ++     ++        
Sbjct: 822 HGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAAL 881

Query: 880 QHAAFSKVASLLKLPKEEASKGFLKVVYD 908
               F KVAS+L+LP+E+  KG L+ +YD
Sbjct: 882 GQEKFDKVASMLRLPEEQQHKGILRALYD 910


>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
           [Brachypodium distachyon]
          Length = 919

 Score =  975 bits (2520), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 512/923 (55%), Positives = 650/923 (70%), Gaps = 38/923 (4%)

Query: 5   VKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSK 64
           +K E   A S+  +D  QP   TWQRK D   K    F+++  +   +  + F+    + 
Sbjct: 4   LKLEQTKAHSL--IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNA 61

Query: 65  QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
           +E  KG+  +++ F+K      +G+PLGG+GAGSIGRSYRG FQ F++F  I +  P+LA
Sbjct: 62  EERGKGRDVIYDPFKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILA 121

Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVY 184
           NQFS F+SRPNG+ +S+VL   +  V K    +GI SWDW LK +NCTYHALFPR+WTVY
Sbjct: 122 NQFSAFISRPNGKSYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVY 181

Query: 185 DGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD 244
           DGEPDP ++I CRQISPFIPHNYKESSFP  VFTFT+ N+G T ADVTLLFTWANSV G 
Sbjct: 182 DGEPDPAIKITCRQISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGR 241

Query: 245 SGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS 304
           S L+G+H NSK +T +            +G PPVTFA+A++ET DV V++CP F +  +S
Sbjct: 242 SELTGNHTNSKMITAN------------DGHPPVTFAIASQETEDVRVTDCPSFTMGSSS 289

Query: 305 KG-ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
            G  TAKDMW EIKKHGSF   D +  + S+PGSSIGAA+AA+ T+P+G TR V+F+L+W
Sbjct: 290 SGDFTAKDMWQEIKKHGSFITTDKEPRA-SKPGSSIGAAVAAATTVPAGGTRVVSFALSW 348

Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAAR-IAHDAILEHAKWECEIEAWQRPILEDKRF 421
            CPEVKF   + YHRRYT F+G   D+AA  +AHDA+LEH  WE +IE WQRPIL+DKR 
Sbjct: 349 SCPEVKFPAGRTYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRL 408

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT--IRERKFSLDTP-------SKIHTP 472
           P+WYP+ LFNELYYLNAGGTIWTDG PP  S+++  +    FSL T          I + 
Sbjct: 409 PDWYPVALFNELYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSLATTYLPGTAVDGILSA 468

Query: 473 ASSDT-------ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
            +S T       A G  LL +GEEN+GQFLYLE  EY MYNTYDVHFY+SFAL+ LFP+L
Sbjct: 469 VASATEDQHSAAAFGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSLFPEL 528

Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
           ELS+QRDFA AV++HDP  M +  DG  V RK LGAVPHDIGL+DPWFE+N+Y + + +R
Sbjct: 529 ELSLQRDFARAVLIHDPRPM-VTLDGATVLRKVLGAVPHDIGLNDPWFELNAYMIHDPAR 587

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           WKDLN KFVLQVYR  V TG+  FA+A WP+VY+AMAYM+Q+D+DGDGM+ENEG PDQTY
Sbjct: 588 WKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-PDQTY 646

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
           D WS  GVSAY GG+WV ALQA +A+A  VGD  +  YF  RY +AK VYD+ LWNG+YF
Sbjct: 647 DLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWNGTYF 706

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
            YDNS G +S SI ADQLAGQWYA  CGL P+ +E K + AL  + D+NV++VKGG  GA
Sbjct: 707 RYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGGAVGA 766

Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           +NGM+PDG IDMS  Q++EIW G TY +AA+M+ E M + AF+TA G ++ +W + G GY
Sbjct: 767 VNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKAGFGY 826

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAF 884
           +FQTPE+W     YR L YMRPL++WAMQWAL+ P+L +      +S    +       F
Sbjct: 827 AFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPELHKDLGLASVSPPGDA-ARGREKF 885

Query: 885 SKVASLLKLPKEEASKGFLKVVY 907
            KVA +L+LP++   KGF   +Y
Sbjct: 886 DKVAGMLRLPEKVQHKGFFLALY 908


>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 946

 Score =  946 bits (2444), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 512/935 (54%), Positives = 643/935 (68%), Gaps = 36/935 (3%)

Query: 8   EGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
           E E   +   +D  QPP+ TWQRK D   K    F+ +  +   +A +  ++ R   +E 
Sbjct: 5   ELEQTKAHSHIDCAQPPERTWQRKFDDEGKEIAMFSFTMNDIMTVAPLIVKMMRILVEER 64

Query: 68  AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
           AKGK  V++ F+K      +G+PLG +GAGSIGRSYRG FQ F+LF GI +  P+LANQF
Sbjct: 65  AKGKASVYDPFKKWMSNCYRGVPLGTLGAGSIGRSYRGYFQHFQLFPGIYEQKPILANQF 124

Query: 128 SVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           S FVSRP G+ +S+VL      V K    + I SWDWNLK +NC+YH LFPR+WTVYDGE
Sbjct: 125 SAFVSRPGGKSYSTVLSAPPAEVLKGIDKASIGSWDWNLKEKNCSYHGLFPRSWTVYDGE 184

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
           PDPE++I CRQISPFIP NYKESSFP +VFTFT+ NSG T ADVTLLFTWANSV G S L
Sbjct: 185 PDPEIKITCRQISPFIPRNYKESSFPVAVFTFTVQNSGSTPADVTLLFTWANSVGGKSEL 244

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG- 306
           +G+H NS    +DGVHG+ L H TA G PPVTFA+A+++T DV V+ CP F +  +SK  
Sbjct: 245 TGNHSNSTMRARDGVHGVLLRHSTAEGHPPVTFAIASQDTGDVRVTCCPSFSMGPSSKSG 304

Query: 307 -ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
             TAK+MW EIKKHGSF         PS PGSS+GAA+AA+ T+ +G TR V+F+L+W C
Sbjct: 305 EPTAKEMWEEIKKHGSFGDAAAGTGRPSRPGSSVGAAVAAATTVTAGCTREVSFALSWSC 364

Query: 366 PEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
           P VKF   + YHRR+TKF G   D+AA ++AH A+LEH +WE +IEAWQRP+L+DK  P+
Sbjct: 365 PVVKFPAGRTYHRRHTKFVGLDRDAAAEQLAHHALLEHMEWERQIEAWQRPVLQDKSLPD 424

Query: 424 WYPITLFNELYYLNAGGTIWTDGSP----PMQSLATIRERKFSLDTPSKIHTPASSDTAL 479
           WYP+ LFNELYYLNAGGTIWTDG P     + S ++     FSL          SS TA 
Sbjct: 425 WYPVALFNELYYLNAGGTIWTDGMPSKKTSLVSSSSGTMEPFSLAAFHPDPNATSSSTAA 484

Query: 480 GTRLL------------ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
              LL               E+ +G+FLYLEG EY MYNTYDVHFY+SFAL+ LFP LEL
Sbjct: 485 DDILLAMARAEEHLLPAAEDEKGVGKFLYLEGMEYHMYNTYDVHFYASFALLSLFPDLEL 544

Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWK 587
           S+QRDFAAAV+ HDP  M  + DGK+V RK LGAVPHDIGL+DPW E+N+Y + + SRWK
Sbjct: 545 SLQRDFAAAVLRHDPRLMYTL-DGKFVPRKVLGAVPHDIGLNDPWHELNAYMIHDPSRWK 603

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           DLN KFVLQVYRD  ATGD +FAR+ WPSVY+A+AYM+QFD+D DGM+ENEG PDQTYD 
Sbjct: 604 DLNPKFVLQVYRDVAATGDLDFARSAWPSVYVALAYMDQFDRDRDGMVENEGRPDQTYDL 663

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY 706
           WS +GVSAY GGLWVAAL+AA+A+A  VGD  + + F  RY +A  VYDS LW G YF Y
Sbjct: 664 WSVSGVSAYTGGLWVAALRAATAMAALVGDLPAEAVFLERYNRANKVYDSELWTGDYFRY 723

Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
           DNS G +S S+ ADQLAGQWYARACGL P+    K ++AL  + + NV++V+GG  GA+N
Sbjct: 724 DNSGGGNSESVMADQLAGQWYARACGLEPVVGRDKARRALAAVLEHNVMQVQGGGVGAVN 783

Query: 767 GMQ-PD--GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
           G + P+  G +D S  Q++E+W G TY +AA+MI E M +  F  A G Y   W EDG G
Sbjct: 784 GARLPEHGGGVDESSTQSKEVWTGTTYAVAAAMIGEGMREEGFTAAKGAYGAGWGEDGYG 843

Query: 824 YSFQTPESWNNNDE--YRSLCYMRPLTIWAMQWALTK-----PKLSRQEIKHEISDRDSS 876
           Y+FQ PESW  +    YRSL YMRPL +WAMQWAL+      P+L R   K + ++ +  
Sbjct: 844 YAFQMPESWTADGAGGYRSLHYMRPLAVWAMQWALSPPTPVLPELERMAAKSKATEEEVE 903

Query: 877 YLEQHAAFSKVASLLKLP--KEEASKGFLKVVYDF 909
             ++   F KVASLL+LP  +++  KG+L+  Y  
Sbjct: 904 LAKEK--FEKVASLLRLPEEEKKKEKGYLRAFYQI 936


>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 821

 Score =  942 bits (2434), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 478/812 (58%), Positives = 595/812 (73%), Gaps = 39/812 (4%)

Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
           FVS P G+ +S+VL   +  + K    +GI SWDW LK ENC YH LFPR+WTVYDGEPD
Sbjct: 2   FVSHPGGKSYSTVLSAPTADLLKGIDKAGIGSWDWKLKEENCNYHGLFPRSWTVYDGEPD 61

Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
            +++I CRQISPFIPHNYKESSFP +VFTFT+ NSG T ADVTLLFTWANSV G S L+G
Sbjct: 62  TKIKITCRQISPFIPHNYKESSFPVTVFTFTVQNSGSTPADVTLLFTWANSVGGRSELTG 121

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--- 306
           +H NS+   +DGVHG+ L HRTA+G PPVTFA+A+EET+DV V+ CP F +  +S G   
Sbjct: 122 NHTNSRIKARDGVHGVLLRHRTADGNPPVTFAIASEETSDVRVTCCPTFAMGPSSPGAEQ 181

Query: 307 ITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
            TAKDMW+E+K  GSF        T+ S PGSSIGAA+A++  +P+G TR V+F+L+W C
Sbjct: 182 FTAKDMWDEVKNRGSFGGDAGGAPTASSRPGSSIGAAVASTTKVPAGDTRVVSFALSWSC 241

Query: 366 PEVKFFE-KVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
           PEVKF   + YHRRYTKF G   D+AA ++ HDA+LEH KWE  IE WQ PIL+D+R PE
Sbjct: 242 PEVKFPSGRTYHRRYTKFLGLDRDAAAEQLVHDALLEHMKWESLIEEWQTPILQDRRLPE 301

Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQ-SLATIR----ERKFSLD-------------- 464
           WYP+ LFNELYYLNAGGTIWTDG PP + S A+ +       FSLD              
Sbjct: 302 WYPVALFNELYYLNAGGTIWTDGMPPKKTSFASSKYGSTAESFSLDGFRAGDPAVDGILR 361

Query: 465 ---TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
              T  +   P+S   A GT LL +GEEN+GQFLYLEG EY M+NTYDVHFY+SF+L+ L
Sbjct: 362 AMATAEERLEPSS---AFGTALLGDGEENVGQFLYLEGMEYHMWNTYDVHFYASFSLLSL 418

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLF 581
           FP++ELS+QRDFA AV++HDP  M+ + DG  V RK LGAVPHDIGL DPWFE+N+Y + 
Sbjct: 419 FPEIELSLQRDFARAVLLHDPRPMRTL-DGVDVPRKVLGAVPHDIGLADPWFELNAYMIH 477

Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
           + SRWKDLN KFVLQVYRD  ATG+  FA A WP+VY+AMAYM+QFD+DGDGM+ENEG P
Sbjct: 478 DPSRWKDLNPKFVLQVYRDVAATGNVTFATAAWPAVYLAMAYMDQFDRDGDGMVENEGRP 537

Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
           DQTYD WS +GVSAY GGLW+AALQAA+A+A  VGD  +  YF  RY++A+ VYD  LWN
Sbjct: 538 DQTYDLWSVSGVSAYTGGLWLAALQAAAAMARIVGDRGAEGYFLERYKRAQRVYDGELWN 597

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
           GSYF+YDNS  ++S SI ADQLAGQWYARACGL PI +E K + AL  + D+NV++V+GG
Sbjct: 598 GSYFDYDNSGCATSKSIMADQLAGQWYARACGLEPIVEEEKARSALGTVLDYNVMRVQGG 657

Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
             GA+NGM+PDG +D S LQ++E+W GVTYG+AA+M+ E M + AF+TA G ++  W  D
Sbjct: 658 AVGAVNGMRPDGAVDASSLQSKEVWVGVTYGVAAAMLHEGMTEAAFRTAKGAHDAGWGRD 717

Query: 821 GLGYSFQTPESWNNN--DEYRSLCYMRPLTIWAMQWALTKPKLSR--QEIKHEISDRDSS 876
           G GY+FQTPE+W ++    YRSL YMRPL+IWAMQWAL+ P+L R  + +   +S   S 
Sbjct: 718 GFGYAFQTPEAWTSDAGGGYRSLHYMRPLSIWAMQWALSPPELHRDLRVVPGSVSAVASP 777

Query: 877 YLEQHA--AFSKVASLLKLPKEEASKGFLKVV 906
                A   F KVAS+LKLP+E   KG+L+ +
Sbjct: 778 AEVDLAREKFEKVASMLKLPEEVQHKGYLRAI 809


>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
 gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
          Length = 983

 Score =  929 bits (2401), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 486/935 (51%), Positives = 637/935 (68%), Gaps = 62/935 (6%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQE-FRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           DPG PP   W+RKL+ +    T F ++F E  + +  +G RL+++ + E ++G+ P  + 
Sbjct: 27  DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86

Query: 78  FRKHHITSD-QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-- 134
           F++    S   G+PLGG+G GSIGR +RGEF+R++L   +  +A VLANQFSVFV+R   
Sbjct: 87  FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146

Query: 135 NGEK--FSSVLCPRSP--GVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDP 190
           +GEK   +SVL PR     V KK     I +WDWNL G+N TYH LFPRAWT+YDGEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206

Query: 191 ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGH 250
           +L+I CRQ+SPFIPH+Y+ESS P  VF++ + N+G+ SA V+LLFTWANSV GD+ LSG 
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDTHLSGG 266

Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
           H N   +  DGV G+ LHHRTA G+P V FA+AA + ADV VS CP FL+SG    +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326

Query: 311 DMWNEIKKHGSFD-HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
           +MWN++++ G FD     + + PS   SS+GAA++AS+TI     + V F+L+WD PEVK
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVVDFALSWDIPEVK 386

Query: 370 FFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
           F + + YHRRYT+FYG+ GD+A R+ HDA+L + KWE EIE WQ P+L+D   PEWY  T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYKFT 446

Query: 429 LFNELYYLNAGGTIW---------------------------------------TDGSPP 449
           LFNELYYL AGGT+W                                       T G+  
Sbjct: 447 LFNELYYLVAGGTVWTAFVTPSFRSNGFTSRTNSESSITGAVEELVEMIPLNGGTSGNGG 506

Query: 450 MQSLATIRERKFSLDTPSKIHTP--------ASSDTALGTRLLENGEENIGQFLYLEGAE 501
           + ++ +  E + +      +           A  D   GT LLE+GEEN+GQFLYLE  E
Sbjct: 507 VDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLYLESVE 566

Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
           YVM+ TYDVHFY+SFAL+ LFPKLELSIQRD+AAA + H+P  +K ++ GKW  +K LGA
Sbjct: 567 YVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIKKVLGA 626

Query: 562 VPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
           VPHD+G+ DPW E+N+YN+ ++SRWKDLNSKFVLQ+YRD VAT D NFAR  WP+VY AM
Sbjct: 627 VPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAM 686

Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA 681
           AYM++FD+D DGMIEN+GFPDQTYD W+ +G+SAYCGGLW+AALQAA+A+A  V +  +A
Sbjct: 687 AYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAA 746

Query: 682 SYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
           +YF  +++ A+ VYD  LWNGSYFNYD+   S+S SIQADQ+AGQWYA A GL P+ D+ 
Sbjct: 747 NYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPPLFDDY 806

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
           K + AL +IYD+NV+KVKGG  GA+NGM P+GR+D + +Q+REIW GVTY  AA+MI E 
Sbjct: 807 KARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEG 866

Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           M++ AF TA G+Y   WS+ G GY FQTPE+W  +  +R+L YMRPL IWAMQWAL  P 
Sbjct: 867 MLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWALYPPN 926

Query: 861 L----SRQEIKHEISDRDSSYLEQHAAFSKVASLL 891
                 R   K E +   +S    +A F+KV+  L
Sbjct: 927 CVAAGPRIPSKEESAGGSTSSSYLYAGFAKVSEAL 961


>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 981

 Score =  925 bits (2390), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 494/934 (52%), Positives = 645/934 (69%), Gaps = 52/934 (5%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           DPG P  L W+RKL+S+    T F ++F E   +  +G R++ Y K E A+G+ P  + F
Sbjct: 54  DPGAPSALAWRRKLNSHADFLTEFRVTFMEAVRMITLGMRMWFYVKSERAQGRAPPIDPF 113

Query: 79  -RKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG- 136
            R+   ++  G+P GG+G GSIGR +RG+F+R++L  G+C++APVLA+QFSVFV R  G 
Sbjct: 114 NRETKPSACHGVPCGGMGGGSIGRGFRGDFRRWQLIPGVCEEAPVLADQFSVFVKRDKGN 173

Query: 137 ---EKFSSVLCPRSPG-VPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPEL 192
              +K +SVL P  P  +     D+ + SWDWNL G+N TYHALFPRAWT+Y+GEPDP+L
Sbjct: 174 GVVKKDASVLYPGRPQELSDTKDDTSVSSWDWNLDGQNSTYHALFPRAWTIYEGEPDPDL 233

Query: 193 RIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHF 252
           +I CRQ+SPFIPHNY+ESSFP  VF++ + N+G+ +ADVTLLFTWANS+ G+S  +G HF
Sbjct: 234 KISCRQVSPFIPHNYEESSFPCCVFSYVVMNTGKQAADVTLLFTWANSIGGNSATTGGHF 293

Query: 253 NSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDM 312
           N    T             A G  PVTFA+AA+E  DV ++ CPCF L G    +TAKDM
Sbjct: 294 NEPFETG------------AKGGRPVTFAIAAKEMRDVTLTTCPCFSLGGKEVEVTAKDM 341

Query: 313 WNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
           W E+K++GSFD  + +   ++P+  GS+IGAAIAA + +P G  R V FSLAWD PEVKF
Sbjct: 342 WTEVKENGSFDKENWNFKPSAPTRQGSAIGAAIAARVEVPPGEKRDVVFSLAWDSPEVKF 401

Query: 371 FE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITL 429
            + K YHRRYT FYGT GD+A+++  DA+ ++ KWE  IE WQ+P+L+D++ PEWY +TL
Sbjct: 402 LKGKSYHRRYTSFYGTAGDAASKLTRDALRDYRKWEIAIEDWQQPVLKDEKLPEWYRVTL 461

Query: 430 FNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT-----------------P 472
           FNELYYL AGGTIWT       S  T  ER  S +T  ++ T                  
Sbjct: 462 FNELYYLVAGGTIWTAELELAASQET--ERLQSGETADQMQTETLEHLILKQMQTAMLGS 519

Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
           + +  A G  LL+ GEEN+GQFLYLEG EY+M+NTYDVHFY+SFAL+ LFPKLEL+IQRD
Sbjct: 520 SEAGIATGPSLLQEGEENVGQFLYLEGIEYIMWNTYDVHFYASFALLALFPKLELAIQRD 579

Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK 592
            AAA +  +P  +K M+DG    RK  GAVPHD+G  DPW E+N+YN+ ++S WKDLN K
Sbjct: 580 VAAATLSQNPEKVKYMADGGVGIRKVFGAVPHDLGQHDPWVEVNAYNIHDTSHWKDLNPK 639

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           FVLQVYRD VATGD+ FA+AVWP+VY AMAYM+QFD+D DG+IEN+GFPDQTYD WS +G
Sbjct: 640 FVLQVYRDVVATGDRQFAKAVWPAVYAAMAYMDQFDRDRDGLIENDGFPDQTYDTWSVHG 699

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDG 711
           VSAYCGGLW+AALQAA+A+A+ V D  +A+YF  ++ +A+ VY+  LWNG YFNYD+   
Sbjct: 700 VSAYCGGLWIAALQAAAAMADLVDDKDAATYFKGKFNQARDVYERKLWNGEYFNYDSGTS 759

Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
           S+S S+QADQLAGQWYA A GL P+ D+ K + AL K++DFNV+KVKGG  GA NGM P+
Sbjct: 760 SNSNSVQADQLAGQWYAWASGLPPLFDDYKARSALQKVFDFNVMKVKGGKWGAANGMHPN 819

Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
           G++D + +Q+REIW GVTY  +A+MI E MV+ AF  A GV+   WS+  LGY FQTPE+
Sbjct: 820 GKVDETCMQSREIWTGVTYAASAAMIHEGMVEQAFTAAQGVFLAGWSD--LGYWFQTPEA 877

Query: 832 WNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVA--- 888
           W  +  +RSL YMRPL IWAMQWAL  PK   +  +    DR S+ L  H  FS +A   
Sbjct: 878 WTIDGYFRSLAYMRPLAIWAMQWALYPPKSIAEAPRVPSMDRGSA-LVTHDRFSSLANAL 936

Query: 889 ---SLLKLPKEEASKGFLKVVYDFTIGRVLCHNL 919
              SL +L K     GF  V++   +G V+   +
Sbjct: 937 RSTSLPQLRKTHDPAGF--VLHIILLGGVVLSTI 968


>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
          Length = 966

 Score =  923 bits (2385), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 506/983 (51%), Positives = 643/983 (65%), Gaps = 96/983 (9%)

Query: 7   EEGENAASMPKVD-PGQPP-QLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSK 64
           +E E A  +P  D P QPP   TWQR+ D   K    F+++  +   L            
Sbjct: 4   QESEQAKVLPGTDCPQQPPASRTWQRRFDDEGKNVAMFSMTMSDLVEL------------ 51

Query: 65  QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
                  V V+++ RK      +G+P+GGIG+GSIGRSYRG FQ+F++F    ++ PVLA
Sbjct: 52  -------VAVYDLLRKWMDNCYRGVPVGGIGSGSIGRSYRGYFQQFQIFPVTNEEKPVLA 104

Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVY 184
           NQFS FVSRPNG K+S+VL   +  + K    +GI SWDW LK + CTYHALFPR+WTVY
Sbjct: 105 NQFSAFVSRPNGNKYSTVLSAPTADLLKGVGKAGIGSWDWRLKEDRCTYHALFPRSWTVY 164

Query: 185 DGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFT-------LSNSGQTSADVTLLFTW 237
           DGEPDPE++I CRQISPFIPHNY+ESSFPA+VFTFT       L  +G+   ++T     
Sbjct: 165 DGEPDPEIKITCRQISPFIPHNYRESSFPAAVFTFTVQSVLSVLFKAGEQLWELTCRCDA 224

Query: 238 A-------------NSVAGDSGLSGHHFNSKTMTKDGVHGLTLHH-------RTANGRPP 277
           A             +S+ G SGL           +  V+   L         RTA G+PP
Sbjct: 225 ALHLGRSTTASVAFSSITGGSGLIFRGARCYRPCRWLVYTPVLETDADRRQCRTAGGQPP 284

Query: 278 VTFAVAAEETADVHVSECPCFLLSG----NSKGITAKDMWNEIKKHGSFDHLDNDKTSPS 333
           VTFA+A+ ET  V VS CP F ++     +S   TA  MW+EIKKHG+F H      + +
Sbjct: 285 VTFAIASRETDGVRVSVCPSFTIASGGPSSSGEPTAAAMWDEIKKHGAFGHAGAGNGNAA 344

Query: 334 EPGSS-----IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF-FEKVYHRRYTKFYGTLG 387
              +S     +GAA+AAS  +P+G+TR V+FSLAW CPEVKF     Y+RRYT+FYGT  
Sbjct: 345 RAAASKAGSSVGAAVAASTAVPAGATRVVSFSLAWACPEVKFPAGSTYYRRYTRFYGTDA 404

Query: 388 DSAAR-IAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT-- 444
           D+AA  +AHDA+LEH  WE +IE WQ+PIL D+R PEWYP+ LFNELYYLNAGGTIWT  
Sbjct: 405 DAAAEHLAHDALLEHMNWESQIEEWQKPILHDQRLPEWYPVALFNELYYLNAGGTIWTVE 464

Query: 445 ----DGSPPMQS---LATIRERKFSLDTPSKIHTPAS-----------------SDTALG 480
               DG  P ++    +      FS+DT   +    S                 S  A G
Sbjct: 465 EMLTDGQAPKRAGFASSLTATEPFSIDTLLPVAAGGSAVDGVVRAVASATERSHSAAAFG 524

Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
             LL  GE+N+GQFLYLEG EY MYNTYDVHFY+SFAL+ LFP+LELS+QRDFA AV++H
Sbjct: 525 AALLRGGEDNVGQFLYLEGMEYNMYNTYDVHFYASFALLSLFPRLELSLQRDFARAVLLH 584

Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRD 600
           DP   + + DG+ VARK LGAVPHD+GL+DPW E+N+Y L +++RWKDLN KFVLQVYRD
Sbjct: 585 DPRPRRTL-DGRTVARKVLGAVPHDVGLNDPWVELNAYMLHDAARWKDLNPKFVLQVYRD 643

Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
            VATGD  FA AVWP+VY+AMAYM+QFD+DGDGM+ENEG PDQTYD WS +GVSAY GG+
Sbjct: 644 AVATGDAAFAEAVWPAVYVAMAYMDQFDRDGDGMVENEGVPDQTYDMWSVSGVSAYTGGV 703

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS-DGSSSTSIQ 718
           WVAALQAA+ +A  VGD  + +YF  R  +A  VYD+ LW G+YF YDNS  G +S SI 
Sbjct: 704 WVAALQAAAGMARVVGDRPAEAYFRARRDRAARVYDAELWTGAYFRYDNSGGGGASESIM 763

Query: 719 ADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG 778
           ADQLAGQWYARACGL P+ ++AK + AL  + D+NV++VKGG  GA+NGM+PDG +D S 
Sbjct: 764 ADQLAGQWYARACGLEPVVEDAKARSALATVLDYNVMRVKGGAIGAVNGMRPDGAVDASS 823

Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
            Q++E+WPGVTY +AA+M+ E M + AF+TA G ++ AW  DG GY+FQTPE+W  +  Y
Sbjct: 824 TQSKEVWPGVTYAVAAAMVHEGMTEAAFRTAKGAHDAAWGNDGFGYAFQTPEAWTEDGGY 883

Query: 839 RSLCYMRPLTIWAMQWALTKPKLSRQ------EIKHEISDRDSSYLEQHAAFSKVASLLK 892
           RSL YMRPL IWAMQWAL+ P+L R           E S  D++     A F KVAS+L+
Sbjct: 884 RSLHYMRPLGIWAMQWALSPPELHRDLRTPEARSSEEPSPADAAL--GQAQFDKVASMLR 941

Query: 893 LPKEEASKGFLKVVYDFTIGRVL 915
           LP+E+  KG+L  +Y+     VL
Sbjct: 942 LPEEQQPKGYLWAIYNLVRQIVL 964


>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 962

 Score =  922 bits (2382), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 473/923 (51%), Positives = 629/923 (68%), Gaps = 44/923 (4%)

Query: 9   GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
           G +A  +  +D   PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E +
Sbjct: 21  GRSALQLLDLDGAAPPEQAWRRRLNSHANILKEFSVTFMEAMKMMSLGVRLWSYVREEAS 80

Query: 69  KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
            G+    + F +     ++ QG+PLGG+G+GSI R +RGEF+ + +  G+C+++PV+ NQ
Sbjct: 81  HGRKAPIDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQ 140

Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
           FS+FVSR  G +K SSVL P      KK +DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 141 FSIFVSRDGGNKKCSSVLAPGHHDGLKKYSDSGISSWDWNLSGQHSTYHALFPRAWTVYD 200

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
           GEPDP+L+I CRQISPFIPH+YKESS P SVF +TL N+G+  A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKESSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFS 260

Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
             SG HFN   + +DGV G+ LHH+TA   PPVTFA+AA ET +V+V+  P F LSG + 
Sbjct: 261 HHSGGHFNEPFIGEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGENH 320

Query: 306 GITAKDMWNEIKKHGSF--DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
            ++AKDMW+ +KK G F  ++ +   + PS PG ++ AA+ AS  +      +V F+L+W
Sbjct: 321 -VSAKDMWDIMKKDGHFNLENFNAGCSMPSSPGETLCAAVTASTWVEPHGRCTVAFALSW 379

Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
             P+VKF +   Y+RRYT+FYGT   S+  + HDA+ ++  WE EIE WQ PIL D+R P
Sbjct: 380 SSPKVKFQKGCTYNRRYTEFYGTSERSSINLVHDALTKYRLWEEEIEKWQNPILRDERLP 439

Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-----SLD 464
           EWY  TLFNELY+L AGGT+WTDG PP               S   I++ K      +L 
Sbjct: 440 EWYKFTLFNELYFLVAGGTVWTDGQPPAIDEKTNPASNQQKHSKKPIKDTKSESVKDNLP 499

Query: 465 TPSKIHT----------PASSDTALGTRLLE--------NGEENIGQFLYLEGAEYVMYN 506
            P+              P   +   G R+ E        +G EN+G+FLYLEG EY+M+N
Sbjct: 500 RPTAEQVFNGDDLTNGGPQMPEQTNGLRVQEPVPCIHSKDGPENVGKFLYLEGVEYIMWN 559

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   +K ++DG    RK  GAVPHD+
Sbjct: 560 TYDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKVKGAVPHDL 619

Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
           G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD  F R VWP+V  AM YM+Q
Sbjct: 620 GTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVCAAMDYMDQ 679

Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
           FD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD   A  + +
Sbjct: 680 FDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRPYAEKYKL 739

Query: 687 RYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
           ++ KAKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL PI DE K++ A
Sbjct: 740 KFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIFDEHKIRSA 799

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
           L KI++FNV+KVKGG  GA+NGM P G++D + +Q+REIW GVTYG+AA+M+   M    
Sbjct: 800 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 859

Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
           F TA G++   WSE+G GY FQTPE W  +  YRSL YMRPL IWAMQWAL+ PK   + 
Sbjct: 860 FITAEGIFLAGWSEEGYGYWFQTPEGWTTDGHYRSLIYMRPLAIWAMQWALSPPKAILEA 919

Query: 866 IKHEISDRDSSYLEQHAAFSKVA 888
            K  + DR     +   A S+++
Sbjct: 920 PKVNLMDRIHVSPQAARAISEIS 942


>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 970

 Score =  919 bits (2375), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 486/933 (52%), Positives = 634/933 (67%), Gaps = 55/933 (5%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           DPG PP   W+RKL+S+ +V   F ++F E   +  +G R++ Y + E+++G++P  + F
Sbjct: 46  DPGAPPHQAWRRKLNSHAEVLKEFRVTFMEGFKMLRLGMRMWTYVQSEKSEGRLPPIDPF 105

Query: 79  RKHHITSD-QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
           R+    S   G+P GG+G GSIGR +RGEF+R+++  G+CD+APVLANQFSV VS   GE
Sbjct: 106 RRETKPSACHGVPCGGMGGGSIGRGFRGEFRRWQIVPGVCDEAPVLANQFSVHVSLSGGE 165

Query: 138 K-FSSVLCPRSPGVPK--KNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
           K  SSVL P  P   K  KN ++GI SWDW L G+N TYHALFPRAWT+Y+GEPDP L+I
Sbjct: 166 KAVSSVLYPGRPRELKGLKNDENGISSWDWKLDGQNTTYHALFPRAWTIYEGEPDPALKI 225

Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNS 254
            CRQISPFIPHNYKESS+PA VF++   N+G+  ADV LLFTWANS+   +G +G H N 
Sbjct: 226 SCRQISPFIPHNYKESSYPACVFSYVAVNTGKEVADVNLLFTWANSIGSTTGSTGGHINE 285

Query: 255 KTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWN 314
              T            +  G  P+TFA+AA+++ DV +S CPCF +SG  + ITAKD+W 
Sbjct: 286 TFTT------------SVKGGQPLTFAIAAKQSPDVSLSACPCFQVSGEGQSITAKDLWG 333

Query: 315 EIKKHGSF--DHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF 371
            +K+HG    D  +   TS PS+PGS IGAAIAA + IP    R++ F LAWD P VKF 
Sbjct: 334 RMKEHGRLEVDDWNTINTSLPSKPGSVIGAAIAAHVVIPPNEKRTIDFVLAWDSPTVKFL 393

Query: 372 E-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLF 430
           + + Y RRYT FY   G++A RIAH+A+  +  WE EIE WQRPILED++ PEWY +TLF
Sbjct: 394 KGRSYLRRYTTFYDADGNAAPRIAHNALCGYRLWEEEIEKWQRPILEDEKLPEWYRVTLF 453

Query: 431 NELYYLNAGGTIWT--------DGSP------------PMQS-----LATIRERKFSLDT 465
           NELYYL AGGT+WT         GSP            P  +     LA +R      D 
Sbjct: 454 NELYYLVAGGTVWTASSANTSAPGSPLHGDLSLGGKVVPFSTRRDIDLAALRRDCKDGDV 513

Query: 466 PSKIH----TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
            + +     +  +S    G  LL+  EEN+GQFLYLEG EY+M+NTYDVHFY+SFAL+ L
Sbjct: 514 VAALEGSNKSSFNSRLPSGLALLQAEEENVGQFLYLEGIEYIMWNTYDVHFYASFALLSL 573

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLF 581
           FPKLEL+IQRDFAA  + HDP   K M+ GK   +K  G+VPHD+G  DPW E+N+YN+ 
Sbjct: 574 FPKLELAIQRDFAAGCLAHDPRRTKFMAGGKKGIKKVFGSVPHDLGQHDPWAEVNAYNIH 633

Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
           ++SRWKDLN+KFVLQVYRD VATGDK FA+AVWP+VY AMAY++QFD+D DG+IEN+GFP
Sbjct: 634 DTSRWKDLNAKFVLQVYRDVVATGDKVFAKAVWPAVYAAMAYLDQFDRDRDGLIENDGFP 693

Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWN 700
           DQTYD W+ +GVSAYCGGLW+AALQAA A+A  + +  +A YF  ++ +A+  Y   LWN
Sbjct: 694 DQTYDTWTVHGVSAYCGGLWIAALQAAEAMAVLLEEKNAARYFQSKFIQAREAYLKKLWN 753

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
           GSYF+YD+ + S+S SIQADQ+AGQWYA A GL P+ D+AK + AL KI+DFNV+KVK G
Sbjct: 754 GSYFSYDSGNSSNSKSIQADQMAGQWYAWASGLPPLFDDAKARSALQKIFDFNVMKVKNG 813

Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
             GA+NGM P+G++D + +Q+RE+W GVTY LAA+MI E M+D AF+ A GV+   WS+ 
Sbjct: 814 HLGAVNGMHPNGKVDETCMQSREVWTGVTYALAAAMIHEGMLDQAFRAAEGVFIAGWSD- 872

Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQ 880
            LGY FQTPE+W  +  +RSL YMRPL IWAMQWA+  P+      +    DR  S  + 
Sbjct: 873 -LGYWFQTPEAWTMDGYFRSLAYMRPLAIWAMQWAIDPPETIANAPRIPTVDRLPSECDT 931

Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
            + FS + S L+ P  +      +  Y+F  G+
Sbjct: 932 -SGFSSMQSTLRNPLPQLRP--FRRAYNFLRGK 961


>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
          Length = 974

 Score =  917 bits (2369), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 470/920 (51%), Positives = 622/920 (67%), Gaps = 56/920 (6%)

Query: 9   GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
           G  A  +   D G PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E +
Sbjct: 21  GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEAS 80

Query: 69  KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
            G+    + F K     ++ QG+PLGG+G+GSI R +RGEF+ + +  G+C+ +PV+ NQ
Sbjct: 81  HGRKAPIDPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQ 140

Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
           FS+FVSR  G +K+SSVL P      KK  DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 141 FSIFVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 200

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
           GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+  A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 260

Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
             SG HFN   + +DGV G+ LHH+TA   PPVTFA+AA ET +V+V+  P F LSG   
Sbjct: 261 HHSGGHFNEPFIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 320

Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
             +AK MW+ +K++G FD  + +   + PS  G ++ AA++AS  +      +V F LAW
Sbjct: 321 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 379

Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
             P++KF +   Y+RRYT+FYGT   SA  + HDA+ ++  WE EIE WQ PIL+++R P
Sbjct: 380 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNERLP 439

Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
           EWY  TLFNELY+L AGGT+WTDG PP+              S    R+ K         
Sbjct: 440 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 499

Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
                              S+   + +H P  A +   LG++      L +NG EN+G+F
Sbjct: 500 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 559

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
           LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   MK ++DG   
Sbjct: 560 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 619

Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
            RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 620 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 679

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P+V  AM YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+ 
Sbjct: 680 PAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 739

Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +GD   A  + +++ +AKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL
Sbjct: 740 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 799

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
            P+ DE K++ AL KI++FNV+KVKGG  GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 800 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 859

Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           A+M+   M    F TA G++   WSE+G GY FQTPE W  +  YRSL YMRPL IWAMQ
Sbjct: 860 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919

Query: 854 WALTKPKLSRQEIKHEISDR 873
           WA + PK      K  + DR
Sbjct: 920 WARSPPKAILDAPKVNLMDR 939


>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 970

 Score =  916 bits (2367), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 468/916 (51%), Positives = 624/916 (68%), Gaps = 52/916 (5%)

Query: 9   GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
           G  A  +  +D G PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E +
Sbjct: 21  GRTALQLLDLDGGSPPEQAWRRRLNSHANILKEFSVTFMEAMKMMTLGVRLWSYVREEAS 80

Query: 69  KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
            G+    + F +     ++ QG+PLGG+G+GSI R +RGEF+ + +  G+C+++PV+ NQ
Sbjct: 81  HGRKAPIDPFTRERCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQ 140

Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
           FS+FVSR +G +K+SSVL P      KK  DSGI SWDWNL G++ TYHALFPRAWT+YD
Sbjct: 141 FSIFVSRDSGNKKYSSVLAPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTIYD 200

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
           GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+  A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFS 260

Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
             SG HFN   +  DGV G+ LHH+TA   PPVTF++AA ET +V V+  P F LSG + 
Sbjct: 261 HHSGGHFNEPFIGDDGVSGVLLHHKTAKDNPPVTFSIAACETQNVSVTVLPVFGLSGENH 320

Query: 306 GITAKDMWNEIKKHGSF--DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
            ++AK+MW+ + K G F  ++ +   + PS  G ++ AA++AS  +      +V F+LAW
Sbjct: 321 -VSAKEMWDTMSKDGHFSRENFNAGCSMPSSSGETLCAAVSASTWVEPHGRCTVAFALAW 379

Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
             P+VKF +   Y+RRYT+FYGT   S++  + HDA+ ++  WE EIE WQ PIL+D++ 
Sbjct: 380 SSPKVKFQKGCTYNRRYTEFYGTSERSSSINLVHDALTKYRLWEEEIEKWQDPILKDEKL 439

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT-IRERKFS------------------ 462
           PEWY  TLFNELY+L AGGT+WTDG PP  S A+   + K+S                  
Sbjct: 440 PEWYKFTLFNELYFLVAGGTVWTDGQPPAISEASPAYQHKYSKKGAKSESVKDNHVKPAA 499

Query: 463 ----------------LDTPSKIHTPASSDTALGTRLLE--------NGEENIGQFLYLE 498
                           + T   +H     +   G  L E        +G EN+G+FLYLE
Sbjct: 500 EQVSDGDDLPNGEERSVSTYVAVHGSQMPEQTSGLGLQEPIPYLLSKDGPENVGKFLYLE 559

Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
           G EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   +K ++DG    RK 
Sbjct: 560 GVEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKV 619

Query: 559 LGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY 618
            GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD  F R VWP+V 
Sbjct: 620 KGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVS 679

Query: 619 IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH 678
            AM YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+ +GD 
Sbjct: 680 AAMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDR 739

Query: 679 ASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIA 737
             A  + +++ KAKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL PI 
Sbjct: 740 PYAEKYKLKFMKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIF 799

Query: 738 DEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMI 797
           DE K++ AL KI++FNV+KVKGG  GA+NGM P G++D + +Q+REIW GVTYG+AA+M+
Sbjct: 800 DEHKIRSALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANML 859

Query: 798 QEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
              M    F TA G++   WSEDG GY FQTPE W  +  YRSL YMRPL IWAMQWAL+
Sbjct: 860 LHGMEHQGFITAEGIFLAGWSEDGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAMQWALS 919

Query: 858 KPKLSRQEIKHEISDR 873
            PK   +  K  + DR
Sbjct: 920 PPKAILEAPKVNLMDR 935


>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
 gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
 gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 974

 Score =  914 bits (2362), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 468/920 (50%), Positives = 621/920 (67%), Gaps = 56/920 (6%)

Query: 9   GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
           G  A  +   D G PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E +
Sbjct: 21  GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEAS 80

Query: 69  KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
            G+    + F K     ++ QG+PLGG+G+GSI R +RGEF+ + +  G+C+ +PV+ NQ
Sbjct: 81  HGRKAPIDPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQ 140

Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
           FS+FVSR  G +K+SSVL P      KK  DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 141 FSIFVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 200

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
           GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+  A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 260

Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
             SG HFN   + +DGV G+ LHH+TA   PPVTFA+AA ET +V+V+  P F LSG   
Sbjct: 261 HHSGGHFNEPFIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 320

Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
             +AK MW+ +K++G FD  + +   + PS  G ++ AA++AS  +      +V F LAW
Sbjct: 321 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 379

Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
             P++KF +   Y+RRYT+FYGT   SA  + HDA+ ++  WE EIE WQ PIL++++ P
Sbjct: 380 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLP 439

Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
           EWY  TLFNELY+L AGGT+WTDG PP+              S    R+ K         
Sbjct: 440 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 499

Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
                              S+   + +H P  A +   LG++      L +NG EN+G+F
Sbjct: 500 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 559

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
           LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   MK ++DG   
Sbjct: 560 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 619

Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
            RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 620 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 679

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P+V   M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+ 
Sbjct: 680 PAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 739

Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +GD   A  + +++ +AKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL
Sbjct: 740 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 799

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
            P+ DE K++ AL KI++FNV+KVKGG  GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 800 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 859

Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           A+M+   M    F TA G++   WSE+G GY FQTPE W  +  YRSL YMRPL IWAMQ
Sbjct: 860 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919

Query: 854 WALTKPKLSRQEIKHEISDR 873
           WA + PK      K  + DR
Sbjct: 920 WARSPPKAILDAPKVNLMDR 939


>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
 gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
          Length = 983

 Score =  909 bits (2349), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 481/941 (51%), Positives = 630/941 (66%), Gaps = 72/941 (7%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQE-FRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           DPG PP   W+RKL+ +    T F ++F E  + +  +G RL+++ + E ++G+ P  + 
Sbjct: 27  DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86

Query: 78  FRKHHITSD-QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-- 134
           F++    S   G+PLGG+G GSIGR +RGEF+R++L   +  +A VLANQFSVFV+R   
Sbjct: 87  FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146

Query: 135 NGEK--FSSVLCPRSP--GVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDP 190
           +GEK   +SVL PR     V KK     I +WDWNL G+N TYH LFPRAWT+YDGEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206

Query: 191 ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGH 250
           +L+I CRQ+SPFIPH+Y+ESS P  VF++ + N+G+ SA V+LLFTWANSV GDS LSG 
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDSHLSGG 266

Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
           H N   +  DGV G+ LHHRTA G+P V FA+AA + ADV VS CP FL+SG    +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326

Query: 311 DMWNEIKKHGSFD-HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
           +MWN++++ G FD     + + PS   SS+GAA++AS+TI     + + F+L+WD PEVK
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVIDFALSWDIPEVK 386

Query: 370 FFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
           F + + YHRRYT+FYG+ GD+A R+ HDA+L + KWE EIE WQ P+L+D   PEWY  T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYRFT 446

Query: 429 LFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT------- 481
           LFNELYYL AGGT+WT    P     + R   F+  T S+     + +  +         
Sbjct: 447 LFNELYYLVAGGTVWTAFVTP-----SFRSNGFTSRTNSESSITGAVEELVEMIPLNGGT 501

Query: 482 -------RLLENGEENIGQ--------------------------------------FLY 496
                   +L   EE I +                                      FLY
Sbjct: 502 TGNGGVDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLY 561

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           LE  EYVM+ TYDVHFY+SFAL+ LFPKLELSIQRD+AAA + H+P  +K ++ GKW  +
Sbjct: 562 LESVEYVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIK 621

Query: 557 KCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           K LGAVPHD+G+ DPW E+N+YN+ ++SRWKDLNSKFVLQ+YRD VAT D NFAR  WP+
Sbjct: 622 KVLGAVPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPA 681

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
           VY AMAYM++FD+D DGMIEN+GFPDQTYD W+ +G+SAYCGGLW+AALQAA+A+A  V 
Sbjct: 682 VYTAMAYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVN 741

Query: 677 DHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
           +  +A+YF  +++ A+ VYD  LWNGSYFNYD+   S+S SIQADQ+AGQWYA A GL P
Sbjct: 742 EPNAANYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPP 801

Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAAS 795
           + D+ K + AL +IYD+NV+KVKGG  GA+NGM P+GR+D + +Q+REIW GVTY  AA+
Sbjct: 802 LFDDYKARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAA 861

Query: 796 MIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           MI E M++ AF TA G+Y   WS+ G GY FQTPE+W  +  +R+L YMRPL IWAMQWA
Sbjct: 862 MIHEGMLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWA 921

Query: 856 LTKPKL----SRQEIKHEISDRDSSYLEQHAAFSKVASLLK 892
           L  P       R   K E +   +S    +A F+KV+  LK
Sbjct: 922 LYPPNCVAAGPRIPSKEESAGGSTSSSYLYAGFAKVSEALK 962


>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
           distachyon]
          Length = 932

 Score =  907 bits (2345), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 461/869 (53%), Positives = 599/869 (68%), Gaps = 21/869 (2%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D G PP+  W+RKL S+      F ++F+E   +  +G RL+ Y ++E + G+    + F
Sbjct: 37  DDGSPPEHAWRRKLSSHANRLKEFNVTFREAFKMMKLGLRLWSYIREEASYGRKAPIDPF 96

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G GSI R +RGEF+ +++  G C+  PV+ NQFS+F+ R   
Sbjct: 97  TRESTKPSASQGVPLGGMGTGSISRGFRGEFKHWQITPGSCEMLPVMENQFSIFIIR-GS 155

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           +K+SSVL P      KK+++ GI SWDW L+G+  TYHALFPRAWTVYDGEPDPEL++ C
Sbjct: 156 KKYSSVLAPGQHDGLKKSSNDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 215

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQISPFIPHNY+ESS P SVF +TL N+G+  A V+L+ TWANS+ G S  SG H N   
Sbjct: 216 RQISPFIPHNYQESSLPTSVFVYTLVNTGKERAKVSLVMTWANSIGGLSDHSGGHVNEPF 275

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
           + ++GV G+ LHH+TAN  PPVTFAVAA E  +V V+  P F LSG S  +TA++MW  I
Sbjct: 276 IGENGVSGVLLHHKTANNNPPVTFAVAACENQNVDVTVLPVFGLSGESS-VTAREMWGTI 334

Query: 317 KKHGSFDHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
            + G FD  DN K  PS P   G ++ AA+ AS  + +    +V F+LAW  PEVKF + 
Sbjct: 335 VQDGCFDR-DNFKAGPSMPSSLGETVCAAVCASTWVEAHGRCTVVFALAWSSPEVKFKKG 393

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             YHRRYTKFYGT   SA  +  DA++ +  WE EI+ WQ PIL D+R PEWY ITLFNE
Sbjct: 394 STYHRRYTKFYGTSPRSAINLVQDALMSYKHWEEEIDKWQTPILRDERLPEWYKITLFNE 453

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLEN------ 486
           LY+L AGGT+W D     +SL    ++K +   P     P        T  L +      
Sbjct: 454 LYFLVAGGTVWIDS----ESLMVDADKKVNSSLPEDSDLPLHDSNCNSTVPLIDPMPLDF 509

Query: 487 -GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTM 545
             +EN+G+FLYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   +
Sbjct: 510 DDKENVGKFLYLEGVEYFMWCTYDVHFYASFALLNLFPKIELSIQRDFARAVLREDRSRV 569

Query: 546 KIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG 605
           + ++DG W  RK +GAVPHD+G  DPW E+N+YN+ ++SRWKDLN KFVLQVYRDF ATG
Sbjct: 570 RFLADGTWGTRKVIGAVPHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATG 629

Query: 606 DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAAL 665
           D +F + VWP+V  AM YMEQFD+DGDGMIEN+GFPDQTYDAW+  GVSAYCG LW+A+L
Sbjct: 630 DMSFGKDVWPAVCTAMEYMEQFDRDGDGMIENDGFPDQTYDAWTVLGVSAYCGCLWLASL 689

Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAG 724
           QAA+A+A  +G +  A    V++ KAK V+++ LWNGSYFNYD+    SS SIQADQLAG
Sbjct: 690 QAAAAMARGLGHNDYAERCMVKFAKAKHVFETKLWNGSYFNYDSGASYSSRSIQADQLAG 749

Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
           QWYA + GL P+ DE ++K  L KI+D+NV++VKGG  GA+NGM P+G++D + +Q+REI
Sbjct: 750 QWYAASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMYPNGKVDETCMQSREI 809

Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
           W GVTY LAA+M+   M D AF TA G++   WSE+G GY FQTPE+W  +  YRSL YM
Sbjct: 810 WTGVTYSLAATMLLHGMEDQAFTTAEGIFLAGWSEEGYGYWFQTPEAWTIDGHYRSLIYM 869

Query: 845 RPLTIWAMQWALTKPKLSRQEIKHEISDR 873
           RPL IWAMQ+AL+ PK   +  K    DR
Sbjct: 870 RPLAIWAMQYALSPPKAILEAPKVNTMDR 898


>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
          Length = 967

 Score =  897 bits (2317), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 463/920 (50%), Positives = 615/920 (66%), Gaps = 63/920 (6%)

Query: 9   GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
           G  A  +   D G PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E +
Sbjct: 21  GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEAS 80

Query: 69  KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
            G+    + F K     ++ QG+PLGG+G+GSI R +RGEF+ + +  G+C+ +P+    
Sbjct: 81  HGRKAPIDPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI---- 136

Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
              FVSR  G +K+SSVL P      KK  DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 137 ---FVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 193

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
           GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+  A V+LL TWANS+ G S
Sbjct: 194 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 253

Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
             SG HFN   + +DGV G+ LHH+TA   PPVTFA+AA ET +V+V+  P F LSG   
Sbjct: 254 HHSGGHFNEPFIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 313

Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
             +AK MW+ +K++G FD  + +   + PS  G ++ AA++AS  +      +V F LAW
Sbjct: 314 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 372

Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
             P++KF +   Y+RRYT+FYGT   SA  + HDA+ ++  WE EIE WQ PIL++++ P
Sbjct: 373 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLP 432

Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
           EWY  TLFNELY+L AGGT+WTDG PP+              S    R+ K         
Sbjct: 433 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 492

Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
                              S+   + +H P  A +   LG++      L +NG EN+G+F
Sbjct: 493 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 552

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
           LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   MK ++DG   
Sbjct: 553 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 612

Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
            RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 613 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 672

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P+V   M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+ 
Sbjct: 673 PAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 732

Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +GD   A  + +++ +AKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL
Sbjct: 733 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 792

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
            P+ DE K++ AL KI++FNV+KVKGG  GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 793 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 852

Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           A+M+   M    F TA G++   WSE+G GY FQTPE W  +  YRSL YMRPL IWAMQ
Sbjct: 853 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 912

Query: 854 WALTKPKLSRQEIKHEISDR 873
           WA + PK      K  + DR
Sbjct: 913 WARSPPKAILDAPKVNLMDR 932


>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 950

 Score =  893 bits (2307), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 446/889 (50%), Positives = 611/889 (68%), Gaps = 36/889 (4%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP   W+R+L+ +  +   FT++F+E   +  +G RL+ Y ++E + G+    + F
Sbjct: 31  DCAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            K +   ++ QG+PLGG+G+GSI R +RGEF+++++  G CD +P+++NQFS+F+SR  G
Sbjct: 91  TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150

Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            +K++SVL P   G   K+ D G+ SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I 
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPFIP+NY++SS PA+VF +TL N+G+  A V+LLFTWANS+ G S +SG H N  
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            + +DGV G+ LHH+T  G PPVTFA+AA ET +V+V+  PCF LS +S   TAKDMW+ 
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDSS-FTAKDMWDT 329

Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           ++K G FD  + ++  ++PS  G +I AA++AS  + +    +V+F+L+W  P+VKF + 
Sbjct: 330 VEKDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             Y RRYTKFYGT   +A  + HDA+  + +WE +IEAWQ PIL D+R PEWY  TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449

Query: 433 LYYLNAGGTIWTDGSP------------------PMQSLATIRERKFSLDTPSKIHTPAS 474
           LY+L AGGT+W D S                    M     I +++  L   + +   ++
Sbjct: 450 LYFLVAGGTVWIDSSSLNANGNSQHQQSGLGNSDGMAGGLDINDQRNDLANGNSVGVKSN 509

Query: 475 SDTA---------LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
            + +         + TR +++G++ +G+FLYLEG EYVM+ TYDVHFY+S+AL+MLFPK+
Sbjct: 510 DEVSAIHNRNGLFVDTRHVDDGDD-VGRFLYLEGVEYVMWCTYDVHFYASYALLMLFPKI 568

Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
           EL+IQRDFA AV+  D   +K +++G    RK  GAVPHD+G+ DPW E+N+YN+ ++S+
Sbjct: 569 ELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSK 628

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           WKDLN KFVLQVYRDF ATGD  F   VWP+V  AM YMEQFD+D D +IEN+GFPDQTY
Sbjct: 629 WKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTY 688

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
           D W+ +GVSAYCG LW+AALQAA+A+A  +GD   A     ++  AKA  ++ LWNGSYF
Sbjct: 689 DTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYF 748

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
           NYD+   S+S SIQ DQLAGQWYA + GL PI +E+K++  + KI+DFNV+K KGG  GA
Sbjct: 749 NYDSGASSNSKSIQTDQLAGQWYAASSGLPPIFEESKIRSTMQKIFDFNVMKTKGGKMGA 808

Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           +NGM PDG++D + +Q+REIW GVTY  AA+MI   M +  F TA G++   WSE+G GY
Sbjct: 809 VNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGY 868

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
            FQTPE W  +  YRSL YMRPL IW MQWAL+ PK      +  + DR
Sbjct: 869 WFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPKAILDAPQINMMDR 917


>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
 gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
          Length = 968

 Score =  892 bits (2304), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 458/904 (50%), Positives = 607/904 (67%), Gaps = 50/904 (5%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E + G+    + F
Sbjct: 31  DSAAPPKHAWRRRLNSHANILKEFSVTFTEAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G+GSI R +RGEF+++++   IC+ +PV+ANQFS+F+SR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPSICEVSPVMANQFSIFISRDGG 150

Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            +K++SVL P       K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I 
Sbjct: 151 TKKYASVLAPGQHEGLGKDGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPFIPHNY++SS P +VF +TL NSG+  A V+LLFTWANS+ G S LSG H N  
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNSGKERAKVSLLFTWANSIGGVSHLSGDHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            + +DGV G+ LHH+TA G PPVTFA+AA ET +V V+  P F LS  S  ITAKDMW++
Sbjct: 271 FIGEDGVSGVLLHHKTAKGNPPVTFAIAACETQNVSVTVLPSFGLSEESH-ITAKDMWSK 329

Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           + + G FD  + D   T PS PG ++ AA++AS  +      +V F+L+W  P++KF + 
Sbjct: 330 MVQDGQFDRENFDCGPTMPSSPGETLCAAVSASAWVEPHGKCTVAFALSWSSPKIKFSKG 389

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             YHRRYTKFYGT   +A  + HDA+  +  WE EIE WQ PIL+D+R PEWY  TLFNE
Sbjct: 390 STYHRRYTKFYGTSERAAQNLVHDALKNYKWWEEEIEKWQNPILKDERLPEWYKFTLFNE 449

Query: 433 LYYLNAGGTIWTDGS--------------------PPMQSLATIRERKFSLDT------- 465
           LY+L AGGT+W D S                     P  +   I    +++ T       
Sbjct: 450 LYFLVAGGTVWIDSSLLTEDMRETMNVDVIEVQVSRPKGAEKQIATNGYNVATIGLEEKD 509

Query: 466 -------PSKIHTPASSDTALGTRLL--------ENGEENIGQFLYLEGAEYVMYNTYDV 510
                  PSK   P S +       L        +N  +++G+FLYLEG EY+M+ TYDV
Sbjct: 510 GASNGNYPSKDELPVSHENGHLNHSLKLSPLMEWQNNSDDVGRFLYLEGVEYIMWCTYDV 569

Query: 511 HFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDD 570
           HFY+SFAL+ LFPK+EL+IQRDFA AV+  D   +K +++G    RK  GAVPHD+G  D
Sbjct: 570 HFYASFALLELFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGTHD 629

Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
           PW E+N+YN+ ++S+WKDLN KFVLQVYRDF AT D +F   VWP+V  AM YMEQFD+D
Sbjct: 630 PWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATQDMSFGVDVWPAVRSAMEYMEQFDRD 689

Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK 690
           GD +IEN+GFPDQTYDAW+ +GVSAYCG LW+AAL+AA+A+A  VGD   A     ++ K
Sbjct: 690 GDALIENDGFPDQTYDAWTVHGVSAYCGCLWLAALEAAAAMALQVGDKYFAELCRSKFVK 749

Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           AK+ +++ LWNGSYFNYD+   S+S SIQADQLAGQWY  + GL P+ D++K+K  L KI
Sbjct: 750 AKSAFEAKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYVASSGLPPLFDDSKIKSTLQKI 809

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
           YDFNV+KV+GG  GA+NGM P+G++D + +Q+REIW GVTY +AA+MI   M D AF  A
Sbjct: 810 YDFNVMKVRGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYAVAATMILAGMEDKAFAAA 869

Query: 810 AGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHE 869
            G++   WSEDG GY FQTPE W  +  +RSL YMRPL IW MQWAL+ PK   +  K  
Sbjct: 870 EGIFLAGWSEDGYGYWFQTPEGWTTDGHFRSLIYMRPLAIWGMQWALSLPKAILEAPKIN 929

Query: 870 ISDR 873
           I DR
Sbjct: 930 IMDR 933


>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
          Length = 790

 Score =  890 bits (2299), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 452/823 (54%), Positives = 559/823 (67%), Gaps = 108/823 (13%)

Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRA 180
           FVSRP G K S+VLCP  P V K N          + GIESWDWN+ GE  TYHAL+PR+
Sbjct: 24  FVSRPGGVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRS 83

Query: 181 WTVYDG-------EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
           WTVYDG             R++  ++  +I            +    ++N+G   A VTL
Sbjct: 84  WTVYDGIIFDLNVSNHVYYRLIYCKVLMYI------KVMKMFLLDEQVTNTGAEPAIVTL 137

Query: 234 LFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVS 293
           LFTW NSV G SGL+G HFNS TM            +TANG PPV++A+AA+ET DV VS
Sbjct: 138 LFTWENSVGGASGLTGQHFNS-TM------------KTANGHPPVSYAIAAKETEDVRVS 184

Query: 294 ECPCFLLSGNSKG-ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
            CPCF++SG +   ITA DMW+EIKK+ SFD L ++  SPS+PG+SIGAAIAA + +P G
Sbjct: 185 SCPCFIVSGTTPNQITAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPG 244

Query: 353 STRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
             R+VTFSL+WDCPE +F EK YHRRYT+FYG+LG++A  +AHDA+L             
Sbjct: 245 CDRTVTFSLSWDCPEARFDEKTYHRRYTRFYGSLGNAAVAMAHDALL------------- 291

Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD-------- 464
                                           +G PP QSL +I  RK SL         
Sbjct: 292 --------------------------------NGLPPKQSLDSIGRRKISLGLSTIDKND 319

Query: 465 ----------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTY 508
                             S+IH P +S+ ALG  +++N  ENIGQFLYLEG +Y+MYNTY
Sbjct: 320 QDQNNVALDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTY 379

Query: 509 DVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL 568
           DVHFYSSFAL+MLFPK+ELSIQRDFAAAV+MHD    ++MS G++V RK LGAVPHDIGL
Sbjct: 380 DVHFYSSFALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGL 439

Query: 569 DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFD 628
           +DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFD
Sbjct: 440 NDPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFD 499

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
           KDGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA ++GD+ +A YF  +Y
Sbjct: 500 KDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKY 559

Query: 689 QKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
           +KA++VY+ LWNGSYFNYDNS   SS+SI ADQ+AGQWYARACGL PIA E  +KKAL  
Sbjct: 560 EKARSVYEKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALET 619

Query: 749 IYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           +YDFNV++V+ G  GA+NGM PDGR+D S + +RE+W G TY +AA MIQE + D  F+T
Sbjct: 620 VYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRT 679

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE--- 865
           A+G+YE AWS+ GLG +FQTPE+W  NDEYRSLCYMRPL IW +QWA T PK +R++   
Sbjct: 680 ASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVS 739

Query: 866 IKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYD 908
           ++ +  D  S   +QHA F KVA  LK  K +  +  L+  Y+
Sbjct: 740 LRPQEEDATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 782


>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 950

 Score =  884 bits (2284), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 454/885 (51%), Positives = 609/885 (68%), Gaps = 31/885 (3%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP+  W+R+L+S+  +   F ++F E   +  +G R++ Y ++E + G+    + F
Sbjct: 31  DSSAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G+GSI R +RGEF+++++   +C+ +PV+ANQFS+F+SR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMANQFSIFISREGG 150

Query: 137 EK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            K F+SVL P       K  D GI SW WNL G++ TYHALFPRAWTVYDGEPDPEL+I 
Sbjct: 151 NKNFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPF+PHNY+ESS PA+VF +TL N+G+  A V+LLFTWANS+ G S LSG H N  
Sbjct: 211 CRQISPFVPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHLSGDHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
              +DGV G+ L+H+TA G PPVTFA+AA ET +V+VS  P F LS  S   TAK MW++
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEGSS-TTAKGMWSK 329

Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           + K G FD  + ++  + PS PG ++ AA+AAS+ +      +V FSLAW  P+VKF + 
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASMWVEPHGKCTVAFSLAWSSPKVKFVKG 389

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             ++RRYTKFYGT   +AA +AHDA+  + +WE EIE WQ PIL+D+  PEWY  TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPILKDETLPEWYKFTLFNE 449

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP----------------ASSD 476
           LY+L AGGTIW D      ++   ++R   L+      T                 ++ D
Sbjct: 450 LYFLVAGGTIWIDSPLLSSNMRNDQDRVRELENTVVKETEDKMSDRKRTVVERIMDSTCD 509

Query: 477 TALGT-------RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
           +A+ T       +L  + + ++G+FLYLEG EY+M+ TYDVHFY+SFAL+ LFPK+EL+I
Sbjct: 510 SAVITGHDPADEKLSGDDDADVGRFLYLEGVEYIMWCTYDVHFYASFALLELFPKIELNI 569

Query: 530 QRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDL 589
           QRDFA AV+  D   +K +++G W  RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDL
Sbjct: 570 QRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDL 629

Query: 590 NSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
           N KFVLQVYRDF  TGD  F   VWP+V  AM YMEQFD+DGDG+IEN+GFPDQTYD W+
Sbjct: 630 NPKFVLQVYRDFATTGDLEFGVDVWPAVRAAMEYMEQFDRDGDGLIENDGFPDQTYDTWT 689

Query: 650 ANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDN 708
            +GVS YCG LW+AALQAA+A+A D+GD   A     ++ KAK  + + LWNG+YFNYD+
Sbjct: 690 VHGVSTYCGCLWLAALQAAAAMALDLGDREFAEKCKRKFLKAKPAFEEKLWNGTYFNYDS 749

Query: 709 SDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM 768
               +S SIQADQLAGQWY  + GL P+ +++K+K AL K+YDFNV+KVKGG  GA+NGM
Sbjct: 750 GSSGNSKSIQADQLAGQWYTASSGLPPLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGM 809

Query: 769 QPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
            P+G++D + +Q+RE+W GVTYGLAA+MI   M + AF TA G++   WSEDG GY FQT
Sbjct: 810 HPNGKVDETCMQSREVWTGVTYGLAATMILAGMEEEAFATAEGIFLAGWSEDGYGYWFQT 869

Query: 829 PESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
           PE+W  +  YRSL YMRPL IW MQ+A+ +PK   +  K  I DR
Sbjct: 870 PEAWTMDGHYRSLMYMRPLAIWGMQYAINRPKAILEAPKINIMDR 914


>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
 gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
 gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
 gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
          Length = 950

 Score =  880 bits (2274), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 444/889 (49%), Positives = 610/889 (68%), Gaps = 36/889 (4%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP   W+R+L+ +  +   FT++F+E   +  +G RL+ Y ++E + G+    + F
Sbjct: 31  DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            K +   ++ QG+PLGG+G+GSI R +RGEF+++++  G CD +P+++NQFS+F+SR  G
Sbjct: 91  TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150

Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            +K++SVL P   G   K+ D G+ SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I 
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPFIP+NY++SS PA+VF +TL N+G+  A V+LLFTWANS+ G S +SG H N  
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            + +DGV G+ LHH+T  G PPVTFA+AA ET +V+V+  PCF LS +S   TAKDMW+ 
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDSS-FTAKDMWDT 329

Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           +++ G FD  + ++  ++PS  G +I AA++AS  + +    +V+F+L+W  P+VKF + 
Sbjct: 330 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             Y RRYTKFYGT   +A  + HDA+  + +WE +IEAWQ PIL D+R PEWY  TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449

Query: 433 LYYLNAGGTIWTDGSP---------PMQSLAT---------IRERKFSLDTPSKIHTPAS 474
           LY+L AGGT+W D S              L           I +++  L   + +   ++
Sbjct: 450 LYFLVAGGTVWIDSSSLNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKSN 509

Query: 475 SDTA---------LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
            + +         + T  +++G++ +G+FLYLEG EYVM+ TYDVHFY+S+AL+MLFPK+
Sbjct: 510 DEVSAIHNRNGLFVDTPHVDDGDD-VGRFLYLEGVEYVMWCTYDVHFYASYALLMLFPKI 568

Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
           EL+IQRDFA AV+  D   +K +++G    RK  GAVPHD+G+ DPW E+N+YN+ ++SR
Sbjct: 569 ELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSR 628

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           WKDLN KFVLQVYRDF ATGD  F   VWP+V  AM YMEQFD+D D +IEN+GFPDQTY
Sbjct: 629 WKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTY 688

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
           D W+ +GVSAYCG LW+AALQAA+A+A  +GD   A     ++  AKA  ++ LWNGSYF
Sbjct: 689 DTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYF 748

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
           NYD+   S+S SIQ DQLAGQWYA + GL P+ +E+K++  + KI+DFNV+K KGG  GA
Sbjct: 749 NYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKTKGGKMGA 808

Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           +NGM PDG++D + +Q+REIW GVTY  AA+MI   M +  F TA G++   WSE+G GY
Sbjct: 809 VNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGY 868

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
            FQTPE W  +  YRSL YMRPL IW MQWAL+ PK      +  + DR
Sbjct: 869 WFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPKAILDAPQINMMDR 917


>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
          Length = 978

 Score =  878 bits (2268), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 451/901 (50%), Positives = 600/901 (66%), Gaps = 60/901 (6%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E ++G+    + F
Sbjct: 31  DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G+GSI R +RGEF+ +++  G CD +P++ANQFS+F+SR  G
Sbjct: 91  TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150

Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            +K++SVL P       K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL++ 
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPFIPHNY++SS P +VF +TL N+G+  A V+LLFTWANS+ G S LSG H N  
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            + +DGV G+ LHH+TA   PPVTFA+AA ET +V V+  P F LS  S  ITAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGSH-ITAKDMWGK 329

Query: 316 IKKHGSFDHLD--NDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           + + G FD  +  + ++ PS PG ++ AA++AS  +      +V F+LAW  P+VKF + 
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             YHRRYTK+YGT   +A  I HDA+  + +WE EIE WQ PIL D R PEWY  TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT--------------- 477
           LY+L AGGT+W D S P  S      +  +++  +   T A  ++               
Sbjct: 450 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYD 509

Query: 478 ALGTRLLENGEENI-------------------------------------GQFLYLEGA 500
           A+  + LE  EE I                                     G+FLYLEG 
Sbjct: 510 AISRKGLEYDEEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGV 569

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
           EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQR+FA AV+  D   +K +++G W  RK  G
Sbjct: 570 EYIMWCTYDVHFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRG 629

Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           AVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF AT D +F   VWP+V  A
Sbjct: 630 AVPHDLGTHDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAA 689

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           M YMEQFD+D DG+IEN+GFPDQTYD W+ +G+SAYCG LW+AALQAA+A+A  +GD   
Sbjct: 690 MEYMEQFDRDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPF 749

Query: 681 ASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           A     ++ KAK V+ + LWNGSYFNYD+   S+S SIQADQLAGQWY  + GL  + D+
Sbjct: 750 AEKCKSKFFKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDD 809

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
            K+K +L KIYDFNV+KVKGG  GA+NGM P+G++D S +Q+REIW GVTYG+AA+MI  
Sbjct: 810 YKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILS 869

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
            M + AF TA G++   WSE+G GY FQTPE W  +  +RSL YMRPL IW MQWAL+ P
Sbjct: 870 GMEEQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMP 929

Query: 860 K 860
           +
Sbjct: 930 R 930


>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
          Length = 949

 Score =  871 bits (2251), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 448/885 (50%), Positives = 609/885 (68%), Gaps = 32/885 (3%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP+  W+R+L+S+  +   F ++F E   +  +G R++ Y ++E + G+    + F
Sbjct: 31  DSSAPPEQAWRRRLNSHANLLKEFRVTFTEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G+GSI R +RGEF+++++   +C+ +PV++NQFS+F+SR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMSNQFSIFISREGG 150

Query: 137 EK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
           +K F+SVL P       K  D GI SW WNL G++ TYHALFPRAWTVYDGEPDPEL+I 
Sbjct: 151 KKKFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPFIPHNY+ESS PA+VF +TL N+G+  A V+LLFTWANS+ G S  SG H N  
Sbjct: 211 CRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHSSGDHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
              +DGV G+ L+H+TA G PPVTFA+AA ET +V+VS  P F LS  S  +TAK MW++
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEESS-MTAKHMWSK 329

Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           + K G FD  + ++  + PS PG ++ AA+AAS  +      +V FSLAW  P+VKF + 
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASTWVEPHGKCTVAFSLAWSSPKVKFVKG 389

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             ++RRYTKFYGT   +AA +AHDA+  + +WE EIE WQ P+L+D+  PEWY  TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPVLKDEALPEWYKFTLFNE 449

Query: 433 LYYLNAGGTIWTDGSPPMQS-------------LATIRERKFSLDTPSKIHTPASSDTAL 479
           LY+L AGGTIW D SP + S              A ++E +  +    +    +++D+  
Sbjct: 450 LYFLVAGGTIWID-SPVLSSNMRNDQDRVRELESAVVKETEDKMSDRKRTVVESTTDSTY 508

Query: 480 GTRL----------LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
            + +          L   ++++G+FLYLEG EY+M+ TYDVHFY+SFAL+ LFP++EL+I
Sbjct: 509 DSAVITGHDRADEKLYEDDDDVGRFLYLEGVEYIMWCTYDVHFYASFALLELFPRIELNI 568

Query: 530 QRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDL 589
           QRDFA AV+  D   +K +++G W  RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDL
Sbjct: 569 QRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDL 628

Query: 590 NSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
           N KFVLQVYRDF  TGD  F   VWP+V  AM YMEQFD+DGDG+IEN+GFPDQTYD W+
Sbjct: 629 NPKFVLQVYRDFATTGDLQFGVDVWPAVRAAMEYMEQFDRDGDGLIENDGFPDQTYDTWT 688

Query: 650 ANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDN 708
            +GVS YCG LW+AALQAA+ +A ++GD   A     ++ KAK  + + LWNG+YFNYD+
Sbjct: 689 VHGVSTYCGCLWLAALQAAAVMALELGDREFAETCKRKFLKAKPAFEEKLWNGTYFNYDS 748

Query: 709 SDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM 768
               +S SIQADQLAGQWY  + GL  + +++K+K AL K+YDFNV+KVKGG  GA+NGM
Sbjct: 749 GSSGNSKSIQADQLAGQWYTASSGLPSLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGM 808

Query: 769 QPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
            P+G++D + +Q+RE+W GVTYGLAA+MI   M + AF TA G++   WSEDG GY FQT
Sbjct: 809 HPNGKVDETCMQSREVWTGVTYGLAATMIHAGMEEEAFTTAEGIFLAGWSEDGYGYWFQT 868

Query: 829 PESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
           PE+W  +  YRSL YMRPL IW MQ+A+ +PK   +  K  I DR
Sbjct: 869 PEAWTMDGHYRSLMYMRPLAIWGMQYAINRPKAILEAPKINIMDR 913


>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
 gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
          Length = 966

 Score =  860 bits (2221), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 445/934 (47%), Positives = 601/934 (64%), Gaps = 67/934 (7%)

Query: 10  ENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAK 69
            N   +   D   PP+  W+R+L+S+  +   F+++F+E   +  +G RL+ Y ++E + 
Sbjct: 22  RNTLQLFDFDSAAPPEQAWRRRLNSHANILKEFSVTFKEAIQMVRLGIRLWSYVREEASH 81

Query: 70  GKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
           G+    + F +     ++ QG+PLGG+G+GSI R +RGEF+++++  GIC+ +P+     
Sbjct: 82  GRKAPIDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPGICESSPI----- 136

Query: 128 SVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
             F+SR  G K ++SVL P       K  D GI SW WNL G++ TYHALFPRAWTVYDG
Sbjct: 137 --FISRDGGNKNYASVLAPGQHEGIGKAGDQGISSWGWNLSGQHSTYHALFPRAWTVYDG 194

Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
           EPDPEL+I CRQISPFIPHNY++SS P +VF +TL N+G+  A V+LLFTWANS+ G S 
Sbjct: 195 EPDPELKISCRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISH 254

Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
           LSG H N   + +DGV G+ LHH+  N  PPVTFA+AA ET +V V+  P F LS  S  
Sbjct: 255 LSGDHVNEPFIGEDGVSGVLLHHKQGN--PPVTFAIAACETQNVSVTVLPSFGLSEGS-C 311

Query: 307 ITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
            TAK MW  + + G FD  + +   + PS PG ++ AA++AS  +      +V F+LAW 
Sbjct: 312 TTAKAMWGTMVQDGHFDRGNFNWGPSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWS 371

Query: 365 CPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
            P++KF +   YHRRYTKFYGT   +A  + HDA+  + +WE EIE WQ PIL+D++ PE
Sbjct: 372 SPKIKFLKGSSYHRRYTKFYGTSERAAQNLVHDALTNYKQWEEEIEKWQDPILKDEKLPE 431

Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS-KIHTP---------- 472
           WY  TLFNELY+L AGGT+W D S           R   ++T   K+  P          
Sbjct: 432 WYKFTLFNELYFLVAGGTVWIDSSLSSADTRNGHHRSREVETTGIKVTEPQVNCNGGPDH 491

Query: 473 -------------ASSDTALGTRLLENGE------------------------ENIGQFL 495
                          ++ A  T+ +   E                        +++G+FL
Sbjct: 492 TTTNDHNTTSSEQKENNKAFHTKCICKDESAVSRERGNLDHTLDPFTFLDPLSDDVGRFL 551

Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
           YLEG EY+M+ TYDVHFY+SFAL+ LFPK+EL+IQRDFA AV+  D   ++ ++DG    
Sbjct: 552 YLEGVEYIMWCTYDVHFYASFALLALFPKIELNIQRDFAKAVLSEDGRKVRFLADGSVGI 611

Query: 556 RKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
           RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F   VWP
Sbjct: 612 RKARGAVPHDLGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGVDVWP 671

Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDV 675
           +V  AM YMEQFD+D DG++EN+GFPDQTYDAW+ +GVSAYCG LW+A+LQAA+A+A  +
Sbjct: 672 AVRTAMEYMEQFDRDDDGLVENDGFPDQTYDAWTVHGVSAYCGCLWLASLQAAAAMAMQL 731

Query: 676 GDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL 734
           GD   A     ++ KAK+ ++S LWNGSYFNYD+   ++S SIQADQLAG+WY  + GL 
Sbjct: 732 GDKYFAELCKSKFAKAKSAFESKLWNGSYFNYDSGSSNNSKSIQADQLAGEWYMASSGLP 791

Query: 735 PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
            + D+ K++ AL KIYDFNV+KV+GG  GA+NGM P+G++D + +Q+REIW GVTY +AA
Sbjct: 792 SLFDDVKIRSALNKIYDFNVMKVRGGKMGAVNGMHPNGKVDETCMQSREIWSGVTYAVAA 851

Query: 795 SMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQW 854
           +MI   M D AF TA G++   WSE+G GY FQTPE+W  +  +RSL YMRPL IW MQW
Sbjct: 852 TMILSGMEDKAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHFRSLIYMRPLAIWGMQW 911

Query: 855 ALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVA 888
           AL+ PK      K  I +R  S L     FS + 
Sbjct: 912 ALSLPKAILDAPKINIMER--SLLSPSTRFSLIG 943


>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
          Length = 937

 Score =  857 bits (2213), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 438/889 (49%), Positives = 600/889 (67%), Gaps = 49/889 (5%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP   W+R+L+ +  +   FT++F+E   +  +G RL+ Y ++E + G+    + F
Sbjct: 31  DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            K +   ++ QG+PLGG+G+GSI R +RGEF+++++  G CD +P+++NQFS+F+SR  G
Sbjct: 91  TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150

Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            +K++SVL P   G   K+ D G+ SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I 
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPFIP+NY++SS PA+VF +TL N+G+  A V+LLFTWANS+ G S +SG H N  
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            MT               G PPVTFA+AA ET +V+V+  PCF LS +S   TAKDMW+ 
Sbjct: 271 FMT-------------GKGNPPVTFAIAASETQNVNVTVLPCFGLSEDSS-FTAKDMWDT 316

Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           +++ G FD  + ++  ++PS  G +I AA++AS  + +    +V+F+L+W  P+VKF + 
Sbjct: 317 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 376

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             Y RRYTKFYGT   +A  + HDA+  + +WE +IEAWQ PIL D+R PEWY  TLFNE
Sbjct: 377 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 436

Query: 433 LYYLNAGGTIWTDGSP---------PMQSLAT---------IRERKFSLDTPSKIHTPAS 474
           LY+L AGGT+W D S              L           I +++  L   + +   ++
Sbjct: 437 LYFLVAGGTVWIDSSSLNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKSN 496

Query: 475 SDTA---------LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
            + +         + T  +++G++ +G+FLYLEG EYVM+ TYDVHFY+S+AL+MLFPK+
Sbjct: 497 DEVSAIHNRNGLFVDTPHVDDGDD-VGRFLYLEGVEYVMWCTYDVHFYASYALLMLFPKI 555

Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
           EL+IQRDFA AV+  D   +K +++G    RK  GAVPHD+G+ DPW E+N+YN+ ++SR
Sbjct: 556 ELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSR 615

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           WKDLN KFVLQVYRDF ATGD  F   VWP+V  AM YMEQFD+D D +IEN+GFPDQTY
Sbjct: 616 WKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTY 675

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
           D W+ +GVSAYCG LW+AALQAA+A+A  +GD   A     ++  AKA  ++ LWNGSYF
Sbjct: 676 DTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYF 735

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
           NYD+   S+S SIQ DQLAGQWYA + GL P+ +E+K++  + KI+DFNV+K KGG  GA
Sbjct: 736 NYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKTKGGKMGA 795

Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           +NGM PDG++D + +Q+REIW GVTY  AA+MI   M +  F TA G++   WSE+G GY
Sbjct: 796 VNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGY 855

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
            FQTPE W  +  YRSL YMRPL IW MQWAL+ PK      +  + DR
Sbjct: 856 WFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPKAILDAPQINMMDR 904


>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
          Length = 935

 Score =  847 bits (2187), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 452/920 (49%), Positives = 596/920 (64%), Gaps = 95/920 (10%)

Query: 9   GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
           G  A  +   D G PP+  W+R+L+S+  +   F+++F E   +                
Sbjct: 21  GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM---------------- 64

Query: 69  KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
             K P+ + F K     ++ QG+PLGG+G+GSI R +RGEF+ + +  G+C+ +P+    
Sbjct: 65  --KAPI-DPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI---- 117

Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
              FVSR  G +K+SSVL P      KK  DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 118 ---FVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 174

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
           GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+  A V+LL TWANS+ G S
Sbjct: 175 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 234

Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
             SG HFN   +T             A   PPVTFA+AA ET +V+V+  P F LSG   
Sbjct: 235 HHSGGHFNEPFIT-------------AKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 281

Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
             +AK MW+ +K++G FD  + +   + PS  G ++ AA++AS  +      +V F LAW
Sbjct: 282 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 340

Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
             P++KF +   Y+RRYT+FYGT   SA  + HDA+ ++  WE EIE WQ PIL++++ P
Sbjct: 341 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLP 400

Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
           EWY  TLFNELY+L AGGT+WTDG PP+              S    R+ K         
Sbjct: 401 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 460

Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
                              S+   + +H P  A +   LG++      L +NG EN+G+F
Sbjct: 461 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 520

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
           LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   MK ++DG   
Sbjct: 521 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 580

Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
            RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 581 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 640

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P+V   M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+ 
Sbjct: 641 PAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 700

Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +GD   A  + +++ +AKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL
Sbjct: 701 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 760

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
            P+ DE K++ AL KI++FNV+KVKGG  GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 761 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 820

Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           A+M+   M    F TA G++   WSE+G GY FQTPE W  +  YRSL YMRPL IWAMQ
Sbjct: 821 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 880

Query: 854 WALTKPKLSRQEIKHEISDR 873
           WA + PK      K  + DR
Sbjct: 881 WARSPPKAILDAPKVNLMDR 900


>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
 gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
          Length = 916

 Score =  816 bits (2109), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 463/929 (49%), Positives = 613/929 (65%), Gaps = 58/929 (6%)

Query: 24  PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
           P L W+R+LD+ V   T F  + +E   +  +G RL+++ + E  +G+    + F     
Sbjct: 4   PALAWKRRLDTPVVELTEFKPTLREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDT 63

Query: 84  --TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG----E 137
             +S QG+PLGGIG GSIGR +RG+F R++L   +C  + VLA+QFSVF++R +     +
Sbjct: 64  PASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQ 123

Query: 138 KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
           K SSVL P  P    K T+SGI SW+WNL G   TY+ LFPRAWT YDGEPDP+++I CR
Sbjct: 124 KTSSVLFPGEPVFESKETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCR 183

Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM 257
           Q+SPF+PHNYKESSFP+ VF + L N+G   A  TLLFTWANS+AG S  SG H NS  M
Sbjct: 184 QVSPFVPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSPFM 243

Query: 258 TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG---ITAKDMWN 314
            ++                 VTFA+AA ET DV  S CPCFL++G+ +G    TAKDMW+
Sbjct: 244 KEEH---------------SVTFAIAARETDDVSTSVCPCFLVAGSPEGNSYQTAKDMWS 288

Query: 315 EIKKHGSFDHLDNDKT-SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           EI + G F+  ++  + S S+P + +GAA+AAS+ +P+  T+S+TF LAWD P+  F   
Sbjct: 289 EITEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGG 348

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             Y+RRYTKFYG  G++A+++ HDAIL + +WE EI AWQ PIL D   PEWY  TLFNE
Sbjct: 349 SSYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNE 408

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-----SKIHTPASSD---TALGTRLL 484
           LYYL AGGTIWT  +        + +  FS DT      ++   P  S+    AL   + 
Sbjct: 409 LYYLVAGGTIWTGMNMKRIYTTLLCDLSFS-DTCRQACLAQKLNPVKSNGVSEALLDAIN 467

Query: 485 ENGE-------------ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
            N E             +N+GQFLYLEG EY  +NTYDVHFY+S+AL+ LFP+LEL +QR
Sbjct: 468 ANSEAQSFWSIGIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRLELQLQR 527

Query: 532 DFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNS 591
           DFAAA + HDP  M  ++ G    R   G+VPHD+G  DPW  +NSYNL ++S+WKDLN 
Sbjct: 528 DFAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLHDTSKWKDLNC 587

Query: 592 KFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
           KFV+QV RD  ATGD  FARAVWP+ Y AMA  +QFD+DGDGMIEN+G PDQTYD W+ +
Sbjct: 588 KFVVQVNRDAAATGDLEFARAVWPAAYAAMAVTDQFDRDGDGMIENDGVPDQTYDLWTVD 647

Query: 652 GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSD 710
           GVS+YCGGLWVAALQAASALA+ VGD  S+  F  ++ +AKAVY+ SLWNGSYF+YD   
Sbjct: 648 GVSSYCGGLWVAALQAASALADLVGDKPSSRMFHGKFLRAKAVYEKSLWNGSYFHYDK-- 705

Query: 711 GSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP 770
             +STS+Q+DQLAGQWY+ A GL  I  E +   AL  ++  NV++ KGG  GA+NGM P
Sbjct: 706 --NSTSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLP 763

Query: 771 DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPE 830
            G  D + LQ+RE+W G TY +AA+MIQ+ M +  F+TA GV+   WS+ G GY+FQTPE
Sbjct: 764 SGGEDRTSLQSREVWAGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGYAFQTPE 823

Query: 831 SWNNNDEYRSLCYMRPLTIWAMQWALTKP-KLSRQEIKHEISDRDSSYLEQ-HAAFSKVA 888
           +W+N+ +YRSL YMRPL+IWAMQWAL  P +L ++    E+ D  +  +E  + +F+K+A
Sbjct: 824 AWDNDGKYRSLAYMRPLSIWAMQWALDPPEQLKQRSCSVEVGDITAQEVEACNKSFAKLA 883

Query: 889 SLLKLPKEEASKGFLKV---VYDFTIGRV 914
           S LK      +K +  V   ++D+   RV
Sbjct: 884 SQLKELSPPEAKAYSSVWIWMFDYIRSRV 912


>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
 gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
          Length = 916

 Score =  816 bits (2108), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 460/928 (49%), Positives = 613/928 (66%), Gaps = 56/928 (6%)

Query: 24  PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
           P L W+R+LD+ V     F  +F+E   +  +G RL+++ + E  +G+    + F     
Sbjct: 4   PALAWKRRLDTPVMELKEFKPTFREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDT 63

Query: 84  --TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG----E 137
             +S QG+PLGGIG GSIGR +RG+F R++L   +C  + VLA+QFSVF++R +     +
Sbjct: 64  PASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQ 123

Query: 138 KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
           K SSVL P  P    + T+SGI SW+WNL G   TY+ LFPRAWT YDGEPDP+++I CR
Sbjct: 124 KTSSVLFPGEPVFESEETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCR 183

Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM 257
           Q+SPF+PHNYKESSFP+ VF + L N+G   A  TLLFTWANS+AG S  SG H NS  M
Sbjct: 184 QVSPFLPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSPFM 243

Query: 258 TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG---ITAKDMWN 314
            ++                 VTFA+AA ET DV  S CPCFL++G  +G    TAKDMW+
Sbjct: 244 KEEH---------------SVTFAIAARETDDVSTSVCPCFLVAGAPEGNSYQTAKDMWS 288

Query: 315 EIKKHGSFDHLDNDKT-SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           EI + G F+  ++  + S S+P + +GAA+AAS+ +P+  T+S+TF LAWD P+  F   
Sbjct: 289 EIIEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGG 348

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             Y+RRYTKFYG  G++A+++ HDAIL + +WE EI AWQ PIL D   PEWY  TLFNE
Sbjct: 349 SSYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNE 408

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP----SKIHTPASSD---TALGTRLLE 485
           LYYL AGGTIWT  +        + +  FS        ++   P  S+    AL   +  
Sbjct: 409 LYYLVAGGTIWTGMNMKRIYTTLLCDLSFSDSCRQACLAQKLNPVKSNGVSEALLDAINA 468

Query: 486 NGE-------------ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
           N E             +N+GQFLYLEG EY  +NTYDVHFY+S+AL+ LFP+LEL +QRD
Sbjct: 469 NSEAQSFWSFDIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRLELQLQRD 528

Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK 592
           FAAA + HDP  M  ++ G    R   G+VPHD+G  DPW  +NSYNL+++S+WKDLN K
Sbjct: 529 FAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLYDTSKWKDLNCK 588

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           FV+QV RD  ATGD +FARAVWP+ Y AMA+ +QFD+DGDGMIEN+G PDQTYD W+ +G
Sbjct: 589 FVVQVNRDAAATGDLDFARAVWPAAYAAMAFTDQFDRDGDGMIENDGVPDQTYDLWTVDG 648

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDG 711
           VS+YCGGLWVAALQAASALA+ VGD  S+  F  ++ +AKAVY+ SLWNGSYF+YD    
Sbjct: 649 VSSYCGGLWVAALQAASALADLVGDKPSSRMFHDKFLRAKAVYEKSLWNGSYFHYDK--- 705

Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
            +STS+Q+DQLAGQWY+ A GL  I  E +   AL  ++  NV++ KGG  GA+NGM P 
Sbjct: 706 -NSTSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLPS 764

Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
           G  D + LQ+RE+W G TY +AA+MIQ+ M +  F+TA GV+   WS+ G GY+FQTPE+
Sbjct: 765 GGEDRTSLQSREVWVGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGYAFQTPEA 824

Query: 832 WNNNDEYRSLCYMRPLTIWAMQWALTKP-KLSRQEIKHEISDRDSSYLEQ-HAAFSKVAS 889
           W+N+ +YRSL YMRPL+IWAMQWAL  P +L ++    E+ D  +  +E  + +F+K+AS
Sbjct: 825 WDNDGKYRSLAYMRPLSIWAMQWALDPPEQLKQRSCNVEVGDITAQEVEACNKSFAKLAS 884

Query: 890 LLKLPKEEASKGFLKV---VYDFTIGRV 914
            LK      +K +  V   ++D+   RV
Sbjct: 885 QLKELSPPETKAYSSVWIWMFDYIRSRV 912


>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
          Length = 850

 Score =  785 bits (2027), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 403/742 (54%), Positives = 515/742 (69%), Gaps = 29/742 (3%)

Query: 147 SPGVP----KKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           S GVP     K+ D GI SWDW L+G+  TYHALFPRAWTVYDGEPDPEL++ CRQISPF
Sbjct: 92  SQGVPLGGMGKSNDDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSCRQISPF 151

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
           IPHNY+ESS P SVF +T+ N+G+  A V+L+ TWANS+ G S  SG H N   MT    
Sbjct: 152 IPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEPFMT---- 207

Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
                    AN  PPVTFAVAA E  +V+V+  P F LSG S  +TA++MW  + + G F
Sbjct: 208 ---------ANNNPPVTFAVAACENQNVNVTVLPAFGLSGESS-VTAREMWGTLSQDGCF 257

Query: 323 DHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRR 378
           D  DN    PS P   G ++ AA+ AS  +      +V FSLAW  P+VKF +   Y+RR
Sbjct: 258 DR-DNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGNAYYRR 316

Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
           YTKFYGT   SA  +  D+++++  WE EI+ WQRPIL D+RFPEWY +TLFNELY+L +
Sbjct: 317 YTKFYGTSPRSAINLVQDSLMKYKHWEEEIDKWQRPILHDERFPEWYKVTLFNELYFLVS 376

Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG----TRLLENGEENIGQF 494
           GGT+W D +  M     +   + S D    +H  +S ++A+     +  + +G EN+G+F
Sbjct: 377 GGTVWIDSASLMADADEMMNSRLSEDNDLPLHH-SSRNSAVPLIGFSPHIIDGRENVGKF 435

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
           LYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   ++ ++DG W 
Sbjct: 436 LYLEGIEYFMWCTYDVHFYASFALLELFPKIELSIQRDFATAVLREDKSRVRFLADGTWG 495

Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
            RK +GAV HD+G  DPW E+N+YN+ ++SRWKDLN KFVLQVYRDFVATGD +F + VW
Sbjct: 496 TRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFVATGDMSFGKDVW 555

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P+V  AM YMEQFD D DGMIEN+GFPDQTYDAW+  GVSAYCG LW+AALQAA+ALA  
Sbjct: 556 PAVCTAMEYMEQFDHDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQAAAALART 615

Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +G H  A    +++ KAK V+++ LWNGSYFNYD+    +S SI ADQLAGQWY  + GL
Sbjct: 616 LGHHDYAERCMLKFAKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL 675

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
            P+ DE ++K  L KI+D+NV++VKGG  GA+NGM P+G++D + +Q+REIW GVTY LA
Sbjct: 676 PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLA 735

Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           A+M+   M D AF TA G++   WSE+G GY FQTPE+W  +  YRSL YMRPL IWAMQ
Sbjct: 736 ATMLLHGMDDQAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 795

Query: 854 WALTKPKLSRQEIKHEISDRDS 875
           +AL+ P+   +  K    DR S
Sbjct: 796 YALSPPRTILEAPKVNTMDRTS 817



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 2/86 (2%)

Query: 12  AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGK 71
           A  +   D G PP+  W+RKL S+      F ++F+E   +  +G RL+ Y ++E + G+
Sbjct: 16  AGDLLDFDDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIQEEASHGR 75

Query: 72  VPVFNVFRKH--HITSDQGIPLGGIG 95
               + F +     ++ QG+PLGG+G
Sbjct: 76  KAPIDPFTRESDKPSASQGVPLGGMG 101


>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
 gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
          Length = 750

 Score =  772 bits (1993), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/729 (55%), Positives = 494/729 (67%), Gaps = 90/729 (12%)

Query: 213 PASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM-TKDGVHGLTLHH-- 269
           P     F++SN G+  A VTLLFTW NSV G SGL+G HFNS  +  K G+  ++     
Sbjct: 71  PILTNQFSMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTIIHLKRGLVPVSGRFIA 130

Query: 270 ------------RTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS-KGITAKDMWNEI 316
                       RT NG PPVT+A+AA+ET DVHVSECPCFL+SG+S K ITAK+MW+EI
Sbjct: 131 SIYCNCDEMKTCRTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDEI 190

Query: 317 KKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYH 376
           KK+ SFD L+++  SPS PG+SIGAAIAA + +P G  R+VTFSL+WDCPEV+F EK YH
Sbjct: 191 KKNKSFDELNSEPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRFNEKTYH 250

Query: 377 RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
           R                                               Y +TLFNELYY 
Sbjct: 251 R-----------------------------------------------YRVTLFNELYYF 263

Query: 437 NAGGTIWTDGSPPMQSL-----ATIRERKFSLDTPSKI-------HTPASSDTALGTRLL 484
           N+GGTIWTDG PP +S+         +    +D   KI       ++P SS         
Sbjct: 264 NSGGTIWTDGLPPKESIERSKVTNTEQNDIVIDLFQKINAVCEQIYSPQSS--------- 314

Query: 485 ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGT 544
            N EENIGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKL LSIQRDFAA V++ DP  
Sbjct: 315 -NSEENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLALSIQRDFAATVLIQDPTK 373

Query: 545 MKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVAT 604
            KIMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RWKDLN+KFVLQVYRD VAT
Sbjct: 374 KKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVAT 433

Query: 605 GDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAA 664
            D++FA+AVWPSVY A+AY++QFDKD DGMIENEGFPDQTYDAWS  GVSAYCGGLWVAA
Sbjct: 434 NDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAA 493

Query: 665 LQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAG 724
           LQAASA A+ VG++A A YF  +Y+KAK VY+ LWNGSYFNYD+S   SS+SI ADQLAG
Sbjct: 494 LQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNYDDSGSGSSSSILADQLAG 553

Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
           QWYARACGL PI  E  +KKAL  IY+FNV+KVKGG  GA+NGM  +G++D + L ++E+
Sbjct: 554 QWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEV 613

Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
           W G TY +AA MIQE   +  FQTA+G+YE  WS+ GL  SFQTPE+WN NDEYRSLCYM
Sbjct: 614 WAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYM 673

Query: 845 RPLTIWAMQWALTKPKLSRQEIKHEIS----DRDSSYLEQHAAFSKVASLLKL-PKEEAS 899
           RPL IWA+QWALT+ +   +E +  ++    +  +  L QH  F  VA  +K+ P     
Sbjct: 674 RPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHKGFKDVARFVKIVPTSNVH 733

Query: 900 KGFLKVVYD 908
           +  L+  Y+
Sbjct: 734 RSRLQHTYE 742



 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 12/68 (17%)

Query: 74  VFNVFRKHHITSDQGIPLGGIG------------AGSIGRSYRGEFQRFKLFHGICDDAP 121
           ++++F+K H+  D G+PLGGIG             GSIGRSY+GEFQ+FKLF  IC++AP
Sbjct: 12  MYDIFKKRHVRGDHGVPLGGIGNVVNFTQTRIFSGGSIGRSYKGEFQQFKLFPKICEEAP 71

Query: 122 VLANQFSV 129
           +L NQFS+
Sbjct: 72  ILTNQFSM 79


>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
          Length = 838

 Score =  765 bits (1976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/742 (53%), Positives = 507/742 (68%), Gaps = 41/742 (5%)

Query: 147 SPGVP----KKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           S GVP     K+ D GI SWDW L+G+  TYHALFPRAWTVYDGEPDPEL++ CRQISPF
Sbjct: 92  SQGVPLGGMGKSNDDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSCRQISPF 151

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
           IPHNY+ESS P SVF +T+ N+G+  A V+L+ TWANS+ G S  SG H N   MT    
Sbjct: 152 IPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEPFMT---- 207

Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
                    AN  PPVTFAVAA E  +V+V+  P F LSG S  +TA++MW  + + G F
Sbjct: 208 ---------ANNNPPVTFAVAACENQNVNVTVLPAFGLSGESS-VTAREMWGTLSQDGCF 257

Query: 323 DHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRR 378
           D  DN    PS P   G ++ AA+ AS  +      +V FSLAW  P+VKF +   Y+RR
Sbjct: 258 DR-DNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGNAYYRR 316

Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
           YTKFYGT    +  I++        WE EI+ WQRPIL D+RFPEWY +TLFNELY+L A
Sbjct: 317 YTKFYGT----SPSISN--------WEEEIDKWQRPILHDERFPEWYKVTLFNELYFLVA 364

Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG----TRLLENGEENIGQF 494
           GGT+W D +  M     +   + S D    +H  +S ++A+     +  + +  EN+G+F
Sbjct: 365 GGTVWIDSASLMADADEMMNSRLSEDNDLPLHH-SSRNSAVPLIGFSPHIIDDRENVGKF 423

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
           LYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   ++ ++DG W 
Sbjct: 424 LYLEGIEYFMWCTYDVHFYASFALLELFPKIELSIQRDFATAVLREDKSRVRFLADGTWG 483

Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
            RK +GAV HD+G  DPW E+N+YN+ ++SRWKDLN KFVLQVYRDF ATGD +F + VW
Sbjct: 484 TRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATGDMSFGKDVW 543

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P+V  AM YMEQFD D DGMIEN+GFPDQTYDAW+  GVSAYCG LW+AALQ A+ALA  
Sbjct: 544 PAVCTAMEYMEQFDHDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQTAAALART 603

Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +G H  A    +++ KAK V+++ LWNGSYFNYD+    +S SI ADQLAGQWY  + GL
Sbjct: 604 LGHHDYAERCMLKFAKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL 663

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
            P+ DE ++K  L KI+D+NV++VKGG  GA+NGM P+G++D + +Q+REIW GVTY LA
Sbjct: 664 PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLA 723

Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           A+M+   M D AF TA G++   WSE+G GY FQTPE+W  +  YRSL YMRPL IWAMQ
Sbjct: 724 ATMLLHGMDDQAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 783

Query: 854 WALTKPKLSRQEIKHEISDRDS 875
           +AL+ P+   +  K    DR S
Sbjct: 784 YALSPPRTILEAPKVNTMDRTS 805



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 2/86 (2%)

Query: 12  AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGK 71
           A  +   D G PP+  W+RKL S+      F ++F+E   +  +G RL+ Y ++E + G+
Sbjct: 16  AGDLLDFDDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIREEASHGR 75

Query: 72  VPVFNVFRKH--HITSDQGIPLGGIG 95
               + F +     ++ QG+PLGG+G
Sbjct: 76  KAPIDPFTRESDKPSASQGVPLGGMG 101


>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 676

 Score =  686 bits (1770), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/600 (58%), Positives = 437/600 (72%), Gaps = 30/600 (5%)

Query: 252 FNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG---IT 308
           F      +DGVHG+ LHHRTA+G PPVTFA+A++ET DV V+ CP F +  ++ G     
Sbjct: 5   FLGACRARDGVHGVLLHHRTADGHPPVTFAIASQETDDVRVTCCPSFAMGPSAPGGEQFM 64

Query: 309 AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
           AKDMW E K  GS   +    ++ S PGSSIGAA+AA+ T+P+G TR+V+F+L+W CPEV
Sbjct: 65  AKDMWEEAKNRGSVG-VAPGASASSRPGSSIGAAVAAATTVPAGGTRAVSFALSWSCPEV 123

Query: 369 KF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYP 426
           KF   + YHRRYTKF G   D+AA R+AHDA+LEH KWE  IE WQRP+L DKR P WYP
Sbjct: 124 KFPAGRTYHRRYTKFLGLDRDAAAERLAHDALLEHMKWESLIEEWQRPVLHDKRLPGWYP 183

Query: 427 ITLFNELYYLNAGGTIWTDGSPPMQS------LATIRERKFSLDT---PSKIHTP----- 472
           + LFNELYYLNAGGTIWTDG PP ++        +     FSLD     S  H P     
Sbjct: 184 VALFNELYYLNAGGTIWTDGLPPKKTSFASSKYGSTTMESFSLDGFHFHSGDHAPDGILR 243

Query: 473 --------ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
                     + +A GT LL +GEE++G+FLYLEG EY ++NTYDVHFY+SFAL+ LFP+
Sbjct: 244 AMATAEERTEASSAFGTALLGDGEESVGRFLYLEGMEYHLWNTYDVHFYASFALLSLFPE 303

Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
           +EL +QRDFA AV+ HDP  M+ + DG  V RK LGAVPHD+GL DPWF++N+Y + + +
Sbjct: 304 IELGLQRDFARAVLHHDPRPMRTL-DGATVPRKVLGAVPHDVGLGDPWFQLNAYMIHDPA 362

Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           RWKDLN+KFVLQVYRD  ATGD  FA AVWP+VY+AMAYM+QFD+DGDGM+ENEG PDQT
Sbjct: 363 RWKDLNTKFVLQVYRDGAATGDAAFATAVWPAVYLAMAYMDQFDRDGDGMVENEGRPDQT 422

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSY 703
           YD WS +GVSAY GGLWVAALQAA+ +A  VGD  S  YF  RY+KA+ VYD  LWNG+Y
Sbjct: 423 YDFWSVSGVSAYTGGLWVAALQAAAVMARVVGDRGSEGYFVERYEKARRVYDGELWNGAY 482

Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
           F+YDNS G++S SI ADQLAGQWYARACGL PI +E K + AL  + D+NV++V+GG  G
Sbjct: 483 FDYDNSGGTNSKSIMADQLAGQWYARACGLEPIVEEEKARSALGTVLDYNVMRVQGGAIG 542

Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
           A+NGM+PDG +D S +Q+RE+WPGVTYG+AA+M  E M + AF+TA G ++  W  DG G
Sbjct: 543 AVNGMRPDGTVDTSSIQSREVWPGVTYGVAAAMAHEGMPEAAFRTAKGAHDAGWGRDGFG 602


>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
 gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
          Length = 1205

 Score =  655 bits (1689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/886 (39%), Positives = 517/886 (58%), Gaps = 80/886 (9%)

Query: 24   PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
            PQ +W R+LD N      F +S  +   L  IG+R+++Y K+E + G+VP+ + F     
Sbjct: 324  PQCSWARRLDHNFPDHKHFQVSMWQGLKLVGIGYRMWKYVKKENSSGRVPMMDPFNLPKP 383

Query: 84   TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVS----------- 132
                G+P+GGIG+GSI R +RG+F R+ +  G+  +  V  ++FSV+++           
Sbjct: 384  GPIMGVPIGGIGSGSINRGWRGDFVRWNMNSGLVTNETVDVDKFSVYINFDDSNSASSST 443

Query: 133  --------RPNGEKF--SSVLCPRS-----PGVPKKNTDSGIESWDWNLKGENCTYHALF 177
                     PNG  F  SS    +      PG PK N +  ++ W+W LKGEN  Y  LF
Sbjct: 444  GSTPSGTFTPNGPNFDPSSTKSKQKATVLYPGKPKNNLN--LDVWNWGLKGENSCYFGLF 501

Query: 178  PRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQ-TSADVTLLFT 236
            PRAWTVY+ EP P++++VC+QISP IPHNY+ESS+P  V+ + + N+ + +SADV+L+ T
Sbjct: 502  PRAWTVYE-EPHPDIKLVCKQISPVIPHNYQESSYPVGVYVWKIENNNKNSSADVSLMLT 560

Query: 237  WANSVAGDSGLSGHHFNSKTMTKD----GVHGLTLHHRTANG----------RPPVTFAV 282
            W NS+   S   G H+N      D     + G+TL HR                P+ +++
Sbjct: 561  WQNSIGTKSDQDGGHYNKYFTLGDEPNKKIKGVTLIHRKEQRCAYNSSNIVYHDPMEYSI 620

Query: 283  AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
            A ++  DV+++    F  +     + A ++W    K G  +++ + +  PS P   IGAA
Sbjct: 621  AVKDDPDVNITFNERFETTSR---LDAANLWYTFNKKGELENIQDSR--PSAPKKPIGAA 675

Query: 343  IAASLTIPSGSTRSVTFSLAWDCPEVKF-FEKVYHRRYTKFYGTLGDSAARIAHDAILEH 401
            +AA + +P+G ++++ F +AWD P  +F     Y++RYTKFYG  G ++ RIA+DA+  +
Sbjct: 676  VAAKVKVPAGGSKTIVFCIAWDQPITRFQLGSAYYKRYTKFYGNKGGNSQRIAYDALQNY 735

Query: 402  AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
              W+ +I  WQ PI++D   P +Y + LFNELYYL  GGT+WT GSP  Q  A     ++
Sbjct: 736  KSWDNQIAQWQNPIIQDAELPSFYKMALFNELYYLVDGGTVWTHGSPQDQLPA-----RY 790

Query: 462  SLDT---PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
             L T   P  I  P                 +IG+F YLE  EY+MYNTYDVHFYSS+AL
Sbjct: 791  PLKTQLKPEDISDP----------------NHIGRFAYLESLEYLMYNTYDVHFYSSYAL 834

Query: 519  VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEIN 576
             ML+P LE+S+Q D A A M+    T + +  G+ + RK    VPHD+G   +DPW  +N
Sbjct: 835  AMLWPLLEISLQYDIAEATMLDYGITWEGIHSGQQIPRKKRCTVPHDLGNPGEDPWKRVN 894

Query: 577  SYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFDKDGDGMI 635
            SYN+ + SRWKDL SKFVLQVYRD++   DKNF   V+  V  +    +E FD D DG++
Sbjct: 895  SYNIQDISRWKDLPSKFVLQVYRDYLVVEDKNFLLQVYNVVEEVIQRTLESFDTDHDGVV 954

Query: 636  ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
            +NEGFPDQTYD W A G SAY GGLW+A+L+ AS +A  +G     S +   ++K  A Y
Sbjct: 955  DNEGFPDQTYDVWPAVGCSAYSGGLWLASLKVASEMAKILGFKEDESIYNAIFEKGSASY 1014

Query: 696  -DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
               LWNG YF YD S+   + SI +D LAG WY  +CGL       +   +L+ I ++NV
Sbjct: 1015 TKKLWNGYYFKYDCSNSVHADSIMSDMLAGHWYLLSCGLPSYMTFDQALSSLSIINEYNV 1074

Query: 755  LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
                   CGA+NGM+P+G +D + LQ+ E+W G ++ LAA+MIQ  M   A++   G+  
Sbjct: 1075 NSYGKERCGAVNGMRPEGVVDNTCLQSSEVWIGTSFSLAATMIQHHMDKDAWELVKGIVN 1134

Query: 815  VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
             +++    G+ +QTPE+W++N  YR+  YMRPL+IW++QWAL K K
Sbjct: 1135 SSYNR--WGFQYQTPEAWDSNGCYRAGAYMRPLSIWSIQWALLKKK 1178


>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
          Length = 4775

 Score =  649 bits (1675), Expect = 0.0,   Method: Composition-based stats.
 Identities = 347/853 (40%), Positives = 506/853 (59%), Gaps = 69/853 (8%)

Query: 28   WQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQ 87
            W ++LD        F +S  +   L  +G+R+++Y K+E + G+VP+ + F         
Sbjct: 339  WTKRLDHQFPEHKAFQVSIGQGFKLMGLGYRMWKYVKRENSAGRVPIMDPFNIPKPGPVM 398

Query: 88   GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV------SRPNGEKFSS 141
            G+P+GGIG+GSI R +RG+F R+ L +G+ +   V AN+FSV++      S+P   K ++
Sbjct: 399  GVPIGGIGSGSITRGWRGDFVRWNLKNGMVNSEVVDANKFSVYIKMEEQQSQPLSTKRAT 458

Query: 142  VLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
            VLCP   G PK N    ++ W+W+LKG+  +Y   FPRAWTVY+ EP  ++R+VC+Q+SP
Sbjct: 459  VLCP---GKPKSNF--ALDVWNWSLKGDRSSYFGQFPRAWTVYE-EPHQDVRLVCKQVSP 512

Query: 202  FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKD- 260
             IP+NYKESS+P  VF + + N+  T A+V+L+FTW NS       +G H N +    D 
Sbjct: 513  VIPNNYKESSYPCGVFVWKIENTASTDAEVSLMFTWQNSDGTAVDQAGGHHNKRFKYTDE 572

Query: 261  ---GVHGLTLHHRTANG------------RPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
                ++G+TL   T N             + P+  ++   + ADV  S    F     + 
Sbjct: 573  QGRSINGVTL---TTNRDLKSSVDPKKVYQDPLELSIGVRDDADVQFSFVSRF---ETTN 626

Query: 306  GITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
             + A ++W    K G  D  +++ + PS P   IGAAIAA + + +G+TR V FS+AWD 
Sbjct: 627  RLEAANLWYSFNKSGVLD--NSEDSRPSAPKKPIGAAIAAKVLVKAGTTREVAFSVAWDT 684

Query: 366  PEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEW 424
            P  +F     Y+RRYTKF+GT G+++ RIA DA+  + KWE EI  WQ PIL D   P +
Sbjct: 685  PVCRFNSGSGYYRRYTKFFGTAGNNSQRIACDAVYNYRKWEEEIVRWQHPILSDPSLPTF 744

Query: 425  YPITLFNELYYLNAGGTIWTDGSP---PMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            Y   +FNELYY   GGT+WT G+P   P Q     + ++  L+ P+ I            
Sbjct: 745  YKQAIFNELYYFVDGGTVWTHGAPDDQPNQKKIVSKFQQEDLNDPNYI------------ 792

Query: 482  RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                      G+F YLE  +Y+MYNTYDVHFY+SFAL ML+P+LELS+Q DFA A ++  
Sbjct: 793  ----------GRFAYLESVDYLMYNTYDVHFYASFALAMLWPRLELSLQTDFADATLLDY 842

Query: 542  PGT-MKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
              T ++ +  GK + RK  GAVPHD+G   +DPW  +N+Y++ + SRWKDL SKFVLQ+Y
Sbjct: 843  SDTQVECIQTGKNMPRKVRGAVPHDLGNPGEDPWKRVNAYHIQDVSRWKDLPSKFVLQIY 902

Query: 599  RDFVATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
            RD++  G D+ F   +W  V   +    ++D D DG+I+NEG PDQTYDAWSA G SAY 
Sbjct: 903  RDYLINGNDRTFLLQMWGVVEEVIRRAFEYDIDIDGVIDNEGVPDQTYDAWSALGCSAYS 962

Query: 658  GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTS 716
            GGLW+AA++ AS +A  +G     + +   ++K K  Y + LWNG YFNYD+S      S
Sbjct: 963  GGLWLAAVKVASEMARILGLKEDETVYKKIFEKGKKSYSTKLWNGHYFNYDSSKNPHFDS 1022

Query: 717  IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
            I +DQLAG WY  ACGL       +   +L+ I ++NV     G CGA+NGM+P+G  D 
Sbjct: 1023 IMSDQLAGHWYLLACGLPSYITLDQALSSLSIINEYNVKSYSNGSCGAVNGMRPEGGPDT 1082

Query: 777  SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNND 836
            + LQ+ E+W G +YGLA++M+   M + A++   G+ +  +++   G+ +QTPE+W+ N 
Sbjct: 1083 TSLQSCEVWIGTSYGLASTMLLHFMDNEAWELIKGLVDSTYNK--WGFQYQTPEAWDQNG 1140

Query: 837  EYRSLCYMRPLTI 849
             YR+  YMRPL I
Sbjct: 1141 LYRAGTYMRPLAI 1153


>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
          Length = 521

 Score =  639 bits (1649), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 320/535 (59%), Positives = 383/535 (71%), Gaps = 63/535 (11%)

Query: 425 YPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP------------------ 466
           Y ITLFNELY+LNAGGTIWTDG PPMQSLATI + KFSLD                    
Sbjct: 11  YRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTTDIVHQNDSTV 70

Query: 467 ----------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
                      +IH P +S++A GT LL++GEEN+GQFLYLEG EY M+NTYDVHFYSSF
Sbjct: 71  EILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWNTYDVHFYSSF 130

Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEIN 576
           A++MLFP+LELSIQRDFAAAVM+HDP  MKIMSDGKWV RK LGAVPHDIG+ DPWFE+N
Sbjct: 131 AIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDIGISDPWFELN 190

Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
           +YNL+++ RWKDLNSKFVLQVYRD VATGDKNFARAVWP+VYIA+A+++QFDKDGDGMIE
Sbjct: 191 AYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDKDGDGMIE 250

Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD 696
           N+G P                       LQAASA+A +VGD  +A YFW ++QKAKAVYD
Sbjct: 251 NDGLP-----------------------LQAASAMAREVGDSMTADYFWFKFQKAKAVYD 287

Query: 697 S------LWNGSYFNYDNSDGS-SSTSIQAD-QLAGQW---YARACGLLPIADEAKVKKA 745
                  +  G         G+ S T I    Q    W   YARACGL PI D+ K + A
Sbjct: 288 KDQLAAVIGEGRLLXLFRFLGNPSHTGIDGLFQDHRDWTTRYARACGLQPIVDDEKARSA 347

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
           L K+Y+FNVLKVK G CGA+NGM PDGR+DMS +Q+REIW GVTY +AA+MI E MV+ A
Sbjct: 348 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 407

Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
           F TA+G+Y+ AWS++GLGYSFQTPE+WN ++EYRSLCYMRPL IWAMQWAL+KP+L   +
Sbjct: 408 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPELHNHD 467

Query: 866 IKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVLCHNLN 920
           +KHE   + +   E H  F KVA LLKLP+EEASK FL++ +D T  R+    LN
Sbjct: 468 MKHE-EGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLYTLRLN 521


>gi|449526389|ref|XP_004170196.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
           sativus]
          Length = 440

 Score =  637 bits (1644), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 297/439 (67%), Positives = 349/439 (79%), Gaps = 1/439 (0%)

Query: 7   EEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQE 66
           E+G N AS  +VDP +PP LTW+RKLD   K P  F+ +  +  H+   G+RL+R  K+E
Sbjct: 2   EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPESFSFTLTDAWHMGMTGYRLWRNGKEE 61

Query: 67  EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
            AKG++P++  F    IT   G+ LGGIGAGSIGRSYRGEFQRF++F+G C+D PVLANQ
Sbjct: 62  IAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLANQ 121

Query: 127 FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
           FSVFVSRPNG KFSSVLC   P   K    +GIESWDWNL GEN TYHALFPR+WTVYDG
Sbjct: 122 FSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYDG 181

Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
           EPDP+L+IVCRQ+SP IPHNYKESSFP SVFTF LSN GQTSA VTLLFTWANSV G SG
Sbjct: 182 EPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKSG 241

Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
            +GHHFNSK   +DG  G+ LHH++ANGRP VT+ +AAE T DVHVS CPCF++SG+S+G
Sbjct: 242 FTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSEG 301

Query: 307 ITAKDMWNEIKKHGSFDHLDN-DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
           I+AKDMW EIK HGSFD+L +      S+PG SIGAA+AA+LTIP  S R+VTFSLAWDC
Sbjct: 302 ISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWDC 361

Query: 366 PEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
           PEVKF  K YHR+Y+KFYGTLGD+A  IA DAI +H KWE EIEAWQRPI+EDKR P+WY
Sbjct: 362 PEVKFDGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPDWY 421

Query: 426 PITLFNELYYLNAGGTIWT 444
           P+TLFNELY+LN+GGTIWT
Sbjct: 422 PVTLFNELYFLNSGGTIWT 440


>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
           sativus]
          Length = 508

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 316/503 (62%), Positives = 383/503 (76%), Gaps = 33/503 (6%)

Query: 444 TDGSPPMQSLATIRERKFSLD---------TPSKIHT---------------------PA 473
           +DG PP+Q+L+TI  +K+ L+          P+  H                        
Sbjct: 4   SDGLPPLQNLSTISHKKYFLERSKSELNGGAPNGDHRKDVAVDILERMSQILDQTHGGAG 63

Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
            S+ ALGTRLL  GEEN+G  L +EG++Y+M+NTYDVHFYSSFAL+MLFPKLELSIQRDF
Sbjct: 64  PSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVHFYSSFALIMLFPKLELSIQRDF 123

Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKF 593
           AAAV+MHDP   K M DG WV RK LGAVPHDIG +DPW E+N+YNL N SRWKDL SKF
Sbjct: 124 AAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDPWLEVNAYNLLNVSRWKDLGSKF 183

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           VLQVYRD VATGDKNFA++VWPSVY+A+A+MEQFDKD DGMIENEGFPDQTYD W+  GV
Sbjct: 184 VLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDKDGMIENEGFPDQTYDTWTVKGV 243

Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSS 713
           SAYCGGLWVAALQAASALA++V D A+A YFW++YQKA++VY++LWNGSYFNYDNS G  
Sbjct: 244 SAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKARSVYETLWNGSYFNYDNSKGPW 303

Query: 714 STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
           S+SIQADQLAGQWYARACGL PIADE K++ AL KIY+FNV+KVKGG  GA+NGM PDG 
Sbjct: 304 SSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPDGS 363

Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
           +D S LQ +EIW GVTY +AA+MIQE MV+  FQTA G+Y+ AW++DGLGYSFQTPE+W+
Sbjct: 364 VDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQAAWAQDGLGYSFQTPEAWD 423

Query: 834 NNDEYRSLCYMRPLTIWAMQWA-LTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVASLLK 892
            +D +RS+ YMRPL IWAMQWA ++  K ++   K     ++S++  QHAAF KVASLLK
Sbjct: 424 VDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFSEMQESAFATQHAAFLKVASLLK 483

Query: 893 LPKEE--ASKGFLKVVYDFTIGR 913
           LP  +  A +  ++  YDF   R
Sbjct: 484 LPSNDDTARRSLVEAAYDFICKR 506


>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
          Length = 1200

 Score =  618 bits (1593), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 346/908 (38%), Positives = 509/908 (56%), Gaps = 81/908 (8%)

Query: 2    ENGVKEEGENAASMPK---VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFR 58
            EN   EE +  + M K   +    PP   W R L++       F +SF +   L  +G+R
Sbjct: 309  ENSTFEEEDIQSEMEKWQRLGEDIPPH-CWTRALNNQFPDHKKFQVSFGQGIKLMGLGYR 367

Query: 59   LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICD 118
            +++Y K+E   G++P+ + F   +     G+P+GGIG GSI R +RG+F R+ L +G+  
Sbjct: 368  MWKYVKREMNAGRLPIMDPFNMPNPGPIMGVPIGGIGCGSITRGWRGDFVRWNLSNGVVT 427

Query: 119  DAPVLANQFSVFVSRPN---------------GEKFSSVLCPRSPGVPKKNTDSGIESWD 163
            +  V AN FSVF+   N                 K ++VLC    G P+ N +  ++ W+
Sbjct: 428  NKVVDANNFSVFIKMNNQGSTSSSSSSSSSSATNKLATVLCQ---GKPRNNQN--LDVWN 482

Query: 164  WNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSN 223
            W LKG+  +Y  +FPRAWTVY+ EP P++++VC+Q+SP I HNY+ESS+P +VF + + N
Sbjct: 483  WALKGDKSSYFGMFPRAWTVYE-EPHPDIKLVCKQVSPVIAHNYQESSYPCAVFVWKIDN 541

Query: 224  SGQTSADVTLLFTWANSVAGDSG--LSGHHFNSKTMTKDGVHGLTLH-HRTANGRPPVTF 280
            S   SAD++L+FTW N    DS   L GH+  S ++      G+TL+ +R+A+    +  
Sbjct: 542  SAPVSADISLMFTWQNGDGTDSTDQLGGHYNRSFSLDNGKFRGITLNTNRSASQNSSMEL 601

Query: 281  AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIG 340
            ++ A    D   S          +  + A ++W    K+G  D+  +D T P++    IG
Sbjct: 602  SIGA--LCDEPGSNFSYVSRFETTNRLEAANLWYSFNKNGVLDN--SDDTRPNQAKKPIG 657

Query: 341  AAIAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAI 398
            AAIA  + + + S+R + F++AWD P   F   + +Y RRYTKF+GT G+++  IAH A 
Sbjct: 658  AAIAYKVRVEAQSSRQIVFAVAWDSPYCTFNSGKSIYPRRYTKFFGTSGNNSQSIAHYAS 717

Query: 399  LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
            L +  WE +I +WQ PIL+D + P +Y   LFNELY+L  GG+IWT       +      
Sbjct: 718  LNYQNWEQQINSWQNPILQDPQLPSFYKKALFNELYFLVDGGSIWTINQNNQNNNQNNII 777

Query: 459  RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
               + +  + I+                 ++ IGQF YLE  EY+MYNTYDVHFY+SFAL
Sbjct: 778  DNNNNNNNNYIY-----------------DKEIGQFAYLESQEYLMYNTYDVHFYASFAL 820

Query: 519  VMLFPKLELSIQRDFAAAVMMHDPG-TMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEI 575
              L+P LE S+Q   A A M    G T++ +  GK + RK  G VPHDIG   +DPW  +
Sbjct: 821  ATLWPSLEFSLQSGIADATMEDYGGETVECIHSGKQIPRKLRGTVPHDIGNPGEDPWKRV 880

Query: 576  NSYNLFNSSRWKDLNSKFVLQVYRDFVAT------------------------GDKNFAR 611
            N+YN+ + SRWKDL  KF+LQVYRD++ T                        GD+ F  
Sbjct: 881  NAYNIQDISRWKDLPCKFILQVYRDYLLTSVDSSGQKSNATSSSFGQEYIPLDGDRGFLL 940

Query: 612  AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
             +W +V   +    +FD D DG+I+NEGFPDQTYD WSA+G SAY GGLW+AA++A SA+
Sbjct: 941  QMWGTVEEVIRKAFEFDTDDDGVIDNEGFPDQTYDTWSASGCSAYTGGLWLAAIKATSAM 1000

Query: 672  ANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
            A  +G       +   ++K K  Y+  LWNG YFNYD+S  S   SI ADQLAG WY  +
Sbjct: 1001 ARILGLRDDEEVYSKLFEKGKKSYNKKLWNGHYFNYDSSKQSHYNSIMADQLAGHWYLTS 1060

Query: 731  CGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTY 790
            CGL       +    L+ I ++N+     G CGA+NG+ P   +D + LQ+ E+W G +Y
Sbjct: 1061 CGLSSYMTLDQALSTLSIINEYNIKSYSNGSCGAVNGISPLAIVDQTCLQSSEVWIGTSY 1120

Query: 791  GLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
             LA++ +   M   A+    G+  V  S +  G+ +QTPE+W+ N  +R+  YMRPL IW
Sbjct: 1121 SLASTFLLHYMDKEAWSLIKGL--VNSSYEKWGFQYQTPEAWDMNGSFRASTYMRPLAIW 1178

Query: 851  AMQWALTK 858
            ++QWAL++
Sbjct: 1179 SVQWALSR 1186


>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
          Length = 851

 Score =  616 bits (1588), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 338/822 (41%), Positives = 474/822 (57%), Gaps = 64/822 (7%)

Query: 55  IGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLF 113
           +G R  ++  ++ + + K P  ++FR   +    G+PLGGIG GSI R +RGEF R++L 
Sbjct: 61  LGVRYLKWWYRKTQVEKKAPFIDMFRAVPLRQIYGVPLGGIGGGSITRGWRGEFCRWQLN 120

Query: 114 HGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
            G+     V+ANQF+V + R     +  VL    P        S ++ W+W   GE+  Y
Sbjct: 121 PGMYHYKTVIANQFTVCLRRDGQTVYQQVLSTERP--------STLQGWNWGYCGEHAFY 172

Query: 174 HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
           H LFPRAWTVY+  P   + + CRQ+SP IPH+YK+SS P +VF + + N    + D+++
Sbjct: 173 HGLFPRAWTVYN-LPGQNVTLTCRQVSPVIPHDYKDSSLPVAVFVWDIENKNDYALDISI 231

Query: 234 LFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADV 290
           +FT  N        SG H+N    + K+G  V G+ LHH+T     P T  +AA + +  
Sbjct: 232 MFTMVNGSGQKDDKSGGHWNEPFHLEKEGESVSGVLLHHQTPAN--PYTLCIAARQKSGQ 289

Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIP 350
            +S    F   G     T   +W ++   G  D   N  + P+E G  + AA+  S ++ 
Sbjct: 290 EISHQTAFSPKG-----TCSAVWCDLMTDGRLDSPTN-SSPPTEKGEEVAAALVVSCSVS 343

Query: 351 SGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
           + S  SV FSLAWD P++ F   E+ Y RRYT++YGT GD+A  I+H A+  ++KWE  I
Sbjct: 344 ANSRNSVDFSLAWDMPKITFGSKERTYVRRYTRYYGTKGDAAPSISHYALTHYSKWEESI 403

Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
           E WQRPIL+D+  P WY   LFNELY++  GGTIW                         
Sbjct: 404 EEWQRPILQDRSLPAWYKSALFNELYFVVDGGTIW------------------------- 438

Query: 469 IHTPASSDTALGTRLLENG-------EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
           +  P  +D + G R  + G        ++ G+F YLEG EY MYNTYDVHFY+SFAL+ML
Sbjct: 439 VELPEDADISGGLRPEDGGLPAQPEVVKDFGRFAYLEGQEYRMYNTYDVHFYASFALIML 498

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
           +PKL LS+Q D A +V+ HDP     + +G++   K  G VPHDIG   D+PW  +N+Y 
Sbjct: 499 WPKLALSLQYDIAGSVVQHDPMERLNLMNGRYSPVKTRGVVPHDIGDPDDEPWVRVNAYL 558

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + +++ WKDLN KFVLQVYRD+  T D+ + + +WP     M    +FDKDGDG+IEN G
Sbjct: 559 IHDTADWKDLNLKFVLQVYRDYHLTQDQQYLKDMWPVCQTVMENELKFDKDGDGLIENSG 618

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
           + DQTYD W   G SAYCGGLW+A++     +A  +   +    +    ++  A +D  L
Sbjct: 619 YADQTYDGWKVTGPSAYCGGLWLASVCMMCKMARVLNCESVYQRYRDILERGSAAFDKLL 678

Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNV 754
           WNG Y+NYD+S  S S S+ +DQ AG W+ RA GL     +A    K+  AL  ++D NV
Sbjct: 679 WNGKYYNYDSSGRSLSNSVMSDQCAGHWFLRASGLGDDEYQAFPKEKICSALKSVFDLNV 738

Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           +   GG  GA+NGM+P+G  D S +Q+ E+W GV YGLAA+MI E MV+   +TA G Y 
Sbjct: 739 MSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWVGVVYGLAATMIHEGMVEEGLRTAEGCYR 798

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
             W    +G +FQTPE++     YRSL YMRPL+IWAMQ AL
Sbjct: 799 AVWER--MGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLAL 838


>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
 gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
          Length = 1302

 Score =  615 bits (1586), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 346/917 (37%), Positives = 510/917 (55%), Gaps = 108/917 (11%)

Query: 24   PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
            PQ  W R+LD+N      F +S  +   L  +G+R+++Y K+E + G+VP+ + F     
Sbjct: 369  PQYAWARRLDNNYPEHKHFQVSMWQGLKLVGMGYRMWKYVKKETSSGRVPIMDPFNLPKP 428

Query: 84   TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP--------- 134
                G+P+GGIG+GSI R ++G+F R+ L +G+  +  V  ++FSV+++           
Sbjct: 429  GPIMGVPIGGIGSGSINRGWKGDFVRWNLNNGLVSNEIVDVDKFSVYINFDNSNTSSSSS 488

Query: 135  -----------------------------------NGEKFSSVLCPRSPGVPKKNTDSGI 159
                                               N  K         PG PK N +  +
Sbjct: 489  SSSSNSSTPPPQFSNNNNNSYNNSYNNNINNNGFDNNLKSKQKAVVLYPGKPKSNLN--L 546

Query: 160  ESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTF 219
            + W+W LKG+N  Y  LFPRAWTVY+ EP P++R+ C+Q+SP IPHNY+ESS+P  V+ +
Sbjct: 547  DVWNWGLKGDNSCYFGLFPRAWTVYE-EPHPDIRLTCKQVSPVIPHNYQESSYPCGVYVW 605

Query: 220  TLSNSGQ-TSADVTLLFTWANSVAGDSGLSGHHFNS-------------------KTMTK 259
             + N+ Q +SADV+L+ TW NS+   S   G HFN                        K
Sbjct: 606  KIENNNQYSSADVSLMLTWQNSIGTQSDQDGGHFNKYFTFGDNDQQSPSSSSSSSSGNNK 665

Query: 260  DGVHGLTLHHRTA------NGRP-----PVTFAVAAEETADVHVSECPCFLLSGNSKGIT 308
              + G+T+ H+        NG       P+ +++A  +  DV +S C  F  +     + 
Sbjct: 666  KNIKGITMIHKKEQKCTLPNGNTKVFNDPMEYSIAVMDEPDVEISFCERFETTSR---LD 722

Query: 309  AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
            A ++W    K G  +  +N  + PS P   IGAA+AA + + +GS++++ F ++WD P  
Sbjct: 723  AANLWYTFNKKGELE--NNKDSRPSAPKKPIGAAVAAKVKVAAGSSKTIVFCISWDAPIT 780

Query: 369  KF-FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
            +F     Y++RYTKFYG  G ++ RIA+DAI     W+ +I  WQ+PIL D   P +Y +
Sbjct: 781  RFQMGSAYYKRYTKFYGNQGGNSQRIAYDAINNWKHWDNQIIQWQQPILSDPELPTFYKM 840

Query: 428  TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
             LFNELYYL  GGT+WT GSP    +  +  R     T +++    ++DT          
Sbjct: 841  ALFNELYYLVDGGTVWTHGSP----IDPVPSR---YPTKTQLKQEDTTDT---------- 883

Query: 488  EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
              +IG+F YLE  EY+MYNTYDV FY+SFAL ML+P LE+S+Q D A A  +      + 
Sbjct: 884  -NHIGRFAYLESLEYLMYNTYDVMFYASFALTMLWPLLEISLQYDIADATQLDYGINWEG 942

Query: 548  MSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG 605
            +  G  + RK  G VPHD+G   +DPW  +NSY + + SRWKDL SKFVLQVYRD++   
Sbjct: 943  IHSGHLMPRKKQGTVPHDLGNPGEDPWKRVNSYCIQDVSRWKDLPSKFVLQVYRDYLVME 1002

Query: 606  DKNFARAVWPSVY-IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAA 664
            DKNF    +  V  +    +E FD D DG+++NEGFPDQTYD W A G SAY GGLW+AA
Sbjct: 1003 DKNFLLQCYNVVEGVIQRTLEHFDFDHDGVVDNEGFPDQTYDVWPATGCSAYSGGLWLAA 1062

Query: 665  LQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
            L+ A+ +A  +G +   S +   ++K +K+    LWNG YFNYD+S    S SI +D LA
Sbjct: 1063 LKVAAEIAKILGFNEDESIYNAIFEKGSKSFTKKLWNGHYFNYDSSKSVHSDSIMSDMLA 1122

Query: 724  GQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQARE 783
            G WY  +CGL       +   +L+ I ++NV     G+CGA+NGM+P+G +D + LQ+ E
Sbjct: 1123 GHWYLLSCGLKSYMTFDQALSSLSIINEYNVKSYSKGVCGAVNGMRPEGTVDTTCLQSSE 1182

Query: 784  IWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCY 843
            +W G ++ LAA+MI   M   A+    G+   ++ +   G+ +QTPE+W+ N  +R+  Y
Sbjct: 1183 VWIGTSFSLAATMILHHMDSDAWDLVKGIVNSSYQK--WGFQYQTPEAWDQNGCFRAASY 1240

Query: 844  MRPLTIWAMQWALTKPK 860
            MRPL+IW++QWAL K K
Sbjct: 1241 MRPLSIWSIQWALQKRK 1257


>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
          Length = 853

 Score =  603 bits (1554), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 334/820 (40%), Positives = 469/820 (57%), Gaps = 52/820 (6%)

Query: 52  LAHIGFRLYRYS---KQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           L H+G  L  +    K+   + K    ++     +    G PLGGIG+G+I R +RGEF 
Sbjct: 51  LRHVGLALRYFKWWYKKTRIEKKSAFIDLLCAVPLQQIYGCPLGGIGSGTITRGWRGEFC 110

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  G      V+A+QF+V + R     +  VL    P        S ++ W+W   G
Sbjct: 111 RWQLNPGKYHYETVIADQFTVCLRRKGQTVYQQVLSVEKP--------SALQGWNWGYCG 162

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAW VY+  P   + + CRQISP IPH+YK+SS P  VF + + N+ +  
Sbjct: 163 RYAFYHALYPRAWMVYE-LPGQNVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENNSEEP 221

Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
            DV+++FT  N        SG H+N     +DG   V G+ LHH T     P TFAVAA 
Sbjct: 222 VDVSIMFTLQNGTGTKGDRSGGHWNEPFALQDGGERVAGVLLHHCTPVN--PFTFAVAAR 279

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
           E A   ++    F  +G     + +D+W ++ + G  +     K+  ++ G    AA+ A
Sbjct: 280 EKAGTVITHLTAFDPAG-----SGRDVWQDLLQDGKLES-PTGKSKQTQNGEVTAAAVCA 333

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
           S T+P+    ++  +L WD P V F   E+++ RRYT+F+G+ GD+A  ++H A+  + +
Sbjct: 334 SCTVPAQGHGTLELALVWDMPHVHFGSKERLHLRRYTRFFGSNGDAAPALSHYALTHYKE 393

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
           WE +IEAWQ+PILED + P WY   LFNELY++  GGTIW D  P               
Sbjct: 394 WERKIEAWQKPILEDSQLPSWYKSALFNELYFMTDGGTIWLDLPPDC------------- 440

Query: 464 DTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
             P  +  P ++  +    +L       G+F YLEG EY MYNTYDVHFY+SFALVML+P
Sbjct: 441 -LPQDLQGPGAAKLSHLLPVLRE----YGRFAYLEGQEYRMYNTYDVHFYASFALVMLWP 495

Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLF 581
           KL++S+Q D A  V+  D    + +  GK    K    VPHDIG   D+PW  +N+Y + 
Sbjct: 496 KLQISLQYDIAVTVLNEDVQPRQYLVCGKTAQVKVKNVVPHDIGDPDDEPWQRVNAYLMH 555

Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
           +++ WKDLN KFVLQVYRD+  T D  + + +WP     M    +FD D DG+IEN G  
Sbjct: 556 DTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGLIENGGIA 615

Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
           DQTYDAW  +G SAYCGGLW+AA++    +A  +GD  +   +    QK K  ++  LWN
Sbjct: 616 DQTYDAWVVDGASAYCGGLWLAAVRMMCEMAEVLGDTETRQKYDAILQKGKESFERLLWN 675

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLK 756
           G Y+NYD+S  S+S+SI +DQ AGQW+  ACGL    + +  ++ +  AL  I++ NV+ 
Sbjct: 676 GKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKEVEVFPKSHIVSALKTIFEKNVMS 735

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
             GG  GA+NGM+PDG  D S +Q+ E+W GV Y LAA+MIQE +V   F TA G Y   
Sbjct: 736 FAGGKMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTV 795

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           W    LG +FQTPE++     YRSL YMRPL+IW+MQ AL
Sbjct: 796 WEH--LGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLAL 833


>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
 gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
          Length = 783

 Score =  600 bits (1547), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 338/829 (40%), Positives = 481/829 (58%), Gaps = 74/829 (8%)

Query: 61  RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
           +Y+ +E+  G+VP  +    +      G+PLGGIG G+IGR +RGEF R++L  GI    
Sbjct: 2   KYATKEKKNGRVPYIDPLAANPCRQVYGVPLGGIGCGTIGRGWRGEFNRWQLTPGIYSYN 61

Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
            V ANQF V V +     + +VL P  P GV        ++ W+W   G N  YHAL+PR
Sbjct: 62  YVEANQFVVCVRKKGRTTYQAVLSPNRPNGVL---CGRSLQGWNWGFSGSNAVYHALYPR 118

Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
           AWT Y+  P  ++ +VCRQ+SP  PH+YK++S P +VF +++ N+     +V+++F++ N
Sbjct: 119 AWTRYE-LPGQDIILVCRQVSPVFPHDYKDTSLPVAVFIWSIENNNNEEVEVSIMFSFEN 177

Query: 240 SVAGDSGLSGHHFN-----SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
              G   L+  H+N     + + T   V G+ LHH+  + + P T A+AA+    +H  E
Sbjct: 178 G-DGTQDLTAGHYNESADTTGSNTTCDVTGVLLHHK--HHKLPYTLAIAAKA---LHKKE 231

Query: 295 CPCFLLSGNSKG-----ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI 349
               L++  +K         + +WN++   G      N  T PS P  S  +  AA    
Sbjct: 232 ----LVTVTTKTWFDSRTPCRRVWNDLMDDGKL----NSSTEPSPPSRSDQSLCAAVAAT 283

Query: 350 PSGSTRS---VTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKW 404
            + + RS   + F+LAWD P + F   +K ++R YT+++G  G++   +   A++ +  W
Sbjct: 284 TTVAARSRGELEFALAWDMPVIYFGNSKKRHYRYYTRYFGHQGNAGPALCSHALMSYPDW 343

Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
           E +IEAWQ+PIL+D+  P WY   LFNELY++  GGT+W D                   
Sbjct: 344 ETKIEAWQKPILQDESLPNWYKSALFNELYFVADGGTVWLD------------------- 384

Query: 465 TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
                   + S T+ GT  +      IG+F YLEG EY MYNTYDVHFY+SFAL ML+PK
Sbjct: 385 ------VRSESGTSKGTADIV---RKIGRFAYLEGHEYRMYNTYDVHFYASFALAMLWPK 435

Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFN 582
           L+LS+Q D A AV + D   +  M +G +  +K  G +PHDIG   + PW  +N+Y++ +
Sbjct: 436 LQLSLQYDMAHAVNVSDNQVVMTMMNGHYCRKKVPGCIPHDIGDPSEAPWDHVNAYHIHD 495

Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
           +S+WKDLN KFVLQVYRD+V T D  + + +WP     M     +D DGDG+IEN G  D
Sbjct: 496 TSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKTVMTKSMTYDSDGDGLIENSGLAD 555

Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALAN--DVGDHASASYFWVRYQKAKAVYDSLWN 700
           QT+DAW   G SAYCGGLW+AAL+  + +A   D  D     Y  +  +  KA    LWN
Sbjct: 556 QTFDAWPVTGPSAYCGGLWLAALRVMAEIATILDFPDE-RGKYEKILARGKKAYERLLWN 614

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE-----AKVKKALTKIYDFNVL 755
           G Y+NYD+S      SI ADQL+GQWY  AC L   +++       V  AL  +++FNV+
Sbjct: 615 GKYYNYDSSTSKYHNSIMADQLSGQWYLHACDLAQTSNDRVFPSENVISALRTVFNFNVM 674

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
           K + G  GA+NG++PDG++D S LQA E+W GVTY +AASMIQE +VD  F+TA+G+Y  
Sbjct: 675 KFQEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNT 734

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQ 864
            +    LG +FQTPE+   N  YRSL YMRPL+IWAMQWAL K K  R+
Sbjct: 735 CFER--LGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNKRE 781


>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
 gi|219886613|gb|ACL53681.1| unknown [Zea mays]
          Length = 649

 Score =  597 bits (1540), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 306/614 (49%), Positives = 407/614 (66%), Gaps = 52/614 (8%)

Query: 312 MWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
           MWN + ++G FD  +     + PS PG  + AA++AS  +      +V F+LAW  P+VK
Sbjct: 1   MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60

Query: 370 FFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
           F +   Y+RRYT+FYGT   SA  + HDA+ ++  WE EIE WQ PIL+D+R PEWY  T
Sbjct: 61  FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120

Query: 429 LFNELYYLNAGGTIWTDGSPPM---------------------QSLATIRERKFSLDTPS 467
           LFNELY+L AGGT+WTDG PP                       +  ++++R  +L    
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180

Query: 468 KIH---------------------------TPASSDTALGTRLLENGEENIGQFLYLEGA 500
             H                               S+  +   + ++G E++G+FLYLEG 
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQINGLKSEEPIPYLISKDGPEHVGKFLYLEGV 240

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
           EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   +K ++DG    RK  G
Sbjct: 241 EYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVKG 300

Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           AVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF ATGD  F R VWP+V  A
Sbjct: 301 AVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRGVWPAVCAA 360

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           M YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD   
Sbjct: 361 MDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHF 420

Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           A  + +++ KAKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL P+ DE
Sbjct: 421 AEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDE 480

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
            K++ AL KI++FNV+KVKGG  GA+NGM P G++D + +Q+REIW GVTY +AA+M+  
Sbjct: 481 HKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLH 540

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
            M    F TA G++   WSE+G GY FQTPE W  +  YRSL YMRPL IWA+Q+A++ P
Sbjct: 541 GMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSPP 600

Query: 860 KLSRQEIKHEISDR 873
           K   +  K  + DR
Sbjct: 601 KAILEAPKVNLMDR 614


>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 649

 Score =  597 bits (1538), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 306/614 (49%), Positives = 407/614 (66%), Gaps = 52/614 (8%)

Query: 312 MWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
           MWN + ++G FD  +     + PS PG  + AA++AS  +      +V F+LAW  P+VK
Sbjct: 1   MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60

Query: 370 FFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
           F +   Y+RRYT+FYGT   SA  + HDA+ ++  WE EIE WQ PIL+D+R PEWY  T
Sbjct: 61  FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120

Query: 429 LFNELYYLNAGGTIWTDGSPPM---------------------QSLATIRERKFSLDTPS 467
           LFNELY+L AGGT+WTDG PP                       +  ++++R  +L    
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180

Query: 468 KIH---------------------------TPASSDTALGTRLLENGEENIGQFLYLEGA 500
             H                               S+  +   + ++G E++G+FLYLEG 
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQINGLKSEEPIPYLISKDGPEHVGKFLYLEGV 240

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
           EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   +K ++DG    RK  G
Sbjct: 241 EYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVKG 300

Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           AVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF ATGD  F R VWP+V  A
Sbjct: 301 AVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAA 360

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           M YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD   
Sbjct: 361 MDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHF 420

Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           A  + +++ KAKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL P+ DE
Sbjct: 421 AEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDE 480

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
            K++ AL KI++FNV+KVKGG  GA+NGM P G++D + +Q+REIW GVTY +AA+M+  
Sbjct: 481 HKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLH 540

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
            M    F TA G++   WSE+G GY FQTPE W  +  YRSL YMRPL IWA+Q+A++ P
Sbjct: 541 GMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSPP 600

Query: 860 KLSRQEIKHEISDR 873
           K   +  K  + DR
Sbjct: 601 KAILEAPKVNLMDR 614


>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
          Length = 583

 Score =  593 bits (1529), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 307/567 (54%), Positives = 394/567 (69%), Gaps = 23/567 (4%)

Query: 312 MWNEIKKHGSFDHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
           MW  + + GSFD  DN  T PS P   G ++ AA++AS  +      +V F+LAW  P+V
Sbjct: 1   MWGTMVQDGSFDQ-DNLNTGPSMPSLLGDTVCAAVSASTWVEPHGRCAVVFALAWSSPQV 59

Query: 369 KFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
           KF +   Y+RRYTKFYGT   SA  +  DA++++  WE EI+ WQ P+L+D+R PEWY I
Sbjct: 60  KFKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEEIDKWQTPVLQDERLPEWYKI 119

Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
           TLFNELY+L AGGT+W D      SL +  + K + +  +  H               + 
Sbjct: 120 TLFNELYFLVAGGTVWIDSG----SLVSDADNKSNPEESNLSHEI-------------DD 162

Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
           +EN+G+FLYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   ++ 
Sbjct: 163 KENVGKFLYLEGIEYFMWCTYDVHFYASFALLDLFPKIELSIQRDFARAVLREDNRRVRF 222

Query: 548 MSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK 607
           ++DG W  RK +GAV HD+G  DPW E+N+YN+ ++SRWKDLN KFVLQ+YRDF ATGDK
Sbjct: 223 LADGTWGIRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQIYRDFAATGDK 282

Query: 608 NFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
           +F + VWP+V  AM YMEQFD DGDGMIEN+GFPDQTYDAW+  GVSAYCG LW+AALQA
Sbjct: 283 SFGKDVWPAVCTAMEYMEQFDHDGDGMIENDGFPDQTYDAWTVQGVSAYCGCLWLAALQA 342

Query: 668 ASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
           A+ALA  +G    A     R+ KAK V+++ LWNGS+FNYD+    SS SIQADQLAGQW
Sbjct: 343 AAALAGSLGHDDYAERCMTRFVKAKTVFEARLWNGSHFNYDSGTSYSSRSIQADQLAGQW 402

Query: 727 YARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
           Y  + GL P+ DE ++K  L KI+D+NV++VKGG  GA+NGM P+G++D + +Q+REIW 
Sbjct: 403 YTASSGLPPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWT 462

Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
           GVTY LAA+M+   M   AF TA G+Y   WSE+G GY FQTPE+W  +  YRSL YMRP
Sbjct: 463 GVTYSLAATMLLHGMEHQAFTTAEGIYTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRP 522

Query: 847 LTIWAMQWALTKPKLSRQEIKHEISDR 873
           L IWAMQWAL+ P+   +  K    DR
Sbjct: 523 LAIWAMQWALSPPRSILEAPKVNTMDR 549


>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
          Length = 900

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 330/697 (47%), Positives = 426/697 (61%), Gaps = 93/697 (13%)

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
           T+ +    G  L +RTA   PPVTFA+AA ET +V V+  P F LS  S  ITAKDMW +
Sbjct: 157 TLDRLQKRGWVLANRTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGSH-ITAKDMWGK 215

Query: 316 IKKHGSFDHLD--NDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           + + G FD  +  + ++ PS PG ++ AA++AS  +      +V F+LAW  P+VKF + 
Sbjct: 216 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 275

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             YHRRYTK+YGT   +A  I HDA+  + +WE EIE WQ PIL D R PEWY  TLFNE
Sbjct: 276 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 335

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL--------- 483
           LY+L AGGT+W D S P  S      +  +++  +   T A  ++  G  +         
Sbjct: 336 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYD 395

Query: 484 ------LENGEENI-------------------------------------GQFLYLEGA 500
                 LE  EE I                                     G+FLYLEG 
Sbjct: 396 TTSRKGLEYDEEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGV 455

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
           EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQR+FA AV+  D   +K +++G W  RK  G
Sbjct: 456 EYIMWCTYDVHFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRG 515

Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           AVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF AT D +F   VWP+V  A
Sbjct: 516 AVPHDLGTHDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAA 575

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           M YMEQFD+D DG+IEN+GFPDQTYD W+ +G+SAYCG LW+AALQAA+A+A  +GD   
Sbjct: 576 MEYMEQFDRDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPF 635

Query: 681 ASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           A     ++ KAK V+ + LWNGSYFNYD+   S+S SIQADQLAGQWY  + GL  + D+
Sbjct: 636 AEKCKSKFFKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDD 695

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
            K+K +L KIYDFNV+KVKGG  GA+NGM P+G++D S +Q+REIW GVTYG+AA+MI  
Sbjct: 696 CKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILS 755

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLG------------------------------------ 823
            M + AF TA G++   WSE+G G                                    
Sbjct: 756 GMEEQAFTTAEGIFTAGWSEEGYGTLDFIDDMHCVVQILNCFVHSGLCVACVILLYCTCR 815

Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           Y FQTPE W  +  +RSL YMRPL IW MQWAL+ P+
Sbjct: 816 YWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMPR 852


>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
 gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
          Length = 603

 Score =  591 bits (1524), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 309/574 (53%), Positives = 395/574 (68%), Gaps = 17/574 (2%)

Query: 312 MWNEIKKHGSFDHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
           MW  + + GSFD  DN    PS P   G ++ AA++AS  +      +V F+LAW  P+V
Sbjct: 1   MWGTMVQDGSFDR-DNFSAGPSMPCLLGDTVCAAVSASTWVEPHGRCTVAFALAWSSPKV 59

Query: 369 KFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
           KF +   Y+RRYTKFYGT   SA  +  DA++++  WE  I+ WQ PIL D+R PEWY I
Sbjct: 60  KFKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEAIDKWQTPILRDERLPEWYKI 119

Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLE-- 485
           TLFNELY+L AGGT+W D     +SL +  + K +   P   + P    T   T  L   
Sbjct: 120 TLFNELYFLVAGGTVWIDS----ESLVSDADNKLNPSPPEDSNFPFHDSTCNSTVPLIGF 175

Query: 486 -----NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
                + +EN+G+FLYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  
Sbjct: 176 DPHEIDDKENVGKFLYLEGIEYFMWCTYDVHFYASFALLDLFPKIELSIQRDFARAVLRE 235

Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRD 600
           D   ++ ++DG W  RK +GAV HD+G  DPW E+N+YN+ ++SRWKDLN KFVLQ+YRD
Sbjct: 236 DNRRVRFLADGTWGIRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQIYRD 295

Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
           F ATGDK+F + VWP+V  AM YMEQFD DGD MIEN+GFPDQTYDAW+  GVSAYCG L
Sbjct: 296 FAATGDKSFGKDVWPAVCTAMEYMEQFDHDGDDMIENDGFPDQTYDAWTVQGVSAYCGCL 355

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQA 719
           W+AALQAA+ALA  +G    A     R+ KAK+V+++ LWNGSYFNYD+    +S SIQA
Sbjct: 356 WLAALQAAAALARSLGHDDYAERCMTRFAKAKSVFEARLWNGSYFNYDSGTSYNSRSIQA 415

Query: 720 DQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGL 779
           DQLAGQWY  + GL P+ DE ++K  L KI+D+NV++VKGG  GA+NGM P+G++D + +
Sbjct: 416 DQLAGQWYTASSGLPPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDDTCM 475

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
           Q+REIW GVTY LAA+M+   M   AF TA G+Y   WSE+G GY FQTPE+W  +  YR
Sbjct: 476 QSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYIAGWSEEGYGYWFQTPEAWTIDGHYR 535

Query: 840 SLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
           SL YMRPL IWAMQWAL+ P+   +  K    DR
Sbjct: 536 SLIYMRPLAIWAMQWALSPPRSILEAPKVNTMDR 569


>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
          Length = 1464

 Score =  590 bits (1520), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 328/863 (38%), Positives = 483/863 (55%), Gaps = 55/863 (6%)

Query: 10   ENAASMPKVDPGQPPQLTWQRKL--DSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
            +N+ S PK      P   W+  L  +  +K        F +      +G R  +   +  
Sbjct: 635  DNSISTPKTRIEGIPDFGWRVNLNYECGIKCKPFTQPRFSQVIGFIGLGIRYAKEWWKRR 694

Query: 68   AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
             +G+    +   +    +  G P+GGIG G++GR YRGEF RF++  GI D   + ANQF
Sbjct: 695  RQGRKMFIDHMDQLFHKAIYGAPIGGIGCGTMGRGYRGEFARFQMVPGIYDHTVIQANQF 754

Query: 128  SVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
             +   R NG+  +  VL        ++ +   + SW+W    E   YHAL+PR+WTVY  
Sbjct: 755  -ILCIRKNGQTVYQKVL------TGQRESSKSLRSWEWQAPREGDVYHALYPRSWTVYHL 807

Query: 187  EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA---- 242
            + +  LR++CRQ+SP  PH+YK++SFP + F +T  N+G    DV+++FT+ N       
Sbjct: 808  D-EFNLRLICRQVSPIFPHDYKDTSFPMATFIWTAENNGTEPLDVSIMFTFKNGRGVKDD 866

Query: 243  GDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG 302
            G+ G   H F+  +  +  V G++++H   + +   T+ +      DV V+    F   G
Sbjct: 867  GNGGCHNHVFDENSEGRQ-VSGVSINHNIQDMK--CTYCIGGVHKDDVSVTCKEYFDPRG 923

Query: 303  NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
                 T +++W ++K+ G  +  +  KT+ +  G    AA+     +P    + + F+L 
Sbjct: 924  -----TGQEVWEDLKEDGQLN-TEGGKTTCTGKGEESAAAVCVKANVPPKQCKDLEFTLT 977

Query: 363  WDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
            WD P + F   E+ Y RRY++++G  G++  R++  A   ++ WE +IE WQ+P+L++K 
Sbjct: 978  WDMPIIHFKAKERFYARRYSRWFGLDGEAGPRMSSYAAANYSTWEQKIEEWQKPVLQNKN 1037

Query: 421  FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
             P WY   LFNELYY++ GGT+W D   P++       R      P  +           
Sbjct: 1038 LPAWYKSALFNELYYVSDGGTVWID---PVEGQGGKLVRVTDPKDPPIV----------- 1083

Query: 481  TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
                    +   +F YLEG EY MYNTYDVH Y+SF+L+ML+PKL++S+Q D A AV   
Sbjct: 1084 --------QEFSRFGYLEGHEYRMYNTYDVHHYASFSLIMLWPKLQISLQYDIATAVESE 1135

Query: 541  DPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
            D   +K ++ G+   RK L AVPHD+G   D+PW  +N Y +  +  WKDLN KFVLQ Y
Sbjct: 1136 DSTRIKFLTSGEVGIRKRLHAVPHDMGDPEDEPWNRVNCYVIHPTCNWKDLNMKFVLQTY 1195

Query: 599  RDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
            RD+ AT D  + R ++P     M   +++D D DG+I+N GF DQT+DAW+  G SAYCG
Sbjct: 1196 RDYSATKDDAYLRHMYPVAKYVMETAKKWDVDDDGIIDNGGFADQTFDAWTMTGASAYCG 1255

Query: 659  GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSI 717
            G+W+AAL+    +A  +       ++     K K  Y + LWNGSY+NYD S G    SI
Sbjct: 1256 GMWLAALRMMIEMATILKKEDDVLHYKEILDKGKVSYQNKLWNGSYYNYDCSSGGHHDSI 1315

Query: 718  QADQLAGQWYARACGLL--PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID 775
             ADQLAG W+ +A GL    I    +VK +L KI++ NV+  +GG  GA+NG +PDG  D
Sbjct: 1316 MADQLAGHWFLKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKD 1375

Query: 776  MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
            +S  Q+ E W GVTYGLAA+MIQE ++D AFQTA G Y V W   GL  +FQTPE++  +
Sbjct: 1376 ISSCQSEEFWVGVTYGLAANMIQEGLLDQAFQTAWGAYHVCWEWYGL--AFQTPEAYMTD 1433

Query: 836  DEYRSLCYMRPLTIWAMQWALTK 858
            + YRSL YMRPL IW+MQWAL K
Sbjct: 1434 NIYRSLGYMRPLAIWSMQWALEK 1456


>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
          Length = 967

 Score =  588 bits (1516), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 335/820 (40%), Positives = 451/820 (55%), Gaps = 54/820 (6%)

Query: 44  LSFQEFRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRS 102
           +S      L  +GFR +++  Q+   + K P  ++     +    G PLGGIG G+I R 
Sbjct: 150 ISLSNMIKLVGVGFRYFQWWYQKTRVEKKTPFIDMLNSVPLRQIYGCPLGGIGGGTITRG 209

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESW 162
           +RG+F R++L  G+     V+A+QF+V + R     +  VL    P V        + SW
Sbjct: 210 WRGQFCRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSW 261

Query: 163 DWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLS 222
           +W L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + 
Sbjct: 262 NWGLCGFFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVE 320

Query: 223 NSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVT 279
           N G  + DV+++FT  N + G     G  +N    + +DG  V GL LHH T     P T
Sbjct: 321 NEGDEALDVSIMFTMRNGLGGRDDAPGGLWNEPFCLERDGETVQGLLLHHPTPPN--PYT 378

Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
            A+AA  TAD  V+    F         T + +W ++ + G  D     +++PS+ G  I
Sbjct: 379 MAMAARLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGI 432

Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDA 397
             A+  +  +P      + FSLAWD P + F  K  VY+RRYT+F+G  G +A  ++H A
Sbjct: 433 AGAVCVAAKLPPRGRCCLEFSLAWDMPRIVFGAKGQVYYRRYTRFFGQDGSAAPALSHYA 492

Query: 398 ILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIR 457
           +  +A WE  I AWQ PILED+  P WY   LFNELY+L  GGT+W              
Sbjct: 493 LCRYADWEERISAWQSPILEDRSLPAWYKSALFNELYFLADGGTVW-------------- 538

Query: 458 ERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFA 517
                L+ P    T    +   G R L       G+F YLEG EY MYNTYDVHFY+SFA
Sbjct: 539 -----LEVPEDALT---EEPEGGMRQLRPMLREYGRFGYLEGQEYRMYNTYDVHFYASFA 590

Query: 518 LVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEI 575
           L+ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +
Sbjct: 591 LIMLWPKLELSLQYDMALATLKEDLTPRRYLMSGLTAPVKRRNVIPHDIGDPDDEPWLRV 650

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
           N+Y + +++ WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+I
Sbjct: 651 NAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMEFDKDHDGLI 710

Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
           EN G+ DQTYD W   G SAYCGGLW+AA+     +A           F     + K  Y
Sbjct: 711 ENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMATLCEAQDIQEKFSSILSRGKEAY 770

Query: 696 DS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIY 750
           +  LWNG Y+NYD+S    S SI +DQ AGQW+ RACGL      +     V +AL  I+
Sbjct: 771 ERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTPHVVRALQTIF 830

Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
           +FNV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F TA 
Sbjct: 831 EFNVQAFAGGAMGAVNGMQPHGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTSKGFWTAE 890

Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 891 GCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 928


>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
           glucosylceramidase-like [Monodelphis domestica]
          Length = 950

 Score =  587 bits (1514), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 332/827 (40%), Positives = 463/827 (55%), Gaps = 60/827 (7%)

Query: 48  EFRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGE 106
           + +HL  +GFR  ++  Q+ + + K+P  ++     +    G PLGGIG G+I R +RG+
Sbjct: 115 KMKHLG-MGFRYLKWWYQKTQVEKKIPFIDMLNFLPLRQIYGCPLGGIGGGTITRGWRGQ 173

Query: 107 FQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNL 166
           F R++L  GI     V+A+QF+V + R     +  VL    P V        + SW+W L
Sbjct: 174 FCRWQLNPGIYQHQTVIADQFTVCLRRGGQTVYQQVLSLERPSV--------LRSWNWGL 225

Query: 167 KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQ 226
            G    YHAL+PRAWTVY   P  ++ + CRQI+P +PH+Y++SS P  VF + + N+  
Sbjct: 226 CGFFAFYHALYPRAWTVYQ-LPGQQVTLTCRQITPILPHDYQDSSLPVGVFVWDVENNSD 284

Query: 227 TSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVA 283
              D++++F+  N + G     G  +N    +  DG  V GL LHH   N   P T A+A
Sbjct: 285 EDLDISIMFSMRNGLGGGHDSPGGLWNEPFCLEHDGETVQGLLLHH--PNPPNPYTMALA 342

Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
           A  TA+  V+    F         T + +W ++      D    +   P++ G  IG A+
Sbjct: 343 ARLTANTTVTHITAF-----DPXCTGQKVWQDLLPRXQLDSPLAEWGPPTQKGQGIGGAV 397

Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            A   +P      + F+LAWD P++ F  +   ++RRYT+F+G  G++A  + H A+ ++
Sbjct: 398 CAGTHLPRRGRCQLEFALAWDMPKILFGARGQAHYRRYTRFFGQEGNAAPALCHYALSQY 457

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
             WE  IEAWQ P+LED+  P WY   LFNELY+L  GGT+W                  
Sbjct: 458 RSWEERIEAWQNPVLEDRSLPAWYKSALFNELYFLADGGTMW------------------ 499

Query: 462 SLDTPSKIHTPASSDTA-LGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
                  +  PA + T   G  L  L+   +  G+F YLEG EY MYNTYDVHFY+SFAL
Sbjct: 500 -------VEVPAETTTEEFGGILNQLQPTLQEYGRFAYLEGQEYRMYNTYDVHFYASFAL 552

Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEIN 576
           +ML+PKLELS+Q D A A +  D    K +  G     K    +PHD+G   D+PW  IN
Sbjct: 553 IMLWPKLELSLQYDMAVATLSEDLTRRKYLMSGVMAPVKKKNVIPHDVGDPDDEPWLRIN 612

Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
           +Y + +++ WKDLN KFVLQ+YRDF  TG+++F R +WP     M    +FDKD DG+IE
Sbjct: 613 AYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGLIE 672

Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD 696
           N G+PDQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+
Sbjct: 673 NGGYPDQTYDGWITTGPSAYCGGLWLAAVAVMVQMATVCGTQDIQEKFLSILSRGREAYE 732

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYD 751
             LWNG Y+NYD+S    S  I +DQ AGQW+ RACGL      +     V +AL  I++
Sbjct: 733 RLLWNGRYYNYDSSTQPQSRIIMSDQCAGQWFLRACGLGEGDTEVFPTPHVIRALQTIFE 792

Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
            NV    GG  GA+NGMQPDG  D S +Q+ E+W GV YGLAA+MIQE ++   F+TA G
Sbjct: 793 VNVQGFAGGAMGAVNGMQPDGIPDTSSVQSDEVWVGVVYGLAATMIQEGLIQEGFRTAEG 852

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            Y   W    LG +FQTPE++  +  +RSL YMRPL+IWAMQ AL +
Sbjct: 853 CYRTVWER--LGMAFQTPEAYCQHRVFRSLAYMRPLSIWAMQLALQQ 897


>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
          Length = 918

 Score =  586 bits (1511), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 339/817 (41%), Positives = 461/817 (56%), Gaps = 55/817 (6%)

Query: 49  FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 102 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 160

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+ANQF+V + R     +  VL    P V        + SW+W L 
Sbjct: 161 CRWQLNPGMYQHQTVIANQFTVCLRRRGQTVYQQVLSVERPSV--------LRSWNWGLC 212

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 213 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 271

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + +V+++F+  N + G+    G  +N    + +DG  V GL LHH T     P T AVAA
Sbjct: 272 ALEVSIMFSMRNGLGGEDDAPGGLWNEPFCLERDGETVQGLLLHHPTPPN--PYTMAVAA 329

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             +AD  V+    F    +S G   + +W ++   G  D L   K+ PS+ G  I  A+ 
Sbjct: 330 RLSADTTVTHLTAF--DPDSAG---QQVWQDLLLDGQLDSLAG-KSLPSQKGVGIAGAVC 383

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            S  +P    + + FSLAWD P + F  K  VY+RRYT+F+G  GD+A  ++H A+  +A
Sbjct: 384 VSGKLPPRGRQRLEFSLAWDMPRIMFGAKGRVYYRRYTRFFGPDGDAAPALSHYALSHYA 443

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE +I AWQ P+LED+  P WY   LFNELY+L  GGT+W                   
Sbjct: 444 DWEEKISAWQNPVLEDRSLPAWYKSALFNELYFLADGGTVW------------------- 484

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
           L+ P     P +   ++G   L    +  G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 485 LEVPED-SLPEALVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYDVHFYASFALVMLW 541

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
           PKLELS+Q D A A +  D    + +  G     K    +PHD+G   D+PW  +N+Y +
Sbjct: 542 PKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDPDDEPWLRVNAYEI 601

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            ++  WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN G+
Sbjct: 602 HDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKFDKDHDGLIENGGY 661

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G   + + F     + +  YD  LW
Sbjct: 662 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSILSRGQEAYDRLLW 721

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVL 755
           NG Y+NYD S    S SI +DQ AGQW+ RA GL     E     +V +AL  I++FNV 
Sbjct: 722 NGRYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQ 781

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y  
Sbjct: 782 AFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRT 841

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            W    LG +FQTPE++     +RSL YMRPL+IWAM
Sbjct: 842 VWER--LGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876


>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
           rubripes]
          Length = 827

 Score =  586 bits (1510), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 334/824 (40%), Positives = 462/824 (56%), Gaps = 53/824 (6%)

Query: 47  QEFRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
           Q + HL  +G R LY + K+   + K P  ++F    +    G PLGGIG G+I R +RG
Sbjct: 34  QVWEHLG-LGIRYLYWWYKKTRVEKKAPFIDIFGAVPLRQIYGAPLGGIGGGTITRGWRG 92

Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
           EF R++L  G+     V+ NQF V + R     +  VL    P          ++ W+W 
Sbjct: 93  EFCRWQLKPGMYHYKTVIENQFMVCLRRNGQTVYQQVLSVERPPT--------LQGWNWG 144

Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
             GE   YHAL+PRAW VY   P   + + CRQISP IPH+Y+++S P +VF + + N  
Sbjct: 145 FCGEYAYYHALYPRAWNVYH-LPGQNVTLTCRQISPIIPHDYQDASLPVAVFVWDIENKN 203

Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAV 282
             + DV+++FT  N        SG H+N    + K+G  V G+ LHH TA    P T  +
Sbjct: 204 DYALDVSIMFTLLNGSGHADDKSGGHWNEPFRLEKNGEAVSGVLLHHCTAVN--PYTLCI 261

Query: 283 AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
           AA E     V+    F   G   G     +W+++   G  D      + P+  G  + AA
Sbjct: 262 AAREQPKREVTHQTAFSPKGTCHG-----LWSDLITDGRLDS-PTGSSPPTAKGERVAAA 315

Query: 343 IAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILE 400
           +AA  ++ + S  S+ F LAWD P++ F   EK + RRYT++YG+ GD+   ++H A+  
Sbjct: 316 LAAGCSVATQSRGSLEFCLAWDMPKITFGSGEKEHSRRYTRYYGSKGDACPLLSHHALTH 375

Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
           +  WE  IE WQ PIL+D   P WY   LFNELY++  GGT+WT+ +        +R  +
Sbjct: 376 YRGWEKSIEEWQGPILQDSSLPTWYKSALFNELYFVADGGTVWTELAEDADVSGGLRSEE 435

Query: 461 FSLDT-PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
             L   PS I                   ++ G+F YLEG EY MYNTYDVH+Y+SFAL+
Sbjct: 436 GGLPAQPSVI-------------------KDYGRFAYLEGQEYRMYNTYDVHYYASFALI 476

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINS 577
           ML+PKL LS+Q D A +V+  DP     + +G+    K  G VPHDIG   D+PW  +N+
Sbjct: 477 MLWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNA 536

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + N++ WKDLN KFVLQVYRDF  T D  + + +WP     M    +FD DGDG+IEN
Sbjct: 537 YLIHNTADWKDLNLKFVLQVYRDFHLTQDSQYLQDMWPICQTVMESALKFDLDGDGLIEN 596

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            GF DQTYD W+  G SAYCGGLW+A+L     +A  + +     Y+     +    YD 
Sbjct: 597 SGFADQTYDGWTVTGPSAYCGGLWLASLVVMCKMARLLENEMKYQYYRDILDRGSVAYDK 656

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AG W+ RA GL     +A    K+++AL  ++D 
Sbjct: 657 VLWNGKYYNYDSSGKDHSNSVMSDQCAGHWFLRASGLGDGEYQAFPKEKIQRALKTVFDL 716

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV+   GG  GA+NGM+P+G  D S +Q+ E+W GV YGLAA+MI E M++   +TA G 
Sbjct: 717 NVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGC 776

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           Y   W    LG +FQTPE++     YRSL YMRPL+IWAMQ AL
Sbjct: 777 YRTVWER--LGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLAL 818


>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 792

 Score =  584 bits (1506), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 327/852 (38%), Positives = 472/852 (55%), Gaps = 91/852 (10%)

Query: 20  PGQPPQLTWQRKLD---SNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           P   P+  W+ KLD   +  + P   +L  ++   L  + FR   Y +    KG+ P  +
Sbjct: 14  PSTVPEFGWRIKLDHVFAEKRAPLR-SLRLKQIPELLPMFFRYAWYCRGVTKKGRRPYID 72

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
                  T   G+PLGGIG G I R ++G+F R+ L  G+     V  +QF+V + R   
Sbjct: 73  QIHPLQSTQSYGVPLGGIGCGCINRGWKGDFCRWSLIPGMYTHDTVQTDQFTVCIRRGGR 132

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
             +  VL    P      TD  + +W+W       TYHAL+PRAWTVY   P   + ++C
Sbjct: 133 TTYQQVLSTSRP------TDQNLNTWNWGFNPSCATYHALYPRAWTVYH-LPGQNVTLLC 185

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK- 255
           RQ+SP +PH+YK++S P +VF + + N G    DVT++F+W N     S  SG  +N   
Sbjct: 186 RQVSPILPHDYKDTSLPVAVFVWEVYNHGNEPIDVTIMFSWQNGTGQKSDKSGGRWNKPF 245

Query: 256 TMTKDGVHGLTLHHRTANGRPPV--TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMW 313
           TM +  V G+ LH    NG P +  T A++A    DV VS    F  SG+       ++W
Sbjct: 246 TMNEGLVDGVLLH----NGHPTISCTMAISAVCKDDVSVSHKTAFDPSGD-----GLELW 296

Query: 314 NEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF-- 371
            ++   G  D  +   +  + PG    AA+AA   + + S+ +  F LAWD P V+F   
Sbjct: 297 QDLISDGELDS-EPGASFRTVPGELTAAAVAAKCHVAASSSCTAEFCLAWDMPRVQFMAK 355

Query: 372 EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFN 431
           EK+++RRYT+++GT GD+   ++  A+ ++ +WE  IE WQ PIL+++  P WY   LFN
Sbjct: 356 EKIHNRRYTRWFGTDGDAGPSLSAYALTQYREWEHRIERWQNPILQNESLPSWYKSALFN 415

Query: 432 ELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
           ELY+++ GGTIW + +     L               I++   +++ +         +  
Sbjct: 416 ELYFISDGGTIWIEHNKEHNGL---------------INSGDINNSVI---------KQY 451

Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
           G+F YLEG EY M+NTYDVHFY+SFAL+ML+P+L+LS+Q D                   
Sbjct: 452 GKFAYLEGHEYRMFNTYDVHFYASFALIMLWPQLQLSLQYDID----------------- 494

Query: 552 KWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
                            D+PW  +N+Y +++++ WKDLN KFVLQVYRD+  T D  + +
Sbjct: 495 -----------------DEPWVRVNAYLMYDTADWKDLNLKFVLQVYRDYYVTKDFIYLQ 537

Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
            +WP V + M     +D DGDG+I+N G+ DQTYDAWS  G SAYCGGLW+AA++A   +
Sbjct: 538 DMWPKVKVVMQTSLAWDTDGDGIIDNSGYADQTYDAWSVTGASAYCGGLWLAAVRAMMEI 597

Query: 672 ANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
           A  + +      +     + K+ Y+  LWNG Y+NYD+S    S SI  DQ AGQWY RA
Sbjct: 598 AKILDEEDQFKKYSDVLIQGKSSYEKKLWNGKYYNYDSSKQKHSNSIMCDQAAGQWYLRA 657

Query: 731 CGLLPIADEAKVKK----ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
           CGL+   DE   K     AL  IY  NV+  + G  GA+NGM+PDG  D +  Q+ E+W 
Sbjct: 658 CGLVQENDEVFPKDHILCALRAIYKMNVMGFEKGTMGAVNGMRPDGSPDTTSFQSEEVWT 717

Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
           GVTY LAA+MIQE +++ AF TA+GVY+  +   GLG  FQTPE++  N+ YRSL YMRP
Sbjct: 718 GVTYALAANMIQEGLLEEAFTTASGVYKTCYERCGLG--FQTPEAYFANNCYRSLGYMRP 775

Query: 847 LTIWAMQWALTK 858
           L IWAMQ+A+ +
Sbjct: 776 LAIWAMQYAVER 787


>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
          Length = 927

 Score =  583 bits (1503), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 338/817 (41%), Positives = 458/817 (56%), Gaps = 55/817 (6%)

Query: 49  FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 111 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 169

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+ANQF+V + R     +  VL    P V        + SW+W L 
Sbjct: 170 CRWQLNPGMYQHQTVIANQFTVCLRRRGQTVYQQVLSVERPSV--------LRSWNWGLC 221

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 222 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 280

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + +V+++F+  N + G+    G  +N    +  DG  V GL LHH T     P T AVAA
Sbjct: 281 ALEVSIMFSMRNGLGGEDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAA 338

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             +AD  V+    F    +S G   + +W ++   G  D L   K+ PS+ G  I  A+ 
Sbjct: 339 RLSADTTVTHLTAF--DPDSAG---QQVWQDLLLDGQLDSLAG-KSLPSQKGVGIAGAVC 392

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            S  +P      + FSLAWD P + F  K  VY+RRYT+F+G  GD+A  ++H A+  +A
Sbjct: 393 VSGKLPPRGRHRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYA 452

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE +I AWQ P+LED+  P WY   LFNELY+L  GGT+W                   
Sbjct: 453 DWEEKISAWQSPVLEDRSLPAWYKSALFNELYFLADGGTVW------------------- 493

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
           L+ P     P     ++G   L    +  G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 494 LEVPED-SLPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLW 550

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
           PKLELS+Q D A A +  D    + +  G     K    +PHD+G   D+PW  +N+Y +
Sbjct: 551 PKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDPDDEPWLRVNAYEI 610

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            ++  WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN G+
Sbjct: 611 HDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKFDKDHDGLIENGGY 670

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G   + + F     + +  YD  LW
Sbjct: 671 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSILSRGQEAYDRLLW 730

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVL 755
           NG Y+NYD S    S SI +DQ AGQW+ RA GL     E     +V +AL  I++FNV 
Sbjct: 731 NGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQ 790

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y  
Sbjct: 791 AFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRT 850

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            W    LG +FQTPE++     +RSL YMRPL+IWAM
Sbjct: 851 VWER--LGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 885


>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
 gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
          Length = 918

 Score =  583 bits (1503), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 338/817 (41%), Positives = 458/817 (56%), Gaps = 55/817 (6%)

Query: 49  FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 102 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 160

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+ANQF+V + R     +  VL    P V        + SW+W L 
Sbjct: 161 CRWQLNPGMYQHQTVIANQFTVCLRRRGQTVYQQVLSVERPSV--------LCSWNWGLC 212

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 213 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 271

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + +V+++F+  N + G+    G  +N    +  DG  V GL LHH T     P T AVAA
Sbjct: 272 ALEVSIMFSMRNGLGGEDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAA 329

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             +AD  V+    F    +S G   + +W ++   G  D L   K+ PS+ G  I  A+ 
Sbjct: 330 RLSADTTVTHLTAF--DPDSAG---QQVWQDLLLDGQLDSLAG-KSLPSQKGVGIAGAVC 383

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            S  +P      + FSLAWD P + F  K  VY+RRYT+F+G  GD+A  ++H A+  +A
Sbjct: 384 VSGKLPPRGRHRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYA 443

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE +I AWQ P+LED+  P WY   LFNELY+L  GGT+W                   
Sbjct: 444 DWEEKISAWQSPVLEDRSLPAWYKSALFNELYFLADGGTVW------------------- 484

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
           L+ P     P     ++G   L    +  G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 485 LEVPED-SLPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLW 541

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
           PKLELS+Q D A A +  D    + +  G     K    +PHD+G   D+PW  +N+Y +
Sbjct: 542 PKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDPDDEPWLRVNAYEI 601

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            ++  WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN G+
Sbjct: 602 HDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKFDKDHDGLIENGGY 661

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G   + + F     + +  YD  LW
Sbjct: 662 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSILSRGQEAYDRLLW 721

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVL 755
           NG Y+NYD S    S SI +DQ AGQW+ RA GL     E     +V +AL  I++FNV 
Sbjct: 722 NGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQ 781

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y  
Sbjct: 782 AFAGGAMGAVNGMQPCGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRT 841

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            W    LG +FQTPE++     +RSL YMRPL+IWAM
Sbjct: 842 VWER--LGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876


>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
          Length = 922

 Score =  582 bits (1501), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 339/830 (40%), Positives = 464/830 (55%), Gaps = 64/830 (7%)

Query: 50  RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           +HL  +G R  ++  ++ + + K P  ++     +    G PLGGIG G+I R +RG+F 
Sbjct: 105 KHLG-MGIRYLKWWYRKTQVEKKKPFIDMINCVPLRQIYGCPLGGIGGGTITRGWRGQFC 163

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  GI     V+A+QF+V + R     +  VL    P V        + SW+W L G
Sbjct: 164 RWQLNPGIYQHQTVIADQFTVCLRREGRTVYQQVLSMERPSV--------LHSWNWGLCG 215

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  +
Sbjct: 216 YFAFYHALYPRAWTVYQ-LPGQNITLTCRQITPILPHDYEDSSLPVGVFVWDVENEGDEA 274

Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAE 285
            DV+++F+  N + G+    G  +N    + +DG  V GL LHH T     P T AVAA 
Sbjct: 275 VDVSIMFSMRNGLGGEDDAQGGLWNEPFCLEQDGETVQGLLLHHPTLPN--PYTMAVAAR 332

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHL--DNDKTSPSEPGSSIGAAI 343
            T D  V+    F         T   +W ++ + G  D    D  +++PS+ G+ I  A+
Sbjct: 333 LTEDSTVTHITAF-----DPDSTGHQVWQDLLQDGQLDSPAGDGGRSNPSQKGAGIAGAL 387

Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
             S  +P     S+ FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 388 CVSGKLPPRGRCSLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGPDGDAAPALSHYALCHY 447

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                  
Sbjct: 448 ADWEDRISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------ 489

Query: 462 SLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
            L+ P     P     ++G   L       G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 490 -LEVPED-SLPEELVGSMGQ--LRPILREYGRFGYLEGQEYRMYNTYDVHFYASFALVML 545

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYN 579
           +PK+ELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y 
Sbjct: 546 WPKIELSLQYDMALATLREDLTRRRYLMSGVMGPVKRRNVIPHDIGDPDDEPWLRVNAYL 605

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + +++ WKDLN KFVLQVYRD+  TGD+ F + +WP     M    +FDKD DG+IEN G
Sbjct: 606 IHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGG 665

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAAL-----QAASALANDVGDHASASYFWVRYQKAKAV 694
           + DQTYD W   G SAYCGGLW+AA+      A    A DV D  S+    +  +  +A 
Sbjct: 666 YADQTYDGWVTTGPSAYCGGLWLAAVAVTVQMAVLCAAQDVQDKFSS----ILSRGQEAY 721

Query: 695 YDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIY 750
              LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I+
Sbjct: 722 ERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTTHVVRALQTIF 781

Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
           + NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA 
Sbjct: 782 ELNVQSFAGGAMGAVNGMQPHGIPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAE 841

Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           G Y   W    LG +FQTPE++     +RSL YMRPL+IWAMQ AL + K
Sbjct: 842 GCYRTVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIWAMQLALQQQK 889


>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
 gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
          Length = 871

 Score =  581 bits (1498), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 322/841 (38%), Positives = 461/841 (54%), Gaps = 100/841 (11%)

Query: 46  FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
           F++ + L  + +R + +  +++   K P  +      +    G+PLGG+G GSI R ++G
Sbjct: 104 FKQLKDLYGVFYRYFFWWMKKKTGHKRPFIDALNPVQLQQIYGVPLGGLGCGSICRGWKG 163

Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
           +F R+ L  GI      +ANQF+V V R     +  VL  R P        S +  W+W+
Sbjct: 164 DFCRWALLPGIYSYDIKMANQFTVCVRRRGQTTYQQVLSTRHP--------SCLTEWNWS 215

Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
               +  YHAL+PRAWTVY+  P   + + CRQ+SP +PH+YK+SS PA+VF +T  N G
Sbjct: 216 YSPSHAYYHALYPRAWTVYE-LPGQNITLTCRQVSPVLPHDYKDSSIPAAVFVWTAENHG 274

Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK----DGVHGLTLHHRTANGRPPVTFA 281
           +   D++L+FT+ NS+  D+ LSG H+N          D V G+ LHH+  +   P T A
Sbjct: 275 KEEVDISLMFTFQNSIGADTDLSGGHWNEPFSVNEEGVDPVSGILLHHQ--HKTMPCTIA 332

Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
           +       + V+    F  +G     + K++W ++   G     +   T+ +  G ++  
Sbjct: 333 LGIPHKEGIRVTGSTSFNPAG-----SGKEIWRDLMYDGELGSPEA-PTNKTVKGEALAG 386

Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFYGTLGDSAARIAHDAIL 399
           A+ AS  +PSG  +   F L+WD P +KF    + Y R YT ++G  G +A ++A  A+ 
Sbjct: 387 AVCASFKLPSGGKQDTDFVLSWDMPVIKFVGSCRTYKRWYTGWFGDDGTAAPKLAAYALS 446

Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW-----TDGSPPMQSLA 454
            + +WE  IE WQRPIL+++ +PEWY   LFNELY+++ GGTIW      DG        
Sbjct: 447 HYPEWESTIEEWQRPILDNQSYPEWYRSALFNELYFMSDGGTIWLVAEDNDGHVENNLPP 506

Query: 455 TIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYS 514
            +RE+     TP+ I                      G+F YLEG EY MYNTYDVH Y+
Sbjct: 507 WLREK-----TPAHIM-------------------EYGRFGYLEGHEYRMYNTYDVHHYA 542

Query: 515 SFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFE 574
           SFAL ML+P+LE SIQ D+                                   D+P+  
Sbjct: 543 SFALAMLWPQLEYSIQYDYD----------------------------------DEPFIH 568

Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
           +N+Y + ++S W+DLN KFVLQ++RD+    DK F   +WP     M     +D D DGM
Sbjct: 569 VNAYVMHDTSDWRDLNLKFVLQIFRDYFVGRDKGFLAFMWPRAQAVMTKALTWDTDQDGM 628

Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
           I+N G  DQTYDAW   G SAYCGGLW+AAL+    +AN +G      ++    Q+AK  
Sbjct: 629 IDNSGAADQTYDAWIVTGASAYCGGLWLAALRCMVEIANILGKEEEEQHYLAVMQRAKTA 688

Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA-----------KV 742
           Y + LWNG Y+NYD S  S S SI A   +G WY RACGL+P  +E             V
Sbjct: 689 YEEKLWNGEYYNYDCSGRSYSNSIMAGAGSGHWYLRACGLVPEDNEVMDSDREVLPLDHV 748

Query: 743 KKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMV 802
             +L K+YDFNV++   G  GA+NGM+P+G+ D++ +Q+ E+W G+TY LAASMIQE MV
Sbjct: 749 ASSLRKVYDFNVMQFHQGTMGAVNGMRPNGKKDLTSMQSEEVWTGITYALAASMIQEGMV 808

Query: 803 DMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLS 862
           +  FQTA GVY + + +   G++FQTPE++   D +RSL YMRPL IWAMQWAL     +
Sbjct: 809 EEGFQTAYGVYNMCYLQ--CGFAFQTPEAYLERDFFRSLGYMRPLAIWAMQWALESRHRA 866

Query: 863 R 863
           R
Sbjct: 867 R 867


>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
           niloticus]
          Length = 856

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 331/819 (40%), Positives = 458/819 (55%), Gaps = 52/819 (6%)

Query: 53  AHIGF---RLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQR 109
            HIG     L  + K+ + + K P  ++F    +    G PLGGIG G+I R +RGEF R
Sbjct: 58  GHIGLGVRYLKFWYKKTQVEKKAPFIDMFGAQPLRQIYGAPLGGIGGGTITRGWRGEFCR 117

Query: 110 FKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGE 169
           ++L  G+     V+ NQF+V + R     +  VL    P          ++ W+W   GE
Sbjct: 118 WQLNPGMYHYKTVIPNQFTVCLRREGKTVYQQVLSVERPPT--------LQGWNWGYCGE 169

Query: 170 NCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSA 229
              YHAL+PRAWTVYD  P   + + CRQISP IPH+Y++SS P +V  + + N  +   
Sbjct: 170 YAFYHALYPRAWTVYD-LPGQNVTLTCRQISPVIPHDYQDSSLPVAVLVWDIENKNEYDL 228

Query: 230 DVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEE 286
           DV+++FT  N         G H+N    + K+G  V G+ LHH TA    P T  +AA E
Sbjct: 229 DVSIMFTMVNGSGQKDDKCGGHWNEPFHLEKEGEAVSGVLLHHCTAVN--PYTLCIAARE 286

Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
            +D  +S    F   G         +WN++   G  D      + P+  G  + AA+A  
Sbjct: 287 QSDREISHQTAFSPKG-----PCFSLWNDLIDDGRLDS-PTGSSPPTVKGEKVAAAVAVG 340

Query: 347 LTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKW 404
            ++P+    S+ F LAWD P++ F   EK + RRYT+++GT GD++  ++H A+  + +W
Sbjct: 341 CSVPAQGHNSLEFCLAWDMPKMTFGSQEKEHTRRYTRYFGTKGDASPSLSHYALTHYKQW 400

Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
           E  IE WQRPIL+D   P WY   LFNELY++  GGT+WT+    M   A I     S D
Sbjct: 401 ERSIEEWQRPILQDSSLPSWYKSALFNELYFVADGGTVWTE----MTEDADISGGICSED 456

Query: 465 TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
                  PA               +  G+F YLEG EY MYNTYDVHFYSSFAL+ML+PK
Sbjct: 457 G----GLPAQPAVI----------KEYGRFAYLEGQEYRMYNTYDVHFYSSFALIMLWPK 502

Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFN 582
           L LS+Q D A +V+  D      +  G++   K    VPHDIG   D+PW  +N+Y + +
Sbjct: 503 LALSVQYDIAGSVVQQDLTERLYLMSGQYSPVKTKNVVPHDIGDPDDEPWMRVNAYLIHD 562

Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
           ++ WKDLN KFVLQVYRDF  T D+ + + +WP     M    +FD DGDG+IEN G+ D
Sbjct: 563 TAGWKDLNLKFVLQVYRDFHLTQDRQYLQDMWPICQAVMESELKFDLDGDGLIENSGYAD 622

Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNG 701
           QTYD W   G SAYCGG+WVA+L     +A  +    S  ++     +  A ++  LWNG
Sbjct: 623 QTYDGWIVTGPSAYCGGMWVASLCVMCKMARLLDSEESYRHYKDLLDRGSAAFEKLLWNG 682

Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKV 757
            Y+NYD+S    S S+ +DQ AG W+ RA GL         + K++ AL  ++D NV+  
Sbjct: 683 KYYNYDSSGRELSNSVMSDQCAGHWFLRASGLGDGEFQAFPKEKIQSALKSVFDLNVMSF 742

Query: 758 KGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAW 817
            GG  GA+NGM+P+G  D S +Q+ E+W GV YGLAA+MI E M +   +TA G Y   W
Sbjct: 743 AGGQMGAVNGMRPEGVPDHSSVQSDEVWIGVVYGLAATMIHEGMQEEGMRTAEGCYRTVW 802

Query: 818 SEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
            +  LG +FQTPE++   + YRSL YMRPL+IWAMQ AL
Sbjct: 803 EK--LGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLAL 839


>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
          Length = 1136

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 337/817 (41%), Positives = 458/817 (56%), Gaps = 55/817 (6%)

Query: 49   FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
             +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 323  IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 381

Query: 108  QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
             R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 382  CRWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLC 433

Query: 168  GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
            G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 434  GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 492

Query: 228  SADVTLLFTWANSVAG-DSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
            + DV+++F+  N + G D  + G       + +DG  V GL LHH       P T AVAA
Sbjct: 493  ALDVSIMFSMRNGLGGGDDAIGGLWNEPFCLERDGETVQGLLLHHPALPN--PYTMAVAA 550

Query: 285  EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
              TAD  V+    F         T + +W ++ + G  D     +++PS+ G+ I  A+ 
Sbjct: 551  RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGAGIAGAVC 604

Query: 345  ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
             +  +P      + FSLAWD P + F  K  VY+RRYT+F+G  G++A  ++H A+ ++ 
Sbjct: 605  VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCQYV 664

Query: 403  KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
             WE +I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +   P  SL         
Sbjct: 665  DWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 713

Query: 463  LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
               P K+     S   L   L E      G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 714  ---PEKL---VESMCQLRPILRE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 762

Query: 523  PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
            PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y +
Sbjct: 763  PKLELSLQYDMAVATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 822

Query: 581  FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
             +++ WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN G+
Sbjct: 823  HDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 882

Query: 641  PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
             DQTYD W   G SAYCGGLW+AA+     +A   G H     F    ++ +  Y+  LW
Sbjct: 883  ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAILCGAHDVQDKFSSILRRGQEAYERLLW 942

Query: 700  NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
            NG Y+NYD S    S SI +DQ AGQW+ RA GL      +     V  AL  I++FNV 
Sbjct: 943  NGRYYNYDCSPQPQSCSIMSDQCAGQWFLRASGLGEGDTEVFPTRHVVCALQTIFEFNVR 1002

Query: 756  KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
               GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y  
Sbjct: 1003 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 1062

Query: 816  AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
             W    LG +FQTPE++     +RSL YMRPL+IWAM
Sbjct: 1063 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 1097


>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
          Length = 915

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 333/817 (40%), Positives = 448/817 (54%), Gaps = 62/817 (7%)

Query: 52  LAHIGFRL------YRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
           + H G  L      YR ++ E+   K P  ++     +    G PLGGIG G+I R +RG
Sbjct: 104 IKHFGMGLRYLQWWYRKTRVEK---KTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRG 160

Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
           +F R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W 
Sbjct: 161 QFCRWQLNPGMYRHQTVIADQFTVCLRRKGQTVYQQVLSVEHPSV--------LRSWNWG 212

Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
           L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF +   N G
Sbjct: 213 LCGSFAFYHALYPRAWTVYQ-LPGQSVTLTCRQITPILPHDYQDSSLPVGVFVWDAENEG 271

Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAV 282
             + DV+++F+  N + GD    G  +N    + +DG  + GL LHH T     P T A+
Sbjct: 272 DETLDVSIMFSMRNGLGGDEDAPGGLWNEPFCLERDGETIQGLLLHHPTLPN--PYTMAM 329

Query: 283 AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
           AA  TAD  V+    F   G     T + +W ++   G  D     ++ PS+ G  I  A
Sbjct: 330 AARLTADTTVTHITAFDPDG-----TGQQVWQDLLXDGQLDS-PAGQSPPSQKGVGIAGA 383

Query: 343 IAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILE 400
           +  +  +P G  R + FSLAWD P + F  K  VYHRRYT+F+G  GD+A  ++H A+  
Sbjct: 384 VCVTTRLPPGGRRRLEFSLAWDMPRIMFGAKGQVYHRRYTRFFGPDGDAAPALSHHALSR 443

Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
           +A WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +   P  SL       
Sbjct: 444 YADWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSLP------ 495

Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
                          D       L    +  G+F YLEG EY MYNTYDVHFY+SFAL+M
Sbjct: 496 --------------EDLVGSMCQLRPILQEYGRFGYLEGQEYRMYNTYDVHFYASFALIM 541

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
           L+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y
Sbjct: 542 LWPKLELSLQYDMALATLREDLTERRYLMSGITAPVKRRNVIPHDIGDPDDEPWLRVNAY 601

Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
            + +++ WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN 
Sbjct: 602 VVHDTADWKDLNLKFVLQVYRDYYLTGDQVFLRDMWPVCLAVMESEMKFDKDQDGLIENG 661

Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS- 697
           G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  
Sbjct: 662 GYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGTREVQDQFSSILSRGREAYERL 721

Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFN 753
           LWNG Y+NYD S    + SI +DQ AGQW+ RA GL      +     V +AL  I++FN
Sbjct: 722 LWNGRYYNYDCSSQPQAYSIMSDQCAGQWFLRASGLGEGDTEVFPTPHVVQALQTIFEFN 781

Query: 754 VLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
           V    GG  GA+NGMQP G  D S +Q+ E+W GV Y LAA+MIQE +    F+TA G Y
Sbjct: 782 VQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYALAATMIQEGLTWEGFRTAEGCY 841

Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
              W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 842 RTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
           [Nomascus leucogenys]
          Length = 919

 Score =  576 bits (1484), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 331/818 (40%), Positives = 454/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 104 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 162

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 163 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 214

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G  C  HAL+PR WTVY   P PE  + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 215 GYFCFLHALYPRXWTVYQ-LPGPECSLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 273

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N     + G   V GL LHH T     P T AVAA
Sbjct: 274 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERGGETVQGLLLHHPTLPN--PYTMAVAA 331

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 332 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 385

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+ ++
Sbjct: 386 VSSKLQPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPTLSHYALCQY 444

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
            +WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 445 PEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 492

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 493 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 540

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 541 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 600

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 601 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 660

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 661 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 720

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 721 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFEL 780

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 781 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 840

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 841 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
          Length = 1065

 Score =  575 bits (1483), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 341/842 (40%), Positives = 470/842 (55%), Gaps = 63/842 (7%)

Query: 52   LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 254  IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 312

Query: 108  QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
             R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 313  CRWQLNPGMYRHQTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 364

Query: 168  GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
            G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 365  GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 423

Query: 228  SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
              DV+++F+  N + G     G  +N    + +DG  V GL LHH T     P T AVAA
Sbjct: 424  DLDVSIMFSMRNGLGGGDDAPGGLWNEPFHLERDGKTVQGLLLHHPTLPN--PYTMAVAA 481

Query: 285  EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
              TAD  V+    F   G     T + +W ++ + G  D     +++P++ G  +  A+ 
Sbjct: 482  RVTADTTVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGVAGAVC 535

Query: 345  ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
             S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+ ++
Sbjct: 536  VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCQY 594

Query: 402  AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
             +WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +          +RE   
Sbjct: 595  TEWEERISAWQNPVLDDRLLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 644

Query: 462  SLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
                      P   D  +    L     + G+F YLEG EY MYNTYDVHFY+SFAL+ML
Sbjct: 645  ----------PQELDGNMCH--LRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIML 692

Query: 522  FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
            +PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y 
Sbjct: 693  WPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYL 752

Query: 580  LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
            + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN G
Sbjct: 753  IHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGG 812

Query: 640  FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
            + DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  L
Sbjct: 813  YADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSSILSRGQEAYERLL 872

Query: 699  WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
            WNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ NV
Sbjct: 873  WNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFELNV 932

Query: 755  LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
                GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y 
Sbjct: 933  QAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYR 992

Query: 815  VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK---PKLSRQEIKHEIS 871
              W    LG +FQTPE++     +RSL YMRPL+IWAMQ AL +    K SR ++K +  
Sbjct: 993  TVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAMQLALQQQQHKKASRPKVKQDTG 1050

Query: 872  DR 873
             R
Sbjct: 1051 LR 1052


>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
           (Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
           (Beta-glucosidase 2) [Ciona intestinalis]
          Length = 861

 Score =  575 bits (1481), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 327/853 (38%), Positives = 468/853 (54%), Gaps = 49/853 (5%)

Query: 24  PQLTWQRKLDSNVKV-PTGFTLS-FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKH 81
           P+  W+  L    K  PT ++     +   +  +  R  +Y  Q   K K P  + F +H
Sbjct: 41  PEHGWKIALSHKFKTKPTPWSFPPLTQIPSVLGLAVRYLQYYNQLRRKNKKPCMDFFNQH 100

Query: 82  HITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSS 141
            +    G+PLGGIG+G+I R ++G+F +++L  GI      LA+QF V + R N      
Sbjct: 101 PLRPIYGVPLGGIGSGTINRGWQGDFCKWQLKPGIYTFDCPLADQFIVSIRRHNTVVCQQ 160

Query: 142 VLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
           VL  + P      +   + +W+W   G    Y+ L+PRAWTVYD +    + + CRQISP
Sbjct: 161 VLSTKGP------SGEALRTWNWKYCGSYAYYNGLYPRAWTVYDLQGQ-NIVLTCRQISP 213

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD------SGLSGHHFNSK 255
             P++YK++S PA+VF + + N G    +VT++ +  N           SGL    F S 
Sbjct: 214 VFPNDYKDTSLPAAVFVWDVENRGNEEVEVTIVMSLENGTGNPEQQKKVSGLWNEPF-SN 272

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
           T   +   G+ +H+ T    P  T  VAA E   V V+ C  F  + N       ++WN 
Sbjct: 273 TDHNNKAKGVVMHNTTYRDMP-YTLTVAAAEKPGVDVTHCTSFNPTHN-----GAEVWNS 326

Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFE-- 372
           +   G        K+S +  G     A+AAS  + P     S+ F+L WD P++ F +  
Sbjct: 327 LLNTGRLQSKPG-KSSLTATGEHTACAVAASCKVKPHSMAGSMDFALTWDMPQIHFGDAG 385

Query: 373 KVYHRRYTKFYGTLGDSA-ARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFN 431
           K Y RRYT+++   GD+A A IA  A+  +  WE +IEAWQRPIL++K+ P+WY   +FN
Sbjct: 386 KHYKRRYTRWFN--GDNAGATIASYALDNYQTWEDKIEAWQRPILDNKQLPDWYKSAIFN 443

Query: 432 ELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
           ELYY+  GGT+W + +      A  +        P  +      D+        +  +  
Sbjct: 444 ELYYITDGGTVWLEVAEEENGTANGK-------VPETLQNTTKVDSC-------DFLKEY 489

Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
           G+F YLEG EY MYNTYDVHFY+S AL  L+PKLELS+Q D A +++  +P   K + DG
Sbjct: 490 GKFAYLEGQEYKMYNTYDVHFYASIALAYLWPKLELSVQYDIATSILHSNPQPHKYLMDG 549

Query: 552 KWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
                K    VPHD+G   D+PW  +N+Y + +++ W+DLN KFVLQ YRD+  T D +F
Sbjct: 550 VTAPVKTPNVVPHDVGCPEDEPWLRVNTYFVHDTADWRDLNPKFVLQAYRDYYITKDIDF 609

Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
            +A+WP   I M    + DKD DG+IEN G  DQT+D W   G SAYCGGLW+AAL+   
Sbjct: 610 LKAMWPICKIVMEQSMRHDKDNDGLIENSGAADQTFDGWCVTGPSAYCGGLWLAALRCME 669

Query: 670 ALANDVGDHASASYFW-VRYQKAKAVYDSLWNGSYFNYDNSDG-SSSTSIQADQLAGQWY 727
             A+ + +      F  +  +  KA  D LWNG YFNYD S+     TS+ ADQ AGQW+
Sbjct: 670 EAADILHEKDDQVKFHDILVRATKAYQDRLWNGKYFNYDCSNNPQHRTSVMADQCAGQWF 729

Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
             A G+  +  + +V  +L  I+  NV +  GG  GA+NGM+P G +D S +Q+ E+W G
Sbjct: 730 LLASGICDVLPDDQVMSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVDTSSVQSDEVWTG 789

Query: 788 VTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPL 847
           VTY LAA+MI + M+   F+TA+G+Y   W   G+  +FQTPE+   N  YRSL YMRPL
Sbjct: 790 VTYALAATMIHKGMLKEGFRTASGIYRSCWQRYGM--AFQTPEAMRKNHTYRSLAYMRPL 847

Query: 848 TIWAMQWALTKPK 860
           +IW MQ+AL  PK
Sbjct: 848 SIWGMQYALENPK 860


>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
          Length = 927

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 342/859 (39%), Positives = 473/859 (55%), Gaps = 67/859 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N     + G   V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+ ++
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCQY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +          +RE   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 502

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
             +              LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 503 PEE--------------LGRNMCHLCPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G S YCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSVYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK---PKLSRQEIKHE 869
           Y   W    LG +FQTPE++     +RSL YMRPL+IWAMQ AL +    K SR +++  
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAMQLALQQQQHKKASRPKVEQG 906

Query: 870 ISDRDSSYLEQHAAFSKVA 888
              R     E   A + V+
Sbjct: 907 TGLRTGPMFEPKEAMANVS 925


>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 331/818 (40%), Positives = 456/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P  ++ + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N     + G   V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +          +RE   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 502

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
             +              LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 503 PEE--------------LGRNMCHLCPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
 gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
 gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
          Length = 927

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 331/818 (40%), Positives = 456/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P  ++ + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N     + G   V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +          +RE   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 502

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
             +              LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 503 PEE--------------LGRNMCHLCPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
          Length = 858

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 326/815 (40%), Positives = 458/815 (56%), Gaps = 50/815 (6%)

Query: 55  IGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLF 113
           +G R  R+  K+ + + K P  ++F    +    G PLGGIG G+I R +RGEF R++L 
Sbjct: 62  LGIRYLRWWYKKTQVEKKAPFIDMFHALPLRQIYGAPLGGIGGGTITRGWRGEFCRWQLN 121

Query: 114 HGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
            G+     V+A+QF+V + R     +  VL    P        S ++ W+W   GE   Y
Sbjct: 122 PGMYHYKTVIADQFTVCLRRGGQTVYQQVLSVEHP--------STLQGWNWGYCGEYAFY 173

Query: 174 HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
           HAL+PRAWTVY   P   + + CRQISP IPH+YK+SS P +VF + + N    + +V++
Sbjct: 174 HALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDYKDSSLPVAVFVWDIENKNDYALEVSI 232

Query: 234 LFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADV 290
           +FT  N        SG H+N    + K+G  + G+ LHH TA    P T  ++A E  D 
Sbjct: 233 MFTMVNGSGHKDDSSGGHWNEPFHLEKEGKALSGVLLHHCTAVN--PYTLCISAREQPDR 290

Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIP 350
            VS    F  SG   G     +W+++   G  D   +  + P++ G  + AA+A   ++P
Sbjct: 291 KVSHQTAFSPSGTCSG-----LWSDLMTDGRLDS-PSGSSPPTQKGEKVAAALAVGCSVP 344

Query: 351 SGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
           +     V F LAWD P + F   E+ + RRYT+++G+ G+++  ++H A+  + +WE  I
Sbjct: 345 AHGHNDVEFCLAWDMPLITFGSRERKHVRRYTRYFGSKGEASPSLSHYALTHYKEWERRI 404

Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
           E WQ PIL+D   P WY   LFNELY++  GGT+WT+ +        +R     L     
Sbjct: 405 EEWQSPILQDSSLPSWYKSALFNELYFVADGGTVWTELAEDADVSGGVRSEDGGL----- 459

Query: 469 IHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELS 528
              PA  D            +  G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKL LS
Sbjct: 460 ---PAQPDVI----------KEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLWPKLALS 506

Query: 529 IQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
           +Q D A +V+  D      +  G++   K    VPHDIG   D+PW  +N+Y + +++ W
Sbjct: 507 VQYDIAGSVVQQDLTERLHLMSGRYSPVKTKNVVPHDIGDPDDEPWHRLNAYLIHDTAGW 566

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
           KDLN KFVLQVYRDF  T D  + + +WP     M    +FD DGDG+IEN G+ DQTYD
Sbjct: 567 KDLNLKFVLQVYRDFHITQDNQYLKDMWPICQAVMESEIKFDLDGDGLIENSGYADQTYD 626

Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFN 705
            W+  G SAYCGGLW+A+L     +A  V    +  ++     +  A +D  LWNG Y+N
Sbjct: 627 GWTVTGPSAYCGGLWLASLCVMCKMARLVDTERTYQHYKDILDRGSAAFDKLLWNGKYYN 686

Query: 706 YDNSDGSSSTSIQADQLAGQWY--ARACGL--LPIADEAKVKKALTKIYDFNVLKVKGGM 761
           YD+S    S S+ +DQ AG W+  A ACG        + K++  L  I+D NV+   GG 
Sbjct: 687 YDSSGRYHSNSVMSDQCAGHWFLKASACGEEDYQAFPKEKIQSTLKSIFDLNVMAFAGGQ 746

Query: 762 CGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDG 821
            GA+NGM+P+G  D S +Q+ E+W GV YGLAA+MI E M +    TA G Y   W    
Sbjct: 747 MGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMREEGMHTAEGCYRTVWER-- 804

Query: 822 LGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           LG +FQTPE++   + YRSL YMRPL+IWAMQ AL
Sbjct: 805 LGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLAL 839


>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Cricetulus griseus]
          Length = 898

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 330/817 (40%), Positives = 451/817 (55%), Gaps = 65/817 (7%)

Query: 52  LAHIGFRLYRYSK----QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + H G  L RY K    + + + K P  ++F    +    G PLGGIG G+I R +RG+F
Sbjct: 86  IKHFGMGL-RYLKWWYRKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQF 144

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 145 CRWQLNPGMYQHQTVIADQFTVCLRRNGRTVYQQVLSLERPSV--------LRSWNWGLC 196

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++S  P  VF + + N G  
Sbjct: 197 GHFAFYHALYPRAWTVYQ-LPGQSVTLTCRQITPILPHDYQDSCLPVGVFVWDVENQGDE 255

Query: 228 SADVTLLFTWANSVAG-DSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G D  L G       + +DG  V GL LHH T     P T AVAA
Sbjct: 256 TLDVSIMFSMRNGLGGEDDALGGLWNEPFCLQRDGKTVQGLLLHHPTPPN--PYTMAVAA 313

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
            +T   H++              T + +W  + + G  D     +++PS+ G  +  A+ 
Sbjct: 314 RDTTVTHIT--------AFDPDSTGQQVWKHLLQDGQLDS-PAGQSTPSKKGEGVAGAVC 364

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            S  +P      + FSLAWD P + F  K  +++RRYT+F+G+ GD A  ++H A+ ++A
Sbjct: 365 ISSKLPPRGQCCLEFSLAWDMPRIMFGAKGQIHYRRYTRFFGSDGDVAPALSHYALCQYA 424

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                   
Sbjct: 425 DWENRISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------- 465

Query: 463 LDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
           L+ P       S    LG  +  L    ++ G+F YLEG EY MYNTYDVHFY+SFALVM
Sbjct: 466 LEVPED-----SLPEELGENMYQLRPILQDYGRFGYLEGQEYRMYNTYDVHFYASFALVM 520

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
           L+PKLELS+Q D A A    D    + +  G     K    +PHDIG   D+PW  +N+Y
Sbjct: 521 LWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAY 580

Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
            + +++ WKDLN KFVLQVYRD+  TGD+ F + +WP     M    +FDKD DG+IEN 
Sbjct: 581 LIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDQDGLIENG 640

Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS- 697
           G+ DQTYD W A G SAYCGGLW+AA+     +A   G       F     + +  Y+  
Sbjct: 641 GYADQTYDGWVATGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILCRGREAYERL 700

Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFN 753
           LWNG Y+NYD+S    S SI +DQ AGQW+ RACGL      +     V +AL  I++ N
Sbjct: 701 LWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELN 760

Query: 754 VLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
           V    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y
Sbjct: 761 VQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCY 820

Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
              W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 821 RTVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 855


>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
           abelii]
          Length = 927

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 331/818 (40%), Positives = 454/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYQKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPTV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N     + G   V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERGRETVQGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 ISSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
          Length = 954

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 330/823 (40%), Positives = 460/823 (55%), Gaps = 55/823 (6%)

Query: 49  FRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +GFR  ++  Q+ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 121 LKHLG-MGFRYLKWWYQKTQVEKKTPFIDMLNCLPLRQIYGCPLGGIGGGTITRGWRGQF 179

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  GI     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 180 CRWQLNPGIYQHRTVMADQFTVCLRREGQTVYQQVLSLERPNV--------LRSWNWGLC 231

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P  ++ + C QI+P +PH+Y++SS P  VF + + N+G  
Sbjct: 232 GFFAFYHALYPRAWTVYQ-LPGQQVTLTCLQITPILPHDYQDSSLPVGVFVWNVENTGDE 290

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
             +V+++F+  N + G     G  +N    + +DG  V G+ LHH   N   P T A+AA
Sbjct: 291 DLEVSIMFSMRNGLGGGHDNLGGLWNEPFCLERDGETVQGVLLHH--PNPPNPYTMALAA 348

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TA+  V+    F   G     T + +W ++   G  D      + P++    IG A+ 
Sbjct: 349 RRTANTTVTHITAFDPDG-----TGQKVWQDLLLDGQLDS-PLGPSPPTQKRQGIGGAVC 402

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
           A   +P      + F+L+WD P+V F  K   ++RRYT+F+G  G++A  + H A+ ++ 
Sbjct: 403 AGAHLPRRGRCQLEFALSWDMPKVLFGAKGQAHYRRYTRFFGQEGNAAPALCHYALTQYR 462

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE +I AWQ P+LED+  P WY   LFNELY+L  GGT+W      ++        +FS
Sbjct: 463 SWEEKIIAWQSPVLEDRTLPAWYKSALFNELYFLADGGTVW------LEVTEETITEEFS 516

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
                           +  +L    +E  G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 517 ---------------GMLNQLRPTLQE-YGRFAYLEGQEYHMYNTYDVHFYASFALIMLW 560

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
           PKLELS+Q D A A +  D    K +  G     K    +PHD+G   D+PW  IN+Y +
Sbjct: 561 PKLELSLQYDMAVATLSEDLTRRKYLMSGVIAPVKKKNVIPHDVGDPDDEPWLRINAYLI 620

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            +++ WKDLN KFVLQ+YRDF  TG+++F R +WP     M    +FDKD DG+IEN G+
Sbjct: 621 HDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGLIENGGY 680

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  LW
Sbjct: 681 ADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAAVCGAQDVQEKFLSILSRGREAYERLLW 740

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
           NG Y+NYD+S    S  I +DQ AGQW+ RACGL      +   A V +AL  I++ NV 
Sbjct: 741 NGRYYNYDSSSQPQSRIIMSDQCAGQWFLRACGLGKGDTEVFPSAHVIRALQTIFELNVQ 800

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGM PDG  D S +Q+ E+W GV YGLAA+MIQE +V   F TA G Y  
Sbjct: 801 GFAGGAMGAVNGMHPDGVPDTSSVQSDEVWVGVVYGLAATMIQEGLVQEGFLTAEGCYRT 860

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            W    LG +FQTPE++     +RSL YMRPL+IWAMQ AL +
Sbjct: 861 VWER--LGMAFQTPEAYCQRRVFRSLAYMRPLSIWAMQLALQQ 901


>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 923

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 327/815 (40%), Positives = 452/815 (55%), Gaps = 58/815 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPNV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + +DG  V GL LHH T     P T A AA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFHLERDGETVQGLLLHHPTLPN--PYTMAAAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F   G     T + +W ++ + G  D     +++P++ G  +  A+ 
Sbjct: 340 RVTADTMVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGVAGAVC 393

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            S  +       + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  + 
Sbjct: 394 VSSKLRPRGQCCLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALSRYT 453

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
           +WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +          +RE    
Sbjct: 454 EWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL- 502

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
                    P   D  +    L     + G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 503 ---------PQELDGNMCH--LRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLW 551

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
           PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y +
Sbjct: 552 PKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLI 611

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN G+
Sbjct: 612 HDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGY 671

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  LW
Sbjct: 672 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSSILSRGQEAYERLLW 731

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
           NG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ NV 
Sbjct: 732 NGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFELNVQ 791

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y  
Sbjct: 792 AFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 851

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 852 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
          Length = 917

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 334/815 (40%), Positives = 454/815 (55%), Gaps = 55/815 (6%)

Query: 49  FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 104 IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 162

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 163 CRWQLNPGMYQHQTVVADQFTVCLRREGTTVYQQVLSVERPSV--------LRSWNWGLC 214

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 215 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 273

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + +DG  V GL LHH T     P T AVAA
Sbjct: 274 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAA 331

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++PS+ G  I  A+ 
Sbjct: 332 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGIAGAVC 385

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            +  +P      + FSLAWD P + F  K  VY+RRYT+F+G  G++A  ++H A+  + 
Sbjct: 386 VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPTLSHYALCRYT 445

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE +I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +   P  SL         
Sbjct: 446 DWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 494

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
              P ++     S   L   L E      G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 495 ---PEEL---VGSMCQLRPILQE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 543

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
           PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y +
Sbjct: 544 PKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 603

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            +++ WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN G+
Sbjct: 604 HDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 663

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G       F    ++ +  Y+  LW
Sbjct: 664 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILRRGQEAYERLLW 723

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
           NG Y+NYD S    S SI +DQ AGQW+ RA GL      +     V  AL  I++FNV 
Sbjct: 724 NGRYYNYDCSPPPQSCSIMSDQCAGQWFLRASGLGEGDTEVFPTPHVVCALQTIFEFNVR 783

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y  
Sbjct: 784 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 843

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 844 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876


>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
           gorilla gorilla]
          Length = 927

 Score =  571 bits (1472), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 330/818 (40%), Positives = 455/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + +W+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRTWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
            [Otolemur garnettii]
          Length = 1147

 Score =  570 bits (1468), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 328/821 (39%), Positives = 456/821 (55%), Gaps = 61/821 (7%)

Query: 49   FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
             +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 326  IKHLG-MGLRYLQWWYRKTQVEKKAPFIDMINCVPLRQIYGCPLGGIGGGTITRGWRGQF 384

Query: 108  QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
             R++L  G      +  D P+   +F+V + R     +  VL    P V        + S
Sbjct: 385  CRWQLNPGMYQHRTVIADQPISPTKFTVCLRRKGQTVYQQVLSLECPSV--------LRS 436

Query: 162  WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
            W+W L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + +
Sbjct: 437  WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 495

Query: 222  SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
             N G  + DV+++F+  N + G     G  +N    + +DG  V GL LHH T     P 
Sbjct: 496  ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PY 553

Query: 279  TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
            T AVAA  TAD  V+    F         T + +W ++ + G  D     +++PS+ G  
Sbjct: 554  TMAVAARCTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSAPSQKGVG 607

Query: 339  IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHD 396
            +  A+  S  +P      + FSLAWD P++ F  K  V++RRYT+F+G  G++A  ++H 
Sbjct: 608  VAGAVCVSGKLPPQGQCRLEFSLAWDMPKIMFGAKGQVHYRRYTRFFGQDGNAAPALSHY 667

Query: 397  AILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATI 456
            A+  +A WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +          +
Sbjct: 668  ALCRYADWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------V 717

Query: 457  RERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
            RE   S             +   G   L     + G+F YLEG EY MYNTYDVHFY+SF
Sbjct: 718  REDSIS------------EELGGGMCQLHPMLRDYGRFGYLEGQEYRMYNTYDVHFYASF 765

Query: 517  ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFE 574
            AL+ML+PKLELS+Q D A A +  D    + + +G     K    +PHDIG   D+PW  
Sbjct: 766  ALIMLWPKLELSLQYDMALATLREDLTRRRYLMNGVVAPVKRRNVIPHDIGDPDDEPWLR 825

Query: 575  INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
            +N+Y + +++ WKDLN KFVLQVYRD+  TGD+ F + +WP     M    +FDKD DG+
Sbjct: 826  VNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGL 885

Query: 635  IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
            IEN G+ DQTYDAW   G SAYCGGLW+AA+     +A   G       F    ++ +  
Sbjct: 886  IENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDLQDKFSSILRQGQEA 945

Query: 695  YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKI 749
            Y+  LWNG Y+NYD+S    S SI +DQ AGQW+ +ACGL      +     V +AL  I
Sbjct: 946  YERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTI 1005

Query: 750  YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
            Y+ NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA
Sbjct: 1006 YELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTA 1065

Query: 810  AGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
             G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 1066 EGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1104


>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Ailuropoda melanoleuca]
          Length = 928

 Score =  568 bits (1464), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 334/815 (40%), Positives = 454/815 (55%), Gaps = 55/815 (6%)

Query: 49  FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 115 IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 173

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 174 CRWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLC 225

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 226 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 284

Query: 228 SADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N +  GD    G       + +DG  V GL LHH T     P T AVAA
Sbjct: 285 ALDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAA 342

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++PS+ G  I  A+ 
Sbjct: 343 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPSQKGVGIAGAVC 396

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            +  +P      + FSLAWD P + F  K  VY+RRYT+F+G  G++A  ++H A+  + 
Sbjct: 397 VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYV 456

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE +I AWQ PIL+D+  P WY   LFNELY+L  GGT+W +   P  SL         
Sbjct: 457 DWEEKISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 505

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
              P ++     S   L   L E      G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 506 ---PEEL---VGSMCQLRPILRE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 554

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
           PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y +
Sbjct: 555 PKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 614

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            +++ WKDLN KFVLQVYRD+  TGD++F R +WP     M    +FDKD DG+IEN G+
Sbjct: 615 HDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 674

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G       F    ++ +  Y+  LW
Sbjct: 675 ADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSSILRRGREAYERLLW 734

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
           NG Y+NYD S    S SI +DQ AGQW+ +A GL      +     V  AL  I++FNV 
Sbjct: 735 NGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPTPHVVCALQTIFEFNVQ 794

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y  
Sbjct: 795 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 854

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 855 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 887


>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
          Length = 923

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 334/815 (40%), Positives = 454/815 (55%), Gaps = 55/815 (6%)

Query: 49  FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N +  GD    G       + +DG  V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TAD  V+    F         T + +W ++ + G  D     +++PS+ G  I  A+ 
Sbjct: 340 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPSQKGVGIAGAVC 393

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
            +  +P      + FSLAWD P + F  K  VY+RRYT+F+G  G++A  ++H A+  + 
Sbjct: 394 VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYV 453

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            WE +I AWQ PIL+D+  P WY   LFNELY+L  GGT+W +   P  SL         
Sbjct: 454 DWEEKISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 502

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
              P ++     S   L   L E      G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 503 ---PEEL---VGSMCQLRPILRE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 551

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
           PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y +
Sbjct: 552 PKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 611

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
            +++ WKDLN KFVLQVYRD+  TGD++F R +WP     M    +FDKD DG+IEN G+
Sbjct: 612 HDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 671

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
            DQTYD W   G SAYCGGLW+AA+     +A   G       F    ++ +  Y+  LW
Sbjct: 672 ADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSSILRRGREAYERLLW 731

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
           NG Y+NYD S    S SI +DQ AGQW+ +A GL      +     V  AL  I++FNV 
Sbjct: 732 NGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPTPHVVCALQTIFEFNVQ 791

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y  
Sbjct: 792 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 851

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 852 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
 gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
          Length = 843

 Score =  567 bits (1461), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 324/833 (38%), Positives = 462/833 (55%), Gaps = 67/833 (8%)

Query: 55  IGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH 114
           +G R   Y        + P  ++F K  +    G+P+GG+G G+I R + G+F R++L  
Sbjct: 36  MGLRYGMYILHNRFHSREPFIDLFGKRQVDRLYGVPIGGLGCGTIMRGWNGQFCRWQLRP 95

Query: 115 GICDDAPVLANQFSVFVSRPNGE-KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
           G      V A+QF+V + R NGE  +  VL P  P      + + +  W W   G   TY
Sbjct: 96  GWYTYDIVHADQFTVSI-RKNGEPSYQQVLSPCRP------SSAQLGDWFWGYHGSQATY 148

Query: 174 HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
            AL+PRAWT Y+  P   + + CRQISP IPHNYK+S  P   F + + N      +V++
Sbjct: 149 RALYPRAWTTYN-LPGQNVTLTCRQISPIIPHNYKDSCLPVGCFIWEIDNQSDDEIEVSI 207

Query: 234 LFTWANSVAGDSGLSGHHFN-------SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEE 286
           +FT+ N    +   +G H N       ++ M    + G+ LHHR +    P TF + A+E
Sbjct: 208 MFTFQNGDGSEHDSAGGHQNETFTRSSNENMNNSKISGVLLHHRAS--LQPCTFTICAKE 265

Query: 287 TAD------VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPS---EPGS 337
             +      V VS    F   G+ + I+      ++  HG  ++  N K + S   + G 
Sbjct: 266 CPETGDYPAVSVSNSVSFDPKGSGREIS-----RQLLSHGKLENSSNTKATQSPVSKKGE 320

Query: 338 SIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGD-SAARIA 394
            IGAA+A +  +       + F+L WD PEV F  K   Y RRYT+F+   G   AA +A
Sbjct: 321 IIGAAVATTAIVKPQGKAKMEFALTWDQPEVYFDAKEIKYRRRYTRFFNPNGSHRAADLA 380

Query: 395 HDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW--TDGSPPMQS 452
             A+     WE +I+ WQ PILE++ +P+WY   +FNELYY++  G+IW  TDG      
Sbjct: 381 VYALQNVKSWEEKIDQWQSPILENESYPDWYKSAIFNELYYMSDSGSIWAETDGDE---- 436

Query: 453 LATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHF 512
                      D  + + +        G ++     +  G+F YLEG EY+MYNTYDVHF
Sbjct: 437 -----------DDETNLASCLKEVPGKGKQI----HKEYGRFAYLEGHEYLMYNTYDVHF 481

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDD 570
           Y+SFAL  L+P +ELSIQ DFAA++M  DP    IM  G    RK  G VPHDIG   D+
Sbjct: 482 YASFALAQLWPMIELSIQYDFAASIMHEDPELRLIMWTGLRKPRKSYGFVPHDIGEPCDE 541

Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
           PW  IN+Y + +   W DLN+KFVLQ+YRD+ ATG+  F + +WP     M      D D
Sbjct: 542 PWKRINAYCISDVQDWTDLNTKFVLQIYRDYCATGNLEFLKDMWPCAKHVMETAISQDTD 601

Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG-DHASASYFWVRYQ 689
           GDG+I+N G  DQTYD W   G SAYCGG+++AAL     +A+ +G     A Y  +  +
Sbjct: 602 GDGIIDNNG-ADQTYDVWRMYGASAYCGGIFLAALYIMYKIADLIGCKEDKAKYSKILLR 660

Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL-----PIADEAKVKK 744
              +    LWNG Y+ YD+S G+ S SI ADQ+ G WY +ACGL+      I  +++V K
Sbjct: 661 GRDSFQQKLWNGDYYLYDSSKGNHSDSIMADQMCGHWYLQACGLVNNNEDDIFPKSQVDK 720

Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
           AL K++D+NV+K   G  GA+NGM+P+G ID S +Q+ E+W GVTY LAA++IQ+ M+  
Sbjct: 721 ALRKVFDYNVMKFMNGTWGAVNGMKPNGEIDTSSVQSEEVWTGVTYALAANLIQQGMIKE 780

Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
            F+ A G+Y+  ++  GLG  FQTPE++   + YR   YMRPL+IWA++WA  
Sbjct: 781 GFKVAEGMYDTTFNVMGLG--FQTPEAYYKINRYRCTGYMRPLSIWAIEWAFN 831


>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
           abelii]
          Length = 933

 Score =  566 bits (1460), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 331/824 (40%), Positives = 453/824 (54%), Gaps = 70/824 (8%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYQKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
            R++L  G      +  D PV   +F+V + R     +  VL    P V        + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPVCPLKFTVCLRREGQTVYQQVLSLERPTV--------LRS 222

Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
           W+W L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281

Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPV 278
            N G  + DV+++F+  N + G     G  +N     + G   V GL LHH T     P 
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERGRETVQGLLLHHPTLPN--PY 339

Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
           T AVAA  TAD  V+    F         T + +W ++ + G  D     +++P++ G  
Sbjct: 340 TMAVAARVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVG 393

Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
           I  A+  S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H
Sbjct: 394 IAGAVCISSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452

Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
            A+  +A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  
Sbjct: 453 YALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503

Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
           + +               S    LG  +  L     + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548

Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
           +SFAL+ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608

Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
           W  +N+Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD 
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
           DG+IEN G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + 
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728

Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
           +  Y+  LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRAL 788

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
             I++ NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           QTA G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
          Length = 983

 Score =  566 bits (1459), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 335/816 (41%), Positives = 451/816 (55%), Gaps = 57/816 (6%)

Query: 49  FRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R  ++  Q+ + + K P  +      +    G PLGGIG G+I R +RG+F
Sbjct: 168 IKHLG-MGMRYLQWWYQKTQVEKKTPFIDFMNCVPLRQIYGCPLGGIGGGTITRGWRGQF 226

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 227 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSVERPRV--------LRSWNWGLC 278

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 279 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 337

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + +DG    GL LHH T     P T AVAA
Sbjct: 338 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEQDGETTQGLLLHHPTLPN--PYTMAVAA 395

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI-GAAI 343
             TAD  V+    F         T + +W ++ + G  D      T P++ G  I GA  
Sbjct: 396 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDSPAGPST-PTQKGVGIAGAVC 449

Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRYTKFYGTLGDSAARIAHDAILEH 401
            A    P G  R + FSLAWD P + F  K   Y+RRYT+F+G  GD+A  ++H A+  +
Sbjct: 450 VAGKLSPRGRCR-LEFSLAWDMPRIMFGAKGQDYYRRYTRFFGPDGDAAPALSHYALCRY 508

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A WE  I AWQ PIL+D+  P WY   LFNELY+L  GGT+W +   P  SL        
Sbjct: 509 ADWEKRISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL-------- 558

Query: 462 SLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
               P ++   A S   L   L E      G+F YLEG EY MYNTYDVHFY+SFAL+ML
Sbjct: 559 ----PEEL---AGSMCQLRPILQE-----YGRFGYLEGQEYRMYNTYDVHFYASFALIML 606

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYN 579
           +PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y 
Sbjct: 607 WPKLELSLQYDMALATLGEDLTRRRYLMSGVMAPVKKRNVIPHDIGDPDDEPWLRVNAYL 666

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + +++ WKDLN KFVLQ+YRD+  TGD+ F R +WP     M    +FDKD DG+IEN G
Sbjct: 667 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGG 726

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
           + DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  L
Sbjct: 727 YADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQDRFSSILSRGQEAYERLL 786

Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
           WNG Y+NYD S    S SI +DQ AGQW+ RACGL      +     + +AL  I++ NV
Sbjct: 787 WNGRYYNYDCSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTRHIVRALQTIFELNV 846

Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
               GG  GA+NGMQP G  D + +Q+ E+W GV YGLAA+MIQE +    F+TA G Y 
Sbjct: 847 QAFAGGAMGAVNGMQPHGVPDRTSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 906

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
             W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 907 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 940


>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 929

 Score =  565 bits (1456), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 326/821 (39%), Positives = 451/821 (54%), Gaps = 64/821 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
            R++L  G      +  D P+   +F+V + R     +  VL    P V        + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICPLKFTVCLRREGQTVYQQVLSLERPNV--------LRS 222

Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
           W+W L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281

Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
            N G  + DV+++F+  N + G     G  +N    + +DG  V GL LHH T     P 
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFHLERDGETVQGLLLHHPTLPN--PY 339

Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
           T A AA  TAD  V+    F   G     T + +W ++ + G  D     +++P++ G  
Sbjct: 340 TMAAAARVTADTMVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGVG 393

Query: 339 IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHD 396
           +  A+  S  +       + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H 
Sbjct: 394 VAGAVCVSSKLRPRGQCCLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHY 453

Query: 397 AILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATI 456
           A+  + +WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +          +
Sbjct: 454 ALSRYTEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------V 503

Query: 457 RERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
           RE             P   D  +    L     + G+F YLEG EY MYNTYDVHFY+SF
Sbjct: 504 REDSL----------PQELDGNMCH--LRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASF 551

Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFE 574
           AL+ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  
Sbjct: 552 ALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLR 611

Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
           +N+Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+
Sbjct: 612 VNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGL 671

Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
           IEN G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  
Sbjct: 672 IENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSSILSRGQEA 731

Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKI 749
           Y+  LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I
Sbjct: 732 YERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTI 791

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
           ++ NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA
Sbjct: 792 FELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTA 851

Query: 810 AGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 852 EGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
           gorilla gorilla]
          Length = 933

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 329/824 (39%), Positives = 454/824 (55%), Gaps = 70/824 (8%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
            R++L  G      +  D P+   +F+V + R     +  VL    P V        + +
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICPLKFTVCLRREGQTVYQQVLSLERPSV--------LRT 222

Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
           W+W L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281

Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
            N G  + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P 
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PY 339

Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
           T AVAA  TAD  V+    F         T + +W ++ + G  D     +++P++ G  
Sbjct: 340 TMAVAARVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVG 393

Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
           I  A+  S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H
Sbjct: 394 IAGAVCVSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452

Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
            A+  +A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  
Sbjct: 453 YALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503

Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
           + +               S    LG  +  L     + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548

Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
           +SFAL+ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608

Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
           W  +N+Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD 
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
           DG+IEN G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + 
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728

Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
           +  Y+  LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 788

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
             I++ NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           QTA G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
          Length = 826

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 325/783 (41%), Positives = 450/783 (57%), Gaps = 49/783 (6%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G PLGGIG G+I R +RGEF R++L  G+     V+ANQF+V +       +  VL    
Sbjct: 65  GCPLGGIGGGTITRGWRGEFCRWQLNPGLYHYGTVIANQFTVCLRCKGQTIYQQVLSMER 124

Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
           P        S ++ W+W   G    YHAL+PRAW VY+  P  ++ + CRQ+SP IPH+Y
Sbjct: 125 P--------SSLQGWNWGYCGHYAFYHALYPRAWLVYE-LPGQQVVLTCRQVSPVIPHDY 175

Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHG 264
           K+SS P  VF + + N      +V+++F+  N +      SG H+N   T  KDG  V G
Sbjct: 176 KDSSLPVGVFIWEVENRRDEDVEVSIMFSLQNGMGTKEDGSGGHWNEPFTFQKDGERVAG 235

Query: 265 LTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
           + LHH  A    P T A++A E A   V+    F    N  G+  +++W ++ + G  D 
Sbjct: 236 VLLHHCPAVN--PFTLAISAREKAGTGVTHVTAF----NPTGL-GREVWQDLLQDGRLDS 288

Query: 325 LDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKF 382
               K+ P+E G    AA+ AS  +P+   +++  +LAWD P V F   EK++ RRYT+F
Sbjct: 289 -PAGKSRPTEKGEVTAAAVCASCRVPAHGHKTLEMALAWDMPCVHFGSKEKLHVRRYTRF 347

Query: 383 YGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTI 442
           +G+ GD+A  ++H A+  + +WE +IEAWQ PILE+ + P WY   LFNELY++  GGTI
Sbjct: 348 FGSKGDAAPTLSHYALTHYEEWERKIEAWQNPILENSQLPAWYKSALFNELYFMTDGGTI 407

Query: 443 WTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEY 502
           W +  P                    +  PA +    G   L       G+F YLEG EY
Sbjct: 408 WVEVPPDC--------------CAEDLQGPAGA----GLSHLLPVLREYGRFAYLEGQEY 449

Query: 503 VMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAV 562
            MYNTYDVHFY+SFAL+ML+PKL++S+Q D A  V+  D    + +  G+    K    V
Sbjct: 450 RMYNTYDVHFYASFALIMLWPKLQISLQYDIAVTVVNEDVQPRQYLMGGQTAQVKMKNVV 509

Query: 563 PHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           PHDIG   D+PW  +N+Y + +++ WKDLN KFVLQVYRD+  T D  + + +WP     
Sbjct: 510 PHDIGDPGDEPWQRVNAYLMHDTADWKDLNLKFVLQVYRDYYLTHDSLYLQDMWPVCQAV 569

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           M    +FD D DG+IEN GF DQTYDAW      AYCGGLW+AA+     +A  +GD   
Sbjct: 570 MESELKFDTDNDGLIENGGFADQTYDAWVVIRRCAYCGGLWLAAVCMMCKMAEVLGDTEI 629

Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LP 735
              +     K K  ++  LWNG Y+NYD+S   +S+SI +DQ AGQW+  ACGL      
Sbjct: 630 RQKYLGILNKGKEAFERMLWNGKYYNYDSSGSDTSSSIMSDQCAGQWFLGACGLDQGEFE 689

Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAAS 795
           +  ++ +  AL  I++ NVL   GG  GA+NGM+PDG  D S +Q+ E+W GV Y LAA+
Sbjct: 690 VFPKSHILSALRTIFEKNVLSFAGGTMGAVNGMRPDGVPDTSSVQSNEVWVGVVYSLAAT 749

Query: 796 MIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           MIQE MV+  F+TA G Y   W    LG +FQTPE++     YRSL YMRPL+IW+MQ A
Sbjct: 750 MIQEGMVEEGFRTAEGCYRTVWER--LGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 807

Query: 856 LTK 858
           L +
Sbjct: 808 LER 810


>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
          Length = 940

 Score =  564 bits (1453), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)

Query: 50  RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           +HL  +G R  ++  ++   + K P  ++     +    G PLGGIG G+I R +RG+F 
Sbjct: 126 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 184

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  G+     V+A+QF V + R     +  VL    P V        + SW+W L G
Sbjct: 185 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 236

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAWTVY   P   + + CRQ++P +PH+Y++SS P  VF + + N G  +
Sbjct: 237 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 295

Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
            DV++ F+  N + G+   +G  +N     + G   V GL LHH T     P T AVAA 
Sbjct: 296 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 353

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
            TAD  V+    F  +G     T + +W ++ + G  D     +++P++ G  I  A+  
Sbjct: 354 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCV 407

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
           S  +   S   + FSLAWD P++ F  K  V++RRYT+F+G+ GD A  ++H A+  +A 
Sbjct: 408 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 467

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
           WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                   L
Sbjct: 468 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 508

Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
           + P+      S    LG   R L +  ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 509 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 563

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
           +PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y 
Sbjct: 564 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 623

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + +++ WKDLN KFVLQ+YRD+  TGD+ F   +WP     M    +FDKD DG+IEN G
Sbjct: 624 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 683

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
           + DQTYDAW   G SAYCGGLW+AA+     +A   G       F     + +  Y+  L
Sbjct: 684 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 743

Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
           WNG Y+NYD+S    S SI +DQ AGQW+ RACGL      +     V +AL  I++ NV
Sbjct: 744 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 803

Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
               GG  GA+NGM P G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y 
Sbjct: 804 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 863

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
             W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 864 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 897


>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
 gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
          Length = 918

 Score =  563 bits (1452), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)

Query: 50  RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           +HL  +G R  ++  ++   + K P  ++     +    G PLGGIG G+I R +RG+F 
Sbjct: 104 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 162

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  G+     V+A+QF V + R     +  VL    P V        + SW+W L G
Sbjct: 163 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 214

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAWTVY   P   + + CRQ++P +PH+Y++SS P  VF + + N G  +
Sbjct: 215 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 273

Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
            DV++ F+  N + G+   +G  +N     + G   V GL LHH T     P T AVAA 
Sbjct: 274 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 331

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
            TAD  V+    F  +G     T + +W ++ + G  D     +++P++ G  I  A+  
Sbjct: 332 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCV 385

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
           S  +   S   + FSLAWD P++ F  K  V++RRYT+F+G+ GD A  ++H A+  +A 
Sbjct: 386 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 445

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
           WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                   L
Sbjct: 446 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 486

Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
           + P+      S    LG   R L +  ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 487 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 541

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
           +PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y 
Sbjct: 542 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 601

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + +++ WKDLN KFVLQ+YRD+  TGD+ F   +WP     M    +FDKD DG+IEN G
Sbjct: 602 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 661

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
           + DQTYDAW   G SAYCGGLW+AA+     +A   G       F     + +  Y+  L
Sbjct: 662 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 721

Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
           WNG Y+NYD+S    S SI +DQ AGQW+ RACGL      +     V +AL  I++ NV
Sbjct: 722 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 781

Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
               GG  GA+NGM P G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y 
Sbjct: 782 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 841

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
             W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 842 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875


>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
          Length = 918

 Score =  563 bits (1451), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)

Query: 50  RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           +HL  +G R  ++  ++   + K P  ++     +    G PLGGIG G+I R +RG+F 
Sbjct: 104 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 162

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  G+     V+A+QF V + R     +  VL    P V        + SW+W L G
Sbjct: 163 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 214

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAWTVY   P   + + CRQ++P +PH+Y++SS P  VF + + N G  +
Sbjct: 215 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 273

Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
            DV++ F+  N + G+   +G  +N     + G   V GL LHH T     P T AVAA 
Sbjct: 274 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 331

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
            TAD  V+    F  +G     T + +W ++ + G  D     +++P++ G  I  A+  
Sbjct: 332 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCV 385

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
           S  +   S   + FSLAWD P++ F  K  V++RRYT+F+G+ GD A  ++H A+  +A 
Sbjct: 386 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 445

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
           WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                   L
Sbjct: 446 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 486

Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
           + P+      S    LG   R L +  ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 487 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 541

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
           +PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y 
Sbjct: 542 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 601

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + +++ WKDLN KFVLQ+YRD+  TGD+ F   +WP     M    +FDKD DG+IEN G
Sbjct: 602 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 661

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
           + DQTYDAW   G SAYCGGLW+AA+     +A   G       F     + +  Y+  L
Sbjct: 662 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 721

Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
           WNG Y+NYD+S    S SI +DQ AGQW+ RACGL      +     V +AL  I++ NV
Sbjct: 722 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 781

Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
               GG  GA+NGM P G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y 
Sbjct: 782 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 841

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
             W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 842 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875


>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
          Length = 918

 Score =  563 bits (1450), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)

Query: 50  RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           +HL  +G R  ++  ++   + K P  ++     +    G PLGGIG G+I R +RG+F 
Sbjct: 104 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 162

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  G+     V+A+QF V + R     +  VL    P V        + SW+W L G
Sbjct: 163 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 214

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAWTVY   P   + + CRQ++P +PH+Y++SS P  VF + + N G  +
Sbjct: 215 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 273

Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
            DV++ F+  N + G+   +G  +N     + G   V GL LHH T     P T AVAA 
Sbjct: 274 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 331

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
            TAD  V+    F  +G     T + +W ++ + G  D     +++P++ G  I  A+  
Sbjct: 332 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCI 385

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
           S  +   S   + FSLAWD P++ F  K  V++RRYT+F+G+ GD A  ++H A+  +A 
Sbjct: 386 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 445

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
           WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                   L
Sbjct: 446 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 486

Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
           + P+      S    LG   R L +  ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 487 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 541

Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
           +PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y 
Sbjct: 542 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 601

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + +++ WKDLN KFVLQ+YRD+  TGD+ F   +WP     M    +FDKD DG+IEN G
Sbjct: 602 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 661

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
           + DQTYDAW   G SAYCGGLW+AA+     +A   G       F     + +  Y+  L
Sbjct: 662 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 721

Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
           WNG Y+NYD+S    S SI +DQ AGQW+ RACGL      +     V +AL  I++ NV
Sbjct: 722 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 781

Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
               GG  GA+NGM P G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y 
Sbjct: 782 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 841

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
             W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 842 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875


>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
          Length = 912

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 334/818 (40%), Positives = 457/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRYSK----QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + H G  L RY K    + + + K P  ++F    +    G PLGGIG G+I R +RG+F
Sbjct: 103 VKHFGMGL-RYLKWWYRKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQF 161

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF V + R     +  VL    P V        + SW+W L 
Sbjct: 162 CRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPSV--------LRSWNWGLC 213

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 214 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 272

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G+   +G  +N    + +DG  V GL LHH T     P T AVAA
Sbjct: 273 TLDVSIMFSMRNGLGGEDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAA 330

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI-GAAI 343
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  + GA  
Sbjct: 331 RHTADTTVTYTTAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQRGEGVAGAVC 384

Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
           A+S  +P G    + FSLAWD P + F  K  V++RRYT+F+G+ GD A  ++H A+ ++
Sbjct: 385 ASSKLLPRGRC-CLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQY 443

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                  
Sbjct: 444 AGWENSISAWQNPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------ 485

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
            L+ P       S    LG  +  L    ++ G+F YLEG EY MYNTYDVHFY+SFALV
Sbjct: 486 -LEVPED-----SLPEELGGSMYQLRPILQDYGRFGYLEGQEYRMYNTYDVHFYASFALV 539

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A    D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 540 MLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNA 599

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+ F + +WP     M    +FDKD DG+IEN
Sbjct: 600 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDQDGLIEN 659

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 660 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILCRGREAYER 719

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ RACGL      +     V +AL  I++ 
Sbjct: 720 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALKTIFEL 779

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G 
Sbjct: 780 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 839

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 840 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875


>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
 gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
          Length = 904

 Score =  560 bits (1444), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 334/818 (40%), Positives = 457/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRYSK----QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + H G  L RY K    + + + K P  ++F    +    G PLGGIG G+I R +RG+F
Sbjct: 103 VKHFGMGL-RYLKWWYRKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQF 161

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF V + R     +  VL    P V        + SW+W L 
Sbjct: 162 CRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPSV--------LRSWNWGLC 213

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 214 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 272

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G+   +G  +N    + +DG  V GL LHH T     P T AVAA
Sbjct: 273 TLDVSIMFSMRNGLGGEDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAA 330

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI-GAAI 343
             TAD  V+    F         T + +W ++ + G  D     +++P++ G  + GA  
Sbjct: 331 RHTADTTVTYTTAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQRGEGVAGAVC 384

Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
           A+S  +P G    + FSLAWD P + F  K  V++RRYT+F+G+ GD A  ++H A+ ++
Sbjct: 385 ASSKLLPRGRC-CLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQY 443

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W                  
Sbjct: 444 AGWENSISAWQNPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------ 485

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
            L+ P       S    LG  +  L    ++ G+F YLEG EY MYNTYDVHFY+SFALV
Sbjct: 486 -LEVPED-----SLPEELGGSMYQLRPILQDYGRFGYLEGQEYRMYNTYDVHFYASFALV 539

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A    D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 540 MLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNA 599

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+ F + +WP     M    +FDKD DG+IEN
Sbjct: 600 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDQDGLIEN 659

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 660 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILCRGREAYER 719

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ RACGL      +     V +AL  I++ 
Sbjct: 720 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALKTIFEL 779

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G 
Sbjct: 780 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 839

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 840 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875


>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 1001

 Score =  560 bits (1442), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 342/876 (39%), Positives = 473/876 (53%), Gaps = 93/876 (10%)

Query: 50  RHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F 
Sbjct: 105 KHLG-MGLRYLQWWYRKTQVEKKTPFIDMINCVPLRQIYGCPLGGIGGGTITRGWRGQFC 163

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L G
Sbjct: 164 RWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLDHPSV--------LRSWNWGLCG 215

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  +
Sbjct: 216 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEA 274

Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAE 285
            DV+++F+  N + G+    G  +N    + +DG  V GL LHH T     P T AVAA 
Sbjct: 275 LDVSIMFSMRNGLGGEDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAAR 332

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
            TA+  V+    F   G     T + +W ++ + G  D     +++PS+ G  I  A+  
Sbjct: 333 LTANTIVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGIAGAVCI 386

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
           S  +P      + FSLAWD P++ F  K  V++RRYT+F+   GD+A  ++H A+  +A 
Sbjct: 387 SGKLPPRGQCCLEFSLAWDMPKIVFGAKGQVHYRRYTRFFSPDGDAAPALSHYALCRYAD 446

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTD----------GSPPMQSL 453
           WE  I AWQ PIL+D+  P WY   LFNELY+L  GGT+W +          G    Q  
Sbjct: 447 WEERISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLEVPEDSLPEELGGGMHQLR 506

Query: 454 ATIRER-KFS----LDTPSKI----HTPASSDTAL----------GTRLLENGEENI--- 491
             +R+  +F     +  P  +      PA   +AL          GT  LE  E+++   
Sbjct: 507 PILRDYGRFGYLEGMLNPCPLCLRRSLPAWYKSALFNELYFLADGGTVWLEVPEDSLPEE 566

Query: 492 ---------------GQFLYLEGA-------------EYVMYNTYDVHFYSSFALVMLFP 523
                          G+F YLEG              EY MYNTYDVHFY+SFALVML+P
Sbjct: 567 LGGGMHQLRPILRDYGRFGYLEGMAAGGLQYGWGWGQEYRMYNTYDVHFYASFALVMLWP 626

Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLF 581
           KLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y + 
Sbjct: 627 KLELSLQYDMALATLREDLTRRRYLMSGVIAPVKRRNVIPHDIGDPDDEPWLRVNAYLIH 686

Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
           +++ WKDLN KFVLQVYRD+  TGD+ F + +WP     M    +FDKD DG+IEN G+ 
Sbjct: 687 DTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYA 746

Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
           DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  LWN
Sbjct: 747 DQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEKFTSILTRGQEAYERLLWN 806

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLK 756
           G Y+NYD+S    S SI +DQ AGQW+ +ACGL      +     V +AL  I++FNV  
Sbjct: 807 GRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFEFNVQA 866

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
             GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y   
Sbjct: 867 FAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTV 926

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           W    LG +FQTPE++     +RSL YMRPL+IWAM
Sbjct: 927 WER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 960


>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
 gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
           AltName: Full=Beta-glucocerebrosidase 2;
           Short=Beta-glucosidase 2; AltName:
           Full=Glucosylceramidase 2
 gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
 gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
 gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
 gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
 gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
          Length = 927

 Score =  554 bits (1428), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 330/818 (40%), Positives = 453/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++   + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TA   V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
          Length = 927

 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 329/818 (40%), Positives = 453/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++   + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L+
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLR 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TA   V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LF ELY+L  GGT+W         L  + +   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFKELYFLADGGTVW---------LEVLED--- 500

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
           paniscus]
          Length = 927

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 328/818 (40%), Positives = 453/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TA   V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
            +WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 453 TEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLHPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  +++ 
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTVFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
           troglodytes]
 gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
          Length = 927

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 329/818 (40%), Positives = 452/818 (55%), Gaps = 64/818 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TA   V+    F         T + +W ++ + G  D     +++P+  G  I  A+ 
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTHKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
            +WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 453 TEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G 
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884


>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
          Length = 868

 Score =  552 bits (1422), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 325/820 (39%), Positives = 444/820 (54%), Gaps = 79/820 (9%)

Query: 49  FRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R  ++  Q+ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 104 IKHLG-MGLRYLQWWYQKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 162

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 163 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSVERPSV--------LRSWNWGLC 214

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 215 GFFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPVLPHDYQDSSLPVGVFVWDVENEGDE 273

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEET 287
           + DV+++F+  N + G    SG  +N             L H T      VT   A +  
Sbjct: 274 ALDVSIMFSMRNGLGGGDDASGGLWNEP---------FCLEHTT------VTHITAFDPD 318

Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
                               T + +W ++ + G  D     +++ S+ G  I  A+  + 
Sbjct: 319 G-------------------TGQQVWQDLLQDGQLDS-PAGRSTLSQKGVGIAGAVCVTS 358

Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWE 405
            +P      + FSLAWD P + F  +  VY+RRYT+F+G+ GD+A  ++H A+  +A WE
Sbjct: 359 KLPPRGRCRMEFSLAWDMPRIMFGAQGQVYYRRYTRFFGSDGDAAPALSHYALCRYADWE 418

Query: 406 CEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDT 465
             I AWQ P+L+D+  P WY   LFNELY+L  GGT+W +   P  SL            
Sbjct: 419 ERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL------------ 464

Query: 466 PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
           P ++         L T          G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKL
Sbjct: 465 PEELGGNMCQLRPLLT--------EYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKL 516

Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNS 583
           ELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y + ++
Sbjct: 517 ELSLQYDMALATLGEDLTWRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVIHDT 576

Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
           + WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN G+ DQ
Sbjct: 577 ADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYADQ 636

Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGS 702
           TYDAW   G SAYCGGLW+AA+     +A   G       F     + +  Y+  LWNG 
Sbjct: 637 TYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDIQEKFSSILSRGQQAYERLLWNGR 696

Query: 703 YFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVK 758
           Y+NYD S    S SI +DQ AGQW+ RA GL      +     V +AL  I++FNV    
Sbjct: 697 YYNYDCSSRPQSCSIMSDQCAGQWFLRASGLGEGETEVFPTQHVVRALQTIFEFNVQAFA 756

Query: 759 GGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWS 818
           GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y   W 
Sbjct: 757 GGTMGAVNGMQPQGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 816

Query: 819 EDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
             GLG  FQTPE++     +RSL YMRPL+IWAMQ AL +
Sbjct: 817 RLGLG--FQTPEAYCQKRVFRSLAYMRPLSIWAMQLALQQ 854


>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
          Length = 860

 Score =  552 bits (1422), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 368/909 (40%), Positives = 513/909 (56%), Gaps = 122/909 (13%)

Query: 3   NGVKEEGENAASMPKV--DPGQPPQLT---WQRKLDSN-------VKVPTGFTLSFQEFR 50
           +G +EEGE    +P+   +  Q P  T   WQR+L          +  P   TL     R
Sbjct: 7   SGQQEEGERI-KVPESTRERAQMPTATDALWQRRLSEQPTRQLRPLTAPLLHTL-----R 60

Query: 51  HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
           +L  +GFRL+R ++ EE  G+ PV +        S  G+P GG+G G +GRS+RG+F+  
Sbjct: 61  NLP-LGFRLWRQAR-EERGGRDPVLDASGGQ--ASSSGLPAGGMGGGCVGRSWRGDFEPG 116

Query: 111 KLFHGICDDAPVL-ANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGE 169
            +       APV+ ANQFSVF+  P+GE   + +   +PG+P  +    + +WDW+  GE
Sbjct: 117 LILPNGRGQAPVVPANQFSVFIE-PDGEDSEATVL--NPGIPPTDR---LGAWDWSFVGE 170

Query: 170 NCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSA 229
              Y ALFPRAWTVYD EP+P LR+  RQISP I +NYKESS PA VF +TL N  Q   
Sbjct: 171 ESYYSALFPRAWTVYD-EPEPRLRLTSRQISPVIGNNYKESSLPACVFVWTLENLDQEDE 229

Query: 230 -DVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHH----------RTANGRPPV 278
              +L+FTW N     S   G H N+   T+     + + H          +T     P+
Sbjct: 230 ITASLMFTWQNGTGAASDALGGHANTP-FTRPHPERVVMDHTQWRVDERGGKTHASSEPL 288

Query: 279 TFAVAAEET-ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGS 337
           +FA+AA    +D H++   C+  + NS G    ++W + K+ G+ +   +     S+ G 
Sbjct: 289 SFAIAASSPDSDAHITVRSCW--ATNSDG---AEVWKDFKRSGALEENWDALGKASQEGQ 343

Query: 338 SIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHD 396
           ++GAAIA  +TIP G  R V F+LAWD P V+F   + Y RRYT+FY   GDSAA++A D
Sbjct: 344 TLGAAIAVRVTIPPGEHRQVEFALAWDHPVVEFGAGRRYTRRYTRFYSQHGDSAAQLACD 403

Query: 397 AILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATI 456
           A++++ +WE  IE WQ PIL+DKR P                GGT+W       +     
Sbjct: 404 ALVQYPQWERLIEQWQAPILDDKRLP--------------GCGGTVWLQDELSAKEEKPR 449

Query: 457 RERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
           RE++                    T   E  + ++G++  LE  E+ +YN  D  FY+SF
Sbjct: 450 REKR-------------------ETDGQEEEQSDVGRWACLESQEHRLYNASDALFYASF 490

Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFE 574
           AL+  +P+L+L++QR+ AAA +  D   + +++ G+    K  GAVP+D+G   DDPW  
Sbjct: 491 ALIANWPQLQLALQREVAAATLREDSRMVDVLATGRQAPNKVRGAVPYDMGTPADDPWVA 550

Query: 575 INSYNLFNSSRWK---------------------DLNSKFVLQVYRDFVATGDKNFARAV 613
           +N+YN+  +S WK                     DLNSKFVL VYRDFVAT DK F   V
Sbjct: 551 LNAYNIHETSSWKGKLGLLPASHLSPSTELTEAADLNSKFVLGVYRDFVATQDKRFLADV 610

Query: 614 WPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL-- 671
           WP+V  A+ +++QFD++ DGMIENEG PDQTY +W+        GGLWVAAL AA A+  
Sbjct: 611 WPAVDAALRHLQQFDRNKDGMIENEGLPDQTYHSWT--------GGLWVAALSAACAMAQ 662

Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGS-YFNYDNSDGSSSTSIQADQLAGQWYAR 729
           A  + D + A ++   +Q A  VY +SLWN S YFNYD S  +   SIQADQL GQWYAR
Sbjct: 663 ALQLADDSVAQHYRDLHQSAARVYNESLWNPSGYFNYDTSTSTHHDSIQADQLCGQWYAR 722

Query: 730 ACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVT 789
           ACGL  IA + ++  AL  ++  N  +   G  GA+NGM+PDGR+D + L + E WPG T
Sbjct: 723 ACGLPSIAPDKRIVTALKTVFANNFNE---GKRGAVNGMRPDGRVDYACLHSSEAWPGTT 779

Query: 790 YGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTI 849
           + LAA M+ E +VD AF TAAGV    + +  LGY F+TPE+W+ +  YR+L +MRPL I
Sbjct: 780 FALAACMMGEGLVDQAFITAAGVCGSIYED--LGYWFRTPEAWDGSGNYRALGHMRPLAI 837

Query: 850 WAMQWALTK 858
           WAMQWA T+
Sbjct: 838 WAMQWAWTQ 846


>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
          Length = 933

 Score =  548 bits (1413), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 329/824 (39%), Positives = 452/824 (54%), Gaps = 70/824 (8%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++   + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
            R++L  G      +  D P+   +F+V + R     +  VL    P V        + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICPLKFTVCLRRERQTVYQQVLSLERPSV--------LRS 222

Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
           W+W L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281

Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
            N G  + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P 
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PY 339

Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
           T AVAA  TA   V+    F         T + +W ++ + G  D     +++P++ G  
Sbjct: 340 TMAVAARVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVG 393

Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
           I  A+  S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H
Sbjct: 394 IAGAVCVSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452

Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
            A+  +A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  
Sbjct: 453 YALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503

Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
           + +               S    LG  +  L     + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548

Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
           +SFAL+ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608

Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
           W  +N+Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD 
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
           DG+IEN G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + 
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728

Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
           +  Y+  LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 788

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
             I++ NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           QTA G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
           troglodytes]
          Length = 933

 Score =  547 bits (1409), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 328/824 (39%), Positives = 452/824 (54%), Gaps = 70/824 (8%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
            R++L  G      +  D P+ + +F+V + R     +  VL    P V        + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICSLKFTVCLRREGQTVYQQVLSLERPSV--------LRS 222

Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
           W+W L G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281

Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
            N G  + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P 
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PY 339

Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
           T AVAA  TA   V+    F         T + +W ++ + G  D     +++P+  G  
Sbjct: 340 TMAVAARVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTHKGVG 393

Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
           I  A+  S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H
Sbjct: 394 IAGAVCVSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452

Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
            A+  + +WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  
Sbjct: 453 YALCRYTEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503

Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
           + +               S    LG  +  L     + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548

Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
           +SFAL+ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608

Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
           W  +N+Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD 
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
           DG+IEN G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + 
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728

Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
           +  Y+  LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 788

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
             I++ NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           QTA G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890


>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
          Length = 589

 Score =  546 bits (1407), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 281/567 (49%), Positives = 378/567 (66%), Gaps = 52/567 (9%)

Query: 312 MWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
           MWN + ++G FD  +     + PS PG  + AA++AS  +      +V F+LAW  P+VK
Sbjct: 1   MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60

Query: 370 FFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
           F +   Y+RRYT+FYGT   SA  + HDA+ ++  WE EIE WQ PIL+D+R PEWY  T
Sbjct: 61  FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120

Query: 429 LFNELYYLNAGGTIWTDGSPPM---------------------QSLATIRERKFSLDTP- 466
           LFNELY+L AGGT+WTDG PP                       +  ++++R  +L    
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180

Query: 467 ------------------SKIHTPASSDTALGTR--------LLENGEENIGQFLYLEGA 500
                             + +H     +   G +        + ++G E++G+FLYLEG 
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQINGLKSEEPIPYLISKDGPEHVGKFLYLEGV 240

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
           EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   +K ++DG    RK  G
Sbjct: 241 EYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVKG 300

Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
           AVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF ATGD  F R VWP+V  A
Sbjct: 301 AVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAA 360

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           M YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD   
Sbjct: 361 MDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHF 420

Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           A  + +++ KAKAVY++ LWNGSYFNYD+   S+S SIQADQLAGQWYA + GL P+ DE
Sbjct: 421 AEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDE 480

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
            K++ AL KI++FNV+KVKGG  GA+NGM P G++D + +Q+REIW GVTY +AA+M+  
Sbjct: 481 HKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLH 540

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSF 826
            M    F TA G++   WSE+G GY+ 
Sbjct: 541 GMEHQGFTTAEGIFTAGWSEEGYGYAL 567


>gi|413920604|gb|AFW60536.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
          Length = 459

 Score =  541 bits (1393), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 273/436 (62%), Positives = 334/436 (76%), Gaps = 8/436 (1%)

Query: 17  KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           KV PGQ P LTW+ KL         F L+++E   LA IG RL R+  +E +KG++ V +
Sbjct: 28  KVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKGRIAVID 87

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
             +K    S QG+PLGG    SIGRSY+G+FQR++LF G C+D  VLANQFS F+SR +G
Sbjct: 88  PMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAFISRQDG 143

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           +K+S+VL P  P +PK +  SGI SWDWN+ G+  TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 KKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDPELNIVC 203

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS  
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
           + KDGVHG+ LHHRTA+G+P VTF +AA+E  D+ +SECP F++SG+S     TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFTAKDMWN 323

Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
            +K+HGSFDHLD  KTS  S PGSSIGAAIAAS+ +   +T+ ++FSLAW CPEVKF   
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPEVKFSSG 383

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
           K YHRRYTKFYGT GD+AA +AHDAILEHA WE +IE WQ PIL+D+RFP WYP+TLFNE
Sbjct: 384 KTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYPVTLFNE 443

Query: 433 LYYLNAGGTIWTDGSP 448
           LYYLNAGGTIWT   P
Sbjct: 444 LYYLNAGGTIWTGMHP 459


>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
 gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
          Length = 806

 Score =  536 bits (1382), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 303/781 (38%), Positives = 439/781 (56%), Gaps = 65/781 (8%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVS-RPNGEKFSSVLCP 145
           QG PLGG GAG+IGR+Y G F R+ L  G      V ANQFSVF     N +  + VL  
Sbjct: 71  QGAPLGGFGAGTIGRTYNGGFSRWHLEIGKNKYTTVYANQFSVFQKVEGNKDGVAQVLYA 130

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
             P       +  + SW W+   E+  Y+AL+P +W  Y  +  P +++  +Q SP IP+
Sbjct: 131 GEP------ENGYLSSWKWDYPKESGMYYALYPNSWYTYTNKDLP-VQLAVKQFSPIIPY 183

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA---GDSGLSGHHFNS--KTMTKD 260
           NYKE+S+P +VF +T  N    + DV+++FTW N +        ++  +FN   K  +KD
Sbjct: 184 NYKETSYPVAVFKWTAYNPTNKNVDVSIMFTWQNMIGFFGKQVNVNSGNFNKIIKDKSKD 243

Query: 261 G--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKK 318
              V  +  +    N      +++  ++   V +S    F+ +G+       D+W+E  K
Sbjct: 244 SEIVAAVMGNISNDNEEWNGEYSIGVKKVPGVDISYKAKFVTTGD-----GSDLWHEFSK 298

Query: 319 HGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHR 377
           +G  D+ D D+T   + G  IG+AIA +  +  G T  V F+L+WD P +KF     +++
Sbjct: 299 NGILDNKD-DETPTKQDG--IGSAIAVNFKLQPGQTIEVPFALSWDLPIMKFGGGDKWYK 355

Query: 378 RYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLN 437
            YTK++G  G ++  I  +A+  + KWE  I+ WQ+PIL +K  P+WY   LFNELYYL 
Sbjct: 356 MYTKYFGKNGKNSFAILKEALNNYQKWEKMIDDWQKPILSNKSKPDWYKTALFNELYYLA 415

Query: 438 AGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYL 497
            GGT W +G                                   ++ E  +     F  L
Sbjct: 416 DGGTAWENG-----------------------------------KVGEKDKRTNNMFGLL 440

Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
           E  +Y  Y T DV FY SF LVML+P +E  + R FA  + + D    K+ S+G    +K
Sbjct: 441 ECFDYNYYETLDVRFYGSFPLVMLWPDIEKQVMRQFADTINVQDSSEFKVGSNGAMAVKK 500

Query: 558 CLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAV 613
             G +PHD+G     PW +IN+Y+  N + WKDLNSK+VL VYRD+V TG  DK F +  
Sbjct: 501 VQGMIPHDLGSSYALPWIKINAYDWQNPNIWKDLNSKYVLLVYRDYVLTGKTDKEFLKYT 560

Query: 614 WPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAN 673
           W SV  A+  +++ DKD DG+ +NEG PDQTYD WS  G SAYCG LW+AAL+AA  +  
Sbjct: 561 WKSVKTALDKLKEMDKDNDGIPDNEGIPDQTYDTWSMKGTSAYCGSLWLAALKAAQEIGK 620

Query: 674 DVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACG 732
            + D+ +   +   Y+ A+  ++  LWNG Y+N+D ++     SI ADQLAGQWYA    
Sbjct: 621 VLKDNEAYIKYNEWYKIAQQNFEKELWNGEYYNFD-TESDHKDSIMADQLAGQWYADILR 679

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
           L  I  +  V+KAL KIY+FNV+K + G  GA+NGM+PDG +D S +QA+E+W GVTY L
Sbjct: 680 LGDILPKDHVQKALKKIYEFNVMKFENGKMGAVNGMRPDGIVDESDIQAQEVWTGVTYAL 739

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A+ M    M + A+ TA GVY++ + + G GY F+TPE+W  +  YR+  YMRPL+IW+M
Sbjct: 740 ASFMKYRGMTEEAYNTAYGVYKMTYDKSGKGYWFRTPEAWTKDGNYRASMYMRPLSIWSM 799

Query: 853 Q 853
           +
Sbjct: 800 E 800


>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
          Length = 755

 Score =  535 bits (1377), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 305/750 (40%), Positives = 416/750 (55%), Gaps = 59/750 (7%)

Query: 116 ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHA 175
           +  D P+   +F+V + R     +  VL    P V        + SW+W L G    YHA
Sbjct: 7   VIADQPICPLKFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLCGYFAFYHA 58

Query: 176 LFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLF 235
           L+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  + DV+++F
Sbjct: 59  LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117

Query: 236 TWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHV 292
           +  N + G     G  +N    + + G  V GL LHH T     P T AVAA  TA   V
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PHTMAVAARVTAATTV 175

Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI-PS 351
           +    F         T + +W ++ + G  D     +++P++ G  I  A+  S  + P 
Sbjct: 176 THITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVCVSSKLRPR 229

Query: 352 GSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
           G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +A+WE  I 
Sbjct: 230 GQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERIS 288

Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
           AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +           
Sbjct: 289 AWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED----------- 328

Query: 470 HTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
               S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLEL
Sbjct: 329 ----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLEL 384

Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSR 585
           S+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y + +++ 
Sbjct: 385 SLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTAD 444

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN G+ DQTY
Sbjct: 445 WKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTY 504

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
           D W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  LWNG Y+
Sbjct: 505 DGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYY 564

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGG 760
           NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ NV    GG
Sbjct: 565 NYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFELNVQAFAGG 624

Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
             GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y   W   
Sbjct: 625 AMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWER- 683

Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 684 -LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712


>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
           paniscus]
          Length = 755

 Score =  534 bits (1375), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 303/750 (40%), Positives = 416/750 (55%), Gaps = 59/750 (7%)

Query: 116 ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHA 175
           +  D P+ + +F+V + R     +  VL    P V        + SW+W L G    YHA
Sbjct: 7   VIADQPICSLKFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLCGYFAFYHA 58

Query: 176 LFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLF 235
           L+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  + DV+++F
Sbjct: 59  LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117

Query: 236 TWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHV 292
           +  N + G     G  +N    + + G  V GL LHH T     P T AVAA  TA   V
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAARVTAATTV 175

Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI-PS 351
           +    F         T + +W ++ + G  D     +++P++ G  I  A+  S  + P 
Sbjct: 176 THITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVCVSSKLRPR 229

Query: 352 GSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
           G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  + +WE  I 
Sbjct: 230 GQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERIS 288

Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
           AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +           
Sbjct: 289 AWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED----------- 328

Query: 470 HTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
               S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLEL
Sbjct: 329 ----SLPEELGRNMCHLHPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLEL 384

Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSR 585
           S+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y + +++ 
Sbjct: 385 SLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTAD 444

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN G+ DQTY
Sbjct: 445 WKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTY 504

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
           D W   G SAYCGGLW+AA+     +A   G       F     + +  Y+  LWNG Y+
Sbjct: 505 DGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYY 564

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGG 760
           NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  +++ NV    GG
Sbjct: 565 NYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTVFELNVQAFAGG 624

Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
             GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y   W   
Sbjct: 625 AMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWER- 683

Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 684 -LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712


>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 812

 Score =  528 bits (1359), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 318/822 (38%), Positives = 455/822 (55%), Gaps = 61/822 (7%)

Query: 46  FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
             +F  +  +  R   YS     KG+ P+ +  +  +     G+PLGGIG+GSIGR ++G
Sbjct: 33  LNQFWSMIPLIIRYLIYSIYILCKGRRPLMDYMKNLNGKQMYGVPLGGIGSGSIGRGFKG 92

Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVL-CPRSPGVPKKNTDSGIESWD 163
           EF R++L  G  +   +  NQF + V    G   F  VL C R+P        S +++WD
Sbjct: 93  EFCRYQLRPGNYEYETIYGNQFILTVRDGEGNAVFQKVLNCGRAP-------KSCLKNWD 145

Query: 164 WNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSN 223
           W L GEN  Y  L+PR+WT Y+ E + +LR++C+QISP IP+NYK+S  P S+F + + N
Sbjct: 146 WTLNGENANYVGLYPRSWTEYNIE-EQKLRLICKQISPVIPNNYKDSCIPGSMFIWHIEN 204

Query: 224 SGQTSADVTLLFTWANSVA--GDS--GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVT 279
                 DV + FT+ N +   GD   G     F  +  +   + G+ ++H  ++     T
Sbjct: 205 QSNEVKDVNITFTFQNGIGEKGDKKKGCWSQMF-KENCSNGNILGVLINHVISSMN--YT 261

Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
           F ++  E A VHVS     L   N  G   KD+WN+IK  G  +  D D +  +  G  +
Sbjct: 262 FGISVCEKAGVHVS----CLKKWNPLG-NGKDIWNKIKT-GKLEKFDFDPSHHTANGEQL 315

Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFYGTLGDSAARIAHDA 397
            +A+ A   IP  S  ++ F+L WD P++ F E  K Y + YT+F+G    +  +++   
Sbjct: 316 ASAVCAQTLIPGKSVSTLEFNLIWDMPKINFHENKKQYWKYYTRFFGK-EKNGPKLSAYC 374

Query: 398 ILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIR 457
              +  W+ EI+ WQ PIL D   P WY   LFNE Y+++ GGT+W D            
Sbjct: 375 FENYKNWDLEIDKWQNPILNDDNLPSWYKSALFNETYFVSDGGTVWLD------------ 422

Query: 458 ERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFA 517
                LD   K    A+                 G+F YLEG EY MYNTYDVHFY+SFA
Sbjct: 423 -----LDEDEKKLLSANDPRC-----------EYGRFAYLEGHEYRMYNTYDVHFYASFA 466

Query: 518 LVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEI 575
           L ML+P L+  +QRDF   V   D      + +G    RK   +VPHD+G   ++P+ +I
Sbjct: 467 LAMLWPNLQSCVQRDFCDTVPQEDRSRQWHLFNGSIGYRKVKNSVPHDLGDPFEEPFLKI 526

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
           NSY + + S W+DL+ KFVLQ YRD+    D N  + +W SV + M    ++DKDGDG+I
Sbjct: 527 NSYPIHDVSEWRDLDLKFVLQSYRDYFLNNDLNQLKYLWKSVCVLMNNALRWDKDGDGLI 586

Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
           EN G PDQTYDAW+  G SAYCG LW+AALQ    +A  + D  + + F    + A+  +
Sbjct: 587 ENVGEPDQTYDAWTMEGPSAYCGSLWLAALQCTCKMALILSDVDTYNKFSETLKMAQQSF 646

Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL-PIADEAKVKKALTKIYDFN 753
           D  LWNG Y+N+D S+    T+I +DQL G WY   CG+   +   AKV  +L  IY+ N
Sbjct: 647 DKKLWNGKYYNFDCSE-KWGTTIMSDQLCGLWYLYCCGIKNEVLPPAKVNSSLRCIYENN 705

Query: 754 VLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
           VLK+  G  G +NGM P+G ID+  LQ+ EIW GV Y LA+ MI E MV+  ++TA G+Y
Sbjct: 706 VLKLGKGNLGCVNGMLPNGEIDIHTLQSEEIWVGVVYALASLMIFEGMVEEGWKTARGIY 765

Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           E  +   G+G  F+TPE+   + +YR++ YMRPL+IW+MQ A
Sbjct: 766 ETVYDRIGMG--FETPEALRESKKYRAIGYMRPLSIWSMQLA 805


>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
 gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
          Length = 877

 Score =  508 bits (1309), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 306/767 (39%), Positives = 423/767 (55%), Gaps = 62/767 (8%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++   + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TA   V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835


>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
           vitripennis]
          Length = 826

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 304/824 (36%), Positives = 451/824 (54%), Gaps = 60/824 (7%)

Query: 46  FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
           F++   +  +  R   Y       G+  + + F   H     G+P+GG+G GSIGR +RG
Sbjct: 33  FKQTIQMMPLAVRYVFYYTNVSRYGRPVLMDYFNMRHGQQIYGVPIGGLGGGSIGRGFRG 92

Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIESWDW 164
           EF R+ L  G+     + ANQF + V   NG   ++ VL  ++   PK      + SW W
Sbjct: 93  EFCRYALLPGLYSYHVIPANQFILTVRDVNGNTIYNQVLSTKTK--PKH-----LRSWKW 145

Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
             +G   +Y  L+PR+WT Y+ E + EL++ CRQ+SP IPH+YK+SS P +VF + + N 
Sbjct: 146 GFEGNKASYTGLYPRSWTTYEVE-ELELKLTCRQVSPVIPHDYKDSSLPCAVFVWDVVNR 204

Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAA 284
                 + + FT+ +  A      G  + ++    D   G T+ H+   G P  T+A+++
Sbjct: 205 SNRDLRIAITFTFQSGCASKEDAQGDKW-TEFFELDESRG-TMIHQEFRGMP-CTYAISS 261

Query: 285 EETAD-VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
               D V V+    F   GN        +W  ++  G FD     + S S+    I  A+
Sbjct: 262 RTRPDDVGVTRMIGFDPRGN-----GAVLWQRLQNAGRFDESAEPRIS-SKTRKPIATAV 315

Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAA-RIAHDAILE 400
            A   + +GS+  + FSLAW  P+++FF  +K + R YT+F+   G+ A   I   A+ +
Sbjct: 316 CADTLVKAGSSGFLEFSLAWHMPKIQFFFKKKDHLRYYTQFFS--GEKAGPEICRYALSQ 373

Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
           + +WE EIE WQ+ +LED   P+WY   +FNELY+++ GG++W         L+T    +
Sbjct: 374 YERWEDEIERWQKSVLEDPDLPDWYKSAIFNELYFVSDGGSLW---------LSTDESNE 424

Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
              D P + +                     G+F YLEG EY MYNTYDVHFY++FAL  
Sbjct: 425 LQPDDPRRRY---------------------GRFAYLEGHEYRMYNTYDVHFYAAFALAQ 463

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
           L+P LE  IQ +F  +V   D   ++ + DG    RK  G+VPHDIG   ++P+  IN+Y
Sbjct: 464 LWPNLEACIQYEFRDSVHAEDQQRLQGLYDGSKRHRKVKGSVPHDIGDPGEEPFDLINAY 523

Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
            + + S+W+DLNSKFVL  YR +    +    R  W ++ + + +   FD+D DG+IEN 
Sbjct: 524 PIHDVSQWRDLNSKFVLSCYRIYFFNRNLEQLRDFWSTIKLVLEHSLTFDEDNDGLIENG 583

Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DS 697
           GFPDQTYD W  +G SAYCGGLW+AAL  A  +A  +G+    + +     + K  + D 
Sbjct: 584 GFPDQTYDCWVMSGPSAYCGGLWIAALHCAVKMAELLGESEDEARYKGILDRGKVAFQDK 643

Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL-LPIADEAKVKKALTKIYDFNVLK 756
           LWNG Y+N+D     S  SI +DQL G W  RACG    +  + +V+ +L  ++  NV+K
Sbjct: 644 LWNGKYYNFDCGKDESKLSIMSDQLCGHWLLRACGFSYEVFPQDRVRSSLETVFQNNVMK 703

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
            K G  GA+NG  P G ID S +Q+ E+W GV YGLAA +I E M++ AF+TA GVY   
Sbjct: 704 YKNGQQGAVNGFSPTGSIDYSCIQSEEMWTGVAYGLAALLIHEGMIEEAFRTAEGVYRTV 763

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           + + G+G  F+TPE+   +  YR++ YMRPL+IWAMQ A T  K
Sbjct: 764 YEKIGMG--FETPEALYEHKVYRAIGYMRPLSIWAMQHAWTMRK 805


>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
          Length = 922

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 306/767 (39%), Positives = 423/767 (55%), Gaps = 62/767 (8%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++   + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 157 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 215

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 216 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 267

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 268 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 326

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA
Sbjct: 327 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 384

Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
             TA   V+    F         T + +W ++ + G  D     +++P++ G  I  A+ 
Sbjct: 385 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 438

Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
            S  + P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +
Sbjct: 439 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 497

Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
           A+WE  I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +   
Sbjct: 498 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 545

Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                       S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 546 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 593

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
           ML+PKLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+
Sbjct: 594 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 653

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           Y + +++ WKDLN KFVLQVYRD+  TGD+NF + +WP     M    +FDKD DG+IEN
Sbjct: 654 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 713

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
            G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F     + +  Y+ 
Sbjct: 714 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 773

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
            LWNG Y+NYD+S    S S+ +DQ AGQW+ +ACGL      +     V +AL  I++ 
Sbjct: 774 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 833

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           NV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 834 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 880


>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
          Length = 970

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 296/763 (38%), Positives = 411/763 (53%), Gaps = 75/763 (9%)

Query: 49  FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
            +HL  +G R L  + ++ + + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 104 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLKQIYGCPLGGIGGGTITRGWRGQF 162

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 163 CRWQLNPGMYQHRTVIADQFTVCLRREGRTVYQQVLSVERPSV--------LRSWNWGLC 214

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P  ++ + CRQI+P +PH+Y++SS P  +F + + N G  
Sbjct: 215 GYFAFYHALYPRAWTVYQ-LPGQKVTLTCRQITPILPHDYQDSSLPVGIFVWDVENEGDE 273

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDGVHGLTLHHRTANGRPPVTFAVAAEE 286
           + DV+++F+  N + G    SG  +N    + +DG                        E
Sbjct: 274 ALDVSIMFSMRNGLGGGDDASGGLWNEPFCLERDG------------------------E 309

Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
           TAD  ++    F         T + +W ++ + G  D     +++PS+ G  I  A+  +
Sbjct: 310 TADTTITHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGIAGAVCVT 363

Query: 347 LTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKW 404
             +P      + FSLAWD P + F  K  VY+RRYT+F+G+ GD+A  ++H A+  +  W
Sbjct: 364 GKLPPRGQCCLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGSDGDAAPALSHYALCRYTDW 423

Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
           E  I AWQ P+L+D+  P WY   LFNELY+L  GGTIW                   L+
Sbjct: 424 EERISAWQSPVLDDRSLPSWYKSALFNELYFLADGGTIW-------------------LE 464

Query: 465 TPSKIHTPASSDTALGTRL-LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
            P      A  +  +G    L    +  G+F YLEG EY MYNTYDVHFY+SFAL+ML+P
Sbjct: 465 VPED----ALPEELVGNMCQLRPILQEYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWP 520

Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLF 581
           KLELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y + 
Sbjct: 521 KLELSLQYDMALATLREDLSWRQYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYMIH 580

Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
           +++ WKDLN KFVLQVYRD+  TGD+ F R +WP     M    +FDKD DG+IEN G+ 
Sbjct: 581 DTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYA 640

Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
           DQTYDAW   G SAYCGGLW+AA+     +A   G       F     + +  Y+  LWN
Sbjct: 641 DQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDILDKFSSILSRGREAYERLLWN 700

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLK 756
           G Y+NYD S    S SI +DQ AGQW+ RA GL      +     V  AL  I+ FNV  
Sbjct: 701 GHYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDTEVFPTPHVVCALQTIFKFNVQA 760

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
             GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 761 FAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 803



 Score = 47.4 bits (111), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 27/41 (65%), Gaps = 2/41 (4%)

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            Y   W    LG +FQTPE++  +  +RSL YMRPL+IWAM
Sbjct: 890 CYRTVWER--LGLAFQTPEAYCQHQVFRSLAYMRPLSIWAM 928


>gi|413920605|gb|AFW60537.1| hypothetical protein ZEAMMB73_659971, partial [Zea mays]
          Length = 449

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 256/421 (60%), Positives = 317/421 (75%), Gaps = 8/421 (1%)

Query: 17  KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           KV PGQ P LTW+ KL         F L+++E   LA IG RL R+  +E +KG++ V +
Sbjct: 28  KVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKGRIAVID 87

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
             +K    S QG+PLGG    SIGRSY+G+FQR++LF G C+D  VLANQFS F+SR +G
Sbjct: 88  PMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAFISRQDG 143

Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           +K+S+VL P  P +PK +  SGI SWDWN+ G+  TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 KKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDPELNIVC 203

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
           RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS  
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
           + KDGVHG+ LHHRTA+G+P VTF +AA+E  D+ +SECP F++SG+S     TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFTAKDMWN 323

Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
            +K+HGSFDHLD  KTS  S PGSSIGAAIAAS+ +   +T+ ++FSLAW CPEVKF   
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPEVKFSSG 383

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
           K YHRRYTKFYGT GD+AA +AHDAILEHA WE +IE WQ PIL+D+RFP W+  TL   
Sbjct: 384 KTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWFVPTLIIH 443

Query: 433 L 433
           L
Sbjct: 444 L 444


>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
          Length = 802

 Score =  493 bits (1269), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 297/787 (37%), Positives = 425/787 (54%), Gaps = 98/787 (12%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G P+GGIG GSIGR +RGEF RF++  G+     V A+QF + V + N      VL P  
Sbjct: 87  GAPIGGIGCGSIGRGFRGEFCRFQMIPGMYTHKTVEADQFILTVRKSNETIHQCVLSPL- 145

Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
                K     + SW W    +  TYHAL+PR+WT Y+  P+ +L ++CRQISP  P++Y
Sbjct: 146 -----KLKGKRLSSWRWGFPSQQATYHALYPRSWTRYE-IPECDLTLICRQISPIFPNDY 199

Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-----TMTKDGV 262
           +E+ FP SVF + + N G  + D++L FT+ N        SG  ++       T T D V
Sbjct: 200 QETCFPGSVFVWEVINKGSDAVDISLTFTFKNGQGVKEDRSGGVWSRAFQEGITGTDDCV 259

Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
            G+ +H +  +   P  + VAA+    ++VSE   F     +   T +++WN++   GS 
Sbjct: 260 SGVEIHQKFND--LPCVYGVAAKHGPGINVSEKSYF-----NPQSTGEEVWNDLLAEGS- 311

Query: 323 DHLDNDKTSPSE--PGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRR 378
             L +   S SE   G  +  A+ A  TI   S  S+TFSL WD P + F  K   YHRR
Sbjct: 312 --LKSTPVSGSETAKGEELACAVCAKTTISPYSHTSLTFSLTWDMPIIHFGAKRNKYHRR 369

Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
           Y +F+G  G +  R+A  A+     WE +I  WQ+P L+  + P WY   LFNE Y+++ 
Sbjct: 370 YARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANKLPAWYKSALFNETYFISD 429

Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLE 498
           GG++W D    +     + E  F                           +  G+F YLE
Sbjct: 430 GGSVWVDS---IDVKDAVSEHPFV--------------------------QEFGKFAYLE 460

Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
           G EY MYNT DVH+Y+SFAL+ L+P L+LS+Q D                          
Sbjct: 461 GHEYRMYNTLDVHYYASFALMKLWPNLQLSLQYDID------------------------ 496

Query: 559 LGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY 618
                     ++PW  +N+Y +  ++ WKDLN KFVLQVYRD+V T D+ +   ++P   
Sbjct: 497 ----------NEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCK 546

Query: 619 IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH 678
             M +  ++D D DG+I+N GF DQTYDAW+  G SAYCGGLW+AA++A + +A+ +G  
Sbjct: 547 AVMDHSLRWDVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLGVV 606

Query: 679 ASASYFWVRYQKAKAVY-DSLWNG----SYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
                +    ++ K  + + LWNG     Y+NYD S+     SI ADQ+ GQW+ +ACG+
Sbjct: 607 EDLMKYEEILERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGV 666

Query: 734 L--PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYG 791
               +   A VK+AL  ++  NVL   GG  GA+NGM+P+ + D++  Q+ E W GVTY 
Sbjct: 667 ADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYA 726

Query: 792 LAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWA 851
           L A+MIQ  M+D  FQTA G Y   W   GL  +FQTPE++ +N  +RSL YMRPL IWA
Sbjct: 727 LGATMIQVGMIDKGFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPLAIWA 784

Query: 852 MQWALTK 858
           +Q A+ K
Sbjct: 785 IQHAVEK 791


>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
           [Tribolium castaneum]
 gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
          Length = 818

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 306/800 (38%), Positives = 428/800 (53%), Gaps = 62/800 (7%)

Query: 70  GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
           GK  V +           G+P+GGIG G+IGR YRGEF RF+L  G+ +   V ANQF V
Sbjct: 64  GKSIVMDYVHTLRAKQIYGVPIGGIGCGTIGRGYRGEFCRFQLKPGLYEYNTVDANQFIV 123

Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
            +   + E     L    P          ++SW+  + G  C Y  L+PR+WT YD    
Sbjct: 124 TIKDGHNETIFHSLLSTFPT-------KSLKSWESLIDGSECFYTGLYPRSWTEYDLSK- 175

Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
             + + CRQI+P IPHNYK+SS P ++F + + N  +    VT+ FT+ N V   S    
Sbjct: 176 YGVMLTCRQITPIIPHNYKDSSLPCAMFVWDVKNISEEDRTVTIAFTFKNGVGDKSRDRS 235

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITA 309
              +SK  T     G+ L+H     +   ++++AA+    + +S+C  F    NS GI  
Sbjct: 236 STCSSKAFTLADSEGVILYHTI--DKMQCSYSLAAKTNPQIDISKCLYF--DPNSDGIKP 291

Query: 310 KDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
              W ++K +GSFD + +DK      G  +   IA   TI +G+T +   +L WD P V+
Sbjct: 292 ---WIQLKNNGSFDKI-SDKNYGQIFGE-MACGIATKATIRAGNTVTSEMALVWDMPSVQ 346

Query: 370 FFEKVYHRRYTKFYGTLGD---SAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYP 426
           F +K   ++YT++Y    D   +  RI   A   +  WE  I  WQR IL+D+  P+WY 
Sbjct: 347 FPKK--QKKYTRYYTKHFDGDNTGLRIVRYAFENYQNWERAIYDWQRRILDDENLPDWYK 404

Query: 427 ITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLEN 486
             LFNE YY++ GG IW         L     +K   + P   H                
Sbjct: 405 SALFNESYYISDGGAIWV-------ILDEQEAKKLPKNDPRLEH---------------- 441

Query: 487 GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK 546
                G+F  LEG EY MYNTYDVHFY+SF+ V+ FP L+  +Q D   A+    P  +K
Sbjct: 442 -----GKFALLEGHEYKMYNTYDVHFYASFSFVLNFPLLQSILQYDMRDAIFTEIPDKVK 496

Query: 547 IMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVAT 604
           ++ DG+   RK    VPHDIG   ++P+  +NSY + + S+W+DLNSKFVLQV+RD   T
Sbjct: 497 MLYDGEVCERKVPNTVPHDIGDPGEEPFILLNSYPIHDVSQWRDLNSKFVLQVFRDAFIT 556

Query: 605 G--DKNFA--RAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
           G  D++ A    ++ + Y  M     FD DGDG+IEN G PDQT+D W   G SAYCGGL
Sbjct: 557 GLDDRSIAYLNDMYNACYTVMHKSLDFDVDGDGLIENSGSPDQTFDTWVMTGASAYCGGL 616

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQA 719
           W+AAL A + +A+ +        F     K  A ++  LWNG  +N+D SD     SI A
Sbjct: 617 WLAALFAMTKIADALQKTEDKEKFQELLDKGTAAFERKLWNGKCYNFDCSD-KECRSIMA 675

Query: 720 DQLAGQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG 778
           DQL GQWY R+CG    +  + +VK +L  IY+ NV     G  GA+NG   DG ID   
Sbjct: 676 DQLCGQWYLRSCGFNYEVFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGF-IDGVIDEFT 734

Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
           +Q++E+W GVTY LAASM+QE M   AF TA G+++     +  G SF TPE+      Y
Sbjct: 735 IQSQEVWTGVTYALAASMLQEGMRTEAFNTAGGMFKSM--SERFGLSFDTPEALYAAKYY 792

Query: 839 RSLCYMRPLTIWAMQWALTK 858
           R++ YMRPL+IW+MQ A+ +
Sbjct: 793 RAIGYMRPLSIWSMQLAIDQ 812


>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
 gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
          Length = 959

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 302/852 (35%), Positives = 451/852 (52%), Gaps = 73/852 (8%)

Query: 37  KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
           +VP      +Q +  L  +      + K    K K+ + N F+        G+P GGIG 
Sbjct: 86  RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 144

Query: 97  GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
           G+IGR +RG F +F L  G+ +     V A+QF + V   +   +  VL       P   
Sbjct: 145 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLSAADVQRP--- 201

Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
               + +WD+    +N  Y  LFPR+WT +   P+ EL +V RQ+SP +PHNY+++++P 
Sbjct: 202 -SGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-PELELTVVIRQVSPVLPHNYEDTTYPV 259

Query: 215 SVFTFTLSNSGQTSAD--VTLLFTWANSVAG-----DSGLSGHHFNSK-------TMTKD 260
            +F   + N G  S +  +++ FT+ N         ++  SG  F ++       T  ++
Sbjct: 260 CLFLIDVENGGSASREYEISVAFTFRNGTGNRRWEREAECSGVKFETEEEGLKPTTTAQE 319

Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
            + G++L+H  ++   P T+ +A         + C  F  S N        +WN +K+ G
Sbjct: 320 KISGVSLYHTISSM--PCTYGLATTHREHTTTTVCERFDPSRNGAA-----LWNHLKQTG 372

Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRR 378
                + +    +     +  A+    T+P  ST++  ++L+WD P+V F    + YHRR
Sbjct: 373 DVPSCEEECLINAR---EMAVAVCNRFTLPPNSTKTHDYALSWDMPKVHFGSVARSYHRR 429

Query: 379 YTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
           YT+F+     G +A  +   A+    KWE EIEAWQ PIL  K+ P+WY   +FNELY++
Sbjct: 430 YTRFFEASEAGSAADSLCVRALRLKDKWESEIEAWQSPILNHKKLPDWYKSAIFNELYFI 489

Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
             GGT+W                 F  D        A  +T L +   ++  + IG+F Y
Sbjct: 490 TDGGTVW-----------------FEFD-----ENWAEHETHL-SHYTKDKMKEIGRFGY 526

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           LE  EY M NTYDVHFY+SFAL  L+P+LE++IQ +F   V        +   +G W   
Sbjct: 527 LESWEYRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANV 586

Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK--NFARA 612
           K    VPHD+G   D+PW   N+Y + ++ +WKDLN K+VL  +RD+V   ++  +F   
Sbjct: 587 KTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFH 646

Query: 613 VWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
            WP+V  I +  +E +D+DGDGMIEN G  DQTYDAW   GVSAYCG LW+A+L+ A  +
Sbjct: 647 TWPAVRMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEM 706

Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
           A  + D  +   F    +KAK V+ D+LW G+YF +     S  T + ADQL G W+ ++
Sbjct: 707 AGLMKDDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQS 765

Query: 731 CGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
                +ADE      V+ AL  IY  NV K   G  GA+NGM+P G +D   +QA E+W 
Sbjct: 766 VS-PELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWT 824

Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
           GVTY +A+ +IQ+  V+ AF TA+G Y   + E GL Y  QTPE+   +  YR++ YMRP
Sbjct: 825 GVTYAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRP 882

Query: 847 LTIWAMQWALTK 858
           L+IWAMQW+L K
Sbjct: 883 LSIWAMQWSLKK 894


>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
 gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
          Length = 922

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 302/852 (35%), Positives = 450/852 (52%), Gaps = 73/852 (8%)

Query: 37  KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
           +VP      +Q +  L  +      + K    K K+   N F+        G+P GGIG 
Sbjct: 49  RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKL-FINTFQPLKHKPYYGVPCGGIGC 107

Query: 97  GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
           G+IGR +RG F +F L  G+ +     V A+QF + V   +   +  VL       P   
Sbjct: 108 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLSAADVQRPSGQ 167

Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
               + +WD+    +N  Y  LFPR+WT +   P+ EL +V RQ+SP +PHNY+++++P 
Sbjct: 168 ----LSTWDFKFPKKNVHYRGLFPRSWTTFRV-PELELTVVIRQVSPVLPHNYEDTTYPV 222

Query: 215 SVFTFTLSNSGQTSAD--VTLLFTWANSVAG-----DSGLSGHHFNSK-------TMTKD 260
            +F   + N G  S +  +++ FT+ N         ++  SG  F ++       T  ++
Sbjct: 223 CLFLIDVENGGSASREYEISVAFTFRNGTGNRRWEREAECSGVKFETEEEGLKPTTTAQE 282

Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
            + G++L+H  ++   P T+ +A         + C  F  S N        +WN +K+ G
Sbjct: 283 KISGVSLYHTISSM--PCTYGLATTHREHTTTTVCERFDPSRNGAA-----LWNHLKQTG 335

Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRR 378
                + +    +     +  A+    T+P  ST++  ++L+WD P+V F    + YHRR
Sbjct: 336 DVPSCEEECLINAR---EMAVAVCNRFTLPPNSTKTHDYALSWDMPKVHFGSVARSYHRR 392

Query: 379 YTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
           YT+F+     G +A  +   A+    KWE EIEAWQ PIL  K+ P+WY   +FNELY++
Sbjct: 393 YTRFFEASEAGSAADSLCVRALRLKDKWESEIEAWQSPILNHKKLPDWYKSAIFNELYFI 452

Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
             GGT+W                 F  D        A  +T L +   ++  + IG+F Y
Sbjct: 453 TDGGTVW-----------------FEFD-----ENWAEHETHL-SHYTKDKMKEIGRFGY 489

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           LE  EY M NTYDVHFY+SFAL  L+P+LE++IQ +F   V        +   +G W   
Sbjct: 490 LESWEYRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANV 549

Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK--NFARA 612
           K    VPHD+G   D+PW   N+Y + ++ +WKDLN K+VL  +RD+V   ++  +F   
Sbjct: 550 KTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFH 609

Query: 613 VWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
            WP+V  I +  +E +D+DGDGMIEN G  DQTYDAW   GVSAYCG LW+A+L+ A  +
Sbjct: 610 TWPAVRMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEM 669

Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
           A  + D  +   F    +KAK V+ D+LW G+YF +     S  T + ADQL G W+ ++
Sbjct: 670 AGLMKDDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQS 728

Query: 731 CGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
                +ADE      V+ AL  IY  NV K   G  GA+NGM+P G +D   +QA E+W 
Sbjct: 729 VS-PELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWT 787

Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
           GVTY +A+ +IQ+  V+ AF TA+G Y   + E GL Y  QTPE+   +  YR++ YMRP
Sbjct: 788 GVTYAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRP 845

Query: 847 LTIWAMQWALTK 858
           L+IWAMQW+L K
Sbjct: 846 LSIWAMQWSLKK 857


>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
 gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
          Length = 930

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 302/852 (35%), Positives = 450/852 (52%), Gaps = 73/852 (8%)

Query: 37  KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
           +VP      +Q +  L  +      + K    K K+   N F+        G+P GGIG 
Sbjct: 57  RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKL-FINTFQPLKHKPYYGVPCGGIGC 115

Query: 97  GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
           G+IGR +RG F +F L  G+ +     V A+QF + V   +   +  VL       P   
Sbjct: 116 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLSAADVQRPSGQ 175

Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
               + +WD+    +N  Y  LFPR+WT +   P+ EL +V RQ+SP +PHNY+++++P 
Sbjct: 176 ----LSTWDFKFPKKNVHYRGLFPRSWTTFRV-PELELTVVIRQVSPVLPHNYEDTTYPV 230

Query: 215 SVFTFTLSNSGQTSAD--VTLLFTWANSVAG-----DSGLSGHHFNSK-------TMTKD 260
            +F   + N G  S +  +++ FT+ N         ++  SG  F ++       T  ++
Sbjct: 231 CLFLIDVENGGSASREYEISVAFTFRNGTGNRRWEREAECSGVKFETEEEGLKPTTTAQE 290

Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
            + G++L+H  ++   P T+ +A         + C  F  S N        +WN +K+ G
Sbjct: 291 KISGVSLYHTISSM--PCTYGLATTHREHTTTTVCERFDPSRNGAA-----LWNHLKQTG 343

Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRR 378
                + +    +     +  A+    T+P  ST++  ++L+WD P+V F    + YHRR
Sbjct: 344 DVPSCEEECLINAR---EMAVAVCNRFTLPPNSTKTHDYALSWDMPKVHFGSVARSYHRR 400

Query: 379 YTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
           YT+F+     G +A  +   A+    KWE EIEAWQ PIL  K+ P+WY   +FNELY++
Sbjct: 401 YTRFFEASEAGSAADSLCVRALRLKDKWESEIEAWQSPILNHKKLPDWYKSAIFNELYFI 460

Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
             GGT+W                 F  D        A  +T L +   ++  + IG+F Y
Sbjct: 461 TDGGTVW-----------------FEFD-----ENWAEHETHL-SHYTKDKMKEIGRFGY 497

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           LE  EY M NTYDVHFY+SFAL  L+P+LE++IQ +F   V        +   +G W   
Sbjct: 498 LESWEYRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANV 557

Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK--NFARA 612
           K    VPHD+G   D+PW   N+Y + ++ +WKDLN K+VL  +RD+V   ++  +F   
Sbjct: 558 KTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFH 617

Query: 613 VWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
            WP+V  I +  +E +D+DGDGMIEN G  DQTYDAW   GVSAYCG LW+A+L+ A  +
Sbjct: 618 TWPAVRMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEM 677

Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
           A  + D  +   F    +KAK V+ D+LW G+YF +     S  T + ADQL G W+ ++
Sbjct: 678 AGLMKDDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQS 736

Query: 731 CGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
                +ADE      V+ AL  IY  NV K   G  GA+NGM+P G +D   +QA E+W 
Sbjct: 737 VS-PELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWT 795

Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
           GVTY +A+ +IQ+  V+ AF TA+G Y   + E GL Y  QTPE+   +  YR++ YMRP
Sbjct: 796 GVTYAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRP 853

Query: 847 LTIWAMQWALTK 858
           L+IWAMQW+L K
Sbjct: 854 LSIWAMQWSLKK 865


>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
          Length = 444

 Score =  480 bits (1235), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 237/392 (60%), Positives = 300/392 (76%), Gaps = 1/392 (0%)

Query: 483 LLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP 542
           L +NG EN+G+FLYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D 
Sbjct: 18  LSKNGPENVGKFLYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDR 77

Query: 543 GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFV 602
             MK ++DG    RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF 
Sbjct: 78  RRMKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFA 137

Query: 603 ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
           ATGD +F R VWP+V   M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+
Sbjct: 138 ATGDMSFGRDVWPAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWL 197

Query: 663 AALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQ 721
           AALQAA+ +A+ +GD   A  + +++ +AKAVY++ LWNGSYFNYD+   S+S SIQADQ
Sbjct: 198 AALQAAATMAHRLGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQ 257

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           LAGQWYA + GL P+ DE K++ AL KI++FNV+KVKGG  GA+NGM P+G++D + +Q+
Sbjct: 258 LAGQWYAASSGLPPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQS 317

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
           REIW GVTYG+AA+M+   M    F TA G++   WSE+G GY FQTPE W  +  YRSL
Sbjct: 318 REIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSL 377

Query: 842 CYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
            YMRPL IWAMQWA + PK      K  + DR
Sbjct: 378 IYMRPLAIWAMQWARSPPKAILDAPKVNLMDR 409


>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
 gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
          Length = 993

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 236/395 (59%), Positives = 306/395 (77%), Gaps = 3/395 (0%)

Query: 482 RLLE--NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMM 539
           ++LE  N EE++G+FLYLEG EYVM+ TYDVHFY+S+AL+ LFPK+EL+IQRDFA AV+ 
Sbjct: 563 KVLEPGNTEEDVGRFLYLEGVEYVMWCTYDVHFYASYALLELFPKIELNIQRDFAKAVLS 622

Query: 540 HDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
            D   ++ +++GK+  RK  GAVPHD+G  DPW E+N+YN+ ++SRWKDLN+KFVLQVYR
Sbjct: 623 EDGRKVRFLAEGKYGIRKVRGAVPHDLGTHDPWNEMNAYNIHDTSRWKDLNTKFVLQVYR 682

Query: 600 DFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGG 659
           DF AT D +F   VWPSV  A+ YMEQFD+DGDG+IEN+GFPDQTYD W+ +G+SAYCG 
Sbjct: 683 DFAATRDMSFGVDVWPSVRAAIEYMEQFDRDGDGVIENDGFPDQTYDTWTVHGISAYCGC 742

Query: 660 LWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQ 718
           LWVAALQAA+A+A+++GD   A     ++ KA+ V ++ LWNGSYFNYD+   S+S SIQ
Sbjct: 743 LWVAALQAAAAMAHELGDQEFAETCKSKFLKARPVLEAELWNGSYFNYDSGSSSNSKSIQ 802

Query: 719 ADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG 778
           ADQLAGQWY  + GL P+ D+ K+K AL KIYDFNV+KV+GG  GA+NGM P+G+ID + 
Sbjct: 803 ADQLAGQWYTASSGLPPLFDDFKIKSALRKIYDFNVMKVRGGRMGAVNGMHPNGKIDETC 862

Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
           +Q+REIW GVTYG+AA+MI   M + AF+TA G++   WSE+G GY FQTPE+W+ +  Y
Sbjct: 863 MQSREIWTGVTYGVAATMILAGMEEEAFKTAEGIFLAGWSEEGFGYWFQTPEAWSTDGHY 922

Query: 839 RSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
           RSL YMRPL+IW MQWAL+ PK      K  + DR
Sbjct: 923 RSLIYMRPLSIWGMQWALSLPKAILDAPKINVMDR 957



 Score =  447 bits (1151), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 226/449 (50%), Positives = 308/449 (68%), Gaps = 21/449 (4%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP+  W+RKL+ +  +   F+++F E   +  +G RL+ Y ++E ++G+    + F
Sbjct: 31  DSASPPEQAWRRKLNGHANLLKEFSVTFVEAIKMVRLGIRLWSYVREEASQGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G+GSI R +RGEF+++++  G C+ +PV+ANQFS+FVSR  G
Sbjct: 91  TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGTCEASPVMANQFSIFVSRDGG 150

Query: 137 -EKFSSVLCP---RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPEL 192
            +K++SVL P      G  +K+ DSGI SW WNL G++ TYHALFPRAWTVYDGEPDPEL
Sbjct: 151 GKKYASVLAPGQHEGLGCCRKDGDSGISSWGWNLDGQHSTYHALFPRAWTVYDGEPDPEL 210

Query: 193 RIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHF 252
           ++ CRQISPFIPHNY++SS P +VF +TL N+G+  A V+LLFTWANS+ G+S LSG+H 
Sbjct: 211 KVSCRQISPFIPHNYRDSSLPTAVFVYTLVNTGRERAKVSLLFTWANSIGGNSHLSGNHV 270

Query: 253 NSKTMTKDGVHGLTLHH-----------RTANGRPPVTFAVAAEETADVHVSECPCFLLS 301
           N   + +DGV G+ LHH           RTA G PPVTFA+AA ET +V V+  P F LS
Sbjct: 271 NEPFIDEDGVSGVLLHHKQNSKDLCKCFRTAKGNPPVTFAIAACETQNVSVTVLPSFGLS 330

Query: 302 GNSKGITAKDMWNEIKKHGSF--DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
             S  ITAKDMW+++ + G F  D+  +  + PS PG ++ AA+AAS  +      +V F
Sbjct: 331 EGS-CITAKDMWDKMVQDGQFDRDNFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAF 389

Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
           SL+W  P+VKF +   YHRRYTKFYGT G +A  +AHDA+  + +WE EIE WQRP+L D
Sbjct: 390 SLSWSSPKVKFLKGFSYHRRYTKFYGTSGKAAQTLAHDALTNYKRWEEEIEKWQRPVLMD 449

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGS 447
           +R PEWY  TLFNELY+L AGGT+W D S
Sbjct: 450 ERLPEWYKFTLFNELYFLVAGGTVWIDSS 478


>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
 gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
           785]
          Length = 774

 Score =  477 bits (1228), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 288/788 (36%), Positives = 427/788 (54%), Gaps = 81/788 (10%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+P+GG+G+G+IGR++RG++ R+ L  G      V  NQ+SVF    + ++ + VLC  
Sbjct: 43  HGLPIGGMGSGAIGRNFRGDWSRWHLEVGKHVHRSVWPNQWSVFWQTAS-QQAAQVLCTT 101

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P     +TD  + SW+WN       YHALFPRAW  Y   PD  L +V  Q SP +  N
Sbjct: 102 QP-----DTDE-LSSWNWNYPVGAGNYHALFPRAWFDYQ-HPDWPLELVQEQFSPVLAGN 154

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS-GLSGHHFNSK---------T 256
            KESSFP  VFT+ ++N G  +  + L+ TW ++ A ++ GLS    +S          T
Sbjct: 155 LKESSFPVGVFTWRVTNRGSETVRLGLMLTWEHTRAVEAAGLSLQRQHSAWNDGNTSGVT 214

Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
           +T+     L+ H    NG    T+A+A +      VS+  C+ ++ +     A  +W + 
Sbjct: 215 LTQTSDQALSSH----NG----TWALAVQAPESASVSQWTCWDVAQD-----AAALWQDF 261

Query: 317 KKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV- 374
              G   D+  + K +  +  ++   AIA +L +  G++  + FSLAWD P V+F ++  
Sbjct: 262 ASDGQLADYPTSQKVAADQRSAT---AIAVTLELAPGASAVIPFSLAWDFPIVEFADQSR 318

Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
           +++RYT+F+GT GD A  +A  ++     W   IEAWQ PIL D + P WY   LFNELY
Sbjct: 319 WYKRYTRFWGTNGDQAQALAVASLTNADAWRTAIEAWQNPILADDQRPFWYKSALFNELY 378

Query: 435 YLNAGGTIWTD---GSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
           YL  GGT+W D   G P                                    E   +++
Sbjct: 379 YLVDGGTLWVDRAVGGP------------------------------------EPAADDV 402

Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
           G F YLE  +Y  Y T DV FYSS++++ L+P+LE      F+  V   D   + I++  
Sbjct: 403 GLFSYLECYDYPFYGTLDVSFYSSWSILALWPELERGEILAFSKTVNDADDTVVTIVATQ 462

Query: 552 KWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
               RK  GA+PHD+G   + P  + N+Y+  + + WKDLN K++L++YRD     D+  
Sbjct: 463 VPAIRKAAGALPHDLGAPKEQPLIKTNAYDFQDINNWKDLNLKYILRIYRDVSLWNDQAM 522

Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
             A W ++  A+ Y+ QFD DGDG++++ G  DQTYD W+ +G ++Y   L + AL+AA 
Sbjct: 523 LEATWDTIPTALEYVHQFDSDGDGLLDHSG-ADQTYDTWAMSGAASYSASLLICALEAAI 581

Query: 670 ALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYA 728
            LA  +GDHA A  +      A+  +++ LWNG+YF Y  +D      I ADQL GQWYA
Sbjct: 582 RLAQRMGDHAQADAWSEWLAAARQSFETKLWNGTYFRYHTADTDLREVIMADQLVGQWYA 641

Query: 729 RACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGV 788
            A GL  +A    ++ AL  +Y FNV++   G  GA+NGM PDG +D S  QA E+W G 
Sbjct: 642 GAIGLPAVAPREMIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVDTSSNQASEVWSGT 701

Query: 789 TYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLT 848
           TY +AA M+QE +    +QTA G Y   ++E  LG  F+TPE+W     +R+  YMRP +
Sbjct: 702 TYAIAAMMLQEGLDLEGWQTAWGAYNATYNE--LGLWFRTPEAWGIERTFRASMYMRPQS 759

Query: 849 IWAMQWAL 856
           IWA++ AL
Sbjct: 760 IWAIEHAL 767


>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
          Length = 925

 Score =  474 bits (1219), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 297/846 (35%), Positives = 447/846 (52%), Gaps = 71/846 (8%)

Query: 37  KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
           +VP      +Q +  L  +      + K    K K+ + N F+        G+P GGIG 
Sbjct: 48  RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 106

Query: 97  GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
           G+IGR +RG F +F L  G+ +     V A+QF + V   +   +  VL   +  V +  
Sbjct: 107 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDTCIYQKVLS--AADVHR-- 162

Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
             S + SWD+    ++  Y  LFPR+WT +   P+  L +  RQ+SP +PHNY+E+++P 
Sbjct: 163 --SQLSSWDFGFPKKHLHYRGLFPRSWTTFRV-PELGLTVTIRQVSPVLPHNYQETTYPV 219

Query: 215 SVFTFTLSNSGQTSA-DVTLLFTWANSVAG-----DSGLSGHHFNSKTMTKDGVHGLTLH 268
            +F   + N+    A DV++ FT+ N         +S  S   F+S    ++ V G++L+
Sbjct: 220 CLFLVDVENTSNNRAYDVSIAFTFRNGTGNRRWDRESECSAGRFDSIHSDRESVSGVSLY 279

Query: 269 HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDND 328
           H  ++   P T+ +A+    D H+S C  F  S +         WN +K+ G     + +
Sbjct: 280 HTISSM--PCTYGLASTHKKDSHLSICERFDPSRSGAAF-----WNHLKQTGDVPEFEEE 332

Query: 329 KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFY--G 384
                +    +  ++     +P  S+++  ++L+WD P+V F    + YHRRYT+F+   
Sbjct: 333 ---CPQNAREMAVSVCNRFHLPPQSSKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEGS 389

Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
             G  A  +   A+L   +WE EIE WQ PIL   + P+WY   +FNELY++  GGT+W 
Sbjct: 390 EAGSVADSLCVRALLARNQWEEEIEKWQSPILTHPKLPDWYKSAIFNELYFMVDGGTVWF 449

Query: 445 DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
           +            E   S+ T  K+                   +NIG+F YLE  EY M
Sbjct: 450 EFDENWAD----HEAHLSVYTKEKM-------------------KNIGRFGYLESWEYRM 486

Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
            NTYDVHFY+S+AL  L+P+LEL++Q +F   V        +   +G W   K    VPH
Sbjct: 487 VNTYDVHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPH 546

Query: 565 DIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD--KNFARAVWPSV-YI 619
           D+G   D+PW   N+Y + ++ +WKDLN K+VL  +RD+V   +  + F    WP++  I
Sbjct: 547 DLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEAHQEFLHHTWPAIKMI 606

Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
            +  +E +D++GDGMIEN G  DQTYDAW   GVSAYCG LW+A+L+ +  +A  V D  
Sbjct: 607 MLEALENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSVEMAAIVKDEI 666

Query: 680 SASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACG------ 732
           + + F    +KAK V+ D LW G+YF +     S  T + ADQL G W+ ++        
Sbjct: 667 TENLFKGTLEKAKKVFIDLLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEMADD 725

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
           LLP      V+ AL  IY  NV K   G  GA+NGM+P G +D   +QA E+W GVTY +
Sbjct: 726 LLP---NHMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAV 782

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A+ +IQ+  V+ AF TA+G Y   + + GL Y  QTPE+   +  YR++ YMRPL+IWAM
Sbjct: 783 ASLLIQQGEVEKAFHTASGSYLTCFEQAGLQY--QTPEALYESKFYRAIGYMRPLSIWAM 840

Query: 853 QWALTK 858
           QW+L K
Sbjct: 841 QWSLKK 846


>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 890

 Score =  467 bits (1202), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 315/826 (38%), Positives = 444/826 (53%), Gaps = 69/826 (8%)

Query: 58  RLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGIC 117
           R + Y  +   K +    N F+        G+P GGIG GSIGR +RG F +F L  G+ 
Sbjct: 50  RYFIYWLKHVGKKEKLFINTFQPLKHKPHYGVPCGGIGCGSIGRDFRGGFCKFGLRPGLI 109

Query: 118 DDAP--VLANQFSVFVSRPNGEKFSSV----LCPRSPGVPKKNTDSGIESWDWNLKGENC 171
           +     + A+QF V + R NGE F++V    LC  +P  P  +T+  + +WD++   +  
Sbjct: 110 EHKVDVLRADQFIVTLRRKNGEGFATVYQKVLCA-APTTPATSTE--LNAWDFSFPPDCL 166

Query: 172 TYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
           TY  L+PR+WT YD   +  + IV RQISP IP +Y++SS P SVF F + N G     V
Sbjct: 167 TYRGLYPRSWTRYDIR-EYRVSIVMRQISPVIPGDYRDSSLPVSVFVFDVENGGDEDLYV 225

Query: 232 TLLFTWANSV-----AGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEE 286
           ++ FT+ N       + +   +   F SK      V G+ L H  +  R   T+AVAA  
Sbjct: 226 SITFTFRNGTGYHKWSQEGNCTSSLFESK-HAHSRVEGVCLEHTISRMR--CTYAVAANC 282

Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
                VS C  F  S      +  ++W  +   G   HL N       P   +  A+ A 
Sbjct: 283 NQRTTVSRCAAFDPSK-----SGDEIWQSLWSTG---HLQN--VGEVTPTRELAIAVCAQ 332

Query: 347 LTIPSGSTRS--VTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHA 402
             + + S     + F+LAWD P V+F   ++ Y RRYT+F+G  GD+A  +   ++  + 
Sbjct: 333 TIVKANSKAEAPMEFALAWDMPIVQFGARKRNYRRRYTRFFGDRGDAAPALCAHSLTVYE 392

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
           +WE  I  WQ P+L  +  P+WY   LFNELY++  GG++W +               F 
Sbjct: 393 QWERRINDWQDPVLNHRNLPDWYKSALFNELYFMTDGGSVWFE---------------FE 437

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
            D   +    +    AL         ++ G+F YLE  EY M NTYDVHFY+SFAL  L+
Sbjct: 438 EDWACEEQHLSPYSIAL--------MKDFGRFGYLESWEYRMVNTYDVHFYASFALAQLW 489

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
           P LE  IQ +F   +   D   +    +G    +K    VPHD+G    DPW   N+Y +
Sbjct: 490 PSLEHVIQAEFTDQIEHFDGKPVIYHMEGNRGPKKSRTRVPHDLGNPAVDPWLLTNAYVM 549

Query: 581 FNSSRWKDLNSKFVLQVYRDF--VATGDKNFARAVWPSVY-IAMAYMEQFDKDGDGMIEN 637
            ++ +WKDLN KF+L  +RD+  V   D+ F   VWP+V  +    ++ +D+DGDGMIEN
Sbjct: 550 HDTGKWKDLNLKFILISFRDYTCVLGKDQRFLNLVWPAVKSLINEGLKNWDQDGDGMIEN 609

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-D 696
            G  DQTYDAW  +GVSAYCG LW+AAL+AA  +A +VGD A A  +     +A+  Y  
Sbjct: 610 FGAADQTYDAWKMSGVSAYCGCLWLAALRAALVMAREVGDSACAHRYAETLSRAREAYIT 669

Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE----AKVKKALTKIYDF 752
            LWNG Y+N+D    S  T I ADQL G W+ ++     +ADE    A VK AL  IY++
Sbjct: 670 KLWNGEYYNFDERSRSRHT-IMADQLCGYWFLQSIS-PELADELLPAAHVKSALNIIYNY 727

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NVL+  GG  GA+NGM+ DG +D + +QA E+W GVTY LAA +IQ+  V   F TA G 
Sbjct: 728 NVLRFAGGRLGAVNGMRRDGTVDRNYIQADEMWTGVTYALAAFLIQQGAVQRGFDTAWGC 787

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
           Y+  ++  GL Y  QTPE+      YR++ YMRPL+IWAMQWAL +
Sbjct: 788 YDGCFNRFGLQY--QTPEALYEQRFYRAIGYMRPLSIWAMQWALIR 831


>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
 gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
          Length = 877

 Score =  466 bits (1200), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 297/877 (33%), Positives = 433/877 (49%), Gaps = 163/877 (18%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLC----------PRSPGVP 151
           Y GEF RF+L  GI +   V ANQF V +    G   F S+L            ++P   
Sbjct: 47  YAGEFCRFQLRPGIYEYNVVHANQFIVTIKDHKGCTIFQSLLSRCSTKSFKAKTKTPSTN 106

Query: 152 KKNTDSG------------------------IESWDWNLKGENCTYHALFPRAWTVYDGE 187
             + D+                         + +W  N++   C+Y  L+PRAWT YD  
Sbjct: 107 NNDIDASTDEAEAQSTKCQLPHCSCSRNRQPLSTWHSNIEDSRCSYTGLYPRAWTEYDLS 166

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
               +R+VCRQISP IPH+YKESS P +VF +++ N       V++ FT+ N        
Sbjct: 167 -HYGVRLVCRQISPVIPHDYKESSLPCAVFVWSVQNVCDQERKVSITFTFKNGTGNKKQD 225

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
           +  +  ++ + +    G+ +    A+   P ++ +A     ++ ++ CP F  +G     
Sbjct: 226 AEGNPEAQLIAEGNAKGVAIRQTIADM--PCSYNLACRVLPEISITRCPQFDPAG----- 278

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
           + + +W ++K++G    L+   TS +     IG A+ A L +   ++  + F LAWD P+
Sbjct: 279 SGEQLWAQLKEYG---QLNEQPTSETLKTKDIGVAVCAQLALKPQASHELEFVLAWDMPK 335

Query: 368 VKFFEKV--YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
           ++F  K+  + R YTK++   G+   RI   A+  +A WE  I+AWQRPIL D+  P+WY
Sbjct: 336 IQFPRKLQTHTRYYTKYFDDSGECGPRICEYALKHYASWERLIDAWQRPILNDEGLPDWY 395

Query: 426 PITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-L 484
              +FN+LYY++ GGTIW                                D++LG +L  
Sbjct: 396 KCAIFNQLYYISDGGTIWL-----------------------------KCDSSLGNQLAF 426

Query: 485 ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGT 544
           ++     G+F YLEG EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T
Sbjct: 427 DDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIGAELNDT 486

Query: 545 MKIMSDGKWVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFV 602
            K++ DGK + RK    VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+ 
Sbjct: 487 RKMLYDGKVMPRKIKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYY 546

Query: 603 ATGDKNFARAVWPSVYIAMAYM------EQFDKDG------------------------- 631
              +   A+A   S + ++ ++      EQ+ +D                          
Sbjct: 547 VLNELAQAQADNASKFSSIEFIDKESLYEQYSQDNNRKSSADGKQQNRKSASMYINETNG 606

Query: 632 -----DGMI---------------------------ENEGFPDQTYDAWSANGVSAYCGG 659
                D M+                           EN   PDQTYDAW  +G SAYC G
Sbjct: 607 KVYLMDAMVYLKSMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDAWVMDGPSAYCSG 666

Query: 660 LWVAALQAASALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSS 714
           LW+AALQA S +A  +          +RYQ       +++ + LWNGSY+ +D S     
Sbjct: 667 LWLAALQAMSVMATLLDQPNDC----LRYQDILEKGKRSLEEKLWNGSYYRFDLSHNHRD 722

Query: 715 TSIQADQLAGQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-- 771
           T I ADQL G WY ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +  
Sbjct: 723 T-IMADQLCGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVS 781

Query: 772 -----GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
                G +D S +QA E+WPGV Y LAA+MIQE M D AFQTA G+Y+      G+ Y  
Sbjct: 782 EPSKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLSQRIGMNY-- 839

Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSR 863
           +TPE+      YRS+ YMRPL+IW+MQ A  + +  R
Sbjct: 840 ETPEALYGEKRYRSIGYMRPLSIWSMQVAWERRRALR 876


>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
 gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
 gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
 gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
          Length = 948

 Score =  463 bits (1192), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 289/863 (33%), Positives = 429/863 (49%), Gaps = 149/863 (17%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
           Y GEF RF++  GI +   VLANQF V +  P G      L  +     K +   G    
Sbjct: 132 YAGEFCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDG 191

Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
                            + +W  N++   C+Y  L+PR+WT YD      +R+ CRQ+SP
Sbjct: 192 ERTKCQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 250

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
            IPH Y+ESS P +VF +++ N       V++ FT+ N        +     S+ +++  
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
             G+++  + +    P ++ +A     ++ ++ CP F  +GN      + +W ++K+HG 
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 362

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
              L    TS +     IG A+   + +   ++  + F LAWD P+++F  K+  + R Y
Sbjct: 363 --QLSEHPTSEALKTKDIGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 420

Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
           TK++   GDS  RI   A+ +++ WE  I+AWQRPIL D+  P+WY   +FN+LY+++ G
Sbjct: 421 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 480

Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
           GTIW                                D++LG  L  ++     G+F YLE
Sbjct: 481 GTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYLE 511

Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
           G EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK 
Sbjct: 512 GHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKV 571

Query: 559 LGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA------------- 603
              VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+               
Sbjct: 572 KNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNAS 631

Query: 604 -------------------------TGDKNFARAVWPSVYI-----------AMAYMEQF 627
                                    + D+        S+YI           A+ Y++  
Sbjct: 632 KFSSIEFIDKESLYELYSQDNKRKNSADEKQQNRKSASMYINETNGKVYLMDAIGYLKAM 691

Query: 628 DKDGDGMIE--------NEGFPDQT------YDAWSANGVSAYCGGLWVAALQAASALAN 673
                 ++E        N+G  + T      YD+W  +G SAYC GLW+AALQA SA+A 
Sbjct: 692 YASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAMSAMAT 751

Query: 674 DVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYA 728
            + D  +     +RYQ       +++ + LWNGSY+ +D S     T I ADQL G WY 
Sbjct: 752 -ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWYL 806

Query: 729 RACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGLQ 780
           ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D S +Q
Sbjct: 807 KSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNIQ 866

Query: 781 AREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRS 840
           A E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+      YRS
Sbjct: 867 AEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGEKRYRS 924

Query: 841 LCYMRPLTIWAMQWALTKPKLSR 863
           + YMRPL+IW+MQ AL + +  R
Sbjct: 925 IGYMRPLSIWSMQVALERRRAQR 947


>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 811

 Score =  463 bits (1192), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 284/821 (34%), Positives = 418/821 (50%), Gaps = 91/821 (11%)

Query: 66  EEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLAN 125
           + A GK P   +    +    QG P+GG GAG+  R+YRG F+R+ +  GI     V AN
Sbjct: 43  QNAGGKKPGLPIIDDGYW---QGAPVGGFGAGTFSRTYRGNFERWHVKAGIHKYQNVPAN 99

Query: 126 QFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
           QF+VF         + VL  + P          + SW+W        Y AL+P++W  Y 
Sbjct: 100 QFAVFAQADGDAAVAQVLSTQKP------EHGELSSWNWTYPVGAGDYAALYPKSWFAYK 153

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV---- 241
             P   + +   Q SP IP NYKESS+P +V+ + ++N  Q    V++LF+W N V    
Sbjct: 154 -SPQLPITLTLEQFSPVIPDNYKESSYPVAVYNWYVTNPSQKRVTVSILFSWTNMVGWFR 212

Query: 242 -------AGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVT------FAVAAEETA 288
                  A  +    +H+ S+ +    + G+      +    PV+      FA+A+   +
Sbjct: 213 DTTTGFGAALNSQDINHYKSEAIKSSQMQGIVFDRLRSE---PVSDEWDGQFAIASLAGS 269

Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
            V +S    +  SG+       D+W    K G    L N     +  G S+  AIA   T
Sbjct: 270 GVEISYLTTYRPSGD-----GSDVWTPFAKDG---RLPNVAAGLASSGDSLAGAIAVRFT 321

Query: 349 IPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
           +  G  R++  +LAWD P ++F   + + R YT+F+   G +A ++A  A+ +   W  +
Sbjct: 322 LAPGEKRTIPMALAWDLPFIQFGSGRKWARHYTRFFDRFGTNAWKLARTALEQGNDWSRQ 381

Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
           I+AWQ+P + D+  P WY   LFNELY L  GGT+W                   + TP 
Sbjct: 382 IDAWQKPYISDESKPLWYRGMLFNELYDLVDGGTMWAH----------------EVGTPG 425

Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
             H P++                   F YLE  +Y  Y T DV FY SF L   +P++E 
Sbjct: 426 NPHHPSAVKD--------------DSFSYLECFDYAYYGTLDVRFYGSFPLAKFWPEIEK 471

Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKW----------VARKCLGAVPHDIG--LDDPWFEI 575
              R++A  +    P  +       W            RK  G+ PHD+G  L+D     
Sbjct: 472 QEMREYADTI----PENIDQKYIWAWKSDNEHKLEPTQRKVAGSAPHDLGSPLEDALILP 527

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDG 633
           N YN  + S WKDLNSK+VL V+RD+V +G  DK F +  + +V  +M ++ QFDK+ DG
Sbjct: 528 NQYNYQSVSFWKDLNSKYVLMVWRDYVMSGSKDKAFLQYNYNAVKQSMQFLRQFDKNNDG 587

Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA 693
           +IEN+G+PDQTYD WSA G SAY G L++AAL+A   +A  +GD  +A+     +++A+ 
Sbjct: 588 LIENDGYPDQTYDNWSARGESAYSGSLYLAALRATEEMAKLLGDQRTAADTAALFKRAQD 647

Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
            +   LWNG+YFNYD    S   +I A+QLAGQWYA   GL  +        AL K+YDF
Sbjct: 648 SFVKKLWNGTYFNYD-VGSSYKDAIMAEQLAGQWYASLTGLGDLVPRDMQHSALKKVYDF 706

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           NV+K++ G  GA+NG+   G +     Q  E+W GVT+ +AA+M+Q  + +  F TA GV
Sbjct: 707 NVMKLQNGTMGALNGISASGEVLKDNEQTEEVWTGVTFAVAATMLQNGLREEGFNTAKGV 766

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           Y V +  D  GY F+TPE+++ N  YR+  YMRP  IW+M+
Sbjct: 767 YNVVY--DQKGYWFRTPEAYDTNGMYRASMYMRPGAIWSME 805


>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
          Length = 899

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 291/846 (34%), Positives = 442/846 (52%), Gaps = 69/846 (8%)

Query: 37  KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
           +VP      +Q +  L  +      + K    K K+ + N F+        G+P GGIG 
Sbjct: 42  RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 100

Query: 97  GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
           G+IGR +RG F +F L  G+ +     V A+QF + V   +   +  VL           
Sbjct: 101 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLS----AADVHR 156

Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
           +   + +WD+    +N  Y  LFPR+WT +   PD  L +V RQ+SP +P+NY+++++P 
Sbjct: 157 SSGQLSTWDFGFPKKNVHYRGLFPRSWTTFRI-PDLGLTVVIRQVSPVLPNNYEDTTYPV 215

Query: 215 SVFTFTLSNS-GQTSADVTLLFTWANSVAG-----DSGLSGHHFNSKTMTKDGVHGLTLH 268
            +F   + N+      DV++ FT+ N         ++  S   F +   + + V G++L+
Sbjct: 216 CLFLIDVENTTNDREYDVSVAFTFRNGTGNRRWEREAECSAVMFETPEDSAEQVTGVSLY 275

Query: 269 HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDND 328
           HR +      T+ +A+  + D  +S C  F  S +        +WN + + G    L+ +
Sbjct: 276 HRISGM--SCTYGLASTHSKDSVLSVCERFDPSKSGGA-----LWNHLMQTGDVPSLEEE 328

Query: 329 KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFY--G 384
               +     +  A++   T+P  S+++  ++L+WD P+V F    + YHRRYT+F+   
Sbjct: 329 CAINAR---EMAVAVSNRFTLPPLSSKTHDYALSWDMPKVHFGSTGRSYHRRYTRFFEAS 385

Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
             G +A  +   A+    KWE EIE WQ PIL   + P+WY   +FNELY++  GGT+W 
Sbjct: 386 EAGSAADSLCVRALRLRHKWEEEIEKWQSPILTHDKLPDWYKSAIFNELYFITDGGTVW- 444

Query: 445 DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
                           F  D     H    S+        +   +NIG+F YLE  EY M
Sbjct: 445 ----------------FEFDENWADHEAHLSNYT------KEKMKNIGRFGYLESWEYRM 482

Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
            NTYDVHFY+S+AL  L+P+LEL++Q +F   V        +   +G W   K    VPH
Sbjct: 483 VNTYDVHFYASYALAELWPELELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPH 542

Query: 565 DIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSV-YI 619
           D+G   D+PW   N+Y + ++ +WKDLN K+VL  +RD+V   +++  F    WP+V  I
Sbjct: 543 DLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLHHTWPAVKMI 602

Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
            +  +  +D++GDGMIEN G  DQTYDAW   GVSAYCG LW+A+L+ +  +A  + D  
Sbjct: 603 MLEALANWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMAGLLEDGE 662

Query: 680 SASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACG------ 732
           +   F     KAK V+ D LW G+YF +     S  T + ADQL G W+ ++        
Sbjct: 663 TKKLFLETLNKAKKVFVDVLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEMADD 721

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
           LLP      V+ AL  IY  NV +   G  GA+NGM+P G +D   +QA E+W GVTY +
Sbjct: 722 LLP---SHMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAV 778

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A+ +IQ+  V+ AF TA+G Y   + + GL Y  QTPE+   +  YR++ YMRPL+IWAM
Sbjct: 779 ASLLIQQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALYESKFYRAIGYMRPLSIWAM 836

Query: 853 QWALTK 858
           QW+L K
Sbjct: 837 QWSLKK 842


>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
 gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
          Length = 913

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 303/865 (35%), Positives = 434/865 (50%), Gaps = 157/865 (18%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVL------------------ 143
           + GEF RF++  G+ +   V ANQF V +   N    F S+L                  
Sbjct: 101 FAGEFCRFQMRPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRTKDDEVHDVCAPNQ 160

Query: 144 ---------CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
                    C   P  P       + SWD  +    C+Y AL+PRAW+ YD   +  +++
Sbjct: 161 TTLLHGLMPCETRPKTP-------LASWDSTINSSKCSYTALYPRAWSEYDLS-EHGIKL 212

Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNS 254
           V RQISP IPH+YKESS P +VF +T+ N       VT+ FT+ N        +  +  +
Sbjct: 213 VQRQISPIIPHDYKESSLPVAVFAWTVENVCGKDRHVTITFTFKNGTGNKKQDAEGNAET 272

Query: 255 KTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWN 314
                    G+++    A+   P T+ V+ + +++++++ C  F  SGN      + +WN
Sbjct: 273 AAFAHGTAKGVSIKQTIAD--LPCTYCVSCKTSSEINLTRCEKFDPSGN-----GEKLWN 325

Query: 315 EIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FE 372
           ++K++G   HL       S     +  A++A + +PS S+  + FSL WD P++ F    
Sbjct: 326 DLKENG---HLTEKSGDESLKTKDVAVAVSAQILVPSESSSELEFSLVWDMPKIHFTKKM 382

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
           K Y++ YTK++G  GD+   I+  A+  + KWE  I+ WQRPIL+D   P+WY   +FNE
Sbjct: 383 KEYYKYYTKYFGKNGDAGPVISDYALRNYGKWESLIDEWQRPILDDSDLPDWYKSAIFNE 442

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIG 492
           LY++  GG+IW            + +     D P   +                     G
Sbjct: 443 LYFIADGGSIWL----------AVDDTTLPYDDPRLAY---------------------G 471

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
           ++ YLEG EY MYNTYDVHFY+S AL  L+P L++S+Q D+  AV        K + DGK
Sbjct: 472 RYAYLEGHEYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDAVQREITEGRKHLYDGK 531

Query: 553 WVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDF--------- 601
            + RK   +VPHD+G  D  P+  IN+Y + + S WKDLN+KF+LQVYRD+         
Sbjct: 532 VIPRKIKSSVPHDLGDPDEEPFELINAYPIHDVSEWKDLNTKFILQVYRDYYTLNHYAQL 591

Query: 602 ------------------------------VATGDKNFARAVWPSVYI-----------A 620
                                         V+  DK   R    S+YI           A
Sbjct: 592 NADNASKFSSIEFIDKESMFEMYVQDNRNKVSPEDKAQNRKS-ASMYINETNGKVYLMDA 650

Query: 621 MAYME--------------QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQ 666
           M Y++              ++DKD DG+IEN   PDQTYD W  +G S+YCGGLW+A+L 
Sbjct: 651 MTYLKSMYPACKQVLEKTLEWDKDNDGLIENSKSPDQTYDTWVMDGPSSYCGGLWLASLH 710

Query: 667 AASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
             S +AN +        +    +K KA ++  LWNGSY+ +D S  SS  SI +DQL G 
Sbjct: 711 CMSVMANLLDQSEDCIKYREILEKGKASFEEKLWNGSYYKFD-SQSSSKNSIMSDQLCGH 769

Query: 726 WYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSG 778
           WY R CG    I  +  V+ AL  IYD NV++  GG  GA+NG  P      DGR D S 
Sbjct: 770 WYLRCCGFDYDIVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPNKDGRADSST 829

Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
           LQA E+W GVTY LA++MI E M   AFQTA G+Y+     + +G SF+TPE+      Y
Sbjct: 830 LQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTL--SEKIGMSFETPEALYAERHY 887

Query: 839 RSLCYMRPLTIWAMQWALTKPKLSR 863
           R++ YMRPL+IW+MQ A  + K +R
Sbjct: 888 RAIGYMRPLSIWSMQTAWEQKKQNR 912


>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 992

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 235/407 (57%), Positives = 299/407 (73%), Gaps = 6/407 (1%)

Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
           G+FLYLEG EYVM+ TYDVHFY+SFAL+MLFP++EL+IQRDFA AV+  D   +K +++G
Sbjct: 575 GRFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNIQRDFAQAVLCEDGRKVKFLAEG 634

Query: 552 KWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
            W  RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD  F  
Sbjct: 635 NWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSATGDLQFGV 694

Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
            VWP+V  AM YMEQFD+D DG+IEN+GFPDQTYD W+ +GVSAYCGGLW+AALQAA+A+
Sbjct: 695 DVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLAALQAAAAM 754

Query: 672 ANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
           A  +GD   A     ++ KAK VY+  LWNGSYFNYD+   S+S SIQADQLAGQWY  +
Sbjct: 755 ALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTAS 814

Query: 731 CGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTY 790
            GL  + D+ K+K +L K++DFNV+KVKGG  GA+NGM P+G++D + +Q+REIW GVTY
Sbjct: 815 SGLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREIWAGVTY 874

Query: 791 GLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
           G+AA+MI   M + AF TA G++   WSE+G GY FQTPE++  +  YRSL YMRPL+IW
Sbjct: 875 GVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRSLIYMRPLSIW 934

Query: 851 AMQWALTKPKLSRQEIKHEISDRD-----SSYLEQHAAFSKVASLLK 892
            MQ+ALT PK   +  K    DR      S  L +     K+A+  K
Sbjct: 935 GMQYALTMPKAVLEAPKINFMDRIHLSPVSGGLHKETGVKKIATKTK 981



 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 222/446 (49%), Positives = 302/446 (67%), Gaps = 17/446 (3%)

Query: 18  VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           +D   PP+  W+R+L+S+  +   F ++F E   +  +G R++ Y ++E + G+    + 
Sbjct: 30  IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAPIDP 89

Query: 78  FRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPN 135
           F +     ++ QG+PLGG+G+GSI R +RGEF+++++  G+C+ +PV+ANQFS+FVSR  
Sbjct: 90  FTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSIFVSREG 149

Query: 136 GEK-FSSVLCP---RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPE 191
           G K F+SVL P      G  +K  D GI SW WNL G++ TYHALFPRAWTVYDGEPDPE
Sbjct: 150 GNKSFASVLAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEPDPE 209

Query: 192 LRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH 251
           L+I CRQISPFIPHNY+ESS PA+VF +TL N+G+  A V+LLFTWANS+ G+S LSG H
Sbjct: 210 LKISCRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLSGDH 269

Query: 252 FNSKTMTKDGVHGLTLHH-------RTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS 304
            N   + +DGV G+ L+H       RTA   PPVTF++AA ET +V VS  PCF LS  S
Sbjct: 270 VNEPFIAEDGVSGVLLYHKQVVEYFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLSDRS 329

Query: 305 KGITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
             +TAK MW ++ K G FD  +  +  + PS PG ++ AA+AAS  +      +V FSLA
Sbjct: 330 S-VTAKGMWTKMVKDGQFDRENFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAFSLA 388

Query: 363 WDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           W  P+VKF +   ++RRYTKFYGT   +A  +AHDA+  + +WE EI  WQ PIL+D++ 
Sbjct: 389 WSSPKVKFVKGSTFNRRYTKFYGTSERAAVHLAHDALTHYTRWEEEIAKWQDPILKDEKL 448

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGS 447
           PEWY  TLFNELY+L AGGTIW D +
Sbjct: 449 PEWYKFTLFNELYFLVAGGTIWIDST 474


>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
 gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
          Length = 1103

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 235/407 (57%), Positives = 299/407 (73%), Gaps = 6/407 (1%)

Query: 492  GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
            G+FLYLEG EYVM+ TYDVHFY+SFAL+MLFP++EL+IQRDFA AV+  D   +K +++G
Sbjct: 686  GRFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNIQRDFAQAVLCEDGRKVKFLAEG 745

Query: 552  KWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
             W  RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD  F  
Sbjct: 746  NWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSATGDLQFGV 805

Query: 612  AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
             VWP+V  AM YMEQFD+D DG+IEN+GFPDQTYD W+ +GVSAYCGGLW+AALQAA+A+
Sbjct: 806  DVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLAALQAAAAM 865

Query: 672  ANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
            A  +GD   A     ++ KAK VY+  LWNGSYFNYD+   S+S SIQADQLAGQWY  +
Sbjct: 866  ALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTAS 925

Query: 731  CGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTY 790
             GL  + D+ K+K +L K++DFNV+KVKGG  GA+NGM P+G++D + +Q+REIW GVTY
Sbjct: 926  SGLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREIWAGVTY 985

Query: 791  GLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            G+AA+MI   M + AF TA G++   WSE+G GY FQTPE++  +  YRSL YMRPL+IW
Sbjct: 986  GVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRSLIYMRPLSIW 1045

Query: 851  AMQWALTKPKLSRQEIKHEISDRD-----SSYLEQHAAFSKVASLLK 892
             MQ+ALT PK   +  K    DR      S  L +     K+A+  K
Sbjct: 1046 GMQYALTMPKAVLEAPKINFMDRIHLSPVSGGLHKETGVKKIATKTK 1092



 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 183/311 (58%), Gaps = 58/311 (18%)

Query: 18  VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
           +D   PP+  W+R+L+S+  +   F ++F E   +  +G R++ Y ++E + G+    + 
Sbjct: 30  IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAPIDP 89

Query: 78  FRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV------ 129
           F +     ++ QG+PLGG+G+GSI R +RGEF+++++  G+C+ +PV+ANQFSV      
Sbjct: 90  FTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSVSLVFPC 149

Query: 130 ----------------------------------------------FVSRPNGEK-FSSV 142
                                                         FVSR  G K F+SV
Sbjct: 150 TIDSLYTSLICSQIRSRQNHLWVYFPSIYEDYFVSWGNECSLFAEIFVSREGGNKSFASV 209

Query: 143 LCP---RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
           L P      G  +K  D GI SW WNL G++ TYHALFPRAWTVYDGEPDPEL+I CRQI
Sbjct: 210 LAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEPDPELKISCRQI 269

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
           SPFIPHNY+ESS PA+VF +TL N+G+  A V+LLFTWANS+ G+S LSG H N   + +
Sbjct: 270 SPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLSGDHVNEPFIAE 329

Query: 260 DGVHGLTLHHR 270
           DGV G+ L+H+
Sbjct: 330 DGVSGVLLYHK 340



 Score =  169 bits (427), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 92/183 (50%), Positives = 121/183 (66%), Gaps = 4/183 (2%)

Query: 268 HHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD--HL 325
           + RTA   PPVTF++AA ET +V VS  PCF LS  S  +TAK MW ++ K G FD  + 
Sbjct: 404 YFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLSDRS-SVTAKGMWTKMVKDGQFDRENF 462

Query: 326 DNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYG 384
            +  + PS PG ++ AA+AAS  +      +V FSLAW  P+VKF +   ++RRYTKFYG
Sbjct: 463 SSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAFSLAWSSPKVKFVKGSTFNRRYTKFYG 522

Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
           T   +A  +AHDA+  + +WE EI  WQ PIL+D++ PEWY  TLFNELY+L AGGTIW 
Sbjct: 523 TSERAAVHLAHDALTHYTRWEEEIAKWQDPILKDEKLPEWYKFTLFNELYFLVAGGTIWI 582

Query: 445 DGS 447
           D +
Sbjct: 583 DST 585


>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
 gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
          Length = 811

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 294/807 (36%), Positives = 404/807 (50%), Gaps = 99/807 (12%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV-SRPNGEKFSSVLCP 145
           QG+P+GG GAG+  RSYRG+F R+ +  G+    PV ANQF++F  S  +    S  L  
Sbjct: 67  QGVPVGGFGAGTFSRSYRGDFARWHIKAGVHKYEPVYANQFAMFQQSEGDAHGTSQALMN 126

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
             P      ++  + SW W+       Y+ LFP+AW  Y  +  P   +   Q SP +P+
Sbjct: 127 GHP------SNGELSSWQWDYPVGAGDYYGLFPKAWFDYKWDKFPA-HVTLEQFSPVLPN 179

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN------------SVAGDSGLSGHHFN 253
           NY+ESS+P +V+ +   N    +  V++L +W N              A + G    + +
Sbjct: 180 NYRESSYPVAVYRWHAENPTNKTVVVSVLLSWTNMSGWFRTYTRDFKGAPNQGNHNEYVS 239

Query: 254 SKTMTKDGVHGLTLHHRTANGRP---PVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
            K      + G+      A   P      FA+AA E+  V VS    +  SG+ K +   
Sbjct: 240 EKVGDAGTMKGIVFDRNRAGTAPNDWDGQFAIAAMESPGVEVSYQTTYQASGDGKAV--- 296

Query: 311 DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
             W+   K G    L ND  S       +  AIA   T+  G  + V   ++WD   ++F
Sbjct: 297 --WSAFSKDG---RLANDGKSWVSDKEKLAGAIALRFTLRPGEKKVVPMVVSWDFSLIQF 351

Query: 371 FE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITL 429
            E + + RRYT FYGT G +A +IA D +L   +W  EI+ WQ P + D+  P WY   L
Sbjct: 352 GEGRKWDRRYTDFYGTSGQNAWKIARDGLLHATEWSDEIDKWQAPYITDESKPLWYRGML 411

Query: 430 FNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEE 489
           FNELY L  GGT W               R+   D  +   TP                 
Sbjct: 412 FNELYALTDGGTFWG--------------RQAGSDAKA---TPT---------------- 438

Query: 490 NIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMS 549
               F  LE  +Y  Y T DV FY+S  L+  +P ++  + R+FA  V    P       
Sbjct: 439 ----FALLECFDYAYYGTLDVRFYASLPLLKFWPDIDKQVLREFADTVPKEWP------E 488

Query: 550 DGKWV------------ARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
            G WV             RK +GAVPHD+G+   DP+  +N     +++ WKDLNSKFVL
Sbjct: 489 QGLWVWKTQQVGEPITHKRKKIGAVPHDLGVPEGDPFVAVNEPGWQDTNDWKDLNSKFVL 548

Query: 596 QVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
            VYRD+V TG K+  F R  WP+V  A+ Y+ QFD  G G+ EN G+PDQTYD W   GV
Sbjct: 549 MVYRDYVLTGRKDTAFLRETWPAVKAAIEYLRQFDHGG-GVPENSGYPDQTYDDWVVRGV 607

Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
           SAY GGLW+AAL+A    A  VGD  + + +   + K +  Y S LWNG YF YD S   
Sbjct: 608 SAYSGGLWLAALRAGEETARVVGDTKTTAEYHALFLKGQKTYISQLWNGEYFRYDTSS-E 666

Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
           S   IQADQLAGQWYA   GL  I        A  KI+D NV+K   G  GA NGM  DG
Sbjct: 667 SKDDIQADQLAGQWYANLTGLGEIVPHTMQVSAAKKIFDVNVMKFGHGEMGAANGMTADG 726

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
            I ++  +A+E+W G T G A  ++QE M D A++T  G+Y V + +   GY F+TPE+W
Sbjct: 727 AI-LTNAEAKEVWVGTTLGYAGLLMQEGMNDEAWKTTRGLYHVIYEDK--GYWFRTPEAW 783

Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKP 859
           +    +R+  YMRP  IWA++  +T P
Sbjct: 784 DITGNFRAGMYMRPTAIWALE--MTSP 808


>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
          Length = 571

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 243/580 (41%), Positives = 334/580 (57%), Gaps = 35/580 (6%)

Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
           E   V VS C  F  +G       +++W ++ + G  +       S ++ G    AA+AA
Sbjct: 2   ERPGVCVSHCTQFDPTG-----LGQEVWKDLLEDGRLNSAPGPSPSTAK-GKKTAAAVAA 55

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
             ++P  S  ++ F L+WD P ++F   ++ +HRRYT+F+G  GD+A  + H  +     
Sbjct: 56  GCSVPPRSRETLEFCLSWDMPNIRFGAGQREHHRRYTRFFGCQGDAAPALCHYGLTHFVG 115

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
           WE +IEAWQ PIL++   P WY   LFNELY++  GGT+W + S        +  R+   
Sbjct: 116 WEQKIEAWQEPILQNGDLPAWYKSALFNELYFMADGGTVWVEVSAADDDDNLM--RRGCK 173

Query: 464 DTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
           D P                 +++  +  G+F YLEG EY MYNTYDVHFY+SFAL+ML+P
Sbjct: 174 DLPG----------------MKDVLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLWP 217

Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLF 581
           +LE+S+Q D AA+V+  DP   K +  G     K    VPHD+G   D+PW ++N+Y + 
Sbjct: 218 QLEISLQYDMAASVLQEDPELRKYLMSGTVAPVKSRSVVPHDVGDPDDEPWQKLNAYLIH 277

Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
           +++ WKDLN KFVLQVYRD+  T D  + R +W      M    +FD+DGDG+IEN GF 
Sbjct: 278 DTANWKDLNLKFVLQVYRDYHITKDTTYLRDMWSVCQTVMETSLKFDEDGDGLIENSGFA 337

Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
           DQTYD W   G S+YCGGLW+AA+     +A  +GD A+   F     + KA ++  LWN
Sbjct: 338 DQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTDILHRGKAAFEKLLWN 397

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVLK 756
           G Y+N+D  D   S S+  DQ AG W+ RACGL     E      V  AL  I++ NV +
Sbjct: 398 GKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQ 457

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
              G  GA+NGM+PDG  D S +Q+ E+W GV YGLAA+MI E +V   F TA G Y   
Sbjct: 458 FADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYRTV 517

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           W    LG SFQTPE++     +RSL YMRPL+IWAMQ AL
Sbjct: 518 WER--LGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQLAL 555


>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
 gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
          Length = 869

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 294/869 (33%), Positives = 425/869 (48%), Gaps = 155/869 (17%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVL--CPRSPGVPKKNTDSG- 158
           Y GEF RF++  GI +   V ANQF V +    G   F S+L  C       K N +   
Sbjct: 47  YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDHKGCTIFQSLLSKCSTKSIKAKPNNNDAE 106

Query: 159 -----------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
                                  + +W  N++   C+Y  L+PR+WT YD      +R++
Sbjct: 107 ADEAEAQKSKCQLPNCSSRSRQPLSAWHSNIEDARCSYTGLYPRSWTEYDLS-HYGVRLI 165

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISP IPH+YKESS P +VF ++  N       V++ FT+ N           +  S+
Sbjct: 166 CRQISPVIPHDYKESSLPCAVFVWSAQNVCDHERKVSITFTFKNGTGNKKQDGEGNAESQ 225

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            +++    G+ +  R   G  P ++ +A     ++ ++ CP F  +GN      + +W +
Sbjct: 226 LISEGNAKGVAI--RQTIGDMPCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWMQ 278

Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV- 374
           +K+HG    L    T+ S     IG A+ A L +   ++  + F LAWD P+++F  K+ 
Sbjct: 279 LKEHG---QLSEQPTAESLKTKDIGVAVCAHLALKPKTSHDLEFVLAWDMPKIQFPRKLQ 335

Query: 375 -YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
            + R YTK++   G+S  RI   A+  +A WE  I+AWQRPIL D   P+WY   +FN+L
Sbjct: 336 THTRYYTKYFDDSGESGPRICEYALKHYASWERLIDAWQRPILNDDGLPDWYKCAIFNQL 395

Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIG 492
           Y+++ GGT+W                                DT+LG  L  ++     G
Sbjct: 396 YFISDGGTLWL-----------------------------KCDTSLGKPLAFDDPRLAYG 426

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
           +F YLEG EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK
Sbjct: 427 RFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGK 486

Query: 553 WVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA------- 603
            + RK    VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+         
Sbjct: 487 VMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDYYVLNELAQA 546

Query: 604 -------------------------------TGDKNFARAVWPSVYI-----------AM 621
                                          + D+        S+YI           AM
Sbjct: 547 QADNASKFSSIEFIDKESLYELYTQDNKRKNSADEKQQNRKSASMYINETNGKVYLMDAM 606

Query: 622 AYMEQFDKDGDGMIE--------NEGFPDQT------YDAWSANGVSAYCGGLWVAALQA 667
           +Y++        ++E        N+G  + T      YD+W  +G SAYC GLW+AALQ 
Sbjct: 607 SYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQT 666

Query: 668 ASALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQL 722
            S +A  +          +RYQ        ++ + LWNGSY+ +D S     T I ADQL
Sbjct: 667 MSVMATLLDQPNDC----LRYQDILERGKNSLEEKLWNGSYYRFDQSHNHRDT-IMADQL 721

Query: 723 AGQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM-----QPD--GRI 774
            G WY ++CG    I  +  V+ AL +IYD NV+    G  GA NG      +P   G +
Sbjct: 722 CGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEPSKPGHV 781

Query: 775 DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNN 834
           D S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+   
Sbjct: 782 DNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTI--SQRIGMNFETPEALYG 839

Query: 835 NDEYRSLCYMRPLTIWAMQWALTKPKLSR 863
              YRS+ YMRPL+IW+MQ A  + +  R
Sbjct: 840 EKRYRSIGYMRPLSIWSMQVAWERRRALR 868


>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
 gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
          Length = 933

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 290/865 (33%), Positives = 448/865 (51%), Gaps = 85/865 (9%)

Query: 37  KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
           +VP      +Q +  L  +      + K    K K+ + N F+        G+P GGIG 
Sbjct: 48  RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 106

Query: 97  GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
           G+IGR +RG F +F L  G+ +     V A+QF + V   +   +  VL           
Sbjct: 107 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFIISVRENDTCIYQKVLS----AADVHR 162

Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
           + + + +WD+    +N  Y  LFPR+WT +   P+  + +  RQ+SP +PHNY+++++P 
Sbjct: 163 SGNQLSTWDFTFPKKNVNYRGLFPRSWTTFRI-PELGITVTIRQVSPVLPHNYEDTTYPV 221

Query: 215 SVFTFTLSNSGQTSA-DVTLLFTWANSVAG-----DSGLSGHHFNSKTMT-------KDG 261
            +F   + N+  +   +V++ FT+ N         +S  S   F +   +       ++ 
Sbjct: 222 CLFVIDVENTTTSKEYEVSIAFTFRNGTGNRRWERESECSAVKFETPGPSDSESNPGQEK 281

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
           V G++L+H  ++   P T+ +A        +S C  F  + N         WN +++ G 
Sbjct: 282 VTGISLYHTISSM--PCTYGLATTHKNGSILSVCERFDPARNGSAF-----WNHLQQTGD 334

Query: 322 ----------FDHLDN--DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
                     F H  N   +   +     +  A+     +  G+ ++  ++L+WD P+V 
Sbjct: 335 VPSCVRILHVFFHKINSFQEEECAVNAREMAVAVCNRFRLSPGTKKTHDYALSWDMPKVH 394

Query: 370 F--FEKVYHRRYTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
           F    + YHRRYT+F+     G +A  +   A+    KWE EIE WQ PIL  ++ P+WY
Sbjct: 395 FGSVARSYHRRYTRFFDAAEAGTAADSLCVRALHLKNKWEEEIEKWQLPILNHEKLPDWY 454

Query: 426 PITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLE 485
              +FNELY++  GGT+W                 F  D     H    S+        +
Sbjct: 455 KSAIFNELYFITDGGTVW-----------------FEFDEQWAEHEMHLSEYT------K 491

Query: 486 NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTM 545
              ++IG+F YLE  EY M NTYDVHFY+S+AL  L+P+LEL++Q +F   V        
Sbjct: 492 EKMKSIGRFGYLESWEYRMVNTYDVHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPT 551

Query: 546 KIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA 603
           +   +G W   K    VPHD+G   D+PW   N+Y + ++ +WKDLN K+VL  +RD+V 
Sbjct: 552 RFHMEGDWANVKTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVV 611

Query: 604 TGDKN--FARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
             +++  F    WP+V  I +  +E +D++GDGMIEN G  DQTYDAW   GVSAYCG L
Sbjct: 612 LSEEHQEFLFHTWPAVKMIMLEALENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSL 671

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQA 719
           W+A+L+ +  +A  +G+ A+ + F V   KAK V+ D+LW G+YF +     S  T + A
Sbjct: 672 WLASLRVSIEMARLMGEEATENLFRVTLNKAKKVFIDTLWTGTYFRFCERSRSRET-VMA 730

Query: 720 DQLAGQWYARACG------LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
           DQL G W+ ++        LLP      V+ AL  IY  NV +   G  GA+NGM+P+G 
Sbjct: 731 DQLCGYWFLQSVSPEMVDDLLP---NHMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGV 787

Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
           +D   +QA E+W GVTY +A+ +IQ+  V+ AF TA+G Y   + + GL Y  QTPE+  
Sbjct: 788 VDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALY 845

Query: 834 NNDEYRSLCYMRPLTIWAMQWALTK 858
            +  YR++ YMRPL+IWAMQW+L K
Sbjct: 846 ESKFYRAIGYMRPLSIWAMQWSLKK 870


>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
           pisum]
          Length = 776

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 299/855 (34%), Positives = 426/855 (49%), Gaps = 111/855 (12%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPT--GFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
           D G  P+  W+   D     P    +    ++   L  +  R   Y      KG+ P+ +
Sbjct: 10  DKGGVPEYGWKVDFDHQFPDPRTPSYMPRIKQILQLLPLIIRYVFYMLLCYLKGQKPIMD 69

Query: 77  VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-N 135
                      G+PLGGIG G+IGR ++GEF R+++  G+ +   V ANQF V +    N
Sbjct: 70  FLFPIKSKHMYGVPLGGIGGGTIGRGFKGEFCRYQVVPGMYEYESVQANQFIVNIQNSEN 129

Query: 136 GEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
              +++VL         K  +  + +W W+     C Y  L+PR+WT Y   P+  +R++
Sbjct: 130 ITVYNNVLSCL------KCNNKTLYNWKWDFPSNQCKYTGLYPRSWTEY-LIPEFHIRLI 182

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           C+QISP IPHNYK+S  P  VF +T+ N+      ++L FT+A+     +G+S +H +  
Sbjct: 183 CKQISPIIPHNYKDSCIPGGVFIWTVENNSDQQYSISLTFTFASG----NGISSYHSDEP 238

Query: 256 T-----MTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGIT 308
           +     + KDG  V G  +H+                        E  C  + G      
Sbjct: 239 SCEPFNLNKDGTEVAGSLIHNC---------------------FKEMKCTYVIGGL---- 273

Query: 309 AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
                  IK+  +F  L+ D           G  I+ SL +  G  R +    +      
Sbjct: 274 -------IKEGAAFSQLNFDPKGN-------GMTISKSLHL-FGELRDMQLGPS------ 312

Query: 369 KFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
                +Y + YTKF+   G +   I H ++    KWE EIE WQ PIL +   P+WY   
Sbjct: 313 ----NIYSKYYTKFFDKNGTAGPEIMHYSMQNFKKWEQEIEKWQNPILNNPLLPDWYKSA 368

Query: 429 LFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGE 488
           +FNE+Y++  GGTIW            ++E    LD                   + +  
Sbjct: 369 IFNEVYFVADGGTIW---------FLPLKEDYERLD-------------------VTDPR 400

Query: 489 ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIM 548
              G F YLEG EY MYNTYDVHFY+SF L  L+PKL+  IQ  F   +      T K +
Sbjct: 401 YQYGYFGYLEGHEYRMYNTYDVHFYASFTLAKLWPKLQACIQYKFRDTIDNALNITRKHL 460

Query: 549 SDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD 606
           S G    RK   +VPHD+G   +DP+  +N+Y + N S W+DLN KFVLQ YRD+   GD
Sbjct: 461 SCGTRGLRKYKYSVPHDLGDPEEDPFNLVNAYLVHNVSEWRDLNLKFVLQCYRDYTMFGD 520

Query: 607 KNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQ 666
             +   ++P +   M     +D DGDGMIEN G  DQT+D W   G+SAYCGGLW+AAL 
Sbjct: 521 NKYLADMYPQIKNVMEKSLSWDTDGDGMIENSGTADQTFDTWIMTGISAYCGGLWLAALY 580

Query: 667 AASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
               ++N +     +  +     KAK  ++ +LWNG YFN+D+S    S SI ADQL G+
Sbjct: 581 CTVQISNILKQDDISEKYSTILNKAKESFNAALWNGGYFNFDSSGQHHSKSIMADQLCGE 640

Query: 726 WYARACG----LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           WY + CG    +LP+    +VKKAL  IY  NVL    G  GA+NGM PDG  D   LQ+
Sbjct: 641 WYLKCCGVNEEVLPV---DRVKKALHTIYKMNVLGFNDGKMGAVNGMMPDGNPDTFSLQS 697

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
            E+W GVTY L++ MI   + D  F TA G+Y   ++  G+ Y  QTPE+  + + YRS+
Sbjct: 698 EEVWTGVTYALSSLMIAHGLRDEGFNTAKGIYNTVYNNIGMAY--QTPEAIYSKNAYRSV 755

Query: 842 CYMRPLTIWAMQWAL 856
            YMRPL+IW++Q AL
Sbjct: 756 GYMRPLSIWSIQTAL 770


>gi|297741934|emb|CBI33369.3| unnamed protein product [Vitis vinifera]
          Length = 508

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 219/440 (49%), Positives = 300/440 (68%), Gaps = 7/440 (1%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E ++G+    + F
Sbjct: 31  DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G+GSI R +RGEF+ +++  G CD +P++ANQFS+F+SR  G
Sbjct: 91  TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150

Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            +K++SVL P       K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL++ 
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISPFIPHNY++SS P +VF +TL N+G+  A V+LLFTWANS+ G S LSG H N  
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            + +DGV G+ LHH+TA   PPVTFA+AA ET +V V+  P F LS  S  ITAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGSH-ITAKDMWGK 329

Query: 316 IKKHGSFDHLD--NDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           + + G FD  +  + ++ PS PG ++ AA++AS  +      +V F+LAW  P+VKF + 
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
             YHRRYTK+YGT   +A  I HDA+  + +WE EIE WQ PIL D R PEWY  TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449

Query: 433 LYYLNAGGTIWTDGSPPMQS 452
           LY+L AGGT+W D S P  S
Sbjct: 450 LYFLVAGGTVWIDSSLPATS 469


>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
 gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
          Length = 830

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 282/817 (34%), Positives = 417/817 (51%), Gaps = 84/817 (10%)

Query: 72  VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQFSVF 130
           + +FNVFR    T   G+PLGGIG GSIG  +RG F RF +  GI +       NQF V 
Sbjct: 64  IDIFNVFRHFTYT---GVPLGGIGCGSIGTDFRGGFNRFSIIPGIKEQTETQKCNQFIVT 120

Query: 131 V-SRPNGEK-FSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           V S+   E  + S+L C   P        + +  WD  +  E+  Y  LFPRAW  +   
Sbjct: 121 VHSKKTFELIYQSILSCAEFPA-------TVLPKWDTTIPAEDVRYRGLFPRAWQEFR-L 172

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
               + +V   +SP IP +Y +SS P + F F + N      +V++  ++ N        
Sbjct: 173 GSSGVTVVVEHLSPVIPGDYSDSSLPLANFEFHVFNDSYEEVEVSITMSFRNGTGNRKWN 232

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
             +   S+ + KD +   TL H T  G  PVT+A+  EE     V+ C   L   N  G 
Sbjct: 233 DENLCQSQKIQKDTMVVRTLAH-TVKGM-PVTYAIGTEEKNGSKVTTC---LFDPNGTG- 286

Query: 308 TAKDMWNEIKKHG---SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
               +W++++ +G   S+DHL      PS+P   +G A+ +S  +P     +  FSL W 
Sbjct: 287 --GRLWSDLEAYGHLSSYDHL------PSKP-KELGIAVCSSFFVPPDGAHNTQFSLTWY 337

Query: 365 CPEVKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
            P+V F   E+ Y+RRY +F+      +  A I    +   + W+ +IE WQ P+L D++
Sbjct: 338 MPQVHFGTAERFYNRRYCRFFNGPDADEVTAAICRHGLQNFSTWQKKIEDWQAPVLNDQK 397

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
            PEWY   +FNELYY+  G T+W +  P  +                       ++ +L 
Sbjct: 398 LPEWYKSAIFNELYYIVDGSTVWFEYDPDWK-----------------------TEESLM 434

Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
           +   E   +  G+F Y+E  EY M NTYDVHFYSS+A++  +P++E+S+Q DFA  +   
Sbjct: 435 SEQTEKQFKEYGRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQLDFADQMDRV 494

Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIG-------------LDDPWFEINSYNLFNSSRWK 587
           D      ++DG+ +  K +  +PHD+G             + DPW   N+Y L ++ RWK
Sbjct: 495 DNNIATSLADGEQMTIKSVDRIPHDMGHPTKGCIQEKRLSVADPWIHTNAYILHDTGRWK 554

Query: 588 DLNSKFVLQVYRD--FVATGDKN---FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
           DLN KFV+  YRD   +  G +        +     I    +E +DKD DGMIEN+GF D
Sbjct: 555 DLNLKFVISCYRDWKLIELGSEKGQVLEFFLGKCTKIVDGALECWDKDNDGMIENDGFAD 614

Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNG 701
           QTYD W   G SAYCG LW+AAL +   +    G      ++  + + A   Y   LWNG
Sbjct: 615 QTYDVWKMTGTSAYCGSLWIAALSSYIEMLKQSG--LPTKHYEEKLEMAYDAYIGKLWNG 672

Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGM 761
           ++F +D    +S   + ADQL G W   A        + K+K AL  I+ +NV     G 
Sbjct: 673 TFFKFDELPENSKI-VMADQLCGFWAMTAMDEPVQISKDKMKSALDTIFKYNVQMYNNGR 731

Query: 762 CGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDG 821
           CGA+NG     R+D S +Q+ E+W G+TY L+A MI++ M + AF+T+ G++E  W    
Sbjct: 732 CGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEQAFKTSEGLFESIWHRFP 791

Query: 822 LGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
           L Y  QTPE+  ++  YR+L YMRPL+IWA+Q AL +
Sbjct: 792 LQY--QTPEAITSDGMYRALGYMRPLSIWAIQHALDR 826


>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
 gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
          Length = 794

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 285/798 (35%), Positives = 403/798 (50%), Gaps = 96/798 (12%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK--FSSVLC 144
           QG P+GG GAG+  RSYRG+F R+ +  G+       AN+F++F  +  GE    + VL 
Sbjct: 56  QGAPVGGFGAGTFSRSYRGDFARWHIKAGVHKYDVSYANEFAMF-QQVEGEATGVAKVLL 114

Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
              P          + SW W+       Y AL+P++W  Y  E  P   +   Q SP IP
Sbjct: 115 NDHP------KQGELASWSWDYPVGAGEYAALYPKSWYDYRWEKFPA-HVTLEQYSPVIP 167

Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHF-------NSKTM 257
            NY+ESS+P +V+ +   N    +  V++L +W N ++G      H F       N  T 
Sbjct: 168 DNYRESSYPTAVYRWHAENPTDKTVTVSVLLSWTN-MSGWFRTFTHDFKGAPNQGNYDTY 226

Query: 258 TKD-GVHGLTLHHRTANGRPPV---TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMW 313
             + G+ G+      A   P      FA+AA ET  V V+    F  +G+ K +     W
Sbjct: 227 RNEAGMKGIVFDRSRAGVAPNEWDGQFAIAAMETPGVEVTYQTSFQAAGDGKAV-----W 281

Query: 314 NEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
           +   + G    L ND TS    G  +  AIA   T+  G  + V   +AWD P V+F E 
Sbjct: 282 SHFARDG---RLPNDDTSWVSDGEKLAGAIAVRFTLKPGEKKVVPMVIAWDLPVVQFGEG 338

Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
           + ++R+YT FYGT G +A +IA + +L   +W   I+AWQ+P + D+  P WY   LFNE
Sbjct: 339 RQWNRKYTDFYGTDGKNAWKIAREGLLHAQEWSDAIDAWQKPTITDESKPLWYRGMLFNE 398

Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIG 492
           LY L  GGT W                            P  SD  L             
Sbjct: 399 LYALTDGGTFWG--------------------------RPVGSDKKL-----------PA 421

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
            F  LE  +Y  Y T DV FY+S  L+  +P+++  + R+FA  V        +    G 
Sbjct: 422 SFALLECFDYAYYGTLDVRFYASLPLLKFWPEIDKRVLREFAETV------EKEWSEKGL 475

Query: 553 WV------------ARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
           WV             RK +GAVPHD+G+   DP++ +N     +++ WKDLNSKFVL VY
Sbjct: 476 WVWKTAETGSPVLHKRKKVGAVPHDLGVPEGDPFYVVNEPGWQDTNDWKDLNSKFVLMVY 535

Query: 599 RDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAY 656
           RD+V TG  DK F +  WP+V  A+ Y+ QFD  G G+ EN G+PDQTYD+W   GVSAY
Sbjct: 536 RDYVLTGRTDKAFLQETWPAVKAAIQYLRQFDH-GRGVPENSGYPDQTYDSWVVRGVSAY 594

Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSST 715
            GGLW+ AL+A    A  +GD  + + +   +  A+  Y  +LWNG YF YD +D S+  
Sbjct: 595 SGGLWLGALRAGEETARVLGDWTAQAEYQALFAMAQKTYVATLWNGEYFLYD-TDASNKG 653

Query: 716 SIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID 775
            IQADQLAGQWYA   GL  +   A  +  L KIY FNV +   G  GA+NGM  DG + 
Sbjct: 654 VIQADQLAGQWYAHMTGLGDLVPRAMQRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTV- 712

Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
           +  ++ARE+W G T G AA +    + + ++    G++ V +     GY F+TPE+W+  
Sbjct: 713 VDNVEAREVWAGTTLGYAALLKSLGIEEESYHVVHGLFHVIYESK--GYWFRTPEAWDVT 770

Query: 836 DEYRSLCYMRPLTIWAMQ 853
             +R+  YMRP  +W M+
Sbjct: 771 GNFRASMYMRPAGLWGME 788


>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
          Length = 820

 Score =  440 bits (1131), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 284/803 (35%), Positives = 406/803 (50%), Gaps = 66/803 (8%)

Query: 72  VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
           + +FNVFR    T   G+PLGGIG+GSIG  +RG F RF +  GI +       NQF  S
Sbjct: 64  IDIFNVFRHFTFT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTDTQKCNQFIAS 120

Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           V   +     + S+L C   P        + +  WD  +  E+  Y  LFPRAW  +   
Sbjct: 121 VHSKKTFELIYQSILSCAEFPS-------TVLPKWDSTIPAEDVRYRGLFPRAWQEFR-L 172

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
               + +V   +SP IP++Y +SS P + F F + N      +V++   + N        
Sbjct: 173 GSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHVFNDSPEEVEVSITMAFRNGTGNRKWN 232

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
             H   S+ + K+     TL H T  G  PVT+A+  EE     V+ C   L   N  G 
Sbjct: 233 DEHLCTSQKVQKETQVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
               +W++++ +G     DN    P E    +G A+ +S  +P   + +  FSL+W  PE
Sbjct: 287 --GRLWSDLEAYGHLSSYDNLPPKPKE----LGIAVCSSFFVPPDGSHTTQFSLSWYMPE 340

Query: 368 VKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
           V F   E+ Y+RRY +F+      + AA I    +    KW+  IE WQ PI+ +++ PE
Sbjct: 341 VHFGTAERFYNRRYCRFFNGPDADEVAAAICRHGLQNIEKWQEAIEKWQNPIITNQKLPE 400

Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL 483
           WY   +FNELYY+  G T+W +  P  +                       ++  L +  
Sbjct: 401 WYRSAIFNELYYIVDGSTVWFEYDPDWR-----------------------TEENLISEQ 437

Query: 484 LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPG 543
            E      G+F Y+E  EY M NTYDVHFYSS+A++  +P++E+S+Q DFA  V   D G
Sbjct: 438 TEKQFREYGRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNG 497

Query: 544 TMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           T   ++DG  +  K    +PHD+G  + DPW   N+Y L ++  WKDLN KFV+  YRD+
Sbjct: 498 TATSLADGDQMLIKSYDRIPHDMGHPMADPWLHTNAYILHDTGSWKDLNLKFVISCYRDY 557

Query: 602 VATG------DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSA 655
                     D+     +     I    +E +DKD DGMIEN+GF DQTYD W   G SA
Sbjct: 558 KMIAKDREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQTYDVWKMTGTSA 617

Query: 656 YCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSST 715
           YCG LW+AAL     +    G   S  Y         A    LWNG +F +D    +S  
Sbjct: 618 YCGSLWLAALTCYIEMLKQAGS-PSKHYEGKLLNAYDAYTTKLWNGRFFKFDELPDNSKI 676

Query: 716 SIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID 775
            + ADQL G W   A        E K+K AL  I+ +NV     G CGA+NG     R+D
Sbjct: 677 -VMADQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVD 735

Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
            S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G++E  W+   L Y  QTPE+  ++
Sbjct: 736 GSSIQSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY--QTPEAITSD 793

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             YR+L YMRPL+IWA+Q AL +
Sbjct: 794 GMYRALGYMRPLSIWAIQHALDR 816


>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
          Length = 821

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 286/808 (35%), Positives = 413/808 (51%), Gaps = 72/808 (8%)

Query: 72  VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
           + +FNVFR    T   G+PLGGIG+GSIG  +RG F RF +  GI +       NQF  S
Sbjct: 64  IDIFNVFRHFTYT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTETQKCNQFIAS 120

Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           V   +     + S+L C   P        + +  WD  +  E+  Y  LFPRAW  +   
Sbjct: 121 VHSKKTFELIYQSILSCAEFPA-------TVLPKWDSTIPAEDVRYRGLFPRAWQEFRLG 173

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
               + +V   +SP IP +Y +SS P + F F + N      +V++  ++ N        
Sbjct: 174 TSG-ITVVVEHLSPVIPEDYSDSSLPLANFDFHVFNDSPEEVEVSITMSFRNGTGNKKWN 232

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
                 S  + KD +   TL H T  G  PVT+A+  EE     V+ C   L   N  G 
Sbjct: 233 DESLCQSHKVQKDTMVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286

Query: 308 TAKDMWNEIKKHG---SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
               +W++++ +G   S+DHL      P +P   +G A+ +S  +P   + +  FSLAW 
Sbjct: 287 --GRLWSDLEAYGHLSSYDHL------PPKP-KELGIAVCSSFFVPPDGSHTTQFSLAWY 337

Query: 365 CPEVKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
            P+V F   E+ Y+RRY +F+      +  A I    +     W+  I  WQ PI+ D++
Sbjct: 338 MPQVHFGTAERFYNRRYCRFFNGPDADEVGASICRHGLQNIETWQEAIRKWQDPIINDEK 397

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
            PEWY   +FNELYY+  G T+W +  P  +                       +D  L 
Sbjct: 398 LPEWYRSAIFNELYYIVDGSTVWFEYDPEWR-----------------------TDEILI 434

Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
           +   E   +  G+F Y+E  EY M NTYDVHFYSS+A++  +P++E+S+Q DFA  V   
Sbjct: 435 SEETEKQFKQYGRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQMDFADQVDRV 494

Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
           D GT   ++DG  +  K    +PHD+G  + DPW   N+Y L ++  WKDLN KFV+  Y
Sbjct: 495 DYGTATSLADGDTMTIKSYDRIPHDMGHPMADPWIHTNAYILHDTGHWKDLNLKFVISCY 554

Query: 599 RDF--VATGDKNFARA----VWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           RD+  +A G+++        +     I    +E +DKD DGMIEN+GF DQTYD W   G
Sbjct: 555 RDWKMIAEGEEDSQEILEFFLGKCTKIVDGALENWDKDKDGMIENDGFADQTYDVWKMTG 614

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGS 712
            SAYCG LW+AAL +   +    G   +  Y        +A    LWNG +F +D    +
Sbjct: 615 TSAYCGSLWIAALTSYIQMLKRAG-IPTKDYEEKLLMAYEAYTTKLWNGKFFKFDELPDN 673

Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
           S   + ADQL G W   A        E K++ AL  I+ +NV    GG CGA+NG     
Sbjct: 674 SKI-VMADQLCGFWALTAMDEPIQVSEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASE 732

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
           R+D S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G++   W+   L Y  QTPE+ 
Sbjct: 733 RVDGSSIQSEEVWAGITYSLSAMMIEKGMDEMAFKTSEGLFHSIWNRFPLQY--QTPEAI 790

Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKPK 860
             +  YR+L YMRPL+IWA+Q AL K K
Sbjct: 791 TADGMYRALGYMRPLSIWAIQHALDKKK 818


>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
          Length = 831

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 284/814 (34%), Positives = 407/814 (50%), Gaps = 77/814 (9%)

Query: 72  VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
           + +FNVFR    T   G+PLGGIG+GSIG  +RG F RF +  GI +       NQF  S
Sbjct: 64  IDIFNVFRHFTFT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTDTQKCNQFIAS 120

Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           V   +     + S+L C   P        + +  WD  +  E+  Y  LFPRAW  +   
Sbjct: 121 VHSKKTFELIYQSILSCAEFPS-------TVLPKWDSTIPAEDVRYRGLFPRAWQEFR-L 172

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
               + +V   +SP IP++Y +SS P + F F + N      +V++   + N        
Sbjct: 173 GSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHVFNDSPEEVEVSITMAFRNGTGNRKWN 232

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
             H   S+ + K+     TL H T  G  PVT+A+  EE     V+ C   L   N  G 
Sbjct: 233 DEHLCTSQKVQKETQVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
               +W++++ +G     DN    P E    +G A+ +S  +P   + +  FSL+W  PE
Sbjct: 287 --GRLWSDLEAYGHLSSYDNLPPKPKE----LGIAVCSSFFVPPDGSHTTQFSLSWYMPE 340

Query: 368 VKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
           V F   E+ Y+RRY +F+      + AA I    +    KW+  IE WQ PI+ +++ PE
Sbjct: 341 VHFGTAERFYNRRYCRFFNGPDADEVAAAICRHGLQNIEKWQEAIEKWQNPIINNQKLPE 400

Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL 483
           WY   +FNELYY+  G T+W +  P  +                       ++  L +  
Sbjct: 401 WYRSAIFNELYYIVDGSTVWFEYDPDWR-----------------------TEENLISEQ 437

Query: 484 LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPG 543
            E      G+F Y+E  EY M NTYDVHFYSS+A++  +P++E+S+Q DFA  V   D G
Sbjct: 438 TEKQFREYGRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNG 497

Query: 544 TMKIMSDGKWVARKCLGAVPHDIG-------------LDDPWFEINSYNLFNSSRWKDLN 590
           T   ++DG  +  K    +PHD+G             + DPW   N+Y L ++  WKDLN
Sbjct: 498 TATSLADGDQMLIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTGSWKDLN 557

Query: 591 SKFVLQVYRDFVATG------DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
            KFV+  YRD+          D+     +     I    +E +DKD DGMIEN+GF DQT
Sbjct: 558 LKFVISCYRDYKMIAKDREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQT 617

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF 704
           YD W   G SAYCG LW+AAL     +    G   S +Y         A    LWNG +F
Sbjct: 618 YDVWKMTGTSAYCGSLWLAALTCYIEMLKQAGS-PSKNYEGKLLNAYDAYITKLWNGRFF 676

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
            +D    +S   + ADQL G W   A        E K+K AL  I+ +NV     G CGA
Sbjct: 677 KFDELPDNSKI-VMADQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGA 735

Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           +NG     R+D S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G++E  W+   L Y
Sbjct: 736 VNGFLTSERVDGSSIQSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY 795

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
             QTPE+  ++  YR+L YMRPL+IWA+Q AL +
Sbjct: 796 --QTPEAITSDGMYRALGYMRPLSIWAIQHALDR 827


>gi|326528739|dbj|BAJ97391.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 424

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 233/418 (55%), Positives = 297/418 (71%), Gaps = 4/418 (0%)

Query: 10  ENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAK 69
           E   +   ++  QP   TWQRK D   K    F+++  +   +  +  +    +  +  K
Sbjct: 7   EQTKARSHINCAQPATRTWQRKFDDEGKKIELFSMTMNDMISIIPMILKGLMINADQRGK 66

Query: 70  GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
           G+  +++ FRK      +G+PLGG+GAGSIGRSYRG FQ F++F  + ++ P+LANQFS 
Sbjct: 67  GRDILYDPFRKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPALYEERPILANQFSA 126

Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
           FVSRPNG+ +S+VL   +    K     GI SWDW LK +NCTYHALFPR+WTVYDGEPD
Sbjct: 127 FVSRPNGKSYSTVLSAPTADAFKGVDKVGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPD 186

Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
           PE+ I CRQISP IPHNYKESSFP +VFTFT+ NSG T+ADVTLLFTW NSV G S L+G
Sbjct: 187 PEINITCRQISPIIPHNYKESSFPVAVFTFTVQNSGSTAADVTLLFTWTNSVGGRSELTG 246

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLL-SGNSKGIT 308
           +H NSK + +DGV G+ L HRTA+G PPVTFA+A++ET +V V++CP F + S +S   T
Sbjct: 247 NHTNSKMIERDGVQGVLLRHRTADGHPPVTFAIASQETEEVRVTDCPFFTMGSSDSGDFT 306

Query: 309 AKDMWNEIKKHGSFDHLDNDKT-SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
           AKDMW EIK+HGSF     DK    S+PGSSIGAA+AA+ T+P+G TR V+F+L+W CPE
Sbjct: 307 AKDMWEEIKQHGSFSETRTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPE 366

Query: 368 VKFFE-KVYHRRYTKFYGTLGDSAAR-IAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
           VKF + K YHRRYTKF G  GD+AA  +AHDA+LEH  WE +IE WQRPIL+DKR PE
Sbjct: 367 VKFPDGKTYHRRYTKFCGLDGDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPE 424


>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
          Length = 1200

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 293/871 (33%), Positives = 436/871 (50%), Gaps = 75/871 (8%)

Query: 28  WQRKLDSNVKV---PTGFTLSFQEFRHLAHIGFRL-YR-YSKQEEAKGKVPVFNVFRKHH 82
           W+ + D   +V   P GF     + +    + FR  +R Y ++   + ++P  +  +   
Sbjct: 24  WKARFDRKPEVQCKPFGFP-RLSQIKSFIPMAFRYTFRFYLRKRLIEKRLPFLDPSQHVP 82

Query: 83  ITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSV 142
                G+P+GGIG G+IGR +RGEF R  L  G+    P  A+QF + + +     F  V
Sbjct: 83  WKPIYGVPMGGIGCGAIGRGFRGEFVRSSLIPGMYSYDPQPADQFILTIRKGGETLFHQV 142

Query: 143 LCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           L P++  VP     +G+ SW+W    ++  Y  L+PR+WTVY+      L +VC Q+SP 
Sbjct: 143 LSPQT-SVPA--FANGLRSWNWGFSAQHAHYIGLYPRSWTVYEVA-HFNLILVCHQVSPV 198

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTW-ANSVAGDSGLSGHHFNSKTMTKDG 261
           +PH+YK S  P  VF +T+ N G  + +VT+ F+W   S A  S  + ++  S     +G
Sbjct: 199 LPHDYKASCLPTCVFVWTVFNFGSEALEVTISFSWHGPSPAPRSAPNSNNQVSNHKQPNG 258

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
           +  + L       + PV   ++A     +H+   P   +S  S G T  + W  +++   
Sbjct: 259 LSSVELTQPLV--KEPVADYLSARIQFAMHLLVGPSGYISYVS-GYTITN-WPTLRRRTP 314

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRY 379
              +    T    P  S   A A++  +       + F++ W  P V F   + +Y RRY
Sbjct: 315 KLAVMVSATCIVPPARS---ANASAGDLSEAGQSQLEFAVVWHSPLVWFRGNKVIYRRRY 371

Query: 380 TKFY---GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
            +++   G LG  A R+   A+     W   IE WQ P+L +  FP+WY   LFNELYYL
Sbjct: 372 ARWFPEPGMLG--AERLLESALSNWKDWVERIEQWQNPVLHNSSFPDWYKSALFNELYYL 429

Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
             GGT+W D  P     A  + + +                    +      + +G F Y
Sbjct: 430 TDGGTVWLD--PNHHGGADFKWQSW--------------------KYRSRVAQEMGLFAY 467

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           LEG EY MYNT DVH+ SS+AL+ L+PKL+L++  D A   +  D   +  +  G++  R
Sbjct: 468 LEGHEYRMYNTLDVHYNSSWALIKLWPKLQLAVLLDCADLAIEEDQTQLYFIHQGRYGIR 527

Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
               AVPHD G    +PW + N+Y ++ +  WKDLN KFVLQV+RD+  T D ++   + 
Sbjct: 528 STESAVPHDFGDPEGEPWRDANAYVMYPTKDWKDLNPKFVLQVWRDWKLTQDNDYLLYML 587

Query: 615 PSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL-- 671
           P V  I    ++ +D +GDGMIEN GFPDQTYD W+A G+SAY GGLW+A L +A  +  
Sbjct: 588 PIVNRIIQVSLQSWDSNGDGMIENSGFPDQTYDTWNAQGISAYTGGLWLACLYSACEMLE 647

Query: 672 ----AND------VGDHASASYFWVRYQK--------AKAVYD-SLWNGSYFNYDNSDGS 712
               AN       +  ++     W + QK        AK VYD +LW G+Y+ Y N+   
Sbjct: 648 YALSANSPLKAFLIASNSEDEVSWPQVQKELQILLESAKTVYDRALWTGTYYIYQNTPVG 707

Query: 713 SSTSIQADQLAGQWYARACGLLP--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP 770
           +  +I ADQL+G  + R  G  P  I     V  AL  I   N   ++ G  G +NG  P
Sbjct: 708 NHEAIMADQLSGHGFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLP 767

Query: 771 DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPE 830
             + D+S +QA E W GV Y LAA+MI E M    F  A   Y   ++  GL Y  QTPE
Sbjct: 768 GAKRDLSNVQAEEFWVGVNYSLAATMILEGMSQEGFSLAEACYNTIYNRFGLQY--QTPE 825

Query: 831 SWNNNDEYRSLCYMRPLTIWAMQWALTKPKL 861
           ++  +  +R   YMRPL IW++Q AL   +L
Sbjct: 826 AYMIDGRFRCPGYMRPLAIWSIQQALETKQL 856


>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
          Length = 550

 Score =  434 bits (1116), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 233/531 (43%), Positives = 309/531 (58%), Gaps = 35/531 (6%)

Query: 330 TSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLG 387
           ++PS+ G  I  A+  +  +P      + FSLAWD P + F  K  VY+RRYT+F+G  G
Sbjct: 1   STPSQKGVGIAGAVCVTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDG 60

Query: 388 DSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGS 447
           ++A  ++H A+  +  WE +I AWQ P+L+D+  P WY   LFNELY+L  GGT+W    
Sbjct: 61  NAAPALSHYALCRYIDWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---- 116

Query: 448 PPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLEGAEYVMYN 506
                          L+ P      +  D  +G+   L    +  G+F YLEG EY MYN
Sbjct: 117 ---------------LEVPED----SLPDELVGSMCQLRPILQEYGRFGYLEGQEYRMYN 157

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFY+SFAL+ML+PKLELS+Q D A A +  D    + +  G     K    +PHDI
Sbjct: 158 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDI 217

Query: 567 G--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM 624
           G   D+PW  +N+Y + +++ WKDLN KFVLQVYRD+  TGD++F R +WP     M   
Sbjct: 218 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYHLTGDQSFLRDMWPVCLAVMESE 277

Query: 625 EQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYF 684
            +FDKD DG+IEN G+ DQTYD W   G SAYCGGLW+AA+     +A   G       F
Sbjct: 278 MKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAVLCGAKDIQDKF 337

Query: 685 WVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADE 739
               ++ +  Y+  LWNG Y+NYD S    S SI +DQ AGQW+ +A GL      +   
Sbjct: 338 SSILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPT 397

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
             V  AL  I++FNV    GG  GA+NGMQP G  D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 398 QHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 457

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
            +    FQTA G Y   W    LG +FQTPE++     +RSL YMRPL+IW
Sbjct: 458 GLTWEGFQTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 506


>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
 gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
          Length = 845

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 284/818 (34%), Positives = 421/818 (51%), Gaps = 71/818 (8%)

Query: 72  VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
           + +FNVFR    T   G+PLGGIG+GSIG  +RG F RF +  GI +       NQF  S
Sbjct: 64  IDIFNVFRHFTYT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTETQKCNQFIAS 120

Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
           V   +     + S+L C   P        + +  WD  +  E+  Y  LFPR W  +   
Sbjct: 121 VHSKKTFELIYQSILSCAEFPA-------TVLPKWDTTIPAEDVRYRGLFPRGWQEFR-L 172

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
               + +V   +S  IP +Y +SS P + F F + N      +V++   + N        
Sbjct: 173 GSSGITVVIESLSSVIPGDYSDSSLPLANFDFHVFNDSPEEVEVSITLAFRNGTGNRKWN 232

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
             H   S+ + +D +   TL H T  G  PVT+A+  EE     V+ C   L   N  G 
Sbjct: 233 DEHLCQSQKVQRDTMVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286

Query: 308 TAKDMWNEIKKHG---SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
               +W++++ +G   S+DHL      P +P   +G A+ +S  +P   + +  FSL+W 
Sbjct: 287 --GRLWSDLEAYGHLSSYDHL------PPKP-KELGIAVCSSFFVPPDGSHTTQFSLSWY 337

Query: 365 CPEVKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
            P+V F   E+ Y+RRY +F+      + AA I    +   +KW+  IE WQ P++ D++
Sbjct: 338 MPQVHFGTAERFYNRRYCRFFNGPDADEVAASICRHGLQNISKWQEAIEKWQTPVINDEK 397

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPM---QSLATIRERKFSLDTPSKIHTPASSDT 477
            P+WY   +FNELYY+  G T+W +  P     +SL +    K   +     +      +
Sbjct: 398 LPDWYRSAIFNELYYIVDGSTVWFEYDPEWKTDESLISEHTEKHFKEYGRFGYMECEFFS 457

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
            +  R L         F +L   EY M NTYDVHFYSS+A++  +P++E+S+Q DFA  V
Sbjct: 458 LVELRTL---------FFFLTAWEYFMLNTYDVHFYSSWAILKNWPQIEMSMQLDFADQV 508

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIG-------------LDDPWFEINSYNLFNSS 584
              D GT   ++DG+ ++ K    +PHD+G             + DPW   N+Y L ++ 
Sbjct: 509 DRIDNGTATSLADGEQMSIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTG 568

Query: 585 RWKDLNSKFVLQVYRDF--VATGDKNFAR-AVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
            WKDLN KFV+  YRD+  ++ GD+      +     I    +E +DKD DGMIEN+GF 
Sbjct: 569 CWKDLNLKFVISCYRDWKMISNGDQEILEFFIGKCTKIVDGALESWDKDQDGMIENDGFA 628

Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
           DQTYD W   G SAYCG LW+AAL     +    G  +   ++  +   A   Y + LWN
Sbjct: 629 DQTYDVWKMTGTSAYCGSLWLAALTCYIQMLKKAG--SPTKFYDEKLLMAYEAYTTKLWN 686

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
           G +F +D    +S   + ADQL G W   A        E K+K AL  I+ +NV    GG
Sbjct: 687 GKFFKFDELPDNSKI-VMADQLCGFWALTAMDEPVQVSEEKMKSALETIFKYNVQMYDGG 745

Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
            CGA+NG     R+D S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G+++  W+  
Sbjct: 746 KCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEMAFKTSEGLFDSIWNRF 805

Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            L   FQTPE+  ++  YR+L YMRPL+IWA+Q AL K
Sbjct: 806 PL--QFQTPEAITSDGMYRALGYMRPLSIWAIQHALEK 841


>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
 gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
          Length = 784

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 265/803 (33%), Positives = 400/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP       D  +  W W    E  TY AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------KDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T+ N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P+E G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGKQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+ +  KW   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDKWRENIINWQKPILERIDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGEIIGEDTAIFTSWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V    + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
 gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
          Length = 784

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 264/803 (32%), Positives = 400/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E   Y AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P++ G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+ +   W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+  HD
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPTHD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY+   LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V    + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
          Length = 384

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 206/343 (60%), Positives = 269/343 (78%), Gaps = 1/343 (0%)

Query: 485 ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGT 544
           ++G E++G+FLYLEG EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+  D   
Sbjct: 20  KDGPEHVGKFLYLEGVEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRK 79

Query: 545 MKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVAT 604
           +K ++DG    RK  GAVPHD+G  DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF AT
Sbjct: 80  VKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAAT 139

Query: 605 GDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAA 664
           GD  F R VWP+V  AM YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AA
Sbjct: 140 GDMQFGRDVWPAVCAAMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAA 199

Query: 665 LQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLA 723
           LQAA+ +A+ +GD   A  + +++ KAKAVY++ LWNGSYFNYD+   S+S SIQADQLA
Sbjct: 200 LQAAATMAHRLGDRHFAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLA 259

Query: 724 GQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQARE 783
           GQWYA + GL P+ DE K++ AL KI++FNV+KVKGG  GA+NGM P G++D + +Q+RE
Sbjct: 260 GQWYAASSGLPPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSRE 319

Query: 784 IWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
           IW GVTY +AA+M+   M    F TA G++   WSE+G GY+ 
Sbjct: 320 IWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSEEGYGYAL 362


>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
 gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
          Length = 784

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 265/803 (33%), Positives = 400/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E  TY AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P+E G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+ +   W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
             T  I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTTRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  W S+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWSSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY+   LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V    + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
 gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
          Length = 867

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 284/819 (34%), Positives = 404/819 (49%), Gaps = 99/819 (12%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
            G+PLGG GAG IGRS+RG+F  + L  G       PV   QF++F      +  + VLC
Sbjct: 79  HGMPLGGFGAGCIGRSHRGDFTLWHLDSGDHYFRSQPVC--QFALFEQSEGAKPRAKVLC 136

Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
              P       D  + +WDW       TY AL+PR+W  Y+G  +  +++  +Q SP IP
Sbjct: 137 TEKPA------DGSLGAWDWTYPKGAGTYAALYPRSWFTYEGFFNSHVQL--KQFSPIIP 188

Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS------------------- 245
           +NYK++S+P +VF +T  N       +++L +W N V   +                   
Sbjct: 189 NNYKQTSYPVAVFEWTAFNPTDQPVTLSILVSWENMVGWFTNQFKSRAVKVRDDDTPVYD 248

Query: 246 -----GLSGHHFNS-------KTMTKDGVHGLTLHHRTA------NGRPPVT------FA 281
                G S  +FNS       + +T     G T   + A      + RP VT      F 
Sbjct: 249 YEPYWGESTGNFNSAREENAYRGLTMSRASGYTEQQKPAPATAAVSERPFVTAEGDGQFV 308

Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
           +A     DV ++    F  +GN       ++W      G    L +    P+  G  I A
Sbjct: 309 IATTREPDVEITYHTRFDPTGN-----GLEVWQPFATTGRLGDLVS--VRPAAKGERIAA 361

Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILE 400
           A+A  LT+  G  R V  +LAWD P ++F + V  +R YTK +   G +A +IA +A+  
Sbjct: 362 AMAVRLTLKPGELRVVPMALAWDLPIMEFAQGVRRYRYYTKHFDKSGRNAWKIATEALDN 421

Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
           +A W+ +I+ WQ P+L D   P+WY   LFNELYYL +GGT+W +G              
Sbjct: 422 YADWDRQIQEWQNPVLADPSLPDWYKQALFNELYYLTSGGTLWENGP------------- 468

Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
                P KI + A          +       G+F  LE  +Y  Y++ DV  Y SFAL++
Sbjct: 469 ----VPEKITSLAD---------IAKAPPGQGRFAVLESIDYRWYDSLDVRLYGSFALLI 515

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
            +P+++ ++ R +A AV   D  T  I        RK   A+PHD+G   +DPW + N  
Sbjct: 516 NWPEIDKAVLRSYADAVPDEDDSTRIIGYTKTVAKRKSRDALPHDLGAPNEDPWIKNNYT 575

Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQ-FDKDGDGMI 635
              + + WKDL   FVLQVYRD+V TG  D  F R  WP+   A+  ++  FD+DGDG  
Sbjct: 576 TYQDCNLWKDLPCDFVLQVYRDYVDTGKRDLGFVRYCWPASKAALVRLKTTFDQDGDGFP 635

Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
           EN G PD TYDAW   G+SAYCGGLW+AAL AA  +   VGD A+   F   Y +A+ +Y
Sbjct: 636 ENGGPPDCTYDAWPLKGISAYCGGLWLAALLAAVEMGKLVGDAAAVQQFGAWYAQAQPLY 695

Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
           + +LWNG Y+  D S   +S +I ADQL G++YA+ C L  I  E + + AL  +Y    
Sbjct: 696 EKTLWNGRYYRID-SQSKNSEAIMADQLCGEYYAQVCNLADIVPEPQARSALETVYQTCF 754

Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           +K  GG  G  NG   DG          E+W G+ +GLAA MI+  M       A  V  
Sbjct: 755 VKFYGGRFGCANGTNADGSFIGDTEHPSEVWTGINFGLAAFMIRNGMRREGMAIAEAVVA 814

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
             +   G G  F+TPE+      +R+  Y+RP+ IWAMQ
Sbjct: 815 NVY---GGGLQFRTPEALTPARTFRACMYLRPMAIWAMQ 850


>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
 gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
          Length = 805

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 275/826 (33%), Positives = 403/826 (48%), Gaps = 118/826 (14%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGG G+G IGRS  G+F  + L  G      + + QFSVF  +P GE        
Sbjct: 40  NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVF-EQPEGEDAQVYALS 98

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
             P  P++ T   +  W W    E  TYHAL+PR+W  Y+G    +L  +C Q SP IP+
Sbjct: 99  TQP--PEEGT---LSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
           NY+E+S+P  +F +T  N       ++++ TW N V               D G   + +
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210

Query: 253 NSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPC 297
             +     G               ++ + LH +   G   +  A     + +V       
Sbjct: 211 QPRWGDSTGNFNQWVQDNFRVGFILNRIQLHDQIQEGEGQICIASVTNPSVEVF------ 264

Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
           +L   N KG    ++W+    +GS    D +  +P+EPG  I  A+A   TI  G T+ +
Sbjct: 265 YLGKWNPKG-DGSEVWDYFAMNGSLP--DTEDETPAEPGEQIAVAMAIRFTIRPGRTKKI 321

Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
            F LAWD P  +F   V Y+RRYT FYG  G +   +   A+     W   +E WQ PIL
Sbjct: 322 PFILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPIL 381

Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
           + +  P+W+ + LFNELY L  GGT+WT  S                             
Sbjct: 382 KREDLPDWFKMALFNELYLLTDGGTLWTAAS----------------------------- 412

Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
                   EN  + +GQF  LE  +Y  Y + DV  Y SF L+ML+P+LE S+   FA A
Sbjct: 413 --------EN--DPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARA 462

Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
           +  HD     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL S FV
Sbjct: 463 IPTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFV 522

Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           LQVYRD++ TG  D +F    WP++   + Y++ FD D DG+ EN G PDQT+D W   G
Sbjct: 523 LQVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQG 582

Query: 653 VSAYCGGLWVAALQAASALA-------------------NDVGDHASASYFWVRYQKAKA 693
           +SAYCGGLW+AAL+AA  +A                     +  +      W+  +++++
Sbjct: 583 ISAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWL--EQSRS 640

Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
           +Y D+LWNG Y+  D+  GS    + ADQL GQ+YAR   L  + +    + AL K+Y+ 
Sbjct: 641 IYHDTLWNGEYYKLDSQSGSD--VVMADQLCGQFYARLLNLPDVVETQYTESALNKVYEA 698

Query: 753 NVLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
             LK + G  GA NGM+PDG   D +    +E+W G+ +GLAA ++Q    D AF+    
Sbjct: 699 CFLKFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEA 758

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
           V +  + E+GL   F+TPE+      +R+  Y+R + IW +   LT
Sbjct: 759 VVKQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGIYGILT 801


>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
 gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
          Length = 784

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 264/803 (32%), Positives = 400/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP       D  +  W W    E  TY AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------KDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T+ N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P++ G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGSEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERTDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLADGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y D+LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHDTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY+   LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V    + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
 gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
          Length = 805

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 274/826 (33%), Positives = 401/826 (48%), Gaps = 118/826 (14%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGG G+G IGRS  G+F  + L  G      + + QFSVF  +P GE        
Sbjct: 40  NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVF-EQPEGEDAQVYALS 98

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
             P  P++ T   +  W W    E  TYHAL+PR+W  Y+G    +L  +C Q SP IP+
Sbjct: 99  TQP--PEEGT---LSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
           NY+E+S+P  +F +T  N       ++++ TW N V               D G   + +
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210

Query: 253 NSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPC 297
             +     G               ++ + LH     G   +  A     + +V       
Sbjct: 211 QPRWGDSTGNFNQWVQDNFRVGFILNRIQLHDEIQEGEGQICIASVTNPSVEVF------ 264

Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
           +L   N KG    ++W+    +GS    D +  +P+EPG  I  A+A   TI  G T+ +
Sbjct: 265 YLGKWNPKG-DGSEVWDYFAMNGSLP--DTEDETPAEPGEQIAVAMAIRFTIRPGRTKKI 321

Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
            F LAWD P  +F   V Y+RRYT FYG  G +   +   A+     W   +E WQ PIL
Sbjct: 322 PFILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPIL 381

Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
           + +  P+W+ + LFNELY L  GGT+WT  S                             
Sbjct: 382 KREDLPDWFKMALFNELYLLTDGGTLWTAAS----------------------------- 412

Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
                   EN  + +GQF  LE  +Y  Y + DV  Y SF L+ML+P+LE S+   FA A
Sbjct: 413 --------EN--DPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARA 462

Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
           +  HD     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL S FV
Sbjct: 463 IPTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFV 522

Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           LQVYRD++ TG  D +F    WP++   + Y++ FD D DG+ EN G PDQT+D W   G
Sbjct: 523 LQVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQG 582

Query: 653 VSAYCGGLWVAALQAASALA-------------------NDVGDHASASYFWVRYQKAKA 693
           +SAYCGGLW+AAL+AA  +A                     +  +      W+  +++++
Sbjct: 583 ISAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWL--EQSRS 640

Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
           +Y D+LWNG Y+  D+  GS    + ADQL GQ+YAR      + +    + AL K+Y+ 
Sbjct: 641 IYHDTLWNGEYYKLDSQSGSD--VVMADQLCGQFYARLLNFPDVVETQYTESALNKVYEA 698

Query: 753 NVLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
             LK + G  GA NGM+PDG   D +    +E+W G+ +GLAA ++Q    D AF+    
Sbjct: 699 CFLKFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEA 758

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
           V +  + E+GL   F+TPE+      +R+  Y+R + IW +   LT
Sbjct: 759 VVKQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGIYGILT 801


>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
 gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
          Length = 784

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 264/803 (32%), Positives = 398/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLPACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E   Y AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYGALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   +D++  +P++ G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIDDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+ +   W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATIKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V    + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITQAVVRQIY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
 gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
          Length = 784

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 264/803 (32%), Positives = 397/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E   Y AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P+E G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATIKIGEIIGEDTAIFTSWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V    + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
 gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
          Length = 789

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 262/797 (32%), Positives = 395/797 (49%), Gaps = 93/797 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E   Y AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P+E G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDAIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V +  + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAM 852
             +R+  Y+R + IWA+
Sbjct: 759 GTFRACHYLRAMAIWAI 775


>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
 gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
          Length = 784

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 266/803 (33%), Positives = 396/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P      TD  +  W W    E  TY AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 RP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A      + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P+E G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +          ER                      
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAAT----------ER---------------------- 412

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                  + IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 413 -------DPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+       W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATIKIGAILGEDTEIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALGKIYQACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V +  + E+GL   F+TPE+  + 
Sbjct: 702 PQATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGILTE 781


>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
 gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
          Length = 784

 Score =  417 bits (1073), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 262/803 (32%), Positives = 398/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP       D  +  W W    E  +Y AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------KDGTLSRWSW-YPAEKGSYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P+E G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   F+ A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFSRAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATIKIGEIIGEDTAIFTSWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V +  + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
 gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
          Length = 784

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 265/803 (33%), Positives = 398/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E  TY AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNGIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A      + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P++ G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVNEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +          ER                      
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAAT----------ER---------------------- 412

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                  + IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 413 -------DPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKTKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+       W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGAILGEDTEIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+Y+R   L  + +    + AL+KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V +  + E+GL   F+TPE+  + 
Sbjct: 702 PQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
 gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
          Length = 808

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 273/820 (33%), Positives = 397/820 (48%), Gaps = 110/820 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G PLGG GAG IGRS +G+F  + L  G      + A QFSVF S   GE  +  +   
Sbjct: 37  HGAPLGGFGAGCIGRSPKGDFNLWHLDGGEHLLNSIPACQFSVFESVAGGETQAFAMACE 96

Query: 147 SPGVPKK--NTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
             G   K  + +  +  W+W  + E   Y AL+PR+W  Y G    EL   C Q SP   
Sbjct: 97  DDGEIGKWGDREESLSRWNWYPQ-EQGVYAALYPRSWYEYQGVFASEL--TCEQFSPIWK 153

Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHH 251
            NY+E+S+P ++F +T  N       ++++ TW N+V               D G   + 
Sbjct: 154 ENYQETSYPVAIFDWTAHNPTDKPITLSIMLTWQNTVGWFTNAIKSPEVKVRDDGSPEYE 213

Query: 252 FNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           +  K     G               ++ +  H     G   +  A  A    +V      
Sbjct: 214 YQPKWGDSTGNYNQWIQDSYRVGCILNRVQPHSEIQEGEGQIAIATVANPAVEV------ 267

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            F LS  +      ++W+     GS    D +  +P++PG  I  AIA  +TIP G T+ 
Sbjct: 268 -FYLSRWNPEGDGSEVWDSFAADGSLP--DKEDETPADPGEQIACAIAVRVTIPPGKTKK 324

Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           + F LAWD P  +F + + Y+RRYT F+G  G++   I   A+     W  +IEAWQ PI
Sbjct: 325 IPFILAWDFPVTEFAQGINYYRRYTDFFGRTGNNVWSIVKTALKHSDWWREQIEAWQNPI 384

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           LE    P+W+ + LFNELY L  GGT+W+                            A++
Sbjct: 385 LERPDLPDWFKMALFNELYLLTQGGTLWS----------------------------AAT 416

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
           ++           + +GQF   E  +Y  Y + DV  Y SF L+ML+ KL+ ++   FA 
Sbjct: 417 ES-----------DPVGQFCVCECMDYRWYESLDVRLYGSFGLMMLWSKLDKAVLEAFAR 465

Query: 536 AVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKF 593
           A+   D     I  +G    RK  GA PHD+G   + PW   N  +  + ++WKDL S F
Sbjct: 466 AIPKSDDKPRIIGYNGATAIRKAAGATPHDLGAPNEHPWEATNYTSYQDCNQWKDLPSDF 525

Query: 594 VLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
           VLQVYRD++ T   N  F    W SV   +AY+++FD+D DG+ EN G PDQT+D W   
Sbjct: 526 VLQVYRDYLFTEGDNTEFLWECWDSVVCTLAYLKEFDEDNDGIPENSGAPDQTFDDWRLQ 585

Query: 652 GVSAYCGGLWVAALQAASALANDV-----------------GDHASASYFWVRYQKAKAV 694
           G+S+YCGGLW+AAL+AA  +   +                 G   + S +    Q+++ +
Sbjct: 586 GISSYCGGLWIAALEAAIKIGEILIANPTKNPELQTADFPQGIEKTISDYREWLQRSREL 645

Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           Y D+LWNG Y+  D+   S+S  + ADQL GQ+YAR  GL  + +    K AL+KIYD  
Sbjct: 646 YHDTLWNGEYYRLDSK--SNSEVVMADQLCGQFYARLLGLPDVVENQYTKSALSKIYDAC 703

Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
            LK   G  GA NG+ PDG  ++       E+W G+ +GLAA M+Q  M D AF+    V
Sbjct: 704 FLKFHDGKFGAANGVLPDGSAVNPDDTHPLEVWTGINFGLAAFMVQMGMKDEAFRLTETV 763

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
               + E+GL   F+TPE+      +R+  Y+R + IW +
Sbjct: 764 VRQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGV 800


>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
 gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
          Length = 784

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 263/803 (32%), Positives = 397/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E  TY AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P+E G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERTDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
             W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 AAWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++ +Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATIKIGAILGEDTAIFASWL--QQSRTIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+Y+R   L  + +    + AL+KIY    LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V +  + E+GL   F+TPE+  + 
Sbjct: 702 PQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
 gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
          Length = 804

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 281/818 (34%), Positives = 400/818 (48%), Gaps = 115/818 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS+RG+F  + L  G      + A QFSVF S    +  +  LC  
Sbjct: 42  HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHVFRSLPACQFSVFESS-EAKTQAYALCTE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W W  K E   YHAL+PR+W  Y      +L   C Q SP    N
Sbjct: 101 PP------EDGTLSRWQWYPK-EKGNYHALYPRSWFEYQDVFQSQL--TCEQFSPIWAEN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F + + N       ++++ TW N+V               D G   + + 
Sbjct: 152 YQESSYPVAIFDWNVHNPTDKPITLSIMLTWQNTVGWFTNAIKSPTVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
            K     G               ++ +  H     G   +  A     T +V       F
Sbjct: 212 PKWGDSTGNFNQWIQDNYRVGCLLNRIQPHAEIQEGEGQIAIATVTNPTLEV-------F 264

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
            LS         D+W+     GS    D    +P++PG  I +AIA   T+  G TR + 
Sbjct: 265 YLSRWHPDGDGADVWDYFAADGSLP--DRQDETPADPGEQIASAIAVRFTVNPGKTRQIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F + + Y+RRYT F+G  G++A  I   A+     W+  I+AWQ PIL+
Sbjct: 323 FVLAWDLPVTEFAQGIKYYRRYTDFFGRTGNNAWSIVKTALKHSDWWQESIQAWQEPILK 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
            +  P+W+ + LFNELY L  GGT+W+                            A+++T
Sbjct: 383 RQDLPDWFKMALFNELYLLTQGGTLWS----------------------------AATET 414

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                      + IGQF  LE  +Y  Y + DV  Y SF L+ML+ KL+ S+   FA A+
Sbjct: 415 -----------DPIGQFGVLECMDYRWYESLDVRLYGSFGLMMLWSKLDKSVLEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW   N  +  + + WKDL S FVL
Sbjct: 464 PKGDDTPRIIGYNRAQAIRKAAGATPHDLGAPNEHPWQATNYTSYQDCNLWKDLPSDFVL 523

Query: 596 QVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           QVYRD+V TG K+  F    W S+ + +AY+++FD DGDG+ EN G PDQT+D W   G+
Sbjct: 524 QVYRDYVLTGSKDSEFLWECWDSIVVTLAYLKEFDLDGDGIPENSGAPDQTFDDWRLQGI 583

Query: 654 SAYCGGLWVAALQAASALANDVGDHASA-----------------SYFWVRYQKAKAVY- 695
           SAYCGGLW+AAL+AA A+   +  H                    +++ V  Q+A+++Y 
Sbjct: 584 SAYCGGLWIAALEAAIAIGQILMTHPPMNPNLQPPDYPQGIETVINHYRVWLQQARSLYH 643

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           D+LWNG Y+  D+   S+S  + ADQL GQ+YAR  GL  + +E   K AL KIY    L
Sbjct: 644 DTLWNGEYYRLDSQ--SNSDVVMADQLCGQFYARLLGLPDVVEEQYTKSALKKIYQACFL 701

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K   G  GA NG+ PDG  ++ +     E+W G+ +GLAA M+Q  M   AFQ    V +
Sbjct: 702 KFHNGKYGAANGVLPDGSPVNPNDTHPLEVWTGINFGLAAFMLQMGMKAEAFQLTETVVK 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
             + E+GL   F+TPE+      +R+  Y+R + IW +
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGI 796


>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
 gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
          Length = 784

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 262/803 (32%), Positives = 396/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP      TD  +  W W    E   Y AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T  N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P++ G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFAGWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY    LK   G  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHEGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V    + E+GL   F+TPE+  + 
Sbjct: 702 PQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGILTE 781


>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
 gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
 gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
          Length = 818

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 268/818 (32%), Positives = 394/818 (48%), Gaps = 114/818 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+P+GG G+G IGRS RG+F  + L  G      + + QFS+F    NG   +  L   
Sbjct: 55  HGMPIGGFGSGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQPENGTAQAYALATE 114

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D+ +  W W    E  TY AL+P +W  Y+G    E  I C Q SP IP N
Sbjct: 115 PP------EDNTLSRWAW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 165

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P  VF +T+ N       ++++ TW N V               D G   + + 
Sbjct: 166 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 225

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
            +     G               ++ L  H +   G   +  A     + +V       +
Sbjct: 226 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHQQVQEGEGQICIASVTNPSVEVF------Y 279

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
           L   N  G    ++W+    +GS    D +  +P+EPG  I AA+A   T+  G  + + 
Sbjct: 280 LGKWNPNG-DGSEVWDHFAMNGSLP--DQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIP 336

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F + + Y+RRYT F+G  G +   +   A+     W  +IE WQ PIL 
Sbjct: 337 FILAWDLPVTEFAQGIQYYRRYTDFFGRNGQNGWAMVRTALKHSDVWREKIEEWQNPILN 396

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
               P+W+ + LFNELY L  GGT+WT                            A+S+T
Sbjct: 397 RDDLPDWFKMALFNELYLLTDGGTLWT----------------------------AASET 428

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                      + IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+
Sbjct: 429 -----------DPIGQFGVLESMDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAI 477

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL S FVL
Sbjct: 478 PNSDDTVRIIGYNQAEGIRKTKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 537

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           QVYRDF+ TG  D  F    W ++   + Y++ FD D DG+ EN G PDQT+D W   G+
Sbjct: 538 QVYRDFLLTGSDDIEFLWECWEAIPETLNYLKAFDLDNDGIPENSGAPDQTFDDWELRGI 597

Query: 654 SAYCGGLWVAALQAASALANDVGDHASAS-----------------YFWVRYQKAKAVYD 696
           SAYCG LW+AAL+AA  +   + ++A  +                  +    Q+++A+Y 
Sbjct: 598 SAYCGALWIAALEAAIKIGEILLENAPQNPQLEPENYPKSIEKELDKYQQWLQQSRAIYH 657

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           S LWNG Y+  D+  GS    + ADQL+GQ+YAR  GL  + ++     AL K+Y+   L
Sbjct: 658 STLWNGEYYKLDSESGSD--VVMADQLSGQFYARLLGLPDVVEQQYALSALKKVYEACFL 715

Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K + G  GA NGM+PDG   D +    +E+W G+ +GLA+ +IQ  M D A +    V +
Sbjct: 716 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVK 775

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
             + E+GL   F+TPE+      +R+  Y+R + IW +
Sbjct: 776 QVY-ENGL--QFRTPEAITAVGTFRACHYLRAMAIWGV 810


>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
 gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
          Length = 898

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 276/860 (32%), Positives = 427/860 (49%), Gaps = 148/860 (17%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-------------------EKFSSVL 143
           + GEF RF+L  G+ +   V ANQF V +   N                    +++S + 
Sbjct: 87  FAGEFCRFQLKPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRYVCTWCNDRYSDLF 146

Query: 144 CPRSPGVPKK-NTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
             +   VP +    + + SW+  +    C Y AL+PRAW+ YD   D  +++V RQISP 
Sbjct: 147 ARKLGLVPIEIRPKNPLSSWETYINSSKCNYTALYPRAWSEYDLS-DYGIKLVQRQISPI 205

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
           IPH+YKESS PA+VF +T+ N       VT+ FT+ N        +  +  +   +    
Sbjct: 206 IPHDYKESSLPAAVFVWTVENVCGKDRQVTITFTFKNGTGNKKQDAEGNSETAEFSHGSA 265

Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
            G+++     + +   T+ ++ + +++++++ C  F  +GN      + +WN++K++G  
Sbjct: 266 KGVSIKQSIVDMQS--TYCISCKTSSEINLTRCKKFDPTGN-----GEKLWNDLKENGQL 318

Query: 323 DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYT 380
             +  ++   S+    +  A++A + + S  +  + FSL WD P+++F    K Y++ YT
Sbjct: 319 TEVSVEENLKSK---DVAVAVSAQILVQSTGSSDLEFSLVWDMPKIRFGKKTKEYYKYYT 375

Query: 381 KFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGG 440
           K++G  GD+   I+  A+    KWE  I  WQRPIL+D   P+WY   +FNELY++  GG
Sbjct: 376 KYFGKSGDAGPSISDYALQNFGKWETLINEWQRPILDDNDLPDWYKSAIFNELYFIADGG 435

Query: 441 TIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGA 500
           ++W           T  + +   D P   +                     G+F YLEG 
Sbjct: 436 SVW----------LTTDDAELPYDDPRLAY---------------------GRFAYLEGH 464

Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC-- 558
           EY MYNTYDVHFY+S AL  L+P L++S+Q D+   +        K + DGK + RK   
Sbjct: 465 EYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDTINREISEGRKHLYDGKIIPRKIKN 524

Query: 559 ----------------LGAVP-HDIGLDDPWFEINS----------YNL----------- 580
                           + A P HD+     W ++N+          Y L           
Sbjct: 525 SVPHDLGDPDEEPFDLINAYPIHDVS---EWRDLNTKFILQVYRDYYTLNRYAQQNAENA 581

Query: 581 --FNSSRWKDLNSKF--VLQVYRDFVATGDKNFARAVWPSVYI-----------AMAYME 625
             F+S  + D  S +   +Q  R+ V+  +K   R    S+YI           AM Y++
Sbjct: 582 SKFSSIEFIDKESMYDTYVQDNRNKVSPEEKQQNRKS-ASMYINETNGKVYLMDAMTYLK 640

Query: 626 --------------QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
                         ++DKD DG+IEN   PDQTYD W  +G SAYCGGLW+A+L   +A+
Sbjct: 641 SMYPACKQVLEKTLEWDKDNDGIIENSKCPDQTYDTWVMDGPSAYCGGLWLASLHCMTAM 700

Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
           AN +  +   + F    +K K  + + LWNG+Y+ +D    S +T I +DQL G WY R 
Sbjct: 701 ANLLDQNDDCNKFREILEKGKTSFEEKLWNGTYYKFDGQSASKNT-IMSDQLCGHWYLRC 759

Query: 731 CGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSGLQARE 783
           CG    I  +  V+ AL  IYD NV++  GG  GA+NG  P      DGR D S +Q  E
Sbjct: 760 CGFDYDIFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPNKDGRADASTVQGEE 819

Query: 784 IWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCY 843
           +W GVTY LA++MI E M   AFQTA G+Y+     + +G +F+TPE+      YR++ Y
Sbjct: 820 VWTGVTYALASTMIHEGMFTEAFQTAGGMYQTL--SEKIGMNFETPEALYAERHYRAIGY 877

Query: 844 MRPLTIWAMQWALTKPKLSR 863
           MRPL+IW++Q A  + K +R
Sbjct: 878 MRPLSIWSLQTAWEQKKQNR 897


>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
          Length = 514

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 219/496 (44%), Positives = 289/496 (58%), Gaps = 37/496 (7%)

Query: 366 PEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
           P + F  K  V++RRYT+F+G  GD+A  ++H A+  + +WE  I AWQ P+L+D+  P 
Sbjct: 2   PRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVLDDRSLPA 61

Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL 483
           WY   LFNELY+L  GGT+W         L  + +               S    LG  +
Sbjct: 62  WYKSALFNELYFLADGGTVW---------LEVLED---------------SLPEELGRNM 97

Query: 484 --LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
             L     + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLELS+Q D A A +  D
Sbjct: 98  CHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 157

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
               + +  G     K    +P DIG   D+PW  +N+Y + +++ WKDLN KFVLQVYR
Sbjct: 158 LTRRRYLMSGVMAPVKRRNVIPRDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 217

Query: 600 DFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGG 659
           D+  TGD+NF + +WP     M    +FDKD DG+IEN G+ DQTYD W   G SAYCGG
Sbjct: 218 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGG 277

Query: 660 LWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQ 718
           LW+AA+     +A   G       F     + +  Y+  LWNG Y+NYD+S    S S+ 
Sbjct: 278 LWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVM 337

Query: 719 ADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRI 774
           +DQ AGQW+ +ACGL      +     V +AL  I++ NV    GG  GA+NGMQP G  
Sbjct: 338 SDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 397

Query: 775 DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNN 834
           D S +Q+ E+W GV YGLAA+MIQE +    FQTA G Y   W    LG +FQTPE++  
Sbjct: 398 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWER--LGLAFQTPEAYCQ 455

Query: 835 NDEYRSLCYMRPLTIW 850
              +RSL YMRPL+IW
Sbjct: 456 QRVFRSLAYMRPLSIW 471


>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
 gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
          Length = 802

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 275/822 (33%), Positives = 405/822 (49%), Gaps = 119/822 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
            G+PLGG GAG IGRS RG+F  + +  G  I +  P  A QFS+F S     K +  L 
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFNTFP--ACQFSIFESTGISTK-AYALS 96

Query: 145 PRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
            + P       D  +++W W    +   E   YHAL+PR+W VY+     +L   C Q S
Sbjct: 97  SQPP------EDGTLQTWQWYPASSTTEETGKYHALYPRSWFVYENVFQTDL--TCEQFS 148

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG- 246
           P   +NY+E+S+P +VF +   N       ++++ TW N V               D G 
Sbjct: 149 PIWANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENMVGWFTNALKSPQVQIRDDGS 208

Query: 247 --------LSGHHFNSKTMTKDGVHGLTLHHRTANGRP----PVTFAVAAEETADVHVSE 294
                   L     N   + ++  H      R A+G P      ++++A  +   V +  
Sbjct: 209 PVYEYQPRLGASQGNYNQIVENTDHFSCFLGRVASGEPVGEGDGSWSIATPKHPQVEI-- 266

Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
              +    +S G T  D+W      GS    +N   +P++  + +GAAIA   T+  G T
Sbjct: 267 --FYHTRWDSVG-TGADIWQSFATDGSL--ANNQDATPADENTRLGAAIAIRFTLQPGET 321

Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
             + F+LAWD P  +F   + Y+RRYT F+G  G++A +IA   +LE+  W   I  WQ 
Sbjct: 322 LEIPFTLAWDFPVTEFSAGINYYRRYTDFFGRNGENAGKIATTGLLEYQNWRSHILTWQN 381

Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
           PIL+ +  P+W+ + LFNELY L +GGT+W+                            A
Sbjct: 382 PILDREDLPKWFKMALFNELYDLTSGGTLWS----------------------------A 413

Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
           +S+            + IGQF  LE  +Y  Y + DV  Y SFAL+MLFP+LE S+ R F
Sbjct: 414 ASEI-----------DPIGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAF 462

Query: 534 AAAVMMHDPGTMKIM-------SDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSS 584
           A A+  H    ++++       +D     RK  GA PHD+G  +   W + N     + +
Sbjct: 463 ARAIP-HSDDYLRVIGYYYTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCN 521

Query: 585 RWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
            WKDL S FVLQVYRDF+ TG  D  F    W ++   + Y++ FD DGDG+ EN G PD
Sbjct: 522 LWKDLGSDFVLQVYRDFIFTGGQDVEFLADCWDAIVQTLDYLKTFDLDGDGIPENSGAPD 581

Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA----------SYFWVRYQKAK 692
           QT+D W   GVSAYCGGLW+AAL++A A+ + + +   A          S +    Q++ 
Sbjct: 582 QTFDDWRLQGVSAYCGGLWLAALESAIAICDILTNRRGAENAEKIRSQKSVYQSWLQQSL 641

Query: 693 AVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
            +Y + LWNG Y+  D+  GS+   + ADQL GQ+YAR  GL  I        AL  +YD
Sbjct: 642 PIYQEKLWNGQYYRLDSESGSN--VVMADQLCGQFYARLLGLPDIVPNDNALSALQTVYD 699

Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
              +K   G  GA NG++PDG  +        E+W G+ +GLAA +IQ  M + A +   
Sbjct: 700 ACFVKFCNGQFGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLIQMGMKNEAMRLTQ 759

Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            V +  +     G  F+TPE+  +   +R+  Y+R + IWA+
Sbjct: 760 AVVQQIYDN---GLQFRTPEAITSAGTFRASTYLRAMAIWAI 798


>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
 gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
          Length = 808

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 281/833 (33%), Positives = 401/833 (48%), Gaps = 127/833 (15%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      + A QFSVF     GE  +  +C  
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHVFRSLPACQFSVFEQPEGGEAQAYAMCTE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W W  K +  TYHAL+PR+W  Y G    EL  VC Q SP    N
Sbjct: 101 PP------EDGTLSRWLWYPKKKG-TYHALYPRSWYEYKGVFQSEL--VCEQFSPICAGN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P ++F +T  N       ++++ TW N+V               D G   + + 
Sbjct: 152 YQETSYPVAIFEWTAHNPTDKPITLSIMLTWQNTVGWFSNAIKSPSIQVRDDGSPEYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHH-RTANGRPPVTFAVAAEETADVHVSECPC 297
            +     G               ++ + LH      G   + FA     T +        
Sbjct: 212 PRWGDSTGNFNQWIVDNFRVGCLLNRIRLHEDEIGEGEGQICFASITNPTIEAS------ 265

Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
           +L   N  G    ++W+    +GS    D +  +P+EPG  I AA+A   T+  G TR +
Sbjct: 266 YLGRWNPNG-DGSEVWDCFAANGSLP--DTEDETPAEPGEQIAAAMAIRFTVRPGKTRKI 322

Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
            F LAWD P  +F E V Y RRYT F+G  G +A  I   A+     W+  I+ WQ PIL
Sbjct: 323 PFILAWDLPITEFAEGVKYCRRYTDFFGRTGTNAWAIVRTALKHGDMWKERIKEWQSPIL 382

Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
           + +  P+W+ + LFNELY L  GGT+WT  +                             
Sbjct: 383 KREDLPDWFKMALFNELYLLADGGTLWTAATE---------------------------- 414

Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
                      ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+ +L+ ++   FA A
Sbjct: 415 -----------DDPIGQFGVLECMDYRWYESLDVRLYGSFGLMMLWSRLDKAVLEAFARA 463

Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
           +   D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL S FV
Sbjct: 464 IPTGDDTPRIIGYNKAPAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFV 523

Query: 595 LQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           LQVYRDFV TG K+  F    W S+   +AY+++FD+DGDG+ EN G PDQT+D W   G
Sbjct: 524 LQVYRDFVLTGAKDVAFLWECWQSIVETLAYLKKFDRDGDGIPENSGAPDQTFDDWRLRG 583

Query: 653 VSAYCGGLWVAALQAA-----------------------SALANDVGDHASASYFWVRYQ 689
           +SAYCGGLW+AAL+AA                        +L N +  + +    W+  Q
Sbjct: 584 ISAYCGGLWIAALEAAIAIGKILIENPPSNPDLELENYPQSLQNTIQTYQN----WL--Q 637

Query: 690 KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
            A+++Y + LWNG Y++ D+  GS    + ADQL GQ+YAR  GL  + +    + AL K
Sbjct: 638 GARSLYHEKLWNGEYYHLDSESGSD--IVMADQLCGQFYARLLGLPDVVERQYAESALKK 695

Query: 749 IYDFNVLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQ 807
           +Y+   LK + G  GA NG++PDG   D       E+W G+ +GLAA ++Q  M D AF+
Sbjct: 696 VYEACFLKFQDGKFGAANGVKPDGSPEDPKATHPLEVWTGINFGLAAFLLQMGMKDEAFK 755

Query: 808 TAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
            A  V    +     G  F+TPE+      +R+  Y+R + IW +   LT+ K
Sbjct: 756 LAETVVRQVYDN---GLQFRTPEAITAVGTFRASHYLRAMAIWGIYGVLTEFK 805


>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
 gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
          Length = 804

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 271/826 (32%), Positives = 398/826 (48%), Gaps = 114/826 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      + + QFS+F    +G   +  L   
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALSTE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D+ +  W W  + E  +Y AL+P +W  Y+G    E  I C Q SP IP+N
Sbjct: 101 PP------EDNTLSRWSWYPQ-EKGSYSALYPCSWYQYEGVFQTE--IYCEQFSPIIPNN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P  VF +T+ N       ++++ TW N V               D G   + + 
Sbjct: 152 YQETSYPIGVFEWTIKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
            +     G               ++ L  H +   G   +  A     + +V       +
Sbjct: 212 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVF------Y 265

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
           L   N  G    ++W+    +GS    D +  +P+EPG  I AA+A   TI  G  + + 
Sbjct: 266 LGKWNPNG-DGSEVWDYFAMNGSLP--DQEDETPAEPGEQIAAAMAIRFTIKPGRVKKIP 322

Query: 359 FSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F + +  +RRYT F+G  G +   +   A+     W  ++E W   IL+
Sbjct: 323 FILAWDLPVTEFAQGIQSYRRYTDFFGRNGQNGWAMVRTALKHSDVWRDQVEEWHNSILK 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
            K  P W+ + LFNELY L  GGTIWT                            A+S+T
Sbjct: 383 QKDLPNWFKMALFNELYLLTDGGTIWT----------------------------AASET 414

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                      + IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+
Sbjct: 415 -----------DPIGQFGVLECMDYRWYESLDVRLYGSFGLLMLWPRLEKAVIEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL S FVL
Sbjct: 464 PSSDDTIRIIGYNQAEGIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           QVYRDF+ TG  D  F    W +V  A+ Y+++FD D DG+ EN G PDQT+D W   G+
Sbjct: 524 QVYRDFLLTGADDIEFLWECWEAVPEALNYLKEFDLDNDGIPENSGAPDQTFDDWELRGM 583

Query: 654 SAYCGGLWVAALQAASALANDVGDH-----------------ASASYFWVRYQKAKAVYD 696
           SAYCG LW+AAL+AA  +   + ++                      +    Q++++VY 
Sbjct: 584 SAYCGALWIAALEAAIKIGEILLENRPNNPQLEPKNYPECIEKELDKYQNWLQQSRSVYH 643

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           S LWNG Y+  D+  GS    + ADQL+GQ+YAR  GL  + +      AL K+Y+   L
Sbjct: 644 STLWNGEYYKLDSESGSDV--VMADQLSGQFYARLLGLPDVVENQYALSALKKVYEACFL 701

Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K + G  GA NGM+PDG   D +    +E+W G+ +GLA+ +IQ  M D A +    V +
Sbjct: 702 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVK 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
             + E+GL   F+TPE+      +R+  Y+R + IW + + L+K K
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTFRACHYLRAMAIWGIYYQLSKTK 804


>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
 gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
           0003]
          Length = 804

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 269/823 (32%), Positives = 398/823 (48%), Gaps = 114/823 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      + + QFS+F    +G   +  L   
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W W    E  TY AL+P +W  Y+G    E  I C Q SP IP N
Sbjct: 101 PP------EDKTLSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P  VF +T+ N       ++++ TW N V               D G   + + 
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
            +     G               ++ L  H +   G   +  A     + +V       +
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVF------Y 265

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
           L   N  G    ++W+     GS    D +  +P+EPG  I AA+A   T+  G  + + 
Sbjct: 266 LGKWNPDG-DGSEVWDYFAMDGSLP--DQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F   V Y+RRYT F+G  G++   +   A+     W  ++E W   IL+
Sbjct: 323 FILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSYVWRDKVEEWHNSILK 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
           +   P+W  + LFNELY L  GGT+WT                            A+S+T
Sbjct: 383 NDNLPDWLKMALFNELYLLTDGGTLWT----------------------------AASET 414

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                      + +GQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+
Sbjct: 415 -----------DPVGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL S FVL
Sbjct: 464 PSSDDTIRIIGYNQAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           QVYRDF+ TG  D +F    W ++   + Y+++FD D DG+ EN G PDQT+D W  +G+
Sbjct: 524 QVYRDFLLTGSDDIDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGI 583

Query: 654 SAYCGGLWVAALQAASALANDVGDHA-------------SASYFWVRY----QKAKAVYD 696
           SAYCGGLW+AAL+AA  +   + +++             S      +Y    Q+++A+Y 
Sbjct: 584 SAYCGGLWIAALEAAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYH 643

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           S LWNG Y+N D+  GS    + ADQL GQ+YAR  GL  + +      AL K+Y+   L
Sbjct: 644 STLWNGEYYNLDSKSGSD--IVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFL 701

Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K + G  GA NGM+PDG   D +    +E+W G+ +G+A+ +IQ  M + A +    V +
Sbjct: 702 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVK 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
             + E+GL   F+TPE+      YR+  Y+R + IW + + L+
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTYRACHYLRAMAIWGVYYQLS 801


>gi|147812670|emb|CAN61859.1| hypothetical protein VITISV_016692 [Vitis vinifera]
          Length = 305

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 209/322 (64%), Positives = 247/322 (76%), Gaps = 23/322 (7%)

Query: 1   MENGVKEEGEN----AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
           MENG KE+G+      +S  KV+PG+P  LTWQRKL++     T F L  +E +HL    
Sbjct: 1   MENGHKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHL---- 56

Query: 57  FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
                          + + + F K  +TS  G+PLGGIG GSIGRSYRGEFQR++LF  I
Sbjct: 57  ---------------ISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPRI 101

Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
           C+D+PVLANQFSVFVSRPNG+K S+VLCPR+P V K +  SGI SWDWNL GE+CTYHAL
Sbjct: 102 CEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHAL 161

Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
           +PRAWTVY+GEPDPE+ I+  QISPFIPHNYKESSFP SVF FTLSNSG+TSAD+TLLFT
Sbjct: 162 YPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLFT 221

Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
           WANSV G S  SGHH+NSK  TKDGVHG+ LHH+TANG PPVTFA+AAEET DVH+SECP
Sbjct: 222 WANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISECP 281

Query: 297 CFLLSGNSKGITAKDMWNEIKK 318
           CFL+SGNS G+TAK+MW EIK+
Sbjct: 282 CFLISGNSLGVTAKEMWQEIKQ 303


>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
 gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
          Length = 801

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 266/818 (32%), Positives = 389/818 (47%), Gaps = 114/818 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QFS+F     GE  +  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFRNLPPCQFSIFEHPEGGEAQAYALCTQ 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W W  + +  TYHAL+PR+W  Y+G    + +I+C Q SP +  N
Sbjct: 101 PP------EDGTLSRWSWYPQKKG-TYHALYPRSWYEYEGVF--QSQIICEQYSPILADN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T+ N       ++++ +W N+V               D G   + + 
Sbjct: 152 YQESSYPLAIFEWTIYNPSDRPVTISIMMSWQNTVGWFSNAIKTPEVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
            +     G               ++ +  H +   G   V  A       +V  +     
Sbjct: 212 PRWGDSTGNFNQWIVDNYRVGCLLNRIQPHPQVEEGEGQVCIASILNPNLEVFYNT---- 267

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
               N +G     +W+    +GS     N+  SP+ PG  I AA+A   TIP G TR + 
Sbjct: 268 --RWNPQG-DGSQIWDYFAMNGSLGDYQNE--SPAAPGEQIAAAMAIRFTIPPGKTRKIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F + + Y+RRYT F+G  G +   +   A+  H  W   I  WQ PIL+
Sbjct: 323 FILAWDLPITEFAQGINYYRRYTDFFGRNGQNVWSVVRTALKHHEMWRDRIIEWQAPILK 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
            K  P+W+ + LFNELY L  GGT+WT  +                              
Sbjct: 383 QKELPDWFKMALFNELYLLADGGTLWTAATE----------------------------- 413

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                     ++ +GQF  LE  +Y  Y + DV  Y SF L ML+P+L+ ++   FA A+
Sbjct: 414 ----------DDPVGQFAVLECIDYRWYESIDVRLYGSFGLGMLWPRLDKAVMEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +G    RK  GA PHD+G   + PW + N  +  + + WKDL   FVL
Sbjct: 464 PKGDDTPRIIGYNGAKAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGCDFVL 523

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
            +YR F+ TG  +++F    W SV   +AY+++FD DGDG+ EN G PDQT+D W   G+
Sbjct: 524 LLYRSFLWTGANNQDFLWEHWDSVTQTIAYVKKFDLDGDGIPENSGAPDQTFDDWQLRGI 583

Query: 654 SAYCGGLWVAALQAA------------------SALANDVGDHASASYFWVRYQKAKAVY 695
           SAYCG LW+A L+AA                  S  A +  + A A+Y    +Q     +
Sbjct: 584 SAYCGALWIAGLEAAIKIGETLLQRPPMNPQLQSENAPEKIEIAIATYRQWLHQARALYH 643

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           D+LW G Y+  D+  GS    + +DQL GQ+YAR  GL  + +      AL KIY+   L
Sbjct: 644 DTLWTGEYYRLDSESGSD--VVMSDQLCGQFYARLLGLPDVVEPQYTLSALRKIYESCFL 701

Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           +  GG  GA NG++PDG  +        E+W G+ +GLAA MIQ  M          V +
Sbjct: 702 QFYGGKYGAANGVRPDGSPENPDATHPLEVWTGINFGLAAFMIQMGMKQEGLSITEAVVK 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
             + E+GL   F+TPE+      +R+  Y+R + IWA+
Sbjct: 762 QIY-ENGL--QFRTPEAITAAGTFRASHYLRAMAIWAI 796


>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
 gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
          Length = 805

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 267/820 (32%), Positives = 394/820 (48%), Gaps = 118/820 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + L  G      + A QFS+F     GE  +  L   
Sbjct: 41  HGVPLGGFGAGCIGRSPKGDFNLWHLDGGEHIFRTLAACQFSIFEQPEGGEAQAYALSTE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W W    E  TYHAL+PR+W  Y+G    + +I+C Q SP    N
Sbjct: 101 PP------EDGTLSRWSW-YPTEKGTYHALYPRSWYEYEGVF--QTKIICEQYSPIWAGN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P ++F +T+ N       ++++ +W N+V               D G   + + 
Sbjct: 152 YQETSYPVAIFAWTIHNPTDRPITLSIMMSWQNTVGWFSNAIKTPEVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
            +     G               ++ +  H +   G   +  A     + +V  +     
Sbjct: 212 PRWGDSTGNYNQWIVDNFRVGCLLNRIQPHPQVQEGEGQLCIASILNPSVEVFYNT---- 267

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
               N  G    ++W+    +G     +N+  S + PG  I AA+A   TIP G TR + 
Sbjct: 268 --RWNPMG-DGSEIWDYFAMNGMLGDYENE--SAAAPGEQIAAAMAIRFTIPPGKTRLIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F + + Y+RRYT F+G  G +   +   A+     W   I  WQ PIL+
Sbjct: 323 FILAWDLPITEFSQGINYYRRYTDFFGRNGQNVWSVVRTALKNVDIWRDRIIDWQSPILK 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
           D+  P+W+ + LFNELY L  GGT+WT  +                              
Sbjct: 383 DQDLPDWFKMALFNELYLLADGGTLWTAATE----------------------------- 413

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                     ++ +GQF  LE  +Y  Y + DV  Y SFAL ML+P+L+ ++   FA A+
Sbjct: 414 ----------DDPVGQFAVLECIDYRWYESLDVRLYGSFALGMLWPRLDKAVMEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +G    RK  GA PHD+G   + PW + N  +  + + WKDL S FVL
Sbjct: 464 PQADDTPRIIGYNGAKAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523

Query: 596 QVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
            +YR F+ TG  N  +    W SV   +AY++ FD DGDG+ EN G PDQT+D W   G+
Sbjct: 524 LLYRSFLWTGGNNQDYLWEHWHSVTETLAYLKAFDLDGDGIPENSGAPDQTFDDWQLRGI 583

Query: 654 SAYCGGLWVAALQAASAL-------------------ANDVGDHASASYFWVRYQKAKAV 694
           SAYCG LW+AAL+AA  +                   A  + +  +    W+  Q+A+++
Sbjct: 584 SAYCGALWIAALEAAIKMGEILLQRPPMNPQLQPQNAAESIEEAITTYRHWL--QQARSL 641

Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           Y D+LWNG Y+  D+  GS    + +DQL GQ+YAR  GL  + +      AL KIY+  
Sbjct: 642 YHDTLWNGEYYRLDSESGSD--VVMSDQLCGQFYARLLGLPDVVEPQYTTSALRKIYEAC 699

Query: 754 VLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
            LK  GG  GA NG++PDG  +        E+W G+ +GLAA MIQ  M + AF+    V
Sbjct: 700 FLKFHGGKYGAANGVKPDGTPENPDATHPLEVWTGINFGLAAFMIQMGMKEEAFKITEAV 759

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            +  + E+GL   F+TPE+      +R+  Y+R + IWA+
Sbjct: 760 VKQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWAV 796


>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
 gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
          Length = 819

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 276/841 (32%), Positives = 395/841 (46%), Gaps = 133/841 (15%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G+PLGG GAG IGRS RG+F  + L  G      + A QFSVF    N  K +  LC   
Sbjct: 42  GMPLGGFGAGCIGRSPRGDFNLWHLDGGEHSFKSLPACQFSVFEQSENAPKQAYALCTEP 101

Query: 148 PGVPKKNTDSGIESWDW-----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           P       D  + +W W       KG   TY AL+P++W  Y+     EL   C Q SP 
Sbjct: 102 P------QDGTLGTWQWYPTSQQAKGNTGTYQALYPQSWFTYENVFQTEL--TCEQFSPI 153

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
               Y+E+S+P +VF +T  N       ++++ TW N V               D G   
Sbjct: 154 WAGCYQETSYPVAVFEWTAHNPTDAPITLSIMLTWQNIVGWFTNAIKSPQVKVRDDGSPV 213

Query: 250 HHFNSKTMTKDGVHG---------------LTLHHRTA-----------NGRPPVTFAVA 283
           + +  +     G +                + LH   A            G   + FA  
Sbjct: 214 YEYQPRWGESTGSYNQWIVDNFRVGCILDRVRLHDSEAVADFRGASEIQEGEGQLCFATV 273

Query: 284 AEETADV--HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
              + +V  H    P    +GN       ++W      GS   + ++  + + PG  I A
Sbjct: 274 TNPSMEVFYHTRWNP----AGN-----GAEVWQSFAADGSLSDMQDE--TAAAPGEQIAA 322

Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILE 400
           AIA   TI  G TR + F LAWD P  +F   V Y+RRYT F+G  G++A  I   A+  
Sbjct: 323 AIAIRFTIRPGRTRKIPFILAWDFPVTEFSPGVNYYRRYTDFFGRTGNNAWSIVRTALKH 382

Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
              W+ +I++WQ PIL+ +  P W+ + LFNELY L  GGT+WT                
Sbjct: 383 ADLWKEKIQSWQEPILKREDLPHWFKMALFNELYLLADGGTLWT---------------- 426

Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
                       A+ D            + IGQF  LE  +Y  Y + DV  Y SFAL M
Sbjct: 427 ------------AADD-----------RDPIGQFGVLECLDYRWYESLDVRLYGSFALAM 463

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
           L+P+L+ ++   FA A+ + D     I  +     RK   A PHD+G   + PW + N  
Sbjct: 464 LWPRLDKAVLEAFARAIPIGDKTPRIIGYNQAQAIRKAADATPHDLGAPNEHPWEQTNYT 523

Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
           +  + ++WKDL   FVLQVYRDFV TG  D  F    W SV   +AY++ FDKDGDG+ E
Sbjct: 524 SYQDCNQWKDLPCDFVLQVYRDFVFTGANDTEFLWECWSSVVKTLAYLKTFDKDGDGIPE 583

Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVR--------- 687
           N G PDQT+D W   G+SAYCGGLW+AAL+AA A+   + +H   +   +          
Sbjct: 584 NSGAPDQTFDDWRLQGISAYCGGLWLAALEAAIAMGEILINHYPPTSELIEVPDPESIQE 643

Query: 688 --------YQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIAD 738
                    ++++ +Y + LWNG Y+  D+  GS    +  DQL GQ+YA+  GL  I  
Sbjct: 644 TLKLYRQWLEQSQPIYQEKLWNGQYYRLDSESGSD--IVMTDQLCGQFYAQLLGLPDIVP 701

Query: 739 EAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMI 797
              V+ AL  +YD   LK +    GA NG++PDG  +        E+W G+ +GLAA ++
Sbjct: 702 PECVQSALKSVYDSCFLKFQDAQLGAANGVRPDGSPENPDATHPLEVWTGINFGLAAFLV 761

Query: 798 QEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
           Q+ M   A Q    V    + E+GL   F+TPE+      +R+  Y+R + IWA+ + + 
Sbjct: 762 QQGMKQEALQLTEAVVRQVY-ENGL--QFRTPEAITAAGTFRASHYLRAMAIWAVYFMIN 818

Query: 858 K 858
           +
Sbjct: 819 Q 819


>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
 gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
          Length = 806

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 270/822 (32%), Positives = 403/822 (49%), Gaps = 116/822 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G        + QFSVF S  +  K  ++    
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFKSFPSCQFSVFESTGSSSKAYAL---- 95

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           S  +P+   D  ++SW W           TYHAL+PR+W VY  E   + ++ C Q SP 
Sbjct: 96  STEIPE---DGTLQSWQWYPPSTTTQATGTYHALYPRSWFVY--ENVLQTQLTCEQFSPI 150

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
             +NY+E+S+P +VF +   N       ++++ TW N V               D G   
Sbjct: 151 WANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENIVGWFTNALKSPDLQIRDDGSPV 210

Query: 250 HHFNSKTMTKDGVHGLTLHH---------RTANGRPPV----TFAVAAEETADVHVSECP 296
           + +  +     G +   + +         R   G P      ++ +A  +   + V    
Sbjct: 211 YEYQPRLGESKGNYNHIVENTEFFGCFLGRVPTGLPIQEGDGSWCIATPKNPRMEVFYHT 270

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            F    N  G +  D+W      G   +  N   +P++  + +GAA+A   T+  G T  
Sbjct: 271 HF----NPVG-SGADLWESFSTDGYLPNYLN--ATPADADTRLGAALAVRCTLQPGETLE 323

Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           + F+LAWD P  +F E + Y+RRYT F+   GD+A  I    + ++  W  +I+ WQ+PI
Sbjct: 324 IPFTLAWDFPVTEFAEGINYYRRYTDFFSRYGDNAFAITSRGLEQYQVWRAQIQNWQKPI 383

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L+ +  PEW+ + LFNELY L +GGT+W+                            A+S
Sbjct: 384 LDREDLPEWFKMALFNELYDLTSGGTLWS----------------------------AAS 415

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
           +            + IGQF  LE  +Y  Y + DV  Y SFAL++LFP+LE S+ R FA 
Sbjct: 416 EI-----------DPIGQFAVLECLDYRWYESLDVRLYGSFALLILFPELEKSVIRAFAR 464

Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLF-NSSRWK 587
           A+   D           I ++ K   RK   A PHD+G  + W +E  +Y  + + + WK
Sbjct: 465 AIPHSDDHLRVIGYYYTIGAETKKAIRKVAKATPHDLGAPNEWVWEKTNYTCYQDCNLWK 524

Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           DL+S FVLQVYRDF+ TG  D  F    W ++   + Y++ FD DGDG+ EN G PDQT+
Sbjct: 525 DLSSDFVLQVYRDFLFTGSHDVQFLVDCWDAIVQTLDYLKTFDLDGDGIPENSGAPDQTF 584

Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDH-------------ASASYFWVRYQKAK 692
           D W  NGVSAYCGGLW+AAL++A A+A+ + +H             +  S +    Q+++
Sbjct: 585 DDWLLNGVSAYCGGLWLAALESAIAIADILTNHRGTENTEKTDKLRSQKSIYQSWLQQSR 644

Query: 693 AVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
            +Y+  LWNG Y+  D+  GS    + ADQL GQ+Y R  GL  I        AL  +Y+
Sbjct: 645 PIYEQKLWNGQYYRLDSESGSD--VVMADQLCGQFYTRLLGLPDIVPSNHAHSALQTVYE 702

Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
              +K   G  GA NG+ PDG  +        E+W G+ +GLAA ++Q  M + A Q   
Sbjct: 703 ACFVKFYDGKFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKNEAMQLTQ 762

Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            V E  +     G  F+TPE+      +R+  Y+RP+ IWA+
Sbjct: 763 AVVEQIYDN---GLQFRTPEAITPVGTFRASTYLRPMAIWAI 801


>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
 gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
          Length = 803

 Score =  404 bits (1039), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 270/827 (32%), Positives = 404/827 (48%), Gaps = 120/827 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+P+GG GAG IGRS  G+F  + L  G      + + QFS+F    +G+  +  L  +
Sbjct: 41  HGMPIGGFGAGCIGRSSAGDFNLWHLDGGEHVFKTISSCQFSIFEELKDGKTKAYALSTK 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W+W  K +  TY+AL+PR+W  Y+G    E +IVC Q SP IP N
Sbjct: 101 EP------EDKTLSKWNWYPKNKG-TYYALYPRSWYQYEGIF--EAKIVCEQFSPIIPDN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV------------------------A 242
           Y+E+S+P  +F +T+SN       ++++ TW N +                        +
Sbjct: 152 YQETSYPIGIFEWTVSNPTDQPITLSIMLTWQNMIGWFTNSIKTPKIEVRDDGSPEYEYS 211

Query: 243 GDSGLSGHHFNS--KTMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
              G S  +FN   ++ ++ G  +  L ++   + G   + FA A +   D+  S    +
Sbjct: 212 PQLGFSQGNFNKWKQSSSQFGFILDRLRVNDSISEGEGQICFATAHD--PDLEFSYLGKW 269

Query: 299 LLSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
             SGN       ++W+     G+  + LD     P E    I AA+A    I  G  + +
Sbjct: 270 NPSGN-----GSEVWDHFSASGTLPNTLDETIAKPDE---QIAAAMAVKFVIKPGEVKKI 321

Query: 358 TFSLAWDCPEVKF-FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
            F+LAWD P  +F     YHRRYT F+G  G +A  IA   +     W  +I  +Q PIL
Sbjct: 322 PFTLAWDLPITEFSLGNQYHRRYTDFFGINGKNAWMIAETGLKSLNIWREQILEYQLPIL 381

Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
            DK  P+W+ + LFNELY L  GGT+WT                    T SK +      
Sbjct: 382 NDKDLPDWFKMALFNELYLLTDGGTLWT--------------------TSSKSYP----- 416

Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
                         +G+F  LE  +Y  Y + DV  Y SF L+ L+P LE ++ + FA +
Sbjct: 417 --------------VGKFGVLECIDYRWYESLDVRLYGSFGLLALWPSLEKAVIKTFAES 462

Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
           +   D     I  D     RK  GAVPHD+G   + PW E N  +  + + WKDL S FV
Sbjct: 463 ISKSDHTLRLIGYDRSLAPRKVKGAVPHDLGAPNEHPWEETNYTSYQDCNLWKDLGSDFV 522

Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           LQVYRDF+ T   D NF +  WP++   + Y+++FD D DG+ EN G PDQT+D W  +G
Sbjct: 523 LQVYRDFLLTDSQDINFLQECWPAIVETLDYLKKFDLDDDGIPENSGAPDQTFDDWQLHG 582

Query: 653 VSAYCGGLWVAALQAASALA---------NDVGDHASASYF----------WVRYQKAKA 693
           +S+YCG LW+ AL+AA  +          N   D  ++  F          W++  K++ 
Sbjct: 583 ISSYCGALWITALEAAIGIGKILLKNSENNSHPDLQNSLAFIQENLTKYDDWLK--KSRL 640

Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
           +Y ++LWNG Y+  D+  GS   +I  DQL GQ+YAR   L  + D +    AL+KIY+ 
Sbjct: 641 IYHETLWNGEYYQLDSKSGSD--TIMTDQLCGQFYARLLNLPDVVDTSCTISALSKIYEA 698

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
             L    G  GA NG++ DG  +  +    +E+W G+ +GLAA +IQ  M + A +    
Sbjct: 699 CFLNFYDGQYGAANGVRSDGTPENPNSTHPQEVWVGINFGLAAFLIQMGMKNQALELTKV 758

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
           V E  ++    G  F+TPE+      +R+  Y+R + IW + + L K
Sbjct: 759 VVEQIYAN---GLQFRTPEAITPVGTFRASHYLRAMAIWGVYYQLKK 802


>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
 gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
          Length = 804

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 268/823 (32%), Positives = 396/823 (48%), Gaps = 114/823 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      + + QFS+F    +G   +  L   
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W W    E  TY AL+P +W  Y+G    E  I C Q SP IP N
Sbjct: 101 PP------EDKTLSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P  VF +T+ N       ++++ TW N V               D G   + + 
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
            +     G               ++ L  H +   G   +  A     + +V       F
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEV-------F 264

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
            L   +      ++W+     GS    D +  +P+EPG  I AA+A   T+  G  + + 
Sbjct: 265 NLGKWNPDGDGSEVWDYFAMDGSLP--DQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F   V Y+RRYT F+G  G++   +   A+     W  ++E W   IL+
Sbjct: 323 FILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSDVWRDKVEEWHNSILK 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
           +   P+W  + LFNELY L  GGT+WT                            A+S+T
Sbjct: 383 NDNLPDWLKMALFNELYLLTDGGTLWT----------------------------AASET 414

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                      + +GQF  LE  +Y  Y + DV  Y SF L+ML+P LE ++   FA A+
Sbjct: 415 -----------DPVGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPGLEKAVMEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL S FVL
Sbjct: 464 PSSDDTIRIIGYNQAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           QVYRDF+ TG  D +F    W ++   + Y+++FD D DG+ EN G PDQT+D W  +G+
Sbjct: 524 QVYRDFLLTGSDDIDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGI 583

Query: 654 SAYCGGLWVAALQAASALANDVGDHA-------------SASYFWVRY----QKAKAVYD 696
           SAYCGGLW+AAL+AA  +   + +++             S      +Y    Q+++A+Y 
Sbjct: 584 SAYCGGLWIAALEAAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYH 643

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           S LWNG Y+N D+  GS    + ADQL GQ+YAR  GL  + +      AL K+Y+   L
Sbjct: 644 STLWNGEYYNLDSKSGSD--IVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFL 701

Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K + G  GA NGM+PDG   D +    +E+W G+ +G+A+ +IQ  M + A +    V +
Sbjct: 702 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVK 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
             + E+GL   F+TPE+      YR+  Y+R + IW + + L+
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTYRACHYLRAMAIWGVYYQLS 801


>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
 gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
          Length = 812

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 271/828 (32%), Positives = 399/828 (48%), Gaps = 125/828 (15%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVF--VSRPNGEKFSSVLC 144
            G+PLGG GAG IGRS RG+F  +++  G     P+ A QFSVF  VS  +  + +  LC
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNVWQVDGGEHIFQPMPACQFSVFESVSERSASRQAYALC 100

Query: 145 PRSPGVPKKNTDSGIESWDWN-LKGEN----CTYHALFPRAWTVYDGEPDPELRIVCRQI 199
             +P       D G+++W W   +GE      TYHAL+PR+W VY+G    +L   C Q 
Sbjct: 101 TEAP-------DDGLQNWQWYPSRGEQHAHTGTYHALYPRSWFVYEGVFQSQL--TCEQF 151

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
           SP   +NY+ESS+P  VF +T  N       ++++ +W N                 D G
Sbjct: 152 SPIWANNYQESSYPVGVFVWTAHNPTNAPMTLSIMLSWQNMTGWFTNASKSSEVKIRDDG 211

Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
              + + S+     G     +          V   +  + + DV   +    + +    G
Sbjct: 212 SPVYEYKSRWGESQGNFNRLVESEY------VGCVLGRDNSEDVEEGDGQWCIATAKQPG 265

Query: 307 I------------TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
           +            T +D+W    + G+  ++ ++  +P+  G  I  AIA   T+  G T
Sbjct: 266 VEVFHHTRWNPVGTGEDIWTSFAQSGALPNIQDE--TPAIEGEQIAGAIAIRFTLQPGET 323

Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
           R++ F L WD P  +F   V Y+RR+T F+   G +A  IA  A+  +  W+ +I+AWQ+
Sbjct: 324 RTIPFVLTWDFPVTEFAAGVKYYRRHTDFFDRTGKNAWAIARTALQAYQSWQSQIQAWQQ 383

Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
           PIL+    P W+ + LFNELY L +GGT+W        S AT R+               
Sbjct: 384 PILDKTDLPNWFKMALFNELYDLTSGGTLW--------SAATERDP-------------- 421

Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
                            IGQF  LE  +Y  Y + DV  Y SFAL++LFP+LE ++ R F
Sbjct: 422 -----------------IGQFAILECIDYRWYESLDVRLYGSFALLILFPELEKAVMRAF 464

Query: 534 AAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSR 585
           A A+   D         + I ++     RK  GA PHD+G  +   W + N     + + 
Sbjct: 465 ARAIPASDNHRRLIGYYLTIGAEMPLAPRKVAGATPHDLGAPNEHVWEKTNYTAYQDCNL 524

Query: 586 WKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
           WKDL S FVLQVYRD++ TG  D  F  + W ++  A+ Y++ FDKDGDG+ EN G PDQ
Sbjct: 525 WKDLPSDFVLQVYRDYLLTGATDVEFLASCWEAIVQALNYLKTFDKDGDGIPENSGAPDQ 584

Query: 644 TYDAWSANGVSAYCGGLW-----------------VAALQAASALANDVGDHASASYFWV 686
           T+D W   GVSAYCGGLW                 V  L   S     + +       W+
Sbjct: 585 TFDDWKLQGVSAYCGGLWLAALEAAIAIGKILLNNVERLHVTSLHVTSLQETIDTYQSWL 644

Query: 687 RYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
              +++ +Y + LWNG Y+  D++ GS    + ADQL GQ+YAR   L  I  +     A
Sbjct: 645 --SRSRPIYQEKLWNGQYYRLDSNSGSD--VVMADQLCGQFYARLLKLPDIVPQECALSA 700

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDM 804
           L  IYD   LK   G  GA NG++P+G  +  +     E+W G+ +GLAA ++Q  M   
Sbjct: 701 LKTIYDACFLKFHQGKFGAANGLKPNGSPENPNATHPLEVWTGINFGLAAFLLQMGMKAE 760

Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A Q    V    + E+GL   F+TPE+  +   +R+  Y+R + IWA+
Sbjct: 761 ALQLTETVVNQIY-ENGL--QFRTPEAITSVGTFRASYYLRAMAIWAI 805


>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
          Length = 812

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 279/824 (33%), Positives = 403/824 (48%), Gaps = 114/824 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      +   QFSVF     G + +  LC +
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVFERTEAGTQ-AYALCTQ 98

Query: 147 SPGVPKKNTDSGIESWDW-----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
            P       D  + +W W     N +G   TYHAL+PR+W VY  E   + ++ C Q SP
Sbjct: 99  PP------EDGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVY--EQVYKTQLTCEQFSP 150

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
              +NY+E+S+P ++F +   N    S  ++++ TW N V               D G  
Sbjct: 151 LWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGWFTNALKSPEVRMRDDGSP 210

Query: 249 GHHFNSKT-MTKDGVHGL---TLHHRTANGRPPVTFAVAAEE----TADVHVSECPCFLL 300
            + +  +   +KD  + L   + H     GR  V   +   E     A     +   F  
Sbjct: 211 VYEYQPRLGESKDNFNQLVENSEHIGCILGRVGVDQPIQEGEGQWCIATRKHPKLEIFYH 270

Query: 301 SGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
           S      T +D+W    + GS  ++LD+   + +  G  I AA+A   T+  G T  + F
Sbjct: 271 SRWQPRGTGEDVWQSFAQDGSLPNYLDH---ASAVEGEQIAAALAVRFTLQPGETIEIPF 327

Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
            L WD P  +F   + Y+RRYT F+G  G +A  IA  A+ ++  W+ +I+AWQ+PIL+ 
Sbjct: 328 VLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPILDR 387

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
              P+W+ + LFNELY L+AGGT+W        S AT R+                    
Sbjct: 388 ADLPDWFKMALFNELYDLSAGGTLW--------SAATERD-------------------- 419

Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
                       IGQF  LE  +Y  Y + DV  Y SFAL+MLFP+LE ++ R FA A+ 
Sbjct: 420 -----------PIGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIP 468

Query: 539 MHDPGTMKIM------SDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLN 590
           + D  T  I       ++  +  RK  GA PHD+G  +   W + N  +  + + WKDL 
Sbjct: 469 VSDSTTRVIGYYYTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLG 528

Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
             FVLQVYRDF+ TG  D  F    W ++   + Y++ FD D DG+ EN G PDQT+D W
Sbjct: 529 CDFVLQVYRDFLFTGADDITFLAECWNAIVQTLDYLKTFDIDNDGIPENSGAPDQTFDDW 588

Query: 649 SANGVSAYCGGLWVAA-----------------LQAASALANDVGDHA-SASYFWVRYQK 690
              GVSAYCGGLW+AA                  + A     D+GD A   S +    Q+
Sbjct: 589 RLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTENAEKFLGDLGDLAVQKSIYETWLQQ 648

Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           ++ +Y++ LWNG Y+  D+  GS    + ADQL GQ+YAR  GL  I    +   AL  +
Sbjct: 649 SRPIYEAKLWNGQYYRLDSESGSD--VVMADQLCGQFYARLLGLPDIVKRDRALSALKTV 706

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           Y+   LK   G  GA NG++PDG  +        E+W G+ +GLAA M+Q  M D A + 
Sbjct: 707 YNACFLKFHHGQFGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKL 766

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
              V    + E+GL   F+TPE+      +R+  Y+R + IWA+
Sbjct: 767 TEAVVRQVY-ENGL--QFRTPEAITAVGTFRASTYLRAMAIWAI 807


>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
 gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
          Length = 799

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 271/817 (33%), Positives = 388/817 (47%), Gaps = 115/817 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      + A QFS+F    NG+  +  LC  
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFRSLPACQFSIF-EEVNGKTQTYALCTE 99

Query: 147 SPGVPKKNTDSGIESWDWNLKGENC----TYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
            P       D  + +W+W     N     TYHAL+PR+W VY+     +L   C Q SP 
Sbjct: 100 PP------EDESLSAWNWYPASSNSQKTGTYHALYPRSWFVYENVFKAQLS--CEQFSPV 151

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
             +NY+E+S+P +VF +   N       +++L TW N++               D G   
Sbjct: 152 WANNYQETSYPIAVFEWIAHNPTDEPITLSILLTWQNTIGWFTNALEQPLVQVRDDGSPI 211

Query: 250 HHFNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
           + +  +     G               +  L+       G   +  A       +V+   
Sbjct: 212 YEYQPRWGNSVGNFNHLVEDFHRIGCLMSRLSASEEVREGEGQMAIATITNPAVEVY--- 268

Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
              +  + N+ G T +++WN   + G+   LD +   P+  G  IG A+A   TI  G T
Sbjct: 269 ---YHTAWNTVG-TGEEIWNYFSQDGTL--LDEESEHPATEGRQIGVALAVKFTIRPGKT 322

Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
           R + F L+WD P  +F   V Y+RRYT F+G  G +A  +   A+     W   I+AWQ 
Sbjct: 323 RKIPFLLSWDLPVTEFESGVGYYRRYTDFFGRNGKNAWSMIRTAMKHSDVWLDNIKAWQN 382

Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
           PIL  +  P W+ + LFNELY L  GGT+W+                             
Sbjct: 383 PILNRQDLPSWFKMALFNELYLLTDGGTLWSAAD-------------------------- 416

Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
                      EN  +  GQF  LE  +Y  Y + DV  Y SF L++L+P+L+ S+   F
Sbjct: 417 -----------ENNPK--GQFAVLECLDYRWYESLDVRLYGSFGLLLLWPELDKSVLLAF 463

Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNS 591
           A ++   D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL  
Sbjct: 464 ARSISKGDKTPRIIGYNQAEAIRKATGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPC 523

Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
            FVLQVYRDFV TG  D  F +  W ++   + Y++ FDKD DG+ EN G PDQT+D W 
Sbjct: 524 DFVLQVYRDFVLTGSEDYEFLQDCWLAIVETLDYLKIFDKDEDGIPENSGAPDQTFDDWK 583

Query: 650 ANGVSAYCGGLWVAALQAASALAN-------------DVGDHASASYFWVRYQKAKAVYD 696
             GVSAYCGGLW+AAL+AA  +                + D  S    W+  +KAK +Y 
Sbjct: 584 LQGVSAYCGGLWLAALEAAIEIGKCLQKTEFSDNVSPSISDKISTYETWL--EKAKPIYQ 641

Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
           +LWNG YF  D+  GS    + ADQL GQ+YA+  GL  I    K   AL  IY      
Sbjct: 642 NLWNGQYFQLDSQSGSD--VVMADQLCGQFYAKLLGLEDIVSPEKTISALQTIYQSCFQN 699

Query: 757 VKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              G  GA NG++ +G  I+ +     E+W G+ +GLAA +IQ  M +  F+ A  V + 
Sbjct: 700 FYQGQFGAANGVRINGEPINPNDTHPLEVWTGINFGLAAFLIQMGMKEEGFKLAEVVVQQ 759

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            + E+GL   F+TPE+      +R+  Y+R + IWA+
Sbjct: 760 IY-ENGL--QFRTPEAITAAGTFRASHYLRAMAIWAI 793


>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
 gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
          Length = 806

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 267/822 (32%), Positives = 388/822 (47%), Gaps = 114/822 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG +GRS RG+F  +    G      + A QFSVF     GE  +  +C +
Sbjct: 41  HGMPLGGFGAGCVGRSPRGDFNLWHFDGGEHIFRSLDACQFSVFEHPEQGEVKAYGMCTK 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +  W W  + +  TYHAL+PR+W  Y G    EL  VC Q SP    N
Sbjct: 101 PP------EDGTLSRWRWYPEDQG-TYHALYPRSWFEYQGVFATEL--VCTQFSPIWAKN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF- 252
           Y+ESS+P ++F +T  N       ++++ TW N V               D G   + + 
Sbjct: 152 YQESSYPLAIFEWTAYNPTDQPITLSIMLTWQNVVGWFTNAIKSPEIRIRDDGSPVYQYQ 211

Query: 253 --------NSKTMTKDG------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
                   N     +DG       + +  H     G      A     + +V       +
Sbjct: 212 PRWGDSTGNYNQWIQDGSRVGCLFNRIQPHEELQEGEGQFCIASILNPSLEVF------Y 265

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
               + +G    D+W++  + GS    D    +P+EPG  I  AIA   T+  G TR + 
Sbjct: 266 QTRWHPRG-DGSDVWDDFARDGSLG--DRQDETPAEPGEQIAGAIAVRFTVRPGKTRLIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F L WD P ++F   V Y+RRYT F+G  G +A  +   A+     W+ +I+ WQ P+++
Sbjct: 323 FVLTWDLPILEFAPGVNYYRRYTDFFGRNGQNAWSMVRTALKHGDMWQDKIDQWQSPVVQ 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
              +P W  + LFNELY L  GGT+WT  +                              
Sbjct: 383 RSDYPTWLKMALFNELYLLTQGGTLWTAAT------------------------------ 412

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                  EN  + IGQF  LE  +Y  Y + DV  Y SFAL ML+P+L+ +I   FA A+
Sbjct: 413 -------EN--DPIGQFAVLECIDYRWYESLDVRLYGSFALAMLWPRLDKAILEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL+S FVL
Sbjct: 464 PQSDESMRVIGYNQAEAQRKIAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLSSDFVL 523

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           QVYRDFV TG  D +F    WPS+   + Y++ FD DGDG+ EN G PDQT+D W   G+
Sbjct: 524 QVYRDFVLTGKEDTDFLWECWPSITQTLNYLKSFDLDGDGIPENAGAPDQTFDDWKLEGI 583

Query: 654 SAYCGGLWVAALQAASALANDVGDH-----------------ASASYFWVRYQKAKAVY- 695
           SAYCGGLW+AAL+A+ A+   + D+                 A+          A+++Y 
Sbjct: 584 SAYCGGLWIAALEASIAIGAVLLDNPPMNPQLQPSDYPEGIEATLKTHQSWLTLARSIYH 643

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
             LWNG Y++ D   G  S  + ADQLAGQ+YA+  GL  + ++  V+  L  IY    L
Sbjct: 644 QQLWNGQYYSLDTKSG--SVVVMADQLAGQFYAQLLGLPDVVEKEYVQSTLKTIYQSCFL 701

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
               G  GA NG++ DG   +       E+W G+ + L A +I   M D   + A  + +
Sbjct: 702 NFHQGKIGAANGVKLDGTGENPQSTHPLEVWTGINFALGAFLIHSGMRDEGLKLAETIVK 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
             + +   G  F+TPE+   +  +R+  Y+R + IWA+ + L
Sbjct: 762 QIYDQ---GLQFRTPEALTTSGTFRASHYLRAMAIWAIDYTL 800


>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
 gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
          Length = 812

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 285/824 (34%), Positives = 405/824 (49%), Gaps = 114/824 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      +   QFSVF     G + +  LC +
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVFERTGAGTQ-AYALCTQ 98

Query: 147 SPGVPKKNTDSGIESWDW-----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
            P       D  + +W W     N +G   TYHAL+PR+W VY  E   + ++ C Q SP
Sbjct: 99  PP------EDGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVY--EQVYKTQLTCEQFSP 150

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
              +NY+E+S+P ++F +   N    S  ++++ TW N V               D G  
Sbjct: 151 IWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGWFTNALKSPEVKMRDDGSP 210

Query: 249 GHHFNSKT-MTKDGVHGL---TLHHRTANGRPPVTFAVAAEE----TADVHVSECPCFLL 300
            + +      +KD  + L   + H     GR  V   +   E     A     +   F  
Sbjct: 211 VYEYQPPLGESKDNFNQLVENSEHIGCVLGRVGVDQPIQEGEGQWCIATRKHPKLEIFYH 270

Query: 301 SGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
           S      T +D+W    + GS  ++LD+  TS  E G  I AA+A   T+  G T  + F
Sbjct: 271 SRWQPRGTGEDVWQSFAQDGSLPNYLDH--TSAVE-GEQIAAALAVRFTLQPGETLEIPF 327

Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
            L WD P  +F   + Y+RRYT F+G  G +A  IA  A+ ++  W+ +I+AWQ+PILE 
Sbjct: 328 VLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPILER 387

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
              P+W+ + LFNELY L+AGGT+W        S AT R+                    
Sbjct: 388 ADLPDWFKMALFNELYDLSAGGTLW--------SAATERD-------------------- 419

Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
                       IGQF  LE  +Y  Y + DV  Y SFAL+MLFP+LE ++ R FA A+ 
Sbjct: 420 -----------PIGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIP 468

Query: 539 MHDPGTMKIM------SDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLN 590
           + D  T  I       ++  +  RK  GA PHD+G  +   W + N  +  + + WKDL 
Sbjct: 469 VSDQTTRVIGYYYTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLG 528

Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
             FVLQVYRDF+ TG  D  F    W ++   + Y++ FD D DG+ EN G PDQT+D W
Sbjct: 529 CDFVLQVYRDFLFTGADDITFLAECWNAIVETLDYLKTFDIDNDGIPENSGAPDQTFDDW 588

Query: 649 SANGVSAYCGGLWVAALQA-----------------ASALANDVGDHA-SASYFWVRYQK 690
              GVSAYCGGLW+AAL+A                 A     D+GD A   S +    Q+
Sbjct: 589 RLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTDNAENFLGDLGDLAVQKSIYETWLQQ 648

Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           ++ +Y++ LWNG Y+  D+  GS    + ADQL GQ+YAR  GL  I    +   AL  +
Sbjct: 649 SRPIYEAKLWNGQYYRLDSESGSD--VVMADQLCGQFYARLLGLPDIVKRDRALSALKTV 706

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           Y+   LK   G  GA NG++PDG  +        E+W G+ +GLAA M+Q  M D A + 
Sbjct: 707 YNACFLKFHHGQFGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKL 766

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
              V    + E+GL   F+TPE+      +R+  Y+R + IWA+
Sbjct: 767 TETVVRQIY-ENGL--QFRTPEAITAVGTFRASTYLRAMAIWAI 807


>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 789

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 274/814 (33%), Positives = 402/814 (49%), Gaps = 115/814 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      V   QFS+F S   G+ ++    P 
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYALSTQP- 93

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
               P+ NT   +++W W    +  TYHAL+PR+W VY  E    L+  C Q SP   HN
Sbjct: 94  ----PEDNT---LQTWQWYPNTDGGTYHALYPRSWFVY--ENVFPLQFTCEQFSPVWAHN 144

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P +VF + + N        +++ +W N                 D G   + + 
Sbjct: 145 YRETSYPVAVFLWNIHNPTNQPLTASIMLSWENMTGWFTNALKSPQVKIRDDGSPVYEYQ 204

Query: 254 SKTMTKDG--------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
            +     G              V G    H    G    T+ +A  +   V +     F 
Sbjct: 205 PRWGESQGNYNWLAENDQYLGCVLGRATDHPIQEGDG--TWCIATVKNPQVELFYHCQFN 262

Query: 300 LSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
             GN      +++W +  ++GS  +  ++  +P++  S +GAA+A   T+P G T +V F
Sbjct: 263 PLGN-----GEEIWRDFSQNGSLSNYQDE--TPADVNSRLGAAVAVRFTLPPGETLTVPF 315

Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
            L+WD P  +F   V Y+RRYT F+GT GD+A +IA  A+ E   W   IE+WQ+PILE 
Sbjct: 316 VLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSHIESWQKPILER 375

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
           +  P W+ + LFNELY L +GGT+W+ G+                               
Sbjct: 376 EDLPSWFKMALFNELYDLTSGGTLWSAGTE------------------------------ 405

Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
                    ++ +GQF  LE  +Y  Y + DV  Y SF L++LFP+LE ++ R FA A+ 
Sbjct: 406 ---------KDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAVIRAFARAIP 456

Query: 539 MHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLF-NSSRWKDLN 590
             D  +      + I S      RK  GA PHD+G  + W +E  +Y  + + + WKDL 
Sbjct: 457 QSDDRSRIIGYYLTINSPSPMALRKVAGATPHDLGAPNEWVWEQTNYTSYQDCNLWKDLG 516

Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
             FVLQVYRDF+ TG  D  F    W  + + + Y++QFD DGDG+ EN G PDQT+D W
Sbjct: 517 CDFVLQVYRDFLFTGANDIEFLVDCWHGMVLTLDYLKQFDMDGDGIPENSGAPDQTFDDW 576

Query: 649 SANGVSAYCGGLWVAALQAASALAN---------DVGDHASASYFWVRYQKAKAVYDSLW 699
             +GVSAYCG LW+AAL+AA A+ +         +VG+  S    W+  Q        LW
Sbjct: 577 RLSGVSAYCGALWLAALEAAIAICDVLINRPELPNVGEQKSIYEHWLN-QSLPVYQQKLW 635

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
           NG ++  D+  GS+   + ADQL+GQ+YAR   L  I  + +   ALT IYD   LK + 
Sbjct: 636 NGKFYRLDSESGSN--VVMADQLSGQFYARLLNLPDIVPKDRALSALTTIYDACFLKFQD 693

Query: 760 GMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWS 818
           G  GA NG+  DG  +        E+W G+ +GLAA ++Q  M +   +    V    + 
Sbjct: 694 GKFGAANGVLLDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKEQGLRLTEAVVRQVYE 753

Query: 819 EDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
               G  F+TPE+      +R+  Y+RP+ IWA+
Sbjct: 754 H---GLQFRTPEAITAAGTFRASTYLRPMAIWAV 784


>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
 gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
          Length = 815

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 272/820 (33%), Positives = 383/820 (46%), Gaps = 108/820 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      + A QFSVF    +    +  LC +
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSLPACQFSVFEQPEHSLAQAYALCTQ 100

Query: 147 SPGVPKKNTDSGIESWDW---------NLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
            P       D  + SW+W         N    +  Y+AL+PR+W  Y      EL   C 
Sbjct: 101 PP------EDGKLGSWEWYPTQGKSEENSSEVSGHYYALYPRSWFTYQNVFQTEL--TCE 152

Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GD 244
           Q SP     Y+ESS+P +VF +T  N   T   V+++ TW N V               D
Sbjct: 153 QFSPIWAGCYQESSYPVAVFEWTAHNPTDTPITVSIMLTWQNMVGWFTNGIKNPEVRVRD 212

Query: 245 SGLSGHHFNSK--TMTKDGVHGLTLHHRTANGRPPVTFAVAAEE------TADVHVSECP 296
            G   + +  +    T +    +  H+R       V      +E       A V      
Sbjct: 213 DGSPVYEYQPRWGDSTGNLNQWIVDHYRVGFVLDRVRLGDDIKEGEGQLCLATVTNPSME 272

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            F     +      ++W+     GS   +D+   +P+ PG  I  AIA   T+  G TR 
Sbjct: 273 VFYHGRWNPVGDGAEVWDTFATDGSL--VDHQDETPAAPGEQIACAIAIRFTLRPGKTRK 330

Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           + F LAWD P  +F   + Y+RRYT F+G  G++   +   A+  H  W  +IE+WQ PI
Sbjct: 331 IPFILAWDFPVTEFKLGINYYRRYTDFFGRNGNNGWPMVRTALKHHDLWREQIESWQEPI 390

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L  +  P W+ + LFNELY L  GGT+WT  +               LD           
Sbjct: 391 LNQEDLPNWFKMALFNELYLLTDGGTLWTAAT--------------ELDP---------- 426

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
                          IGQF  LE  +Y  Y + DV  Y SFAL ML+PKL+ ++   FA 
Sbjct: 427 ---------------IGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEAFAR 471

Query: 536 AVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKF 593
           A+   D     I  +     RK   A PHD+G   + PW + N  +  + ++WKDL S F
Sbjct: 472 AIPTSDETLRIIGYNQAQAVRKIADATPHDLGAPNEHPWQQTNYTSYQDCNQWKDLASDF 531

Query: 594 VLQVYRDFV--ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
           VLQVYRDFV  +  D  F    W S+  A+AY+++FD+DGDG+ EN G PDQT+D W   
Sbjct: 532 VLQVYRDFVFYSATDIEFLWECWSSIVKALAYLKRFDQDGDGIPENYGAPDQTFDDWRLQ 591

Query: 652 GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-------------- 697
           GVSAYCGGLW+AAL++A A+   +  +   +  +V   +  ++ D+              
Sbjct: 592 GVSAYCGGLWLAALESAIAIGEILTKNYPPTSPFVEVPELDSIQDTIDCYRQWLAQAMPI 651

Query: 698 ----LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
               LWNG Y+  D+  GS    + ADQL GQ+YA+  GL  I        AL  +YD  
Sbjct: 652 YQEKLWNGQYYQLDSESGSD--VVMADQLCGQFYAQLLGLPDIVPPECTLSALKTVYDAC 709

Query: 754 VLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
            LK   G  GA+NG++PDG   D       E+W G+ +GLAA +IQ+ M D A +    V
Sbjct: 710 FLKFHQGQLGAVNGVRPDGTPEDPDATHPMEVWTGINFGLAAFLIQQGMKDEALRITEAV 769

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
               +     G  F+TPE+      +R+  Y+R + IWA+
Sbjct: 770 VRQVYDH---GLQFRTPEAITAAGTFRASHYLRAMGIWAV 806


>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
 gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
 gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
 gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
 gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
 gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
          Length = 817

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 275/817 (33%), Positives = 387/817 (47%), Gaps = 114/817 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVL 143
           D G+PLGG GAG IGRS  G+F  + L  G  I  + P  A QFSV+      E  +  +
Sbjct: 40  DHGMPLGGFGAGCIGRSPTGKFNLWHLDGGNHIFKNLP--ACQFSVYEQVAGEEAKAYAM 97

Query: 144 CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
              +P       D  ++ W W  K E   Y AL+PR+W  Y      EL  +C Q SP  
Sbjct: 98  ATEAP------EDGSLDRWQWYPK-EKGEYSALYPRSWYEYKDVFKSEL--ICEQFSPVW 148

Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGH 250
             NY+ESS+P +VF +T+ N       ++++ TW NSV               D G   +
Sbjct: 149 GGNYQESSYPLAVFDWTVHNPTDKPITLSIMLTWQNSVGWFTNAIKSPTVKLRDDGSPEY 208

Query: 251 HF---------NSKTMTKDGVHGLTLHHRTA------NGRPPVTFAVAAEETADVHVSEC 295
            +         N     +DG     L +R         G   +  A  +  + +V     
Sbjct: 209 EYQPRWGDSTGNFNQWIQDGFRVGCLFNRVQPHEALREGEGQMCIASVSNPSVEVFYHN- 267

Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
                  N  G  A D+W+    +GS   + ++  + +EPG  I  AI    T+  G T+
Sbjct: 268 -----RWNPSGDGA-DVWDWFAGNGSLPDIQDE--TAAEPGEQIAGAICLRFTVRPGKTK 319

Query: 356 SVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            + F LAWD P  +F + V Y+RRYT FYG  G +   +   A+     W  +IEAWQ P
Sbjct: 320 KIPFLLAWDLPITEFAQGVTYYRRYTDFYGRNGKNVWSMIRTALKHSDTWRKKIEAWQAP 379

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
           ILE+ R P+W  ++LFNELY L  GGT+WT  +                           
Sbjct: 380 ILENPRLPDWLKMSLFNELYLLTQGGTLWTAAT--------------------------- 412

Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
                     EN  + +GQF   E  +Y  Y + DV  Y SF L+M +PKLE ++   FA
Sbjct: 413 ----------EN--DPVGQFGVCECMDYRWYESLDVRLYGSFGLLMHWPKLEQAVMLAFA 460

Query: 535 AAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSK 592
            A+   DP    I  +     RK   A PHD+G   + PW   N     + + WKDL S 
Sbjct: 461 RAIPTEDPKERVIGYNLSLAPRKVKNATPHDLGAPNEHPWERSNYTAYQDCNLWKDLGSD 520

Query: 593 FVLQVYRDFV--ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSA 650
           FVL VYRDF+     D  F    WPS+  A+ Y++ FD D DG+ EN G PDQT+D W  
Sbjct: 521 FVLLVYRDFLLLPDADGEFLGECWPSIVAALDYLKGFDLDNDGIPENSGAPDQTFDDWRL 580

Query: 651 NGVSAYCGGLWVAALQAASALANDV---------GDHASASYFWVRYQ----KAKAVY-D 696
            G+SAYCGGLW+AAL+AA  +   +          D         RY+    +A+++Y D
Sbjct: 581 QGISAYCGGLWIAALEAALVMGQYLLENPPQQTDLDAMEIQTAIARYETWLAQARSLYQD 640

Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
           +LWNG Y+N D+  GS    + ADQL GQ+YAR   L  + +    +KAL K+YD   LK
Sbjct: 641 TLWNGEYYNLDSQSGSK--VVMADQLCGQFYARLLNLPDVVEHHYAEKALAKVYDTCFLK 698

Query: 757 VKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              G  GA NG+ PDG   + +     E+W G+ +GLAA +IQ  M + A +    V   
Sbjct: 699 FAHGELGAANGLLPDGSPQNPNDTHPLEVWTGINFGLAAFLIQMGMKEEALKMTETVVNQ 758

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            ++    G  F+TPE+      +R+  Y+R + IWA+
Sbjct: 759 VYNN---GLQFRTPEAITAVGTFRASHYLRAMAIWAV 792


>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
 gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
          Length = 784

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 261/803 (32%), Positives = 399/803 (49%), Gaps = 93/803 (11%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG G G IGRS +G+F  + L  G      + A QFS+F    +G   +  +   
Sbjct: 41  HGMPLGGFGGGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           SP        + +  W W    E  TY AL+PR+W  Y+ +   + +++C Q SP   HN
Sbjct: 101 SPKY------ATLSRWLW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+ESS+P ++F +T+ N       ++++ TW N++               D G   + + 
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211

Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
           SK    +G +          G  +     N G      A+A+     + V     F L  
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
            +      ++W+    +GS   ++++  +P++ G     A+A   TI  G TR + F LA
Sbjct: 267 WNPAGDGSEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324

Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
           WD P  +F + + Y+RRYT F+G  G +A  +   A+     W   I  WQ+PILE    
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384

Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
            +W+ + LFNELY L  GGT+WT  +                                  
Sbjct: 385 ADWFKMALFNELYLLADGGTLWTAATE--------------------------------- 411

Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                 ++ IGQF  LE  +Y  Y + DV  Y SF L+ML+P+LE ++   FA A+   D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
                I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525

Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           D++ TG  D++F R  WPS+ + + Y++ FD D DG+ EN G PDQT+D W   G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
           GGLW+AAL+A   +   +G+  +    W+  Q+++A+Y  +LWNG Y+N D+  GS S  
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           +  DQL GQ+YAR   L  + +    + AL+KIY+   LK  GG  GA NG++PDG  + 
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701

Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
                  E+W G+ +GL A ++Q  M    +Q    V +  + E+GL   F+TPE+  + 
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVQQIY-ENGL--QFRTPEAITSV 758

Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
             +R+  Y+R + IWA+   LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781


>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
 gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
          Length = 800

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 273/813 (33%), Positives = 394/813 (48%), Gaps = 107/813 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      + A QFSVF S     + +  L  +
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESHGTSSQ-AYALSTQ 98

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           +P       D  +  W W       TYHAL+PR+W VY  E   + ++ C Q SP   +N
Sbjct: 99  AP------EDGSLAGWKW-YPASGGTYHALYPRSWFVY--ENVFQAQLTCEQFSPVWANN 149

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P ++F +   N       ++++ TW N V               D G   + + 
Sbjct: 150 YQETSYPVAIFLWKAHNPTDAPITLSIMLTWENMVGWFTNALKSPEVRIRDDGSPVYEYQ 209

Query: 254 SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI------ 307
            +     G +   + +    G     F    +  A V   +    + +     I      
Sbjct: 210 PRLGESQGNYNQLIENSGQVG----CFLSQVDRDASVAEGDGSWCIATLKHPQIEVFHHT 265

Query: 308 ------TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSL 361
                 T +D+W      GS  +  +   +P+   + IG AIA   T+  G T  + FSL
Sbjct: 266 RWNPVGTGEDVWQSFAADGSLPNYQD--ATPAIENTRIGGAIAIRFTLQPGQTIEIPFSL 323

Query: 362 AWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
           AWD P  +F   V Y+RRYT F+G  G++A  IA  A+ E+  W  +I+ WQ+PILE + 
Sbjct: 324 AWDFPVTEFAAGVNYYRRYTDFFGRSGENAWAIASTALQEYQNWRSQIQRWQKPILERED 383

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
            P+W+ + LFNELY L +GGT+W+  S               LD                
Sbjct: 384 LPQWFKMALFNELYDLTSGGTLWSAAS--------------KLDP--------------- 414

Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
                     +GQF  LE  +Y  Y + DV  Y SFAL+MLFP+LE S+ R FA A+   
Sbjct: 415 ----------VGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIPHS 464

Query: 541 DPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLNSK 592
           D           I +D     RK  GA PHD+G  +   W + N     + + WKDL S 
Sbjct: 465 DAHQRVIGYYYTIGADTTTAVRKVPGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSD 524

Query: 593 FVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSA 650
           FVLQVYRDF+ TG  D  F    WP++   + Y++ FD+D DG+ EN G PDQT+D W  
Sbjct: 525 FVLQVYRDFLLTGADDVEFLTDCWPAIVETLNYLKTFDQDDDGIPENSGAPDQTFDDWRL 584

Query: 651 NGVSAYCGGLWVAALQAASALAN--------DVGDHA-SASYFWVRYQKAKAVYDS-LWN 700
            GVSAYCGGLW+AAL+AA A++N        D+G+ A   S +     +++ +Y+  LWN
Sbjct: 585 QGVSAYCGGLWLAALEAAIAISNILLNQNLADLGNLALQKSIYETWLAQSRPIYEEKLWN 644

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
           G Y+  D+  GS    + ADQL GQ+YAR  GL  I    +   AL  +Y+   LK   G
Sbjct: 645 GQYYRLDSKSGSD--VVMADQLCGQFYARLLGLPDIVASDRALSALQTVYEACFLKFYDG 702

Query: 761 MCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
             GA NG++PDG  +        E+W G+ +GLAA ++Q  M D A +    V +  +  
Sbjct: 703 QFGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLVQMGMQDEALKLTQAVVKQIYDN 762

Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
              G  F+TPE+      +R+  Y+R + IWA+
Sbjct: 763 ---GLQFRTPEAITTVGTFRASTYLRAMAIWAI 792


>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
 gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
          Length = 800

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 265/822 (32%), Positives = 389/822 (47%), Gaps = 110/822 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGG GAG IGR+  G+F  + +  G      + A QF+VF      +     L  
Sbjct: 39  NHGMPLGGFGAGCIGRAPSGDFNLWHIDGGNHTFETLPACQFAVFEHSEGEDAQVYALST 98

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
            +P       D  + SW W  +G+  TY AL+PR+W  Y+G    +L  VC Q SP    
Sbjct: 99  NAP------EDGSLSSWSWYPQGKG-TYSALYPRSWFSYEGVFKSQL--VCEQFSPVWAG 149

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
           +Y+ESS+P +VF +++ N       ++++ +W N+V               D G   + +
Sbjct: 150 SYQESSYPMAVFEWSIHNPTDKPITMSIMVSWQNTVGWFTNAIKSPTVKVRDDGSPEYEY 209

Query: 253 ---------NSKTMTKDGVHGLTLHHRTANGRPPV----TFAVAAEETADVHVSECPCFL 299
                    N     +DG     L  R      P      +A+A      + V     + 
Sbjct: 210 KPRWGNSTGNLNQWIQDGFRVGCLFDRIRLEDEPTEGEGQWAIATTTNPTLEV----FYH 265

Query: 300 LSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
              N KG    ++W+    +GS     ++  +P+EPG  I  A+A   TI  G T+ + F
Sbjct: 266 TRWNPKG-DGSEIWDRFAMNGSLPDFQDE--TPAEPGEQIAGAMAIRFTIRPGKTKKIPF 322

Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
            L WD P  +F + V Y RRYT F+G  G +   +    +  H  W+ +++ WQ PIL  
Sbjct: 323 ILGWDFPVTEFAQGVNYFRRYTDFFGRNGRNVWSMIRTGLKHHDMWQNKVKMWQEPILSR 382

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
              P+W  + LFNELY L  GGT+WT  +                D P            
Sbjct: 383 SNLPDWLKMALFNELYLLTQGGTLWTAATE---------------DDP------------ 415

Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
                       +GQF  LE  +Y  Y + DV  Y SF L+ LFP+LE S+   F+ A+ 
Sbjct: 416 ------------VGQFGVLECIDYRWYESLDVRLYGSFGLLQLFPRLEKSVMEAFSRAIP 463

Query: 539 MHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQ 596
             D     I  +G    RK  GA PHD+G   + PW + N  +  + + WKDL S FVL 
Sbjct: 464 TADDTPRIIGYNGASAIRKAKGATPHDLGAPNEHPWIKSNYTSYQDCNLWKDLGSDFVLL 523

Query: 597 VYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
           VYRD++ TG  D++F    W SV   + Y++ FD D DG+ EN G PDQT+D W   G+S
Sbjct: 524 VYRDYLYTGKNDQDFLWECWDSVVETLHYVKGFDLDNDGIPENSGAPDQTFDDWRLQGIS 583

Query: 655 AYCGGLWVAALQAASALANDVGDH-----------------ASASYFWVRYQKAKAVY-D 696
           AYCGGLW+ AL+AA A+   + D+                  + + F     +++A+Y D
Sbjct: 584 AYCGGLWICALEAAIAIGTILLDNPPVNPALEKENAQENIQGAIALFRQWLHQSRAIYHD 643

Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
           +LWNG Y+N D+  GS    + ADQL GQ+YA+   L  + ++  V   L KIYD   LK
Sbjct: 644 TLWNGEYYNLDSKSGSD--VVMADQLCGQYYAQLLKLPDVVEKEYVTATLNKIYDACYLK 701

Query: 757 VKGGMCGAMNGMQPDGRIDMSG-LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              G  G  NG++PDG  +        EIW G+ YG+ A MI   M +   + A  V   
Sbjct: 702 FHNGTLGIANGVKPDGSPEKENDTHPLEIWTGINYGIVAFMILNGMKEEGLKVAETVVNQ 761

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
            +S    G  F+TPE+    + +R+  Y+R + IWA+  AL+
Sbjct: 762 VYSN---GLQFRTPEAITAVNTFRASHYLRAMAIWAIYQALS 800


>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
 gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
          Length = 798

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 274/824 (33%), Positives = 398/824 (48%), Gaps = 126/824 (15%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      V   QFS+F S   G+ ++    P 
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYALSTQP- 93

Query: 147 SPGVPKKNTDSGIESWDW---------NLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
               P+ NT   +++W W           K +  TYHAL+PR+W VY  E    L+  C 
Sbjct: 94  ----PEDNT---LQTWQWYPNVVHSSTGTKTDGGTYHALYPRSWFVY--ENLFPLQFTCE 144

Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GD 244
           Q SP   HNY+E+S+P +VF + + N        +++ +W N                 D
Sbjct: 145 QFSPVWAHNYRETSYPVAVFLWNIHNPTNQPITASIMLSWENMTGWFTNALKSPQVKIRD 204

Query: 245 SGLSGHHFNSKTMTKDG--------------VHGLTLHHRTANGRPPVTFAVAAEETADV 290
            G   + +  +     G              V G   +H    G    T+ +A  +   V
Sbjct: 205 DGSPVYEYQPRWGESQGNYNWLAENDQYLGCVLGRATNHPVQEGDG--TWCIATVKNPQV 262

Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIP 350
            +     F   GN      +++W +  ++GS  +  ++  +P++  S +GAA+A   TI 
Sbjct: 263 ELFYHCQFNPVGN-----GEELWRDFSQNGSLSNYQDE--TPADVNSRLGAAVAVRFTIS 315

Query: 351 SGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
            G T +V F L+WD P  +F   V Y+RRYT F+GT GD+A +IA  A+ E   W   IE
Sbjct: 316 PGETFTVPFVLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSHIE 375

Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
           +WQ+PILE +  P W+ + LFNELY L +GGT+W+ G+                      
Sbjct: 376 SWQKPILEREDLPSWFKMALFNELYDLTSGGTLWSAGTE--------------------- 414

Query: 470 HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
                             ++ +GQF  LE  +Y  Y + DV  Y SF L++LFP+LE ++
Sbjct: 415 ------------------KDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAV 456

Query: 530 QRDFAAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLF- 581
            R FA A+   D  +      + I S      RK  GA PHD+G  + W +E  +Y  + 
Sbjct: 457 IRAFARAIPQSDDRSRIIGYYLTINSPSPVALRKVAGATPHDLGAPNEWVWEQTNYTSYQ 516

Query: 582 NSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           + + WKDL   FVLQVYRDF+ TG  D  F    W  + + + Y++QFD D DG+ EN G
Sbjct: 517 DCNLWKDLGCDFVLQVYRDFLFTGGNDIEFLVDCWYGIVLTLDYLKQFDIDADGIPENSG 576

Query: 640 FPDQTYDAWSANGVSAYCGGLWVA----------ALQAASALANDVGDHASASYFWVRYQ 689
            PDQT+D W  +GVSAYCG LW+A           L     L+N VG+  S    W+  Q
Sbjct: 577 APDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLMNHPELSN-VGEQRSIYENWLN-Q 634

Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
                   LWNG ++  D+  GS+   + ADQL+GQ+YAR   L  I  + +   ALT I
Sbjct: 635 SLPVYQQKLWNGKFYRLDSESGSN--VVMADQLSGQFYARLLNLPDIVPQDRALSALTTI 692

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           YD   LK + G  GA NG+ PDG  +        E+W G+ +GLAA ++Q  M D   + 
Sbjct: 693 YDACFLKFQDGKFGAANGVLPDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKDQGLRL 752

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
              V    +     G  F+TPE+      +R+  Y+RP+ IWA+
Sbjct: 753 TQAVVRQIYDH---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 793


>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 777

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 238/613 (38%), Positives = 335/613 (54%), Gaps = 44/613 (7%)

Query: 47  QEFRHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
           Q  +HL  +G R  R+  K+   + K P  ++F    +    G PLGGIG G+I R +RG
Sbjct: 43  QVLKHLG-LGIRYLRWWYKKTRVEKKAPFIDMFGALPLRQIYGAPLGGIGGGTITRGWRG 101

Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
           EF R++L  G      V+ NQF+V + R     +  VL    P          ++ W+W 
Sbjct: 102 EFCRWQLNPGKYHYKTVIENQFTVCLRRDGQTVYQQVLSVERPPT--------LQGWNWG 153

Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
             GE   YHAL+PRAWTVY   P   + + CRQISP IPH+Y+++S P +VF + + N  
Sbjct: 154 FCGEYAYYHALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDYQDASLPVAVFVWDVENKN 212

Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAV 282
             + DV+++FT  N        SG H+N    + K+G  V G+ LHH TA    P T  +
Sbjct: 213 DYALDVSIMFTALNGSGHRDDKSGGHWNQPFRLEKEGEAVSGVLLHHCTAVN--PYTLCI 270

Query: 283 AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
           AA E  D  VS    F   G   G     +WN++   G  D      + P+  G  + AA
Sbjct: 271 AAREQPDRDVSHQTAFSPKGTCNG-----LWNDLITDGRLDS-PTGSSPPTAKGERVAAA 324

Query: 343 IAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILE 400
           +AA   + + +   + FSLAWD P++ F   E+ + RRYT++YG+ GD++  ++H A+ +
Sbjct: 325 LAAGCAVSAQNRNCLEFSLAWDMPKITFGAGEQQHSRRYTRYYGSKGDASPLLSHHALTQ 384

Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
           +  WE  IE WQ P+L+D   P WY   LFNELY++  GGT+WT+ +        +R   
Sbjct: 385 YRGWEKRIEEWQTPVLQDSSLPAWYKSALFNELYFVADGGTVWTELAEDADVSGGLR--- 441

Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
                P     PA               +  G+F YLEG EY MYNTYDVH+Y+SFAL+M
Sbjct: 442 -----PEDGGLPAQPAVI----------KEYGRFAYLEGQEYRMYNTYDVHYYASFALIM 486

Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSY 578
           L+PKL LS+Q D A +V+  DP     + +G+    K  G VPHDIG   D+PW  +N+Y
Sbjct: 487 LWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNAY 546

Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
            + +++ WKDLN KFVLQVYRDF  T D  + R +WP   +      QFD DGDG+IEN 
Sbjct: 547 LIHDTAGWKDLNLKFVLQVYRDFHLTQDAQYLRDMWPICEMVTESELQFDLDGDGLIENS 606

Query: 639 GFPDQTYDAWSAN 651
           G+ DQTYD W+A+
Sbjct: 607 GYADQTYDGWAAS 619



 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 60/116 (51%), Positives = 77/116 (66%), Gaps = 2/116 (1%)

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
           K++ AL  I+D NV+   GG  GA+NGM+P+G  D S +Q+ E+W GV YGLAA+MI E 
Sbjct: 663 KIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEG 722

Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           M++   +TA G Y   W    LG +FQTPE++     YRSL YMRPL IWAMQ AL
Sbjct: 723 MLEEGMRTAEGCYRTVWER--LGMAFQTPEAYCEKGIYRSLAYMRPLCIWAMQLAL 776


>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
 gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
          Length = 811

 Score =  394 bits (1013), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 274/829 (33%), Positives = 396/829 (47%), Gaps = 126/829 (15%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG +GRS RG+F  + +  G      + A QFSVF    N    +  +C  
Sbjct: 41  HGMPLGGFGAGCLGRSSRGDFNLWHIDGGEHTFKSLPACQFSVFEQAGNSAPQAYAMCTE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKG-----ENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
            P       D  + +W W  K       + TYHAL+PR+W VYDG    +L   C Q SP
Sbjct: 101 PP------EDGTLSTWQWYPKTGKGTESSGTYHALYPRSWFVYDGVFQAQL--TCEQFSP 152

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
               NY+ESS+P  VF +T  N       ++++ TW NSV               D G  
Sbjct: 153 IWADNYQESSYPVGVFEWTAHNPTDQPITLSIMLTWQNSVGWFTNAIKTPQVKVRDDGSP 212

Query: 249 GHHFNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVS 293
            + +  +     G               +  + +H     G      A     + +V   
Sbjct: 213 VYEYQPRWKDSTGNLNQWIVDHFRAGCLLDRVRIHDEVQEGEGQWAIATITNPSVEV--- 269

Query: 294 ECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGS 353
               F  +  +   + +++W+     GS    D +  +P++PG  I AA+    T+  G 
Sbjct: 270 ----FYHTRWNPAGSGEEIWDNFAFDGSLR--DKEDETPADPGEQIAAAMTIRFTVRPGR 323

Query: 354 TRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
           TR + F LAWD P  +F   V Y+RRYT F+G  G++A  I   A+     W+ +I++WQ
Sbjct: 324 TRKIPFVLAWDFPVTEFATGVNYYRRYTDFFGRTGNNAWSIVRTALKHDDTWKEQIQSWQ 383

Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP 472
              L++   P+W+ + LFNELY L  GGT+WT                            
Sbjct: 384 ELTLKNANLPDWFKMALFNELYILADGGTLWT---------------------------- 415

Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
           A+S+T           + IGQF  LE  +Y  Y + DV  Y SFAL ML+PKL+ ++   
Sbjct: 416 AASET-----------DGIGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEA 464

Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLN 590
           +A A+   D     I  +     RK   A PHD+G   + PW + N  +  + + WKDL 
Sbjct: 465 YARAIPTSDNTPRIIGYNQASAIRKAANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLG 524

Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
           S FVLQVYRD++ TG  D  F    W ++   +AY+++FD DGDG+ EN G PDQT+D W
Sbjct: 525 SDFVLQVYRDYLLTGATDIEFLSECWSAIVKTLAYLKEFDLDGDGIPENSGAPDQTFDDW 584

Query: 649 SANGVSAYCGGLWVAALQAASALANDVGDH-----------ASASY-FWVRYQKAKAVY- 695
              GVSAYCGGLW+AAL+AA A+   + +H           A A+Y  W+  +++K +Y 
Sbjct: 585 RLQGVSAYCGGLWLAALEAAIAIGEILINHGVELEDGDFESAIATYKSWL--EQSKPLYQ 642

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           + LWNG Y+  D+  GS+   + ADQL GQ+YAR   L  I     V+ AL  +YD   L
Sbjct: 643 EKLWNGQYYQLDSESGSN--VVMADQLCGQFYARLLNLPDIVPNDCVESALKTVYDSCFL 700

Query: 756 KVKGGM-----------CGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVD 803
           K    +            GA NG+  DG  +        E+W G+ +GLAA MIQ  M D
Sbjct: 701 KFNQKLSQHSESLTPTFIGAANGVLADGSPENPQATHPLEVWTGINFGLAAFMIQMGMKD 760

Query: 804 MAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            AFQ    V +  ++    G  F+TPE+      +R+  Y+R + IWA+
Sbjct: 761 EAFQLTEAVVQQIYNN---GLQFRTPEAITAAGTFRASHYLRAMAIWAI 806


>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
 gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
          Length = 799

 Score =  394 bits (1013), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 266/820 (32%), Positives = 392/820 (47%), Gaps = 118/820 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
           QG+PLGG GAG IGRS RG+F  + +  G      + A QFS+F S  N     ++    
Sbjct: 40  QGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFGNIPACQFSLFESSSNRNVAYAL---- 95

Query: 147 SPGVPKKNTDSGIESWDW---NLKGENC-TYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                + + DS + +W+W     KG++  TYHAL+PR+W VY  E   + R+ C Q SP 
Sbjct: 96  ---STEPSNDSTLSAWEWYPSRSKGDSSGTYHALYPRSWFVY--ENVFQSRLSCEQFSPI 150

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------------- 242
              NY+ESS+P ++F +  SN       ++++ TW N V                     
Sbjct: 151 WADNYQESSYPVAIFLWNASNPTSEPVTLSIMLTWQNMVGWFTNTLKSPEVEMRDDGSPV 210

Query: 243 ----GDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRP---PVTFAVAAEETADVHVSEC 295
                + G S  +FN + +T     G  L  +     P     ++ +A  +   V +   
Sbjct: 211 YDYQPNLGESKGNFN-RLVTNSEYIGCVLD-KIGTAEPIEGEGSWCIATAKHPKVEI--- 265

Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
             F  +  +   T +D+W    K GS  ++ ++  +P+     +GAAIA   T+  G + 
Sbjct: 266 --FHHNRWNPTSTGEDIWRSFAKTGSLANISDE--TPAISNEQVGAAIAVRFTLQPGESL 321

Query: 356 SVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            + F++AWD P  +F   V Y RRYT F+G  G +A  IA  A+ E+  W  +++AWQ+P
Sbjct: 322 QIPFAVAWDFPITEFVSGVTYKRRYTDFFGDNGKNAWNIATTALEEYQNWLSQVQAWQQP 381

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
           I+E    P+W+ + LFNELY L AGGT+W+  S                           
Sbjct: 382 IVERPDLPDWFKMALFNELYDLTAGGTLWSAASK-------------------------- 415

Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
                         + IGQF  LE  +Y  Y + DV  Y SF L+ LFP+LE ++ R FA
Sbjct: 416 -------------RDPIGQFAVLECLDYRWYESLDVRLYGSFGLLNLFPELEKAVMRAFA 462

Query: 535 AAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRW 586
            A+   D  T        I ++     RK  GA PHD+G  +   W + N  +  + ++W
Sbjct: 463 RAIPESDEKTRVIGYYYTIGAESPLAVRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNQW 522

Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           KDL   FVLQVYRDF+ TG  D  F    W ++   + Y++ FDKDGDG+ EN G PDQT
Sbjct: 523 KDLGCDFVLQVYRDFIFTGSNDIEFLADCWNAIVETLNYLKAFDKDGDGIPENSGAPDQT 582

Query: 645 YDAWSANGVSAYCGGLW----------VAALQAASALANDVGDHASASYFWVRYQKAKAV 694
           +D W   GVSAYCGGLW             L +      ++    S    W+  +K++ V
Sbjct: 583 FDDWRLLGVSAYCGGLWLAALEAAIAIAEVLSSKQVQVENLAAQKSIYEGWL--EKSRPV 640

Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           Y + LWNG ++  D+  GS    + ADQL GQ+YAR   L  I        AL  +YD  
Sbjct: 641 YQEKLWNGKFYRLDSESGSD--VVMADQLCGQFYARLLKLPDIVPPECTMSALNSVYDAC 698

Query: 754 VLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
            LK   G  GA NG++ DG  +        E+W G+ +G+AA ++Q  M   AF+    V
Sbjct: 699 FLKFNQGKLGAANGLRLDGSPENPDATHPLEVWTGINFGIAAFLVQMGMKSEAFKLTEAV 758

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
               +     G  F+TPE+      +R+  Y+R + IWA+
Sbjct: 759 VNQIYEH---GLQFRTPEAITATRNFRACVYLRAMAIWAI 795


>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
 gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
           7002]
          Length = 803

 Score =  394 bits (1012), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 277/822 (33%), Positives = 386/822 (46%), Gaps = 114/822 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G        A QFSVF  +P GE   +     
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFSVF-EQPEGEAAQAYAM-- 97

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           S   P+  T   ++SW W    +  TY AL+PR+W  Y G    EL   C Q SP I  N
Sbjct: 98  STVAPEDGT---LDSWQW-YPAKKGTYAALYPRSWYEYQGAFAAEL--TCEQFSPIIAQN 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P +VF +T  N       V+++ TW N V               D G   + + 
Sbjct: 152 YQETSYPVAVFEWTAHNPTDKPITVSIMMTWQNMVGWFTKAQKTPEVVLRDDGSPVYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
           S+     G               ++ +  H   + G   + FA     T +V        
Sbjct: 212 SRWGDSTGNLNQWVQDRYRVGCLLNRVRPHADLSEGEGQLCFATVTNPTLEVFYHN---- 267

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
               N  G   +D+W+    +GS   ++++  +P+ PG  I  AIA   T+  G T+ + 
Sbjct: 268 --RWNPVG-DGRDVWDYFAGNGSLPDIEDE--TPAAPGEQIAGAIAVRFTLRPGKTKKIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P ++F   + Y RRYT F+G  G++   +   A+     W+  I AWQ+PIL 
Sbjct: 323 FFLAWDLPVMEFAPDITYFRRYTDFFGRSGNNVWTMIRTAMKHSDLWKERIAAWQQPILG 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
               P+W  + LFNELY L  GG++WT  +                              
Sbjct: 383 RDDLPDWLKMALFNELYLLTQGGSLWTAAT------------------------------ 412

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                  EN  + +GQF  LE  +Y  Y + DV  Y SF L ML+P+L+ ++   FA A+
Sbjct: 413 -------EN--DPVGQFGVLECLDYCWYESLDVRMYGSFGLTMLWPRLDKAVLEAFARAI 463

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW   N     + + WKDL S FVL
Sbjct: 464 PTSDDTQRMIGYNKTMATRKVAGATPHDLGAPNEHPWEATNYTAYQDCNLWKDLGSDFVL 523

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
           QVYRDFV TG  D  F  + W SV  A+ Y++ FD DGDG+ EN G PDQT+D W   G+
Sbjct: 524 QVYRDFVMTGAEDTEFLWSCWGSVLTALDYLKTFDLDGDGIPENGGAPDQTFDDWKLKGI 583

Query: 654 SAYCGGLWVAA----------LQAASALANDVGDHASASYF---WVRYQ----KAKAVY- 695
           SAYCGGLW+AA          LQ + +      +  SA+ F      YQ    +A+ +Y 
Sbjct: 584 SAYCGGLWIAALEAAIAMAKILQQSPSHFEAHQNQVSAAEFEQAIATYQTWLDQARPLYQ 643

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           D LWNGSY+  D   GS+S  + ADQL GQ+Y +   L  +    +    L  IY     
Sbjct: 644 DILWNGSYYTLDT--GSNSRVVMADQLCGQYYTQLLSLPDVNPGDRTATTLATIYTACFE 701

Query: 756 KVKGGMCGAMNGMQPDGRIDMSG-LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K  GG  GA NG+ PDG  +        E+W G+ +G+AA MI+  M   A +    V E
Sbjct: 702 KFHGGQFGAANGLNPDGTPEKENDTHPLEVWTGINFGIAALMIRNGMQTEALRMVEAVVE 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
             +     G  F+TPE+   N  +R+  Y+R + IWA+  AL
Sbjct: 762 QVYDN---GLQFRTPEAITANGTFRACHYLRAMGIWAIYDAL 800


>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
 gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
          Length = 814

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 277/824 (33%), Positives = 402/824 (48%), Gaps = 118/824 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVF--VSRPNGEKFS-SVL 143
            G+PLGG GAG +GRS+RG+F  + +  G      + A QFSVF  V +  G+ +S S +
Sbjct: 42  HGMPLGGFGAGCVGRSHRGDFNLWHIDGGEHVFQSLPACQFSVFEQVGQSPGKAYSLSTV 101

Query: 144 CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
            P          D  + SW W  +    +YHAL+PR+W VY      +L  VC Q SP +
Sbjct: 102 APE---------DGSLNSWQWYPQ-SGGSYHALYPRSWFVYQNVFQADL--VCEQFSPIL 149

Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGH 250
             NY+E+S+P +VF +T  N       ++L+ TW N V               D G   +
Sbjct: 150 TENYQETSYPVAVFHWTAHNPTDQPLTLSLMLTWQNVVGWFTNADKSPVVKVRDDGSPVY 209

Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE---------------C 295
            +  +     G     +     NGR  V F +    + +  ++E                
Sbjct: 210 TYQPRLGESQGNFNRVIDD---NGR--VGFVLDRLRSEEAAIAEGEGQWAVATFASPYTS 264

Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
             F  +  +      ++W      GS  + D++  +P+  G  I AAIA   T+P G TR
Sbjct: 265 EVFYYTRWNPVGDGSELWQTFAADGSLANCDDE--TPAGVGEQIAAAIAVRFTLPPGQTR 322

Query: 356 SVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            + F LAWD P  +F   V Y RRYT F+G  GD    IA   +  H  W  +I+AWQ+P
Sbjct: 323 QIPFVLAWDFPVTEFAAGVNYFRRYTDFFGRTGDHGWAIARTGLKMHQTWTDQIQAWQQP 382

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
           IL D   P+W+ + L NELY L +GGT+W+             ER               
Sbjct: 383 IL-DSNEPDWFKMALLNELYDLTSGGTLWSAAD----------ER--------------- 416

Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
                         + IGQF  LE  +Y  Y + DV  Y SFAL++L+P+LE ++   FA
Sbjct: 417 --------------DPIGQFAVLECLDYRWYESLDVRLYGSFALLLLWPRLEKAVMEAFA 462

Query: 535 AAVMMHDPGTMKI---MSDG---KWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
            A+   D  T  I    + G        K   A PHD+G   + PW + N  +  + + W
Sbjct: 463 RAIPTQDDRTRVIGYYYTQGMPSPTALHKIANATPHDLGAPNEHPWEKTNYTSYQDCNLW 522

Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           KDL + FV+ VYRD++ TG  D +F    WP+++  +AY++ FD DGDG+ EN G PDQT
Sbjct: 523 KDLPADFVILVYRDYLLTGGTDYDFLTTCWPAIHATLAYLKTFDLDGDGIPENSGAPDQT 582

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHA--------SASYFWVRYQKAKAVY- 695
           +D W   G+SAYCGGLW+AAL+AA A+   +   A        S + +     +++ +Y 
Sbjct: 583 FDDWRLQGISAYCGGLWIAALEAAIAIIKLLLVQAPDLPHLQESLTTYQTWLAQSRPLYQ 642

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
            +LWNG Y+  D+  GS S  + ADQL GQ+YAR  GL  +  E   + AL  +Y    L
Sbjct: 643 QTLWNGQYYRLDS--GSGSDVVMADQLCGQFYARLVGLPDVVPEDCARAALDMVYQTCFL 700

Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQ-TAAGVY 813
           +   G  GA NG++P G  +        E+W G+ +GLAA ++Q      A Q TAA VY
Sbjct: 701 QFHNGTLGAANGLRPGGLAEKPDATHPLEVWTGINFGLAAFLVQMGRKAEAMQLTAAVVY 760

Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
           ++   E GL   F+TPE+      +R+  Y+RP+ IWA+   LT
Sbjct: 761 QI--YEHGL--QFRTPEAITAEQTFRAGMYLRPMAIWAIYHLLT 800


>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
 gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
          Length = 804

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 273/820 (33%), Positives = 390/820 (47%), Gaps = 114/820 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
            G+PLGG GAG IGRS RG+F  + +  G  I  + P  A QFSVF    NG    +   
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVF--EANGTSSQAYAL 95

Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENC----TYHALFPRAWTVYDGEPDPELRIVCRQIS 200
              P       D  ++SW W           TYHAL+PR+W VY+     EL   C Q S
Sbjct: 96  STQP-----TDDESLKSWQWYPASSETQTTGTYHALYPRSWFVYENVFQAEL--TCEQFS 148

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
           P   +NY+E+S+P +VF +   N       ++++ TW N V               D G 
Sbjct: 149 PIWANNYQETSYPVAVFAWQAHNPTNAPITLSIMLTWQNMVGWFTNALKSPDVRVRDDGS 208

Query: 248 SGHHFNSKTMTKDGVHGLT----LHHRTANGRPPVTFAVAAEE----TADVHVSECPCFL 299
             + +  +     G +        H     GR  +T  +   E     A +  S+   F 
Sbjct: 209 PVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMTEPLQEGEGSWCIATLKHSQVEVFY 268

Query: 300 LSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
            +  +   T +++W      GS  +H+D    + +E    +GAAIA   T+  G T  + 
Sbjct: 269 HTRWNPVGTGEEVWQSFAADGSLANHIDTSSVTENE---QLGAAIAVRFTLQPGETLEIP 325

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F ++WD P  +F   V Y+RRYT F+G  GD A  IA  A+ ++  W+ +IE+WQ PIL 
Sbjct: 326 FVVSWDLPVTEFAAGVNYYRRYTDFFGKTGDHAWAIATTALEQYKTWQQQIESWQDPILN 385

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
               P+W+ + LFNELY L +GGT+W+  +P                             
Sbjct: 386 RPDLPDWFKMALFNELYDLTSGGTLWSAATPC---------------------------- 417

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                      + IGQF  LE  +Y  Y + DV  Y SF L+ LFP+LE ++ R FA A+
Sbjct: 418 -----------DPIGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAI 466

Query: 538 MMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDL 589
              D  P  +     I ++     RK  GA PHD+G  +   W + N  +  + + WKDL
Sbjct: 467 PQGDDTPRVIGYYYTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDL 526

Query: 590 NSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
              FVLQVYRDF+ TG  D  F R  W ++   + Y++ FD DGDG+ EN G PDQT+D 
Sbjct: 527 GCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDD 586

Query: 648 WSANGVSAYCGGLW----VAALQAASALANDVGDHASASYFWVRYQKAK---------AV 694
           W   GVSAYCGGLW     AA+  +  L  +  D A+  +  + YQK+           +
Sbjct: 587 WRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSATKEH--LLYQKSTYETWLTKSLPI 644

Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           Y + LWNG Y+  D+  GS    + ADQL GQ+YA    L  I    +   AL  +YD  
Sbjct: 645 YQEKLWNGKYYRLDSESGSD--VVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDAC 702

Query: 754 VLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
            LK   G  GA NG++PDG  +        E+W G+ +GLAA ++Q  M D  F+    V
Sbjct: 703 FLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAV 762

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            E  ++    G  F+TPE+      +R+  Y+RP+ IWA+
Sbjct: 763 VEQIYNN---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 799


>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
 gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
          Length = 821

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 280/831 (33%), Positives = 397/831 (47%), Gaps = 122/831 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG +GRS RG+F  + +  G      + A QFSVF    +G   +  LC  
Sbjct: 39  HGMPLGGFGAGCLGRSPRGDFNLWHIDGGEHVFQSMPACQFSVFEQGADGTSQAYALCSE 98

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           +P          +  W W      +    TY AL+PR  + +D E     R+ C Q SP 
Sbjct: 99  APA-------ESLGQWQWYPASMAERSTGTYSALYPR--SRFDYEGVFRARLACEQFSPI 149

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
              NY+ESS+P +VF +T  N       +++L TW N V               D G   
Sbjct: 150 WADNYQESSYPVAVFEWTAHNPTDQLLTLSILLTWQNMVGWFTNALKSPEVKVRDDGSPV 209

Query: 250 HHF---------NSKTMTKDGVHGLTLHHRTANGRPPV----TFAVAAEETADVHVSECP 296
           + +         N      +G     L  R      P      +A+A +        E P
Sbjct: 210 YEYQPRLGESEGNFNQWIAEGAWVGCLMDRAGRAEEPQEGEGQWAIATQ-------CEGP 262

Query: 297 CFLLSGNSKGITAKD---MWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGS 353
              +  +++   A D   +W    + G+ +  +    +P++    IG AIA   T+  G 
Sbjct: 263 GTEIFYHTRWNPAGDGGELWRTFAQTGALE--NRIDGTPAQGEEQIGGAIALRFTLAPGE 320

Query: 354 TRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
           TR V F LAWD P  +F      +RRYT F+G  G +   IA  A+ E++ W+  I AWQ
Sbjct: 321 TRRVPFVLAWDFPVTEFAAGTAAYRRYTDFFGRSGRNGWAIARTALEEYSAWQARIAAWQ 380

Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP 472
           +PIL+    P W+ + LFNELY L +GGT+W+                            
Sbjct: 381 QPILDRADLPNWFKMALFNELYDLTSGGTLWS---------------------------- 412

Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
           A+SD            + +GQF  LE  +Y  Y + DV  Y SF L+ML+P+LE S+ R 
Sbjct: 413 AASD-----------RDPVGQFAVLECIDYRWYESLDVRLYGSFGLLMLWPELEKSVMRA 461

Query: 533 FAAAVMMHDPGTMKIM------SDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSS 584
           FA A+   D     I       ++     RK L A PHD+G   + PW + N  +  + +
Sbjct: 462 FARAIPTADDRQRVIGYYYTQGTESPLALRKALDATPHDLGAPNEHPWEKTNYTSYQDCN 521

Query: 585 RWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
            WKDL S FVLQVYRDF  TG  D  F    WP+V  A+AY++ FD+DGDG+ EN G PD
Sbjct: 522 LWKDLGSDFVLQVYRDFRLTGSTDDAFIAECWPAVTRALAYLKAFDQDGDGIPENSGAPD 581

Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALA-----NDVGDHASASYFWVRYQ-------- 689
           QT+D W   GVSAYCGGLW+AAL+AA AL      + V DH   S   + +         
Sbjct: 582 QTFDDWRLQGVSAYCGGLWIAALEAAIALGERLQQHPVSDHPELSQGAIAHHLSTYRTWL 641

Query: 690 -KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALT 747
            +++AVY D+LWNG Y+  D+  GS S  + ADQL GQ+YAR   L  +    +   AL 
Sbjct: 642 AQSRAVYDDALWNGRYYRLDS--GSGSEVVMADQLCGQFYARLLDLPDVVASDRALSALK 699

Query: 748 KIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
            +YD   L+   G  GA NG++PDG  +        E+W G+ +G+AA ++Q  + D  F
Sbjct: 700 TVYDSCFLRFHDGKFGAANGVKPDGSPENPKATHPLEVWTGINFGIAAFLVQMGLRDEGF 759

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
           +    V E  +     G  F+TPE+      +R+  Y+RP+ IWA+   LT
Sbjct: 760 RLTQAVVEQIYEH---GLQFRTPEAITAVGTFRACHYLRPMAIWAVYGVLT 807


>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
 gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
           7417]
          Length = 816

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 270/828 (32%), Positives = 390/828 (47%), Gaps = 122/828 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      + A QFSVF S     + +  L   
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNIPACQFSVFESHGASSQ-AYALSTE 98

Query: 147 SPGVPKKNTDSGIESWDWNLKG------------ENCTYHALFPRAWTVYDGEPDPELRI 194
            P       D  + +W W  +             +  TYHAL+PR+W VY+     +L  
Sbjct: 99  PP------EDGTLSAWQWYPRSAESKAGKMPTPQDTGTYHALYPRSWFVYENVFQAQLS- 151

Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA------------ 242
            C Q SP   +NY+E+S+P +VF +   N       ++++ TW N V             
Sbjct: 152 -CEQFSPVWANNYQETSYPVAVFNWKAHNPTDAPITLSIMLTWENMVGWFTNALKSPKVQ 210

Query: 243 -GDSGLSGHHFNSKTMTKDGVHGLTLHH---------RTANGRPPV----TFAVAAEETA 288
             D G   + +  +     G +   + +         R A+  P      T+ +A  +  
Sbjct: 211 IRDDGSPVYEYQPRLGKSQGNYHQLIENTQQVGCVLGRVASDAPVQEGDGTWCIATRKHP 270

Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
            + V     F  +  +   T  ++W      GS  +   D+T+ +E  + IGAA+A   T
Sbjct: 271 LIEV-----FHHTRWNPVGTGDEVWQSFAADGSLPNY-VDETAAAE-NTRIGAALAVRFT 323

Query: 349 IPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
           +  G    V F LAWD P  +F   V Y+RRYT F+G  G +A  IA  ++ E+  W  +
Sbjct: 324 LQPGECLEVPFVLAWDFPVTEFAAGVNYYRRYTDFFGRSGQNAEAIAFLSLQEYQNWRSQ 383

Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
           I+ WQ+PIL  +  P W+ + LFNELY L +GGT+W+  S               LD   
Sbjct: 384 IQNWQKPILVREDLPPWFKMALFNELYDLTSGGTLWSAAS--------------ELDP-- 427

Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
                                   GQF  LE  +Y  Y + DV  Y SFAL+MLFP+LE 
Sbjct: 428 -----------------------FGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEK 464

Query: 528 SIQRDFAAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYN 579
           S+ R FA A+   D           I +D     RK  GA PHD+G  +   W + N   
Sbjct: 465 SVIRAFARAIPQSDDHQRIIGYYYTIGADTTTAVRKIAGATPHDLGAPNEHVWEKTNYTC 524

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
             + + WKDL S FVLQVYRDF+ TG  D  F    W ++   + Y++ FDKDGDG+ EN
Sbjct: 525 YQDCNLWKDLGSDFVLQVYRDFLFTGADDVEFLADCWDAIVQTLDYLKGFDKDGDGIPEN 584

Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-- 695
            G PDQT+D W   GVSAYCGGLW+AAL+ A A+ + +  +   +      +  +++Y  
Sbjct: 585 SGAPDQTFDDWRLQGVSAYCGGLWLAALECAIAICDILLTNRQDAKNAKNLETQRSIYGA 644

Query: 696 ----------DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
                     + LWNG Y+  D+  G+    + ADQL GQ+YAR  GL  I    +   A
Sbjct: 645 WLAQSLTVYEEKLWNGQYYRLDSESGTE--VVMADQLCGQFYARLLGLPDIVKCDRALSA 702

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDM 804
           L  +YD   LK   G  GA NG++PDG  +  +     E+W G+ +GLAA ++Q  M D 
Sbjct: 703 LQTVYDACFLKFFDGQFGAANGVRPDGVPENPNATHPLEVWTGINFGLAAFLVQMGMQDE 762

Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A +    V    +     G  F+TPE+      +R+  Y+RP+ IWA+
Sbjct: 763 ALRLTQAVVSQIYDN---GLQFRTPEAITTTGTFRASTYLRPMAIWAI 807


>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
 gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
          Length = 807

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 267/829 (32%), Positives = 389/829 (46%), Gaps = 120/829 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G        A QF V+     GE  +  +   
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFGVYEYVEGGEAQAYAMATE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P       D  +ESWDW    E  TY AL+PR+W  Y    + EL   C Q SP +  +
Sbjct: 101 PP------DDGTLESWDW-YPTEKGTYSALYPRSWYEYKDVFNVEL--TCEQFSPIVAES 151

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P ++F ++  N       V+++ +W N V               D G   + + 
Sbjct: 152 YQETSYPTAIFEWSAHNPTDKPITVSIMMSWQNMVGWFTKATKTPKVELRDDGSPVYEYQ 211

Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
           S+     G               +  +  H + + G   + FA     T +V       F
Sbjct: 212 SRWGDSTGNLNQWIQDRYRVGCLLTRVRPHDQISEGEGQICFATVTNPTLEV-------F 264

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
             +  +     +D+WN    +GS   ++++  +P+ PG  I  AIA   T+  G T+ + 
Sbjct: 265 YHNRWNPSGDGRDVWNYFAGNGSLPDVEDE--TPASPGEQIAGAIAVRFTVRPGRTKKIP 322

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P ++F   + Y RRYT F+G  G +   I   A+     W+  IE WQ+P L 
Sbjct: 323 FFLAWDFPVMEFAPNITYFRRYTDFFGRSGHNVWSIIRTAMKHSDLWKERIEEWQKPFLS 382

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
               P+W+ ++L NELY L  GG++WT  +                              
Sbjct: 383 RADLPDWFKMSLCNELYLLAQGGSLWTAATE----------------------------- 413

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                     ++ +GQF  LE  +Y  Y + DV  Y SFAL ML+PKL+ S+   FA A+
Sbjct: 414 ----------DDPVGQFGVLECLDYSWYESLDVRMYGSFALAMLWPKLDKSVLEAFARAI 463

Query: 538 MMHDP-----GTMKIMS-DGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDL 589
              D      G  K+M  D    ARK  GA PHD+G   + PW   N     + + WKDL
Sbjct: 464 PSSDDTQRAIGYNKLMGYDNFMAARKRAGATPHDLGAPNEHPWEATNYTAYQDCNLWKDL 523

Query: 590 NSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
            + FVLQVYRDFV TG K+  F  + W  V  A+ Y++ FD D DG+ EN G PDQT+D 
Sbjct: 524 GADFVLQVYRDFVMTGSKDTEFLWSCWEGVVQALDYLKTFDLDCDGIPENGGSPDQTFDD 583

Query: 648 WSANGVSAYCGGLW----------VAALQAASALANDVGDHASASYF---WVRYQ----K 690
           W   G+SAYCGGLW             L+  +++  +  +  S   F     +YQ    +
Sbjct: 584 WKLKGISAYCGGLWIAALEAAIAIANILKNKASIYEEHQNQVSTEEFEAAIAKYQDWLKQ 643

Query: 691 AKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           ++ +Y D+LWNGSY+  D   GS S  + +DQL G++Y +   L  +  E +    L  I
Sbjct: 644 SRPLYQDTLWNGSYYTLDT--GSDSKVVMSDQLCGEFYTQLLNLPNVNPENRTAITLQTI 701

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSG-LQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           Y+   LK   G  GA NG+ PDG  +        E+W G+ +G+AA MI+  M D A + 
Sbjct: 702 YEACFLKFHDGQFGAANGLNPDGSPEKENDTHPLEVWTGINFGIAAFMIRNGMKDEALKM 761

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
              V    +S    G  F+TPE+   N+ +R+  Y+R + IWA+  ALT
Sbjct: 762 TEAVIHQVYSN---GLQFRTPEAITANNTFRACHYLRAMGIWAIYDALT 807


>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
 gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
          Length = 795

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 274/818 (33%), Positives = 398/818 (48%), Gaps = 117/818 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      V A QFS+F S   G+ ++    P 
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKTVPACQFSIFES---GQAYALSTQP- 93

Query: 147 SPGVPKKNTDSGIESWDWNLKGE---NCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
               P+ +T   ++SW W  + +   + TYHAL+PR+W VY+     +L   C Q SP  
Sbjct: 94  ----PEDHT---LQSWQWYPQSQEQNSGTYHALYPRSWFVYENVLKTQL--TCEQFSPIW 144

Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGH 250
             NY+ESS+P +VF +   N       ++++ TW N V               D G   +
Sbjct: 145 ADNYQESSYPVAVFLWKAHNPTNAPITISIMLTWENMVGWFTNTLKSPEVRVRDDGSPVY 204

Query: 251 HF---------NSKTMTKD-----GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
            +         N   + +D      V G    H+T       T+ +A  +   V +    
Sbjct: 205 EYLPRWGESQGNYNCLAEDHQYFGCVLGQVSDHQTVQ-EGDGTWCIATAKNPQVEIFYHS 263

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            F   GN      +D+W     +GS  +  ++  +P++  + +GAAIA   T+  G    
Sbjct: 264 RFNPVGN-----GEDVWQSFSANGSLPNYIDE--TPADENTRLGAAIAVRFTLQPGENLE 316

Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           V   L+WD P  +F   V Y+RRYT F+G  G++A +IA  A+ E+  W   I+ WQ PI
Sbjct: 317 VPLVLSWDFPVTEFAAGVNYYRRYTDFFGCNGENAWQIATTALKEYQNWRSLIQDWQNPI 376

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L     P+W+ + LFNELY L +GGT+W+                            A+S
Sbjct: 377 LAKSDLPDWFKMALFNELYDLTSGGTLWS----------------------------AAS 408

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
           +            + +GQF  LE  +Y  Y + DV  Y SF L+MLFP+LE S+ R FA 
Sbjct: 409 EI-----------DPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKSVIRAFAR 457

Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWK 587
           A+   D  T      + I S+     RK  GA PHD+G  +   W + N  +  + + WK
Sbjct: 458 AIPQSDDRTRIIGYYLTIKSESPNAVRKIAGATPHDLGAPNEHVWEKTNYTSYQDCNLWK 517

Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           DL   FVLQVYRDF+ TG  D  F    W ++   + Y++ FD DGD + EN G PDQT+
Sbjct: 518 DLGCDFVLQVYRDFLFTGANDLQFLADCWDAIVQTLDYVKTFDLDGDSIPENSGAPDQTF 577

Query: 646 DAWSANGVSAYCGGLWVAALQAASAL---------ANDVGDHASASYFWVRYQKAKAVY- 695
           D W   GVSAYCGGLW+AAL+AA A+         A D       S +     ++K +Y 
Sbjct: 578 DDWRLQGVSAYCGGLWLAALEAAIAISKILTNRRGAEDAEVEVQGSIYQGWLSQSKPIYQ 637

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           + LWNG Y+  D+  GS    + ADQL GQ+YAR   L  I    +   ALTK+YD   L
Sbjct: 638 EKLWNGEYYRLDSDSGSD--VVMADQLCGQFYARLLSLPDIVPSDRALSALTKVYDSCFL 695

Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K + G  GA NG+ P+G  +  +     E+W G+ +GLAA ++Q  M + A +    V  
Sbjct: 696 KFQNGEFGAANGVLPNGLPENPNSTHPLEVWTGINFGLAAFLVQMNMKNEAMRLTEAVVR 755

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
             +     G  F+TPE+   N  +R+  Y+R + IW +
Sbjct: 756 QIYDN---GLQFRTPEAITANGTFRASTYLRAMAIWGI 790


>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
 gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
          Length = 801

 Score =  388 bits (996), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 275/818 (33%), Positives = 396/818 (48%), Gaps = 113/818 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      V A QFSVF S     + +  L  +
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNVPACQFSVFESNGTSTQ-AYALSTQ 98

Query: 147 SPGVPKKNTDSGIESWDW--NLKGENCT--YHALFPRAWTVYDGEPDPELRIVCRQISPF 202
            P       D  +++W W   L     T  YHAL+PR+W VY  E   + ++ C Q SP 
Sbjct: 99  GP------NDGALKAWQWYPTLPSTEGTGNYHALYPRSWFVY--ENIFQAQLTCEQFSPV 150

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
              NY+E+S+P ++F +   N       ++++ TW N V               D G   
Sbjct: 151 WAGNYQETSYPVAIFLWNAHNPTNAPITLSIMLTWQNMVGWFTNALKSPEVRVRDDGSPV 210

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANG------------RPPVTFAVAAEETADVHVSECPC 297
           + +  +     G +   + +    G                T+ +A  +   V +     
Sbjct: 211 YEYQPRWGESQGNYNQIVENPQQFGCLLSRVGSDALQEGDGTWCIATLKHPQVEI----F 266

Query: 298 FLLSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
           +    N++G T  ++W    K GS  +++D    +P   G  +GAA+A   T+  G T  
Sbjct: 267 YHTRWNAEG-TGDEVWQSFAKDGSLSNYID---ANPVGEGEQLGAAMALRFTLQPGETLE 322

Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           + F LAWD P  +F   V Y+RRYT F+   G++A  IA  A+ E+  W  +I+ WQ+PI
Sbjct: 323 IPFVLAWDLPITEFAAGVNYYRRYTDFFDRSGNNAWAIASTALQEYQTWRSQIQTWQQPI 382

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L+ +  P W+ + LFNELY L +GGT+W+  S               LD           
Sbjct: 383 LDREDLPNWFKMALFNELYDLTSGGTLWSAAS--------------ELDP---------- 418

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
                          IGQF  LE  +Y  Y + DV  Y SFAL+MLFP+LE S+ R FA 
Sbjct: 419 ---------------IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVMRAFAR 463

Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWK 587
           A+   D         M I ++     RK  GA PHD+G  +   W + N  +  + + WK
Sbjct: 464 AIPQGDDTPRIIGYYMTIKAESPIAVRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWK 523

Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           DL S FVLQVYRDF+ TG  D  F    W ++   + Y++ FD DGDG+ EN G PDQT+
Sbjct: 524 DLGSDFVLQVYRDFLLTGADDVEFLADCWNAIVQTLDYLKTFDLDGDGIPENSGAPDQTF 583

Query: 646 DAWSANGVSAYCGGLWVAALQAASA-----LANDVGD----HASASYFWVRYQKAKAVY- 695
           D W   GVSAYCGGLW+AAL+AA A     L + +GD        S +    Q++  +Y 
Sbjct: 584 DDWRLQGVSAYCGGLWLAALEAAIAISDLLLTHKLGDLGGLAVQKSIYETWLQQSLPIYQ 643

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           + LWNG Y+  D+  GS    + ADQL GQ+YAR   L  I    +   AL  +YD   L
Sbjct: 644 EKLWNGQYYQLDSQSGSD--VVMADQLCGQFYARLLDLPDIVPSDRALSALKTVYDACFL 701

Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K   G  G  NG++PDG  +        E+W G+ +GLAA ++Q  M D A +    V +
Sbjct: 702 KFCNGEFGTANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEALRLTQAVVQ 761

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
             + E+GL   F+TPE+      +R+  Y+R + IW +
Sbjct: 762 QIY-ENGL--QFRTPEAITAAGTFRASTYLRAMAIWGI 796


>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
 gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
          Length = 802

 Score =  387 bits (995), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 270/819 (32%), Positives = 392/819 (47%), Gaps = 103/819 (12%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      V A QFSVF S     +  ++    
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNVPACQFSVFESYGTSSQAYAL---- 95

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                +   D  + +W W    +      TYHAL+PR+W VY+     +L   C Q SP 
Sbjct: 96  ---STQSLEDGSLNAWQWYPASSESTSTGTYHALYPRSWFVYENVFQSQL--TCEQFSPI 150

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
             +NY+E+S+P +VF +   N       ++++ TW N V               D G   
Sbjct: 151 WANNYQETSYPVAVFLWKAHNPSNAPITLSIMLTWENMVGWFTNALKSPEVKIRDDGSPV 210

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANG--------RPPVTFAVAAEETADVHVSECPCFLLS 301
           + +  +     G +   + +    G          PV     +   A V   +   F  S
Sbjct: 211 YEYQPRWGESQGNYNQVVENSEYFGCFLGRVGITAPVQEGDGSWCIATVKHPQVEIFHHS 270

Query: 302 GNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFS 360
                 T  ++W      GS  ++LD   T+P      +G+AIA    +  G T  + F 
Sbjct: 271 KWHPEGTGDEVWQSFAADGSLPNYLD---TNPVGENQRLGSAIAVRFNVQPGETLEIPFV 327

Query: 361 LAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDK 419
           LAWDCP  +F   V Y+RRYT F+G  G+ A  IA  A+  +  W  +I++WQ PIL  +
Sbjct: 328 LAWDCPVTEFAAGVNYYRRYTDFFGKDGNQAWEIATTALGNYQTWRSQIQSWQAPILNRE 387

Query: 420 RFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTAL 479
             P W+ + LFNELY L +GGT+W+                            A+S+   
Sbjct: 388 DLPAWFKMALFNELYDLTSGGTLWS----------------------------AASEI-- 417

Query: 480 GTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMM 539
                    + IGQF  LE  +Y  Y + DV  Y SF L++LFP+LE S+ R FA  +  
Sbjct: 418 ---------DPIGQFAVLECLDYRWYESLDVRLYGSFGLLLLFPELEKSVMRAFARGIPQ 468

Query: 540 HDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLNS 591
            D           I +D    ARK  GA PHD+G  +   W + N     + + WKDL+ 
Sbjct: 469 SDDHQRIIGYYYTIGADTTTAARKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLSC 528

Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
            FVLQVYRD++ TG  D  F    W +V   + Y++ FDKDGDG+ EN G PDQT+D W 
Sbjct: 529 DFVLQVYRDYLLTGADDVEFLADCWDAVVQTLDYLKTFDKDGDGIPENSGAPDQTFDDWR 588

Query: 650 ANGVSAYCGGLWVAALQAASALAND-VGDH-------ASASYFWVRYQKAKAVY-DSLWN 700
             GVSAYCGGLW+AAL+AA A+++  + +H          S + V   +++ +Y + LWN
Sbjct: 589 LLGVSAYCGGLWLAALEAAIAISDILITNHKVVESAEKQKSIYEVWLAQSRPIYQEKLWN 648

Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
           G Y+  D+  GS+   + ADQL GQ+YAR   L  I    +   AL  +YD   LK + G
Sbjct: 649 GQYYRLDSESGSA--VVMADQLCGQFYARLLELPDIVPSDRALSALKTVYDACFLKFQNG 706

Query: 761 MCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
             GA NG+ PDG  +        E+W G+ +GLAA ++Q  M D A +    V E  ++ 
Sbjct: 707 EFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVEQVYNN 766

Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
              G  F+TPE+      +R+  Y+R + IW +   ++K
Sbjct: 767 ---GLQFRTPEAITATGTFRASTYLRAMAIWGIYLVMSK 802


>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 892

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 283/852 (33%), Positives = 420/852 (49%), Gaps = 87/852 (10%)

Query: 49  FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           F  L  +  + + Y K+   + ++P  +           G+P+GGIG+GSIGR YRGEF 
Sbjct: 63  FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122

Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
           R  L  G+   D  PV  +QF V V +     +  VL P  + P   KK     ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175

Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
               E   Y  L+PR+WTVY+  P+ +L +V +QISP IPH+Y+ +  P +VF + + + 
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234

Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
            + +  VT+ F+W          S     S+       +    H  TA N  P  +    
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294

Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
           A  T D          L  N +G   K    ++   G          S ++    +  A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344

Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
           +A+  +P        + S   + F++ W  P VKF   + VY RRY +++     S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404

Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
             AH AI    +W  +IE WQ PIL ++  P WY   LFNELYYL+ GGT+W D   P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460

Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
              T   +  S D  +++                     IG F YLEG EY MYNTYDVH
Sbjct: 461 VDLTEDMKVDSWDHRARL------------------SREIGLFGYLEGHEYRMYNTYDVH 502

Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
            Y+S+AL+ L+PK++L++  D A   +  D  ++  + +GK + R    AV HD G   D
Sbjct: 503 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 562

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
           +PW   N+Y +F +  WKDLNSKF+LQV+RD+  T D  +   + P V  I    +  +D
Sbjct: 563 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 622

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA---------------SALAN 673
            D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A                  + N
Sbjct: 623 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQIVN 682

Query: 674 DVGDHASASYFWVR------YQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
           +  D    S+  ++      + KA+  Y++ LW GSY+ +          I ADQL+G W
Sbjct: 683 NTDD-TQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSGYW 741

Query: 727 YARACGLLP--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
           ++R  G+ P  I  +  V K L  I + N   +K G  GA+NG  P  + D+S +QA E 
Sbjct: 742 FSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAEEF 801

Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
           W  V Y L++ MI E M+D         Y+  ++  GL Y  QTPE++ ++  +R   YM
Sbjct: 802 WVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QTPEAYMSDGRFRCPGYM 859

Query: 845 RPLTIWAMQWAL 856
           R L IW++Q AL
Sbjct: 860 RALAIWSIQQAL 871


>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
 gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
          Length = 809

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 280/829 (33%), Positives = 395/829 (47%), Gaps = 115/829 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      + A QFSVF     G+K +  L   
Sbjct: 41  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 99

Query: 147 SPGVPKKNTDSGIESWDW-------------NLKGENCTYHALFPRAWTVYDGEPDPELR 193
            P      TD  + SW W               +  + TYHAL+PR+W VY+     +L 
Sbjct: 100 PP------TDGSLSSWKWYPSEGRRKKEEGRREESTSGTYHALYPRSWFVYENVFQAQLS 153

Query: 194 IVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA----------- 242
             C Q SP +P+NY+ESS+P ++F +   N       ++++ TW N V            
Sbjct: 154 --CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGWFTNANKSPKV 211

Query: 243 --GDSGLSGHHFNSK--------TMTKDGVH--GLTLHHRTANGRPPV---TFAVAAEET 287
              D G   + +  +         +  +  H  G  +   TA+  P      +A+A    
Sbjct: 212 KVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTADDEPQEGDGQWAIATITN 271

Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
             V V     +  +GN       D+W      G    LD+   + +  G  +  AIA   
Sbjct: 272 PAVEVFYQTRWNPAGN-----GDDIWRNFSAEGFL--LDDRDETAAAAGEQLACAIAVRF 324

Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWEC 406
           TI  G TR + F +AWD P  +F   V Y+RRYT F+G  G +A  I   A+     W  
Sbjct: 325 TIRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPIIRTAMKHSDTWRE 384

Query: 407 EIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP 466
            IEAWQ+PILE +   + + + LFNELY L  GGT+W        S A  R+ K      
Sbjct: 385 NIEAWQKPILERQDLSDSFKMALFNELYALTDGGTLW--------SAADERDPK------ 430

Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
                                    GQF  LE  +Y  Y + DV  Y SFAL+ML+P LE
Sbjct: 431 -------------------------GQFAVLECIDYRWYESLDVRLYGSFALLMLWPDLE 465

Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSS 584
            S+   FA A+   D     I  +     RK +GA PHD+G   + PW + N  +  + +
Sbjct: 466 KSVVLAFARAISAGDDTPRIIGWNQASAIRKAVGATPHDLGAPNEHPWEKTNYTSYQDCN 525

Query: 585 RWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
            WKDL   FVLQVYRDF+ TG  D  F +  WP++  A+AY++ FD+DGDG+ EN G PD
Sbjct: 526 LWKDLPCDFVLQVYRDFLLTGGTDCEFLQECWPAIVEALAYLKTFDEDGDGIPENSGAPD 585

Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS----------ASYFWVRYQKAK 692
           QT+D W   G+SAYCGGLW+AAL+AA A+   + D A           A Y     Q   
Sbjct: 586 QTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPEIIPNPQAKIALYHSWLLQARP 645

Query: 693 AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
           +  + LWNG Y+  D+   S+S  + ADQL GQ+YA+  GL  I        AL  +Y+ 
Sbjct: 646 SYQEKLWNGQYYRLDSE--SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVSALETVYES 703

Query: 753 NVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
             LK   G  GA NG+  +G  +  +     E+W G+ +GLAA ++Q  M + AF+    
Sbjct: 704 CFLKFNDGEFGAANGVMLNGSPENPNATHPLEVWTGINFGLAAFLVQMGMEEKAFKLTDA 763

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           V +  + E+GL   F+TPE+      +R+  Y+R + IWA+   LT  K
Sbjct: 764 VVKQIY-ENGL--QFRTPEAITAGGTFRASHYLRAMAIWAIYGVLTNFK 809


>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
 gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
          Length = 814

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 273/814 (33%), Positives = 390/814 (47%), Gaps = 105/814 (12%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      + A QFSVF S     + +  L   
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESDGTSAQ-AYALATE 98

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           +P       +  + +W W           TYHAL+PR+W VY  E   + ++ C Q SP 
Sbjct: 99  AP------ENGSLSAWKWYPASTATQSTGTYHALYPRSWFVY--ENVLQAQLTCEQFSPI 150

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
              NY+ESS+P +VF +   N       ++++ TW N V               D G   
Sbjct: 151 WAENYQESSYPVAVFNWKAHNPTNAPITLSIMLTWQNMVGWFTNALKSPEVRIRDDGSPV 210

Query: 250 HHFNSKTM-TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSEC------PCFLLSG 302
           + +  +   ++D  + +T    T N     +  V  +  A+   S C      P F +  
Sbjct: 211 YEYQPRLGDSQDNYNCIT--ENTENFSCFCSRVVDDDSVAEGDGSWCIATIKHPQFKVFY 268

Query: 303 NS---KGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
           N+      T  D+W      GS  +  +   +P+   + IG AIA   T+  G T ++ F
Sbjct: 269 NTCWNPAGTGADVWETFAADGSLTNYQD--ATPALENTQIGVAIALRFTLQPGETLAIPF 326

Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
            L+WD P  +F   + Y+RRYT F+G  GD A  IA  A+ ++  W   IE WQ+PI++ 
Sbjct: 327 VLSWDFPVTEFAAGINYYRRYTDFFGRGGDHAWAIASTALTQYQTWYKNIENWQQPIIDR 386

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
           +  P+W+ + LFNELY L +GGT+W+  S               LD              
Sbjct: 387 EDLPDWFKMALFNELYDLTSGGTLWSAAS--------------ELDP------------- 419

Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
                       IGQF  LE  +Y  Y + DV  Y SF L+ LFP+LE S+ R FA A+ 
Sbjct: 420 ------------IGQFAVLECLDYRWYESLDVRLYGSFGLLHLFPELEKSVIRAFARAIP 467

Query: 539 MHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLN 590
             D           I +D     RK  GA PHD+G  +   W + N     + + WKDL 
Sbjct: 468 HSDNHQRVIGYYYTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLG 527

Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
           S FVLQVYRDF+ TG  D  F    W  +   + Y++ FD DGDG+ EN G PDQT+D W
Sbjct: 528 SDFVLQVYRDFLLTGADDVQFLADCWAGIVQTLDYLKSFDLDGDGIPENSGAPDQTFDDW 587

Query: 649 SANGVSAYCGGLWVAALQAASA-----LANDVGDHAS---ASYFWVRYQKAKAVY-DSLW 699
              GVSAYCGGLW+AAL+AA A     L N  G   +      + V   +++ +Y + LW
Sbjct: 588 RLQGVSAYCGGLWLAALEAAIAISDILLTNHRGAEDTEELGQRYRVWLAQSRTIYQEKLW 647

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
           NG YF  D+  GS    + ADQL GQ+YAR  GL  I    +   AL  +Y    +K   
Sbjct: 648 NGQYFRLDSESGSD--VVMADQLCGQFYARLLGLPDIVPSDRALSALQTVYHACFVKFCN 705

Query: 760 GMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWS 818
           G  GA NG++PDG  +  +     E+W G+ +GLAA ++Q  M D A +    V +  + 
Sbjct: 706 GEFGAANGVRPDGSAENPNATHPLEVWTGINFGLAAFLVQMGMQDEALKLTGAVVQQIYH 765

Query: 819 EDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
               G  F+TPE+   +  +R+  Y+R + IWA+
Sbjct: 766 N---GLQFRTPEAITASGTFRASTYLRAMAIWAI 796


>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 900

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 284/852 (33%), Positives = 422/852 (49%), Gaps = 79/852 (9%)

Query: 49  FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           F  L  +  + + Y K+   + ++P  +           G+P+GGIG+GSIGR YRGEF 
Sbjct: 63  FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122

Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
           R  L  G+   D  PV  +QF V V +     +  VL P  + P   KK     ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175

Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
               E   Y  L+PR+WTVY+  P+ +L +V +QISP IPH+Y+ +  P +VF + + + 
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234

Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
            + +  VT+ F+W          S     S+       +    H  TA N  P  +    
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294

Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
           A  T D          L  N +G   K    ++   G          S ++    +  A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344

Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
           +A+  +P        + S   + F++ W  P VKF   + VY RRY +++     S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404

Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
             AH AI    +W  +IE WQ PIL ++  P WY   LFNELYYL+ GGT+W D   P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460

Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
            +   +    +L    K+ +          RL       IG F YLEG EY MYNTYDVH
Sbjct: 461 -VDCFKSDLLNLTEDMKVDSWDHR-----ARL----SREIGLFGYLEGHEYRMYNTYDVH 510

Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
            Y+S+AL+ L+PK++L++  D A   +  D  ++  + +GK + R    AV HD G   D
Sbjct: 511 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 570

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
           +PW   N+Y +F +  WKDLNSKF+LQV+RD+  T D  +   + P V  I    +  +D
Sbjct: 571 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 630

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA---------------SALAN 673
            D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A                  + N
Sbjct: 631 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQIVN 690

Query: 674 DVGDHASASYFWVR------YQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
           +  D    S+  ++      + KA+  Y++ LW GSY+ +          I ADQL+G W
Sbjct: 691 NTDD-TQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSGYW 749

Query: 727 YARACGLLP--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
           ++R  G+ P  I  +  V K L  I + N   +K G  GA+NG  P  + D+S +QA E 
Sbjct: 750 FSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAEEF 809

Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
           W  V Y L++ MI E M+D         Y+  ++  GL Y  QTPE++ ++  +R   YM
Sbjct: 810 WVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QTPEAYMSDGRFRCPGYM 867

Query: 845 RPLTIWAMQWAL 856
           R L IW++Q AL
Sbjct: 868 RALAIWSIQQAL 879


>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
 gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
          Length = 802

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 269/824 (32%), Positives = 400/824 (48%), Gaps = 110/824 (13%)

Query: 85  SDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
           ++ G+PLGG G G +GR   GEF  + +  G      + A QFSVF            + 
Sbjct: 38  ANHGMPLGGFGGGCVGRGVNGEFNLWHIDGGNHIYQSLPACQFSVFEQVEGEPPQVYAMS 97

Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
             SP      TD  + SW W  +G+  TY AL+PR+W  Y G    + R++C Q SP   
Sbjct: 98  TNSP------TDGSLSSWAWYPQGKG-TYSALYPRSWFDYQGVF--KSRLICEQFSPIWA 148

Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHH 251
            +Y+ESS+P +VF +++ N       ++++FTW NSV               D G   + 
Sbjct: 149 GSYQESSYPLAVFEWSVHNPTDKEITLSIMFTWQNSVGWFTNAIKSPTVKVRDDGSPEYE 208

Query: 252 FNSK---------TMTKDGVHGLTLHHRTANGRPPV----TFAVAAEETADVHVSECPCF 298
           +  K            +D      L  R      P      +A+++     + V     +
Sbjct: 209 YRPKWGDSTGNLNQWIQDNFRVGCLFDRIRMEEQPQEGEGQWAISSITNPSLEV----FY 264

Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
               N KG  A D+W+    +GS    D    +P+EPG  I  A+A   T+  G T+ + 
Sbjct: 265 HSRWNPKGDGA-DIWDYFAMNGSLP--DYQDETPAEPGEQIAGAMAIRFTVKPGKTKKIP 321

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F LAWD P  +F + + Y+RRYT F+G  G +A  I   A+  H  W+  I+ WQ+PIL+
Sbjct: 322 FILAWDFPVTEFAQGITYYRRYTDFFGRNGRNAWAIVRTALKHHDMWQNRIKEWQKPILD 381

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
               PEW  + LFNELY L+ GG++WT  +                              
Sbjct: 382 RGDLPEWLKMALFNELYLLSEGGSLWTAAT------------------------------ 411

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                  EN  + +GQF  LE  +Y  Y + DV  Y SF L+M++P+L+ SI   F+ A+
Sbjct: 412 -------EN--DPVGQFGVLECIDYRWYESLDVRLYGSFPLIMMWPRLDKSIMEAFSRAI 462

Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
              D     I  +     RK  GA PHD+G   + PW + N  +  + + WKDL   FVL
Sbjct: 463 PTEDKTPRIIGYNQTQAIRKAKGATPHDLGAPNEHPWEKSNYTSYQDCNLWKDLGCDFVL 522

Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
            VYRD++ TG  D++F    W SV   + Y++ FD DGDG+ EN G PDQT+D W   G+
Sbjct: 523 LVYRDYLFTGGLDQDFLWECWDSVVETLHYVKTFDLDGDGIPENSGAPDQTFDDWRLQGI 582

Query: 654 SAYCGGLWVAALQAASALANDV-----------------GDHASASYFWVRYQKAKAVY- 695
           SAYCGGLW+A L+AA A+ N +                 G   + + F     +A+++Y 
Sbjct: 583 SAYCGGLWIAGLEAAIAIGNTLIANPPMNPSLQPDDYPQGIKDAIAVFQQWLTQARSLYH 642

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           ++LWNG Y+  D+  GS    + ADQL GQ+YA+  GL  + +   VK  L K+YD   L
Sbjct: 643 ETLWNGEYYRLDSESGSD--VVMADQLCGQFYAQLLGLPDVVESKYVKSTLRKVYDACFL 700

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K   G  G  NG++PDG+ +  +     E+W G+ YG+ A MI   M     + A  V +
Sbjct: 701 KFHDGKFGIANGVKPDGKPVKENDTHPLEVWTGINYGIVAFMILNGMKQEGLRVAETVVK 760

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
             + E+GL   F+TPE+      +R+  Y+R + +WA+ ++L K
Sbjct: 761 QVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAVWAIYYSLEK 801


>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
           carolinensis]
          Length = 667

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 205/435 (47%), Positives = 258/435 (59%), Gaps = 28/435 (6%)

Query: 372 EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFN 431
           EKV    YT+F+G  GD+   +AH  +  + +WE  IEAWQRPILED   P WY   LFN
Sbjct: 221 EKVTMLWYTRFFGREGDACPALAHYMLTHYPEWEKRIEAWQRPILEDSDLPAWYKSALFN 280

Query: 432 ELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
           ELY+L+ GGT+W +  P               D PS            G   L    +  
Sbjct: 281 ELYFLSDGGTLWVEFQPE--------------DAPSLA-------AGQGLSGLLPVLKEY 319

Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
           G+F YLEG EY MYNTYDVHFY+SFAL ML+PKLELS+Q D A AV+  DP     + +G
Sbjct: 320 GRFAYLEGQEYRMYNTYDVHFYASFALAMLWPKLELSLQYDMAVAVLSEDPRPRVYLMNG 379

Query: 552 KWVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
           +    K    VPHDIG  D  PW  +N+Y + +++ WKDLN KFVLQVYRDF  T D  +
Sbjct: 380 QTAQVKLRNVVPHDIGEPDEEPWQRVNAYLIHDTADWKDLNLKFVLQVYRDFFLTEDCTY 439

Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
            R +WP     M    +FD DGDG+IEN GF DQTYDAW   G SAYCGGLW+AA+    
Sbjct: 440 LRDMWPVCQAVMESELKFDTDGDGLIENSGFADQTYDAWVVTGASAYCGGLWLAAVCMMC 499

Query: 670 ALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYA 728
            +A  +G+ A+   +        A ++  LWNG Y+NYD+  G SS SI +DQLAGQW+ 
Sbjct: 500 RMAEILGEGAALQRYSAILSSGAAAFERLLWNGRYYNYDSGGGPSSNSIMSDQLAGQWFL 559

Query: 729 RACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
           RACGL      +  +  V+ AL  IY+ NVL    G  GA+NGM+PDG  D S +Q+ E+
Sbjct: 560 RACGLGEGKYQVFPQEHVQSALRTIYEMNVLGFSEGAMGAVNGMRPDGVPDTSSVQSDEV 619

Query: 785 WPGVTYGLAASMIQE 799
           W GV Y LAA+MIQE
Sbjct: 620 WVGVVYSLAATMIQE 634



 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 80/231 (34%), Positives = 122/231 (52%), Gaps = 19/231 (8%)

Query: 60  YRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDD 119
           YR ++ E+   K P  ++F    +    G PLGG G G+I R ++G+F R++L  G+   
Sbjct: 6   YRKTRLEK---KQPFIDLFNSQPLRQIYGCPLGGFGGGTITRGWKGDFCRWQLNPGLYHY 62

Query: 120 APVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
             V+A+QF+V + R     +  VL    P        S ++ W+W+  G    YHAL+PR
Sbjct: 63  KRVVADQFTVCLRRKGQTAYQQVLSVEQP--------SCLQGWNWSFCGHYAFYHALYPR 114

Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
           AW+VY   P  +L + CRQ++P IPH+YKE+S P  VF + + N      +V+++FT+ N
Sbjct: 115 AWSVYQ-LPGQDLVLTCRQVTPIIPHDYKETSLPVGVFVWEVENLRDEEVEVSIMFTFQN 173

Query: 240 SVAGDSGLSGHHFNSKTMTKDGVH---GLTLHH-RTANGRPPVTFAVAAEE 286
                    G H+N     ++G H   G+ LHH    NG    T A+AA E
Sbjct: 174 GTETKEDRRGGHWNEPFALREGGHCVRGVLLHHCLPVNG---YTLAIAARE 221


>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
 gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
          Length = 818

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 283/837 (33%), Positives = 396/837 (47%), Gaps = 123/837 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      + A QFSVF     G+K +  L   
Sbjct: 42  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 100

Query: 147 SPGVPKKNTDSGIESWDW---------------------NLKGENCTYHALFPRAWTVYD 185
            P      TD  + SW W                          + +YHAL+PR+W VY+
Sbjct: 101 PP------TDGSLSSWKWYPCEGRRKKEEGRREEEERTGEESNSSGSYHALYPRSWFVYE 154

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA--- 242
                +L   C Q SP +P+NY+ESS+P ++F +   N       ++++ TW N V    
Sbjct: 155 NVFHAQLS--CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGWFT 212

Query: 243 ----------GDSGLSGHHFNSK--------TMTKDGVH--GLTLHHRTANGRPPV---T 279
                      D G   + +  +         +  +  H  G  +   TA   P      
Sbjct: 213 NANKSPKVMVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTAEDEPQEGDGQ 272

Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
           +A+A      V V     +  +GN      +D+W      G F   DND+T+ +  G  +
Sbjct: 273 WAIATITNPAVEVFYQTRWNPAGN-----GEDIWRNFSAEG-FLLDDNDETAAAA-GEQL 325

Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAI 398
             AIA   T+  G TR + F +AWD P  +F   V Y+RRYT F+G  G +A  I   A+
Sbjct: 326 ACAIAVRFTLRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPIIRTAM 385

Query: 399 LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
                W   IEAWQ+PILE K   + + + LFNELY L  GGT+W        S A  R+
Sbjct: 386 KHSDTWRENIEAWQKPILERKDLSDSFKMALFNELYALTDGGTLW--------SAADERD 437

Query: 459 RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
            K                               GQF  LE  +Y  Y + DV  Y SFAL
Sbjct: 438 PK-------------------------------GQFAVLECIDYRWYESLDVRLYGSFAL 466

Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEIN 576
           +ML+P LE S+   FA A+   D     I  +     RK +GA PHD+G   + PW + N
Sbjct: 467 LMLWPDLEKSVVLAFARAISAGDDTPRIIGWNQASAIRKAVGATPHDLGAPNEHPWEKTN 526

Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
             +  + + WKDL   FVLQVYRDF+ TG  D  F +  WP++  A+AY++ FD+DGDG+
Sbjct: 527 YTSYQDCNLWKDLPCDFVLQVYRDFLLTGGTDSEFLQECWPAIVEALAYLKTFDEDGDGI 586

Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS----------ASYF 684
            EN G PDQT+D W   G+SAYCGGLW+AAL+AA A+   + D A           A Y 
Sbjct: 587 PENSGAPDQTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPQIIPNPQAKIALYH 646

Query: 685 WVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKK 744
               Q   +  + LWNG Y+  D+   S+S  + ADQL GQ+YA+  GL  I        
Sbjct: 647 SWLLQARPSYQEKLWNGQYYRLDSE--SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVS 704

Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVD 803
           AL  +Y+   LK   G  GA NG+  +G  +  +     E+W G+ +GLAA ++Q  M D
Sbjct: 705 ALETVYESCFLKFNEGEFGAANGVMLNGSPENPNATHPLEVWTGINFGLAAFLVQMGMED 764

Query: 804 MAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
            AF+    V    + E+GL   F+TPE+      +R+  Y+R + IWA+   LT  K
Sbjct: 765 KAFKLTDAVVNQIY-ENGL--QFRTPEAITAGGTFRASHYLRAMAIWAIYGVLTNFK 818


>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
 gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
          Length = 1615

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 262/807 (32%), Positives = 383/807 (47%), Gaps = 119/807 (14%)

Query: 69   KGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGIC--DDAPVLANQ 126
            +G     + FR   +    G+P G IG+GSIGR   G F +F L  GI   D + V A+ 
Sbjct: 899  RGTSMTIDSFRPLQLKPIYGVPCGTIGSGSIGRHILGGFCKFSLVPGIVEHDVSSVPADN 958

Query: 127  FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
            F V V       +  VL        K      + SW +N   E  +Y   +P +WT Y  
Sbjct: 959  FIVTVRDAGVTIYQQVL-----SAVKVENKKFLSSWLFNFPAECMSYLGRYPMSWTTYKL 1013

Query: 187  EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT--LLFTWANSVAGD 244
             P   L ++C Q SP IPHNYK+S  P ++F +T+ +    S+D+T  + FT+ N     
Sbjct: 1014 -PCHNLILICHQYSPVIPHNYKDSCLPLTLFKWTVMSEESLSSDLTVSITFTFRNGTGRP 1072

Query: 245  SGLSGHHFNSKTMTK---DGVH--GLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
                 H   + +  +     VH  G++LH +T +G     + VAA  T  + V+      
Sbjct: 1073 RVDDQHQMRANSFLERMSSSVHCVGVSLH-QTIDGMN-CDYCVAALNTKSISVTHS---- 1126

Query: 300  LSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
             + N +G    ++W  + + G   +   D+    E    +  A++    I S     + F
Sbjct: 1127 -TFNPEG-CGSEIWKSLYETGRLSNESTDEFVVVE-DRPLAIAVSCCTNIRSPCQEGIVF 1183

Query: 360  SLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
            SL W  PE+ F  K   Y RRY+ ++    D+A R++  A+     +  +I+ WQ PIL+
Sbjct: 1184 SLVWHMPEIYFGNKRRTYLRRYSDWFDESSDAAKRLSIYAMENREIFLSKIDEWQEPILK 1243

Query: 418  DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
            +   P+WY   +FNELYYL  GG++W              ++++  D P           
Sbjct: 1244 NDNLPKWYKSAIFNELYYLTDGGSVWVK-----------YDQRWRDDEPHLTW------- 1285

Query: 478  ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                                   EY MYNTYDVHFY+SFAL  LFP++E SIQ +FA   
Sbjct: 1286 -----------------------EYRMYNTYDVHFYASFALAKLFPQIEHSIQAEFADQF 1322

Query: 538  MMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQV 597
                   +  +S G +V               +PW E+N+Y   ++S WKDLN KFVL  
Sbjct: 1323 YNVTQTNVYHISPGTYVF--------------EPWLEVNAYAFHDTSEWKDLNLKFVLTS 1368

Query: 598  YRDFVATG-DKNFARAVWPSVY-IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSA 655
            YRD++     ++F R VWP+V  I    ++ +D DGD ++EN+G  DQTYDAW  +    
Sbjct: 1369 YRDYLMLNKSESFLRHVWPAVKEIIETALQNWDHDGDKLVENDGTCDQTYDAWCMH---- 1424

Query: 656  YCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW--NGSYFNYDNSDGSS 713
                 W   ++ A  L                 QK   + +  W   G+Y+ +D    +S
Sbjct: 1425 -----WTLLVELAKRL---------------EMQKKSLISEQCWMMRGTYYQFDQQSCNS 1464

Query: 714  STSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
            ST + ADQL G  Y + CGL    LP+ +   VKK L  IY+ NV     G  GA+NGM 
Sbjct: 1465 ST-VMADQLCGYAYLKICGLQTDILPLDN---VKKVLETIYNLNVCSFGNGTLGAVNGML 1520

Query: 770  PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
              G  D S LQA E W GVTY L+A MI E  V+  F TA+G+Y+  +  +  G  +QTP
Sbjct: 1521 YSGEKDTSSLQADEAWTGVTYFLSAHMISEGFVEQGFSTASGIYKSCF--ESFGMHYQTP 1578

Query: 830  ESWNNNDEYRSLCYMRPLTIWAMQWAL 856
            E+      +R++ YMRPL+IWA+QW L
Sbjct: 1579 EALYEKKWFRAVGYMRPLSIWAIQWYL 1605


>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 814

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 267/834 (32%), Positives = 384/834 (46%), Gaps = 123/834 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG +GRS RG+F  + L  G      + A QFSVF    N +  +  LC  
Sbjct: 41  HGMPLGGFGAGCLGRSPRGDFNLWHLDGGEHTFKRLPACQFSVFEQPENQDAQAYALCTE 100

Query: 147 SPGVPKKNTDSGIESWDWNLKGENC-----------TYHALFPRAWTVYDGEPDPELRIV 195
            P       D  +  W W     N            TYHAL+PR+W VY+     EL   
Sbjct: 101 PP------EDGSLGKWQWYPTSPNVETRHGASLRTGTYHALYPRSWFVYENVFQTEL--T 152

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA------------- 242
           C+Q SP     Y+ESS+P +VF +T  N       ++++ TW N V              
Sbjct: 153 CQQFSPIWADCYQESSYPVAVFEWTAHNPTDQPITLSIMLTWQNMVGWFTNAIQHPQVQV 212

Query: 243 GDSGLSGHHFNSKTMTKDGVHG---------------LTLHHRTANGRPPVTFAVAAEET 287
            D G   + +  +     G +                + L      G   + FA     +
Sbjct: 213 RDDGSPVYEYQPRWGNSTGNYNQWIVDNFRVGCLLDRVRLRDEIEEGEGQLCFATVTNPS 272

Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
            +V       +    N  G    ++WN     GS   + ++  +P+EPG  I  AIA  +
Sbjct: 273 QEVF------YHCRWNPTG-DGSEVWNTFASDGSLSDVQDE--TPAEPGEQIAGAIAVRV 323

Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWEC 406
            +  G TR + F LAWD P  +F   V Y+RRYT F+G  G++A  +   A+     W+ 
Sbjct: 324 RLRPGRTRKIPFILAWDFPVTEFASGVKYYRRYTDFFGRTGNNAWSMVRTALKHSDLWQE 383

Query: 407 EIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP 466
            I +WQ PIL+    P W+ + LFNELY L+AGGTIW+                      
Sbjct: 384 RILSWQNPILQRHDLPNWFKMALFNELYLLSAGGTIWSSA-------------------- 423

Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
                              N  + +G+F  LE  +Y  Y + DV  Y SFAL ML+ +L+
Sbjct: 424 -------------------NERDPVGKFGVLESFDYRWYESLDVRLYGSFALAMLWSRLD 464

Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSS 584
            ++   FA A+   D     I  +     RK   A PHD+G   + PW + N  +  + +
Sbjct: 465 KAVLEAFARAIPTSDDTPRIIGYNQATAIRKAADATPHDLGAANEHPWEKTNYTSYQDCN 524

Query: 585 RWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
           +WKDL   FVLQVYR FV TG K+  F    W ++  A+AY++ FD DGDG+ EN G PD
Sbjct: 525 QWKDLPCDFVLQVYRTFVFTGSKDIEFLAECWSAIVRALAYLKSFDHDGDGIPENSGAPD 584

Query: 643 QTYDAWSANGVSAYCGGLWVA----------------ALQAASALANDVGDHASASYFWV 686
           QT+D W   GVSAYCGGLW+A                 L +A  L +      + + +  
Sbjct: 585 QTFDDWQLQGVSAYCGGLWLAALEAASAIAQILLDNYTLVSAINLPDPESIQHTLNTYHK 644

Query: 687 RYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
             Q+A+ +Y + LWNG Y+  D+   S S  + ADQL GQ+YA+  GL  I      + A
Sbjct: 645 WLQQAQPIYQEKLWNGQYYRLDSD--SDSDVVMADQLCGQFYAQLLGLPDIVPPECTQSA 702

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDM 804
           L  +Y+   LK      GA NG++PDG  +        E+W G+ +GLAA +IQ  M D 
Sbjct: 703 LKTVYESCFLKFHNAQFGAANGVKPDGSPEHPDATHPLEVWTGINFGLAAFLIQLGMKDE 762

Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
           A +    V    + E+GL   F+TPE+      +R+  Y+R + IWA+   L++
Sbjct: 763 ALKLTESVVRQVY-ENGL--QFRTPEAITPVGTFRAGHYLRAMAIWAVYLTLSQ 813


>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 863

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 282/851 (33%), Positives = 397/851 (46%), Gaps = 139/851 (16%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS+RG+F  + +  G      + A QFSVF    +G+K +  L   
Sbjct: 66  HGMPLGGFGAGCIGRSHRGDFNLWHIDGGEHVFRSLPACQFSVF-EEIDGKKQAYALSTE 124

Query: 147 SPGVPKKNTDSGIESWDW----------NLKGE---------NCTYHALFPRAWTVYDGE 187
            P      TD  + +W W           LK E           +YHAL+PR+W VY+  
Sbjct: 125 PP------TDGSLSAWKWYPSELKIKNEKLKSEEQNSDSSLSTGSYHALYPRSWFVYENV 178

Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA----- 242
              +L   C Q SP    NY+ESS+P ++F +   N       ++++ TW N+V      
Sbjct: 179 FQAQLS--CEQFSPIWAGNYQESSYPVAIFEWIAHNPTDRPITLSIMLTWQNTVGWFTNA 236

Query: 243 --------GDSGLSGHHFNSKTMTKDGVHGLTLH--HRTANGRPPVTFAVAAEET----- 287
                    D G   + +NS+     G     +   HR       +T   A+EE      
Sbjct: 237 NKSPEIKVRDDGSPVYEYNSRWGESGGNFNRLVEDFHRIGC---VMTRLSASEEVKEGEG 293

Query: 288 ----ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
               A V  +    F  +  +   + +D+W    + G    LD+   +P+  G  +  AI
Sbjct: 294 QWAIATVANAAVEVFYQTRWNPAGSGEDIWRPFSQEGFL--LDDPDETPAAGGEQLACAI 351

Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHA 402
           A   TI  G TR + F LAWD P  +F  +V Y+RRYT F+G  G +   I   A+    
Sbjct: 352 AVRFTIRPGKTRKIPFILAWDFPITEFAPEVWYYRRYTDFFGRNGKNVWSIIRTAMKHSD 411

Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
            W   I+AWQ+PIL  +  P+   + +FNELY L  GGT+W        S A  R+ K  
Sbjct: 412 TWRENIQAWQQPILAREDLPDIIKMAMFNELYALTDGGTLW--------SAADERDPK-- 461

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
                                        GQF  LE  +Y  Y + DV  Y SFAL+ML+
Sbjct: 462 -----------------------------GQFAVLECIDYRWYESLDVRLYGSFALLMLW 492

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
           P+L+ ++   FA A+   D     I  +     RK  GA PHD+G   + PW + N  + 
Sbjct: 493 PELDKAVLLAFARAIFTGDDTPRVIGWNQASAVRKIAGATPHDLGAPNEHPWEKTNYTSY 552

Query: 581 FNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
            + + WKDL   FVLQVYRDF+ TG  D  F +  WP++  A+AY++ FD DGDG+ EN 
Sbjct: 553 QDCNLWKDLPCDFVLQVYRDFLLTGSTDYEFLQECWPAIVEALAYLKTFDLDGDGIPENS 612

Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAA-------------------SALANDVGDHA 679
           G PDQT+D W   GVSAYCGGLW+AAL+AA                   S + N   D  
Sbjct: 613 GAPDQTFDDWQMRGVSAYCGGLWLAALEAAIAIGSLLIKHQSSVEELNQSPITNFNKDFK 672

Query: 680 SASYFWV-RYQ-----------KAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
             S F +  YQ           +AK +Y   LWNG Y+  D+  GS    + ADQL GQ+
Sbjct: 673 DVSLFPITNYQLPITNYQNWLEQAKPIYQQRLWNGQYYRLDSESGSD--VVMADQLCGQF 730

Query: 727 YARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG-LQAREIW 785
           YAR  GL  I  E     AL  +Y+   +K   G  GA NG+  DG  +  G     E+W
Sbjct: 731 YARLLGLPDIVPEECAISALETVYESCFIKFNQGEFGAANGVMLDGSPEKPGATHPLEVW 790

Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
            G+ +GLAA M+Q  M + A +    V +  ++    G  F+TPE+      +R+  Y+R
Sbjct: 791 TGINFGLAAFMMQMGMKEKALELTEVVVKQIYAN---GLQFRTPEAITATGTFRASHYLR 847

Query: 846 PLTIWAMQWAL 856
            + IWA+ + L
Sbjct: 848 AMAIWAIYYEL 858


>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
 gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
 gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
          Length = 799

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 270/826 (32%), Positives = 390/826 (47%), Gaps = 120/826 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS+RG+F  + L  G     P+   QFSVF    +G++ +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
            P       D  + +W W    +   +  TY+AL+PR+W VY  E   + ++ C Q SP 
Sbjct: 100 PP------QDGSLSAWSWYPTSDQHTQTGTYYALYPRSWFVY--ENVFQTQLTCEQFSPV 151

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------------- 242
              NY+E+S+P +VF +   N       +++L TW N++                     
Sbjct: 152 WGGNYQETSYPIAVFEWIAHNPTDKLIILSILLTWENTIGWFTNSLENPTVKVRDDGSPV 211

Query: 243 -------GDSG------LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETAD 289
                  GDS       +   H     MT+     L ++ +   G   +  A  +    +
Sbjct: 212 YEYQPRWGDSQDNLNRFVEDFHRVGCVMTR-----LNINDQPREGEGQIAIATISNPAVE 266

Query: 290 VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI 349
           V+           N  G    D+W    K GS   + +D+  P+  G  IGAA+A    +
Sbjct: 267 VYYHN------RWNPNG-NGYDIWEYFSKDGSLIDIQDDR--PAVEGERIGAALAVRCIL 317

Query: 350 PSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
             G  R + F +AWD P  +F   V Y+R YT F+G  G +A  +   A+     W   I
Sbjct: 318 RPGKIRKIPFFIAWDLPVCEFEAGVSYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENI 377

Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
           EAWQ PIL  +  P W+ + LFNELY L  GG++WT         AT  ++ +       
Sbjct: 378 EAWQNPILNRQDLPPWFKMALFNELYLLTDGGSLWT---------ATDEDQPW------- 421

Query: 469 IHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELS 528
                                  G+F  LE  +Y  Y + DV  Y SF L+M +P+LE S
Sbjct: 422 -----------------------GRFAVLECLDYRWYESLDVRLYGSFGLLMSWPELEKS 458

Query: 529 IQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
           + R FA A+   D     I  +     RK  GA PHD+G   + PW + N  +  + + W
Sbjct: 459 VIRTFAKAIATADHTPRIIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLW 518

Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           KDL   FVLQVYRD++ TG  D  F    WP++   + Y++ FD+D DG+ EN G PDQT
Sbjct: 519 KDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVETLDYLKTFDRDQDGIPENGGAPDQT 578

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDH--------ASASYFWVRY-QKAKAVY 695
           +D W   G+SAYCGGLW+AAL+AA  +AN +  H          A   W  + + AK +Y
Sbjct: 579 FDDWRLVGISAYCGGLWLAALEAAIEIANILLSHNRDLTPDTTKAIATWTNWLETAKPLY 638

Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
           D +LWNGSY+N D+  GS    + ADQL GQ+YA   GL  I    + + AL  IY    
Sbjct: 639 DQTLWNGSYYNLDSQSGSD--VVMADQLCGQFYAALLGLPDIVPPHRAQVALQTIYQACF 696

Query: 755 LKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
                G  GA NG++  G  ++ +     E+W G+ +GL A  IQ  M + AFQ A  V 
Sbjct: 697 QNFHNGKFGAANGVRTTGEPMNPNDTHPLEVWTGINFGLGAFFIQMGMKEEAFQLAEAVI 756

Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
              +     G  F+TPE+      +R+  Y+R + IWAM + +  P
Sbjct: 757 TQVYQN---GLQFRTPEAITAAGTFRASHYLRAMAIWAMYYQIQHP 799


>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
 gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
           Paraca]
          Length = 799

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 275/826 (33%), Positives = 395/826 (47%), Gaps = 120/826 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS+RG+F  + L  G     P+   QFSVF    +G++ +  L  +
Sbjct: 41  HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
            P       D  + +W+W    +   +  TYHAL+PR+W VY  E   + ++ C Q SP 
Sbjct: 100 PP------EDGSLSTWNWYPSSDQHTQTGTYHALYPRSWFVY--ENVFQTQLTCEQFSPV 151

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------------- 242
              NY+E+S+P +VF +   N       +++L TW N++                     
Sbjct: 152 WGGNYQETSYPVAVFEWIAHNPTDKLIILSILLTWENTIGWFTNRLDTPAVKVRDDGSPV 211

Query: 243 -------GDSGLSGHHFNSK------TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETAD 289
                  GDS  + +HF          MT+     L ++ +   G   +  A       +
Sbjct: 212 YEYQPRWGDSQDNCNHFVEDFHRVGCVMTR-----LNINDQLGEGDGQMAIATITNPVVE 266

Query: 290 VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI 349
           V+      +  SGN       D+W    + GS   + +D   P+  G  IGAA+A    +
Sbjct: 267 VYCHNR--WRPSGN-----GYDIWEYFSQDGSLIDIHDD--GPALEGERIGAALAVRCIL 317

Query: 350 PSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
             G TR + F +AWD P  +F   V Y+R YT F+G  G +A  +   A+     W   I
Sbjct: 318 RPGKTRKIPFFIAWDLPVTEFEAGVTYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENI 377

Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
           EAWQ PIL  +  P W+ + LFNELY L  GG++WT                 ++D    
Sbjct: 378 EAWQNPILNRQDLPPWFKMALFNELYLLTDGGSLWT-----------------AIDE--- 417

Query: 469 IHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELS 528
                              +E  G+F  LE  +Y  Y + DV  Y SF L+M +P+LE S
Sbjct: 418 -------------------DEPWGRFAVLECLDYRWYESLDVRLYGSFGLLMWWPELEKS 458

Query: 529 IQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
           + R FA A+   D     I  +     RK  GA PHD+G   + PW + N  +  + + W
Sbjct: 459 VMRAFAKAIATDDDTPRIIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLW 518

Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           KDL   FVLQVYRD++ TG  D  F    WP++   + Y++ FD+D D + EN G PDQT
Sbjct: 519 KDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVQTLDYLKTFDRDRDCIPENGGAPDQT 578

Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS--------ASYFWVRY-QKAKAVY 695
           +D W   G+SAYCGGLW+AAL+AA A+AN +  H S        A   W  + + AK +Y
Sbjct: 579 FDDWRMMGISAYCGGLWLAALEAAIAIANILLSHHSDITPDTTKAIATWTNWLETAKPLY 638

Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
           D +LWNGSY+N D+  GS    + ADQL GQ+YA   GL  I    + + AL  IY    
Sbjct: 639 DQTLWNGSYYNLDSQSGSD--VVMADQLCGQFYAALLGLPDIVPHHRARVALQTIYQACF 696

Query: 755 LKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
                G  GA NG++  G  I+       E+W G+ +GL A +IQ  M + AFQ A  V 
Sbjct: 697 QNFHNGKFGAANGVRTTGEPINPQDTHPLEVWTGINFGLGAFLIQMGMKEEAFQLAEAVI 756

Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
              +     G  F+TPE+      +R+  Y+R + IWAM + +  P
Sbjct: 757 TQVYQN---GLQFRTPEAITAAGTFRASHYLRAMAIWAMYYQIQHP 799


>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
 gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
          Length = 806

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 279/799 (34%), Positives = 393/799 (49%), Gaps = 93/799 (11%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G+PLGG GAG IGRS  G+F  + +  G      + A QFS+F      + ++    P+ 
Sbjct: 57  GMPLGGFGAGCIGRSSAGDFNLWHVDGGEHIFGTLPACQFSLFEQGEQTQAYALGSAPK- 115

Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
                   D  + SW W   G+  TY   +PR+W VY+G      +I C Q SP +PHNY
Sbjct: 116 --------DGRLSSWQWYPAGKG-TYAVRYPRSWFVYEGVF--RAQITCEQFSPILPHNY 164

Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFNS 254
           +E+S+P +VF +T SN    S  ++L+ +W N+V               D G   + +  
Sbjct: 165 QETSYPVAVFLWTFSNPTDQSLTLSLMLSWQNTVGWFCNTTPSSAIAIRDDGSPVYTYTP 224

Query: 255 KTMTKDGVHGLTLHHRTANG-------RPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
           +    DG     +   +  G        P        E  A +       F  S      
Sbjct: 225 RWGQSDGNFNELIQTESFQGWRLRRMPHPNPPQEGDGEWAALIPTGLGEFFGCSRWQPEG 284

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
               +W      GS   + ND T P+  G  + AA A   ++  G  + +   LAWD P 
Sbjct: 285 DGAHLWQSFSVDGSLPFV-NDPT-PAAAGEQVAAAFALRFSLAPGERKQIPVVLAWDFPV 342

Query: 368 VKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYP 426
            +F + V Y+RRYT F    G +A  +A  A+  +A W+ +I  WQ PIL    +P+W+ 
Sbjct: 343 TEFGKGVIYYRRYTDFCDRHGTNAVTLAAQALAAYATWQEQIRTWQAPILSHPDWPDWFK 402

Query: 427 ITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLEN 486
           + L NELY L++GG++W+                            A+SD          
Sbjct: 403 MALCNELYVLSSGGSLWS----------------------------AASD---------- 424

Query: 487 GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK 546
             + +GQF  LE  +Y  Y + DV  Y SFAL+ L+P+LE S+ R FA A+   DP T++
Sbjct: 425 -RDPVGQFAVLECLDYRWYESLDVRLYGSFALLQLWPELEKSVMRAFARAIPTADP-TLR 482

Query: 547 IMS--------DGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQ 596
           I+               RK   AVPHD+G   + PW + N     + + WKDL S FVL 
Sbjct: 483 IIGYFYRGDPETAYKAPRKLANAVPHDLGAPNEHPWEKTNYTAYQDCNLWKDLASDFVLL 542

Query: 597 VYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
           VYRDF+ TG  D NFAR  WP+V  A+ +++QFD+DGDG+ EN G PDQTYD W   GVS
Sbjct: 543 VYRDFLFTGGTDLNFARECWPAVVAALDHLKQFDQDGDGLPENGGAPDQTYDDWKLQGVS 602

Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSS 714
           AYCGGLW+AAL+AA AL   +       Y     Q     +  LWNG Y+  D   GS S
Sbjct: 603 AYCGGLWLAALEAAIALGTLLQQPQVEIYRQWLSQARPRYHQLLWNGEYYRLDT--GSGS 660

Query: 715 TSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRI 774
             I ADQL GQ+YA+  GL+ I       +AL KIYD   LK   G  GA NG+ P+G+ 
Sbjct: 661 DVIMADQLCGQFYAQLLGLVDIVPPDCCDRALRKIYDTCFLKFHNGQFGAANGLLPNGQP 720

Query: 775 DMS-GLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
           +        E+W G+ +GLAA + Q  M+D A++ A  V    + E+GL   F+TPE+  
Sbjct: 721 ENPHATHPLEVWTGINFGLAAFLWQRGMIDEAWRLAEVVVRQIY-ENGL--QFRTPEAIT 777

Query: 834 NNDEYRSLCYMRPLTIWAM 852
            N  +R+  Y+RP+ IWA+
Sbjct: 778 ANGTFRACMYLRPMAIWAL 796


>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
 gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
          Length = 804

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 258/816 (31%), Positives = 398/816 (48%), Gaps = 110/816 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS +G+F  + +  G      + A QFSVF  + N    +  LC  
Sbjct: 41  HGMPLGGFGAGCIGRSSKGDFNLWHIDGGEHTFKSIPACQFSVFEQQSN---LAYALCTE 97

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           +P       D  +++W W  + +  TYHAL+PR+W VY+     +L   C Q SP +P N
Sbjct: 98  AP------EDGSLKAWQWYPQNKG-TYHALYPRSWFVYENVFLSQL--TCEQFSPILPDN 148

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
           Y+E+S+P ++F +   N       ++++ TW N V               D G   + + 
Sbjct: 149 YRETSYPVAIFVWKAHNPTDQPLTLSIMLTWQNMVGWFTNSIKSPQVRVRDDGSPVYEYE 208

Query: 254 SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETA-------DVHVSECPCFLLSGNSK- 305
            +     G +   +   TA+    +   V +++ A        +  S  P + +S +++ 
Sbjct: 209 PRLGQSAGNYNKLV--ETADSLGCILARVGSDDVALEGIGQWVIATSINPKYQISHHTRW 266

Query: 306 --GITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
               T  D+W      GS  +  ++  +P   G  + +AIA   T+  G +  + F LAW
Sbjct: 267 NPTGTGADVWQSFSTDGSLPNYIDE--TPIADGEQLASAIAIKFTLEPGESLEIPFVLAW 324

Query: 364 DCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
           D P  +F   V Y+RRYT F+G  G +A  IA +A+  +  W+ +++AWQ+PI++    P
Sbjct: 325 DFPITEFAPGVNYYRRYTDFFGRNGSNAEAIAFNALQHYQTWQQQVQAWQQPIIDRTDLP 384

Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTR 482
           +W+ + LFNELY L +GGT+W+             ER                       
Sbjct: 385 DWFKMALFNELYDLTSGGTLWSAAD----------ER----------------------- 411

Query: 483 LLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP 542
                 + +GQF  LE  +Y  Y + DV  Y SF L+ML+ KL+ SI R +A A+   D 
Sbjct: 412 ------DPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLWSKLDKSILRAYARAISTSDR 465

Query: 543 GTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRD 600
              +I  +     RK   A PHD+G   + PW + N  +  + + WKDL S FVLQVYRD
Sbjct: 466 TLREIGYNQATGLRKVANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPSDFVLQVYRD 525

Query: 601 FVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
           FV TG  D       W ++   + Y++ FD D DG+ EN G PDQT+D W   GVSAYCG
Sbjct: 526 FVLTGSTDWELLWDCWGAIVQTLNYLKTFDLDNDGIPENSGAPDQTFDDWRLQGVSAYCG 585

Query: 659 GLWVAALQAASALA--------------------NDVGDHASASYFWVRYQKAKAVY-DS 697
           GLW+A L++A A+                     N      +   +    ++++ VY + 
Sbjct: 586 GLWLAGLESAIAIGKTLISYPQEHPLGEMLTSPLNQPPIEETIGIYQTWLEQSRPVYQEK 645

Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKV 757
           LWNG Y+  D+  GS    + ADQL GQ+YAR  G+  I +  +   +L  +Y+   LK 
Sbjct: 646 LWNGQYYKLDSESGSD--VVMADQLCGQFYARLLGVPDIVESDRTVSSLKTVYEACFLKF 703

Query: 758 KGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
             G  GA NG++ DG  +        E+W G+ +GLAA ++Q +M + AF+    V E  
Sbjct: 704 HNGAFGAANGLKIDGSPENPDATHPLEVWTGINFGLAAFLLQMDMKEEAFKLTQTVVEQI 763

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           ++    G  F+TPE+      +R+  Y+RP+ IWA+
Sbjct: 764 YNN---GLQFRTPEAITAVGTFRACHYLRPMAIWAV 796


>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
 gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
          Length = 809

 Score =  377 bits (969), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 272/829 (32%), Positives = 401/829 (48%), Gaps = 118/829 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGR+ RG+F  + +  G      + A QFS+F  + +  + +  LC  
Sbjct: 40  HGMPLGGFGAGCIGRASRGDFNLWHIDGGEHTFKSLPACQFSIFEQQASTSQ-AFALCTL 98

Query: 147 SPGVPKKNTDSGIESWDWNLK-GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
            P       D  + +W W  +  EN  YHAL+PR+W VY+      L   C Q SP +P 
Sbjct: 99  PPA------DGTLTTWQWYPQHCENGVYHALYPRSWFVYENVFQAHLS--CEQFSPILPE 150

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
           NY+E+S+P +VF +   N       ++++ TW N+V               D G   + +
Sbjct: 151 NYQETSYPVAVFVWRAHNPTNAPLTLSIMLTWQNTVGWFQNAIKNPEIRMRDDGSPVYEY 210

Query: 253 N----------SKTMTKDGVHGLTLHHRTANGRPPV-TFAVAAEETADVHVSECPCFLLS 301
                      ++ + KD + G  L    ++ +  +  +A+A  +  ++ V     +  S
Sbjct: 211 EPCLGNNLGNFNRLIAKDNLCGCVLERVASDLQDGIGQWAIATPKQPNIEVFYHTRWNPS 270

Query: 302 GNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSL 361
           G+       ++W+   + GS  + + DKTS SE    + AAIA   T+  G TR + F++
Sbjct: 271 GD-----GAEIWHSFARDGSLPNTE-DKTSASE-QEQLAAAIAVRFTLQPGETRELPFAI 323

Query: 362 AWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
           AWD P  +F   V Y RRYT F+G  GD+A  IA  A+  +  W+ +I+AWQ+PIL+   
Sbjct: 324 AWDFPITEFAPGVKYFRRYTDFFGRGGDNAVAIAQTALQNYQSWQQQIQAWQQPILDRDD 383

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
            P  + + LFNELY L +GGT+W+                            A+ D    
Sbjct: 384 LPSVFKMALFNELYDLTSGGTLWS----------------------------AADD---- 411

Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
                   + +GQF  LE  +Y  Y + DV  Y SFAL+ML+ KLE S+ R FA A+   
Sbjct: 412 -------RDPVGQFAVLECFDYRWYESLDVRLYGSFALLMLWSKLEKSVIRAFARAIGQR 464

Query: 541 DPGTMKIMS------DGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSK 592
           D  T  I        +     RK  GA PHD+G   + PW   N  +  + + WKDL+  
Sbjct: 465 DDRTRVIGYYYTQGLESPTALRKVAGATPHDLGAPNEHPWEATNYTSYQDCNLWKDLSCD 524

Query: 593 FVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSA 650
           FVLQVYRDFV TG  D       W +V   + Y++ FD D DG+ EN G PDQT+D W  
Sbjct: 525 FVLQVYRDFVLTGGTDWELLWDCWSAVVQTLTYLKTFDLDEDGIPENSGAPDQTFDDWRL 584

Query: 651 NGVSAYCGGLW----VAALQAASALANDVGDHASAS----------------YFWVRYQK 690
            GVSAYCGGLW     AA+     L +   DH ++                  F     +
Sbjct: 585 QGVSAYCGGLWLAALEAAIAIGKTLLSYPDDHPASKILASAPDYPPIPETLDVFQSWLMR 644

Query: 691 AKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           ++ +Y + LWNG Y+  D+  GS    + ADQL GQ+YAR  GL  I       +AL  +
Sbjct: 645 SRPIYQEKLWNGQYYRLDSKSGSD--VVMADQLCGQFYARLLGLPDIVPPECATQALKTV 702

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           YD   +K   G  GA NG++PDG  +  +     E+W G+ +GLAA ++Q  M   A++ 
Sbjct: 703 YDACFVKFHNGQFGAANGLKPDGLPENPNATHPLEVWTGINFGLAAFLMQMGMKSEAWRI 762

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
              V +  +     G  F+TPE+      +R+  Y+RP+ IWA+   LT
Sbjct: 763 TQAVVQQVYDN---GLQFRTPEAITAKGTFRACHYLRPMAIWAIYRVLT 808


>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
 gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
          Length = 803

 Score =  377 bits (969), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 271/818 (33%), Positives = 399/818 (48%), Gaps = 102/818 (12%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + +  G      + A QFSVF S     + +  L   
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFQNIPACQFSVFESHGTSSQ-AYALATT 98

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           +P       D  +++W W        E   YHAL+PR+W VY  E   + ++ C Q SP 
Sbjct: 99  AP------EDGSLQAWQWYPASTATQETGAYHALYPRSWFVY--ENVLQAQLTCEQFSPV 150

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
             HNY+E+S+P +VF +   N       ++++ TW N                 D G   
Sbjct: 151 YAHNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWQNMTGWFTNALKSPEVKVRDDGSPV 210

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANG----RPPVTFAVAAEETADVHVSE----CPCFLLS 301
           + +  +     G +   +    + G    R  +T  +   E +   V++       F  +
Sbjct: 211 YEYQPRLGESQGNYNYLVESPQSLGCFLGRVGITEPLQEGEGSWCIVTQKHPQVEVFHHT 270

Query: 302 GNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFS 360
             +      ++W      GS  D++D   T+P+     IG AIA   T+  G    + F 
Sbjct: 271 RWNAAGKGDEIWQSFAADGSLPDYID---TTPAGENEQIGTAIAVRFTLQPGENLEIPFV 327

Query: 361 LAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDK 419
           LAWD P  +F   V YHRRYT F+GT G++A  IA  A+ ++  W+ +I+AWQ+PI+E +
Sbjct: 328 LAWDFPVTEFAAGVNYHRRYTDFFGTNGNNAWAIATLALEQYQTWQQQIQAWQQPIIERE 387

Query: 420 RFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTAL 479
             P W  + LFNELY L +GGT+W        S AT R+                     
Sbjct: 388 DLPNWLKMALFNELYDLTSGGTLW--------SAATERDP-------------------- 419

Query: 480 GTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMM 539
                      IGQF  LE  +Y  Y + DV  Y SF LV+LFP+LE S+ R FA A+  
Sbjct: 420 -----------IGQFAVLECLDYRWYESLDVRLYGSFGLVLLFPELEKSVMRAFARAIPQ 468

Query: 540 HD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLNS 591
            D  P  +     I ++     RK +GA PHD+G  +   W + N  +  + + WKDL  
Sbjct: 469 GDDTPRVIGYYYTIGAESPVAVRKAVGATPHDLGAPNEHVWQKTNYTSYQDCNLWKDLGC 528

Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
            FVLQVYRD++ TG  D  F +  W ++   + Y++ FD DGDG+ EN G PDQT+D W 
Sbjct: 529 DFVLQVYRDYLFTGANDVEFLQDCWDAIVQTLDYVKTFDLDGDGIPENSGAPDQTFDDWR 588

Query: 650 ANGVSAYCGGLWVAALQAASALANDVGDH-------ASASYFWVRYQKAKAVY-DSLWNG 701
             GVSAYCGGLW+AAL+AA A++  +  H          S +    +K+  +Y + LWNG
Sbjct: 589 LQGVSAYCGGLWMAALEAAIAISEILSQHQPDSALLKQKSIYATWLEKSLPIYQEKLWNG 648

Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGM 761
            Y+  D++ GS    + ADQL GQ+YA    L  I    +   AL  +YD   LK   G 
Sbjct: 649 QYYRLDSNSGSD--VVMADQLCGQFYAALLNLPDIVPGDRAISALRTVYDACFLKFYNGQ 706

Query: 762 CGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
            GA NG++PDG  +        E+W G+ +GLAA ++Q  M D A +    V +  +S  
Sbjct: 707 FGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLVQMGMKDEALRLTEAVVQQIYSN- 765

Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
             G  F+TPE+   +  +R+  Y+R + IW +   +++
Sbjct: 766 --GLQFRTPEAITPSGTFRASTYLRAMAIWGVYLVMSE 801


>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
 gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
           6304]
          Length = 820

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 270/831 (32%), Positives = 380/831 (45%), Gaps = 127/831 (15%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      + A QFS+F    N ++ +  LC  
Sbjct: 41  HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFQSLPACQFSIFEETQN-QRHAYALCTE 99

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
            P      +D  + SW W      +    TYHAL+PR+W VY      +L   C Q SP 
Sbjct: 100 PP------SDGSLNSWQWYPASTEENPTGTYHALYPRSWFVYQNVLKAQL--TCEQFSPI 151

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
           +P  Y+E S+P ++F +T  N       +++L TW N V               D G   
Sbjct: 152 LPGKYQECSYPLAIFEWTAYNPTDEPITLSILLTWENVVGWFTNAIKSPTVKVRDDGSPV 211

Query: 250 HHFNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
           + + SK     G               + G  L      G   +  A A     D   + 
Sbjct: 212 YEYQSKWGQSQGNSNRYIEDFHRIGCIMAGGHLGEVPQEGEGELAIATATNPFLDTFKNT 271

Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
                   N  G + K++W +    GS   LD    +P+  G  IG A+A   TI  G T
Sbjct: 272 ------RWNPTG-SGKEVWEQFSHDGSL--LDCTDETPAAKGERIGVALAVRFTIRPGKT 322

Query: 355 RSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
           R + F LAWD P  +F   V+ +RRYT F+G  G ++  +   A+     W   I  WQ+
Sbjct: 323 RKIPFILAWDFPVTEFAAGVWEYRRYTDFFGRHGKNSWSMIRTALKHSEMWRENIITWQQ 382

Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
            IL+ +   +   + LFNELY L  GGT+WT                             
Sbjct: 383 SILDREDLSDSLKMALFNELYILTDGGTLWT----------------------------- 413

Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
                      ++ EE  G+F  LE  +Y  Y + DV  Y SFAL+ML+P+LE S+ R F
Sbjct: 414 ----------AQDAEEPRGKFAVLECIDYRWYESLDVRLYGSFALLMLWPELEKSVMRGF 463

Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNS 591
           A A+   D     I  +     RK   A PHD+G   + PW + N  +  + + WKDL  
Sbjct: 464 ARAIPTGDDTPRIIGYNQAAAIRKVTDATPHDLGAPNEHPWVKTNYTSYQDCNLWKDLPC 523

Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
            F L VYRDF+ TG  D  F    WPS+  A+AY++ FD DGDG+ EN G PDQT+D W 
Sbjct: 524 DFALLVYRDFLLTGGDDYEFLWDCWPSMVQAIAYLKTFDTDGDGIPENGGAPDQTFDDWP 583

Query: 650 ANGVSAYCGGLWVAAL--------------------------QAASALANDVGDHASASY 683
             GVSAYCGGLW+AAL                          Q  S L       A+ S 
Sbjct: 584 MRGVSAYCGGLWIAALQAAIAIGEVILNNPPHSGVFTDTIYEQKRSQLLEPATIQATLST 643

Query: 684 FWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKV 742
           +    ++++ +Y   LWNG+Y+  D+  GS    + ADQL GQ+YAR  GL  I      
Sbjct: 644 YQDWLEQSRPLYQHKLWNGTYYRLDSESGSEV--VMADQLCGQFYARLLGLADIVPIDCA 701

Query: 743 KKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEM 801
           + AL  +Y+   LK   G  GA NG+  DG  ++       E+W G+ +GLAA ++Q  M
Sbjct: 702 ESALKTVYESCFLKFHNGEFGAANGVLIDGSPVNPDATHPLEVWTGINFGLAAFLLQMGM 761

Query: 802 VDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            +  F+    V +  + E+GL   F+TPE+      +R+  Y+R + IWA+
Sbjct: 762 KEEGFKLTETVIQQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWAV 809


>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
 gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
           IMS101]
          Length = 816

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 277/827 (33%), Positives = 394/827 (47%), Gaps = 120/827 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
            G+PLGG GAG+ GRS RG+F  + L  G  I  + P  A QFSVF     G+K +  L 
Sbjct: 41  HGMPLGGFGAGATGRSPRGDFNLWHLDGGEHIYQNLP--ACQFSVF-EETKGQKQAYALS 97

Query: 145 PRSPGVPKKNTDSGIESWDWNLKGE----NCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
              P      TD  + +W W  + +      TYHAL+PR+W VY+     +L   C Q S
Sbjct: 98  TELP------TDGSLSAWQWYPREKAGLKTGTYHALYPRSWFVYENVFTAQLS--CEQFS 149

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
           P +  NY+E+S+P ++F +T  N    +  ++++ +W N+V               D G 
Sbjct: 150 PILAGNYQETSYPIAIFEWTAHNPTDEAITLSIMLSWQNTVGWFTNSVKTPEVRVRDDGS 209

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG----- 302
             + +  +         L +      G      ++A E            F+ +G     
Sbjct: 210 PVYEYKPRWGESTDNFNLLVEDFHRIGCTMTKLSIADEPAEGEGQMAIATFINAGMEVFY 269

Query: 303 ----NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
               N  G T +D+W+     GS   +D +   P+  G  IG AI+   TI  G  R + 
Sbjct: 270 HTRWNPTG-TGEDIWHYFALDGSL--IDEENELPATEGEQIGVAISVRFTIRPGKNRKIP 326

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F L WD P   F   V Y+RRYT FYG  G +A  +   A+  +  W   IEAWQ PIL+
Sbjct: 327 FFLVWDLPVTDFGNGVSYYRRYTDFYGRNGKNAWSMIRTAMKHYQTWRENIEAWQNPILQ 386

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
            +  P W  + L NELY L +GGTIW                             A+SD+
Sbjct: 387 REDLPNWLKMALLNELYDLTSGGTIWA----------------------------AASDS 418

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
           A             GQF  LE  +Y  Y + DV  Y SF L++L+P+LE S+   FA AV
Sbjct: 419 A-----------PYGQFAVLECLDYRWYESLDVRLYGSFGLLILWPELEKSVLVAFARAV 467

Query: 538 MMHDPGTMKIMSDGKWVA-RKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
              D  T++I+   +  A RK  GA PHD+G   + PW   N  +  + ++WKDL+S FV
Sbjct: 468 STAD-DTLRIIGYNQVSAVRKVAGATPHDLGAPNEHPWEMTNYTSYQDCNQWKDLSSDFV 526

Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           LQVYRDF+ TG  D  F    W ++   +AY++ FDKD DG+ ENEG PDQT+D W   G
Sbjct: 527 LQVYRDFLLTGADDYEFLWQSWSAITETLAYLKGFDKDNDGIPENEGAPDQTFDDWQLRG 586

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFW---------------------VRYQ-- 689
           VSAYCGGLW+AAL+AA A+   + +H     ++                       YQ  
Sbjct: 587 VSAYCGGLWLAALEAAIAIGKVLIEHPREIPYYPPKGFYSEVDKNSVDAINNQVYLYQGW 646

Query: 690 --KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
             K   +Y + LWNG Y+  D+   S+S  + ADQL+GQ+YA+   L  I        AL
Sbjct: 647 LKKGLPIYQEKLWNGEYYRLDSE--SNSEVVMADQLSGQFYAKLLNLEDIVPAECALSAL 704

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMA 805
             +Y+        G  GA NG+ PDG  +  +     E+W G+ +GLAA M+Q  M   A
Sbjct: 705 KTVYNSCFKNFHNGKFGAANGVLPDGSPENPNATHPLEVWTGINFGLAAFMVQIGMKKEA 764

Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            +    V    + E+GL   F+TPE+      +R+  Y+R + IWA+
Sbjct: 765 LEITEVVVGQIY-ENGL--QFRTPEAITVMGTFRASHYLRAMAIWAI 808


>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
 gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
          Length = 818

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 260/833 (31%), Positives = 392/833 (47%), Gaps = 128/833 (15%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS+RGEF  + +  G      + A QFSVF    NG+  +  +  +
Sbjct: 41  HGMPLGGFGAGCIGRSHRGEFNLWHIDGGEHVFKNLSACQFSVFEQLANGDSQAYAMSTQ 100

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENC-TYHALFPRAWTVYDGEPDPELRIVCRQISP 201
           +PG      D  +  W W       G++  +Y AL+PR+W  Y  E   + ++ C Q SP
Sbjct: 101 APG------DGSLNRWQWYPQTTPTGQSTGSYQALYPRSWYSY--ENVFQAQLTCEQFSP 152

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
               NY+E+S+P +VF +T  N       ++++ TW N+V               D G  
Sbjct: 153 IWADNYQETSYPIAVFEWTAHNPTDKPITLSIMLTWENTVGWFANTSPSSKVEIRDDGSP 212

Query: 249 GHHFNSKTMTKDGVHG------------LTLHHRT---ANGRPPVTFAVAAEETADVHVS 293
            + +  +    +G               L   H++   + G    + A  A  + +V   
Sbjct: 213 VYDYQPRIGESEGNFNQWIVDNYRVGCLLDKAHKSDIPSEGEGQWSIASIANPSLEV--- 269

Query: 294 ECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGS 353
               F  S  +      ++W+     GS   +D++  S ++    I  AI    T+  G 
Sbjct: 270 ----FYHSRWNPIGNGGEIWDSFAADGSLPDIDDE--SEAKQRERIATAITIRFTVKPGQ 323

Query: 354 TRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
           T+ + F LAWD P  +F + + Y+RRYT F+G  G+++  +   A+     W+ +I  WQ
Sbjct: 324 TKKIPFILAWDFPVTEFGKGINYYRRYTDFFGRSGENSWAMVRTALKHSDTWQEQIINWQ 383

Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP 472
           +PIL+    P+W+ + LFNELY L  GGTIW+                            
Sbjct: 384 QPILDRSDLPDWFKMALFNELYLLTDGGTIWS---------------------------- 415

Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
           A+ D            + +GQF  LE  +Y  Y + DV  Y SF++++L+PKLE SI   
Sbjct: 416 AADD-----------RDPVGQFGVLECLDYRWYESLDVRLYGSFSVLLLWPKLEKSILIA 464

Query: 533 FAAAVMMHDPGT-----------------MKIMSDGKWVARKCLGAVPHDIGL--DDPWF 573
           FA A+   D                      +  D     RK L A PHD+G   + PW 
Sbjct: 465 FARAIDTEDTNKRLIGYYVQTAFGYEDRPQNVDRDAMMAPRKTLHATPHDLGAPNEHPWE 524

Query: 574 EINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDG 631
           + N     + + WKDL S FVLQVYR ++ TG K+  F    W ++   +AY++QFD D 
Sbjct: 525 KTNYTCYQDCNLWKDLGSDFVLQVYRAYIFTGKKDLEFLAECWTAIVDTLAYLKQFDLDN 584

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH------ASASYFW 685
           DG+IEN G PDQT+D W   G+SAYCGGLW+AAL+AA ++ N +            +   
Sbjct: 585 DGIIENSGAPDQTFDDWRLQGISAYCGGLWLAALEAAISIGNILTQQYHGELTPKPAEVI 644

Query: 686 VRYQ----KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
            +YQ      K +Y   LWNGSY+  D   GS S  + ADQL GQ+Y +   L  I    
Sbjct: 645 PQYQAWLDSGKPIYQQQLWNGSYYRIDT--GSGSEVVMADQLCGQFYTQLLELDDIVPPD 702

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQE 799
             K AL  +Y    L  +GG  GA NG++ DG   +       E+W G+ +G+AA ++Q 
Sbjct: 703 CAKTALQTVYQSCFLNFQGGKFGAANGVKIDGSPSNPKDTHPLEVWTGINFGIAAFLVQM 762

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            M + A + A  + E  +S    G  F+TPE+  +   +R+  Y+R + IW +
Sbjct: 763 GMQEEALKLAETIVEQVYSN---GLQFRTPEAITSAGTFRASHYLRAMAIWGI 812


>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
 gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
          Length = 804

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 263/823 (31%), Positives = 381/823 (46%), Gaps = 117/823 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS+ G+F  + +  G      + A QFS+F S     +  ++    
Sbjct: 40  HGMPLGGFGAGCIGRSHGGDFNFWHIDGGEHVFKNIPACQFSLFESVGESSQAYAL---- 95

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                    D  ++SW +    + +    TYHAL+PR+W  Y G    +  I C+Q SP 
Sbjct: 96  ---ATAAPADGTLQSWKYYPQSSPEQNTGTYHALYPRSWFEYGGVY--QANITCQQFSPI 150

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
              NY+ESS+P +VF + L N       ++++ TW N V               D G   
Sbjct: 151 WADNYQESSYPVAVFLWNLHNPTNAPITLSIMLTWENMVGWFTNTSKSPEVIVRDDGSPV 210

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP---CFLLSG---- 302
           + +  +     G      +    NG    T  V     +   V E     CFL S     
Sbjct: 211 YEYQPRLGQSQG----NFNEIVDNGN---TIGVVLSGISGEVVQEGDGQWCFLTSKQAGV 263

Query: 303 --------NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
                   N  G   +++W      GS  +  +   +  E    +  A+A  LT+  G T
Sbjct: 264 EIFSHSRWNPTG-NGEEIWQSFAADGSLSNYQDATPATDE---RLAGALAVRLTLQPGET 319

Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
             V F L WD P  +F   V Y+RRYT F+G+ G +A +IA  A+ ++  W  +I+ WQ+
Sbjct: 320 LEVPFVLTWDFPVTEFAAGVNYNRRYTDFFGSNGKNALQIAETALEQYRDWLEKIQTWQK 379

Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
           PIL     P+W  + LFNELY L AGGT+W+  SP                         
Sbjct: 380 PILNRHDLPDWMKMALFNELYDLTAGGTLWSAASP------------------------- 414

Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
                         ++ +GQF  LE  +Y  Y + DV  Y SF L+MLFP+LE ++ R F
Sbjct: 415 --------------KDPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKAVMRAF 460

Query: 534 AAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSR 585
           A A+   D  T        I ++     RK  GA PHD+G  +   W + N  +  + + 
Sbjct: 461 ARAIPASDEKTRVIGYYYTIGAESPLAIRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNL 520

Query: 586 WKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
           WKDL S FVLQVYRDFV TG K+  F    W ++ + + Y++ FDKD DG+ EN G PDQ
Sbjct: 521 WKDLGSDFVLQVYRDFVFTGSKDIEFLAECWDAIALTLEYLKAFDKDHDGIPENSGAPDQ 580

Query: 644 TYDAWSANGVSAYCGGLW----VAALQAASALANDVGDHASASYFWVRYQ----KAKAVY 695
           T+D W   GVSAYCGGLW     AA+     L  + GD    +   + Y     K++ +Y
Sbjct: 581 TFDDWKLQGVSAYCGGLWLAALEAAIAICDILLENHGDQEKLTAQKIIYTDWLAKSRPIY 640

Query: 696 DS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
              LWNG Y+  D+  GS    + ADQL GQ+YAR   L  I        A+  +Y+   
Sbjct: 641 QQKLWNGQYYRLDSESGSD--VVMADQLCGQFYARLLKLADIVPHDCAISAVRTVYEACF 698

Query: 755 LKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
           +    G  GA NG++PDG  +        E+W G+ +GLAA ++Q  M D A++    V 
Sbjct: 699 INFNDGKFGAANGLRPDGSPENPKATHPLEVWTGINFGLAAFLMQMGMQDEAWKLTEVVV 758

Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           +  +     G  F+TPE+      +R+  Y+R + IW +   L
Sbjct: 759 QQIYDN---GLQFRTPEAITATGTFRACTYLRAMAIWGIYLVL 798


>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
 gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
          Length = 799

 Score =  374 bits (961), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 261/817 (31%), Positives = 382/817 (46%), Gaps = 117/817 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QFSVF  +      +  LC  
Sbjct: 38  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFQTMPGCQFSVF-EQAGDHSQAYALCTE 96

Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
            P      +D  + +W+W    +  TY AL+PR+W  Y G    +  + C Q SP IP N
Sbjct: 97  PP------SDGTLGAWNWYPTAKG-TYGALYPRSWFQYQGVF--QAGLTCEQFSPIIPDN 147

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS--------------------- 245
           Y+E+S+P +VF +   N       ++++ +W N V   S                     
Sbjct: 148 YQETSYPVAVFVWAAHNPTDQPITLSIMLSWQNLVGWFSNATPASEIRIRDDGSPVYDYT 207

Query: 246 ---GLSGHHFNSKTMTKDGVHGLTLHHRTAN-----GRPPVTFAVAAEETADVHVSECPC 297
              G S  ++NS   ++     L    R  +     G     F+V   +  ++       
Sbjct: 208 PRLGQSQGNYNSFHQSQTQASCLLQRQRDPHTAIKEGEGEWAFSVPVRDDLEIF------ 261

Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
           +    N  G    ++W+   + GS  +L+N++  P+     IGA +A   T+  G ++ +
Sbjct: 262 YHTRWNPVG-DGGELWSSFSRDGSLSNLENNQ--PASDIEQIGATLAVRFTLAPGESKEI 318

Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
            F LAWD P  +F   V Y RRYT F+   G +A  IA  A+  + KW  +I AWQ+PIL
Sbjct: 319 PFGLAWDLPVTEFAPNVNYFRRYTDFWNRSGTNAETIAQVALENYQKWSEQIGAWQQPIL 378

Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
           E   +P+ + + L NELY L +GG++WT  S               LD            
Sbjct: 379 EHPDWPDDFKMALINELYDLTSGGSLWTAAS--------------ELDP----------- 413

Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
                         +GQ+  LE  +Y  Y + DV  Y SFAL+ L+P+LE  + R FA A
Sbjct: 414 --------------LGQYAVLECLDYRWYESLDVRLYGSFALLYLWPELEKGVMRAFARA 459

Query: 537 VMMHDPGTMKIM--------SDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
           +   DP  ++I+               RK   A PHD+G   + PW   N  +  + + W
Sbjct: 460 IPTEDP-KLRIIGYYYQGDPETAHRAPRKIAHATPHDLGAPNEHPWQASNYTSYQDCNLW 518

Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           KDL S FVL VYRD++ TG  D  F    WP++  ++ Y++ FD DGDG+ EN G PDQT
Sbjct: 519 KDLASDFVLLVYRDYLLTGATDLEFLTDCWPAIVSSLDYLKTFDTDGDGLPENSGAPDQT 578

Query: 645 YDAWSANGVSAYCGGLW-------VAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
           YD W   GVSAYCGGLW       +A  Q      N+          W+   +++  Y +
Sbjct: 579 YDDWQLKGVSAYCGGLWIAALEAALAIAQILEKQGNNTKLEQEKWQAWL--SQSRPTYQA 636

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
            LWNG ++  D+   S+S  + ADQL GQ+YAR   L  +      + AL  +YD   LK
Sbjct: 637 KLWNGQFYRLDSD--SNSQVVMADQLCGQYYARLLNLPDVVPIECAESALKTVYDVCFLK 694

Query: 757 VKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
              G  GA NG+ PDG  +  +     E+W G+ YGLAA + Q  + D A++    V   
Sbjct: 695 FHEGQFGAANGLLPDGSPEKPNATHPLEVWIGINYGLAAFLYQMGLEDGAWRLTQTVIRQ 754

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            +     G  F+TPE+      +R+  Y+RP+ IWAM
Sbjct: 755 IYDN---GLQFRTPEAITAVGTFRACMYLRPMAIWAM 788


>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
          Length = 842

 Score =  371 bits (953), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 258/777 (33%), Positives = 388/777 (49%), Gaps = 103/777 (13%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G P+GG GAG+I   + G F + +L  G  + +      F ++  +P+G+  S       
Sbjct: 70  GAPIGGFGAGTITWKFDGSFTKGRLDIGSNELSSDADCAFYMY-QKPSGQNIS------- 121

Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
                K  D+G      ++     TY++LFP++W  Y G  +   +    Q SP IP NY
Sbjct: 122 ----MKKLDAG------SIGAGQATYYSLFPKSWVDYSGS-NFYCKAKVTQFSPIIPGNY 170

Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK--TMTKDGVHGL 265
           + +S+P  V+ + ++N G +S DV ++ +W N   G         NSK   + ++G    
Sbjct: 171 ETTSYPVGVYKWEITNPGSSSCDVAIMLSWKNEFGGSFAEGQTSGNSKGIILRRNGTSNA 230

Query: 266 TLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHL 325
           T+ ++         F++AAE+   V V+        G++ G +       +      D L
Sbjct: 231 TMENQGE-------FSLAAEQKDGVKVTY-------GSASGTST------LSSDFGTDGL 270

Query: 326 DNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYG 384
             D       GS+   A+A   T+  G T +V  +L+WD P  +      ++R+YT+FYG
Sbjct: 271 LGDTV-----GSNNIGAVAFKATLEPGQTITVPVALSWDIPISQAGSGSKWYRKYTRFYG 325

Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
             G ++ +IA DAI  ++ WE  I++WQ  +L D ++P+W   T+FNELYY   GGT W 
Sbjct: 326 RSGLNSWKIAKDAIDNYSSWESSIDSWQNSVLNDTKYPQWLKTTMFNELYYYFTGGTYWE 385

Query: 445 DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
            G+   Q                                 +N +E++  F +LE  +Y  
Sbjct: 386 AGAASGQK--------------------------------DNPDEDM--FSHLECYKYDF 411

Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
           Y T DV FY S+ L++++P ++    R FA +V  +   T K            +G   H
Sbjct: 412 YGTSDVRFYGSWPLILMWPDIDKQAVRQFADSV--YHTRTDK---------PAAIGTTAH 460

Query: 565 DIGLDDPWFEI-NSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAM 621
           DIG  +  FE  N+Y   +S+ WKDLNSK VL VYRD+  TG  D +F +  W  V  AM
Sbjct: 461 DIGNANNVFETWNAYVYRDSTNWKDLNSKLVLMVYRDWALTGKTDMDFLKYCWIPVQKAM 520

Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA 681
             ++  D DGD +  + G  DQTYD     G +AYCGGL+VAA  AA  +A  +GD   A
Sbjct: 521 DKVKSQDSDGDYLPNSSGI-DQTYDDMQLQGNTAYCGGLFVAACNAAKEIATAMGDSTKA 579

Query: 682 SYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTS-IQADQLAGQWYARACGLLPIADE 739
           + +   Y+K+K  Y + LWNGSYF  D   GS  TS I +DQL G WYA+ACGL  I  +
Sbjct: 580 TQYQSWYEKSKENYIAKLWNGSYFKIDT--GSYDTSRIMSDQLCGHWYAKACGLEGIVPD 637

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
              KK+ +K+Y+FN  K   G+ G +N M+ DG +DMS  Q  E W G +YG+ A MIQE
Sbjct: 638 EYAKKSYSKVYEFNNKKFDNGLHGFVNIMKADGNVDMSHAQTAEAWVGTSYGVIAGMIQE 697

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN-DEYRSLCYMRPLTIWAMQWA 855
            +   A Q  + + +  W  + +   F+TPE+W     E R+  YMR   IWA++ A
Sbjct: 698 GLETEASQVGSNLADTVWKTNDMW--FRTPEAWRQGVSEVRAPYYMRANCIWAVKHA 752


>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
 gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
          Length = 937

 Score =  370 bits (950), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 263/812 (32%), Positives = 395/812 (48%), Gaps = 117/812 (14%)

Query: 115 GICDDAPVLANQFSVFVSRPNGEKFSSVL---CPRSPGVPKKNTDSGIE-SW----DWNL 166
           GI  D  +  NQFS+ V R NGE  S VL   C    G   K+    ++ +W    D   
Sbjct: 178 GIYSDTVLPVNQFSLRVER-NGEIESFVLNPICEFKDGTMDKDLIKKLKHNWMFINDERR 236

Query: 167 KGENCTY----HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLS 222
           +     +    ++LFP++WTVY    D +  ++C + SP   HNY+ESSFP   F +T+ 
Sbjct: 237 RKNKMQFSQQVYSLFPKSWTVYQT-SDEKFTVICERYSPVWAHNYQESSFPTCNFEWTII 295

Query: 223 NSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK--DGVHGLTLHHRTANG---RPP 277
           N+    A+V+LLFT    +  +      +  S       D V  LT+     +    +  
Sbjct: 296 NNTDEEANVSLLFT----IQSEEKFRKKNEQSIIARNETDSVKCLTIETTVTDSFSHKDT 351

Query: 278 VTFAVAAEETADVHVSECPCFLLSGNSKGIT-AKDMWNEIKKHGSF------DHLDNDKT 330
           + F + A  +          FL   N    +   D+W E   +G F      +       
Sbjct: 352 IQFCIGANTSLG-----SITFLDEFNGTNESHLHDLWQEFNNNGEFSTEFCENFAKRHDW 406

Query: 331 SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF---FEKVYHRRYTKFYGTLG 387
           + S+       AI + + IPS S++ V F+L W+ P V+F    E  Y R YT+F     
Sbjct: 407 TKSQSKHLSCHAICSKVQIPSNSSKQVCFNLNWNAPFVRFGPTMEFRYLRHYTQFCTDAI 466

Query: 388 DSAARIAHDAILEHAK--------WECEIEAWQRPILED--KRFPEWYPITLFNELYYLN 437
            S  +    A  E  K        W+ EI  W + +++   K  P      LFNELYY+ 
Sbjct: 467 SSITQHKDTACFEMCKRSLDSSKQWKDEISQWHQGVIDQVGKDHPTAVA-ALFNELYYIV 525

Query: 438 AGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG--EENIGQFL 495
            GGT+WT+G                                    L ENG  +++   F 
Sbjct: 526 DGGTLWTNGG----------------------------------ELDENGNSQDSKDYFF 551

Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
           YLEG EY+M NTYDVHFY+S A++M +P+++LSIQRD   AV   +   +          
Sbjct: 552 YLEGHEYLMCNTYDVHFYASHAMIMNWPEIQLSIQRDIIRAVREENNQEVTFYLSNMKNR 611

Query: 556 RKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAV 613
           RK  G+VPHD+G   + PW  +N+YN  + ++WKDLNS+F+L ++RDF+ T ++ F    
Sbjct: 612 RKVKGSVPHDVGTPYERPWKLVNAYNFQDVNKWKDLNSQFILTIFRDFMITKNREFLEEA 671

Query: 614 WPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALA 672
           +PS+ +A+ Y +  FDKDGDG+IENE FPD TYDAW   GVSAY G LW+A+L+A   +A
Sbjct: 672 FPSIILALNYSLTNFDKDGDGLIENESFPDTTYDAWKVTGVSAYSGMLWIASLKAILEMA 731

Query: 673 NDVGDHASASYFWVRYQKAKAVYDS-LWN--GSYFNYDNSDGSSSTSIQADQLAGQW--- 726
            ++ D  +        +     ++S LW+    Y++YD S      SI +D L G +   
Sbjct: 732 KELDDENTRQKVEPLIESGLKTFESKLWDETNQYYHYDASTEPQHDSIMSDHLHGTFMLL 791

Query: 727 ----YARACG-------LLPIADEAKVKKALTKIYDFNVLKVK--GGMCGAMNGMQPDGR 773
               Y +  G       L+P   E +V K+L KI + N +K +    + G +NGM+P   
Sbjct: 792 LIGNYVKNSGRCEDSHYLIPFNME-RVSKSLNKILENNFVKYQQITKLGGCVNGMRPTQE 850

Query: 774 IDMSGLQAREIWPGVTYGLAASMI----QEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           +D + LQ+RE+W G +Y +AA  I    +EE +D+       V++  WS +   + FQTP
Sbjct: 851 VDKTSLQSREMWTGTSYVVAALTILLGRREEGIDL----MKSVFDKCWSSNS-AFWFQTP 905

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKPKL 861
           E+     EYR+L Y RPL+IW+   A+   K+
Sbjct: 906 EAVTETGEYRALGYCRPLSIWSCLSAMMNKKI 937


>gi|443318654|ref|ZP_21047901.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
 gi|442781756|gb|ELR91849.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
          Length = 857

 Score =  369 bits (946), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 273/864 (31%), Positives = 393/864 (45%), Gaps = 158/864 (18%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G PLGG GAG IGRS  G+F  + L  G        A QFSVF  +   ++ +  L   
Sbjct: 39  HGAPLGGFGAGCIGRSPHGDFNLWHLDGGEHTFQTFPACQFSVF-EQWGDQRQAYALSTE 97

Query: 147 SPGVPKKNTDSGIESWDW---NLKGEN--------------------CTYHALFPRAWTV 183
           +P      TD  +  W W    L+  N                     TYHAL+PR+W V
Sbjct: 98  AP------TDGTLGQWQWYPATLQNSNQNSKFKIQNSELPASPPLSTGTYHALYPRSWYV 151

Query: 184 YDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA- 242
           Y  E   + ++ C Q SP    NY+E+S+P +VF +T  N       +++L +W N V  
Sbjct: 152 Y--ENVFQAQLTCEQFSPIWARNYQEASYPVAVFLWTAHNPTDQPITLSILLSWQNMVGW 209

Query: 243 ------------GDSGLSGHHF---------NSKTMTKDGVH-GLTLHHRTANGRPPV-- 278
                        D G   + +         N+  + + G   G  ++   + G  P   
Sbjct: 210 FTNTAKSPQVLQRDDGSPYYDYVSAIGQSAGNANRLVQTGAAVGYVMNGAISPGITPTEG 269

Query: 279 --TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPG 336
              +A+A      V V     +    N  G    D+WN     GS  +  +D+  P+  G
Sbjct: 270 EGQWAIATLTQPGVEV----FYHTRWNPVG-DGGDLWNGFAIDGSLVNSGDDR--PAAAG 322

Query: 337 SSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAH 395
             IG AIA   T+  G TR +  +L WD P  ++   V  +RRYT F+G  G +A  IA 
Sbjct: 323 EQIGGAIAVRFTLQPGETRQMPMTLVWDLPVTEYAAGVADYRRYTDFFGREGQNAWVIAQ 382

Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
            A+  +  W  +I  WQ+PIL+   FP+W+ + LFNELY L +GGT+WT           
Sbjct: 383 TALANYTTWRQQIINWQQPILDQPDFPDWFKMALFNELYDLTSGGTLWT----------- 431

Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSS 515
                            A+++T           + +GQF  LE  +Y  Y + DV  Y  
Sbjct: 432 -----------------AATET-----------DPVGQFGVLECVDYRWYESLDVRLYGG 463

Query: 516 FALVMLFPKLELSIQRDFAAAVMMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGL- 568
           FA ++L+P+LE ++ R FA A+   D  P  +     I     +  RK  GA PHD+G  
Sbjct: 464 FATLILWPELEKAVIRAFARAIPTADDHPRIIGYYYTIGETDHYAPRKLPGATPHDLGAP 523

Query: 569 -DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYME 625
            + P+ + N  +  + ++WKDL S FV+QVYR +  TG  D +F    WP+V   + Y++
Sbjct: 524 NEHPFVKTNYTSYQDCNQWKDLPSDFVIQVYRAYKMTGATDGDFLADCWPAVTETLKYLK 583

Query: 626 QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA---- 681
           +FD D DG+ EN G PDQT+D W   G+SAYCGGLW+AAL++A A+A+ +     A    
Sbjct: 584 RFDTDSDGIPENGGAPDQTFDDWQLKGLSAYCGGLWMAALESAIAIADTLTAAGRAPGNT 643

Query: 682 -----SYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
                 Y     Q  K  +  LWNG Y+  +   GS S+ + ADQL GQ+ AR   L  +
Sbjct: 644 PILVTQYRRWLEQSRKVYHKRLWNGRYYRLET--GSDSSVVMADQLCGQFCARLMELPDL 701

Query: 737 ADEAKVKKALTKIYDFNVLKVK---------------------------GGMCGAMNGMQ 769
            +E  V+ AL  IYD   LK                             G   GA NG+ 
Sbjct: 702 VEEKYVQSALAAIYDACFLKFNNYAATQARPQEQKFIGSQTGTFRSAQPGIPIGAANGVL 761

Query: 770 PDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
           PDG   D  G    E+W G+ +GLA    Q   VD AF     V +  ++    G  F+T
Sbjct: 762 PDGSPEDPDGTHQLEVWTGINFGLATFFAQMGQVDKAFAITEAVVQQIYTH---GLQFRT 818

Query: 829 PESWNNNDEYRSLCYMRPLTIWAM 852
           PE+      +R+  Y+RPL IW +
Sbjct: 819 PEAITALGTFRACHYLRPLAIWGL 842


>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
 gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
          Length = 804

 Score =  368 bits (944), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 270/818 (33%), Positives = 389/818 (47%), Gaps = 110/818 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
            G+PLGG GAG IGRS RG+F  + +  G  I  + P  A QFSVF S  NG    +   
Sbjct: 40  HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVFES--NGTSSQAYAL 95

Query: 145 PRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
              P       D  ++SW W           TYHAL+PR+W VY+     EL   C Q S
Sbjct: 96  STQP-----TDDGSLKSWQWYPASTATQTTGTYHALYPRSWFVYENVFQAEL--TCEQFS 148

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
           P   +NY+E+S+P +VF +   N       ++++ TW N V               D G 
Sbjct: 149 PIWANNYQETSYPVAVFVWQAHNPTNAPITLSIMLTWQNMVGWFTNALKSPDVRVRDDGS 208

Query: 248 SGHHFNSKTMTKDGVHGLT----LHHRTANGRPPVTFAVAAEETADVHVS----ECPCFL 299
             + +  +     G +        H     GR  +   +   E +   V+    +   F 
Sbjct: 209 PVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMAEPLQEGEGSWCIVTRKHPQVEIFH 268

Query: 300 LSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
            +  +   T +++W      GS  +++D   TSP      +GAAIA   T+  G T  + 
Sbjct: 269 HTRWNPVGTGEEVWQSFAADGSLANYID---TSPVSENEQLGAAIAVRFTLQPGETLEIP 325

Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
           F ++WD P  +F   V Y+RRYT F+G  G+ A  IA  A+ ++  W+ +I+AWQ PIL 
Sbjct: 326 FVVSWDLPVTEFAAGVNYYRRYTDFFGKSGNHAWAIATIALEQYQTWQQQIQAWQDPILN 385

Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
               P+W+ + LFNELY L +GGT+W+  +P                             
Sbjct: 386 RDDLPDWFKMALFNELYDLTSGGTLWSAATP----------------------------- 416

Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
                      + IGQF  LE  +Y  Y + DV  Y SF L+ LFP+LE ++ R FA A+
Sbjct: 417 ----------SDPIGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAI 466

Query: 538 MMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDL 589
              D  P  +     I ++     RK  GA PHD+G  +   W + N  +  + + WKDL
Sbjct: 467 PQGDDTPRVIGYYYTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDL 526

Query: 590 NSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
              FVLQVYRDF+ TG  D  F R  W ++   + Y++ FD DGDG+ EN G PDQT+D 
Sbjct: 527 GCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDD 586

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS---------- 697
           W   GVSAYCGGLW+AAL AA A+++ +  +   S    +    K+ Y++          
Sbjct: 587 WRLQGVSAYCGGLWMAALAAAIAISDILLQNHQDSETKEKLLLQKSTYETWLTKSLPIYQ 646

Query: 698 --LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
             LWNG Y+  D+  GS    + ADQL GQ+YA    L  I    +   AL  +YD   L
Sbjct: 647 EKLWNGKYYRLDSESGSD--VVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFL 704

Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K   G  GA NG++PDG  +        E+W G+ +GLAA ++Q  M D  F+    V  
Sbjct: 705 KFYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVA 764

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
             ++    G  F+TPE+      +R+  Y+R + IWA+
Sbjct: 765 QIYNN---GLQFRTPEAITAAGTFRASTYLRAMAIWAI 799


>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
           7942]
 gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 798

 Score =  367 bits (942), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 261/824 (31%), Positives = 385/824 (46%), Gaps = 122/824 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G PLGG GAG  GRS RG+   + L  G      + A QF+V+ S       S+     
Sbjct: 36  HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                +  +D  + SW+W      +     Y AL+PR+   Y  +   E  I CRQ SP 
Sbjct: 91  -----EAPSDGSLSSWNWYPASTAERSTGEYSALYPRSQFSY--QQVFEAEIHCRQFSPI 143

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN-------------SVAGDSGLSG 249
           +PH+Y+ +S+P ++F + L N       +++L +W N              V  D G   
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGWFTNTNKAPEVVYRDDGSPV 203

Query: 250 HHFN----------SKTMTKDGVHGLTLHHRTANGRPPV---TFAVAAEETADVHVSECP 296
           + +           ++ +  +G  GL L  +T +  P      +A+A  E   + +  C 
Sbjct: 204 YDYVPALGQSVGNLNQRIAGEGWQGLLLD-QTRSQDPEEGDGQWAIAIAEAEGLEIFRCD 262

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            +  +G+       ++W      GS    D   + P+  G  +  AIA  LT+  G +R 
Sbjct: 263 RWDPTGD-----GSELWQSFALDGSIP--DRQDSQPAAAGERLAGAIAVRLTLQPGESRE 315

Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           + FS+AWD P  +F   V   RRYT F+G  G +AA IA   +  + +WE  I AWQ+PI
Sbjct: 316 IPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAWQQPI 375

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L+     + + + LFNELY L +GG++WT  S                            
Sbjct: 376 LDRDDLSDTFKLGLFNELYDLCSGGSLWTAASE--------------------------- 408

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
                        + +GQF  LE  +Y  Y + DV  Y +F L+ML+P+LE ++ R FA 
Sbjct: 409 ------------ADPVGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFAR 456

Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWK 587
           A+   D  T        I  +     RK   A PHD+G   + PW + N     + + WK
Sbjct: 457 AIPQADDRTRVIGYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWK 516

Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           DL   FVLQV+RD+  TG  D+ F    WP+   A+ Y++ FD DGDG+ EN G PDQT+
Sbjct: 517 DLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTF 576

Query: 646 DAWSANGVSAYCGGLWVAALQAA---------SALANDVGDHASASYFWVRYQKAKAVY- 695
           D W   G+SAYCGGLW+AAL+AA         SA   D  D  S    W+   +A+++Y 
Sbjct: 577 DDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDRQDFQS----WL--AQARSLYH 630

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           D+LWNG Y+  D+  GS S  + ADQL G +Y+R   L P+A     +     +Y+    
Sbjct: 631 DTLWNGRYYQLDS--GSGSQVVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQ 688

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K   G  G  NG+ PDG  +D  G    E+W G+ +G+AA  +     D  F+    V  
Sbjct: 689 KFHSGQFGLANGLLPDGSPVDPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIR 748

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
             +     G  F+TPE+   N  +R+  Y+RP+ IWA+   LT 
Sbjct: 749 QIYDN---GLQFRTPEAITANATFRASHYLRPMAIWAVYGVLTN 789


>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
          Length = 467

 Score =  366 bits (940), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 198/441 (44%), Positives = 254/441 (57%), Gaps = 35/441 (7%)

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
           +  P WY   LFNELY+L  GGT+W                   L+ P+      S    
Sbjct: 10  RTLPAWYKSALFNELYFLADGGTVW-------------------LEVPAD-----SLPEG 45

Query: 479 LG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
           LG   R L +  ++ G+F YLEG EY MYNTYDVHFY+SFALVML+PKLELS+Q D A A
Sbjct: 46  LGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALA 105

Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
            +  D    + +  G     K    +PHDIG   D+PW  +N+Y + +++ WKDLN KFV
Sbjct: 106 TLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFV 165

Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
           LQ+YRD+  TGD+ F   +WP     M    +FDKD DG+IEN G+ DQTYDAW   G S
Sbjct: 166 LQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPS 225

Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSS 713
           AYCGGLW+AA+     +A   G       F     + +  Y+  LWNG Y+NYD+S    
Sbjct: 226 AYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQ 285

Query: 714 STSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
           S SI +DQ AGQW+ RACGL      +     V +AL  I++ NV    GG  GA+NGM 
Sbjct: 286 SRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMH 345

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P G  D S +Q+ E+W GV YGLAA+MIQE +    F+TA G Y   W    LG +FQTP
Sbjct: 346 PHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWER--LGLAFQTP 403

Query: 830 ESWNNNDEYRSLCYMRPLTIW 850
           E++     +RSL YMRPL+IW
Sbjct: 404 EAYCQQQVFRSLAYMRPLSIW 424


>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
 gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 798

 Score =  365 bits (936), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 260/824 (31%), Positives = 384/824 (46%), Gaps = 122/824 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G PLGG GAG  GRS RG+   + L  G      + A QF+V+ S       S+     
Sbjct: 36  HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                +  +D  + SW+W      +     Y AL+PR+   Y  +   E  I CRQ SP 
Sbjct: 91  -----EAPSDGSLSSWNWYPASTAERSTGEYSALYPRSQFSY--QQVFEAEIHCRQFSPI 143

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN-------------SVAGDSGLSG 249
           +PH+Y+ +S+P ++F + L N       +++L +W N              V  D G   
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGWFTNTNKAPEVVYRDDGSPV 203

Query: 250 HHFN----------SKTMTKDGVHGLTLHHRTANGRPPV---TFAVAAEETADVHVSECP 296
           + +           ++ +  +G  GL L  +T +  P      +A+A  E   + +  C 
Sbjct: 204 YDYVPALGQSVGNLNQRIAGEGWQGLLLD-QTRSQDPEEGDGQWAIAIAEAEGLEIFRCD 262

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            +  +G+       ++W      GS    D   + P+  G  +  AIA  LT+  G +R 
Sbjct: 263 RWDPTGD-----GSELWQSFALDGSIP--DRQDSQPAAAGERLAGAIAVRLTLQPGESRE 315

Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           + FS+AWD P  +F   V   RRYT F+G  G +AA IA   +  + +WE  I AWQ+PI
Sbjct: 316 IPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAWQQPI 375

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L+     + + + LFNELY L +GG++WT  S                            
Sbjct: 376 LDRDDLSDTFKLGLFNELYDLCSGGSLWTAASE--------------------------- 408

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
                        + +GQF  LE  +Y  Y + DV  Y +F L+ML+P+LE ++ R FA 
Sbjct: 409 ------------ADPVGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFAR 456

Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWK 587
           A+   D  T        I  +     RK   A PHD+G   + PW + N     + + WK
Sbjct: 457 AIPQADDRTRVIGYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWK 516

Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
           DL   FVLQV+RD+  TG  D+ F    WP+   A+ Y++ FD DGDG+ EN G PDQT+
Sbjct: 517 DLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTF 576

Query: 646 DAWSANGVSAYCGGLWVAALQAA---------SALANDVGDHASASYFWVRYQKAKAVY- 695
           D W   G+SAYCGGLW+AAL+AA         SA   D  D  S    W+   +A+++Y 
Sbjct: 577 DDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDRQDFQS----WL--AQARSLYH 630

Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           D+LWNG Y+  D+  GS S  + ADQL G +Y+R   L P+A     +     +Y+    
Sbjct: 631 DTLWNGRYYQLDS--GSGSQVVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQ 688

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K   G  G  NG+ PDG  +   G    E+W G+ +G+AA  +     D  F+    V  
Sbjct: 689 KFHSGQFGLANGLLPDGSPVGPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIR 748

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
             +     G  F+TPE+   N  +R+  Y+RP+ IWA+   LT 
Sbjct: 749 QIYDN---GLQFRTPEAITANATFRASHYLRPMAIWAVYGVLTN 789


>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
 gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
          Length = 835

 Score =  360 bits (925), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 271/861 (31%), Positives = 395/861 (45%), Gaps = 159/861 (18%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKF------ 139
           + G+PLGG+GAG IGR+  G F  + L  G        A QF+V+       +       
Sbjct: 36  EHGMPLGGLGAGCIGRAPNGTFNLWHLDGGEHTFEHFPACQFAVYEEAAGASQTYAMATA 95

Query: 140 --------------SSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
                          SV     PG P  +T S I+           TY+AL+PR+W  Y 
Sbjct: 96  GDGLERWQWYPAENRSVGTAHPPGQPGNDTTSTIK---------GGTYNALYPRSWYSYA 146

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
                 L   C Q SP  P NY+ESS+P +VF +T  N       +++L +W N V    
Sbjct: 147 DVFKANL--TCEQFSPIWPDNYQESSYPVAVFVWTAHNPTDQPITLSILLSWQNMV---- 200

Query: 246 GLSGHHFNSKT-----MTKDGV----HGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
              G   NS++       +DG     +G  L     N    V+   A     D   ++ P
Sbjct: 201 ---GWFTNSESSPEIQQREDGSPFYDYGPPLRQSAGNFNQLVSDERAIGMVMDGAWTDSP 257

Query: 297 -------CFLLSGNSKGIT----------AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
                  C  +   ++GIT            D+W     HG+  ++ +  T+ +     +
Sbjct: 258 GEGDGQFCIAVPA-TEGITYHTRWNPHGDGGDLWESFAGHGTLSNVAD--TTSAVKDEQV 314

Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAI 398
           G AIA   T+  G T+ + F L WD P  +F   K+ +RRYT F    G +A ++A +A+
Sbjct: 315 GGAIAIKFTLAPGETQELPFVLTWDLPVTEFATGKLAYRRYTDFCDCSGRNAWQLAQEAL 374

Query: 399 LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
             +  W   I  WQ+PIL+    P+W+ + LFNELY + +GGT+W+              
Sbjct: 375 ANYPTWRQNIIDWQQPILDRPDLPDWFKMALFNELYDMASGGTLWS-------------- 420

Query: 459 RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
                         A+SD            + +GQF  LE  +Y  Y + DV  Y  F+ 
Sbjct: 421 --------------AASDI-----------DPVGQFGVLECIDYRWYESLDVRLYGGFST 455

Query: 519 VMLFPKLELSIQRDFAAAVMMHDP-----GTMKIMSDGKWVA-RKCLGAVPHDIGL--DD 570
           +ML+P+LE +I R FA A+   DP     G    + + + +A RK  GA PHD+G   ++
Sbjct: 456 LMLWPELEKAIIRAFARAIPTEDPKKRIIGYYYTVGESEHLAPRKLKGATPHDLGAPNEN 515

Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFD 628
           PW   N  +  + ++WKDL S FVLQVYR FV TG  D  F    WP+V   + Y++ FD
Sbjct: 516 PWIATNYTSYQDCNQWKDLPSDFVLQVYRAFVLTGGDDLEFLAECWPAVTQTLQYLKLFD 575

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH--ASASYFWV 686
            DGDG+ EN G PDQT+D W   G+SAYCGGLW+AAL+AA  + + +  H   S S+  V
Sbjct: 576 LDGDGVPENGGAPDQTFDDWQLKGLSAYCGGLWLAALEAAIKMGDLLQQHRKISGSFAIV 635

Query: 687 RYQ------KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           + Q      + K VY + LW GS++  D+   S+S  + ADQL GQ+ AR   L  I D+
Sbjct: 636 KSQYQRWIKQGKIVYQNKLWTGSFYRLDSE--SNSDVVMADQLCGQFCARLMDLPDIVDQ 693

Query: 740 AKVKKALTKIYD-----FNVLKVK----------------------GGMCGAMNGMQPDG 772
             +  AL  IYD     FN    K                      G   GA NG++PDG
Sbjct: 694 EFIDIALDTIYDACFVKFNEYAAKHSTPQNQKFEGSQAGFFSAAQLGMAIGAANGVKPDG 753

Query: 773 RIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
             +        E+W G+ +G+AA + Q    D A      V +  ++    G  F+TPE+
Sbjct: 754 SPENPDDTHQLEVWTGINFGVAAFLAQMGKRDQAMAITEAVVQQIYN---YGLQFRTPEA 810

Query: 832 WNNNDEYRSLCYMRPLTIWAM 852
                 +R+  Y+RP+ IW +
Sbjct: 811 ITAMGTFRACHYLRPMAIWGL 831


>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
 gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
          Length = 782

 Score =  358 bits (920), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 260/775 (33%), Positives = 374/775 (48%), Gaps = 101/775 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G P+GG GAG+I   + G F + +L  G  D     +  F ++     G+  +  L  
Sbjct: 95  EDGAPIGGFGAGTITWRFDGRFYKGRLDIGSNDQNVDNSAGFFMYQKAEGGDAQTKRLDK 154

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
              G                 +G+  TY +LFPR+W  Y G   P  +    Q SP IP 
Sbjct: 155 DGLG-----------------QGQ-ATYSSLFPRSWVDYHGAAFP-CKAKVEQYSPIIPG 195

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVH-G 264
           +YK++S+P  V+ + ++N      DV ++ TW N          H  N  T+   G   G
Sbjct: 196 DYKKTSYPVGVYRWEITNPTGAPCDVAVMLTWKND---------HGGNGATVQTSGDDVG 246

Query: 265 LTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
           L L   T  G  P +     +    +     P  ++S  S    A  + +E+   G+  +
Sbjct: 247 LVL---TRGGGAPSS---EGQGEFTLASRNAPGVVVSYQSADSVAA-LQSELANDGALAN 299

Query: 325 LDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFY 383
                   +  G S+GA IA   T+  G +  V   LAWD P  +      ++R YT+ +
Sbjct: 300 --------TTGGHSLGA-IAFKATVEPGKSVIVPIVLAWDIPVTQAGSGDRWYRAYTRHF 350

Query: 384 GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW 443
           G  G+++  IA +A+ EH  W   I+ WQ  +L+D ++PEW    LFNELYY    GT W
Sbjct: 351 GRTGNASWTIAAEALAEHEPWLWAIQDWQASVLDDPKYPEWLKGALFNELYYYAIAGTYW 410

Query: 444 TDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYV 503
             G+   Q            D P                     +E++  F  LE   Y 
Sbjct: 411 EAGAASGQP-----------DDP---------------------DEDM--FTSLESYIYP 436

Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVP 563
            Y T DV FY S+AL  L+P ++    R F  +V           SD      + +G   
Sbjct: 437 FYGTSDVRFYGSWALFSLWPDIDKQEVRQFCDSVTT-------TRSD----RPRAIGTTA 485

Query: 564 HDIGLDDPWF-EINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIA 620
           HD G ++  F + N+Y   +S+ WKDLNSK VL VYRD+  TG  DK F    WP+V +A
Sbjct: 486 HDFGDNESVFRKWNAYTYRDSTNWKDLNSKLVLMVYRDWALTGKTDKAFLDYCWPAVKMA 545

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           M  ++  D DGDG+ E+ G  DQTYD    +G +AYCGGL++AA +AA  LA  VGD + 
Sbjct: 546 MQKVKGQDGDGDGLPESNGI-DQTYDDMDLHGNTAYCGGLFIAASEAAKELATAVGDASL 604

Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTS-IQADQLAGQWYARACGLLPIAD 738
           AS +    +++K  ++S LW GSY+  D   GS  T+ I +DQLAGQWYARA GL PI D
Sbjct: 605 ASTYQTWVEQSKGGFESKLWTGSYYKIDT--GSRDTNRIMSDQLAGQWYARALGLPPIVD 662

Query: 739 EAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQ 798
            +    A TKIY+ N  +  GG  G +N M  +G +D +  Q RE W G ++G+ A MIQ
Sbjct: 663 PSHAASAFTKIYENNFKRFDGGTRGVVNVMTANGSVDGTSNQTRECWVGTSWGVVAGMIQ 722

Query: 799 EEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           E +   A +  A + +  W  DGL   F+TPE+W  N   R+  YMR  T+WA +
Sbjct: 723 EGLAAQAGEIGASLVDTIWKTDGL--WFRTPEAWEGNGSIRAPYYMRATTLWAAK 775


>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
          Length = 449

 Score =  355 bits (910), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 192/491 (39%), Positives = 272/491 (55%), Gaps = 72/491 (14%)

Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
           YHRRY +F+G  G +  R+A  A+     WE +I  WQ+P L+  + P WY   LFNE Y
Sbjct: 13  YHRRYARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANKLPAWYKSALFNETY 72

Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQF 494
           +++ GG++W D    +     + E  F                           +  G+F
Sbjct: 73  FISDGGSVWVDS---IDVKDAVSEHPFV--------------------------QEFGKF 103

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
            YLEG EY MYNT DVH+Y+SFAL+ L+P L+LS+Q D                      
Sbjct: 104 AYLEGHEYRMYNTLDVHYYASFALMKLWPNLQLSLQYDID-------------------- 143

Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
                         ++PW  +N+Y +  ++ WKDLN KFVLQVYRD+V T D+ +   ++
Sbjct: 144 --------------NEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMY 189

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P     M +  ++D D DG+I+N GF DQTYDAW+  G SAYCGGLW+AA++A + +A+ 
Sbjct: 190 PQCKAVMDHSLRWDVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHR 249

Query: 675 VGDHASASYFWVRYQKAKAVY-DSLWNG----SYFNYDNSDGSSSTSIQADQLAGQWYAR 729
           +G       +    ++ K  + + LWNG     Y+NYD S+     SI ADQ+ GQW+ +
Sbjct: 250 LGVVEDLMKYEEILERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLK 309

Query: 730 ACGLL--PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
           ACG+    +   A VK+AL  ++  NVL   GG  GA+NGM+P+ + D++  Q+ E W G
Sbjct: 310 ACGVADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTG 369

Query: 788 VTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPL 847
           VTY L A+MIQ  M+D  FQTA G Y   W   GL  +FQTPE++ +N  +RSL YMRPL
Sbjct: 370 VTYALGATMIQVGMIDKGFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPL 427

Query: 848 TIWAMQWALTK 858
            IWA+Q A+ K
Sbjct: 428 AIWAIQHAVEK 438


>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 907

 Score =  352 bits (903), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 244/774 (31%), Positives = 362/774 (46%), Gaps = 143/774 (18%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS++G+F  + +  G     P+ A QFS+F  +   +  +  LC  
Sbjct: 41  HGMPLGGFGAGCIGRSHQGDFNLWHIDGGEHVFQPMPACQFSIF-EQVEQDAQAYALCTE 99

Query: 147 SPGVPKKNTDSGIESWDWNLKGENC--------------------TYHALFPRAWTVYDG 186
           +P      TD  + +W W    E+                     TYHAL+PR+W  Y  
Sbjct: 100 AP------TDGSLAAWSWYPTVEDADELGETSLDQARSLLSPHTGTYHALYPRSWFTYQN 153

Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAG--- 243
               +L   C Q SP +PHNY+E+S+P ++F +T+ N       ++L+ TW N +AG   
Sbjct: 154 VFLADL--TCEQFSPILPHNYQETSYPVAIFEWTVHNPTNQPLTLSLMLTWQN-MAGWFT 210

Query: 244 -----------DSGLSGHHF---------NSKTMTKDGVHGLTL-----HHRTANGRPPV 278
                      D G   + +         N   + K+G H   +     +     G    
Sbjct: 211 NASKSPDVQMRDDGSPVYDYQPRLYESAGNWNQIVKEGQHVAIVMAGARNQELEEGDGQW 270

Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
             AV     A+V       +    N  G  A D+W    ++GS  ++ N+  +P+     
Sbjct: 271 AIAVKLPTQAEVF------YHTHWNPVGDGA-DIWQSFSQNGSLTNVSNN--TPAAKDEQ 321

Query: 339 IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDA 397
           IG AIA    +  G T  + F L+WD P ++F E V Y +RYT F+   G +A  IA  A
Sbjct: 322 IGGAIAVRFMLQPGETWQIPFCLSWDFPVIEFAEGVTYFKRYTDFFDRSGQNAWTIAQTA 381

Query: 398 ILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIR 457
           +  +  W+ +I  WQ+PILE    P+W+ + LFNELY L  GGT+WT  +          
Sbjct: 382 LNHYKTWQEQIHTWQQPILERNDLPDWFKMALFNELYNLTDGGTLWTAAT---------- 431

Query: 458 ERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFA 517
           ER                             + +GQF  LE  +Y  Y + DV  Y SFA
Sbjct: 432 ER-----------------------------DPVGQFAVLECIDYRWYESLDVRLYGSFA 462

Query: 518 LVMLFPKLELSIQRDFAAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP 571
           L+MLFP+LE ++ R FA A+   D  T        I ++     RK  GA PHD+G  + 
Sbjct: 463 LLMLFPELEKAVIRAFARAIPASDDRTRIIGYYYTIGAESPQAVRKVAGATPHDLGAPNE 522

Query: 572 --WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQF 627
             W + N  +  + + WKDL   FVLQVYRDF+ TG K+  F +  W ++   + Y++ F
Sbjct: 523 HVWQKTNYTSYQDCNLWKDLPCDFVLQVYRDFLYTGSKDIAFLQECWSAIAQTLHYLKTF 582

Query: 628 DKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW-----------------VAALQAASA 670
           D D DG+ EN G PDQT+D W  +G+SAYCGGLW                    ++  + 
Sbjct: 583 DYDFDGIPENSGAPDQTFDDWKLDGISAYCGGLWIAALEAALAIGRILAEYTTNMRQLAT 642

Query: 671 LANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYAR 729
           L+ND+G+      F V  ++A+++Y D+LWNG Y+  D++  S S  + ADQL GQ+YAR
Sbjct: 643 LSNDIGN------FQVWLEQARSIYHDTLWNGQYYRLDSN--SCSEVVMADQLCGQFYAR 694

Query: 730 ACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQARE 783
              L  + D    + AL  IYD   +K    +      +Q    +  + +  RE
Sbjct: 695 LLKLPDVVDLNCAQTALKTIYDTCFIKFNNYLKSGALKLQQFTTVQEARIAERE 748



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 51/91 (56%), Gaps = 4/91 (4%)

Query: 763 GAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDG 821
           GA NG++PDG  +  +     E+W G+ +GLAA ++Q  M D A +    V +  ++   
Sbjct: 811 GAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEAMELTQAVVQQIYTN-- 868

Query: 822 LGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
            G  F+TPE+   +  +R+  Y+R L IWA+
Sbjct: 869 -GLQFRTPEAITPSGTFRACHYLRALAIWAI 898


>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
 gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
          Length = 849

 Score =  348 bits (893), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 254/823 (30%), Positives = 402/823 (48%), Gaps = 112/823 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QFS+F +  + ++  ++    
Sbjct: 70  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFSLFENNGSSKRAHALALKP 129

Query: 147 SPGVPKKNTDSGIESWDW--NLKGENCT--YHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                + N+   + +W+W     GE  T  Y A +P +WT Y+G  D E++  C   SP 
Sbjct: 130 EADASRSNSGEPLAAWNWYPASTGEQSTGTYAARYPLSWTHYEGVFDAEVQ--CEAFSPI 187

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
           +P +Y+ +S+P +VF +TL N      D++LL +W N+V               D G   
Sbjct: 188 LPGDYQRTSYPVAVFRWTLRNPTDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 247

Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN--GRPPVTFAVAAEETAD-VHVSECP 296
           H++           ++ + ++G+ G+ L  + +   G     + +A  +T D V V  C 
Sbjct: 248 HNYAPAIGHGEGQRNRWLDQEGIRGIVLEGQRSEPIGEGQGQWCLAVPDTLDGVEVLRCS 307

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            +  +G+      +++W      G   + +ND+ S    G    AAIA + T+  G +R 
Sbjct: 308 RWDPTGD-----GQELWQPFAAEGRIPNSNNDRAS--RAGEHASAAIALTFTLAPGESRE 360

Query: 357 VTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           +  +++WD P   F   V   RRYT F+G  G +AA +A +A+ +   W  +I+AWQ P+
Sbjct: 361 IPIAISWDLPVTAFASGVRDLRRYTDFFGADGRNAAAMATEALRDWRHWRDQIDAWQAPV 420

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L  K  PE   + LFNELY L +GG++WT  S                            
Sbjct: 421 LARKELPEALRMALFNELYDLASGGSLWTAAS---------------------------- 452

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
                      G++  G+F  LE  +Y  Y + DV  Y S  L+ L+P+L+ ++ R FA 
Sbjct: 453 -----------GKDPHGRFGVLECLDYAWYESLDVRLYGSLGLLQLWPELDKAVLRSFAR 501

Query: 536 AVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRWK 587
           A+   D     I        G+  A RK  GA PHD+G  ++  F+  +Y  + + + WK
Sbjct: 502 AIPAADATQRPIGWYFTQGRGRVEADRKVEGATPHDLGAPNEVPFDATNYTAYQDCNLWK 561

Query: 588 DLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           DL S FVLQV+R F    TG D  F    WP+   A+ Y+++FD + DG+ +N G PDQT
Sbjct: 562 DLASDFVLQVWRTFKLAPTGEDLTFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQT 621

Query: 645 YDAWSANGVSAYCGGLWVAAL--------QAASALANDVGDHASASYFWVRYQKAKAVYD 696
           +D W   GVSAYCG LW+AAL        +    L  D G        W+  ++++A +D
Sbjct: 622 FDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDTGSEQHTFSNWL--EQSRANFD 679

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           + LWNG Y++ D   G+    + ADQL G +YAR  GL P+  +A  +  L  + +    
Sbjct: 680 ALLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLEPVVSDANSRSTLKAVKESCFE 737

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAFQTAAGV 812
           K +GG  G  NG++ DG  +D +G    E+W G+ +G+A+   ++ E     A  +A  V
Sbjct: 738 KFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYQLMGEGQTAEAICSAV-V 796

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           ++V       G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 797 HQVY----SGGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 834


>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
 gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
          Length = 828

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 264/855 (30%), Positives = 388/855 (45%), Gaps = 161/855 (18%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVL 143
           + G PLGG GAG IGRS  G F  + L  G  + D  P    QFSVF    +GE  +  +
Sbjct: 40  EHGAPLGGFGAGCIGRSPHGTFNLWHLDGGEHVFDSFP--GCQFSVFEQSADGETKAYAM 97

Query: 144 CPRSPGVPKKNTDSGIESWDWNLKGENC----TYHALFPRAWTVYDGEPDPELRIVCRQI 199
               P       ++ + SW W   G+      TYHAL+PR+W +Y+     +L   C Q+
Sbjct: 98  SSDKP-------ETELASWQWYPAGKTTENTGTYHALYPRSWYLYENIFKADL--TCEQL 148

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
           SP   +NY ESS+P + F +T  N       ++++ +W N V       G   NS+T  +
Sbjct: 149 SPIWANNYIESSYPTATFIWTAHNHTDQPLTLSIMLSWENMV-------GWFTNSETSPE 201

Query: 260 DGVHGLTLHHRTANGRPPVTFA-VAAEETADVHVSECP-----CFLLSGNSKGITAK--- 310
                  +  R  +G P   +  +  E   + +   C        +L GN  G  A+   
Sbjct: 202 -------IQQR-EDGSPFYDYVPLLRESKGNYNTLVCEEGVARGVVLDGNWTGSPAEGQG 253

Query: 311 --------------------------DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
                                     ++WN+    G   +L++   S    G  IG AIA
Sbjct: 254 QLAIAAHHLEAEVSYHTRWHPDGDGSELWNDFSTTGRLSNLEDVDQSG---GERIGGAIA 310

Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
              T+  G T+ + F+L+WD P  +F + K  +RRYT F+   G +A  IA  A+  +A 
Sbjct: 311 LRFTLAPGETKQIPFTLSWDLPVTEFAKGKRAYRRYTDFFDRSGRNAWVIASTALNNYAI 370

Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
           W+  I  WQ+PIL+    P W+ + LFNELY L +GGT+W+  S                
Sbjct: 371 WQQNIINWQQPILDRTDLPNWFKMALFNELYDLASGGTLWSAASK--------------- 415

Query: 464 DTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
                                    + +GQF  LE  +Y  Y + DV  Y  +AL++L+P
Sbjct: 416 ------------------------SDPVGQFGVLECIDYRWYESLDVRLYGGYALLLLWP 451

Query: 524 KLELSIQRDFAAAVMMHDP-----GTMKIMSDGKWVA-RKCLGAVPHDIGL--DDPWFEI 575
           +LE SI R FA A+   +      G    + +   +A RK  GA PHD+G   ++PW   
Sbjct: 452 ELEKSILRAFARAIPTANAKERIIGYYYTVGEADHMAPRKLKGATPHDLGAPNENPWMAT 511

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDG 633
           N  +  + ++WKDL+S FVLQVYR F +TG  D +F    WP++   + Y + FD DGDG
Sbjct: 512 NYTSYQDCNQWKDLSSDFVLQVYRAFESTGSVDMDFLLDCWPAIVETLGYTKLFDTDGDG 571

Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA--------SYFW 685
           +IEN G PDQT+D W   G+SAYCGGLW+AAL AA  +   + +            + + 
Sbjct: 572 LIENSGAPDQTFDDWKLQGISAYCGGLWIAALSAAIKIGEILQESKRVEGTIVILLAQYR 631

Query: 686 VRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKK 744
             +Q  +  Y   LWNG Y+  D   GS S  + ADQL GQ+ A   GL  + D   V+ 
Sbjct: 632 RWWQHGRTAYQKQLWNGEYYRLDT--GSGSDVVMADQLCGQFCASTMGLPDVVDSEFVEP 689

Query: 745 ALTKIYD-----FNVLKVK---------------------GGMCGAMNGMQPDGR-IDMS 777
            L  IY+     FN    +                     G   G  NG++PDG   +  
Sbjct: 690 TLRAIYEACFVRFNQYTAQLGPQNQKFEGAQTGYFSASELGVKVGCANGVRPDGSPQNPD 749

Query: 778 GLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDE 837
                E+W G+ +GLAA +  E  +  A +    V    +     G  F+TPE+      
Sbjct: 750 DTHQLEVWTGINFGLAAFLAHEGKLHEAMEITEAVVRQVYEH---GLQFRTPEAITAVGT 806

Query: 838 YRSLCYMRPLTIWAM 852
           +R+  Y+RP+ IWA+
Sbjct: 807 FRACHYLRPMAIWAV 821


>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
 gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
          Length = 837

 Score =  346 bits (887), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 259/825 (31%), Positives = 387/825 (46%), Gaps = 110/825 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGG GAG IGR+  G F  + L  G      +   QF++F    NG +  ++   
Sbjct: 51  NHGMPLGGFGAGCIGRAPDGSFNLWNLDGGEHWFGTLPDCQFALFERDGNGSRAHALAVA 110

Query: 146 RSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
                 + +    + +W W           TY A +P +WT + G      R+ C   SP
Sbjct: 111 PEADASRPDAGPPLSAWSWYPASTADTSTGTYAARYPLSWTSFSGVF--SARVGCEAFSP 168

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
            +P +Y+ SS+P +VFT++LSN  +   +++LL +W N+                D G  
Sbjct: 169 ILPGDYERSSYPVAVFTWSLSNPTRRPLELSLLLSWRNTTGWFTNTDPAAAVHFRDDGSP 228

Query: 249 GHHFNSKTMTKDG----------VHGLTLH----HRTANGRPPVTFAVAAEETA---DVH 291
            H++       +G          + G+ L        A G+     A  A   A    + 
Sbjct: 229 EHNYVPAIGRSEGQRNRWVDDGPLKGVLLEGPISDPIAEGQGQWCIATDAGLEARHPGLR 288

Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
           +  C  +  +G+       ++W    + GS    +ND+ S +   +S  AA+A  L +  
Sbjct: 289 IHRCSRWNPAGD-----GAELWESFARDGSIPDSNNDRHSGANDPAS--AALAVQLRLEP 341

Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
           G++  +   ++WD P   F       RRYT  +G  G +AA IA +A+     W  +I A
Sbjct: 342 GASIDIPLVISWDLPVTAFATGSRALRRYTDHFGADGTNAAAIAAEALGSWPSWREQIAA 401

Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
           WQ+P+LE    PE   + LFNELY L +GGT+WT  SP                      
Sbjct: 402 WQQPVLERTDLPEPLRMALFNELYDLASGGTLWTAASP---------------------- 439

Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
                            E+ +G+F  LE  +Y  Y + DV  Y SFAL+ L+P+L+ ++ 
Sbjct: 440 -----------------EDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVL 482

Query: 531 RDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEINSYNLFN 582
           R FA A+   DP    I    + GK      RK  GA PHD+G   + PW   N     +
Sbjct: 483 RSFARAIPAADPTPRPIGWYFTQGKGRVEAPRKVAGATPHDLGAPNERPWDATNYTAYQD 542

Query: 583 SSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
            + WKDL S FVLQV+R F    TG D  F    WP+   A+ Y++ FD + DG+ +N G
Sbjct: 543 CNLWKDLASDFVLQVWRTFRLAPTGEDLRFLADCWPAAVEALRYLKGFDANSDGLPDNGG 602

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALAN----DVGDHASASY--FWVRYQKAKA 693
            PDQT+D W   GVSAYCG LW+AAL+AA A+      ++G   S+    F    ++++A
Sbjct: 603 APDQTFDDWPLKGVSAYCGALWIAALEAALAMGQRLQLELGLDTSSQQRDFSSWLEQSRA 662

Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
            +D  LWNG Y+N D   G+    + ADQL G +YAR   L P+  E + + AL  I + 
Sbjct: 663 NFDRLLWNGEYYNIDADSGTP--VVMADQLCGDFYARLLELPPVVAEERARSALRAIREA 720

Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQ-TAA 810
                +GG  G  NG++ DG  +D  G    E+W G+ +GLAA        D A   T A
Sbjct: 721 CFEGFQGGSLGVANGLRRDGTPLDPDGTHPLEVWTGINFGLAAYYRLMGETDTALAITGA 780

Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
            V++V     G G  F+TPE+    + +R+  Y+R + IWA+ WA
Sbjct: 781 VVHQVY----GGGLQFRTPEAITAVNTFRACHYLRAMAIWAL-WA 820


>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
          Length = 836

 Score =  345 bits (885), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 254/834 (30%), Positives = 392/834 (47%), Gaps = 133/834 (15%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
           + G+PLGG GAG IGR+  G F  + L     + G+  D      QF++F S  NG++  
Sbjct: 58  NHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEHWFGVLPDC-----QFALFES--NGQQSR 110

Query: 141 SVLCPRSPGVPKKNTDSG--IESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRI 194
           +      P       D+G  + SW W      +    TY A +P +W+ Y G  D E  +
Sbjct: 111 AHALAVQPERDASRPDAGAPLPSWSWYPASTAERTTGTYAARYPLSWSHYQGVFDAE--V 168

Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA------------ 242
            C+  SP +P NY+++S+P +VF +TL N  +   D++LL +W N+              
Sbjct: 169 SCQAFSPILPGNYQQTSYPLAVFVWTLHNPTRKPLDLSLLLSWRNTSGWFTNTDSSAEVH 228

Query: 243 -GDSGLSGHHFNSKTMTKDG----------VHGLTLHHRTANGRPPVT-----FAVAAEE 286
             D G   H++     T +G          + G+ L    +N   PV      + +A  E
Sbjct: 229 FRDDGSPEHNYAPAIGTTEGQRNRWVDDGPLKGVVLERDVSN---PVAEGEGQWCIATGE 285

Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDK-TSPSEPGSSIGAAIAA 345
              V +  C  +   G+       ++W+    +GS    +ND+ +S S+P S   AA+A 
Sbjct: 286 QPGVRIQRCSRWNPHGD-----GSELWSSFSANGSIPDSNNDRRSSQSDPAS---AALAV 337

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKW 404
              +  G +  +   ++WD P   F       RRYT F+G+ G +A  IA + + +  +W
Sbjct: 338 QCRLEPGQSIEIPVVISWDLPVTAFATGTSALRRYTDFFGSSGGNATAIAAEGLRDWRRW 397

Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
             +IEAWQ+P+LE    PE   + LFNELY L +GG++W+  S                 
Sbjct: 398 REQIEAWQQPVLERSDLPEPLRMALFNELYDLCSGGSLWSAAS----------------- 440

Query: 465 TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
                                  E+ +G+F  LE  +Y  Y + DV  Y S AL+ L+P+
Sbjct: 441 ----------------------REDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPE 478

Query: 525 LELSIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEIN 576
           L+ ++ R FA A+   DP    I    + GK     ARK  GA PHD+G   + PW   N
Sbjct: 479 LDKAVLRSFARAIPAADPSPRPIGWYFTQGKGRVEAARKVKGATPHDLGAPNERPWDATN 538

Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDG 633
                + + WKDL S +VLQV+R F       D NF    WP+   A+ Y++ FD + DG
Sbjct: 539 YTAYQDCNLWKDLASDYVLQVWRTFKLAPNGEDINFLAECWPAAVEALHYLKGFDANNDG 598

Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAA----------LQAASALANDVGDHASASY 683
           + +N G PDQT+D W   GVSAYCG LW+AA          LQ ++ L      H  +S+
Sbjct: 599 LPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQQLQLSTGLDTASEQHTFSSW 658

Query: 684 FWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKV 742
                ++++A +D  LWNG Y++ D   G+    + ADQL G +YAR  GL  +  EA  
Sbjct: 659 L----EQSRANFDHLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLPAVVSEANS 712

Query: 743 KKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEM 801
           +  L  + +    + +GG  G  NG++ DG  +D +G    E+W G+ +G+A+       
Sbjct: 713 RSTLKAVKEACFEQFEGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGD 772

Query: 802 VDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
            D A    A V    ++    G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 773 TDTALAITAAVVNQVYAG---GLQFRTPEAITAVNTFRACHYLRAMAIWGV-WA 822


>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Meleagris gallopavo]
          Length = 639

 Score =  343 bits (880), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 188/394 (47%), Positives = 247/394 (62%), Gaps = 18/394 (4%)

Query: 472 PASSDTALGTRLLENG--EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
           PA S   +   LL++G  E  IGQ       EY MYNTYDVHFY+SFALVML+PKL++S+
Sbjct: 235 PAGSGRDVWQDLLQDGKLESPIGQ-------EYRMYNTYDVHFYASFALVMLWPKLQISL 287

Query: 530 QRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWK 587
           Q D A  V+  D    + +  GK    K    VPHDIG   D+PW  +N+Y + +++ WK
Sbjct: 288 QYDIAVTVLNEDMQPRQYLVCGKTAQVKLKNVVPHDIGDPDDEPWQRVNAYLMHDTANWK 347

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           DLN KFVLQVYRD+  T D  + + +WP     M    +FD D DG+IEN G  DQTYDA
Sbjct: 348 DLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGLIENGGTADQTYDA 407

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY 706
           W  NG SAYCGGLW+AA++    +A  +GD  +   +    QK K  ++  LWNG Y+NY
Sbjct: 408 WVVNGASAYCGGLWLAAVRMMCEMAEVLGDAETQQKYGAILQKGKESFERLLWNGKYYNY 467

Query: 707 DNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMC 762
           D+S  S+S+SI +DQ AGQW+  ACGL    L +  ++ +  AL  I++ NV+   GG  
Sbjct: 468 DSSGSSTSSSIMSDQCAGQWFLGACGLDQKELEVFPKSHIVSALKTIFEKNVMSFAGGTM 527

Query: 763 GAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGL 822
           GA+NGM+PDG  D S +Q+ E+W GV Y LAA+MIQE +V   F TA G Y   W  + L
Sbjct: 528 GAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVW--ENL 585

Query: 823 GYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           G +FQTPE++     YRSL YMRPL+IW+MQ AL
Sbjct: 586 GMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLAL 619



 Score =  136 bits (343), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 118/401 (29%), Positives = 179/401 (44%), Gaps = 41/401 (10%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G PLGGIG G+I R +RGEF R++L  G      V+ +QF+V + R     +  VL    
Sbjct: 31  GCPLGGIGGGTITRGWRGEFCRWQLNPGKYHYETVITDQFTVCLRRKGQTVYQQVLSVEK 90

Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
           P        S ++ W+W   G    YHAL+PRAW VY+  P   + + CRQISP IPH+Y
Sbjct: 91  P--------SALQGWNWGYCGRYAFYHALYPRAWMVYE-LPGQNVVLTCRQISPVIPHDY 141

Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHG 264
           K+SS P  VF + + N+ +   DV+++FT  N        SG H+N     +DG   V G
Sbjct: 142 KDSSLPVGVFIWEVENNSEEPVDVSIMFTLQNGTGTKGDGSGGHWNEPFALQDGGERVTG 201

Query: 265 LTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
           + LHH T     P TFAVAA   A   ++    F  +G     + +D+W ++ + G  + 
Sbjct: 202 VLLHHCTPIN--PFTFAVAAXXQAGTVITHLTAFDPAG-----SGRDVWQDLLQDGKLES 254

Query: 325 -LDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFY 383
            +  +    +       A+ A  +  P      +  SL +D       E +  R+Y    
Sbjct: 255 PIGQEYRMYNTYDVHFYASFALVMLWP-----KLQISLQYDIAVTVLNEDMQPRQY---- 305

Query: 384 GTLGDSAARIAHDAILEHAKWECEIEAWQRP---ILEDKRFPEWYPITLFNEL-----YY 435
             +    A++    ++ H   + + E WQR    ++ D     W  + L   L     YY
Sbjct: 306 -LVCGKTAQVKLKNVVPHDIGDPDDEPWQRVNAYLMHDT--ANWKDLNLKFVLQVYRDYY 362

Query: 436 LNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
           L        D  P  Q++    E KF  D    I    ++D
Sbjct: 363 LTHDALYLQDMWPVCQAVME-SELKFDTDNDGLIENGGTAD 402


>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
 gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
          Length = 832

 Score =  342 bits (876), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 251/824 (30%), Positives = 378/824 (45%), Gaps = 113/824 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGG GAG  GR+  G F  + L  G      +   QF++F    NG    +    
Sbjct: 54  NHGMPLGGFGAGCFGRAPDGSFNLWHLDGGEHWFGALPDCQFALFER--NGSSTRAHALA 111

Query: 146 RSPGVPKKNTDSG--IESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
             P       D G  + +WDW      +    TY A +P +WT YDG  D E+R  C   
Sbjct: 112 VKPEADASRPDGGEPLAAWDWYPASTPERSTGTYAARYPLSWTHYDGVYDAEVR--CEAF 169

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
           SP +P +Y+ +S+P +VF +TL N      D++LL +W N+                D G
Sbjct: 170 SPILPGDYQRTSYPVAVFVWTLRNPTPKPLDLSLLLSWRNTSGWFTNTDASAEVHFRDDG 229

Query: 247 LSGHHFNSKTMTKDG----------VHGLTLHHRTANGRPPVT-----FAVAAEETADVH 291
              H++     T DG          + G+ L    +N   P+      + +A  E   V 
Sbjct: 230 SPEHNYAPAIGTTDGQRNRWIDDGNLKGVVLEGNVSN---PIAEGEGQWCIATAEQPGVT 286

Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
           +  C  +   G+       ++W+  +  GS    +ND+ S       + AA+A    +  
Sbjct: 287 IQRCSRWNPHGD-----GNELWSSFRADGSIPDSNNDRRSGHN--DPLSAALAVQCQLAP 339

Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
           G +  +   ++WD P   F       RRYT F+G  G  AA IA +A+ +   W  +IEA
Sbjct: 340 GQSLEIPVVISWDLPVTGFASGSQALRRYTDFFGAEGHQAAAIAAEALRDWRDWRQQIEA 399

Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
           WQ+P+LE    PE   + LFNELY L +GG++W                  S  TP   H
Sbjct: 400 WQQPVLERSDLPEPVRMALFNELYDLCSGGSLW------------------SAATPMDPH 441

Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
                                G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ ++ 
Sbjct: 442 ---------------------GRFGVLECIDYAWYESLDVRLYGSLALLQLWPELDKAVL 480

Query: 531 RDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEINSYNLFN 582
           R FA A+   D     I    + GK      RK  GA PHD+G   ++PW   N     +
Sbjct: 481 RSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNENPWDATNYTAYQD 540

Query: 583 SSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
            + WKDL S +VLQV+R F  +    D NF    WP+   A+ Y++ FD + DG+ +N G
Sbjct: 541 CNLWKDLASDYVLQVWRTFKLSPSGEDINFLAECWPAAVEALTYLKGFDTNNDGLPDNGG 600

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQKAKA 693
            PDQT+D W   GVSAYCG LW+AAL+AA A      LA  +   A    F    ++++A
Sbjct: 601 APDQTFDDWPLKGVSAYCGALWIAALEAALAMGQQLQLATGLDTAAEQHTFSTWLEQSRA 660

Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
            +D+ LWNG Y++ D   G+    + ADQL G +YAR   L  +  +A  +  L  + + 
Sbjct: 661 NFDTLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLKLPAVVSDANCRSTLKAVKES 718

Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
                +GG  G  NG++ DG  +D  G    E+W G+ +G+A+     +    A    + 
Sbjct: 719 CFEAFQGGSLGVANGLRRDGTPLDPQGTHPLEVWTGINFGIASYYRLMDDTSTALAITSA 778

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           V    ++    G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 779 VVNQVYAG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 818


>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 534

 Score =  340 bits (872), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 208/556 (37%), Positives = 288/556 (51%), Gaps = 48/556 (8%)

Query: 329 KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTK--FYGTL 386
           K   +E    +G AI A       S+  V F LAWD P VKF       R     F+   
Sbjct: 7   KKERTEGEVDLGVAICAEFPAKCKSSAEVEFVLAWDMPIVKFGAGRRQYRRRYARFFPDA 66

Query: 387 GDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDG 446
                ++   A++   +WE +I+AWQ+ IL D   P+WY   LFNE Y+L  GGT W + 
Sbjct: 67  SKRVEQMCSRALMSRIEWERKIDAWQQRILSDDSLPDWYKSALFNESYFLTDGGTCWFEY 126

Query: 447 SPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
               +S     ER+ S ++                       +  G+F YLE  EY M N
Sbjct: 127 DDEWRST----ERQMSNESAKYF-------------------KEFGRFAYLEAWEYYMLN 163

Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
           TYDVHFYSSFAL+  +P +EL+IQ DFA  V+         +++    A K LG +PHD+
Sbjct: 164 TYDVHFYSSFALLENWPLIELAIQLDFADQVLSSCDRKSVNINESTRTAVKRLGRLPHDL 223

Query: 567 G--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF-----VATGDKN-----FARAVW 614
           G  +D+PW  +N+Y L ++  WKDLN KFVL  YRD+     +   D N       R  +
Sbjct: 224 GNPMDEPWLHLNAYALSDTCEWKDLNLKFVLTCYRDYEKIVKIYFNDDNEMKGCLLRRFY 283

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
                 +A  + +D DGD +IEN G PDQTYD WS +G SAYCGGLW+ AL+    +A  
Sbjct: 284 DLSSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALT 343

Query: 675 VGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +G+   A  F  +   A+  Y+  LWNG YF++D    +   SI ADQL G W+   C  
Sbjct: 344 LGEVVDAQKFANKLNNARKAYERKLWNGKYFDFDEH-STDHKSIMADQLCGFWF--MCIT 400

Query: 734 LPIADEAKVKK-----ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGV 788
               D+  + +     +L  I+++NV K   G  G +N M P G +D +G+Q+ E+W GV
Sbjct: 401 DGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGV 460

Query: 789 TYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLT 848
            Y LA+  +  E  + AF+TA G Y   W   GL Y  Q+PE+ N +  YR++ YMRPL 
Sbjct: 461 AYALASFHLLVEENESAFKTAEGWYRSCWERYGLQY--QSPEAINESSYYRAIGYMRPLA 518

Query: 849 IWAMQWALTKPKLSRQ 864
           IWAMQ AL   +  RQ
Sbjct: 519 IWAMQSALDALRNKRQ 534


>gi|345289089|gb|AEN81036.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289091|gb|AEN81037.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289093|gb|AEN81038.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289095|gb|AEN81039.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289097|gb|AEN81040.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289101|gb|AEN81042.1| AT1G33700-like protein, partial [Capsella rubella]
 gi|345289103|gb|AEN81043.1| AT1G33700-like protein, partial [Capsella rubella]
          Length = 191

 Score =  339 bits (870), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 155/190 (81%), Positives = 175/190 (92%)

Query: 489 ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIM 548
           ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFPKLELSIQRDFAAAV+MHD    ++M
Sbjct: 2   ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61

Query: 549 SDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN 608
           S G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD N
Sbjct: 62  SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           FA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA 
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181

Query: 669 SALANDVGDH 678
           SALA ++GD+
Sbjct: 182 SALAREIGDN 191


>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
 gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
          Length = 833

 Score =  339 bits (869), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 251/821 (30%), Positives = 397/821 (48%), Gaps = 108/821 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QF++F S  + ++  ++    
Sbjct: 55  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFALFESNGSSQRAHALAVKP 114

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                + N+   + +WDW      +    TY A +P +WT Y+G  D E  + C   SP 
Sbjct: 115 EADASRPNSAEPLAAWDWYPASTPERSTGTYAARYPLSWTHYEGVYDAE--VSCEAFSPI 172

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
           +P +Y+ +S+P +VF +TL N      D++LL +W N+V               D G   
Sbjct: 173 LPGDYQRTSYPVAVFRWTLRNPSDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 232

Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN--GRPPVTFAVAAEETAD-VHVSECP 296
           H++           ++ + ++GV G+ L  + +   G     + +A  +  + V V  C 
Sbjct: 233 HNYAPAIGQGEGQRNRWLDQEGVRGIVLEGQRSEPIGEGQGQWCLAVPDALEGVEVLRCS 292

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            +  +G+      +++W      G   + +ND++S    G    AAIA   T+  G +R 
Sbjct: 293 RWDPTGD-----GQELWQSFAAEGRIPNSNNDRSS--RAGEQASAAIALKFTLAPGESRE 345

Query: 357 VTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           +  +++WD P   F   V   RRYT F+G  G +AA +A +A+ +  +W  +I+AWQ P+
Sbjct: 346 IPIAISWDLPVTAFATGVRDLRRYTDFFGADGCNAAALAAEALRDWRQWRDQIDAWQAPV 405

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L  +  PE   + LFNELY L +GG++WT  S                            
Sbjct: 406 LARQELPEALRMALFNELYDLASGGSLWTAAS---------------------------- 437

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
                      G++  G+F  LE  +Y  Y + DV  Y SFAL+ L+P+L+ ++ R FA 
Sbjct: 438 -----------GKDPHGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFAR 486

Query: 536 AVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRWK 587
           A+   D     I        G+  A RK  GA PHD+G  ++  F+  +Y  + + + WK
Sbjct: 487 AIPAADATQRPIGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWK 546

Query: 588 DLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           DL S FVLQV+R F    TG D  F    WP+   A+ Y+++FD + DG+ +N G PDQT
Sbjct: 547 DLASDFVLQVWRTFKLAPTGEDLKFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQT 606

Query: 645 YDAWSANGVSAYCGGLWVAAL--------QAASALANDVGDHASASYFWVRYQKAKAVYD 696
           +D W   GVSAYCG LW+AAL        +    L  D G        W+  +++++ +D
Sbjct: 607 FDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDTGSDQHTFSAWL--EQSRSNFD 664

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
             LWNG Y++ D   G+    + ADQL G +YAR  GL P+  EA  +  L  + +    
Sbjct: 665 RLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLEPVVSEANSRSTLKAVKESCFE 722

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           K +GG  G  NG++ DG  +D +G    E+W G+ +G+A+          A    + V +
Sbjct: 723 KFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAICSAVVD 782

Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
             +S    G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 783 QVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819


>gi|345289099|gb|AEN81041.1| AT1G33700-like protein, partial [Capsella rubella]
          Length = 191

 Score =  338 bits (866), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 154/190 (81%), Positives = 175/190 (92%)

Query: 489 ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIM 548
           ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFPKLELSIQRDFAAAV+MHD    ++M
Sbjct: 2   ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61

Query: 549 SDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN 608
           S G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD N
Sbjct: 62  SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           FA+AVWPSVY A+AY++QFDKDGDGMIEN+GFPDQTYDAWS +GVSAYCGGLWVAALQA 
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENDGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181

Query: 669 SALANDVGDH 678
           SALA ++GD+
Sbjct: 182 SALAREIGDN 191


>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
          Length = 837

 Score =  336 bits (861), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 249/825 (30%), Positives = 384/825 (46%), Gaps = 115/825 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
           + G+PLGG GAG IGR+  G F  + L     + G+  D      QFS+F S    ++  
Sbjct: 59  NHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEHWFGVLPDC-----QFSLFESNGTEQRAH 113

Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           ++    +    +      + SW W    + + +  TY A +P +W+ Y G  + E  + C
Sbjct: 114 ALAVKPNRDDSRPEAADPLSSWSWYPASSEQRQTGTYAARYPLSWSHYQGVFEAE--VSC 171

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
              SP +P NY+++S+P + F +TL N  + + D++L+ +W N+                
Sbjct: 172 EAFSPILPGNYQQTSYPLATFVWTLRNPTRQALDLSLMLSWRNTCGWFTNTDSSAEVHFR 231

Query: 244 DSGLSGHHF-----NSKTMTKDGVHGLTLHHRTANG--RPPVT-----FAVAAEETADVH 291
           D G   H++      S+    D +   +L      G    PV      + +A +E A V 
Sbjct: 232 DDGSPEHNYAPAIGKSEGHRNDWIDDGSLKGVVLGGDVSSPVAEGEGQWCIATQEQAGVS 291

Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIP 350
           +  C  +   G+       D+W    + GS    +N++ S  +EP   + AA+A    + 
Sbjct: 292 IQRCSRWNPQGD-----GSDLWGSFSRDGSIPESNNNRRSGAAEP---VSAALAVRCRLE 343

Query: 351 SGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
            G +  +   ++WD P   F       RRYT F+G  G +AA IA + + +   W  +IE
Sbjct: 344 PGQSIEIPVVISWDLPVTSFATGTSALRRYTDFFGASGKNAAAIAAEGLRDWRSWRQQIE 403

Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
           AWQ+P+LE    PE   + LFNELY L +GG++W+  S                      
Sbjct: 404 AWQQPVLERSDLPEPLRMALFNELYDLCSGGSLWSAASR--------------------- 442

Query: 470 HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
                             E+ +G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ ++
Sbjct: 443 ------------------EDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAV 484

Query: 530 QRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEINSYNLF 581
            R FA A+   D     I    + GK     ARK  GA PHD+G   + PW   N     
Sbjct: 485 LRSFARAIPAADATPRPIGWYFTQGKGRVEAARKVKGATPHDLGAPNEKPWDATNYTAYQ 544

Query: 582 NSSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
           + + WKDL S +VLQV+R F       D  F    WP+   A+ Y++ FD + DG+ +N 
Sbjct: 545 DCNLWKDLASDYVLQVWRTFKLAPNGEDLRFLADCWPAAVEALRYLKTFDINNDGLPDNG 604

Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQKAK 692
           G PDQT+D W   GVSAYCG LW+AAL AA A      L+  +   A    F     +A+
Sbjct: 605 GAPDQTFDDWPLKGVSAYCGALWIAALDAALAIAQQLQLSTGLDTSAEQQEFSAWLAQAR 664

Query: 693 AVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
             +D  LWNG Y++ D   G  +  + ADQL G +YAR  GL P+  EA  +  L  + D
Sbjct: 665 GNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGLEPVVSEANSRSTLNAVKD 722

Query: 752 FNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
               K +GG  G  NG++ DG  +D +G    E+W G+ +G+A+        + A    +
Sbjct: 723 SCFEKFEGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGETETALAITS 782

Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
            V    +S    G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 783 AVVNQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGV-WA 823


>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
 gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
          Length = 832

 Score =  333 bits (855), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 258/829 (31%), Positives = 393/829 (47%), Gaps = 123/829 (14%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
           + G+PLGG GAG IGR+  G    + L     + G+  D      QF++F S  +G++  
Sbjct: 54  NHGMPLGGFGAGCIGRAPDGNINLWHLDGGEHWFGVLPDC-----QFALFESNGSGKRAH 108

Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           ++    S    +  +   + +WDW      +    TY A +P +WT Y+G  D E+R  C
Sbjct: 109 ALAVKPSADASRPESGEPLPAWDWYPASTAERSTGTYAARYPLSWTSYEGVYDAEVR--C 166

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
              SP +P +Y+ +S+P +VF +TL N      D++LL +W N+                
Sbjct: 167 EAFSPILPGDYQRTSYPVAVFVWTLHNPTDQPLDLSLLLSWRNTTGWFTNTDASAEVHFR 226

Query: 244 DSGLSGHHFN---SKTMTK------DG-VHGLTLHHRTANGRPPVT-----FAVAAEETA 288
           D G   H++     KT+ +      DG + G+ L    +N   PV      + +A  E  
Sbjct: 227 DDGSPEHNYAPAIGKTVGQRNRWVDDGNLKGVVLEGNVSN---PVAEGEGQWCIATTEQP 283

Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
            V +  C  +  SG+       ++WN     GS    +ND+TS +     + AA+A    
Sbjct: 284 GVSIQRCSRWNPSGD-----GSELWNSFSADGSIPESNNDRTSGAN--DPLSAALAVQCQ 336

Query: 349 IPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
           +  G +  +   ++WD P   F       RRYT F+GT GD AA IA +A+ +  +W  +
Sbjct: 337 LAPGQSLEIPVMISWDLPVTAFATGSQALRRYTDFFGTGGDQAAAIAAEALRDWKQWREQ 396

Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
           I AWQ+P+L+    PE   + LFNELY L +GG++W+  SP                   
Sbjct: 397 IGAWQQPVLQRSELPEPLRMALFNELYDLCSGGSLWSAASP------------------- 437

Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
                               E+  G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ 
Sbjct: 438 --------------------EDPHGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDK 477

Query: 528 SIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGLDD--PWFEINSYN 579
           ++ R FA A+   D     I    + GK      RK  GA PHD+G  +  PW   N   
Sbjct: 478 AVLRSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNELPWDATNYTA 537

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
             + + WKDL S +VLQV+R F  +    D  F    WPS   A+ Y++ FD + DG+ +
Sbjct: 538 YQDCNLWKDLGSDYVLQVWRTFKLSPSGEDVRFLADCWPSAVEALRYLKTFDVNDDGLPD 597

Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQK 690
           N G PDQT+D W   GVSAYCG LW+AAL+AA A      L   +   A    F    ++
Sbjct: 598 NGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLKTGLDTSAEQREFSGWLEQ 657

Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           ++  +D  LWNG Y++ D   G+    + ADQL G +YAR  GL P+  EA  +  L  +
Sbjct: 658 SRGNFDRLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLEPVVSEANSRSTLKAV 715

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAF 806
            +    K  GG  G  NG++ DG  +D +G    E+W G+ +G+A+   ++ E     A 
Sbjct: 716 REACFEKFDGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAI 775

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
            +A  V +V       G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 776 CSAV-VNQVY----AGGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 818


>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
 gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
          Length = 833

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 259/828 (31%), Positives = 391/828 (47%), Gaps = 122/828 (14%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QF+++  +  G++        
Sbjct: 55  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQ--GDQVRVHALAT 112

Query: 147 SPGVPKKNTDSG--IESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
            P       +SG  + SW W      +    TY A +P +W  Y G    E  + C   S
Sbjct: 113 EPARDDSRPESGKPLSSWQWYPASTEENSTGTYAARYPLSWNHYKGVFRAE--VSCEAFS 170

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
           P +P +Y+ SS+P +VF +TL+N      +V+LL +W N+V               D G 
Sbjct: 171 PILPGDYQRSSYPVAVFRWTLTNPTNKPLEVSLLLSWRNTVGWFTNTDASAEVHFRDDGS 230

Query: 248 SGHHF----------NSKTMTKDGVHGLTLHHRTANGRPPVT-----FAVAAEETAD-VH 291
             H++          +++ +  DG+ G+ L  + +    PV      + +A  +T + V 
Sbjct: 231 PEHNYAPAIGDGEGQSNRWIDGDGLSGVLLDGKRST---PVAEGEGQWCLALPDTLEGVE 287

Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
           +  C  +  S +       ++W      G     +ND+ S    G    AAIA   T+  
Sbjct: 288 LMRCSRWDPSSD-----GAELWQPFAADGVIPDSNNDRAS--RKGEHASAAIAVKFTLAP 340

Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
           G TR +  +++WD P   F   V   RRY  FYG  G  AA IA +A+ +   W  +IEA
Sbjct: 341 GETREIPVAISWDLPVTSFATGVRDLRRYCDFYGVDGCHAAAIASEALRDWRSWHQQIEA 400

Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
           WQ P+L  K  PE   + LFNELY L +GG++WT  +                       
Sbjct: 401 WQEPVLARKELPEELRMALFNELYDLASGGSLWTAAT----------------------- 437

Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
                            ++  G+F  LE  +Y  Y + DV  Y SFAL+ L+P+L+ ++ 
Sbjct: 438 ----------------SKDPYGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVI 481

Query: 531 RDFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-N 582
           R FA A+   D     I        G+  A RK  GA PHD+G  ++  F+  +Y  + +
Sbjct: 482 RSFARAIPAADATQRPIGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQD 541

Query: 583 SSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
            + WKDL S +VLQV+R F    TG D +F    WP+   A+ Y++QFD + DG+ +N G
Sbjct: 542 CNLWKDLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGG 601

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASA--------LANDVGDHASASYFWVRYQKA 691
            PDQT+D W   GVSAYCG LW+AAL+AA A        L  D GD       W+  +++
Sbjct: 602 APDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDTGDEQHTFSGWL--EQS 659

Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIY 750
           +A +D  LWNG Y++ D   G+    + ADQL G +YAR  GL P+  EA  +  L  + 
Sbjct: 660 RANFDKLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLPPVVSEANSRSTLKAVK 717

Query: 751 DFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAFQ 807
           +       GG  G  NG++ DG  +D +G    E+W G+ +G+A+   ++ E     A  
Sbjct: 718 EACFDNFAGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGKTAEAIC 777

Query: 808 TAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           +A      A      G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 778 SAVVTQVYA-----GGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819


>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
          Length = 692

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 172/365 (47%), Positives = 233/365 (63%), Gaps = 7/365 (1%)

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           L G EY MYNTYDVHFY+SFAL+ML+PKL++S+Q D A AV+  D    + +  G+    
Sbjct: 314 LPGQEYRMYNTYDVHFYASFALIMLWPKLQISLQYDIAVAVVNEDVQPRQYLMGGQTAPV 373

Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
           K    VPHDIG   D+PW  +N+Y + +++ WKDLN KFVLQVYRD+  T D  + R +W
Sbjct: 374 KTKNVVPHDIGDPADEPWQRVNAYLIHDTANWKDLNLKFVLQVYRDYYLTHDSLYLRDMW 433

Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           P     M    +FD D DG+IEN GF DQTYDAW  NG SAYCGGLW+AA+     +A  
Sbjct: 434 PVCQAVMESELKFDMDNDGLIENGGFADQTYDAWVVNGASAYCGGLWLAAVCMMCKMAEV 493

Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
           +GD      +    +K K  ++  LWNG Y+NYD+S  ++S+SI +DQ AG  + +    
Sbjct: 494 LGDAEIQQKYMDILRKGKETFERLLWNGKYYNYDSSGSNTSSSIMSDQCAG--HPKPPLS 551

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
           + +  ++ V  AL  I++ NV+   GG  GA+NGM+P G  D S +Q+ E+W GV Y LA
Sbjct: 552 MQVFPKSHVVSALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALA 611

Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
           A+MIQE +V+  F+TA G Y   W +  LG +FQTPE++     YRSL YMRPL+IW+MQ
Sbjct: 612 ATMIQEGLVEEGFRTAEGCYRTVWEQ--LGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQ 669

Query: 854 WALTK 858
            AL +
Sbjct: 670 LALER 674



 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 71/210 (33%), Positives = 104/210 (49%), Gaps = 19/210 (9%)

Query: 126 QFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
           QF+V +       +  VL    P        S ++ W+W   G    YHAL+PRAW VY+
Sbjct: 6   QFTVCLRCKGQTVYQQVLSVERP--------STLQGWNWGYCGHYAFYHALYPRAWMVYE 57

Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
             P   + + CRQISP IPH+YK+SS P  VF + + N      DV+++F+  N      
Sbjct: 58  -LPGQNVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENGRDEDVDVSIMFSLQNGTGTKE 116

Query: 246 GLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG 302
             SG H+N   T  K+G  V G+ LHH T     P TFA++A E A V V+    F  +G
Sbjct: 117 DRSGGHWNEPFTFEKEGERVAGVLLHHCTHVN--PFTFAISAREKAGVVVTHLTAFNPAG 174

Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSP 332
                + +++W ++ + G  D  D +   P
Sbjct: 175 -----SGREVWQDLLQDGQLDSSDGESLHP 199


>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
 gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
          Length = 830

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 258/823 (31%), Positives = 390/823 (47%), Gaps = 114/823 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
           + G PLGG GAG IGR   G F  + L     + G+  D      QF++F       +  
Sbjct: 54  NHGAPLGGFGAGCIGRGPDGSFNLWHLDGGEHWFGVLPDC-----QFALFEHDGTASRAR 108

Query: 141 SV-LCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
           ++ + P       + ++  +  W W L+GE  TY A +P  W  +D + +    + C   
Sbjct: 109 ALAMAPERDDSRPEASERPLGGWQW-LEGEAGTYAARYPLHW--FDYQDNFHTEVSCEAF 165

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
           SP +P +Y+ +S+P +VF + L+NS +   D++LL +W N+V               D G
Sbjct: 166 SPILPGDYQRTSYPVAVFRWRLANSTKKPLDLSLLLSWRNTVGWFTNTDPSAAVTFRDDG 225

Query: 247 LSGHHF----------NSKTMTKDGVHGLTLH----HRTANGRPPVTFAVAAEETADVHV 292
              H++           ++ +   G+ G+ L        A G      AV  ++   V V
Sbjct: 226 SPEHNYLPAIGAGVGQRNRLIDAPGLKGVLLEGAPSQPIAEGEGQWCLAVP-DQLEGVEV 284

Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
             C  +  SG+       ++W    + GS    +ND+ S     +S  AAIA  L +  G
Sbjct: 285 MRCSRWDPSGD-----GAEIWEPFARDGSIPDSNNDRRSTGSEQAS--AAIALRLRLEPG 337

Query: 353 STRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAW 411
           +   +   ++WD P   F       RRYT F+G  G SA  IA +A+ +  +W   IEAW
Sbjct: 338 AEIELPVVISWDLPVTAFATGTSALRRYTDFFGAAGTSAVAIAAEALRDWRQWRDAIEAW 397

Query: 412 QRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT 471
           Q+P+LE    PE   + L NELY L +GG++WT                           
Sbjct: 398 QQPVLERSDLPEPLRMALLNELYDLTSGGSLWT--------------------------- 430

Query: 472 PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
            A+S T           + +G+F  LE  +Y  Y + DV  Y SFAL+ L+P+L+ ++ R
Sbjct: 431 -AASST-----------DPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLR 478

Query: 532 DFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSYNLF-NS 583
            FA A+   D     I    + G+      RK  GA PHD+G  ++  F+  +Y  + + 
Sbjct: 479 SFARAIPAADATPRPIGWYFTQGRGRVEAPRKVAGATPHDLGAPNERPFDATNYTAYQDC 538

Query: 584 SRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
           + WKDL S FVLQV+R F    TG D  F    WP+V  A+ Y++QFD + DG+ +N G 
Sbjct: 539 NLWKDLASDFVLQVWRTFSLAPTGEDLRFLADCWPAVVEALRYLKQFDVNDDGLPDNGGA 598

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAV 694
           PDQT+D W   GVSAYCG LW+AAL+AA A    L  D+G    A    F    + ++A 
Sbjct: 599 PDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDTGAEQREFGSWLEPSRAN 658

Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           +D  LWNG Y+  D   GS +  + ADQL G +YAR  GL  +  + +   +L  I +  
Sbjct: 659 FDRLLWNGEYYRIDA--GSGTPVVMADQLCGDFYARLLGLPQVVSDERALSSLKAIREAC 716

Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
             + +GG  G  NG++ DG  +D  G    E+W G+ +GLAA        D A    + V
Sbjct: 717 FERFEGGRLGVANGLRRDGTPLDPKGTHPLEVWTGINFGLAAYYRLMGQTDTALAITSAV 776

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
               ++    G  F+TPE+      +R+  Y+R + IWA+ WA
Sbjct: 777 VGQVYAG---GLQFRTPEAITAVGTFRACHYLRAMAIWAL-WA 815


>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
 gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
          Length = 913

 Score =  329 bits (844), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 192/582 (32%), Positives = 304/582 (52%), Gaps = 66/582 (11%)

Query: 45  SFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYR 104
           SF++   +  +  R   Y  +   +G+  + + +   +     G PLGGIGAG+IGR + 
Sbjct: 35  SFRQILSMVPLALRYIPYYWKVAREGRQVLMDYWYTENGKQIYGAPLGGIGAGTIGRGFA 94

Query: 105 GEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVLCPRSPGVPKKNT-------- 155
           GEF R++L  G+ +   V ANQF V +    G   F S+L      V K ++        
Sbjct: 95  GEFCRYQLKPGLYEYNTVHANQFIVTIKDETGATIFHSLLSTYRSYVQKIDSHRLAKAFV 154

Query: 156 ------------DSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
                        + + SW+  L    C+Y AL+PRAW+ YD   +  +++V RQISP I
Sbjct: 155 LRVWAGFISYRPKTPLASWESGLDASRCSYTALYPRAWSEYD-LSEHGVKLVQRQISPII 213

Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVH 263
           PH+YKESS P +VF +T+ N       VTL FT+ N        +     +   T+    
Sbjct: 214 PHDYKESSLPCAVFVWTVENVCDKDRQVTLTFTFKNGTGTKKQDAEGGSETSAFTQGNAR 273

Query: 264 GLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD 323
           G+++    A    P T+ V+   +++++++ C  F  +GN      + +WN++K++G   
Sbjct: 274 GVSIRQTIA--EMPCTYCVSCRSSSEINLTRCERFDPTGN-----GEKLWNDLKENGHLT 326

Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTK 381
              ND+T  S+    +  A++  + +  G+T  + F+L WD P+V F +  K YHR YTK
Sbjct: 327 EKSNDETLKSK---DVAVAVSGQILVQPGTTSQLEFALVWDMPKVHFQKRGKEYHRYYTK 383

Query: 382 FYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGT 441
           ++G  GD+  +I+  A+  + KWE  I+ WQRPILED   P+WY   +FNELY++  GG+
Sbjct: 384 YFGKSGDAGPQISDYALNNYGKWERLIDEWQRPILEDADLPDWYKSAIFNELYFIADGGS 443

Query: 442 IWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAE 501
           +W                 F++D  + +               ++     G++ YLEG E
Sbjct: 444 VW-----------------FTMDDQTDLP-------------FDDPRLAYGRYSYLEGHE 473

Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
           Y MY TYDVHFY+S AL  L+P L++SIQ D+  ++        K + DGK + RK   +
Sbjct: 474 YRMYTTYDVHFYASHALASLWPNLQVSIQYDYKDSIEREITEGRKHLYDGKVIPRKIKNS 533

Query: 562 VPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           VPHD+G   ++P+  IN+Y + + S W+DLN KF+LQVYRD+
Sbjct: 534 VPHDLGDPAEEPFDLINAYPIHDVSEWRDLNLKFILQVYRDY 575



 Score =  213 bits (543), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 111/263 (42%), Positives = 165/263 (62%), Gaps = 11/263 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++P+  + + +  ++DKDGDG+IEN   PDQTYD W  +G SAYCGGLW+A+L   
Sbjct: 653 YLKAMYPACRVVLEHSLEWDKDGDGLIENSKAPDQTYDTWVMDGPSAYCGGLWLASLHCM 712

Query: 669 SALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           SA+A+ +  +  +  +     K +A + + LWNG+Y+ +D +  +S  SI +DQL G WY
Sbjct: 713 SAMASLLDQNEDSDRYKAVLDKGRASFEEKLWNGTYYRFD-AQSASKNSIMSDQLCGHWY 771

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSGLQ 780
            R+CG    +  +  V+ A+  IY+ NV++  GG  GA+NG  P      DGR D   +Q
Sbjct: 772 LRSCGFDYDVFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPNKDGRPDTVSIQ 831

Query: 781 AREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRS 840
             E+W GVTY LA++MI E M + AF+TA G+Y+ A SE  +G +F+TPE+      YR+
Sbjct: 832 GEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQ-ALSER-IGMNFETPEAVYAERHYRA 889

Query: 841 LCYMRPLTIWAMQWALTKPKLSR 863
           + YMRPL+IW+MQ A    KL R
Sbjct: 890 IGYMRPLSIWSMQTAWEMKKLRR 912


>gi|352096901|ref|ZP_08957615.1| Glucosylceramidase [Synechococcus sp. WH 8016]
 gi|351675485|gb|EHA58646.1| Glucosylceramidase [Synechococcus sp. WH 8016]
          Length = 833

 Score =  322 bits (825), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 257/823 (31%), Positives = 394/823 (47%), Gaps = 112/823 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QF+++  + +  +  ++    
Sbjct: 55  HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQDDQVRVHALATEP 114

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           +    +  +   + SW W      +    TY A +P +W  Y+G    E  + C   SP 
Sbjct: 115 TRDDSRPESGKPLSSWQWYPASTEESSTGTYAARYPLSWNHYEGVFRAE--VSCEAFSPI 172

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
           +P +Y+ SS+P +VF +TL+N      +V+LL +W N+V               D G   
Sbjct: 173 LPGDYQRSSYPVAVFRWTLTNPTTKPLEVSLLLSWRNTVGWFTNTDSSAEVHFRDDGSPE 232

Query: 250 HHF----------NSKTMTKDGVHGLTLHHR--TANGRPPVTFAVAAEETAD-VHVSECP 296
           H++          +++ +  DG+ G+ L  +  T        + +A  +  D V +  C 
Sbjct: 233 HNYAPAIGVGEGQSNRWIDGDGLCGVLLDGKRSTPLAEGEGQWCLALPDRLDGVELMRCS 292

Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
            +  SG+       ++W      G     +ND+ S    G    AAIA  LT+  G TR 
Sbjct: 293 RWDPSGD-----GAELWQPFAADGLIPDSNNDRAS--RAGEHASAAIAVKLTLAPGETRE 345

Query: 357 VTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
           +  +++WD P   F   V   RRY+ FYG  G  AA IA +A+ +   W  +IEAWQ P+
Sbjct: 346 IPVAISWDLPVTSFATGVRDLRRYSDFYGADGCHAAAIAAEALRDWRSWHEQIEAWQAPV 405

Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
           L  K  PE   + LFNELY L +GG++WT         AT ++                 
Sbjct: 406 LARKELPEELRMALFNELYDLASGGSLWT--------AATTKD----------------- 440

Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
                           G+F  LE  +Y  Y + DV  Y SFAL+ L+P+L+ ++ R FA 
Sbjct: 441 --------------PYGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVIRSFAR 486

Query: 536 AVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRWK 587
           A+   D     I        G+  A RK  GA PHD+G  ++  F+  +Y  + + + WK
Sbjct: 487 AIPAADATQRPIGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWK 546

Query: 588 DLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
           DL S +VLQV+R F    TG D +F    WP+   A+ Y++QFD + DG+ +N G PDQT
Sbjct: 547 DLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGGAPDQT 606

Query: 645 YDAWSANGVSAYCGGLWVAALQAASA--------LANDVGDHASASYFWVRYQKAKAVYD 696
           +D W   GVSAYCG LW+AAL+AA A        L  D GD       W+  ++++A +D
Sbjct: 607 FDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDTGDEQHTFSGWL--EQSRANFD 664

Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           + LWNG Y++ D   G+    + ADQL G +YAR   L P+  EA  +  L  + +    
Sbjct: 665 TLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLELPPVVSEANSRSTLKAVKEACFD 722

Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAFQTAAGV 812
              GG+ G  NG++ DG  +D +G    E+W G+ +G+A+   ++ E     A  +A   
Sbjct: 723 NFAGGLLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGPTAEAICSAVVT 782

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
              A      G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 783 QVYA-----GGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819


>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
 gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
          Length = 832

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 241/829 (29%), Positives = 385/829 (46%), Gaps = 123/829 (14%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
           + G+PLGG GAG  GR+  G    + L     + G+  D      QF++F     G++  
Sbjct: 54  NHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDC-----QFALFEGNGRGKRAH 108

Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           ++         ++     +++W+W           TY A +P +WT Y+G  D E+R  C
Sbjct: 109 ALAVQPDKDASRQEAGQPLKAWEWYPASTPDRSTGTYAARYPFSWTSYEGVYDAEVR--C 166

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
              SP +P +Y+ +S+P +VF +TL N      D++LL  W N+                
Sbjct: 167 EAFSPILPGDYQRTSYPVAVFIWTLRNPTDQPLDLSLLLCWRNTTGWFTNTDASAEVHFR 226

Query: 244 DSGLSGHHF----------NSKTMTKDGVHGLTLHHRTANGRPPVT-----FAVAAEETA 288
           D G   H++           ++ +    + G+ L    +N   PV      + +AA E  
Sbjct: 227 DDGSPEHNYAPAIGSTAGQRNRCIDYGSLKGVVLEGNASN---PVAEGEGQWCIAAAEQP 283

Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
            V +  C  +  SG+      +++W+     GS    +ND+ S S+    + AA+A    
Sbjct: 284 GVTIQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSD--DPLSAALAVQCQ 336

Query: 349 IPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
           +  G +  +   ++WD P   F       RRYT F+    + A  IA +++ +   W  +
Sbjct: 337 LAPGQSIEIPLVISWDLPVTGFATGSQALRRYTDFFAADANQAVAIAAESLRDWRSWRQK 396

Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
           I+AWQ P+L+ +  PE   + LFNELY L +GG++W+  SP                   
Sbjct: 397 IDAWQEPVLQRQDLPEPLRMALFNELYDLCSGGSLWSAASP------------------- 437

Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
                               E+  G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ 
Sbjct: 438 --------------------EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDK 477

Query: 528 SIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGLDD--PWFEINSYN 579
           ++ R FA A+   D     I    + GK      RK  GA PHD+G  +  PW   N   
Sbjct: 478 AVLRSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTA 537

Query: 580 LFNSSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
             + + WKDL S FVLQV+R F    +G D  F    WP+   A+ +++ FD + DG+ +
Sbjct: 538 YQDCNLWKDLGSDFVLQVWRSFKLAPSGEDIRFLADCWPAAVEALRFLKTFDVNNDGLPD 597

Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQK 690
           N G PDQT+D W   GVSAYCG LW+AAL+AA A      L+  +   A    F    ++
Sbjct: 598 NGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDTSAEQREFSGWLEQ 657

Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           ++  +D  LWNG Y++ D   G+    + ADQL G +YAR  GL P+  ++  +  L  +
Sbjct: 658 SRGNFDKLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLPPVVSDSNSRSTLKVV 715

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAF 806
            +       GG  G  NG++ DG  +D +G    E+W G+ +G+A+   ++ ++    A 
Sbjct: 716 KEACFEAFDGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDKQTAQAI 775

Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
            +A  V E  +S    G  F+TPE+    + +R+  Y+R + IW   WA
Sbjct: 776 CSA--VVEQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGF-WA 818


>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
 gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 832

 Score =  320 bits (819), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 240/824 (29%), Positives = 373/824 (45%), Gaps = 113/824 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
           + G+PLGG GAG  GR+  G    + L     + G+  D      QF++F     G++  
Sbjct: 54  NHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDC-----QFALFEGNGRGKRAH 108

Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           ++         +      ++ W+W           TY A +P +WT Y+G  D E+R  C
Sbjct: 109 ALAVQPEQDASRPEAGQPLKVWEWYPASTPDRSTGTYAARYPLSWTSYEGVYDAEVR--C 166

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
              SP +P +Y+ +S+P  VF +TL N      D++LL +W N+                
Sbjct: 167 EAFSPILPGDYQRTSYPVVVFVWTLRNPTDQPLDLSLLLSWRNTTGWFTNTDASAEVHFR 226

Query: 244 DSGLSGHHFNSKTMTKDGVHGLTLHHRTANG-------RPPVT-----FAVAAEETADVH 291
           D G   H++     +  G     +   +  G         PV      + +A  E   V 
Sbjct: 227 DDGSPEHNYAPAIGSTAGQRNCCIDDGSLKGVLLEGNVSDPVAEGEGQWCIATAEQPGVT 286

Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
           +  C  +  SG+      +++W+     GS    +ND+ S S+    + AA+A    +  
Sbjct: 287 IQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSD--DPLSAALAVQCQLAP 339

Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
           G +  +   ++WD P   F       RRYT F+    + A  IA +A+ +  +W  +IEA
Sbjct: 340 GQSIEIPLVISWDLPVTAFATGSQALRRYTDFFAADANQAVAIAAEALRDWRRWRQQIEA 399

Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
           WQ+P+L+ +  P+   + LFNELY L +GG++W+  SP                      
Sbjct: 400 WQQPVLQRQDLPKPLRMALFNELYDLCSGGSLWSAASP---------------------- 437

Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
                            E+  G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ ++ 
Sbjct: 438 -----------------EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVL 480

Query: 531 RDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGLDD--PWFEINSYNLFN 582
           R FA A+   D     I    + GK      RK  GA PHD+G  +  PW   N     +
Sbjct: 481 RSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTAYQD 540

Query: 583 SSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
            + WKDL S FVLQV+R F    +G D  F    WP+   A+ Y++ FD + DG+ +N G
Sbjct: 541 CNLWKDLGSDFVLQVWRTFKLAPSGEDIRFLADCWPAAVEALRYLKTFDVNNDGLPDNGG 600

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQKAKA 693
            PDQT+D W   GVSAYCG LW+AAL+AA A      L+  +   A    F    ++++ 
Sbjct: 601 APDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDTSAEQREFSGWLEQSRG 660

Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
            +D  LWNG Y++ D   G+    + ADQL G +YAR   L P+  +A     L  + + 
Sbjct: 661 NFDKLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLALPPVVSDANSLSTLKAVKEA 718

Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
                 GG  G  NG++ DG  +D +     E+W G+ +G+A+          A    + 
Sbjct: 719 CFEAFDGGSLGVANGLRRDGTPLDPNATHPLEVWTGINFGIASYYRLMGDKQTAQSICSA 778

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           V E  +S    G  F+TPE+    + YR+  Y+R + IW + WA
Sbjct: 779 VVEQVYSS---GLQFRTPEAITAVNTYRACHYLRAMAIWGL-WA 818


>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
 gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
          Length = 823

 Score =  318 bits (815), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 249/823 (30%), Positives = 388/823 (47%), Gaps = 110/823 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGG+GAG  GR+  G    + L  G      +   QFS++  + +  +  ++   
Sbjct: 44  NHGMPLGGLGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDVRAHALAVA 103

Query: 146 RSPGVPKKNTDSGIESWDWNLKGEN--CT--YHALFPRAWTVYDGEPDPELR--IVCRQI 199
                 + +  + + +W W     N  CT  Y A +P +WT YDG      R  ++C   
Sbjct: 104 PEGDASRPDAAASLPAWSWYPASTNKRCTGTYAARYPLSWTHYDGV----FRSGVLCEAF 159

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
           SP +P +Y+ SS+P +VF + L+N      +++L+ +W N+V               D G
Sbjct: 160 SPILPGDYQRSSYPVAVFCWQLANPTDQPLELSLMVSWRNTVGWFTNTDASAEVHFRDDG 219

Query: 247 LSGHHFN----------SKTMTKDGVHGLTLHHRTAN----GRPPVTFAVAAEETADVHV 292
              H+++          ++  ++ G+ G+ +  R++     G      AV  ++ A V V
Sbjct: 220 SPEHNYSPAIGRGAGQRNRAHSESGLTGVLMEGRSSEPIQEGEGQWCLAVP-DDLAGVEV 278

Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
             C  +  SG+       D+W      G     DND+ S    G    AA+     +  G
Sbjct: 279 LRCSRWDPSGD-----GSDIWTPFSAEGRIPESDNDRES--RAGEQASAAMVVKFILAPG 331

Query: 353 STRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAW 411
            +R +   ++WD P   F       RRYT F+G+ G +AA IA +A+ + + W   IEAW
Sbjct: 332 ESREIPVVISWDLPITAFASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEAW 391

Query: 412 QRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT 471
           Q+P+LE K  PE   + LFNELY L +GG++WT  +                        
Sbjct: 392 QQPVLERKALPEPVRMALFNELYDLCSGGSLWTAAT------------------------ 427

Query: 472 PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
                           ++  G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ ++ R
Sbjct: 428 ---------------AQDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLR 472

Query: 532 DFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGLDDPW-FEINSYNLF-NS 583
            F+ A+  HD     I        G+  A RK  GA PHD+G  +   F+  +Y  + + 
Sbjct: 473 SFSRAIPAHDATQRPIGWYFTQGRGRVEADRKVAGATPHDLGAPNELPFDATNYTAYQDC 532

Query: 584 SRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
           + WKDL S FVLQV+R F    TG D NF    WPS   A+ Y++ FD + DG+ +N G 
Sbjct: 533 NLWKDLASDFVLQVWRTFQLAPTGEDLNFLADCWPSAVRALDYLKGFDVNDDGLPDNGGA 592

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAV 694
           PDQT+D W   GVSAYCG LW+AAL+AA A    L  D+G   +     F    +++++ 
Sbjct: 593 PDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDTTVEQRRFGQWLEQSRSN 652

Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           +D  LWNG Y+  D   G+    + ADQL G +YAR  GL  +  +A     L  + D  
Sbjct: 653 FDRLLWNGEYYEIDAESGTP--VVMADQLCGDFYARLLGLPSVVSDANALSTLNVVRDAC 710

Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
             + +GG  G  NG++ DG  +D +G    E+W G+ +G+A+        D A      V
Sbjct: 711 FERFEGGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAV 770

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
               +S    G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 771 VNQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 809


>gi|115633999|ref|XP_780575.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Strongylocentrotus purpuratus]
          Length = 597

 Score =  314 bits (804), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 183/492 (37%), Positives = 253/492 (51%), Gaps = 42/492 (8%)

Query: 70  GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
           GK P  +V          G+P+GGIG G+I R ++G+F R+ L  G      V  NQF+V
Sbjct: 136 GKRPFIDVLHPMKFKPVYGVPVGGIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTV 195

Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
            V R     +  VL P  P     N D   ++W+W   G   +YH L+PRAWT Y   P 
Sbjct: 196 CVRRGGKTVYQQVLSPNKP----HNKDLA-QAWEWQFPGNRASYHGLYPRAWTKY-LLPG 249

Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
             + +VCRQISP  PHNY+E+S P  VF + + N G  +ADV+++FTW N     +   G
Sbjct: 250 QNITLVCRQISPVFPHNYQETSLPVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKG 309

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRP--PVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
             +N+   +     G+  H    +G P  P T  +AA +  DV V+ C  F  S      
Sbjct: 310 GRWNTG-FSSASTSGVMFH----DGHPTQPCTMCIAAADGDDVTVTTCASFNPSSQ---- 360

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
             K++WN++K+ G  +      T  +E G ++  A+ +S  I       + F+L+WD P+
Sbjct: 361 -LKELWNDLKEDGQLN-TTKGPTERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPK 418

Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
           V+F    Y RRYT+++G  GDSA  +   A+ E+  WE  I  WQ PILE+ R P WY  
Sbjct: 419 VQFDSCSYFRRYTRWFGRDGDSALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKS 478

Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
            LFNELY++  GG IW D                  +  +  HT ASS            
Sbjct: 479 ALFNELYFMTDGGGIWVDVDDEYP------------NGENNAHTRASSLV---------- 516

Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
               G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLELS+Q DF   V   D  + + 
Sbjct: 517 -REYGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKDEDSFQD 575

Query: 548 MSDGKWVARKCL 559
           +S G    R+ L
Sbjct: 576 LSSGLRSIRRVL 587


>gi|390344694|ref|XP_001200450.2| PREDICTED: uncharacterized protein LOC764213 [Strongylocentrotus
           purpuratus]
          Length = 1283

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 183/492 (37%), Positives = 253/492 (51%), Gaps = 42/492 (8%)

Query: 70  GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
           GK P  +V          G+P+GGIG G+I R ++G+F R+ L  G      V  NQF+V
Sbjct: 136 GKRPFIDVLHPMKFKPVYGVPVGGIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTV 195

Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
            V R     +  VL P  P     N D   ++W+W   G   +YH L+PRAWT Y   P 
Sbjct: 196 CVRRGGKTVYQQVLSPNKP----HNKDLA-QAWEWQFPGNRASYHGLYPRAWTKY-LLPG 249

Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
             + +VCRQISP  PHNY+E+S P  VF + + N G  +ADV+++FTW N     +   G
Sbjct: 250 QNITLVCRQISPVFPHNYQETSLPVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKG 309

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRP--PVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
             +N+   +     G+  H    +G P  P T  +AA +  DV V+ C  F  S      
Sbjct: 310 GRWNTG-FSSASTSGVMFH----DGHPTQPCTMCIAAADGDDVTVTTCASFNPSSQ---- 360

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
             K++WN++K+ G  +      T  +E G ++  A+ +S  I       + F+L+WD P+
Sbjct: 361 -LKELWNDLKEDGQLNTTKG-PTERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPK 418

Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
           V+F    Y RRYT+++G  GDSA  +   A+ E+  WE  I  WQ PILE+ R P WY  
Sbjct: 419 VQFDSCSYFRRYTRWFGRDGDSALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKS 478

Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
            LFNELY++  GG IW D                  +  +  HT ASS            
Sbjct: 479 ALFNELYFMTDGGGIWVDVDDEYP------------NGENNAHTRASSLV---------- 516

Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
               G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLELS+Q DF   V   D  + + 
Sbjct: 517 -REYGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKDEDSFQD 575

Query: 548 MSDGKWVARKCL 559
           +S G    R+ L
Sbjct: 576 LSSGLRSIRRVL 587


>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
 gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
          Length = 955

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 184/552 (33%), Positives = 286/552 (51%), Gaps = 68/552 (12%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
           Y GEF RF++  GI +   VLANQF V +    G   F S+L   S  +   +T++    
Sbjct: 137 YAGEFCRFQMRPGIYEYNTVLANQFIVTIKDQKGCTIFQSLLSKCSSIINNNDTEADEDE 196

Query: 159 -------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
                              + +W  N+    C+Y  L+PR+WT YD      +R++CRQI
Sbjct: 197 KSRTKCQLPNCSSSRSRQPLSTWHSNIDDSRCSYTGLYPRSWTEYDLS-HYGVRLICRQI 255

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
           SP IPH+YKESS P +VF ++  N       V++ FT+ N        +     S+ + +
Sbjct: 256 SPVIPHDYKESSLPCAVFVWSAENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLINE 315

Query: 260 DGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKH 319
               G+ +  + A+   P ++ +A     D+ ++ CP F  +GN +      +W ++K+H
Sbjct: 316 GHAKGVAIRQKIADM--PCSYNLACRVLPDISITRCPQFDPAGNGEA-----LWAQLKEH 368

Query: 320 GSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHR 377
           G    L ++ T  +     IG A+ A + +   ++  + F LAWD P+++F  K+  + R
Sbjct: 369 G---QLSDEPTQETLKTKDIGVAVCAHIALKPQASHELEFVLAWDMPKIQFPRKLKTHTR 425

Query: 378 RYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLN 437
            YTK++   G++  RI   A+  +  WE  I+AWQRPIL D+  PEWY   +FN+LY+++
Sbjct: 426 YYTKYFDDSGEAGPRICEYALKHYQSWERLIDAWQRPILNDEGLPEWYKCAIFNQLYFIS 485

Query: 438 AGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLY 496
            GGTIW                                D++LG  L  ++     G+F Y
Sbjct: 486 DGGTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGY 516

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           LEG EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + R
Sbjct: 517 LEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQFDFKDAIAAELNDTRKMLYDGKIMPR 576

Query: 557 KCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
           K    VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A+A  
Sbjct: 577 KVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDYYVLNELAQAQADN 636

Query: 615 PSVYIAMAYMEQ 626
            S + ++ ++++
Sbjct: 637 ASKFSSIEFIDK 648



 Score =  207 bits (528), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 113/264 (42%), Positives = 157/264 (59%), Gaps = 12/264 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + ++++ +    M    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+A+LQA 
Sbjct: 694 YLKSMYAACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLASLQAM 753

Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           SA+A  +        +    +K K ++ + LWNGSY+ +D S     T I ADQL G WY
Sbjct: 754 SAMATILDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWY 812

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM-----QPD--GRIDMSGL 779
            ++CG    I  +  V+ AL +IYD NV+    G  GA NG      +P   G +D S +
Sbjct: 813 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKAGHVDNSNI 872

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
           QA E+WPGV Y LAA+MIQE M D AFQTA G+Y+       +G +F+TPE+      YR
Sbjct: 873 QAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTI--SQRIGMNFETPEALYGEKRYR 930

Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
           S+ YMRPL+IW+MQ A  + +  R
Sbjct: 931 SIGYMRPLSIWSMQVAWERRRALR 954


>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
 gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
          Length = 820

 Score =  312 bits (799), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 248/823 (30%), Positives = 386/823 (46%), Gaps = 110/823 (13%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGG GAG  GR+  G    + L  G      +   QFS++  + +  +  ++   
Sbjct: 41  NHGMPLGGFGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDLRAHALAVS 100

Query: 146 RSPGVPKKNTDSGIESWDW-NLKGENC---TYHALFPRAWTVYDGEPDPELR--IVCRQI 199
                 + +  + + +W W     + C   TY A +P +WT YDG      R  ++C   
Sbjct: 101 PEGDASRPDAAASLPAWSWYPASTDQCSTGTYAARYPLSWTHYDGV----FRSGVLCEAF 156

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
           SP +P +Y+ SS+P +VF + L+N      +++L+ +W N+V               D G
Sbjct: 157 SPILPGDYQRSSYPVAVFRWQLANPTDQPLELSLMVSWRNTVGWFTNTDAAAEVHFRDDG 216

Query: 247 LSGHHFN----------SKTMTKDGVHGLTLHHRT----ANGRPPVTFAVAAEETADVHV 292
              H+++          ++ + + GV G+ L  R+    A G      AV  ++   V V
Sbjct: 217 SPEHNYSPAIGRGEGQRNRAINEPGVTGVLLEGRSSEPIAEGEGQWCLAVP-DDLDGVDV 275

Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
             C  +   G+       D+W      G     DND+ S    G    AA+    T+  G
Sbjct: 276 LRCSRWDPCGD-----GSDIWTPFAAEGRIPESDNDRES--RAGEQASAAMVVKFTLAPG 328

Query: 353 STRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAW 411
            +R +   ++WD P   F       RRYT F+G+ G +AA IA +A+ + + W   IEAW
Sbjct: 329 ESREIPVVISWDLPITAFASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEAW 388

Query: 412 QRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT 471
           Q+P+LE K  PE   + LFNELY L +GG++WT  +                        
Sbjct: 389 QQPVLERKALPEPVRMALFNELYDLCSGGSLWTAAT------------------------ 424

Query: 472 PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
                           ++  G+F  LE  +Y  Y + DV  Y SFAL+ L+P+L+ ++ R
Sbjct: 425 ---------------AKDPYGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLR 469

Query: 532 DFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NS 583
            F+ A+   D     I        G+  A RK  GA PHD+G  ++  F+  +Y  + + 
Sbjct: 470 SFSRAIPARDATQRPIGWYFTQGRGRVEADRKVAGATPHDLGAPNEVPFDATNYTAYQDC 529

Query: 584 SRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
           + WKDL S FVLQV+R F    TG D NF    WPS   A+ Y++ FD + DG+ +N G 
Sbjct: 530 NLWKDLASDFVLQVWRTFQLAPTGEDLNFLAECWPSAVQALDYLKGFDVNDDGLPDNGGA 589

Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAV 694
           PDQT+D W   GVSAYCG LW+AAL+AA A    L  D+G   +     F    +++++ 
Sbjct: 590 PDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDTTVEQRRFGQWLEQSRSN 649

Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           +D  LWNG Y+  D   G+    + ADQL G +YAR  GL  +  +A     L  + +  
Sbjct: 650 FDRLLWNGEYYEIDAESGTP--VVMADQLCGDFYARLLGLPSVVSDANALSTLNVVREAC 707

Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
             + + G  G  NG++ DG  +D +G    E+W G+ +G+A+        D A      V
Sbjct: 708 FERFEAGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAV 767

Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
               +S    G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 768 VNQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 806


>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
 gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
          Length = 947

 Score =  312 bits (799), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 182/549 (33%), Positives = 286/549 (52%), Gaps = 64/549 (11%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
           Y GEF RF++  GI +   V ANQF V +  P G   F S+L   S      + D+    
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDG 190

Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
                            + +W  N++   C+Y  L+PR+WT YD      +R+ CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 249

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
            IPH Y+ESS P +VF +++ N       V++ FT+ N        +     S+ +++  
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
             G+++  + A+   P ++ +A     ++ ++ CP F  +GN      + +W ++K+HG 
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 361

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
              L    TS +     IG A+ A + +   ++  + F LAWD P+++F  K+  + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419

Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
           TK++   GDS  RI   A+ +++ WE  I+AWQRPIL D+  P+WY   +FN+LY+++ G
Sbjct: 420 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 479

Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEG 499
           GTIW      +        ++ S D P   +                     G+F YLEG
Sbjct: 480 GTIWLKCESSLG-------KELSYDDPRLAY---------------------GRFGYLEG 511

Query: 500 AEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCL 559
            EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK  
Sbjct: 512 HEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVK 571

Query: 560 GAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV 617
             VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A+A   S 
Sbjct: 572 NCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNASK 631

Query: 618 YIAMAYMEQ 626
           + ++ ++++
Sbjct: 632 FSSIEFIDK 640



 Score =  207 bits (526), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 115/268 (42%), Positives = 159/268 (59%), Gaps = 20/268 (7%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++ S    +    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQA 
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
           SA+A  + D  +     +RYQ       +++ + LWNGSY+ +D S     T I ADQL 
Sbjct: 746 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 800

Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
           G WY ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D
Sbjct: 801 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVD 860

Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
            S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+    
Sbjct: 861 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 918

Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
             YRS+ YMRPL+I +MQ AL + +  R
Sbjct: 919 KRYRSIGYMRPLSILSMQVALERRRAQR 946


>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
 gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
          Length = 947

 Score =  311 bits (798), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 182/549 (33%), Positives = 286/549 (52%), Gaps = 64/549 (11%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
           Y GEF RF++  GI +   V ANQF V +  P G   F S+L   S      + D+    
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDG 190

Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
                            + +W  N++   C+Y  L+PR+WT YD      +R+ CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 249

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
            IPH Y+ESS P +VF +++ N       V++ FT+ N        +     S+ +++  
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
             G+++  + A+   P ++ +A     ++ ++ CP F  +GN      + +W ++K+HG 
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 361

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
              L    TS +     IG A+ A + +   ++  + F LAWD P+++F  K+  + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419

Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
           TK++   GDS  RI   A+ +++ WE  I+AWQRPIL D+  P+WY   +FN+LY+++ G
Sbjct: 420 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 479

Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEG 499
           GTIW      +        ++ S D P   +                     G+F YLEG
Sbjct: 480 GTIWLKCESSLG-------KELSYDDPRLAY---------------------GRFGYLEG 511

Query: 500 AEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCL 559
            EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK  
Sbjct: 512 HEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVK 571

Query: 560 GAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV 617
             VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A+A   S 
Sbjct: 572 NCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNASK 631

Query: 618 YIAMAYMEQ 626
           + ++ ++++
Sbjct: 632 FSSIEFIDK 640



 Score =  211 bits (537), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 116/268 (43%), Positives = 160/268 (59%), Gaps = 20/268 (7%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++ S    +    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQA 
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
           SA+A  + D  +     +RYQ       +++ + LWNGSY+ +D S     T I ADQL 
Sbjct: 746 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 800

Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
           G WY ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D
Sbjct: 801 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVD 860

Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
            S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+    
Sbjct: 861 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 918

Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
             YRS+ YMRPL+IW+MQ AL + +  R
Sbjct: 919 KRYRSIGYMRPLSIWSMQVALERRRAQR 946


>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
 gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 833

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 249/827 (30%), Positives = 390/827 (47%), Gaps = 118/827 (14%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           D G+PLGG GAG +GRS  G F  + L  G      +   QF+++  +  GE   +    
Sbjct: 54  DHGMPLGGFGAGCLGRSPSGAFNLWNLDGGEHWFGSIPDCQFALWEQQ--GEDVRAHALA 111

Query: 146 RSPGVPKKNTDSG--IESWDW---NLKGENC-TYHALFPRAWTVYDGEPDPELRIVCRQI 199
            +P     +   G  + +W W   + +G +  TY A +P +WT +  E      ++C   
Sbjct: 112 TAPDRDDTDPSGGTPLSAWQWYPASTQGRSTGTYAARYPLSWTHH--EDVYRAGVLCEAF 169

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
           SP +P +Y+ +S+P +VF +  SN      +++L+  W N+V               D G
Sbjct: 170 SPILPGDYQRTSYPVAVFRWQFSNPTDQPLELSLMLCWRNTVGWFTNTDAAAEVHFRDDG 229

Query: 247 LSGHHF----------NSKTMTKDGVHGLTLHHRTANGRPPVTFA--------VAAEETA 288
              H++           ++ + + G  G+ L      G+P +  A           ++  
Sbjct: 230 SPEHNYAPAIGRGEGQRNRQVDQPGFSGVLLE-----GQPSMPLAEGEGQWCLALPDDLD 284

Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
            VHV  C  +  SG+       ++W      G     +ND+ S S  G    AAIA   T
Sbjct: 285 GVHVMRCSRWDPSGD-----GSELWTPFAADGRIPDSNNDRASRS--GEQASAAIAVKFT 337

Query: 349 IPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
           +  G++R +   ++WD P   F       RRYT F+G+ G +AA IA +A+ +  +W  +
Sbjct: 338 LAPGASREIPLVISWDLPVTAFASGSSALRRYTDFFGSSGSNAAAIAAEALRDWRQWREQ 397

Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
           I+AWQ+P+L   + PE   + LFNELY L +GG++WT  SP                   
Sbjct: 398 IDAWQQPVLARMQLPEPLRMALFNELYDLCSGGSLWTAASP------------------- 438

Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
                                +  G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ 
Sbjct: 439 --------------------ADPHGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDK 478

Query: 528 SIQRDFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNL 580
           ++ R FA A+   D     I        G+  A RK  GA PHD+G  ++  F+  +Y  
Sbjct: 479 AVLRSFARAIPAADGSQRPIGWYFTQGRGRVEADRKVAGATPHDLGAPNERPFDATNYTA 538

Query: 581 F-NSSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
           + + + WKDL S ++LQV+R F    TG D  F    WPS   A+ Y++ FD + DG+ +
Sbjct: 539 YQDCNLWKDLASDYMLQVWRTFRLAPTGEDLRFLAECWPSAVQALDYLKGFDVNNDGLPD 598

Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQK 690
           N G PDQT+D W   GVSAYCG LW+AAL+AA A    L  D+G   +A    F    ++
Sbjct: 599 NGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDTAAEQRRFGAWLEQ 658

Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           ++A +D  LWNG Y+  D   G+    + ADQL G +YAR  GL P+  +A+ +  L  +
Sbjct: 659 SRANFDDLLWNGEYYRIDAESGTP--VVMADQLCGDFYARLLGLPPVVSDARARSTLQAV 716

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
            +    + + G  G  NG++ DG  +D +G    E+W G+ +GLA+          A   
Sbjct: 717 KEACFERFESGRLGVANGLRRDGSPLDPNGTHPLEVWTGINFGLASYYRLMGESKTALAI 776

Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
            + V    +S    G  F+TPE+    + +R+  Y+R + IW + WA
Sbjct: 777 CSAVVGQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819


>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
 gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
          Length = 947

 Score =  310 bits (794), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 180/550 (32%), Positives = 286/550 (52%), Gaps = 66/550 (12%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
           Y GEF RF++  GI +   V ANQF V +  P G   F S+L   S      + D+    
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKSSDPDADPDG 190

Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
                            + +W  N++   C+Y  L+PR+WT YD      +R+ CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 249

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
            IPH Y++SS P +VF +++ N       V++ F++ N        +     S+ +++  
Sbjct: 250 VIPHEYRDSSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
             G+++  + A    P ++ +A     ++ ++ CP F  +GN      + +W ++K+HG 
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 361

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
              L    TS +     IG A+ A + +   ++  + F LAWD P+++F  K+  + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419

Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
           TK++   GDS  +I   A+ +++ WE  I+AWQRPIL D+  P+WY   +FN+LY+++ G
Sbjct: 420 TKYFDASGDSGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISDG 479

Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
           GTIW                                D++LG  L  ++     G+F YLE
Sbjct: 480 GTIWL-----------------------------KCDSSLGKELTYDDPRLAYGRFGYLE 510

Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
           G EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK 
Sbjct: 511 GHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGKVMPRKV 570

Query: 559 LGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
              VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A+A   S
Sbjct: 571 KNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNAS 630

Query: 617 VYIAMAYMEQ 626
            + ++ ++++
Sbjct: 631 KFSSIEFIDK 640



 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 116/268 (43%), Positives = 159/268 (59%), Gaps = 20/268 (7%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++      M    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQA 
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
           SA+A  + D  +     +RYQ       +++ + LWNGSY+ +D S     T I ADQL 
Sbjct: 746 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 800

Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
           G WY ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D
Sbjct: 801 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVD 860

Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
            S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+    
Sbjct: 861 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 918

Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
             YRS+ YMRPL+IW+MQ AL + +  R
Sbjct: 919 KRYRSIGYMRPLSIWSMQVALERRRAQR 946


>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
 gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
          Length = 869

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 186/556 (33%), Positives = 284/556 (51%), Gaps = 72/556 (12%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVL--CPRSPGVPKKNTDSG- 158
           Y GEF RF++  GI +   V ANQF V +    G   F S+L  C       K N +   
Sbjct: 47  YAGEFCRFQMRPGIYEYNVVHANQFIVTIKDHKGCTIFQSLLSRCSTKSFKAKTNNNDAE 106

Query: 159 -----------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
                                  + +W  N++   C+Y  L+PRAWT YD      +R++
Sbjct: 107 ADEADAQKTKCQLPNCSSRPRQPLSAWHSNIEDSRCSYTGLYPRAWTEYDLS-HYGVRLI 165

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
           CRQISP IPH+YKESS P +VF +++ N       V++ FT+ N        +  +  S+
Sbjct: 166 CRQISPVIPHDYKESSLPCAVFVWSVQNVCDQERKVSITFTFKNGTGNKKQDADGNAESQ 225

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            +++    G+ +    A+   P ++ +A     ++ ++ CP F  +GN      + +W +
Sbjct: 226 LISEGNAKGVAIRQNIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQ 278

Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV- 374
           +K+HG    L    T+ +     IG A+ A L +   +   + F LAWD P+++F  K+ 
Sbjct: 279 LKEHG---QLSEQPTAETLRSKDIGVAVCAQLALKPQAYHELEFVLAWDMPKIQFPRKLQ 335

Query: 375 -YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
            + R YTK++   G+S  RI   A+  +A WE  I+AWQRPIL D   P+WY   +FN+L
Sbjct: 336 THTRYYTKYFDDSGESGPRICEYALKHYASWERLIDAWQRPILNDDGLPDWYKCAIFNQL 395

Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIG 492
           Y+++ GGTIW                                D++LG  L  ++     G
Sbjct: 396 YFISDGGTIWL-----------------------------KCDSSLGQSLAFDDPRLAYG 426

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
           +F YLEG EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK
Sbjct: 427 RFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGK 486

Query: 553 WVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFA 610
            + RK    VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A
Sbjct: 487 VMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQA 546

Query: 611 RAVWPSVYIAMAYMEQ 626
           +A   S + ++ ++++
Sbjct: 547 QADNASKFSSIEFIDK 562



 Score =  202 bits (514), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 111/264 (42%), Positives = 153/264 (57%), Gaps = 12/264 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++ +    M    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQ  
Sbjct: 608 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQTM 667

Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           S +A  +        +    +K K ++ + LWNGSY+ +D S     T I ADQL G WY
Sbjct: 668 SVMATLLDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHNHRDT-IMADQLCGHWY 726

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
            ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D S +
Sbjct: 727 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPSKPGHVDNSNI 786

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
           QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+      G+ Y  +TPE+      YR
Sbjct: 787 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNY--ETPEALYGEKRYR 844

Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
           S+ YMRPL+IW+MQ A  + +  R
Sbjct: 845 SIGYMRPLSIWSMQVAWERRRALR 868


>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
 gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
          Length = 947

 Score =  309 bits (792), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 180/550 (32%), Positives = 287/550 (52%), Gaps = 66/550 (12%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
           Y GEF RF++  GI +   V ANQF V +  P G   F S+L   S      + D+    
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDG 190

Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
                            + +W  N++   C+Y  L+PR+WT YD      +R++CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLICRQVSP 249

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
            IPH Y+ESS P +VF +++ N       V++ F++ N        +     S+ +++  
Sbjct: 250 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
             G+++  + A    P ++ +A     ++ ++ CP F  +GN      + +W ++K+HG 
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEPLWAQLKEHG- 361

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
              L    TS +     IG A+ A + +   ++  + F LAWD P+++F  K+  + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419

Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
           TK++   GD+  +I   A+ +++ WE  I+AWQRPIL D+  P+WY   +FN+LY+++ G
Sbjct: 420 TKYFDDSGDAGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISDG 479

Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
           GTIW                                D++LG  L  ++     G+F YLE
Sbjct: 480 GTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYLE 510

Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
           G EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK 
Sbjct: 511 GHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELGDTRKMLYDGKVMPRKV 570

Query: 559 LGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
              VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A+A   S
Sbjct: 571 KNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNAS 630

Query: 617 VYIAMAYMEQ 626
            + ++ ++++
Sbjct: 631 KFSSIEFIDK 640



 Score =  210 bits (534), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 115/264 (43%), Positives = 154/264 (58%), Gaps = 12/264 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++      M    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQA 
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745

Query: 669 SALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           SA+A  +        +    +K K   D  LWNGSY+ +D S     T I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLDEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWY 804

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
            ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D S +
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
           QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+      YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGEKRYR 922

Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
           S+ YMRPL+IW+MQ AL + +  R
Sbjct: 923 SIGYMRPLSIWSMQVALERRRAQR 946


>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
 gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
          Length = 894

 Score =  308 bits (790), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 183/552 (33%), Positives = 283/552 (51%), Gaps = 67/552 (12%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRS---------PGVPK 152
           Y GEF RF++  GI +   V ANQF V +  P G   F S+L   S         P  P 
Sbjct: 75  YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRGSSGGGNPADPD 134

Query: 153 KN--------------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQ 198
            +              T   + +W  N++   C Y  L+PR+WT YD      +R+VCRQ
Sbjct: 135 ADGEKTKCQLPNCSSRTKQPLSAWHSNIEDSRCQYTGLYPRSWTEYDLS-HYGVRLVCRQ 193

Query: 199 ISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMT 258
           +SP IPH Y+ESS P +VF + + N       V++ FT+ N        +     S+ ++
Sbjct: 194 VSPVIPHEYRESSLPCAVFVWDVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLIS 253

Query: 259 KDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKK 318
           +    G+ +  + A    P ++ +A     ++ +S CP F  +GN      + +W ++K+
Sbjct: 254 EGNAKGVAIRQKIAEM--PCSYNLACRVRPEISISRCPQFDPAGN-----GEQLWAQLKE 306

Query: 319 HGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YH 376
           HG    L+   T  +     IG A+ A + +   ++  + F LAWD P+++F  K+  + 
Sbjct: 307 HG---QLNEQPTPETLKTKDIGVAVCAQVALKPKASHKLEFVLAWDMPKIQFPRKLQTHT 363

Query: 377 RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
           R YTK++   G+S  +I   A+ ++  WE  I+AWQRPIL D+  P+WY   +FN+LY++
Sbjct: 364 RYYTKYFDDSGESGPKICEYALKQYPSWERLIDAWQRPILNDETLPDWYKCAIFNQLYFI 423

Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
           + GGTIW           +   ++ S D P   +                     G+F Y
Sbjct: 424 SDGGTIWL-------KCDSFLGKELSYDDPRLAY---------------------GRFGY 455

Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
           LEG EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + R
Sbjct: 456 LEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAADLTDTRKMLYDGKVMPR 515

Query: 557 KCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
           K    VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A+A  
Sbjct: 516 KVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADN 575

Query: 615 PSVYIAMAYMEQ 626
            S + ++ ++++
Sbjct: 576 ASKFSSIEFIDK 587



 Score =  213 bits (541), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 114/264 (43%), Positives = 158/264 (59%), Gaps = 12/264 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++ S    M    ++DKD DG+IEN   PDQTYD+W  +G S+YC GLW+AALQA 
Sbjct: 633 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSSYCSGLWLAALQAM 692

Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           SA+A  +        +    +K K ++ + LWNGSY+ +D S G   T I ADQL G WY
Sbjct: 693 SAMATILDQPNDCMRYQDILEKGKRSLEEKLWNGSYYRFDLSHGHRDT-IMADQLCGHWY 751

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
            ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D S +
Sbjct: 752 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDPSKPGHVDNSNI 811

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
           Q+ E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+      YR
Sbjct: 812 QSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTI--SQRIGMNFETPEALYGEKRYR 869

Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
           S+ YMRPL+IW+MQ AL + +  R
Sbjct: 870 SIGYMRPLSIWSMQVALERRRAQR 893


>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
           tropicalis]
          Length = 692

 Score =  302 bits (774), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 175/443 (39%), Positives = 228/443 (51%), Gaps = 72/443 (16%)

Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
           K  P WY   LFNELY++  GGT+W + S        +R  +   D P            
Sbjct: 301 KDLPAWYKSALFNELYFMADGGTVWVEVSAADDDDNLMR--RGCKDLPG----------- 347

Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
                +++  +  G+F YLEG EY MYNTYDVHFY+SFAL+ML+P+LE+S+Q D AA+V+
Sbjct: 348 -----MKDVLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLWPQLEISLQYDMAASVL 402

Query: 539 MHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
             DP   K +  G                       + S + + S           LQ  
Sbjct: 403 QEDPELRKYLMSGTVAP-------------------VKSRSYYYS------GPSLALQT- 436

Query: 599 RDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
                                 M    +FD+DGDG+IEN GF DQTYD W   G S+YCG
Sbjct: 437 ---------------------VMETSLKFDEDGDGLIENSGFADQTYDDWVMTGPSSYCG 475

Query: 659 GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSL-WNGSYFNYDNSDGSSSTSI 717
           GLW+AA+     +A  +GD A+   F     + KA ++ L WNG Y+N+D  D   S S+
Sbjct: 476 GLWLAAVCMMCKMAEVLGDGAAQEKFTDILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSV 535

Query: 718 QADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
             DQ AG W+ RACGL     E      V  AL  I++ NV +   G  GA+NGM+PDG 
Sbjct: 536 MVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGT 595

Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
            D S +Q+ E+W GV YGLAA+MI E +V   F TA G Y   W    LG SFQTPE++ 
Sbjct: 596 TDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYRTVWER--LGMSFQTPEAYC 653

Query: 834 NNDEYRSLCYMRPLTIWAMQWAL 856
               +RSL YMRPL+IWAMQ AL
Sbjct: 654 EKQVFRSLAYMRPLSIWAMQLAL 676


>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
           [Strongylocentrotus purpuratus]
          Length = 347

 Score =  302 bits (773), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 156/325 (48%), Positives = 202/325 (62%), Gaps = 9/325 (2%)

Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
           D  + + +S G    RK  G VPHD G   +DPW ++N Y   +++ WKDLN K VL VY
Sbjct: 13  DEDSFQDLSSGLRSIRKSAGCVPHDSGNPEEDPWLKLNCYWFHDTAEWKDLNLKMVLMVY 72

Query: 599 RDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
           RD+ AT D+ F   +WP     MA     DKDGDG+I++ G  DQTYD W+A G SAYCG
Sbjct: 73  RDYFATKDREFLDFMWPKCMTVMAKANSQDKDGDGLIDHFGTADQTYDVWTAKGCSAYCG 132

Query: 659 GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSI 717
           GL++AAL+    +A+ +GD  +   +    Q+ KA Y+  LWNG YFNYD+SD S   SI
Sbjct: 133 GLYLAALKCMCEMADLLGDDKAKRQYSEILQRGKATYEKKLWNGRYFNYDSSDQSYHDSI 192

Query: 718 QADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
            ADQ  G WY RAC L+P  D+      V+ AL  I+D NV+ VK G  GAMNG++P G+
Sbjct: 193 MADQTCGHWYLRACDLVPERDQVFQVDHVRSALRTIFDMNVMGVKEGNFGAMNGIRPSGK 252

Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
            D + LQ  E+W G TYGLA +MIQE M +  F+TA G Y   + + GL Y  Q PE++ 
Sbjct: 253 PDHTSLQGEEVWVGTTYGLAGNMIQEGMWEEGFRTAKGCYTTCYEQAGLAY--QVPEAYM 310

Query: 834 NNDEYRSLCYMRPLTIWAMQWALTK 858
           +   YRSL YMRPL IWAMQWA+ K
Sbjct: 311 SKKIYRSLGYMRPLAIWAMQWAVEK 335


>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
 gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
          Length = 835

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 247/831 (29%), Positives = 383/831 (46%), Gaps = 122/831 (14%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
           + G+PLGG GAG +GR   G F  + L     + G+  D      QF++F    +G +  
Sbjct: 51  NHGMPLGGFGAGCLGRGLDGAFNLWHLDGGEHWFGVLPDC-----QFALFEQDGDGRRAH 105

Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
           ++         +  +   + SW+W      +  + T    +P +   Y      ++R  C
Sbjct: 106 ALATAPQADDSRPGSGPPLASWNWYPAATPEQPSGTVAVRYPISSHHYAAVFSADVR--C 163

Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
              SP +P +Y+ +S+P +VF +T SN      +++LL +W N+V               
Sbjct: 164 EAFSPILPGDYRRTSYPVAVFAWTFSNPTNQPLELSLLLSWRNTVGWFTNTDPAAEVHFR 223

Query: 244 DSGLSGHHF----------NSKTMTKDGVHGLTLH-HRT---ANGRPPVTFAVAAEETAD 289
           D G   H++           ++ +   G+ GL L   RT   A G      AV  E   +
Sbjct: 224 DDGSPEHNYVPAIGRGKGQRNRWVDGAGLKGLLLEGERTQPLAEGEGQWCLAVPDEAALN 283

Query: 290 ---VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
              V V  C  +  SG+       ++W    + GS    DND+ S  +    + AA+A  
Sbjct: 284 HPGVEVFRCSRWDPSGD-----GAELWEPFSRDGSIPDSDNDRRSTGD--DPLSAALAVR 336

Query: 347 LTIPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWE 405
             +  G++  +   ++WD P   F       RRYT F+   GDSAA IA +A+ +   W+
Sbjct: 337 FRLEPGASLEIPVVISWDLPVTAFATGTRALRRYTDFFSAEGDSAAAIAGEALADWRDWQ 396

Query: 406 CEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDT 465
             IEAWQ P+++     E   + L NELY L +GG++WT  SP                 
Sbjct: 397 AAIEAWQAPVVQRHDLAEPLRMALLNELYDLASGGSLWTAASP----------------- 439

Query: 466 PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
                                  + +G+F  LE  +Y  Y + DV  Y SFAL+ L+P+L
Sbjct: 440 ----------------------GDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPEL 477

Query: 526 ELSIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSY 578
           + ++ R FA A+   DP    I    + G+     ARK  GA PHD+G  ++  F+  +Y
Sbjct: 478 DKAVLRSFARAIPAADPTPRPIGWYFTQGRGRVEAARKVAGATPHDLGAPNERPFDATNY 537

Query: 579 NLF-NSSRWKDLNSKFVLQVYRDFV---ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
             + + + WKDL S +VLQV+R +    +  D  F    WP+   A+ Y++QFD + DG+
Sbjct: 538 TAYQDCNLWKDLASDYVLQVWRTYSLAPSGPDIRFLADCWPAAVTALDYLKQFDVNDDGL 597

Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA--------LANDVGDHASASYFWV 686
            +N G PDQT+D W   GVSAYCG LW+AAL+AA A        L  D  +       W+
Sbjct: 598 PDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAIGQRLQLELGLDTAEEQRRFGGWL 657

Query: 687 RYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
             +++++ +D+ LWNG Y+  D   GS +  + ADQL G +YAR   L P+  + +   +
Sbjct: 658 --EQSRSHFDTLLWNGEYYAIDA--GSGTPVVMADQLCGDFYARLLALPPVVADERALSS 713

Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
           L  I        +GG  G  NG++ DG  +D  G    E+W G+ +GLAA          
Sbjct: 714 LKAIRQACFESFEGGRLGVANGLRRDGTPLDPEGTHPLEVWTGINFGLAAYYRLMGQSST 773

Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
           AF     V    + E GL   F+TPE+      +R+  Y+R + IWA+ WA
Sbjct: 774 AFAITGAVVRQVY-EGGL--QFRTPEAITAAGTFRACHYLRAMAIWAL-WA 820


>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
 gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
          Length = 865

 Score =  299 bits (765), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 174/551 (31%), Positives = 279/551 (50%), Gaps = 67/551 (12%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
           + GEF RF++  GI +   V ANQF V +    G      L  +   + + N  S     
Sbjct: 48  FAGEFCRFQMRPGIYEYNVVQANQFIVTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDAD 107

Query: 159 ------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
                             + +W  N++   C+Y  L+PR+WT YD      +R+VCRQ+S
Sbjct: 108 GEKTKCQLPNCSSRPKQPLSAWHSNIEDARCSYTGLYPRSWTEYDLS-HYGVRLVCRQVS 166

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKD 260
           P IPH Y++SS P +VF +++ N       V++ F++ N        +     S+ ++  
Sbjct: 167 PVIPHEYRDSSLPCAVFVWSVENVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAG 226

Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
              G+ +  + A+   P ++ +A     ++ ++ CP F  +G     + + +W ++K+HG
Sbjct: 227 NAKGVAIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHG 279

Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRR 378
               L    T  +     IG A+ A L +   ++  + F LAWD P ++F  K+  + R 
Sbjct: 280 ---QLSEQPTGETLKTKDIGVAVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRY 336

Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
           YTK++   G+S  +I   A+ ++  WE  I+AWQRPIL D   P+WY   +FN+LY+++ 
Sbjct: 337 YTKYFDDSGESGPKICEYALKQYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISD 396

Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYL 497
           GGTIW                                D++LG  L  ++     G+F YL
Sbjct: 397 GGTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYL 427

Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
           EG EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK
Sbjct: 428 EGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGKVMPRK 487

Query: 558 CLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
               VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   ++A   
Sbjct: 488 VKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQSQADNA 547

Query: 616 SVYIAMAYMEQ 626
           S + ++ ++++
Sbjct: 548 SKFSSIEFIDK 558



 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 116/264 (43%), Positives = 158/264 (59%), Gaps = 12/264 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++ +    M    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQA 
Sbjct: 604 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 663

Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           SA+A  +        F    +K K ++ + LWNGSY+ +D S  S   SI ADQL G WY
Sbjct: 664 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSP-SHRDSIMADQLCGHWY 722

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
            ++CG    I  +  V+ AL  IYD NV+    G  GA NG   +       G +D S +
Sbjct: 723 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 782

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
           QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+     + +G +F+TPE+      YR
Sbjct: 783 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SERIGMNFETPEALYGEKRYR 840

Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
           S+ YMRPL+IW+MQ AL + +  R
Sbjct: 841 SIGYMRPLSIWSMQVALERRRALR 864


>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
 gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
          Length = 867

 Score =  299 bits (765), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 174/551 (31%), Positives = 279/551 (50%), Gaps = 67/551 (12%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
           + GEF RF++  GI +   V ANQF V +    G      L  +   + + N  S     
Sbjct: 50  FAGEFCRFQMRPGIYEYNVVQANQFIVTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDAD 109

Query: 159 ------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
                             + +W  N++   C+Y  L+PR+WT YD      +R+VCRQ+S
Sbjct: 110 GEKTKCQLPNCSSRPKQPLSAWHSNIEDARCSYTGLYPRSWTEYDLS-HYGVRLVCRQVS 168

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKD 260
           P IPH Y++SS P +VF +++ N       V++ F++ N        +     S+ ++  
Sbjct: 169 PVIPHEYRDSSLPCAVFVWSVENVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAG 228

Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
              G+ +  + A+   P ++ +A     ++ ++ CP F  +G     + + +W ++K+HG
Sbjct: 229 NAKGVAIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHG 281

Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRR 378
               L    T  +     IG A+ A L +   ++  + F LAWD P ++F  K+  + R 
Sbjct: 282 ---QLSEQPTGETLKTKDIGVAVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRY 338

Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
           YTK++   G+S  +I   A+ ++  WE  I+AWQRPIL D   P+WY   +FN+LY+++ 
Sbjct: 339 YTKYFDDSGESGPKICEYALKQYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISD 398

Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYL 497
           GGTIW                                D++LG  L  ++     G+F YL
Sbjct: 399 GGTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYL 429

Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
           EG EY MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK
Sbjct: 430 EGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGKVMPRK 489

Query: 558 CLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
               VPHD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   ++A   
Sbjct: 490 VKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQSQADNA 549

Query: 616 SVYIAMAYMEQ 626
           S + ++ ++++
Sbjct: 550 SKFSSIEFIDK 560



 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 116/264 (43%), Positives = 158/264 (59%), Gaps = 12/264 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++ +    M    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQA 
Sbjct: 606 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 665

Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           SA+A  +        F    +K K ++ + LWNGSY+ +D S  S   SI ADQL G WY
Sbjct: 666 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSP-SHRDSIMADQLCGHWY 724

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
            ++CG    I  +  V+ AL  IYD NV+    G  GA NG   +       G +D S +
Sbjct: 725 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 784

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
           QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+     + +G +F+TPE+      YR
Sbjct: 785 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SERIGMNFETPEALYGEKRYR 842

Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
           S+ YMRPL+IW+MQ AL + +  R
Sbjct: 843 SIGYMRPLSIWSMQVALERRRALR 866


>gi|256082539|ref|XP_002577512.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 717

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 217/640 (33%), Positives = 319/640 (49%), Gaps = 60/640 (9%)

Query: 49  FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           F  L  +  + + Y K+   + ++P  +           G+P+GGIG+GSIGR YRGEF 
Sbjct: 63  FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122

Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
           R  L  G+   D  PV  +QF V V +     +  VL P  + P   KK     ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175

Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
               E   Y  L+PR+WTVY+  P+ +L +V +QISP IPH+Y+ +  P +VF + + + 
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234

Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
            + +  VT+ F+W          S     S+       +    H  TA N  P  +    
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294

Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
           A  T D          L  N +G   K    ++   G          S ++    +  A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344

Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
           +A+  +P        + S   + F++ W  P VKF   + VY RRY +++     S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404

Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
             AH AI    +W  +IE WQ PIL ++  P WY   LFNELYYL+ GGT+W D   P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460

Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
              T   +  S D  +++                     IG F YLEG EY MYNTYDVH
Sbjct: 461 VDLTEDMKVDSWDHRARL------------------SREIGLFGYLEGHEYRMYNTYDVH 502

Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
            Y+S+AL+ L+PK++L++  D A   +  D  ++  + +GK + R    AV HD G   D
Sbjct: 503 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 562

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
           +PW   N+Y +F +  WKDLNSKF+LQV+RD+  T D  +   + P V  I    +  +D
Sbjct: 563 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 622

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
            D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A 
Sbjct: 623 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 662


>gi|353232146|emb|CCD79501.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
          Length = 725

 Score =  295 bits (755), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 218/640 (34%), Positives = 321/640 (50%), Gaps = 52/640 (8%)

Query: 49  FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
           F  L  +  + + Y K+   + ++P  +           G+P+GGIG+GSIGR YRGEF 
Sbjct: 63  FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122

Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
           R  L  G+   D  PV  +QF V V +     +  VL P  + P   KK     ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175

Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
               E   Y  L+PR+WTVY+  P+ +L +V +QISP IPH+Y+ +  P +VF + + + 
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234

Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
            + +  VT+ F+W          S     S+       +    H  TA N  P  +    
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294

Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
           A  T D          L  N +G   K    ++   G          S ++    +  A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344

Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
           +A+  +P        + S   + F++ W  P VKF   + VY RRY +++     S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404

Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
             AH AI    +W  +IE WQ PIL ++  P WY   LFNELYYL+ GGT+W D   P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460

Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
            +   +    +L    K+ +          RL       IG F YLEG EY MYNTYDVH
Sbjct: 461 -VDCFKSDLLNLTEDMKVDSWDHR-----ARL----SREIGLFGYLEGHEYRMYNTYDVH 510

Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
            Y+S+AL+ L+PK++L++  D A   +  D  ++  + +GK + R    AV HD G   D
Sbjct: 511 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 570

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
           +PW   N+Y +F +  WKDLNSKF+LQV+RD+  T D  +   + P V  I    +  +D
Sbjct: 571 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 630

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
            D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A 
Sbjct: 631 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 670


>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
          Length = 912

 Score =  295 bits (754), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 188/522 (36%), Positives = 263/522 (50%), Gaps = 69/522 (13%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIES 161
           ++GEF RF+L+ G+ +   V   QF V +     E  F SVL   S   PKK     + S
Sbjct: 51  FKGEFCRFQLYPGLYEYVTVPECQFIVNIRDAKKETIFQSVLSTYSK--PKK----VLPS 104

Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
           W+WN+KG +C Y AL+PRAWT YD      ++++CRQISP IPHNYK+SS P +VF F+ 
Sbjct: 105 WEWNIKGADCEYTALYPRAWTTYD-LTKYGIKLICRQISPVIPHNYKDSSLPCAVFVFSA 163

Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFA 281
            N      DV++ FTW   +       G   + +  + +   G+TL  + A+   P TF 
Sbjct: 164 KNISNEQRDVSITFTWTECLGESKKKGGCKLDLERYSAENTCGVTLEQKIADT--PCTFT 221

Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF-------------DHLDND 328
           +A  +  +  + E  C   S  + G     +W  +KKHG               D  D D
Sbjct: 222 IAKNKPDNETIHEGYCLWNSTKTSGY----VWECLKKHGRLEPDPSQTPPPPKVDKTDKD 277

Query: 329 KTSPSEPGSSI--GAAIAASLTI---PSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTK 381
                E    I    +IA S TI   P GS  S  F L WD P +K+ +  KV+ R YTK
Sbjct: 278 NKEDKEKAKKIYKNDSIALSSTISLDPRGSG-STEFCLVWDMPVIKYKKDSKVHKRYYTK 336

Query: 382 FYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGT 441
           ++G+ G + A IA  A+  + +WE ++  WQ PIL +   P+W    L NELY++  GGT
Sbjct: 337 YFGSDGIAGASIASYALRSYRRWEKQLAEWQDPILNNSCIPDWLKSALMNELYFVADGGT 396

Query: 442 IWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAE 501
           IW D                + D P         DT        +     G F YLEG E
Sbjct: 397 IWFD---------------VAEDYP---------DT--------DPRHEFGVFGYLEGHE 424

Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
           Y MYNTYDVHFY+SFAL  L+P L++ +Q  F  ++ +        + DGK    K   +
Sbjct: 425 YRMYNTYDVHFYASFALAQLWPNLQVVLQYMFRESISVEINKLRPSLYDGKSCKFKTKDS 484

Query: 562 VPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           +PHD+G  D  P+  IN+YN+ + S W+DLN KF+LQV RD+
Sbjct: 485 IPHDLGEPDGVPFSHINAYNIHDVSEWRDLNLKFILQVMRDY 526



 Score =  216 bits (550), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 121/263 (46%), Positives = 159/263 (60%), Gaps = 9/263 (3%)

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           V    K +   ++PS    +     +D+DGDG+IEN GFPDQTYDAW   G SAYCGGLW
Sbjct: 635 VVWNSKQYLSDMYPSCVTLLRRGLDWDRDGDGLIENGGFPDQTYDAWVMTGPSAYCGGLW 694

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQAD 720
           VA++ A  A+A  +G       F    +KA+  Y+  LWNGSY+ +D    +S   + AD
Sbjct: 695 VASVSAVHAMAKILGFTDDEKEFSTLLEKARDSYERKLWNGSYYKFDTKPCNSEV-VMAD 753

Query: 721 QLAGQWYARACGLL-PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ--PDGRIDMS 777
           QLAGQW+ RA G   P+  EA VKKAL  IY+ NV +   G  GA+NG    P   ID +
Sbjct: 754 QLAGQWFLRASGWTEPVFPEANVKKALHTIYENNVQRFLNGRMGAVNGFVRGPRPGIDTT 813

Query: 778 GLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDE 837
            +Q+ E+W GVTYGLAA MI E M + AF TA G+Y        +G +F+TPE+   N  
Sbjct: 814 AIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTLMK---MGLAFETPEALYENGN 870

Query: 838 YRSLCYMRPLTIWAMQWA-LTKP 859
           +RS+ YMRPL+IW+M  A +TKP
Sbjct: 871 HRSVAYMRPLSIWSMYHAIITKP 893


>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
 gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
           MIT 9303]
          Length = 837

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 244/820 (29%), Positives = 387/820 (47%), Gaps = 106/820 (12%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QF++F  + +  +  ++    
Sbjct: 55  HGMPLGGFGAGCIGRSPRGDFNLWNLDAGEHWQGSIPDCQFALFERQGDQVRAHALATAP 114

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                +   D  + SW W      +    +Y A +P +WT Y G    EL  VC   SP 
Sbjct: 115 QTDDSQPELDKPLSSWSWYPASTEQRTTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPI 172

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
           +P +Y+ +S+P +VF + L N    S +++LL +W N+                D G   
Sbjct: 173 LPGDYRRTSYPLAVFRWQLQNPTSQSLELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPE 232

Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN----GRPPVTFAVAAEETADVHVSEC 295
           H +           ++ + + G+ G+ +    A+    G+     AV  +    V V  C
Sbjct: 233 HSYVPAIGRGEGQCNRWIDQPGLIGVLMEGERADPLAEGQGQWCLAVP-DHLPGVEVMRC 291

Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
             +  SG+       ++W+     G+    +++ T  S  G    AA+A  +T+  G + 
Sbjct: 292 SRWDPSGD-----GSELWSSFASEGTIP--NSNDTHKSLAGEQTSAALAVKVTLAPGESL 344

Query: 356 SVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            +   ++WD P   F   V   RRYT F+G+ G +AA +A +A+ +   W  +I+AWQ P
Sbjct: 345 EIPVVISWDLPVTAFATGVRDLRRYTDFHGSDGQAAAALAAEALRDWPDWREQIDAWQAP 404

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
           +L  +  PE   + LFNELY L +GG++WT   P                          
Sbjct: 405 VLAREDLPERLRMALFNELYDLASGGSLWTAARP-------------------------- 438

Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
                         + +G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ ++ R FA
Sbjct: 439 -------------GDPVGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFA 485

Query: 535 AAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRW 586
            A+   D     I    + G+      RK   A PHD+G  ++  F+  +Y  + + + W
Sbjct: 486 RAIPAADATPRPIGWYFTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLW 545

Query: 587 KDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
           KDL S +VLQV+R F+ +    D +F    WP+   A++Y+++FD + DG+ +N G PDQ
Sbjct: 546 KDLASDYVLQVWRTFLLSPNGEDLSFLAECWPAAVQALSYLKRFDVNHDGLPDNGGAPDQ 605

Query: 644 TYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASAS--YFWVRYQKAKAVYDS 697
           T+D W   GVSAYCG LW+AAL+AA A    L  D+G + +     F    ++++A +D 
Sbjct: 606 TFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWLEQSRANFDR 665

Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
            LWNG Y+  D   G+    + ADQL G +YAR  GL  +  + + + +L  + +     
Sbjct: 666 LLWNGEYYKIDAESGTP--VVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEACFEG 723

Query: 757 VKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
            +GG  G  NG+  DG  +D  G    E+W G+ +GLAA      ++  A    A    V
Sbjct: 724 FEGGRLGVANGLCRDGMPLDPKGTHPLEVWTGINFGLAAYY---RLMGDATTATAICSAV 780

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
                G G  F+TPE+      YR+  Y+R + IWA+ WA
Sbjct: 781 VNQVYGGGLQFRTPEAITAVKTYRACHYLRAMAIWAL-WA 819


>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
          Length = 377

 Score =  293 bits (750), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 150/310 (48%), Positives = 190/310 (61%), Gaps = 7/310 (2%)

Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
           G EY MYNTYDVHFY+SFALVML+PKLELS+Q D A A +  D    + +  G     K 
Sbjct: 19  GQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKR 78

Query: 559 LGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
              +PHDIG   D+PW  +N+Y + +++ WKDLN KFVLQ+YRD+  TGD+ F   +WP 
Sbjct: 79  RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPV 138

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
               M    +FDKD DG+IEN G+ DQTYDAW   G SAYCGGLW+AA+     +A   G
Sbjct: 139 CLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCG 198

Query: 677 DHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL-- 733
                  F     + +  Y+  LWNG Y+NYD+S    S SI +DQ AGQW+ RACGL  
Sbjct: 199 AQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGE 258

Query: 734 --LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYG 791
               +     V +AL  I++ NV    GG  GA+NGM P G  D S +Q+ E+W GV YG
Sbjct: 259 GDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYG 318

Query: 792 LAASMIQEEM 801
           LAA+MIQE M
Sbjct: 319 LAATMIQEVM 328


>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
           9313]
 gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 837

 Score =  292 bits (748), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 239/824 (29%), Positives = 382/824 (46%), Gaps = 114/824 (13%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G+PLGG GAG IGRS RG+F  + L  G      +   QF++F  + +  +  ++    
Sbjct: 55  HGMPLGGFGAGCIGRSPRGDFNLWNLDGGEHWQGSIPDCQFALFERQGDQVRAHALATAP 114

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                +   D  + SW W      + +  +Y A +P +WT Y G    EL  VC   SP 
Sbjct: 115 QTDDSQPELDKPLSSWSWYPASTEQRKTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPI 172

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
           +P +Y+ +S+P +VF + L N    S +++LL +W N+                D G   
Sbjct: 173 LPGDYRRTSYPLAVFRWQLQNPTSQSLELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPE 232

Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN----GRPPVTFAVAAEETADVHVSEC 295
           H +           ++ + + G+ G+ +    A+    G+     AV  +    V V  C
Sbjct: 233 HSYVPAIGRGEGQCNRWIDQPGLIGVLMEGERADPLAEGQGQWCLAVP-DHLPGVEVMRC 291

Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
             +  SG+       ++W+     G+    +++    S  G    AA+A  +T+  G + 
Sbjct: 292 SRWDPSGD-----GSELWSSFASEGTIP--NSNTAHKSLAGEQTSAALAVKVTLAPGESL 344

Query: 356 SVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            +   ++WD P   F   V   RRYT F+G+ G +AA +A +A+ + + W  +I+AWQ P
Sbjct: 345 EIPVVISWDLPVTAFATGVRDLRRYTDFHGSDGQAAAALAAEALSDWSDWREQIDAWQAP 404

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
           +L  +  PE   + LFNELY L +GG++WT   P                          
Sbjct: 405 VLAREDLPERLRMALFNELYDLASGGSLWTAARP-------------------------- 438

Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
                         + +G+F  LE  +Y  Y + DV  Y S AL+ L+P+L+ ++ R FA
Sbjct: 439 -------------GDPVGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFA 485

Query: 535 AAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRW 586
            A+   D     I    + G+      RK   A PHD+G  ++  F+  +Y  + + + W
Sbjct: 486 RAIPAADATPRPIGWYFTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLW 545

Query: 587 KDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
           KDL S +VLQV+R F+ +    D +F    WP+V  A++Y+++FD + DG+ +N G PDQ
Sbjct: 546 KDLASDYVLQVWRTFLLSPNGEDLSFLAECWPAVVQALSYLKRFDVNHDGLPDNGGAPDQ 605

Query: 644 TYDAWSANGVSAYCGGLWVAA----------LQAASALANDVGDHASASYFWVRYQKAKA 693
           T+D W   GVSAYCG LW+AA          LQ    L      H  + +     ++++ 
Sbjct: 606 TFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWL----EQSRV 661

Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
            +D  LWNG Y+  D   G+    + ADQL G +YAR  GL  +  + + + +L  + + 
Sbjct: 662 NFDRLLWNGEYYKIDAESGTP--VVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEA 719

Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
                +GG  G  NG+  DG  +D  G    E+W G+ +GLAA      ++  A    A 
Sbjct: 720 CFEGFEGGRLGVANGLCRDGMPLDPKGTHPLEVWTGINFGLAAYY---RLMGDATTATAI 776

Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
              V     G G  F+TPE+      YR+  Y+R + IWA+ WA
Sbjct: 777 CSAVVNQVYGGGLQFRTPEAITAVKTYRACHYLRAMAIWAL-WA 819


>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
 gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
          Length = 954

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 257/881 (29%), Positives = 387/881 (43%), Gaps = 122/881 (13%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G P+GG+GAG++GR+YRG+F R+ +  G     P      ++F +R +G        P  
Sbjct: 63  GAPIGGMGAGTMGRTYRGDFARWHMRVGGHTHRPA---AHTLFAARVDGTATVLSALPSH 119

Query: 148 P---GVPKKN-TDSGIESWDWNLKGENCTYHALFPRAWTVYDGE---PDPELRIVCRQIS 200
               GVP    +D           G   TYHAL+PR+W  YD     P   +R+   Q S
Sbjct: 120 AVMEGVPGGGGSDRTHSRRVLPPDGTGGTYHALYPRSWYEYDPSALAPGGRVRLSQTQFS 179

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
           P IP  Y+++S P  VF F  SN G   ADV L F++ N +A             G   L
Sbjct: 180 PVIPGEYRDASMPVGVFRFLASNEGDVPADVALAFSFENVLATTDVAPVDVGPPRGAWRL 239

Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTAN----GRPPVT-----FAVAAEETADVHVSECPCF 298
           + H     T  + G     +H  T +     R P        AVA     ++        
Sbjct: 240 APHELTHSTFERSGGRVAGVHMHTDDVVYGTRSPSEPWHGGIAVAVGGNGELGGLNVTTL 299

Query: 299 LL--SGNSKGITAKDMWNEIKKHGSFDH------LDNDKTSPSEPGSSIGAAIAASLTIP 350
            L  +G++  I +K  W   ++ G F         ++D   P+ P S+   A++A+ T+ 
Sbjct: 300 PLYDAGDADSIESK-FWRPFERAGGFGRDESAAANEHDGGLPAGPLSAC--AVSAAFTLA 356

Query: 351 SGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTL--------GDSAARIAHDAILEH 401
            G  R+VTF+ AWD P   F   V + +RYT+++G+         G +A  IA +A+L  
Sbjct: 357 PGERRNVTFAAAWDLPASSFPRGVRWAKRYTRYHGSFIHRRDVPRGAAAPDIATEALLRA 416

Query: 402 AKWECEIEAWQRPILEDK----------------------------RFPEWYPITLFNEL 433
            +WE +I  WQR  ++                              R P W    +FNEL
Sbjct: 417 DEWEGDIARWQRGFVQSAAAKDANLDDPAGGRGRNYPGRVRDRPTLRRPAWLVGAMFNEL 476

Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEE---N 490
           YYL  GGT+W            + E + S   P         +      +  NG E    
Sbjct: 477 YYLVDGGTLWGRPLTVAGLEGDVEESEASAAAPGGWRG-EDGEGGDAQDVAGNGGEWGGT 535

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSD 550
           +G+F  LE  +  +YN  DVHFY S+ L ML+P L+L++  D AA V   D     +  +
Sbjct: 536 LGRFGLLESFDAPVYNALDVHFYGSWPLAMLWPGLDLAVLADLAAGVDSEDDEERALSWN 595

Query: 551 GKWVAR-------KCLGAVPHDIG--LDDPWFEI-NSYNLFNSSRWKDLNSKFVLQVYRD 600
            +   R       K  G+ P D+G   D P     N+++L + +RWKDL  K +L + R 
Sbjct: 596 ARDAGRAPEMRKRKVRGSAPRDLGAPFDAPLTSSPNAFDLTDVNRWKDLAPKLMLTLARA 655

Query: 601 FVATGD-----------KNFARAVWPSVYIAMA-YMEQFDKDGDGMIE----NEGFPDQT 644
               G            ++  R ++   Y ++A  + +FD +GDG+IE    +E  PD +
Sbjct: 656 HALRGGFRGPNDPGGIPRDVLRRMFRPCYESLAAQLRRFDGNGDGLIEHDDDDEAGPDHS 715

Query: 645 YDAWSANG-VSAYCGGLWVAALQAASALANDVGD-HASASYFWVRYQKAKAVYDSLWNG- 701
           + AW A G  SAY GGLW+AAL+A + LA D+ + HA  S        A A   +LW G 
Sbjct: 716 FVAWRARGDTSAYAGGLWLAALRAGAGLARDLNETHARDSLEQTMRAAAVAHDAALWLGD 775

Query: 702 -----SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
                 Y+ +D S         A Q+ G+W     G   +    KV+ AL  +   NV  
Sbjct: 776 DDGGRGYYRFDASGTEGGDVSSAAQVMGEWALAVIGARGVLPARKVRAALASVARLNVAA 835

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
             G            G +D     + E+WPGV+Y +A+ M+   M D A++ A GV    
Sbjct: 836 AGGAGGAVNGAAA-SGSVDAVSAHSNEVWPGVSYAVASHMLLAGMDDEAWELARGVAAGT 894

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
           +     G +F+TPE+W+    +RS    R  ++WA++ ALT
Sbjct: 895 YER---GLAFRTPEAWDARGGFRSAMSQRAGSVWAIEHALT 932


>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
          Length = 777

 Score =  285 bits (730), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 154/445 (34%), Positives = 240/445 (53%), Gaps = 46/445 (10%)

Query: 161 SWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFT 220
           SW+  L    C+Y AL+PRAW+ YD   +  +R+V RQISP IPH+YKESS P +VF +T
Sbjct: 37  SWESALDASRCSYTALYPRAWSEYDLS-EYGVRLVQRQISPIIPHDYKESSMPCAVFVWT 95

Query: 221 LSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTF 280
           + N       VT+ FT+ N        +     +   T+    G+++    A+     T+
Sbjct: 96  VENVCDRDRQVTITFTFKNGTGTKKQDAEGGSETGAFTQGSARGVSIRQTIAD--MACTY 153

Query: 281 AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIG 340
            ++   +++++++ C  F  SG     T + +WN++K+HG   HL       +     + 
Sbjct: 154 CLSCRSSSEINLTRCERFDPSG-----TGEKLWNDLKEHG---HLTEQSVDETVKTKDVA 205

Query: 341 AAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFYGTLGDSAARIAHDAI 398
            A++  + +  G+T  + F+L WD P+V F +  K YHR YTK++G  GD+   ++  A+
Sbjct: 206 VAVSGQILVQPGTTSELQFALVWDMPKVHFQKRGKEYHRYYTKYFGKTGDAGPTMSDYAL 265

Query: 399 LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
             + KWE  I+ WQRPIL+D   P+WY   +FNELY++  GG++W           T+ +
Sbjct: 266 TNYGKWERLIDEWQRPILDDADLPDWYKSAIFNELYFIADGGSVWL----------TMED 315

Query: 459 RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
                D P   +                     G++ YLEG EY MY TYDVHFY+S AL
Sbjct: 316 SDLPYDDPRLAY---------------------GRYSYLEGHEYRMYTTYDVHFYASHAL 354

Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEIN 576
             L+P L++SIQ D+  ++        K + DGK + RK   +VPHD+G   ++P+  IN
Sbjct: 355 ASLWPNLQVSIQYDYKDSIGREISEGRKHLYDGKVIPRKIQNSVPHDLGDPAEEPFDLIN 414

Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDF 601
           +Y + + S W+DLN KF+LQVYRD+
Sbjct: 415 AYPIHDVSEWRDLNIKFILQVYRDY 439



 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 108/263 (41%), Positives = 161/263 (61%), Gaps = 11/263 (4%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++P+  + +    ++DKD DG+IEN   PDQTYD+W  +G SAYCGGLW+A+L   
Sbjct: 517 YLKAMYPACKLVLERSREWDKDDDGLIENSRSPDQTYDSWVMDGPSAYCGGLWLASLHCM 576

Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           + +A+ +     A+ +     K +  ++  LWNG+Y+ +D +  +S  SI +DQL G WY
Sbjct: 577 ATMASLLDQTEDAANYQAILDKGRRSFEEKLWNGTYYKFD-AQSASKNSIMSDQLCGHWY 635

Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSGLQ 780
            RACG    +  +  V+ A+  IY+ NV++  GG  GA+NG  P      +GR D+S +Q
Sbjct: 636 LRACGFDYDVFPKENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPNKEGRPDVSNIQ 695

Query: 781 AREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRS 840
             E+W GVTY LAA+MI E M + AFQTA G+Y      + +G +F+TPE+      YR+
Sbjct: 696 GEEVWTGVTYALAATMIHEGMFEEAFQTAGGLYRSL--SEKIGMNFETPEAVYAERHYRA 753

Query: 841 LCYMRPLTIWAMQWALTKPKLSR 863
           + YMRPL+IW+MQ A    KL R
Sbjct: 754 IGYMRPLSIWSMQTAWELRKLRR 776


>gi|345293667|gb|AEN83325.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293669|gb|AEN83326.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293671|gb|AEN83327.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293673|gb|AEN83328.1| AT5G49900-like protein, partial [Capsella rubella]
 gi|345293675|gb|AEN83329.1| AT5G49900-like protein, partial [Capsella rubella]
          Length = 156

 Score =  279 bits (714), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 127/156 (81%), Positives = 141/156 (90%)

Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
           Y M+NTYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP  +K +S+G+WV RK LGA
Sbjct: 1   YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60

Query: 562 VPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
           VPHD+G++DPWFE+N YNL N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AM
Sbjct: 61  VPHDLGINDPWFEVNGYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120

Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           AYM QFDKDGDGMIENEGFPDQTYD WSA+GVSAYC
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYC 156


>gi|345293661|gb|AEN83322.1| AT5G49900-like protein, partial [Capsella grandiflora]
 gi|345293663|gb|AEN83323.1| AT5G49900-like protein, partial [Capsella grandiflora]
 gi|345293665|gb|AEN83324.1| AT5G49900-like protein, partial [Capsella grandiflora]
          Length = 156

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 126/156 (80%), Positives = 141/156 (90%)

Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
           Y M+NTYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP  +K +S+G+WV RK LGA
Sbjct: 1   YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60

Query: 562 VPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
           VPHD+G++DPWFE+N Y+L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AM
Sbjct: 61  VPHDLGINDPWFEVNGYSLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120

Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           AYM QFDKDGDGMIENEGFPDQTYD WSA+GVSAYC
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYC 156


>gi|449674846|ref|XP_002159451.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
           magnipapillata]
          Length = 751

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 167/446 (37%), Positives = 240/446 (53%), Gaps = 36/446 (8%)

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
           V G+ + H     R   T A+AA +     +                A ++W ++   G 
Sbjct: 336 VRGVLMKHNALKKR--FTLAIAALQVIHNTIDVVQVSTKVAFDAQSPAAELWEDLNDDGV 393

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV---YHRR 378
            D+   D +  S PG  + AA+AA  T+ +G+ + + F L+WD P V F       Y RR
Sbjct: 394 LDNKRGD-SCYSYPGQLLAAAVAAKTTVHAGAKQDLKFVLSWDQPFVTFGSDSGLSYCRR 452

Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
           YT+F+G+ G+++  +   A+     WE  IE WQ PIL D   P W+   LFNELY+++ 
Sbjct: 453 YTRFFGSNGNASPSLCCYALQNIDFWEDNIERWQNPILNDSELPAWFKSALFNELYFISD 512

Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLE 498
           GGT+W D          I+ +K S       H    SD            +  G+F YLE
Sbjct: 513 GGTVWLD----------IKRKKDS-------HGNFISDLV----------KLYGRFAYLE 545

Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
           G EY MYNTYDVHFY+S+AL +L+P L++S+Q D   +V+  D     IM +G    RK 
Sbjct: 546 GHEYRMYNTYDVHFYASWALALLWPNLQISMQYDICNSVLNVDDEKRPIMMEGGIAPRKV 605

Query: 559 LGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           +G VPHD+G   D+P+  +N+Y + ++S+WKDLN  FVLQVYRDF  T +K F   +WP+
Sbjct: 606 IGCVPHDLGDPDDEPFVRVNAYCIHDTSQWKDLNLHFVLQVYRDFYITKNKQFLMDMWPA 665

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
           +  AMA+    D DGDG+IEN G  D T+D W   G SAY GGLW++AL+    +A+ +G
Sbjct: 666 MKAAMAHSLAQDIDGDGIIENSGIADSTFDTWVVTGPSAYTGGLWLSALRCIIEIAHILG 725

Query: 677 DHASASYFWVRYQKAKAVYD-SLWNG 701
              S S +    ++ K  Y+  LWNG
Sbjct: 726 LSQSISKYKSILERGKKSYEKKLWNG 751



 Score =  146 bits (369), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 77/214 (35%), Positives = 118/214 (55%), Gaps = 3/214 (1%)

Query: 61  RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
           +++++ + KGK+P  + +         G+PLGGIGAGSIGR ++G+F R++L  GI    
Sbjct: 2   KFTRKVKKKGKLPFIDPYAVSACKQIYGVPLGGIGAGSIGRGWKGDFNRWQLRPGIYTYD 61

Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRA 180
            V  NQF+V + +     +  VL P  P V  KN +S ++SW W+  G    YHAL+PRA
Sbjct: 62  TVDVNQFTVCIRKCGKTIYQQVLAPYYP-VAIKNVES-LQSWTWSFDGGRGYYHALYPRA 119

Query: 181 WTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANS 240
           WTVY    +  +++ CRQ+SP  P NYK++S P  VF +T+ N G    DV+++ T+ N 
Sbjct: 120 WTVYH-ITEHNIKLTCRQVSPIFPDNYKDTSLPVGVFVWTIENIGMHDLDVSIMLTFQNG 178

Query: 241 VAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANG 274
              +    G H N +   K     + L    +N 
Sbjct: 179 DGTNEDCVGGHNNEEFFCKGVYSEIRLQKDESNA 212


>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
 gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
          Length = 439

 Score =  266 bits (679), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 172/476 (36%), Positives = 243/476 (51%), Gaps = 67/476 (14%)

Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRER 459
           ++  W+ +IE+WQ PIL     P+W+ + LFNELY L +GGT+W+  +P           
Sbjct: 3   QYKTWQQQIESWQDPILNRPDLPDWFKMALFNELYDLTSGGTLWSAATPC---------- 52

Query: 460 KFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                                        + IGQF  LE  +Y  Y + DV  Y SF L+
Sbjct: 53  -----------------------------DPIGQFAVLECLDYRWYESLDVRLYGSFGLL 83

Query: 520 MLFPKLELSIQRDFAAAVMMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP-- 571
            LFP+LE ++ R FA A+   D  P  +     I ++     RK  GA PHD+G  +   
Sbjct: 84  QLFPELEKAVMRAFARAIPQGDDTPRVIGYYYTIGAESPIAVRKTPGATPHDLGAPNEHV 143

Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDK 629
           W + N  +  + + WKDL   FVLQVYRDF+ TG  D  F R  W ++   + Y++ FD 
Sbjct: 144 WEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYVKTFDL 203

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDV--GDHASASYFWVR 687
           DGDG+ EN G PDQT+D W   GVSAYCGGLW+AAL AA A+++ +      SA+   + 
Sbjct: 204 DGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSATKGHLL 263

Query: 688 YQKAK---------AVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIA 737
           YQK+           +Y + LWNG Y+  D+  GS    + ADQL GQ+YA    L  I 
Sbjct: 264 YQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESGSD--VVMADQLCGQFYANLLELPDIV 321

Query: 738 DEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASM 796
              +   AL  +YD   LK   G  GA NG++PDG  +        E+W G+ +GLAA +
Sbjct: 322 PSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFL 381

Query: 797 IQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           +Q  M D  F+    V E  ++    G  F+TPE+      +R+  Y+RP+ IWA+
Sbjct: 382 VQMGMKDEGFRLTQAVVEQIYNN---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 434


>gi|12005896|gb|AAG44660.1|AF258662_1 AD035 [Homo sapiens]
          Length = 367

 Score =  245 bits (626), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 153/392 (39%), Positives = 213/392 (54%), Gaps = 43/392 (10%)

Query: 234 LFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADV 290
           +F+  N + G     G  +N    + + G  V GL LHH T     P T AVAA  TA  
Sbjct: 1   MFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAARVTAAT 58

Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI- 349
            V+    F         T + +W ++ + G  D     +++P++ G  I  A+  S  + 
Sbjct: 59  TVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVCVSSKLR 112

Query: 350 PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
           P G  R + FSLAWD P + F  K  V++RRYT+F+G  GD+A  ++H A+  +A+WE  
Sbjct: 113 PRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEER 171

Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
           I AWQ P+L+D+  P WY   LFNELY+L  GGT+W         L  + +         
Sbjct: 172 ISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--------- 213

Query: 468 KIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
                 S    LG  +  L     + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKL
Sbjct: 214 ------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKL 267

Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNS 583
           ELS+Q D A A +  D    + +  G     K    +PHDIG   D+PW  +N+Y + ++
Sbjct: 268 ELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDT 327

Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
           + WKDLN KFVLQVYRD+  TGD+NF + +WP
Sbjct: 328 ADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWP 359


>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
          Length = 207

 Score =  234 bits (597), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 120/201 (59%), Positives = 157/201 (78%), Gaps = 1/201 (0%)

Query: 606 DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAAL 665
           D  F R VWP+V  AM YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+A+L
Sbjct: 6   DMAFGREVWPAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLASL 65

Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAG 724
           QAA+A+A+ +GDHA A    +++  AK  ++S LWNGSYFNYD+ + S+S SIQADQLAG
Sbjct: 66  QAAAAMAHRLGDHAYAEKCTIKFLTAKHAFESKLWNGSYFNYDSGNSSNSKSIQADQLAG 125

Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
           QWYA + GL P+ DE K++  L KI+DFNV+KV+GG  GA+NGM P+G++D S +Q+REI
Sbjct: 126 QWYAASSGLPPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDESCMQSREI 185

Query: 785 WPGVTYGLAASMIQEEMVDMA 805
           W GVTY L+A+MI   M + A
Sbjct: 186 WTGVTYALSATMILAGMEEQA 206


>gi|85857488|gb|ABC86280.1| RE08574p [Drosophila melanogaster]
          Length = 544

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 133/434 (30%), Positives = 211/434 (48%), Gaps = 64/434 (14%)

Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
           Y GEF RF++  GI +   VLANQF V +  P G      L  +     K +   G    
Sbjct: 132 YAGEFCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDG 191

Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
                            + +W  N++   C+Y  L+PR+WT YD      +R+ CRQ+SP
Sbjct: 192 ERTKCQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 250

Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
            IPH Y+ESS P +VF +++ N       V++ FT+ N        +     S+ +++  
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310

Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
             G+++  + +    P ++ +A     ++ ++ CP F  +GN      + +W ++K+HG 
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 362

Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
              L    TS +     IG A+   + +   ++  + F LAWD P+++F  K+  + R Y
Sbjct: 363 --QLSEHPTSEALKTKDIGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 420

Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
           TK++   GDS  RI   A+ +++ WE  I+AWQRPIL D+  P+WY   +FN+LY+++ G
Sbjct: 421 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 480

Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
           GTIW                                D++LG  L  ++     G+F YLE
Sbjct: 481 GTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYLE 511

Query: 499 GAEYVMYNTYDVHF 512
           G EY MYNTYDVHF
Sbjct: 512 GHEYRMYNTYDVHF 525


>gi|147788373|emb|CAN61188.1| hypothetical protein VITISV_019327 [Vitis vinifera]
          Length = 550

 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 100/217 (46%), Positives = 150/217 (69%), Gaps = 3/217 (1%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D   PP+  W+R+L+S+  +   F+++F E   +  +G RL+ Y ++E ++G+    + F
Sbjct: 31  DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90

Query: 79  RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
            +     ++ QG+PLGG+G+GSI R +RGEF+ +++  G CD +P++ANQFS+F+SR  G
Sbjct: 91  TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150

Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
            +K++SVL P       K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL++ 
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210

Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT 232
           CRQISPFIPHNY++SS P +VF +T   + + S+  T
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTPLTTFEVSSKFT 247


>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
          Length = 279

 Score =  216 bits (549), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 115/240 (47%), Positives = 145/240 (60%), Gaps = 10/240 (4%)

Query: 621 MAYME---QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGD 677
           MA ME   +FDKD DG+IEN G+ DQTYD W   G SAYCGGLW+AA+     +A   G 
Sbjct: 1   MAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGA 60

Query: 678 HASASYFWVRYQKAKAVYDSL-WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL--- 733
                 F     + +  Y+ L WNG Y+NYD+S    S SI +DQ AGQW+ +ACGL   
Sbjct: 61  QDVQEKFTSILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEG 120

Query: 734 -LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
              +     V +AL  I++FNV    GG  GA+NGMQP G  D S +Q+ E+W GV YGL
Sbjct: 121 DTEVFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGL 180

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           AA+MIQE +    F+TA G Y   W    LG +FQTPE++     +RSL YMRPL+IWAM
Sbjct: 181 AATMIQEGLTWEGFRTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 238


>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
          Length = 432

 Score =  213 bits (541), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 117/268 (43%), Positives = 160/268 (59%), Gaps = 20/268 (7%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           + +A++ S    M    ++DKD DG+IEN   PDQTYD+W  +G SAYC GLW+AALQA 
Sbjct: 171 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 230

Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
           SA+A  + D  +     +RYQ       +++ + LWNGSY+ +D S     T I ADQL 
Sbjct: 231 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 285

Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
           G WY ++CG    I  +  V+ AL +IYD NV+    G  GA NG   +       G +D
Sbjct: 286 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVD 345

Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
            S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+       +G +F+TPE+    
Sbjct: 346 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 403

Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
             YRS+ YMRPL+IW+MQ AL + +  R
Sbjct: 404 KRYRSIGYMRPLSIWSMQVALERRRAQR 431



 Score =  107 bits (266), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 55/125 (44%), Positives = 81/125 (64%), Gaps = 2/125 (1%)

Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVP 563
           MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK    VP
Sbjct: 1   MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60

Query: 564 HDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
           HD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+    +   A++   S + ++
Sbjct: 61  HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120

Query: 622 AYMEQ 626
            ++++
Sbjct: 121 EFIDK 125


>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
 gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
          Length = 191

 Score =  213 bits (541), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 113/190 (59%), Positives = 142/190 (74%), Gaps = 1/190 (0%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           F   VWP+V  AM YMEQFD+D D +IEN+GFPDQTYD W+ +GVSAYCG LW+AALQAA
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           +A+A  +GD   A     ++  AKA  +  LWNGSYFNYD+   S+S SIQ DQLAGQWY
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
           A + GL PI +E+K+K  + KI+DFNV+K KGG  GA+NGM PDG++D + +Q+REIW G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180

Query: 788 VTYGLAASMI 797
           VTY  AA+MI
Sbjct: 181 VTYAAAATMI 190


>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
 gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
          Length = 190

 Score =  212 bits (540), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 113/190 (59%), Positives = 142/190 (74%), Gaps = 1/190 (0%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           F   VWP+V  AM YMEQFD+D D +IEN+GFPDQTYD W+ +GVSAYCG LW+AALQAA
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           +A+A  +GD   A     ++  AKA  +  LWNGSYFNYD+   S+S SIQ DQLAGQWY
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
           A + GL PI +E+K+K  + KI+DFNV+K KGG  GA+NGM PDG++D + +Q+REIW G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180

Query: 788 VTYGLAASMI 797
           VTY  AA+MI
Sbjct: 181 VTYAAAATMI 190


>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
          Length = 190

 Score =  212 bits (539), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 112/190 (58%), Positives = 142/190 (74%), Gaps = 1/190 (0%)

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
           F   VWP+V  AM YMEQFD+D D +IEN+GFPDQTYD W+ +GVSAYCG LW+AALQAA
Sbjct: 1   FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60

Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
           +A+A  +GD   A     ++  AKA  +  LWNGSYFNYD+   S+S SIQ DQLAGQWY
Sbjct: 61  AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120

Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
           A + GL PI +E+K++  + KI+DFNV+K KGG  GA+NGM PDG++D + +Q+REIW G
Sbjct: 121 AASSGLPPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDETCMQSREIWTG 180

Query: 788 VTYGLAASMI 797
           VTY  AA+MI
Sbjct: 181 VTYAAAATMI 190


>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
 gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 118/210 (56%), Positives = 158/210 (75%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI  EM D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 118/210 (56%), Positives = 158/210 (75%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  + D+ K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  210 bits (534), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 118/210 (56%), Positives = 157/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  E K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  209 bits (531), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKLAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAATAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
 gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
 gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
 gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
 gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
 gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
 gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
 gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
 gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
 gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
 gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
 gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
 gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
 gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
 gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
 gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
 gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
 gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
 gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
 gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
 gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
 gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
 gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
 gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
 gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
 gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
 gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
 gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
 gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
 gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
 gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
 gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
 gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
 gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
 gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
 gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
 gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
 gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
 gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
 gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
 gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
 gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
 gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
 gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
 gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
 gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
 gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
 gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
 gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
 gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
 gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
 gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
 gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
 gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
 gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
 gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
 gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
 gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
 gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
 gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
 gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
 gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
 gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
 gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
 gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
 gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
 gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
 gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
 gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
 gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
 gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
 gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
 gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
 gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
 gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
 gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
 gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
 gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
 gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
 gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
 gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
          Length = 210

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 155/210 (73%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  SA  +  ++ KAK  ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIFTAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL  P
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEAP 210


>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
          Length = 210

 Score =  207 bits (527), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 156/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K+  +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
          Length = 391

 Score =  207 bits (526), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 137/374 (36%), Positives = 187/374 (50%), Gaps = 83/374 (22%)

Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVP 563
           MYNTYDVHFY+S AL  L+P L++S+Q DF  A+      T K++ DGK + RK    VP
Sbjct: 1   MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60

Query: 564 HDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA------------------ 603
           HD+G  D  P+  IN YN+ + + WKDLN+KFVLQVYRD+                    
Sbjct: 61  HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120

Query: 604 --------------------TGDKNFARAVWPSVYI-----------AMAYMEQFDKDGD 632
                               + D+        S+YI           A+ Y++       
Sbjct: 121 EFIDKESLYELYSQDNKRKNSADEKQQNRKSASMYINETNGKVYLMDAIGYLKAMYASCK 180

Query: 633 GMIE--------NEGFPDQT------YDAWSANGVSAYCGGLWVAALQAASALANDVGDH 678
            ++E        N+G  + T      YD+W  +G SAYC GLW+AALQA SA+A  + D 
Sbjct: 181 AIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAMSAMAT-ILDQ 239

Query: 679 ASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
            +     +RYQ       +++ + LWNGSY+ +D S     T I ADQL G WY ++CG 
Sbjct: 240 PNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWYLKSCGF 295

Query: 734 -LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGLQAREIW 785
              I  +  V+ AL +IYD NV+    G  GA NG   +       G +D S +QA E+W
Sbjct: 296 DYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNIQAEEVW 355

Query: 786 PGVTYGLAASMIQE 799
           PGV Y LAA+MIQE
Sbjct: 356 PGVVYALAATMIQE 369


>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  206 bits (525), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 116/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW +Q AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGIQQALEKP 210


>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
          Length = 210

 Score =  206 bits (525), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 116/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI++FNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
 gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
          Length = 211

 Score =  206 bits (524), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 118/211 (55%), Positives = 158/211 (74%), Gaps = 2/211 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDH-ASASYFWVRYQKAKAVYDS-LWNGSYFNYDN 708
           +G+SAYCG LW+AALQAA+A+A ++ D  A+A  +  ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAAQKYKKKFLKAKSSFESKLWNGSYFNYDS 60

Query: 709 SDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM 768
              S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM
Sbjct: 61  GSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGM 120

Query: 769 QPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
            P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQT
Sbjct: 121 HPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQT 180

Query: 829 PESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           PE W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 PEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 211


>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  206 bits (524), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
          Length = 210

 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 116/210 (55%), Positives = 156/210 (74%), Gaps = 1/210 (0%)

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
           +G+SA CG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+ 
Sbjct: 1   HGISACCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
             S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM 
Sbjct: 61  SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120

Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
           P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180

Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           E W  +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210


>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
          Length = 207

 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 116/207 (56%), Positives = 154/207 (74%), Gaps = 1/207 (0%)

Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
           SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+   S
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
           +S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM P+G
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
           ++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180

Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKP 859
             +  YRSL YMRPL IW MQ AL KP
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQALEKP 207


>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
 gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
          Length = 208

 Score =  202 bits (515), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 116/208 (55%), Positives = 152/208 (73%), Gaps = 1/208 (0%)

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDG 711
           +SAYCG LW+AALQAA+A+A  + D  SA  +  ++ KAK  ++S LWNGSYFNYD+   
Sbjct: 1   ISAYCGCLWLAALQAAAAMAVALNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGSS 60

Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
           S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM P+
Sbjct: 61  SNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLHKIFDFNVMKVRGGRMGAVNGMHPN 120

Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
           G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTPE 
Sbjct: 121 GKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFTAGWSEEGFGYAFQTPEG 180

Query: 832 WNNNDEYRSLCYMRPLTIWAMQWALTKP 859
           W  +  YRSL YMRPL IW MQ AL  P
Sbjct: 181 WTMDGCYRSLVYMRPLAIWGMQQALEAP 208


>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  202 bits (515), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 115/204 (56%), Positives = 153/204 (75%), Gaps = 1/204 (0%)

Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
           SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+   S
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFMKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
           +S SIQADQLAGQWY  + GL  + D+ K++ +L KI+DFNV+KV+GG  GA+NGM P+G
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
           ++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180

Query: 833 NNNDEYRSLCYMRPLTIWAMQWAL 856
             +  YRSL YMRPL IW MQ AL
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 114/204 (55%), Positives = 152/204 (74%), Gaps = 1/204 (0%)

Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
           SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+   S
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFTAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
           +S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM P+G
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
           ++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180

Query: 833 NNNDEYRSLCYMRPLTIWAMQWAL 856
             +  YRSL YMRPL IW MQ AL
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  200 bits (508), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 114/204 (55%), Positives = 152/204 (74%), Gaps = 1/204 (0%)

Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
           SAYCG LW+AALQAA+A+A ++ D  +A  +  ++ KAK+ ++S LWNGSYFNYD+   S
Sbjct: 1   SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60

Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
           +S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV+GG  GA+NGM P+G
Sbjct: 61  NSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
           ++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180

Query: 833 NNNDEYRSLCYMRPLTIWAMQWAL 856
             +  YRSL YMRPL IW MQ AL
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
          Length = 205

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 93/159 (58%), Positives = 119/159 (74%)

Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKV 757
           LWNGSYFNYD+   S+S SIQADQLAGQWY  + GL  +  + K++ +L KI+DFNV+KV
Sbjct: 46  LWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKV 105

Query: 758 KGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAW 817
           +GG  GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++   W
Sbjct: 106 RGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGW 165

Query: 818 SEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
           SE+G GY+FQTPE W  +  YRSL YMRPL IW MQ AL
Sbjct: 166 SEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204


>gi|295828610|gb|ADG37974.1| AT1G33700-like protein [Capsella grandiflora]
          Length = 105

 Score =  195 bits (496), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 89/105 (84%), Positives = 99/105 (94%)

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
           DPWFE+N+YNLFN++RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDK
Sbjct: 1   DPWFEVNAYNLFNTNRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           DGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA +
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105


>gi|295828604|gb|ADG37971.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828608|gb|ADG37973.1| AT1G33700-like protein [Capsella grandiflora]
          Length = 105

 Score =  195 bits (496), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 89/105 (84%), Positives = 98/105 (93%)

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
           DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDK
Sbjct: 1   DPWFEVNAYNLFNTBRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           DGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA +
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105


>gi|295828602|gb|ADG37970.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828606|gb|ADG37972.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828612|gb|ADG37975.1| AT1G33700-like protein [Capsella grandiflora]
 gi|295828614|gb|ADG37976.1| AT1G33700-like protein [Neslia paniculata]
          Length = 105

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 89/105 (84%), Positives = 98/105 (93%)

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
           DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDK
Sbjct: 1   DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           DGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA +
Sbjct: 61  DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105


>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
          Length = 261

 Score =  183 bits (465), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 103/250 (41%), Positives = 143/250 (57%), Gaps = 11/250 (4%)

Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
           +A  + +D DGD +IEN G PDQTYD WS +G SAYCGGLW+ AL+    +A  +G+   
Sbjct: 17  IADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALTLGEVVD 76

Query: 681 ASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           A  F  +   A+  Y+  LWNG YF++D    +   SI ADQL G W+   C      D+
Sbjct: 77  AQKFANKLNNARKAYERKLWNGKYFDFDEH-STDHKSIMADQLCGFWF--MCITDGKVDD 133

Query: 740 AKVKK-----ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
             + +     +L  I+++NV K   G  G +N M P G +D +G+Q+ E+W GV Y LA+
Sbjct: 134 VIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYALAS 193

Query: 795 SMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQW 854
             +  E  + AF+TA G Y   W   GL Y  Q+PE+ N +  YR++ YMRPL IWAMQ 
Sbjct: 194 FHLLVEENESAFKTAEGWYRSCWERYGLQY--QSPEAINESSYYRAIGYMRPLAIWAMQS 251

Query: 855 ALTKPKLSRQ 864
           AL   +  RQ
Sbjct: 252 ALDALRNKRQ 261


>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
          Length = 226

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 84/183 (45%), Positives = 109/183 (59%), Gaps = 10/183 (5%)

Query: 672 ANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARAC 731
           A DV D  S+    +  +  +A    LWNG Y+NYD+S    S SI +DQ AGQW+ RAC
Sbjct: 7   AQDVQDKFSS----ILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRAC 62

Query: 732 GL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
           GL      +     V +AL  I++ NV    GG  GA+NGMQP G  D S +Q+ E+W G
Sbjct: 63  GLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVG 122

Query: 788 VTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPL 847
           V YGLAA+MIQE +    F+TA G Y   W  + LG +FQTPE++     +RSL YMRPL
Sbjct: 123 VVYGLAATMIQEGLTREGFRTAEGCYRTVW--ERLGLAFQTPEAYCQQQVFRSLAYMRPL 180

Query: 848 TIW 850
           +IW
Sbjct: 181 SIW 183


>gi|303285310|ref|XP_003061945.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
 gi|226456356|gb|EEH53657.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
          Length = 1073

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 175/671 (26%), Positives = 262/671 (39%), Gaps = 161/671 (23%)

Query: 333  SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAA 391
            + PG++  AA++A  ++  G  RS+ F++AWD P   F     + +RY       G  AA
Sbjct: 378  ASPGAA--AAVSAKFSLKPGERRSIAFAIAWDLPVATFPRGPAFLKRYAWRRRRRGGGAA 435

Query: 392  RIAHDAILEHAKWECEIEAWQRPILEDK---------RFPEWYPITLFNELYYLNAGGTI 442
             +A  A+   A WE  + AWQ P ++           R P W+   LFNELYYL  GGT+
Sbjct: 436  DVAMSALRRAASWERRVRAWQAPYVDSAANADSKSPVRRPGWFVTALFNELYYLVDGGTL 495

Query: 443  WTDGSP-PMQSLATIRERKFSLDTPSKIHTPASSDTA----LGTRLLENGEE-------- 489
            W  G P  +  L          D         S+D A    L  R    GE         
Sbjct: 496  W--GRPLSVGGLDGDEFDDDEDDEVEDALDGDSADDAEVRLLPIRPRSRGERRSLRRLGL 553

Query: 490  -----------------------NIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
                                    +G+F      +   YN  D +FY S+ L  L+P L+
Sbjct: 554  LGGEDGDGGDARDVIGNGGAWGGTLGRFGSASDVDRATYNALDEYFYGSWPLATLWPGLD 613

Query: 527  LSIQRDFAAAVMMHDPGTMKIMSDGKWVA------------------------RKCLGAV 562
            L++  D A AV   D    +++    W A                        RK  G  
Sbjct: 614  LAVIGDLAHAVDAADDTERRVL----WRALGSAKKSAGGTGATTTTIASQTKRRKIRGVA 669

Query: 563  PHDIGL--DDPWFEI-NSYNLFNSSRWKDLNSKFVLQVYRDFVATG-------------- 605
            P D+G   D P     N+ +  +S+   DL  K +L V R     G              
Sbjct: 670  PRDLGTSRDAPLTSAPNALSDVDSNALLDLAPKLMLLVARAHALRGGFRGPEAMASFGGG 729

Query: 606  -DKNFARAVWPSVYIAMA-YMEQFDKDGDGMIENEGFPDQT---YDAWSANG-VSAYCGG 659
             +++  R V+   Y A+A  +   D +GDG+ E+      T   YD W  +G  S+Y GG
Sbjct: 730  LERDALRRVFEPAYRALATQLRARDLNGDGLPEHGDADAPTLSGYDRWVVSGNASSYGGG 789

Query: 660  LWVAALQAASALANDVGD-HASASYFWVRYQKAKAVYDSLWNG----------------- 701
            LW+AAL+A + +A D+ +  A  S        AK+  ++LW G                 
Sbjct: 790  LWLAALRAGAGIARDLDETDARTSLEETLRIAAKSFDEALWRGGGGGGGGGVRKSQSRSR 849

Query: 702  -------SYFNYDNSDGSSSTSIQADQLAGQW---YARACGLLPIADEAKVKKALTKIYD 751
                    Y+  D S   +   + A Q+ G+W     RA G+L   D  KV+ AL+  Y+
Sbjct: 850  RGTHALEGYYAADASGTDAGDVVLAGQVMGEWALGMIRAPGVL---DPRKVRAALSTTYE 906

Query: 752  FNVLKVKGGM----------------CGAMNGMQPDGRIDMSGL----------QAREIW 785
             NV                        GA+NG + D   D + L          QARE  
Sbjct: 907  RNVKAFGRARGGSGSGSDDEPSTYVPSGAVNGARVDDATDAAALGVGAGDGIPAQARESR 966

Query: 786  PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
             G +Y LA+ +I     D  +  A G Y V + +   G++F+TPE ++    +R     R
Sbjct: 967  VGQSYALASHLILAGFSDEGWDVARGAYRVTYED---GFAFRTPEIFDAERRFRGAISGR 1023

Query: 846  PLTIWAMQWAL 856
               +WA++ +L
Sbjct: 1024 AGAVWAIERSL 1034



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 75/159 (47%), Gaps = 25/159 (15%)

Query: 101 RSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSP------GVPKKN 154
           RSY G+F R+ L  G      V A+ F+    R NG   ++VL   SP      GVP   
Sbjct: 90  RSYLGDFSRWHLKVG-SHTHRVAAHTFA--ACRVNG--VATVLSA-SPAHASLEGVPSSE 143

Query: 155 TDSGIESWDWNL------KGENCTYHALFPRAWTVYDGEPD---PELRIVCRQI--SPFI 203
            D                 G    Y AL+PRAW  Y+  PD   P+ R+V  Q+  SP +
Sbjct: 144 DDDDSPPPRTRRRRTLPEDGIGGNYTALYPRAW--YEYAPDALHPDARVVVSQVQFSPVL 201

Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA 242
           P +Y+ S+ P  V  F   N G+ +A+V+++ ++ + +A
Sbjct: 202 PGDYRASAVPTGVLRFVARNDGRATANVSIMLSFESVLA 240


>gi|148242847|ref|YP_001228004.1| hypothetical protein SynRCC307_1748 [Synechococcus sp. RCC307]
 gi|147851157|emb|CAK28651.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 606

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 129/487 (26%), Positives = 200/487 (41%), Gaps = 81/487 (16%)

Query: 87  QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
            G PLGG GAG IGRS +G+F  + +  G      +   QF++F    +  +  ++    
Sbjct: 54  HGAPLGGFGAGCIGRSSKGDFNLWNIDGGEHWFGSLPDCQFALFEDNGSDTRAHALAVAP 113

Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
                K   ++ + SW W      +    +Y A +P  W  Y G    E    C   SP 
Sbjct: 114 EADSSKPEGEAPLSSWSWYPASTAEKPTGSYSARYPLNWFHYSGVFGCE--ASCESFSPI 171

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
           IP NY++++ P +VF +   N  +    ++LL +W N+                D G   
Sbjct: 172 IPGNYRDTALPVAVFRWRFRNPSRKPLKLSLLLSWRNTTGWFCNTDPSAAVHFRDDGSPE 231

Query: 250 HHF----------NSKTMTKDGVHGLTLH----HRTANGRPPVTFAVAAEETADVHVSEC 295
           H++           ++ +   G+ G+ +        A G+     AV  EE   V +  C
Sbjct: 232 HNYLPAIGATQGQRNRAINAPGLRGVVMEGPMSEPIAEGQGQWCIAVP-EELEGVELMRC 290

Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
             +   G+       ++W      GS    +ND+ S +     + AA+A    +  G+  
Sbjct: 291 SRWNPHGD-----GAEIWQRFAADGSVPESNNDRRSGNN--DPVSAAMALRFELAPGAEL 343

Query: 356 SVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
            +   ++WD P   F       RRYT F+G  GD AA IA DA+L   +W  +IE WQ P
Sbjct: 344 ELPVVISWDLPVTAFATGSQCLRRYTDFFGPSGDQAAAIAADALLRWPRWRQQIEEWQAP 403

Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
           +L     PE   + L NELY L +GGT+W+  S                           
Sbjct: 404 VLLRDDLPEELRMALCNELYDLASGGTLWSAAS--------------------------- 436

Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
                         + +G+F  LE  +Y  Y + DV  Y SFAL+ L+P L+ S+ R FA
Sbjct: 437 ------------ASDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPALDRSVLRSFA 484

Query: 535 AAVMMHD 541
            A+   D
Sbjct: 485 RAIPAAD 491


>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
           magnipapillata]
          Length = 169

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 73/162 (45%), Positives = 98/162 (60%), Gaps = 7/162 (4%)

Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE-----AKVKKALTKIYDFNVLK 756
           +Y+NYD S+      I A Q AGQWY  AC L    D+      +V   L  I+  NV++
Sbjct: 1   TYYNYDGSENDYHNCIMAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMR 60

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
            + G  GA+NGM+ DG +D + LQA E+W GVTY LAA+MIQ  MV   F+TA G+Y   
Sbjct: 61  FEDGFMGAVNGMRADGTVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTV 120

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
           + + GLG  FQTPE+   N  +RS  YMRPL+IW++  A +K
Sbjct: 121 YEKWGLG--FQTPEAITANKTFRSRGYMRPLSIWSIYHAYSK 160


>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
 gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
 gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
 gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
          Length = 103

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 60/103 (58%), Positives = 76/103 (73%)

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
           KV+GG  GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++  
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFIA 60

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            WSE+G GY+FQTPE W  +  YRSL YMRPL IW MQ AL K
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103


>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
 gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
          Length = 1024

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 150/637 (23%), Positives = 258/637 (40%), Gaps = 109/637 (17%)

Query: 153 KNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSF 212
           +N D+G      ++ G +  Y AL+P A   Y    D  ++I  + +SP +  + +  S 
Sbjct: 200 ENNDTG------SVNGRDIEYQALYPLAEYKYSSFDD--IKIKRKVVSPIVKEDKRLCSL 251

Query: 213 PASVFTFTLSNSGQTSADVTLLFTWANSVAG--DSGLSGHHFNSKTMTKDGVHGLTLHHR 270
           P     F L+N+   +  +TL     N +      G  G   ++ T++++ +      H+
Sbjct: 252 PMHWNHFALTNNSSQTRVITLAQPLKNLIGSTYQKGRDGIQDSACTLSQNPI---AQQHQ 308

Query: 271 TANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD------- 323
           T N                          L G S   T   + ++       +       
Sbjct: 309 TVN--------------------------LKGESHSFTGVQLTSQSPYQSDIEGEVVFGV 342

Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRR----- 378
             DN  T   +   S+   + +S T          F+L       +F   +Y  R     
Sbjct: 343 QADNQLTESGKVSISVKPTLYSSKT-----AEQTEFALKTGRTNAEFQTGIYTGREALSA 397

Query: 379 --YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
               +     G+S   +    ++ H+K    +  W      DK + ++YP T        
Sbjct: 398 LVVVQVELEAGESV-DLRFTQVMAHSK--VMLNGWHS----DKAYTQFYPQT-------- 442

Query: 437 NAGGTIWTDGSPPMQSLA---TIRERKFSLDTPSKIHTPASS---DTALGTRLLENGEEN 490
                +  D  P ++++      ++  F     SKI  P S+    T     L    E  
Sbjct: 443 QPALPMLEDMLPKLETIEQQIVAQQTAFLEQAQSKISQPESALRYATMAMNSLSFLAEST 502

Query: 491 I----GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----- 541
           +     +FL  E  +Y  +N+ DV+FY SF+L+ L P+L+  + ++F+ A++  D     
Sbjct: 503 VWDKEDKFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTERR 562

Query: 542 ------PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFV 594
                     +++ D     R   GAV HD+G   P+  + ++Y+  N   WKDL  K++
Sbjct: 563 YWEYEATPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYI 620

Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
           L VYR +  T D +  +  W +V  ++ ++     +GD +    G  D T+D  +++G+S
Sbjct: 621 LMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGIS 679

Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDG 711
            YC  LWVA LQAAS LA  +G+   AS +  R +KA A V  SLW+    Y+++     
Sbjct: 680 IYCASLWVAGLQAASELAQLMGESELASGYLTRSKKALATVEQSLWDDKEGYYHF----- 734

Query: 712 SSSTSIQADQLAGQWYA--RACGLLPIADEAKVKKAL 746
              T +QA  L G+ Y      GL+   D    K  L
Sbjct: 735 -FVTPVQAKHLTGEGYQALETLGLILTGDAIADKNTL 770


>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
 gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
 gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
 gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
 gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
 gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
 gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
 gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
 gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
 gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
 gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
 gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
 gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
 gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
 gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
 gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
 gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
 gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
 gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
 gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
 gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
 gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
 gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
 gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
 gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
 gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
 gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
 gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
 gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
 gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
 gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
 gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
 gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
 gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
          Length = 103

 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 60/103 (58%), Positives = 76/103 (73%)

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
           KV+GG  GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++  
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            WSE+G GY+FQTPE W  +  YRSL YMRPL IW MQ AL K
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103


>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
          Length = 103

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 59/101 (58%), Positives = 75/101 (74%)

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
           KV+GG  GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++  
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
            WSE+G GY+FQTPE W  +  YRSL YMRPL IW MQ AL
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 101


>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
          Length = 103

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 59/103 (57%), Positives = 75/103 (72%)

Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
           KV+GG  GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI   M D AF TA G++  
Sbjct: 1   KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60

Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            WSE+G GY+FQTPE W  +  YRSL YMRPL IW  Q AL K
Sbjct: 61  GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGKQQALEK 103


>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 239

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 69/158 (43%), Positives = 89/158 (56%), Gaps = 24/158 (15%)

Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
           +  G EY MYNTYDVH+Y+SFAL+ML+PKL LS+Q D A +V+  DP     + +G+   
Sbjct: 1   FFPGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSP 60

Query: 556 RKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAV 613
            K  G VPHDIG   D+PW                       +VYRDF  T D  + R +
Sbjct: 61  VKTKGVVPHDIGDPEDEPW----------------------QRVYRDFHLTQDAQYLRDM 98

Query: 614 WPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
           WP   +      QFD DGDG+IEN G+ DQTYD W+A+
Sbjct: 99  WPICEMVTESELQFDLDGDGLIENSGYADQTYDGWAAS 136



 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/59 (52%), Positives = 42/59 (71%)

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           K++ AL  I+D NV+   GG  GA+NGM+P+G  D S +Q+ E+W GV YGLAA+MI E
Sbjct: 180 KIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHE 238


>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
           S85]
 gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
           succinogenes S85]
          Length = 1047

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 147/587 (25%), Positives = 243/587 (41%), Gaps = 120/587 (20%)

Query: 173 YHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT 232
           Y AL+P + TVY+G+    + I   Q S   P + + SS P +   FTL N+ + + +VT
Sbjct: 227 YTALYPVSETVYEGKG---VAITKTQSSYVTPGDERLSSLPVNATVFTLENNTKETREVT 283

Query: 233 LLFTWANSVAG-----------DSGL---------SGHHFNSKTMTKDGVHGLTLHHRTA 272
           ++    +S+ G           DS            G  F+ +      V G+  ++  A
Sbjct: 284 IV-QIQDSITGYMAKKDRQGVQDSSFVLVPAARFPKGVQFDKELEDGRSVRGIEFYNEKA 342

Query: 273 ------NGRPPVTFAVAAEETADVHVSECPCF------------LLSGNSKGITAKDMWN 314
                 NG   V+  VA  +  +++VS  P F            L SG   G   K++++
Sbjct: 343 LAESDFNGCMGVS--VAWNKKDNLNVSVKPMFYQDDAKSVLKAALQSGRVAGSWVKNVYS 400

Query: 315 EIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV 374
                                  +I  AIA +  +      S  F++  D PE+K  +  
Sbjct: 401 --------------------GRETIAGAIAVTAVLKPKQKVSFQFNMVLDFPEIKLNKLT 440

Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
             ++YT FY         +  +A+     ++  ++A++  +          P     +LY
Sbjct: 441 SAKKYTAFYPEAYGRVVALLTEALAADKTFDDRLKAFENLV----------PKKPVAKLY 490

Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQF 494
              A            +SLA        ++T S +      D                +F
Sbjct: 491 KTAAK-------QAEFKSLA--------INTLSFLAEATVWDKE-------------DRF 522

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK-------- 546
           L  E A+Y  +N+ DV+FY SF+L+ L P+L+  + + F  A++  +    +        
Sbjct: 523 LVRECADYPFFNSLDVYFYGSFSLLALMPRLDGVVMKRFGDAILAVNNNRRRHHEFVNLP 582

Query: 547 --IMSDGKWVA-RKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRDFV 602
              + D K    R   GAV HD+G   P+  E ++Y+  N   WKDL  K+VL VYR + 
Sbjct: 583 YADLPDPKLEGPRAVRGAVIHDLG--SPFDAEPDAYDWHNVKEWKDLAPKYVLMVYRHYH 640

Query: 603 ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
            T D         +VY AM Y+E+    G+      G  D T+D   + G+S YCG LW+
Sbjct: 641 KTKDMQCLADCKEAVYAAMEYLEKMVNPGENFPLTHG-TDDTFDNLCSYGISVYCGSLWI 699

Query: 663 AALQAASALANDVGDHASASYFWVRYQKA-KAVYDSLW--NGSYFNY 706
           A L+AA+ +A  +GD+  A+ +  + + A K   +SLW  N  YF +
Sbjct: 700 AGLRAAAKIAELLGDNEQAAKWNEKSEAANKEFTESLWDENEGYFRF 746



 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 61/138 (44%), Gaps = 18/138 (13%)

Query: 719 ADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMS 777
           AD +    Y R   L PI D AK K  L +I++ N  K    + GA N ++ DG  +D  
Sbjct: 838 ADTMLADTYLRLLDLEPITDSAKAKSNLLRIFNTN-YKANSPLIGAANLVRKDGSPLDEF 896

Query: 778 GLQAREIWPGVTYGLAASM----IQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
             QA ++W G+ Y +  +M    +++E   +A      +Y+ A         F  PE +N
Sbjct: 897 NFQAHDVWIGIQYSIMTAMMFHGLEKEASVLADSMIGNLYDEA------RVPFAAPEGFN 950

Query: 834 NNDEYRSLCYMRPLTIWA 851
            +      C + P  + A
Sbjct: 951 GS------CRLHPEALVA 962


>gi|159042199|ref|YP_001541451.1| hypothetical protein Cmaq_1638 [Caldivirga maquilingensis IC-167]
 gi|157921034|gb|ABW02461.1| protein of unknown function DUF608 [Caldivirga maquilingensis
           IC-167]
          Length = 661

 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 140/298 (46%), Gaps = 32/298 (10%)

Query: 515 SFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LDDP 571
           SFAL+ LFP  +L+++ D      +H+                  G VPHD+G   ++DP
Sbjct: 339 SFALLDLFP--DLTVKMDELLGFYIHN------------------GEVPHDLGEESIEDP 378

Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
                 Y       W DL S ++L +YRD++ TG+    R     +   + ++   D DG
Sbjct: 379 -----IYGASYLYPWNDLGSTWILMIYRDYLLTGNVEVLRRNIDKMREVIDWLISRDYDG 433

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
           D + ++ G  D +YD  +  G S+Y   L++ +LQA    A  +G   S  Y      K 
Sbjct: 434 DCIPDSRGGFDNSYDGTNMYGASSYIASLFLCSLQAFIKSAEVLGVRLSDRYESC-LSKG 492

Query: 692 KAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           +   +SLWNG YF    S G+S+ S    QL GQ++     L P+ DE K+K AL  IY+
Sbjct: 493 RETLNSLWNGRYFMAWKSSGNSNESCMNSQLLGQFWCDFLKLPPVVDEDKIKVALRSIYE 552

Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
            N    K       N ++P G ID S  Q R  WP V++ + A M+   MV+   + A
Sbjct: 553 LN---HKSSPHCLPNSVKPSGEIDTSSGQMRSCWPRVSFVVTAHMVLRGMVNEGLEIA 607


>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
 gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
          Length = 982

 Score =  129 bits (324), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 80/250 (32%), Positives = 132/250 (52%), Gaps = 24/250 (9%)

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----------- 541
           +FL  E  +Y  +N+ DV+FY SF+L+ L P+L+  + ++F+ A++  D           
Sbjct: 467 KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEA 526

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
               +++ +     R   GAV HD+G   P+  + ++Y+  N   WKDL  K++L VYR 
Sbjct: 527 TPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 584

Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
           +  T D +  +  W +V  ++ ++     DGD +    G  D T+D  +++G+S YC  L
Sbjct: 585 YQNTQDMSVVKECWQAVTESIDFLSNLIADGDDLPLTRG-TDDTFDNLASHGISIYCASL 643

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
           WVA LQAAS LA  +G+H   + +  R +KA A V  SLW+    Y+++        T +
Sbjct: 644 WVAGLQAASELAKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPV 697

Query: 718 QADQLAGQWY 727
           QA  L G+ Y
Sbjct: 698 QAKHLTGEGY 707


>gi|332796021|ref|YP_004457521.1| hypothetical protein Ahos_0332 [Acidianus hospitalis W1]
 gi|332693756|gb|AEE93223.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 652

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/349 (27%), Positives = 166/349 (47%), Gaps = 43/349 (12%)

Query: 515 SFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFE 574
           SFAL+  FP+L   +   F   ++                     G VPHD+G +    E
Sbjct: 334 SFALLEYFPELVKKMDEHFGNFIID--------------------GEVPHDLGEES--IE 371

Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
              Y       W DL   ++L +YRD+  T D +F +  +  +   + ++ + D+D DG+
Sbjct: 372 NPIYGASYPYSWNDLGPTWILMIYRDYKFTNDLSFLKRNYNKMKEVIDWLIKKDEDNDGI 431

Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----K 690
            +++G  D +YD     G S+Y G L++ AL+A       +      SY + +Y+    K
Sbjct: 432 PDSKGGFDNSYDGTYMYGTSSYIGSLFLCALKAF------IESSKILSYDYSKYEEILNK 485

Query: 691 AKAVYDSLWNGSYF-NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
           AK+  +SLWNG YF N+   D  + + + + QL G+++    GL  + DE K+K AL  I
Sbjct: 486 AKSSLESLWNGRYFINWKYKDQKNESCLNS-QLLGEFWCNLLGLGNVIDEDKIKTALKYI 544

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
           Y+ N    K      +N + PDG ID S  Q +  WP +++ +A+ MI + M++   + A
Sbjct: 545 YEHN---GKASKYCLVNSVNPDGSIDESTDQMKSCWPRISFAIASHMIMKGMINEGIEIA 601

Query: 810 AGVYEVAWSEDGLGYSFQTPESWN--NNDEYRSLCYMRPLTIWAMQWAL 856
              +E   S     Y +  P   N  N + +    Y+  L+I+ +++A+
Sbjct: 602 RKEWETISSR----YPYNQPSKINAFNGEHFGLPYYIGSLSIYLVKYAI 646


>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
 gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
          Length = 173

 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 62/108 (57%), Positives = 78/108 (72%), Gaps = 2/108 (1%)

Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
            K G  GA+NG++PDG++D S LQA E+W GVTY +AASMIQE +VD  F+TA+G+Y   
Sbjct: 49  TKEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTC 108

Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQ 864
           +    LG +FQTPE+   N  YRSL YMRPL+IWAMQWAL K K  R+
Sbjct: 109 FER--LGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNKRE 154



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 35/45 (77%)

Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
           PW  +N+Y++ ++S+WKDLN KFVLQVYRD+V T D  + + +WP
Sbjct: 3   PWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWP 47


>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
 gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
          Length = 1046

 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/274 (31%), Positives = 142/274 (51%), Gaps = 25/274 (9%)

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI----- 547
           +FL  E  +Y  +N+ DV+FY SF+L+ L P+L+  + ++F+ A++  D    +      
Sbjct: 531 KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEA 590

Query: 548 -----MSDGKWV-ARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
                + D K+   R   GAV HD+G   P+  + ++Y+  N   WKDL  K++L VYR 
Sbjct: 591 TPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 648

Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
           +  T D +  +  W +V  ++ ++     +GD +    G  D T+D  +++G+S YC  L
Sbjct: 649 YQKTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASL 707

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
           WVA LQAAS LA  +G+H   + +  R +KA A V  SLW+    Y+++        T +
Sbjct: 708 WVAGLQAASELAKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPV 761

Query: 718 QADQLAGQWYARACGL-LPIADEAKVKKALTKIY 750
           QA  L G+ Y     L L +  +A V K +   Y
Sbjct: 762 QAKHLTGEGYQALETLGLVLTGDAIVDKNMLNDY 795


>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
          Length = 1024

 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 145/613 (23%), Positives = 248/613 (40%), Gaps = 119/613 (19%)

Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
           ++ G +  Y AL+P A   Y    D  + I  + +SP +  N +  S P     F L+N+
Sbjct: 206 SVNGRDIEYQALYPLAEYKYSSFDD--INIKRKVVSPIVKENKRLCSLPMHWNHFELTNN 263

Query: 225 GQTSADVTLLFTWANSVAG--DSGLSGHHFNSKTMTKDGVHGLTLHHRTAN--------- 273
              +  +TL     N +      G  G   ++ T++++ +      H+T N         
Sbjct: 264 SSQTRVITLAQPLQNLIGSTYQKGRDGIQDSACTLSQNPI---AQQHQTVNLKGESHSFT 320

Query: 274 -----GRPP--------VTFAVAAE----ETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
                 + P        V F V A+    E+  V VS  P F  S  +K    +      
Sbjct: 321 GVQLSSQSPYQSDIEGEVVFGVQADNRLTESGKVSVSVKPTFYTSKTTK--QTEFALKTG 378

Query: 317 KKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYH 376
           + +  F      +T       ++ A +   + + +G +  + F+      +V        
Sbjct: 379 RTNTEF------QTGIYTGREALSALVVVQVELEAGESVDLRFAQVMAHSKVMLNGWHSE 432

Query: 377 RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF----PE---WYPITL 429
           + YT+FY      A  +  D + E    E +I   Q   LE  +     P+    Y    
Sbjct: 433 KAYTQFYPQ-AKPALLMLEDVLPELEAIEQQIVKQQTAFLEQAQSKISQPDSALRYATMA 491

Query: 430 FNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEE 489
            N L +L A  T+W             +E KF                            
Sbjct: 492 MNSLSFL-AESTVWD------------KEDKF---------------------------- 510

Query: 490 NIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD-------- 541
                L  E  +Y  +N+ DV+FY SF+L+ L P+L+  + ++F+ A++  D        
Sbjct: 511 -----LVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWE 565

Query: 542 ---PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQV 597
                  +++ +     R   GAV HD+G   P+  + ++Y+  N   WKDL  K++L V
Sbjct: 566 YEATPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMV 623

Query: 598 YRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
           YR +  T D +  +  W +V  ++ ++     +GD +    G  D T+D  +++G+S YC
Sbjct: 624 YRHYQNTQDLSVVKECWQAVTESIEFLSNLVAEGDDLPLTRG-TDDTFDNLASHGISIYC 682

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSS 714
             LWVA LQAA+ LA  + ++  A+ +  R +KA A V  SLW+    Y+++        
Sbjct: 683 ASLWVAGLQAANELALLMNENELAADYLARSKKALATVEQSLWDDKEGYYHF------FV 736

Query: 715 TSIQADQLAGQWY 727
           T +QA  L GQ Y
Sbjct: 737 TPVQAKHLTGQGY 749


>gi|422908697|ref|ZP_16943374.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
 gi|341639159|gb|EGS63785.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
          Length = 981

 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 164/663 (24%), Positives = 264/663 (39%), Gaps = 124/663 (18%)

Query: 114 HGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
             I +D   L  Q  V +   NG   ++    R+  +   N  S +ES D      +  Y
Sbjct: 118 QAITNDPSSLETQLLVLIDFFNGLLVNN--SARALSL-TANNGSSVESID----PSDIEY 170

Query: 174 HALFPRAWTVYDGEPDPELRIVCRQ-ISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT 232
            AL+P A   Y G    E   + RQ ISP + ++    S P     F +SN    +  VT
Sbjct: 171 RALYPIAEYEYQG---LESVALTRQVISPIVKNDPMLCSLPMHWNHFEMSNRSDKTQLVT 227

Query: 233 LLFTWANSVAG-----------------DSGLSGHHFNSKTMTKDGVHGLTLHHRTANGR 275
           L+    N +                    + +S  H   + +T  G  G   H  +    
Sbjct: 228 LVQPLRNLIGSTYKKAREGVQDSCCHLVQNAISQQH---QPVTISGA-GFEFHGVSMTSN 283

Query: 276 PPVTFAVAAEETADVHVSECPCFLLSGNSKGITA------------KDMWNEIKKHGSFD 323
            P    ++ E    VH          G ++ +T             + +   +K   +  
Sbjct: 284 SPYDGDISGEVLYGVHTQ-----FEGGAAEQVTVTVKPQVYCSKADQQVERALKTGRTNA 338

Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF----FEKVYHRRY 379
           H D    S  EP   + A +   +T+  G T ++ F+   D  ++       EK Y R  
Sbjct: 339 HFDRGIYSGREP---LSALVCVQVTLKPGETIALRFAQVMDHSKILLDGWESEKAYAR-- 393

Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
              Y      A  I +  +   ++ E  I + QR  L+                   NA 
Sbjct: 394 ---YFDAQSRAQSILNQTLPRLSEIEQNIISQQREFLD-------------------NAS 431

Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEG 499
            T + D    ++  AT+     S    S +                  +EN+   L  E 
Sbjct: 432 QT-FVDNDSALR-FATMAMNSLSFLAESTVW----------------DKENV--LLVKEC 471

Query: 500 AEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI----------MS 549
            +Y  +N+ DV+FY SF+L+ L P+L+  + + FA A++  D    +           + 
Sbjct: 472 VDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKHFAKAILNSDEKLRRYWEYEDKPYAELV 531

Query: 550 DGKWV-ARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK 607
           D K+  +R   GAV HD+G   P+  + ++Y+  N   WKDL  KF+L VYR +  TGD+
Sbjct: 532 DEKYQGSRAERGAVIHDLG--SPFDIDPDAYSWHNVKEWKDLAPKFILMVYRHWQKTGDR 589

Query: 608 NFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
               + W SV  ++ Y+    +DGD +    G  D T+D  +++GVS YCG LWVA L+A
Sbjct: 590 EVVESCWSSVKESIDYLINLIEDGDTLPLTRG-TDDTFDNLASHGVSIYCGSLWVAGLRA 648

Query: 668 ASALANDVGDHASASYFWVRYQKA-KAVYDSLWN--GSYFNYDNSDGSSSTSIQADQLAG 724
           AS LA  VG+   A  +  R + A + +   LW+    Y+++        T +QA  L G
Sbjct: 649 ASELALLVGEEELAQGYTQRSKDALETLERGLWDEQKGYYHF------FITPVQAKHLTG 702

Query: 725 QWY 727
             +
Sbjct: 703 NGF 705


>gi|15921102|ref|NP_376771.1| hypothetical protein ST0868 [Sulfolobus tokodaii str. 7]
 gi|15621887|dbj|BAB65880.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 683

 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 176/733 (24%), Positives = 284/733 (38%), Gaps = 128/733 (17%)

Query: 86  DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           + G+PLGGIGAG I  S +G      + +        +   F VF+   + E F      
Sbjct: 10  EAGVPLGGIGAGKIEISNKGRMINLTIANNWSFPIKEMLG-FHVFILPNDSEPFF----- 63

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
                      S +   D N       Y   +P A  +  G  +  ++      S  IP 
Sbjct: 64  ---------LQSDLIFLDLNKLAVPLEYEGKYPFA--MIKGVKN-SVKAELEAFSALIPE 111

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGL 265
           N  +SS PA   +  +S S      +++      ++ G S +  ++   K       +G+
Sbjct: 112 NLYDSSLPAVGISVKVSGSKSGLIAISM-----GNITGISKIGRYNEGLK-------NGI 159

Query: 266 TLHHRTANGRPPVTF-AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
            + +  AN   P     V   E+    V +    +  G  K +    +   I+    +  
Sbjct: 160 IMKNSKANDLDPYNGETVLVTESPKAIVKQYNFHVNRGLEKTLNLHRL---IENEKPWMD 216

Query: 325 LDNDKTSPSEPGSSIGAAIAASLTIPSG-------STRSVTFSLAWDCPEVKFFEK--VY 375
           L + K   SE G S G     S  +P+G           V F  +W      FF K  VY
Sbjct: 217 LMSGKIPESEDGESTG-----SYYLPAGMIIDEYEKNEEVKFVFSW------FFNKPWVY 265

Query: 376 HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYY 435
           +  Y  +Y     SA  +A   +    + + +   W   ++ D   P+W    + N  Y 
Sbjct: 266 YP-YKHYYSNFFSSAKEVASYFLENFDRLKEKTRRWHEDLI-DPSLPDWLSDAIINSTYI 323

Query: 436 LNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFL 495
           L++  + W D            + +F +             T +GT L       IG   
Sbjct: 324 LSS--STWLD-----------EKGRFGI----------MEGTQVGTML-----STIGGVC 355

Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
           Y  G               S  +V++FP LE S    F    M  D              
Sbjct: 356 YETG---------------SLPVVLMFPMLEKSTIEQFILN-MRED-------------- 385

Query: 556 RKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
               G +PHD+G     F+  S       +WKD N+ FVL VYR ++ T DK F ++V+P
Sbjct: 386 ----GYIPHDLGTYS--FDAPSDGTTAPPKWKDTNTTFVLMVYRYYLRTKDKEFLKSVYP 439

Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDV 675
            V  AM+++   DKDGDG+ E +G  DQ +D     GV +Y   +++AAL+A   +A  V
Sbjct: 440 YVKKAMSWIISKDKDGDGLPEVDGSTDQGFDCVPIEGVCSYISTVYIAALEAMIKIAEIV 499

Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
           GD  S     +   K+  +   L++G  F       +   ++ + Q+ GQW+A    L  
Sbjct: 500 GDSTSYYSSLLSKAKSSLM--KLFDGKKFIPWTGKPNHHKAVFSAQIFGQWWAYLLDLDV 557

Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA- 794
           +AD + +  AL +IY  N    K   C      + +   D+   Q    WP + + L+A 
Sbjct: 558 VADRSAILSALDEIYRVNGHASK--YCTPNMAKEGESLDDLDS-QLTSSWPRLVFSLSAL 614

Query: 795 --SMIQEEMVDMA 805
             S+ + E +D+A
Sbjct: 615 GYSLGKREWLDIA 627


>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
 gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
          Length = 908

 Score =  126 bits (317), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 80/250 (32%), Positives = 130/250 (52%), Gaps = 24/250 (9%)

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----------- 541
           +FL  E  +Y  +N+ DV+FY SF L+ L P+L+  + ++F+ A++  D           
Sbjct: 393 KFLVKECVDYPFFNSLDVYFYGSFTLLYLLPELDGCVMKEFSKAILAEDFTERRYWEYEA 452

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
               +++ D     R   GAV HD+G   P+  + ++Y+  N   WKDL  K++L VYR 
Sbjct: 453 TPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 510

Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
           +  T D +  +  W +V  ++ ++     +GD +    G  D T+D  +++G+S YC  L
Sbjct: 511 YQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASL 569

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
           WVA LQAAS LA  +G+   A+ +  R +KA A V  SLW+    Y+++        T +
Sbjct: 570 WVAGLQAASELAQLMGESERANGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPV 623

Query: 718 QADQLAGQWY 727
           QA  L G  Y
Sbjct: 624 QAKHLTGAGY 633


>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
 gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
          Length = 1069

 Score =  126 bits (317), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 84/274 (30%), Positives = 139/274 (50%), Gaps = 25/274 (9%)

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----------- 541
           +FL  E  +Y  +N+ DV+FY SF+L+ L P+L+  + ++F+ A++  D           
Sbjct: 554 KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEA 613

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
               +++ D     R   GAV HD+G   P+  + ++Y+  N   WKDL  K++L VYR 
Sbjct: 614 TPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 671

Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
           +  T D +  +  W +V  ++ ++     +GD +    G  D T+D  +++G+S YC  L
Sbjct: 672 YQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASL 730

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
           WVA LQAAS LA  + +   A+ +  R +KA A V  SLW+    Y+++        T +
Sbjct: 731 WVAGLQAASELAKLMNESELAAGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPV 784

Query: 718 QADQLAGQWYARACGL-LPIADEAKVKKALTKIY 750
           QA  L G+ Y     L L +  +A   K +   Y
Sbjct: 785 QAKHLTGEGYQALETLGLSLTGDAIADKTMLNTY 818


>gi|159041841|ref|YP_001541093.1| hypothetical protein Cmaq_1276 [Caldivirga maquilingensis IC-167]
 gi|157920676|gb|ABW02103.1| protein of unknown function DUF608 [Caldivirga maquilingensis
           IC-167]
          Length = 688

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 164/683 (24%), Positives = 260/683 (38%), Gaps = 134/683 (19%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           GIPLGGIG+G +  S  G     +  +        L   F +F+ +P+      + C   
Sbjct: 13  GIPLGGIGSGGVEVSNDGRLINARFANNWAYPIRDLRG-FHIFI-KPHDASGFFMHC--- 67

Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAW--TVYDGEPDPELRIVCRQISPFIPH 205
                +    G+E      +G    +   +P AW     +G     + +     SP IP 
Sbjct: 68  -----RVNVLGLEG-----RGALIGFEGRWPFAWLRAFRNG-----VNVEVEAFSPIIPG 112

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGL 265
           N K+S+ P   FT  +  S   +A         N +    G      N          G+
Sbjct: 113 NLKDSTLPVIGFTIRVKGSDALAAVSVPNVVGTNPI----GRINRSING---------GV 159

Query: 266 TLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH- 324
              +  A    P    +A        ++E P F ++  +     +         G+F++ 
Sbjct: 160 LFTNNKAPDNDPAKGNIAL-------ITEEPRFTITQYNINSKPEHALKARTWKGAFENP 212

Query: 325 ---LDNDK---TSPSEPGSSIG----AAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV 374
              L  DK    +  EP    G     A   +L +P+G    V F+L+W      FF   
Sbjct: 213 EPWLTIDKGGVPTGEEPHEVTGLWDDPAGLIALNVPNGG--EVRFTLSW------FFNGR 264

Query: 375 YHR-RYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
           +H   Y  +Y      ++ +A   + E  +       WQ  ++ D   P+W    + N  
Sbjct: 265 WHLYNYGHYYENFFKDSSEVARYVLDEFDRLRTSTLDWQNSLI-DPALPDWLRDAVVNST 323

Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQ 493
           Y L               S    R+ +FS+                    LE G E    
Sbjct: 324 YILTT-------------STWLTRDGRFSI--------------------LE-GVEVCPC 349

Query: 494 FLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKW 553
              L GA Y            S  +V++FP+LE S+ R F  A+                
Sbjct: 350 HGTLAGACYET---------GSLPVVLMFPELEKSLLRQFTEAM---------------- 384

Query: 554 VARKCLGAVPHDIGLDDPWFEINSYNLFNSS--RWKDLNSKFVLQVYRDFVATGDKNFAR 611
              +  G +PH +G+    + ++      ++  RWKDLNS ++L V+R F  + D  F +
Sbjct: 385 ---RSDGYIPHSLGI----YSLDHIEDGTTAPPRWKDLNSTYILLVHRYFKRSNDVEFIK 437

Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
            ++P +  A  ++   DKDGDG+ E  G  D  +DA S  G  +Y   LW+AAL     L
Sbjct: 438 EIYPKLIKAFEWVLVQDKDGDGVPELSGDGDTGFDAMSVKGFDSYTTSLWIAALMVMGEL 497

Query: 672 ANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN-YDNSDGSSSTSIQADQLAGQWYARA 730
           A  +GD A+ S       KA+  Y+  W G  F  +D  D   ++ +   Q+ G+W++  
Sbjct: 498 AKLMGDQATLSKVESTLLKARDSYNRRWLGDRFKAWDEPDMGKASFLA--QIWGEWWSLM 555

Query: 731 CGLLPIADEAKVKKALTKIYDFN 753
            GL  I DE KVK A+  I   N
Sbjct: 556 LGLGHITDEDKVKAAMGTIIRVN 578


>gi|119578750|gb|EAW58346.1| glucosidase, beta (bile acid) 2, isoform CRA_b [Homo sapiens]
          Length = 370

 Score =  122 bits (307), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 81/227 (35%), Positives = 120/227 (52%), Gaps = 17/227 (7%)

Query: 52  LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
           + HIG  L RY     ++   + K P  ++     +    G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170

Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
            R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L 
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222

Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
           G    YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281

Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRT 271
           + DV+++F+  N + G     G  +N    + + G  V GL LHH T
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT 328


>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
 gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 646

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/306 (28%), Positives = 140/306 (45%), Gaps = 25/306 (8%)

Query: 560 GAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYI 619
           G VPHD+G +    E   Y       W DL S +VL +YRD+  T D  F +  +  +  
Sbjct: 354 GEVPHDLGEES--IEAPIYGASYGYPWTDLGSTWVLMIYRDYKFTNDLAFLKRNYRKMKE 411

Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
            + ++   DKD D + +++G  D +YD     G S+Y G +++ AL+A  + +  +G   
Sbjct: 412 VIDWLISLDKDKDCIPDSKGGFDNSYDGTYMYGASSYVGSMFLCALRAFISASKILGMEY 471

Query: 680 SASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
           S     +R  +    ++SLWNG YF    SD     S  + Q+ GQ++    GL PI DE
Sbjct: 472 SIYEDCLR--RGIMTFNSLWNGKYFIAWKSDNRKKESCMSSQILGQFWCDILGLEPIIDE 529

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
            K+ +AL  IY+ N    K      +N + PDG ID    Q R  W  V + ++A MI  
Sbjct: 530 DKIVQALRSIYELN---GKASKFCLVNSVNPDGSIDTETDQMRSCWSRVAFAVSAHMIIR 586

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC--------YMRPLTIWA 851
            + +   + A   +E          + ++   WN +     +         Y+   + W 
Sbjct: 587 GLKNEGIEIANREWE----------TIKSLGKWNQSSRIDGMTGNKVGLPYYIGSASTWF 636

Query: 852 MQWALT 857
           +++ALT
Sbjct: 637 IKFALT 642


>gi|15899397|ref|NP_344002.1| hypothetical protein SSO2674 [Sulfolobus solfataricus P2]
 gi|384433011|ref|YP_005642369.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|13815992|gb|AAK42792.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
 gi|261601165|gb|ACX90768.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
          Length = 648

 Score =  119 bits (299), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 131/273 (47%), Gaps = 12/273 (4%)

Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           + I +D  ++     G  PHD+G   ++DP +   SY  +    W DL    VL +YRD+
Sbjct: 341 LVISADNYFMNYVNYGETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 395

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           V T ++      +  +   + ++ + D D D + +++G  D +YD     G S+Y   ++
Sbjct: 396 VFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMF 455

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
           ++AL A   ++ ++ D      ++   +  K  ++SLWNG YF     +   +TS    Q
Sbjct: 456 LSALTAFIKMS-EILDVKIDDKYYRFLECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQ 514

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           L GQ++    GL PI D  K+  AL  IY+ N    K       N ++ DG +D S  Q 
Sbjct: 515 LLGQFWCDILGLPPITDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQL 571

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           R  WP V++ +AA MI   MV    + A   +E
Sbjct: 572 RSCWPRVSFAVAAHMILRGMVKEGIEVAKREWE 604


>gi|284173198|ref|ZP_06387167.1| hypothetical protein Ssol98_00875 [Sulfolobus solfataricus 98/2]
          Length = 647

 Score =  119 bits (298), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 131/273 (47%), Gaps = 12/273 (4%)

Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           + I +D  ++     G  PHD+G   ++DP +   SY  +    W DL    VL +YRD+
Sbjct: 340 LVISADNYFMNYVNYGETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 394

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           V T ++      +  +   + ++ + D D D + +++G  D +YD     G S+Y   ++
Sbjct: 395 VFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMF 454

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
           ++AL A   ++ ++ D      ++   +  K  ++SLWNG YF     +   +TS    Q
Sbjct: 455 LSALTAFIKMS-EILDVKIDDKYYRFLECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQ 513

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           L GQ++    GL PI D  K+  AL  IY+ N    K       N ++ DG +D S  Q 
Sbjct: 514 LLGQFWCDILGLPPITDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQL 570

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           R  WP V++ +AA MI   MV    + A   +E
Sbjct: 571 RSCWPRVSFAVAAHMILRGMVKEGIEVAKREWE 603


>gi|374921957|gb|AFA26156.1| non-lysosomal glucosylceramidase-like protein, partial [Lolium
           perenne]
          Length = 109

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 70/108 (64%), Positives = 84/108 (77%), Gaps = 1/108 (0%)

Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
           AM YMEQFD+D D MIEN+GFPDQTYDAW+  GVSAYCG LW+A+LQAA+A+A  +G   
Sbjct: 2   AMEYMEQFDRDDDSMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARSLGHAD 61

Query: 680 SASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
            A    V++ KAK V+++ LWNGSYFNYD+    SS SIQADQLAGQW
Sbjct: 62  YAERCMVKFAKAKHVFEAKLWNGSYFNYDSGTSYSSRSIQADQLAGQW 109


>gi|385777223|ref|YP_005649791.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323475970|gb|ADX86576.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 661

 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)

Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           + I +D   V     G  PHDIG   ++DP +   SY  +    W DL    VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           V T D+      +  +   + ++ + D D D + +++G  D +YD     G S+Y   ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
           ++AL A   ++ ++ D    + ++   +  K  ++SLWNG YF     +   + S    Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           L GQ++    GL PI D  K+  AL  IY+ N    K  +    N ++ D  ID S  Q 
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           R  WP V++ +AA MI   MV    + A   +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617


>gi|227828867|ref|YP_002830647.1| hypothetical protein M1425_2640 [Sulfolobus islandicus M.14.25]
 gi|227460663|gb|ACP39349.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 661

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/275 (29%), Positives = 130/275 (47%), Gaps = 12/275 (4%)

Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           + I +D   V     G  PHDIG   ++DP +   SY  +    W DL    VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           V T D+      +  +   + ++ + D D D + +++G  D +YD     G S+Y   ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
           ++AL A   ++ ++ D    + ++   +  K  ++SLWNG YF     +   + S    Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           L GQ++    GL PI D  K+  AL  IY+ N    K  +    N ++ D  ID S  Q 
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
           R  WP V++ +AA MI   MV    + A   +E  
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETV 619


>gi|385774575|ref|YP_005647144.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323478691|gb|ADX83929.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 661

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 78/258 (30%), Positives = 125/258 (48%), Gaps = 12/258 (4%)

Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           G  PHDIG   ++DP +   SY  +    W DL    VL +YRD+V T D+      +  
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
           +   + ++ + D D D + +++G  D +YD     G S+Y   ++++AL A   ++ ++ 
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKIS-EIL 482

Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
           D    + ++   +  K  ++SLWNG YF     +   + S    QL GQ++    GL PI
Sbjct: 483 DVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542

Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
            D  K+  AL  IY+ N    K  +    N ++ D  ID S  Q R  WP V++ +AA M
Sbjct: 543 TDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599

Query: 797 IQEEMVDMAFQTAAGVYE 814
           I   MV    + A   +E
Sbjct: 600 ILRGMVKEGMEVAKREWE 617


>gi|229586074|ref|YP_002844576.1| hypothetical protein M1627_2694 [Sulfolobus islandicus M.16.27]
 gi|228021124|gb|ACP56531.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 661

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)

Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           + I +D   V     G  PHDIG   ++DP +   SY  +    W DL    VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           V T D+      +  +   + ++ + D D D + +++G  D +YD     G S+Y   ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
           ++AL A   ++ ++ D    + ++   +  K  ++SLWNG YF     +   + S    Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           L GQ++    GL PI D  K+  AL  IY+ N    K  +    N ++ D  ID S  Q 
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           R  WP V++ +AA MI   MV    + A   +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617


>gi|227831601|ref|YP_002833381.1| hypothetical protein LS215_2801 [Sulfolobus islandicus L.S.2.15]
 gi|229580553|ref|YP_002838953.1| hypothetical protein YG5714_2812 [Sulfolobus islandicus Y.G.57.14]
 gi|238621059|ref|YP_002915885.1| hypothetical protein M164_2625 [Sulfolobus islandicus M.16.4]
 gi|284999153|ref|YP_003420921.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|227458049|gb|ACP36736.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
 gi|228011269|gb|ACP47031.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
 gi|238382129|gb|ACR43217.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
 gi|284447049|gb|ADB88551.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
          Length = 661

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)

Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           + I +D   V     G  PHDIG   ++DP +   SY  +    W DL    VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           V T D+      +  +   + ++ + D D D + +++G  D +YD     G S+Y   ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
           ++AL A   ++ ++ D    + ++   +  K  ++SLWNG YF     +   + S    Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           L GQ++    GL PI D  K+  AL  IY+ N    K  +    N ++ D  ID S  Q 
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           R  WP V++ +AA MI   MV    + A   +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617


>gi|229583411|ref|YP_002841810.1| hypothetical protein YN1551_3001 [Sulfolobus islandicus Y.N.15.51]
 gi|228014127|gb|ACP49888.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 661

 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)

Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
           + I +D   V     G  PHDIG   ++DP +   SY  +    W DL    VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408

Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
           + T D+      +  +   + ++ + D D D + +++G  D +YD     G S+Y   ++
Sbjct: 409 IFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
           ++AL A   ++ ++ D    + ++   +  K  ++SLWNG YF     +   + S    Q
Sbjct: 469 LSALAAFIKIS-EILDIKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527

Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
           L GQ++    GL PI D  K+  AL  IY+ N    K  +    N ++ D  ID S  Q 
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           R  WP V++ +AA MI   MV    + A   +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617


>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
 gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
          Length = 660

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 88/300 (29%), Positives = 131/300 (43%), Gaps = 32/300 (10%)

Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LDD 570
           + FAL+ LFP L +     FA    M+  G                G +PHD G   ++D
Sbjct: 334 AGFALLELFPDLVVRADEVFA----MYRRG----------------GEIPHDYGEESIED 373

Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
           P      Y     + WKDL S +VL +YRD+  TGD    R    ++  A+ ++   D+D
Sbjct: 374 P-----IYGATYLTPWKDLASTWVLMIYRDYYYTGDVEILRRNIDAMKDAVDWLMSLDRD 428

Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK 690
           GD + ++ G  D +YD  +  G  AY   L++ AL A    A  +G      Y     ++
Sbjct: 429 GDCVPDSRGRNDNSYDGSNMYGRVAYVASLFLCALTAYIKAAERLGLKPEPKYLEC-LER 487

Query: 691 AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIY 750
            K   D LWNG Y+           +  + QL GQ++     L PIA + KV  AL  IY
Sbjct: 488 GKRSLDELWNGRYYVAWADGPRRKDACMSSQLLGQFWCDMLDLPPIAPDDKVAAALRSIY 547

Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
           +   L ++       N + PDG+ D    Q R  W  V + +AA M+   M     + A+
Sbjct: 548 E---LGMRASKYCIPNSVTPDGKPDEETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVAS 604


>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
 gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
          Length = 220

 Score =  112 bits (280), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 72/211 (34%), Positives = 109/211 (51%), Gaps = 14/211 (6%)

Query: 653 VSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAVYDSL-WNGSYFN 705
           +SAYCG LW+AAL+AA A    LA ++G   +A    F    Q+++  +D L WNG ++ 
Sbjct: 1   MSAYCGALWIAALEAALAMGQTLALELGVDTAAEQRDFSQWLQQSRGNFDQLLWNGEFYK 60

Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
            D   G+    + ADQL G +YAR  GL P+  EA  + +L  I +       GG  G  
Sbjct: 61  IDAESGTPV--VMADQLCGDFYARLLGLPPVVSEANARSSLQAIKEACFEGFAGGRLGVA 118

Query: 766 NGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
           NG++ DG  +D +G    E+W G+ +GLAA        + A      V    ++    G 
Sbjct: 119 NGLRRDGTPLDPNGTHPLEVWTGINFGLAAYYRLMGETNTAVAITQAVVNQVYAG---GL 175

Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
            F+TPE+    + +R+  Y+R + IWA+ WA
Sbjct: 176 QFRTPEALTGQNTFRACHYLRAMAIWAL-WA 205


>gi|284997264|ref|YP_003419031.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284445159|gb|ADB86661.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
          Length = 740

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 73/239 (30%), Positives = 117/239 (48%), Gaps = 14/239 (5%)

Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           G VPHD+G   LD P     S       +WKD N  F+L VYR +  T D +F + ++P 
Sbjct: 450 GYVPHDLGIHSLDHPVDGTTS-----PPKWKDTNPTFILLVYRYYKFTNDLDFLKEMYPK 504

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
           +  AM +    DKD DG+ E EG  D  +DA S  G+ +Y   +++A++ A    A  + 
Sbjct: 505 MLKAMKWELTQDKDKDGVPELEGQGDTGFDAMSVKGIDSYTTSIYIASIIALKETAKILK 564

Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTS-IQADQLAGQWYARACGLLP 735
           D+ + +   +  +K++ ++  L+NG YF  D   G      I   QL G+W++   GL P
Sbjct: 565 DNDTLNEMTILLEKSRKIFSRLFNGKYF--DPWIGEPEIKGIFLGQLVGEWWSEILGLEP 622

Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
           I +E K+  AL  +   N        C   N +  DG+I     Q+   WP + + +++
Sbjct: 623 IIEEEKISSALEAMLSINA--NSSNFCTP-NIVSEDGKIIDISPQSYSSWPRLVFAISS 678



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 63/140 (45%), Gaps = 19/140 (13%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G+PLGGIGAG I  + +G+     +F+      P +   F + +   +G  F   L    
Sbjct: 74  GVPLGGIGAGKIEINNKGKVVNVTIFNNKGFPIPNIRG-FHILIKPDDGNPF--FLEKDL 130

Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
           P +  K+ +S I             Y  L+P A+ +  G  D ++R      SP IP N 
Sbjct: 131 PFISLKHLESDI-----------LIYEGLYPFAYLI--GIKD-KIRAKLTAFSPLIPRNI 176

Query: 208 KESSFPASVFTFTL--SNSG 225
            +S+ PA  F+  +  SNSG
Sbjct: 177 NDSTLPAVGFSLKIEGSNSG 196


>gi|149196161|ref|ZP_01873217.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
 gi|149141008|gb|EDM29405.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
          Length = 1052

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 123/543 (22%), Positives = 220/543 (40%), Gaps = 93/543 (17%)

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
           E      L+P A   Y+ +   ++RI     SP + ++ K  S P ++    L+N   TS
Sbjct: 219 EKIQNRLLYPYAELKYEDDAH-DVRIEKLHFSPIVRNDEKSCSAPLNLTEIKLTN---TS 274

Query: 229 ADVTLL-FTW---------------ANSVAGDSGLSGHHFNSKTM----TKDGV-HGLTL 267
           A V +L   W               AN  AG        +    +    T++G+  G+TL
Sbjct: 275 AKVKILTLAWDQENLSGFSVVKKRPANQDAGFLLQKTVRYQENNLCDIETENGLFQGITL 334

Query: 268 H----HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD 323
                H + + +  +TF +  +   D+ VS    +        +       E  + G  +
Sbjct: 335 GNKKGHESGDIKGELTFGILTQNDPDICVSRRSSYYSVERDSTVC------EALQGGRVN 388

Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFY 383
           ++  DK         +   +   +T+  G+ +++ F    D PE+   E    ++YT F+
Sbjct: 389 NVFFDKIYTGR--EQLSGIVTVQVTLQPGAVKNIPFVQILDYPEISLGEYQSQKKYTSFF 446

Query: 384 GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW 443
                     A +   +  K+ C+     R  ++  +          NEL+ L     ++
Sbjct: 447 P---------AENRTEDLVKFGCDNFYSTREQIDADQ----------NELHELLLNSDVY 487

Query: 444 TDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYV 503
              +     L T+ +   S    + +                       +FL  E ++Y 
Sbjct: 488 KGKAKSAAELTTMAQNTLSFTADATVWDIDD------------------KFLVRECSDYP 529

Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI-----MSDGKW----- 553
            +N+ DV+FY SF+++ L P+++ +  R F  A+M  +    +      + + K      
Sbjct: 530 FFNSLDVYFYGSFSMMWLLPQVDTNTMRCFRDAIMAENDELRRFYVYLELPNAKLPHPKY 589

Query: 554 ---VARKCLGAVPHDIGLD-DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
               ARK  GAV HD+G   DP    ++YN  N + WKDL  K++L + R++  T D   
Sbjct: 590 EGPRARK--GAVIHDLGSPFDP--RPDAYNWHNVAEWKDLAPKYILMLLRNYHFTKDVYL 645

Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
               W SV  ++ Y++    +G  +    G  D T+D  S+ G++ YCG LWVA L+AA 
Sbjct: 646 LEECWDSVQASLQYLKDMILEGHSIPLTNG-TDDTFDNLSSFGITLYCGSLWVAGLKAAG 704

Query: 670 ALA 672
            +A
Sbjct: 705 EIA 707


>gi|327309977|ref|YP_004336874.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
 gi|326946456|gb|AEA11562.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
          Length = 694

 Score =  110 bits (276), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 183/726 (25%), Positives = 271/726 (37%), Gaps = 134/726 (18%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKF-----SSV 142
           G+PLGGIGAG +    +G        +        L   F VFV   + +K       SV
Sbjct: 13  GVPLGGIGAGKVELDNKGRLVNATFANNWSRPIREL-RGFHVFVYAEDLDKHLFAQDVSV 71

Query: 143 LCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
           L      +P   +   +E+           Y   +P  W    G     L +     S  
Sbjct: 72  LGT----LPTLGSAGFVEA----------EYEGRWPFVWL--RGRRG-GLEVEVEGFSAV 114

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLL-FTWANSVAGDSGLSGHHF---NSKTMT 258
           +P + K+SS PA+     +  S +  A V++     AN+V   +      F   N++   
Sbjct: 115 VPGDLKDSSLPAAGLAVRVRGSRRGIAAVSIPNVVGANNVGRINEPVPGGFKARNARAHE 174

Query: 259 KDGVHG-LTLHHRTANGRPPVTFAVAAEETADVHVSECP--CFLLSGNSKGITAKDMWNE 315
            D  +G L L  +      PV  AV  +   DV  SE P  C L      G   +  W  
Sbjct: 175 YDPQNGELALLAKD-----PV--AVIPQFNLDVPRSEAPKVCAL------GPEDERPWLY 221

Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY 375
           +     F         P    +S G    A +   S       F L+W      +F+K +
Sbjct: 222 LMGKADF--------GPRVAEASGGGEQPAGVVAASYEGGEARFVLSW------YFDKPW 267

Query: 376 HRR-YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
               Y  +Y      AA +    + E+ +       WQ  ++ D   PEW    + N  Y
Sbjct: 268 ALYPYGHYYENFFKDAADVGRYFLDEYDRLRRRTAEWQEGLV-DPALPEWLRDAVINSAY 326

Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQF 494
            L+AG  +  DG             +FS      I+         GT         +G  
Sbjct: 327 ILSAGTWLTKDG-------------RFS------IYEAPEVAPMHGT---------VGGL 358

Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
            Y  G               S  +V++FP+LE    R F   +  H       M D    
Sbjct: 359 CYETG---------------SLPVVLMFPQLE----RSFLEQLARH-------MRDD--- 389

Query: 555 ARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
                G VPHD+G   LD P     S       RWKD N  F+L VYR +  TGD +F +
Sbjct: 390 -----GYVPHDLGTHSLDAP-----SDGTTAPPRWKDTNPTFILLVYRYYKRTGDIDFVK 439

Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
           +V+  +  A  +M   D+DGDG+ E EG  D  +D     G SAY   LW+AAL A   L
Sbjct: 440 SVYGRLVAAYRWMRGQDRDGDGLPELEGSGDSGFDCAKIRGPSAYTSSLWIAALVALRDL 499

Query: 672 ANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARAC 731
           A  VGD  +A       ++A+  ++  ++G   +Y   DG+   ++ A    G W+    
Sbjct: 500 AEAVGDSKTAEEAGSDLERARNSFERSFDG--VSYKAWDGAPEGAVFAALAVGDWWLYLA 557

Query: 732 GLLP-IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM-QPDGRI-DMSGLQAREIWPGV 788
           G     A   ++  AL+ +   N          A  G  + DG I +  G Q    WP +
Sbjct: 558 GSREGAARRDRLASALSAVLRVNGSASANCAPNASTGRPEADGEISERVGRQLTASWPRL 617

Query: 789 TYGLAA 794
            + +AA
Sbjct: 618 IFSVAA 623


>gi|194359640|gb|ACF57667.1| bile acid beta-glucosidase [Ctenopharyngodon idella]
          Length = 89

 Score =  110 bits (276), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 46/89 (51%), Positives = 62/89 (69%)

Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
           PW  +N+Y +++++ WKDLN KFVLQV+RD+  T D+ + + +WP     M    +FDKD
Sbjct: 1   PWVRVNAYLIYDTADWKDLNLKFVLQVFRDYHLTQDEQYLKDMWPICQTVMETELKFDKD 60

Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGG 659
           GDG+IEN G+ DQTYD W   G SAYCGG
Sbjct: 61  GDGLIENSGYADQTYDGWKVTGPSAYCGG 89


>gi|355689818|gb|AER98955.1| glucosidase, beta 2 [Mustela putorius furo]
          Length = 157

 Score =  109 bits (273), Expect = 6e-21,   Method: Composition-based stats.
 Identities = 61/166 (36%), Positives = 89/166 (53%), Gaps = 12/166 (7%)

Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
           R++L  G+     V+A+QF+V + R     +  VL    P V        + SW+W L G
Sbjct: 1   RWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLCG 52

Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
               YHAL+PRAWTVY   P   + + CRQI+P +PH+Y++SS P  VF + + N G  +
Sbjct: 53  YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEA 111

Query: 229 ADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG--VHGLTLHHRT 271
            DV+++F+  N +  GD    G       + +DG  V GL LHH T
Sbjct: 112 LDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPT 157


>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
 gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
          Length = 928

 Score =  106 bits (264), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 177/827 (21%), Positives = 313/827 (37%), Gaps = 127/827 (15%)

Query: 74  VFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH-----GICDDAPVLANQFS 128
           V ++   ++ TS  G+P+GG+G GS      G F+ + +F+     G       +     
Sbjct: 2   VLHLRISYYDTSLAGVPMGGLGTGSFEIGPDGRFREWLIFNNRPWAGYGQPQWFMTPDDL 61

Query: 129 VFVSR--PNGEK--FSSVLCPRSPGVPKKNTDSGIESW--------DWNLKGENCTYHAL 176
           VF  R  P G++     +      G  +  T  G   W         W    E       
Sbjct: 62  VFFLRIEPEGQEPILRGLFTGLWYGSAEDYTYRGCGPWVVAEPYHLPWAKGVEGVEMDVR 121

Query: 177 FPRAWTVYDGEPDPEL--RIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLL 234
           +PR    Y      EL   +     SP +P + K SS PA +  F ++N G +   V+L+
Sbjct: 122 YPRVTLSYRDSLLDELGVEVSLDAYSPLLPGDLKASSTPAVILRFRVTNRGTSRVKVSLM 181

Query: 235 FTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
               N        +G   +S+         + +     + R P+     A  +     S 
Sbjct: 182 AVARNLARS----AGGRPSSEVREGPDWTAMIMSACCVDERHPMARGALAVASLGQQSSG 237

Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI---GAAIAASLTIPS 351
               + S N   +        +   G     D   T PS   S +    A     + +  
Sbjct: 238 AVGSVDSSNRPQLVNLLRRLLVDFRG-----DGTLTGPSSMTSDLDVFSAVATRPIELTP 292

Query: 352 GSTRSVTFSLAWDCP-EVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
           GS+  + +++AW  P  +    +     Y  F+G +    + I  +     A+ E    A
Sbjct: 293 GSSEELYWAIAWHFPNHIDASGQRVGHYYENFFGNVEQVISYIFENKHELTARLEAFSSA 352

Query: 411 WQRPILEDKRFPEWY------PITLFNELYYLNAGG--TIWTDGSPPMQSLATIRERKFS 462
                + D  + +W        +T   +L +L   G   +W +G P    L T+    ++
Sbjct: 353 -----MYDASYDDWVIDLVTAQLTSIPKLSWLTRDGHFGLW-EGGPGCCGLNTVDVVLWA 406

Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
                 ++            L++   +++ + + L   ++  Y  + + F+ + +L    
Sbjct: 407 FAGVINMYP----------ELVKVAVKDVVRHI-LRPDKHYYYELFALGFHENMSLYREM 455

Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFN 582
            + + SIQ D A         +  I    +   +   G VPH            S+NL +
Sbjct: 456 LRKDPSIQNDPAKF-------SEAIAEVVRRTGKDPTGRVPHSFR--------ASFNLID 500

Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM-AYMEQFDKDGDGMIEN---- 637
           S    DL  +F+L     + ATGD+ + R++W  V   +     Q D  G G+IE+    
Sbjct: 501 SYDRNDLMPEFLLIALLPYYATGDREYLRSLWNDVLTVIDGTRRQHDSRGTGLIEHYMPS 560

Query: 638 --EGF----------------------------PD------QTYDAWSANGVSAYCGGLW 661
             EG                             P+       T+D +S  GV+++ G LW
Sbjct: 561 DYEGLSRVAQDAAEKGLVPGFYGGNVVKLLFSGPEFVMNSVNTFDTFSLLGVASFTGDLW 620

Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQA 719
            A+L+AA+  A+  G   +     +  +  +++   LWNGSYF+  YD   G    +I +
Sbjct: 621 AASLRAAAEAASREGLARAEELRQLSLRAYESMVKLLWNGSYFDDWYDPESGLRDRAILS 680

Query: 720 DQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM----------Q 769
            QL G+WY +  GL    D  KV+ AL ++Y  N  + +G + G   G           +
Sbjct: 681 AQLTGEWYLQVLGLGDGVDSEKVRSALREVYRANFRRWEGLLNGTYPGTPRPSMVGDVEE 740

Query: 770 PDGR--IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
           P+G   ++    QA   W GV +G+A+ M+ E MV+ A +    +++
Sbjct: 741 PNGTKILNRVSSQADTPWTGVEFGVASQMLYEGMVEEAMEVLRSIHD 787


>gi|320109368|ref|YP_004184958.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
 gi|319927889|gb|ADV84964.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
          Length = 898

 Score =  105 bits (263), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 171/762 (22%), Positives = 283/762 (37%), Gaps = 147/762 (19%)

Query: 76  NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP- 134
           N    H      G+PLGGIG G++  +  G F R  + +   D     A         P 
Sbjct: 56  NYLALHDAPGPSGVPLGGIGVGAVDLAPDGHFTRIAVNNWATDSGVARAR-----AENPE 110

Query: 135 -NGEKFSSVLCPRSPG-------VPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
            + E F +V    S G          + T  G+ ++       + TY  LFP A   +D 
Sbjct: 111 WDAEAFLAVWEKDSSGNVAAVRLQRDRRTAYGMGAYG------HSTYRGLFPTARIAFD- 163

Query: 187 EPD---PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN---- 239
           +P    P   +     S   P N K+SS P      TL+NS     + ++  +W +    
Sbjct: 164 DPSANSPHSLVSVFAYSGLAPQNVKDSSLPGFWIEVTLANSDSKPVEASVALSWPDLVGR 223

Query: 240 ---SVAGDSGLSGHH---------------FNSKTMTKDGVHGLTLHHRTANGRPPVTF- 280
               VA  +  +G+                   + +  D +HGL             TF 
Sbjct: 224 GIYDVAPGAEKTGYDRAFDIEKLVAVPSPATQVEALQMDSLHGLRQFSTATPSIRQATFQ 283

Query: 281 ------AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSE 334
                 A+ AE      VS  P +     SK  T  D W   ++ G+F       T+ S+
Sbjct: 284 NYINAVAILAEVPEGGSVSTNPAW-----SK--TDADSWKSFREDGAFPS-RTGTTALSQ 335

Query: 335 PGSSIGAAIAA-SLTIPSGSTRSVTFSLAWDCP-----EVKFFEKVYHRRYTKFYGTLGD 388
            G   GA++ A    I  G  ++  F + W  P      V+F    Y R +  ++     
Sbjct: 336 SGKLSGASVVAIKAHIAGGGKQTFRFLITWSIPTLPHASVQFGTSDYGRYFQNYFPDF-- 393

Query: 389 SAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSP 448
               + H    E A+   E  +WQ+PIL D   P+W    L      +N+G TI+T+   
Sbjct: 394 --QSLIHYEFSERARILRETLSWQKPIL-DSTLPDWLKFKL------INSGYTIYTN--- 441

Query: 449 PMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTY 508
                                           T L ++G+     F  +EG    +  T 
Sbjct: 442 --------------------------------TMLNKSGD-----FSVMEGGMGGLAGTM 464

Query: 509 DVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL 568
           D    +       F +L+ +  + F A     D G +    DG +            +GL
Sbjct: 465 DQRLVAHPFYQKFFTQLDRAELQRF-ADTQDTDGGILHF--DGSYF-----------VGL 510

Query: 569 DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFD 628
            D     + +      +  D    +V+Q+ +D+  TGD  F +     +    AYM+   
Sbjct: 511 ADD----HGHTPVPHEKMIDNTGAWVIQLAKDYQQTGDDTFIKKNADHIRRGFAYMKAQI 566

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
           +D D  I   G   QTYD +    +S Y G +++AAL+A   L + +GD   +     ++
Sbjct: 567 RD-DSFIPVGG---QTYDDFPHPEISVYTGTVYLAALRAGVVLGDALGDTQMSHEAETQF 622

Query: 689 QKAKA-VYDSLWNGSYFNYDNSDGSS---STSIQADQLAGQWYARACGLLPIADEAKVKK 744
           ++ +A +  +LWNG +F Y    G +      + + QLAGQ+ +R  G   +    + K 
Sbjct: 623 KQTQAGLIHALWNGRFFAYGTDIGGAHRRDDRLFSGQLAGQFLSRYAGWGDVLPHDQAKS 682

Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
           A+ +    +VL         +  ++    +DM G ++   WP
Sbjct: 683 AIYEQLHTSVLGTPDFYAPKVWDLEMQRGVDMPGSRS---WP 721


>gi|158316208|ref|YP_001508716.1| hypothetical protein Franean1_4431 [Frankia sp. EAN1pec]
 gi|158111613|gb|ABW13810.1| protein of unknown function DUF608 [Frankia sp. EAN1pec]
          Length = 934

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 169/800 (21%), Positives = 296/800 (37%), Gaps = 117/800 (14%)

Query: 141 SVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
           ++  P +   P +     + +W+  LK  + +Y  LFP  +  Y G    + ++  +  S
Sbjct: 163 TLAVPHTNTAPDRTFGDVLPAWN-TLKPGDGSYAVLFPFGYMTYSGF---QSKVSTKIWS 218

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG--LSGHHFNSKTMT 258
           P + +  + +S P + F   ++N       ++++ T+ N+ A  +G   +G +      +
Sbjct: 219 PIVANEDERTSMPVAFFDMLMNNPTAKPIKISVMLTFPNAPAFATGSVRTGLYSRFDRDS 278

Query: 259 KDGVHGLTLHHRTANGRPPVT---FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
             G+ G+TL   +    P      + +AA   A   ++ C  +  SG+   I A   ++ 
Sbjct: 279 ASGIGGVTLGSDSPENTPDTVTSEWTIAAHPFAGQTLTYCTSWDGSGDGSDIYAP--FSA 336

Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF----- 370
               G   + D D+       S+   A+A +LT+    T++V F+L+WD P+V +     
Sbjct: 337 AGADGKLPNGDIDQ-------SASAGAVAVALTLEPDQTQTVRFALSWDFPQVYYDGEDA 389

Query: 371 -FEKVYHRRYTKFYGTLGDSAARIAHD----------AILEHAKWE---CEIEAWQRPIL 416
               V+ RRYT F G           D          A  E A+++     +E+W +PI 
Sbjct: 390 TTRAVWMRRYTAFLGGKTSRTNDYVQDSYPFRQGFTIARKELARYDDSLAAVESWWKPIA 449

Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
           E+ + P W      NELY++   G+ W  G                         P S +
Sbjct: 450 ENPQVPPWLRKASLNELYHMIFNGSFWESGL-------------------VSNTMPMSVE 490

Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
                RL     E    F    G      N  D+          LF  LEL   R     
Sbjct: 491 EGTSPRLGSAIPETHIYFHADGGDGGAQTNEVDMDSSGYLVFAKLFRSLELGRVRPLLQM 550

Query: 537 VMMHDPGTMKIM-----SDGKWVARKCLGAVPHDIGLDDPWFEINSY--------NLFNS 583
           V  +  G  +++     S G ++ +    A   ++    P    N          +LF  
Sbjct: 551 VRQNPLGIGRVIQQTFRSSGPYITQT---ASFQNLPFSKPPTAGNPPAPPTRDLGDLFAD 607

Query: 584 SR---WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
                ++D   K + + Y       D +     +  +  A+ Y + F   G  +  +   
Sbjct: 608 EAGDPFRDCPHKLIYRTYALIKFYDDDDLLEYGYAPMLKALTYSQFFRPTGSHLPADPAS 667

Query: 641 --PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG----DHASASY---FWVRYQKA 691
             P  T D    NG   Y  GL++ +LQ  S L         D A+             A
Sbjct: 668 NNPPNTMDQAVVNGHGIYNCGLYLLSLQILSTLTPQAARLGVDEATPEIQKELDEELAAA 727

Query: 692 KAVYDSL-WNGSYFNYDNSDGSSST------------------SIQADQLAGQWYARACG 732
           K  ++ + WN +   Y   DG+                     +I  +  AGQ  A   G
Sbjct: 728 KEEFERIFWNPATGRYRYCDGTGGIGDRTGTIRGRFKPVPPPDAIWLESFAGQLVAMELG 787

Query: 733 LLPIADEAKVKKALTKIYDFNV--LKVKGGMCGAMNGMQPDGRIDMSGLQAREI---WPG 787
           L  + D    +  L    D  V     +G + G    ++PD  I  S L+  EI    PG
Sbjct: 788 LPDVVDLDHARTHLKNTLDSFVRFRDPEGNLMGGPIILKPDFSIYPSSLRTTEINEVIPG 847

Query: 788 VTYGLAASMIQ-------EEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY-- 838
           + +  AA   +       +++ + A +   G     +  +  GY+F TPESW  +D +  
Sbjct: 848 IAFLAAAGAFRIGAKVKDKDITEKALKLGEGCALQIYDIESNGYAFATPESWFVDDHHIS 907

Query: 839 RSLCYMRPLTIWAMQWALTK 858
           R   Y R  ++W++  A+++
Sbjct: 908 RFPGYTRTRSVWSLYDAVSE 927


>gi|224535965|ref|ZP_03676504.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522420|gb|EEF91525.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 899

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 157/697 (22%), Positives = 261/697 (37%), Gaps = 152/697 (21%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV--SRPNGEKFSSVLCP 145
           G+PLGGIG G+I  +  G+F R  + +      P+  ++ S F   +R   EK  +V   
Sbjct: 64  GVPLGGIGVGNINFAPSGKFTRIGMNN---IHTPIKRSEHSFFSLWTRKGNEK-EAVRLV 119

Query: 146 RSPGVPKKNTDSGIESWDWNLKG-ENCTYHALFPRAWTVYDGEPDPELRI--VCRQISPF 202
           R   V             + +KG E+  Y  LFP A   Y    D  L++  V R  S  
Sbjct: 120 RDNHVL------------YGMKGVEHTQYKGLFPTAELSY---ADNNLKVTPVIRAYSGL 164

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG 261
           +PHN K+SS P   F   L +  Q   +  L F+W + +   +   S   F++  +  +G
Sbjct: 165 VPHNVKDSSLPVVWFEVDLIS--QEDMEAALAFSWEDFIGLFNDPKSLEGFDNGQLLSEG 222

Query: 262 ---------------------------VHGLTLHHRTANGRPPVTF-------AVAAEET 287
                                      + GL  +   +     +TF        +A EE 
Sbjct: 223 RANINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEE- 281

Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
            + +VS  P +    NS+       W++ + +G F           +  +S  +A+A   
Sbjct: 282 -EKNVSYLPTY--RSNSEA------WDQFRNNGEFTSSLTKNVLTEQSQTSSASALAVKT 332

Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKVY--------HRRYTKFYGTLGDSAARIAHDAIL 399
            + +G  +++ F LAW  PE++                 Y K+Y    +    +   A+ 
Sbjct: 333 QLKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNCMNSMVSYAVS 392

Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRER 459
             A+   +   WQ P+LE    P+WY   L      +N+G  I+T+              
Sbjct: 393 NRARIARQTTEWQIPVLESS-LPDWYKFKL------INSGYVIYTN-------------- 431

Query: 460 KFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                                  +L  G    G  +  EGA      T D    S     
Sbjct: 432 ----------------------MVLTKG----GDVMVNEGAMGGFAGTMDQRLSSHPFYQ 465

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYN 579
             F +L+ S    FA A+   DP    +   G +     +G V   +  +  W       
Sbjct: 466 KFFTQLDRSEMDIFADAM---DPEGYILHFIGHYYV--GMGTVGGRVPTEKGWM------ 514

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           L N+S W       ++Q+ +D+  TGD  + +A    +  AM ++      G  +     
Sbjct: 515 LDNASGW-------IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFLYSRMPQGSTI----P 563

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSL 698
               TYD ++   + +Y  G+W+  L+A  A+   +GD +       ++  + K   + L
Sbjct: 564 VGPTTYDDFTHPPLYSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKL 623

Query: 699 WNGSYFNYD-NSDGSS--STSIQADQLAGQWYARACG 732
           WNG +F Y    DGS      +   QLAGQ+ +R CG
Sbjct: 624 WNGRFFAYGCEPDGSKRLDNVLFTGQLAGQFLSRYCG 660


>gi|423226023|ref|ZP_17212489.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392630880|gb|EIY24861.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 899

 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 156/697 (22%), Positives = 260/697 (37%), Gaps = 152/697 (21%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV--SRPNGEKFSSVLCP 145
           G+PLGGIG G+I  +  G+F R  + +      P+  ++ S F   +R   EK  +V   
Sbjct: 64  GVPLGGIGVGNINFAPSGKFTRIGMNN---IHTPIKRSEHSFFSLWTRKGNEK-EAVRLV 119

Query: 146 RSPGVPKKNTDSGIESWDWNLKG-ENCTYHALFPRAWTVYDGEPDPELRI--VCRQISPF 202
           R   V             + +KG E+  Y  LFP A   Y    D  L++  V R  S  
Sbjct: 120 RDNHVL------------YGMKGVEHTQYKGLFPTAELSY---ADNNLKVTPVIRAYSGL 164

Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG 261
           +PHN K+SS P  V  F +    Q   +  L F+W + +   +   S   F++  +  +G
Sbjct: 165 VPHNVKDSSLP--VVWFEVDLIAQEDMEAALAFSWEDFIGLFNDPKSLEGFDNGQLLSEG 222

Query: 262 ---------------------------VHGLTLHHRTANGRPPVTF-------AVAAEET 287
                                      + GL  +   +     +TF        +A EE 
Sbjct: 223 RANINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEE- 281

Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
            + +VS  P +    NS+       W + + +G F           +  +S  + +A   
Sbjct: 282 -EKNVSYLPAY--RSNSEA------WEQFRNNGEFTSSLTKNVLTEQSQTSSASVLAVKT 332

Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKVY--------HRRYTKFYGTLGDSAARIAHDAIL 399
            + +G  +++ F LAW  PE++                 Y K+Y    ++   +   A+ 
Sbjct: 333 QLKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNNMNSMVSYAVS 392

Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRER 459
             A+   +   WQ P+LE    P+WY   L      +N+G  I+T+              
Sbjct: 393 NRARIARQTTEWQIPVLESS-LPDWYKFKL------INSGYVIYTN-------------- 431

Query: 460 KFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
                                  +L  G    G  +  EGA      T D    S     
Sbjct: 432 ----------------------MVLTKG----GDVMVNEGAMGGFAGTMDQRLSSHPFYQ 465

Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYN 579
             F +L+ S    FA A+   DP    +   G +     +G V   +  +  W       
Sbjct: 466 KFFTQLDRSEMDIFADAM---DPEGYILHFIGHYYV--GMGTVGGRVPTEKGWM------ 514

Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
           L N+S W       ++Q+ +D+  TGD  + +A    +  AM ++      G  +     
Sbjct: 515 LDNASGW-------IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFLYSRMPQGSTI----P 563

Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSL 698
               TYD ++   + +Y  G+W+  L+A  A+   +GD +       ++  + K   + L
Sbjct: 564 VGPTTYDDFTHPPLYSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKL 623

Query: 699 WNGSYFNYD-NSDGSS--STSIQADQLAGQWYARACG 732
           WNG +F Y    DGS      +   QLAGQ+ +R CG
Sbjct: 624 WNGRFFAYGCEPDGSKRLDNVLFTGQLAGQFLSRYCG 660


>gi|385775519|ref|YP_005648087.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323474267|gb|ADX84873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
          Length = 661

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 81/297 (27%), Positives = 129/297 (43%), Gaps = 35/297 (11%)

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
           + S  ++++FP+LE S  +   + V                   +  G VPHD+G   LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
            P     S       +WKD+N   VL VYR F  T D +F + V+P++   M +  +  +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
             DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VGD     +   +  
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503

Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
            A+  +  ++NG YF     DG    S  A QL G+W+     L  IADE  +K AL  I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIADENMIKSALESI 560

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
              N            N +  +G+I    +Q    WP + + +     ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|238619389|ref|YP_002914214.1| hypothetical protein M164_0933 [Sulfolobus islandicus M.16.4]
 gi|238380458|gb|ACR41546.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
          Length = 661

 Score =  100 bits (248), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 128/297 (43%), Gaps = 35/297 (11%)

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
           + S  ++++FP+LE S  +   + V                   +  G VPHD+G   LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
            P     S       +WKD+N   VL VYR F  T D +F + V+P++   M +  +  +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMNW--ELRQ 443

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
             DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VGD     +   +  
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503

Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
            A+  +  ++NG YF     DG    S  A QL G+W+     L  I DE  +K AL  I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIVDENMIKSALESI 560

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
              N            N +  +G+I    +Q    WP + + +     ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|229584458|ref|YP_002842959.1| hypothetical protein M1627_0991 [Sulfolobus islandicus M.16.27]
 gi|228019507|gb|ACP54914.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 661

 Score = 99.8 bits (247), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 128/297 (43%), Gaps = 35/297 (11%)

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
           + S  ++++FP+LE S  +   + V                   +  G VPHD+G   LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
            P     S       +WKD+N   VL VYR F  T D +F + V+P++   M +  +  +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
             DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VGD     +   +  
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503

Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
            A+  +  ++NG YF     DG    S  A QL G+W+     L  I DE  +K AL  I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIVDENMIKSALESI 560

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
              N            N +  +G+I    +Q    WP + + +     ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|227827237|ref|YP_002829016.1| hypothetical protein M1425_0924 [Sulfolobus islandicus M.14.25]
 gi|227459032|gb|ACP37718.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 661

 Score = 99.8 bits (247), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 128/297 (43%), Gaps = 35/297 (11%)

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
           + S  ++++FP+LE S  +   + V                   +  G VPHD+G   LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
            P     S       +WKD+N   VL VYR F  T D +F + V+P++   M +  +  +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
             DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VGD     +   +  
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503

Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
            A+  +  ++NG YF     DG    S  A QL G+W+     L  I DE  +K AL  I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIVDENMIKSALESI 560

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
              N            N +  +G+I    +Q    WP + + +     ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|385772401|ref|YP_005644967.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|323476515|gb|ADX81753.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 661

 Score = 99.4 bits (246), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 129/297 (43%), Gaps = 35/297 (11%)

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
           + S  ++++FP+LE S  +   + V                   +  G VPHD+G   LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
            P     S       +WKD+N   VL VYR F  T D +F + V+P++   M +  +  +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
             DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VGD     +   +  
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503

Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
            A+  +  ++NG YF     DG    S  A QL G+W+     L  IA+E  +K AL  I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIANENMIKSALESI 560

Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
              N            N +  +G+I    +Q    WP + + +     ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614


>gi|15898194|ref|NP_342799.1| hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
 gi|13814563|gb|AAK41589.1| Hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
          Length = 663

 Score = 96.3 bits (238), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 80/300 (26%), Positives = 133/300 (44%), Gaps = 41/300 (13%)

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
           + S  ++++FP+LE    + F   ++ H      I  DG          VPHD+G   LD
Sbjct: 352 FGSLPVILMFPELE----KSFLKLLIRH------IREDG---------YVPHDLGYHSLD 392

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
            P     S       RWKD+N   +L VYR F  T D  F + V+P +   M +  +  K
Sbjct: 393 SPIDGTTS-----PPRWKDMNPSLILLVYRYFKFTNDIEFLKEVYPILVKVMDWELRQCK 447

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
                +E E   D  +DA    G  +Y   L++ +L A   +A  VGD     +   +  
Sbjct: 448 GNLPFMEGE--MDNAFDATIIKGHDSYTSSLFIGSLIAMREIAKLVGDSNYVDFISEKLS 505

Query: 690 KAKAVYDSLWNGSYFN-YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
            A+  +  ++NG YF  +D+ D +S  +    QL G+W+    GL  I +E  +KKAL  
Sbjct: 506 SAREAFRRMFNGRYFKAWDSVDNASFLA----QLYGEWFTTLVGLEDIVEEDIIKKALES 561

Query: 749 IYDFNVLKVKGGMC--GAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
           I     +++ G        N +  +G+I    +Q    WP + + +     ++ + D++F
Sbjct: 562 I-----IRLNGNASPHCVPNLVDDNGKIVGLSVQTYSSWPRMVFAICWLAYKKGVGDLSF 616


>gi|15898746|ref|NP_343351.1| hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
 gi|229578524|ref|YP_002836922.1| hypothetical protein YG5714_0710 [Sulfolobus islandicus Y.G.57.14]
 gi|229582969|ref|YP_002841368.1| hypothetical protein YN1551_2495 [Sulfolobus islandicus Y.N.15.51]
 gi|284173504|ref|ZP_06387473.1| hypothetical protein Ssol98_02460 [Sulfolobus solfataricus 98/2]
 gi|384435081|ref|YP_005644439.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|13815223|gb|AAK42141.1| Hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
 gi|228009238|gb|ACP45000.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
 gi|228013685|gb|ACP49446.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
 gi|261603235|gb|ACX92838.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
          Length = 661

 Score = 96.3 bits (238), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 112/250 (44%), Gaps = 16/250 (6%)

Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           G VPHD+G   LD P     S       +WKD+N   +L VYR F  T D +F + V+P+
Sbjct: 378 GYVPHDLGFHSLDSPIDGTTS-----PPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPT 432

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
           +   M +  +  +  DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VG
Sbjct: 433 IVKVMDW--ELRQCRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490

Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
           D     +   +   A+  +  ++NG YF     DG    S  A QL G+W+     L  I
Sbjct: 491 DSNYVGFINEKLNVAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENI 547

Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
            DE  +K AL  I   N            N +  +G+I    +Q    WP + + +    
Sbjct: 548 VDENMIKSALESIIRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLA 604

Query: 797 IQEEMVDMAF 806
            ++ + D++F
Sbjct: 605 YKKGVGDLSF 614


>gi|227829616|ref|YP_002831395.1| hypothetical protein LS215_0672 [Sulfolobus islandicus L.S.2.15]
 gi|227456063|gb|ACP34750.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
          Length = 661

 Score = 96.3 bits (238), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 112/250 (44%), Gaps = 16/250 (6%)

Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           G VPHD+G   LD P     S       +WKD+N   +L VYR F  T D +F + V+P+
Sbjct: 378 GYVPHDLGFHSLDSPIDGTTS-----PPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPT 432

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
           +   M +  +  +  DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VG
Sbjct: 433 IVKVMDW--ELRQCRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490

Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
           D     +   +   A+  +  ++NG YF     DG    S  A QL G+W+     L  I
Sbjct: 491 DSNYVGFINEKLNVAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENI 547

Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
            DE  +K AL  I   N            N +  +G+I    +Q    WP + + +    
Sbjct: 548 VDENMIKSALESIIRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLA 604

Query: 797 IQEEMVDMAF 806
            ++ + D++F
Sbjct: 605 YKKGVGDLSF 614


>gi|15922946|ref|NP_378615.1| hypothetical protein ST2609 [Sulfolobus tokodaii str. 7]
 gi|15623737|dbj|BAB67724.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 661

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/245 (28%), Positives = 117/245 (47%), Gaps = 26/245 (10%)

Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           G VPHD+G   LD P     +       +WKDLN  ++L +YR +  TGD  F ++V+  
Sbjct: 364 GYVPHDLGYCSLDSP-----TDGTTAPPKWKDLNPTYILLIYRYYKLTGDIEFLKSVYDK 418

Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
           V  A  +  +F + G      EG  D  +D     G+++Y   L++A+L A   ++   G
Sbjct: 419 VKKAFEWELKFSRYG-----LEGKMDSAFDVTPIKGINSYTLSLYIASLFAMREISKTAG 473

Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
           D+ +      + ++AK  ++ ++NG YF     +G       A Q+ G+W+    GL PI
Sbjct: 474 DNLNLDE---QIKEAKEAFEKMFNGKYF--IAWEGMEDAVFLA-QVFGEWWTTLLGLEPI 527

Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCG--AMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
           ADE K+K AL  I     +KV G        N ++ DG++     Q    WP + + +  
Sbjct: 528 ADEEKIKSALRWI-----IKVNGNASKYCTPNLVKEDGKVVSLSPQTYSSWPRLVFAICW 582

Query: 795 SMIQE 799
             I++
Sbjct: 583 LSIEK 587


>gi|383764296|ref|YP_005443278.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381384564|dbj|BAM01381.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 905

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 140/614 (22%), Positives = 231/614 (37%), Gaps = 126/614 (20%)

Query: 194 IVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV-AGDSGLSGHH- 251
           +  R  SPFIP N ++SS PA+ F +T++NS + S D T++    N + A +  +  HH 
Sbjct: 162 VKLRAFSPFIPLNDRDSSLPAAFFEWTITNSTEESIDYTVVGVLCNPLPANNLNVVRHHP 221

Query: 252 -FNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
              +  MT DGV   T  +        +T A  A    D  +S    +        +   
Sbjct: 222 WGCALHMTSDGVQPDTPSYGD------LTLATNANLVPDAAMSWQQYWFRGAWFDSLEV- 274

Query: 311 DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
             W ++   G F    N        G      +A   ++  G +++V F + W+ P  + 
Sbjct: 275 -YWKDLISGGRF---QNRVYPLQRAGEHNEGVLAVHFSLQPGESKTVNFVITWNFPICEN 330

Query: 371 FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLF 430
           + K           T  +  A +  D+    A  +  +  W+R + E  RF E     LF
Sbjct: 331 YWKPEQVLPQGISPTWKNYYATVWKDS---QASAQYALSEWERLLDETHRFKE----ALF 383

Query: 431 NELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEEN 490
                           S P   L  +      L +P+ +       T  G    E    +
Sbjct: 384 A--------------SSLPPAVLDAVSANMAVLKSPTVLRL--EDGTLYG---WEGCHPD 424

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR-DFAAAVMMHDPGTMKI-- 547
            G     EG+   ++N           L  LFPKLE S++  DFA  +        +I  
Sbjct: 425 SG---CCEGSCTHVWNYQQT-------LPFLFPKLERSMREADFAYNLRPDGGMAFRIQL 474

Query: 548 -MSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD 606
            +  G+W  R C                             D     VL+ YRD+   GD
Sbjct: 475 PLGAGRWDFRPC----------------------------ADGQFGGVLKTYRDWKICGD 506

Query: 607 KNFARAVWPSVYIAMAY------MEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
             + R +WPSV  ++ +      ++++D +  G++   G    T D     G +++   +
Sbjct: 507 NEWLRRLWPSVKKSLEFAWHPNNIDRWDPEKTGVL--WGRQHHTLDM-ELFGPNSWLTSM 563

Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF--------------- 704
           ++AAL+A + +A  +GD A+A  +   + + K   D+ L+NG Y+               
Sbjct: 564 YLAALKAGAEMAEYLGDEAAAIEYREIFARGKKWADANLFNGEYYIQRINLHDRSIVEAF 623

Query: 705 ---------------NYDNSDGSSSTSI----QADQLAGQWYARACGLLPIADEAKVKKA 745
                           +D+  G     I      DQ+ GQW+A   GL  I D  +V+ A
Sbjct: 624 ADDESVLIGGSTLEAYWDDEHGEIKYQIGEGSSIDQVLGQWHASLYGLGEIFDPTQVRSA 683

Query: 746 LTKIYDFNVLKVKG 759
           L  IY +N + V G
Sbjct: 684 LAAIYKYNFIPVMG 697


>gi|332796035|ref|YP_004457535.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
 gi|332693770|gb|AEE93237.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
          Length = 840

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/389 (23%), Positives = 159/389 (40%), Gaps = 74/389 (19%)

Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMS--- 549
           Q+ Y +G  +   NT DV  YS   ++ LFP+L   I   F   ++  +     + S   
Sbjct: 343 QYPYTDGPLHTALNTIDVLTYSLPTILNLFPELAEKIILQFKDKLIEENTPEHVVYSLSF 402

Query: 550 -------------------DGKWVARKCLGAVPHDIGLDD----PWFEINSYNLFNSSRW 586
                              DG+ +    +  +  + G D     P F  ++  +    R 
Sbjct: 403 EENREKFLEKLSKDPSLPTDGEKLY-STINEIVKETGKDPKGRVPHFFTDNLRVDEYHR- 460

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSV------------YIAMAY----------- 623
            DLN +F+L  Y     TGD NF + ++P +            Y  + Y           
Sbjct: 461 VDLNPEFILMTYLIAKTTGDLNFLKEIFPKMKEALESTMKTQTYDGLIYHTLPAGLEWLR 520

Query: 624 -------MEQFDKDGDGMIENEGFP--DQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
                  + + D +   ++ +   P   QT+D WS  G++++   LW++++QA +   ++
Sbjct: 521 YVNNKLNLPRGDNNSASILGHNLIPLSMQTFDDWSMIGITSFTSILWISSIQAVNDACSN 580

Query: 675 VGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACG 732
           +  + S  Y        K + D LWNG YF+  YD        +  A Q+ G WY+   G
Sbjct: 581 LKINCSYDY----ESLVKKLIDYLWNGEYFDLWYDPKSKMRDKACNASQILGHWYSTLLG 636

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM-NGMQPDGRIDMSGL------QAREIW 785
            L   D++ VK  L  I  +N+ + +G + GA  NG +P  R   + L      Q    W
Sbjct: 637 -LRFLDDSLVKTTLKSIVKYNLKEEEGLLNGAYPNGYRPLKRNYQNQLNLPATTQIDTPW 695

Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
            GV + +A+ +I E++ D   +    +YE
Sbjct: 696 SGVEFYVASHLIYEKLRDEGEKILRNIYE 724



 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 64/141 (45%), Gaps = 22/141 (15%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFH--GICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           GIPLGGIG GS+     G    + +F+  G  +   +   +F+ ++   +     S    
Sbjct: 12  GIPLGGIGTGSVEIRADGRLYEWTIFNNGGYAERQDL---RFTYYLDEQD-----SFFAV 63

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD-------PELRIVCRQ 198
           R  G+ +      ++++D+N  G       L P     ++GEP         ++ +  + 
Sbjct: 64  RQKGIVRV-----LQAFDYNFGGSPYILPWLRPVERVEFNGEPPIAYLEFVDKISVNMKA 118

Query: 199 ISPFIPHNYKESSFPASVFTF 219
            SPFIP + K SS P+++FT 
Sbjct: 119 FSPFIPTDLKNSSLPSAIFTL 139


>gi|402589212|gb|EJW83144.1| hypothetical protein WUBG_05948, partial [Wuchereria bancrofti]
          Length = 174

 Score = 89.4 bits (220), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 56/159 (35%), Positives = 79/159 (49%), Gaps = 20/159 (12%)

Query: 89  IPLGGIGAGSIGRSYRGEFQRFKLFHGICDD--APVLANQFSVFVSRPNG---EKFSSVL 143
           +P G IG+GSIGR +RG F +F L  GI +     V ANQF+     PN      +  VL
Sbjct: 1   VPCGTIGSGSIGRDFRGGFCKFGLRPGITEQKIGVVKANQFN-----PNNLWETVYQKVL 55

Query: 144 CPRSPGVPKKNTDSG---IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
           C RS      +  SG   + SWD++   +   Y  L+PR WT Y   P+    +  +Q+S
Sbjct: 56  CARS------SVSSGRGELTSWDFSFPPDKLIYRGLYPRFWTYYS-IPEFNFILCIQQVS 108

Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
           P IP+NY+ES  P ++F     N         + FT+ N
Sbjct: 109 PVIPNNYEESLLPVTLFIIDAENQSSVDLQAAVAFTFRN 147


>gi|150009713|ref|YP_001304456.1| hypothetical protein BDI_3128 [Parabacteroides distasonis ATCC
           8503]
 gi|149938137|gb|ABR44834.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
           8503]
          Length = 901

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 93/406 (22%), Positives = 157/406 (38%), Gaps = 67/406 (16%)

Query: 75  FNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP 134
           F+  +  +   + G+PLGGIG G++  +  G F R  L +      P+  +  S F    
Sbjct: 47  FDYLKLENHPGNSGVPLGGIGVGNVEFAPDGRFARIGLNN---IHLPIRKSTASFFSLWY 103

Query: 135 NGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
            G K ++V+      +P+             L  E+  Y  LFPR    + G   P +  
Sbjct: 104 KGAKETAVVRLVRDSLPQYGL----------LGVEDIYYTGLFPRVEMDF-GNSIPGINA 152

Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD-------SGL 247
             R  SP IPH+ K SS P   F   L +  Q   +V++ F+W + +           G+
Sbjct: 153 TIRAFSPLIPHDVKNSSLPVVFFDVDLVS--QEGGEVSIAFSWEDFIGRGIKEPDDIKGM 210

Query: 248 SGHHFN---------------------SKTMTKDGVHGLT------LHHRTANGRPPVT- 279
            G  F                      S++ +   + G+       L  + AN +  V  
Sbjct: 211 DGQLFGQERNSLCNGEAWPERPLEQTYSESWSCGKMQGIRQFASGPLKPKRANFQNYVNE 270

Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
            AV AE   DV VS  P F +          + W+  K+ G F+ +  D +  S PG  +
Sbjct: 271 VAVLAESREDVEVSVLPSFNIK------ETHEAWDLFKQEGRFEDISKDVSLLSIPGKQM 324

Query: 340 -GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--------VYHRRYTKFYGTLGDSA 390
            G+A+A    + +   ++V F L W  PE+    +        V    Y +++    +  
Sbjct: 325 GGSAVALKTKLKAKEKKTVRFMLVWFYPEMMIDRENDPLEYYWVGGSDYGRYFHNYFNRL 384

Query: 391 ARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
            ++         ++  +   WQ P+L +   P+WY   L N  Y +
Sbjct: 385 EQLVDYGYGNRERFREKTTEWQLPVL-NSTMPDWYKFKLINSGYVI 429



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 87/203 (42%), Gaps = 11/203 (5%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           D    +++Q+ +D+  TGD ++       VY  M +++    +G     N      TYD 
Sbjct: 517 DNTGGWIIQLVKDYEQTGDFHYLSRYAGRVYNGMDFLKSLMPEG----LNIPIGGTTYDD 572

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSLWNGSYFNY 706
           ++   V +Y   +++A L+AA  +A  + D    + +  ++ K  K +   LWNG +F+Y
Sbjct: 573 FTHPPVYSYGASIYLATLRAAQVVAQAMKDTVRENEYKEQFDKTQKDMIRLLWNGRFFSY 632

Query: 707 D---NSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
               +  G     +   QL GQ+ +R CG   +      K +L   +  ++ K       
Sbjct: 633 GCELDGSGRKDNLLFTGQLGGQFVSRYCGWGDVIPMEMTKASLVSQFKISLSKTPDYYAN 692

Query: 764 AMNGMQPDGRIDMSGLQAREIWP 786
            +  +  +  ID  G Q    WP
Sbjct: 693 KVWDITLNRGIDNRGSQC---WP 712


>gi|284174574|ref|ZP_06388543.1| hypothetical protein Ssol98_07941 [Sulfolobus solfataricus 98/2]
 gi|356934758|gb|AET42959.1| hypothetical protein [Sulfolobus solfataricus 98/2]
          Length = 803

 Score = 86.3 bits (212), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 151/383 (39%), Gaps = 69/383 (18%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
           +G++ Y  G E    NT DV  Y+   +++LFP+L  +I RD +   +  D         
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391

Query: 543 ----GTMK----------IMSDGKWV---ARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
                 MK          I +D K +    +K +     D     P +  +S  +    R
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHSLTVDTYER 451

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAM 621
             D+N +FVL  Y     TGD+ F  +V+                        PS    M
Sbjct: 452 -VDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLTLPSGIEWM 510

Query: 622 AYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALAND 674
            ++    K  D              QT D WS  G S Y   LW+AAL+A   AS L N+
Sbjct: 511 RHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALNEASKLLNN 570

Query: 675 VGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACG 732
           + ++          +  + V   LWNG Y+   YD        S+ A Q+ G WY +  G
Sbjct: 571 LHNYEVK-------ELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTGDWYVQLLG 623

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNG-MQPDGRIDMSGLQAREIWPGVTYG 791
           L    D  K K   + I  +N  + +G   G+ N  + P G + +S +Q++  W G+ Y 
Sbjct: 624 LPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLG-VKLS-VQSKTPWSGIEYY 681

Query: 792 LAASMIQEEMVDMAFQTAAGVYE 814
           LA+ M      + A +    VYE
Sbjct: 682 LASHMFYNGFDEYAKKILRNVYE 704


>gi|15899746|ref|NP_344351.1| hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
 gi|13816440|gb|AAK43141.1| Hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
          Length = 803

 Score = 86.3 bits (212), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 151/383 (39%), Gaps = 69/383 (18%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
           +G++ Y  G E    NT DV  Y+   +++LFP+L  +I RD +   +  D         
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391

Query: 543 ----GTMK----------IMSDGKWV---ARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
                 MK          I +D K +    +K +     D     P +  +S  +    R
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHSLTVDTYER 451

Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAM 621
             D+N +FVL  Y     TGD+ F  +V+                        PS    M
Sbjct: 452 -VDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLTLPSGIEWM 510

Query: 622 AYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALAND 674
            ++    K  D              QT D WS  G S Y   LW+AAL+A   AS L N+
Sbjct: 511 RHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALNEASKLLNN 570

Query: 675 VGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACG 732
           + ++          +  + V   LWNG Y+   YD        S+ A Q+ G WY +  G
Sbjct: 571 LHNYEVK-------ELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTGDWYVQLLG 623

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNG-MQPDGRIDMSGLQAREIWPGVTYG 791
           L    D  K K   + I  +N  + +G   G+ N  + P G + +S +Q++  W G+ Y 
Sbjct: 624 LPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLG-VKLS-VQSKTPWSGIEYY 681

Query: 792 LAASMIQEEMVDMAFQTAAGVYE 814
           LA+ M      + A +    VYE
Sbjct: 682 LASHMFYNGFDEYAKKILRNVYE 704


>gi|332796016|ref|YP_004457516.1| hypothetical protein Ahos_0327 [Acidianus hospitalis W1]
 gi|332693751|gb|AEE93218.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 662

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/309 (26%), Positives = 130/309 (42%), Gaps = 50/309 (16%)

Query: 492 GQFLYLEGAEY--VMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMS 549
           G+F  LE  E   V+     + +  +  +++LFP LE    ++ A  V+  D        
Sbjct: 332 GRFGVLEAPEVFPVVGTIAGLCYEGALPVLLLFPCLEKEFIKELAN-VIRED-------- 382

Query: 550 DGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN 608
                     G VPHD+G+   W F+          RWKDLN  F+L VYR F  TGD +
Sbjct: 383 ----------GYVPHDLGI---WSFDSPIEGTTAPPRWKDLNPTFILLVYRYFKFTGDVD 429

Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVA---AL 665
           F R  +P +  A  +M          IE EG  D  +D     G +     L++A   AL
Sbjct: 430 FLRETYPKMLKAYEWMLTRS------IEAEGSGDTAFDVLPIKGKNPMLLTLFIASALAL 483

Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-NYDNSDGSSSTSIQADQLAG 724
           +    + N+  +    S       K + + +SL+NG YF  ++  +G     I   QL G
Sbjct: 484 RETKKVLNEKDESTDLS-------KLREMLNSLYNGKYFIAWEGQEG-----IFMAQLLG 531

Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
           +W+    GL  + DE K+  AL  + + N    K       N ++ +G +     QA   
Sbjct: 532 EWWTELLGLENVTDEEKISSALRYMLEVN---GKASEYCTPNLVKENGEVVKISPQAYSS 588

Query: 785 WPGVTYGLA 793
           WP + + + 
Sbjct: 589 WPRLVFAMG 597


>gi|327311335|ref|YP_004338232.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
 gi|326947814|gb|AEA12920.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
          Length = 892

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 198/872 (22%), Positives = 308/872 (35%), Gaps = 204/872 (23%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHG---ICDDA-----PVLANQFSVFVSRPNGEKF 139
           G+ LGGIGAGS+     G    + +F+       DA     P+    F + V    GE  
Sbjct: 12  GVALGGIGAGSVEIRADGLLHDWLIFNNGPWTTIDAHRKLYPLDEKGFLLAVRVDGGE-- 69

Query: 140 SSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP---------DP 190
                   P V +      ++S  + L G+      L P     Y GEP         D 
Sbjct: 70  -------GPLVRQ------LQSAGYILGGDPYKAPWLKPVKGVDYRGEPPLALLKYVDDL 116

Query: 191 ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGH 250
            + I    +SPFIP + K S+ PA +      N      DV+L   WA        +   
Sbjct: 117 PVEIEAEFLSPFIPGDLKNSALPAFIAVVKFRNRSDKPLDVSL---WA--------MLKS 165

Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPV---TFAVAAEETADVH--VSECPCFLLSGNSK 305
            F SK + K     + L      G  P+   + A+A     +VH  V   P +   G  +
Sbjct: 166 PF-SKAVAKAKGDTVVLQDGGEPGDSPLRDGSMALAVR-AGEVHAAVVRAPGYDEYGFQR 223

Query: 306 GITAKDM---WNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
              + +M   W + ++ G        +   +  G  + A + A L +     +SV   LA
Sbjct: 224 YDDSLEMLQAWIDFRREGRV------RGPAARSGDGLWAVVTAPLKLAPSEEKSVVLVLA 277

Query: 363 WDCPEV--KFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
           W  P    +F E++ H  Y  F+   G  A  +  +    + K     + +   + + K 
Sbjct: 278 WFFPNHVDQFGERLGHY-YENFFSDAGSVARYVLDNFEYLYGK----TKKFHDALYDVKG 332

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
              W    + ++L  L     +  DG             +F+L              ALG
Sbjct: 333 VESWIADLVASQLSTLVKISWLTKDG-------------RFALWE------------ALG 367

Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
            +             Y  G E   +NT DV  Y++ ALV LFP+L +      +A  +  
Sbjct: 368 DK-------------YYGGPERNAFNTTDVIAYATPALVSLFPELAVKYLVQHSAFALRR 414

Query: 541 DPGTMKI----MSDGKWVARKCLGAVPHDIGLD-----------------DPWFEI---- 575
                 I    + + +    K L   P+ + LD                 DP   I    
Sbjct: 415 GTPEWVIYALAIPENRREFEKALEREPY-LALDWQRLVEAVSRIVERTGKDPAGRIAHFL 473

Query: 576 -NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW-------------------- 614
             S    +     DL  K+VL  Y     TG+    R +W                    
Sbjct: 474 NKSIKGIDGYHMIDLMPKYVLMAYATAKWTGNAELLRNLWDVLDGAVDSVAKAQSVEGLP 533

Query: 615 ----PSVYIAMAYMEQFDKDGDGMIENE------------GFPDQTYDAWSANGVSAYCG 658
               P+    M  ++   K+    I               GF  QT+D W+  GVS+Y  
Sbjct: 534 YHTTPAGIEWMRAVQAIFKESASQISTAVVQLLSQRAVLMGF--QTFDTWAFYGVSSYVL 591

Query: 659 GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTS 716
            LW AAL+A    A   G   S   +    ++A    + LWNG YF+  +D   G    +
Sbjct: 592 FLWAAALKAMVEGAKLTGR--SPEKYAELLRRAFEGLERLWNGEYFDLWWDPVTGERDRA 649

Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID- 775
             A QL GQ  A    L  ++D+ +V  AL  +  +N+   +    G +NGM PD R   
Sbjct: 650 SMAAQLFGQLLAHVADLGYLSDKQRVISALRAVAKYNLAPDE----GLINGMYPDRRRPS 705

Query: 776 ---------------MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
                          +   Q    W GV Y +A  M  E +V+        ++E      
Sbjct: 706 FVGPTLYENFTRGPYLPTWQMDTPWTGVEYAVAGHMFYEGLVEEGTAVLKALHE------ 759

Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
                ++    + N+ E+ +  YMRPL+ WA+
Sbjct: 760 ----RYERGGHYWNHIEWGT-HYMRPLSAWAV 786


>gi|76156398|gb|AAX27603.2| SJCHGC03819 protein [Schistosoma japonicum]
          Length = 192

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 49/117 (41%), Positives = 69/117 (58%), Gaps = 8/117 (6%)

Query: 103 YRGEFQRFKLFHGI-CDD-APVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIE 160
           +RGEF R  L  GI C D  PV  +QF V V + N   ++ VL P +   P  N   G+ 
Sbjct: 74  FRGEFCRSSLIPGIYCYDIQPV--DQFIVTVRKNNVIIYNQVLSPLTK--PPSN-GKGLR 128

Query: 161 SWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVF 217
            W W  + EN  Y  L+PR+WTVY+  P+ +L +VC+QISP IPH+Y+ +  P ++F
Sbjct: 129 KWIWGFRPENGYYIGLYPRSWTVYE-IPELQLTLVCQQISPVIPHDYQVTCLPVAIF 184


>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
 gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
          Length = 806

 Score = 82.8 bits (203), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 164/786 (20%), Positives = 281/786 (35%), Gaps = 161/786 (20%)

Query: 91  LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGV 150
           LGGIG G++    RG +  +++F             F+++    N +    +L  R    
Sbjct: 22  LGGIGTGNVSVGTRGNYCDWEIFGQPGKGIHFPYTFFALWAKSKNTQPIVRILESRIQ-- 79

Query: 151 PKKNTDSGIESWDWNLKG----ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           P      G  S D  L G    E  ++ A +P      +    P L +     +PFIP N
Sbjct: 80  PPYTCSHGFISAD--LAGLPRFERSSFSAAYPFVNVALEDSKLP-LSVELEAFTPFIPLN 136

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLT 266
            K+S  P +V  + + N+   + + T+    A+SVA   G +G+      + K  V    
Sbjct: 137 AKDSGIPGAVIRYKVKNNSGEALEATV----ASSVANAVGFNGYDLFDNLLVKRPVKNQY 192

Query: 267 LHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK-------------DMW 313
           L      G         + E A+   +     +++G    +T K             D W
Sbjct: 193 LDQDQMKG-----LLYTSPELAEDDFTNGSIAVMTGEDS-VTVKEEWLSGAWWDGIHDFW 246

Query: 314 NEIKKHGSFD---HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
           ++  + G  +        ++  ++       ++    TIP+       F L W       
Sbjct: 247 DDFSEDGRLEFHSEFQGKESKFNDLSDLRIGSLGVCKTIPAYGEAVFEFILTW------- 299

Query: 371 FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLF 430
               Y     K++G  G       +   +EH                             
Sbjct: 300 ----YFPNRPKYWG--GHVCPENRNKKEIEHN---------------------------- 325

Query: 431 NELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEEN 490
              YY +     W  G    ++L        SL+  S+  T A  ++ L   ++E    N
Sbjct: 326 ---YYADFQKNAWDYGVYLYKNLE-------SLEKASRSFTKALYESTLPDYVIEALANN 375

Query: 491 I-------------GQFLYLEGA----EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
           I             G FL  EG+         N   V  Y+  A   LFP+LE S++R  
Sbjct: 376 ITVLRSPTCFRISDGTFLSWEGSFPQHGSCEGNCTHVWNYAQTA-AFLFPELEQSMRRT- 433

Query: 534 AAAVMMHDPG-----TMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKD 588
              +   D G     T+++  D KW   + + AV   +G                     
Sbjct: 434 EFLLETDDTGRMAFRTLQVFGDKKW---EMIPAVDGQMGT-------------------- 470

Query: 589 LNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDA 647
                +L++YRD+         +  W    +A+ +    +D D D +++++     TYD 
Sbjct: 471 -----ILRLYRDWKLGAGDALLKETWEKAALALDFAFTYWDSDKDCVLDDQ--QHNTYDI 523

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY 706
               G ++    L++AAL+A   +A  +GD A A  +   ++K     DS LW G Y+  
Sbjct: 524 -EFYGPNSLSNSLFLAALKAGKEMAAYLGDTAHAEKYAEAFRKGSEKMDSLLWGGDYYIQ 582

Query: 707 DNSDGSS-----STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK----- 756
              D ++      T   ADQ+ GQ  A    L  +  E  +KKA+  ++ +N  +     
Sbjct: 583 VIDDINAYKYQYGTGCLADQVFGQMLAHLNHLGYVLPEEHIKKAIHSVFQYNFKETMEDH 642

Query: 757 VKGGMCGAMNGMQ--------PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
           V      A+N  +          GR  +  + + E+W G+ Y +AA +I E  V+     
Sbjct: 643 VNVQRTYALNQEKGLVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIFEGFVEEGLAV 702

Query: 809 AAGVYE 814
              V E
Sbjct: 703 VKAVRE 708


>gi|238620760|ref|YP_002915586.1| hypothetical protein M164_2319 [Sulfolobus islandicus M.16.4]
 gi|238381830|gb|ACR42918.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
          Length = 818

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 102/441 (23%), Positives = 174/441 (39%), Gaps = 100/441 (22%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +            
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
                        M DP          GT+K +++     +   G +PH I        +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV-----------------Y 618
           ++Y         D+N +FVL  Y     TGD+   ++V+                    Y
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTVDGLPY 515

Query: 619 IAMAYMEQFDKDGDGMIENE-----------GFPDQTYDAWSANGVSAYCGGLWVAALQA 667
           + +    ++ ++ + M+                  QT D WS  G S Y   LW++AL+A
Sbjct: 516 LTLPSGIEWIRNVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575

Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
              AS L N+  ++          +  + V   LWNG Y+   YD        S  A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628

Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
            G WY +   L    D  + K   + I  +N    +G   G+ N  + P G + +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSAEEGVRNGSSNDDITPLG-VKLS-IQS 686

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
           +  W GV Y LA+ M      + A +    VYE           ++   ++ N+ E+ + 
Sbjct: 687 KAPWSGVEYYLASHMFSSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA- 735

Query: 842 CYMRPLTIWAMQWALTKPKLS 862
            YMRPL   +M +A+   K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756


>gi|385774246|ref|YP_005646813.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
 gi|323478361|gb|ADX83599.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
          Length = 818

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +            
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVLLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
                        M DP          GT+K +++     +   G +PH I        +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
           ++Y         D+N +FVL  Y     TGD+   ++V+                     
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515

Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
              PS    + Y+    +  D        +     QT D WS  G S Y   LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575

Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
              AS L N+  ++          +  + V   LWNG Y+   YD        S  A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628

Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
            G WY +   L    D  + K   + I  +N    +G   G+ N  + P G + +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQS 686

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
           +  W GV Y LA+ M      + A +    VYE           ++   S+ N+ E+ + 
Sbjct: 687 KAPWSGVEYYLASHMFYSGFDEYAKKILRNVYE----------RYEIAGSFWNHIEWGA- 735

Query: 842 CYMRPLTIWAMQWALTKPKLS 862
            YMRPL   +M +A+   K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756


>gi|385776902|ref|YP_005649470.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
 gi|323475650|gb|ADX86256.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
          Length = 818

 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +            
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
                        M DP          GT+K +++     +   G +PH I        +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
           ++Y         D+N +FVL  Y     TGD+   ++V+                     
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515

Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
              PS    + Y+    +  D        +     QT D WS  G S Y   LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575

Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
              AS L N+  ++          +  + V   LWNG Y+   YD        S  A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628

Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
            G WY +   L    D  + K   + I  +N    +G   G+ N  + P G + +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQS 686

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
           +  W GV Y LA+ M      + A +    VYE           ++   S+ N+ E+ + 
Sbjct: 687 KAPWSGVEYYLASHMFYSGFDEYAKKILRNVYE----------RYEIAGSFWNHIEWGA- 735

Query: 842 CYMRPLTIWAMQWALTKPKLS 862
            YMRPL   +M +A+   K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756


>gi|229585796|ref|YP_002844298.1| hypothetical protein M1627_2396 [Sulfolobus islandicus M.16.27]
 gi|228020846|gb|ACP56253.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
          Length = 818

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +            
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
                        M DP          GT+K +++     +   G +PH I        +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
           ++Y         D+N +FVL  Y     TGD+   ++V+                     
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515

Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
              PS    + Y+    +  D        +     QT D WS  G S Y   LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575

Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
              AS L N+  ++          +  + V   LWNG Y+   YD        S  A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628

Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
            G WY +   L    D  + K   + I  +N    +G   G+ N  + P G I +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-IKLS-IQS 686

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
           +  W GV Y LA+ M      + A +    VYE           ++   ++ N+ E+ + 
Sbjct: 687 KAPWSGVEYYLASHMFSSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA- 735

Query: 842 CYMRPLTIWAMQWALTKPKLS 862
            YMRPL   +M +A+   K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756


>gi|297741933|emb|CBI33368.3| unnamed protein product [Vitis vinifera]
          Length = 108

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/60 (56%), Positives = 42/60 (70%)

Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
           M + AF TA G++   WSE+G GY FQTPE W  +  +RSL YMRPL IW MQWAL+ P+
Sbjct: 1   MEEQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMPR 60


>gi|227828566|ref|YP_002830346.1| hypothetical protein M1425_2317 [Sulfolobus islandicus M.14.25]
 gi|227460362|gb|ACP39048.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
          Length = 818

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +            
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
                        M DP          GT+K +++     +   G +PH I        +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461

Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
           ++Y         D+N +FVL  Y     TGD+   ++V+                     
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515

Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
              PS    + Y+    +  D        +     QT D WS  G S Y   LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575

Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
              AS L N+  ++          +  + V   LWNG Y+   YD        S  A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVDKYLWNGEYYMDWYDPISNLRDDSSNASQI 628

Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
            G WY +   L    D  + K   + I  +N    +G   G+ N  + P G I +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-IKLS-IQS 686

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
           +  W GV Y LA+ M      + A +    VYE           ++   ++ N+ E+ + 
Sbjct: 687 KAPWSGVEYYLASHMFSSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA- 735

Query: 842 CYMRPLTIWAMQWALTKPKLS 862
            YMRPL   +M +A+   K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756


>gi|10440618|gb|AAG16856.1|AC069145_5 hypothetical protein [Oryza sativa Japonica Group]
          Length = 143

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 69/126 (54%), Gaps = 21/126 (16%)

Query: 9   GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
           G  A  +   D G PP+  W+R+L+S+  +   F+++F E   +                
Sbjct: 21  GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM---------------- 64

Query: 69  KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
             K P+ + F K     ++ QG+PLGG+G+GSI R +RGEF+ + +  G+C+ +PV+ NQ
Sbjct: 65  --KAPI-DPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQ 121

Query: 127 FSVFVS 132
           FSV +S
Sbjct: 122 FSVMLS 127


>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
          Length = 814

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 133/325 (40%), Gaps = 55/325 (16%)

Query: 513 YSSFALVMLFPKLELSIQR-DFAAAVMMHDPGTMKI--MSDGKWVARKCLGAVPHDIGLD 569
           Y + A+  LFP+LE S++R DF      +D G M+   M +    +   L AV   +G  
Sbjct: 421 YYAQAMAYLFPELEQSMRRTDFLTET--NDEGEMQFRAMRELNGASWGFLPAVDGQMGT- 477

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFD 628
                                   ++++YRD++ +GD  F + +WP   +A+ Y +  +D
Sbjct: 478 ------------------------IVRLYRDWLISGDDAFLKELWPKAKLALDYGIRTWD 513

Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
            DGD +++     D   + +  + +   C   ++AAL+AA  +A  + D  + + +   Y
Sbjct: 514 TDGDFVLDGMKHVDYDVEFYGIDPLGNLC---YLAALKAAVKIAEYLKDEGAHNRYQTIY 570

Query: 689 QKAKAVYDS-LWN--GSYF-----NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
             A    D  +W+    YF     + D       T   ADQL GQ+YA   GL P+ D  
Sbjct: 571 DTASVQADKFMWDSKAGYFIQVLKDVDEFKYQHGTGCLADQLIGQFYAHLTGLGPLVDAG 630

Query: 741 KVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
            +K+A   I+ +N +              V       M      GR       + E W  
Sbjct: 631 HIKQAAQSIFKYNFISDFTDHPNMQRGYAVNDDKGLLMTTWPLGGRPRYPFFYSEEAWSR 690

Query: 788 VTYGLAASMIQEEMVDMAFQTAAGV 812
             Y +A+++I E  +D     A  V
Sbjct: 691 TEYHVASTLIYEGFLDEGLTIAKAV 715



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 72/298 (24%), Positives = 123/298 (41%), Gaps = 40/298 (13%)

Query: 91  LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGV 150
           LGGIG GSI    RG+   F+LF+     +  L   ++ F     GE  +      S   
Sbjct: 30  LGGIGTGSISLGSRGQLTDFELFN---HPSKGLRFPYTFFAVHAQGEGVNVTKVLESRLT 86

Query: 151 PKKNTDSGIESWDWN-----LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
           P     +G+ + + +     L+ E       FP A   +  +  P ++      +PFIP 
Sbjct: 87  PPYGAANGLPAQELSGLPRFLESE---MQVQFPFASIQFMDDRLP-VKAELEAFNPFIPL 142

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL--------SGHHFNSKTM 257
           N  +S  PA++  + + N+      +++  +  N+  G +G+        +G  FN    
Sbjct: 143 NANDSGIPAAILRYRIKNTSAAPLQISVAGSLYNA-GGFTGIEPFGNFINTGRPFNVYQK 201

Query: 258 TKDGVHGLTLHHRTANGRPP--VTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
            ++ + G+ L +   +G PP  + +   A  T+D  VS    + +     GI   + W++
Sbjct: 202 EQE-IRGIFLDN---DGLPPASIEYGNLALMTSDESVSYKAEWQMGQRIDGI--HNFWDD 255

Query: 316 IKKHGSFDHLDNDKTSPSEPGS-------SIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
             + G    LDND       G         IG ++A S T+  G     TF+LAW  P
Sbjct: 256 FSEDGK---LDNDAPKMGVDGKLDKKILIKIG-SLAISRTLAPGEEYCYTFALAWYFP 309


>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
 gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
          Length = 836

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 104/440 (23%), Positives = 173/440 (39%), Gaps = 102/440 (23%)

Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
           E  +G+  Y +G      NT DV  Y  + L +L+P+L  ++    + + + +D      
Sbjct: 333 ERKVGEHPYTDGPVNTALNTIDVSTYFIYTLTVLYPQLAKNLLLTSSRSALSYDNPLYIF 392

Query: 548 MS------DGKWVAR---------------KCLGAVPHDIGLDDPWFEINSYNL----FN 582
            S        K+V R               + +  +  + G  DP   I  Y      F+
Sbjct: 393 YSLAMPENRSKYVERVIKDPSIPSSIEKLLQTVKEIAKETG-KDPKGRIAHYIYRDLKFD 451

Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM------------------ 624
                DLN +FVL        TGD  F ++++P    AM  +                  
Sbjct: 452 EYGRNDLNPEFVLMWGLVSKYTGDIEFMKSLYPVAKEAMESVLRTHSYEGLIYSRLPSGF 511

Query: 625 ----EQFDKDGDGMIENEGF-------------PDQTYDAWSANGVSAYCGGLWVAALQA 667
               + F    D  I +  F                T+D W+  G++++   L ++AL+ 
Sbjct: 512 EWNRQVFSYFKDVNIYDNLFLVVSLLGIDSFHMSVNTFDDWTTIGINSFVSLLGISALKI 571

Query: 668 ASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFN--YDNSDGSSSTSIQADQLAG 724
            + L    GD  +        + A ++Y+S LWNG YF+  YD   G    + QA QL G
Sbjct: 572 LNELG---GDKYNV-------ENALSLYESMLWNGEYFDLWYDPISGYRDKTCQASQLLG 621

Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG------------ 772
           ++Y    G   + D  K +K L  I  +N+ + +G     +NG  PDG            
Sbjct: 622 EFYLNLLGY-SLLDREKTRKTLLSIVKYNLKEEEG----VINGAYPDGYRPLMREYENPL 676

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
           +I  + +     W GV + LA+ +I E+M+D A +    VY+  +S  G   +F     W
Sbjct: 677 KIKEASIHQDTPWSGVEFYLASHLIYEKMIDEAKKVLKEVYD-RYSIAG---NFWNHWEW 732

Query: 833 NNNDEYRSLCYMRPLTIWAM 852
            ++       Y RPL+ W +
Sbjct: 733 GSH-------YSRPLSSWLI 745


>gi|227831316|ref|YP_002833096.1| hypothetical protein LS215_2489 [Sulfolobus islandicus L.S.2.15]
 gi|229580221|ref|YP_002838621.1| hypothetical protein YG5714_2457 [Sulfolobus islandicus Y.G.57.14]
 gi|227457764|gb|ACP36451.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
 gi|228010937|gb|ACP46699.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.G.57.14]
          Length = 818

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 103/430 (23%), Positives = 169/430 (39%), Gaps = 78/430 (18%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +  D         
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406

Query: 543 ----GTMK----IMSDGKWVA--RKCLGAVPHDIGL--DDPWFEINSYNLF----NSSRW 586
                 MK    IM D       +K  G +   +     DP   +  Y  +    ++   
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 466

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAMA 622
            D+N +FVL  Y     TGD+   ++V+                        PS    + 
Sbjct: 467 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 526

Query: 623 YMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALANDV 675
           Y+    +  D        +     QT D WS  G S Y   LW++AL+A   AS L ++ 
Sbjct: 527 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSNP 586

Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACGL 733
            ++          +  + V   LWNG Y+   YD        S  A Q+ G WY +   L
Sbjct: 587 QNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLDL 639

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQAREIWPGVTYGL 792
               D  + K   + I  +N    +G   G+ N  + P G + +S +Q++  W GV Y L
Sbjct: 640 PEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQSKAPWSGVEYYL 697

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A  M      + A +    VYE           ++   ++ N+ E+ +  YMRPL   +M
Sbjct: 698 ALHMFYSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA-RYMRPLVALSM 746

Query: 853 QWALTKPKLS 862
            +A+   K++
Sbjct: 747 IYAIEGMKVN 756


>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 811

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 66/232 (28%), Positives = 106/232 (45%), Gaps = 22/232 (9%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIENEGFPDQTYDAWSANG 652
           V++VYR++  +GD  F + +WP V  A+ Y   + DKD D ++E  G    TYD     G
Sbjct: 471 VIRVYREWKFSGDDAFLKELWPKVKSAVNYSSLYWDKDQDFILE--GRQHNTYDI-EFYG 527

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGS 712
            +   G L++ AL+AA  +A  +G  +++SY       +K +   LWNG Y+     D +
Sbjct: 528 PNPLTGILFLGALRAAEEMAKYLGSESASSYAQAFEASSKKLDQLLWNGEYYIQKLDDPN 587

Query: 713 S-----STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV------------L 755
                 ++   +DQL GQ  A   GL  +  +  + +AL  I+ +N             +
Sbjct: 588 EHRYQHASGCLSDQLFGQTLASLTGLGYLLPKEHISRALESIFAYNFKPNFWNHTNTQRV 647

Query: 756 KVKGGMCGAMNGMQPDG-RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
              G   G +    P G R       + E+W G  Y +A  MI E ++D A 
Sbjct: 648 YALGDDAGLVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEGLLDEAL 699



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 74/319 (23%), Positives = 127/319 (39%), Gaps = 36/319 (11%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
             PLGGIG G++    RGE + +++F+        L N +    +RP GE   + +    
Sbjct: 25  AFPLGGIGTGNVSLGARGELRDWEIFNRPAKGFS-LPNTYFALWARPEGESPVTKVLQSL 83

Query: 148 PGVPKKNTDSGIESWDWNLKGENCT-YHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
           P  P   +     +    L     T +   +P AW VY  + D  +++     +PF+P N
Sbjct: 84  PPPPHTQSHGYHPTSGLGLPHFTSTLFRGEYPIAW-VYFEDSDLPIKVTLESFTPFVPLN 142

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV--------AGDSGLSGHHFNSKTMT 258
            ++S  P +   + + N      DV +  +  N V         G S   G   N+    
Sbjct: 143 AEDSGIPGAYLIYRVENISPKRIDVAIAASITNPVGEVRRDRFGGFSVERGG--NANDFR 200

Query: 259 KDG-VHGLTLHHRTANGRP--PVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
           + G + GL L    +N  P   + +   +  T + HV+    +L S  +   + ++ W +
Sbjct: 201 RQGRILGLFLR---SNSYPLYDLRYGDLSLVTMNSHVTFKRVWLPSRWACD-SVEEFWRD 256

Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW------------ 363
               G  + L  +   PSE G +   +IAA  TI  G  +   F L+W            
Sbjct: 257 FSTDGKLEDLGIE--DPSEEGQTYTGSIAAMETIAPGEVKDFVFILSWFFPNRIKSWDSD 314

Query: 364 --DCPEVKFFEKVYHRRYT 380
             D  +V+     Y RR++
Sbjct: 315 AEDLEDVEIVRNHYSRRFS 333


>gi|229581136|ref|YP_002839535.1| hypothetical protein YN1551_0454 [Sulfolobus islandicus Y.N.15.51]
 gi|228011852|gb|ACP47613.1| protein of unknown function DUF608 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 816

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 103/430 (23%), Positives = 169/430 (39%), Gaps = 78/430 (18%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +  D         
Sbjct: 345 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 404

Query: 543 ----GTMK----IMSDGKWVA--RKCLGAVPHDIGL--DDPWFEINSYNLF----NSSRW 586
                 MK    IM D       +K  G +   +     DP   +  Y  +    ++   
Sbjct: 405 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 464

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAMA 622
            D+N +FVL  Y     TGD+   ++V+                        PS    + 
Sbjct: 465 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 524

Query: 623 YMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALANDV 675
           Y+    +  D        +     QT D WS  G S Y   LW++AL+A   AS L ++ 
Sbjct: 525 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSNP 584

Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACGL 733
            ++          +  + V   LWNG Y+   YD        S  A Q+ G WY +   L
Sbjct: 585 QNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLDL 637

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQAREIWPGVTYGL 792
               D  + K   + I  +N    +G   G+ N  + P G + +S +Q++  W GV Y L
Sbjct: 638 PEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQSKAPWSGVEYYL 695

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A  M      + A +    VYE           ++   ++ N+ E+ +  YMRPL   +M
Sbjct: 696 ALHMFYSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA-RYMRPLVALSM 744

Query: 853 QWALTKPKLS 862
            +A+   K++
Sbjct: 745 IYAIEGMKVN 754


>gi|284998829|ref|YP_003420597.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284446725|gb|ADB88227.1| hypothetical protein LD85_2606 [Sulfolobus islandicus L.D.8.5]
          Length = 803

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 103/430 (23%), Positives = 169/430 (39%), Gaps = 78/430 (18%)

Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
           +G++ Y +G E    NT DV  Y+   +++LFP L  +I +D +   +  D         
Sbjct: 332 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 391

Query: 543 ----GTMK----IMSDGKWVA--RKCLGAVPHDIGL--DDPWFEINSYNLF----NSSRW 586
                 MK    IM D       +K  G +   +     DP   +  Y  +    ++   
Sbjct: 392 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 451

Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAMA 622
            D+N +FVL  Y     TGD+   ++V+                        PS    + 
Sbjct: 452 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 511

Query: 623 YMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALANDV 675
           Y+    +  D        +     QT D WS  G S Y   LW++AL+A   AS L ++ 
Sbjct: 512 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSNP 571

Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACGL 733
            ++          +  + V   LWNG Y+   YD        S  A Q+ G WY +   L
Sbjct: 572 QNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLDL 624

Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQAREIWPGVTYGL 792
               D  + K   + I  +N    +G   G+ N  + P G + +S +Q++  W GV Y L
Sbjct: 625 PEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQSKAPWSGVEYYL 682

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
           A  M      + A +    VYE           ++   ++ N+ E+ +  YMRPL   +M
Sbjct: 683 ALHMFYSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA-RYMRPLVALSM 731

Query: 853 QWALTKPKLS 862
            +A+   K++
Sbjct: 732 IYAIEGMKVN 741


>gi|284045431|ref|YP_003395771.1| hypothetical protein Cwoe_3979 [Conexibacter woesei DSM 14684]
 gi|283949652|gb|ADB52396.1| protein of unknown function DUF608 [Conexibacter woesei DSM 14684]
          Length = 870

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 69/248 (27%), Positives = 111/248 (44%), Gaps = 41/248 (16%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYDAWSANG 652
           VL+ YR+    G + +    W ++   M ++++ ++  GDG++  E     TYD  + +G
Sbjct: 509 VLKTYREARQGGGEAWLADRWDALVRLMDHVDRTWNSAGDGVLRGE--QPCTYDI-ALHG 565

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN------- 705
            +   GGLW+AAL++   +A  +G   +A Y  VR+++A+  Y+ L+NG Y+        
Sbjct: 566 PNLLIGGLWLAALRSMEEIALRLGVAGAAGYR-VRFEQARGGYEKLFNGEYYAQPVTGEP 624

Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
           +D  DG  S     DQL GQW+A    L  + D  +V+ AL  I   N   ++ G    +
Sbjct: 625 HDFGDGCLS-----DQLLGQWWAHQLELGHLLDPERVRSALRAIVAHN---LREGFGARV 676

Query: 766 NGMQP---------------------DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
              QP                      GR D+      E+W GV Y +AA  I E + D 
Sbjct: 677 ADEQPPGHRVFADGEDSGLVVCSWPRGGRPDVPLRYCDEVWSGVEYAVAAHCIDEGLEDE 736

Query: 805 AFQTAAGV 812
                  V
Sbjct: 737 GLALVEAV 744


>gi|336426603|ref|ZP_08606612.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336010619|gb|EGN40601.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 850

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 178/787 (22%), Positives = 284/787 (36%), Gaps = 171/787 (21%)

Query: 89  IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA--NQFSVFVSRPNGEK-------F 139
            P+GG+G+G IG +  G    ++++     + P +   N +S F  +   E         
Sbjct: 16  FPVGGLGSGCIGIAGNGCLIDWQVY-----NRPAMGTRNGYSHFAIKAQDETGLLDARVL 70

Query: 140 SSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
            S L     G   + T  G+  +          +   +P A   +  E  P  ++  R  
Sbjct: 71  VSDLEKPYMGQVDEKTMLGMPHF------RELVFEGRYPAAKLSFLDERFPG-KVTLRYY 123

Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
           +PFIP N K+SS P       + N+       T      N   G +  S H+F  K    
Sbjct: 124 NPFIPLNDKDSSIPGLFADIIVENTSDHPIRYTAAGVITNPNKG-TNTSNHYFEDK---- 178

Query: 260 DGVHGLT-----LHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWN 314
            G+ G+      +  R A G    T A  +EE       +C  +   GN +    +  W 
Sbjct: 179 -GIRGIRFATDGMDERDA-GYSDFTLATDSEE------GDCQEYWFRGNWRD-HVETYWR 229

Query: 315 EIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV 374
           E     S   L N +     P       + A   I  G  +   + L W+ P        
Sbjct: 230 EF---ASEQPLKNRRYETGLPDYKDTGMLCAVQDIGPGEKKEFHYVLTWNVPNC------ 280

Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL- 433
                  ++  L  S + I +D         C+   W+            Y   LF +  
Sbjct: 281 -----NNYWSGLDISTSNIHYDV--------CDRTPWKN-----------YYAVLFADSG 316

Query: 434 ----YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGE- 488
               Y L     +W +    M ++ +    + +LD  +       S T L    LE+G  
Sbjct: 317 ASASYGLENRARLWEETKSFMDAVYSTTIPEEALDAVTANLAILRSPTCL---RLEDGSF 373

Query: 489 -------ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
                  +++G     EG    ++N        ++AL  LFP L  SI RD        D
Sbjct: 374 YGWEGCFDDVGS---CEGTCTHVWNY-------AYALPFLFPALARSI-RDLEFTYSQQD 422

Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
            G ++         R  LG+                   F+S     L S  V+  YRD+
Sbjct: 423 DGGVEFR------LRLPLGSTG---------------KRFHSCVDGQLGS--VMLTYRDY 459

Query: 602 VATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDAWSANGVSA 655
             +GD+ +    W SV  A+AY       + +D+   G+++  G    T D     G +A
Sbjct: 460 KISGDREWLEGKWESVKKAIAYAWSEENPDAWDRKKQGVLD--GRQHHTLDM-ELFGPNA 516

Query: 656 YCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYF--NYDNSDGS 712
           +  G +V AL+AA+ +A  +GD  + + +       KA  +  L+NG YF    D  D S
Sbjct: 517 WLQGFYVGALEAAAEMAAILGDGEAEAEYRALAAGGKAYLNKELFNGEYFYQKIDVCDRS 576

Query: 713 SSTSIQA----------------------DQLAGQWYARACGLLPIADEAKVKKALTKIY 750
           +     A                      DQ+  QW+   CGL  I ++ KV+KAL  I+
Sbjct: 577 AVDRFNASKEYWNEEKGQIKYQIAEGCGIDQVLAQWHGNLCGLPEIFEKEKVRKALASIF 636

Query: 751 DFNVLKVKGGM------------CGAMNGMQPDG--RIDMSGLQAREIWPGVTYGLAASM 796
            +N     G +             GA+    P+G  +  +    A+E   G+ Y  AA M
Sbjct: 637 RYNYKPALGDIYNPWRIFAIDEESGAIMCEWPEGTDKPAIPLTYAQECMDGMQYQAAAHM 696

Query: 797 IQEEMVD 803
           IQE M +
Sbjct: 697 IQEGMTE 703


>gi|253761387|ref|XP_002489099.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
 gi|241947398|gb|EES20543.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
          Length = 175

 Score = 76.3 bits (186), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 55/172 (31%), Positives = 73/172 (42%), Gaps = 63/172 (36%)

Query: 19  DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
           D G PP+  W+RKL S+      F ++F+E   +  +G RL+ Y ++E            
Sbjct: 31  DDGSPPEHAWRRKLSSHANRLKEFNVTFREAIRMMKLGLRLWSYIREE------------ 78

Query: 79  RKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK 138
                              S GR                  AP+  + F+   +RP+   
Sbjct: 79  ------------------ASHGRK-----------------API--DPFTREGNRPSA-- 99

Query: 139 FSSVLCPRSPGVP----KKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
                   S GVP     K+ D GI SWDW LKG+  TYHALFPRAWTVYDG
Sbjct: 100 --------SQGVPLGGMGKSIDDGISSWDWKLKGDRSTYHALFPRAWTVYDG 143


>gi|297725625|ref|NP_001175176.1| Os07g0444000 [Oryza sativa Japonica Group]
 gi|255677726|dbj|BAH93904.1| Os07g0444000 [Oryza sativa Japonica Group]
          Length = 87

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 53/77 (68%), Gaps = 2/77 (2%)

Query: 55  IGFRLYRYSKQEEAKGKVPVFNVFRKH--HITSDQGIPLGGIGAGSIGRSYRGEFQRFKL 112
           +G RL+ Y ++E + G+    + F +     ++ QG+PLGG+G+GSI R +RGEF+ +++
Sbjct: 3   LGLRLWSYIREEASHGRKAPIDPFTRESDKPSASQGVPLGGMGSGSISRGFRGEFKHWQI 62

Query: 113 FHGICDDAPVLANQFSV 129
             G C+ +PV+ANQFSV
Sbjct: 63  IPGSCEMSPVMANQFSV 79


>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
 gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
          Length = 822

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 173/795 (21%), Positives = 309/795 (38%), Gaps = 138/795 (17%)

Query: 91  LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR-SPG 149
           LGGIG G++    RGE + +++F+       +    F++      GE  S +L  + +P 
Sbjct: 32  LGGIGTGNVSVGARGELRSWQIFNEPGQFNFLPYTFFALRAEVEGGEAVSKILESKLNP- 90

Query: 150 VPKKNTDSGIESWDWNL-KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYK 208
            P  + +  +      L + E  T  A +P      + +  P +++     +PFIP N  
Sbjct: 91  -PFTHPEGTLRCELGGLPRFEKSTMSAEYPFVRVKLEDKQVP-VQVEMEAFTPFIPLNAD 148

Query: 209 ESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS-------GLSGHHFNSKTMTKDG 261
           +S  P +   + + N       V++  + AN+V  +         L+G+  N +   ++G
Sbjct: 149 DSGIPGAYLNYRVKNPTNQPVTVSIAGSLANAVGFEKYDIWNHMKLNGNPVN-EYQERNG 207

Query: 262 VHGLTLH------HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
           + GL            ANG    T  +A   T++  V+  P +     + G  A+D W++
Sbjct: 208 LKGLFFRGENVPEKHIANG----TMCLA---TSEDDVTAKPLWFQGEWTDG--AQDFWDD 258

Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY 375
               GS D +D+ K    + G +I      S    S S  S+         E + F  + 
Sbjct: 259 FISDGSLD-VDSGKV---QEGCAIAKIYDFSYLHFSDSIGSICIKKVLKPGEERVFPFLL 314

Query: 376 HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY- 434
              +       GD+          +H K    ++ +   I+ +      Y  TLF + + 
Sbjct: 315 TWNFPNRVRDWGDTD---------KHVK---AVQEYHYEIVGN------YYSTLFQDAWT 356

Query: 435 ---YLNAGGTIWT-DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEEN 490
              Y+N    I   D     Q+  +     + L+  +   T   S T   T   ENG+  
Sbjct: 357 VADYMNQKKEILEGDSRKFSQAFFSSTLPGYVLEAVADNITILRSPTCFRT---ENGD-- 411

Query: 491 IGQFLYLEGAEYVM---YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
              F   EG E  +     T    +  +  +  LFP+LE S++R             ++ 
Sbjct: 412 ---FFGWEGVENTVGCGAGTCTHVWNYAQTVAFLFPELEQSMRR-------------IEF 455

Query: 548 MSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK-----FVLQVYRDFV 602
           + + +                 D +    +Y  F    W+ L S       V+++YR++ 
Sbjct: 456 LQETQ----------------ADGYMPFRTYRPFGLPAWEMLPSADGQLGAVVRLYREWR 499

Query: 603 ATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQ----TYDAWSANGVSAYC 657
            +GD       W  V  +M Y +  +D DGD +      PD     TYD     G+++  
Sbjct: 500 LSGDNRLIEDCWDGVVRSMEYAIRTWDTDGDFV------PDSAQHVTYDT-ELYGMTSMV 552

Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYF-----NYDNSDG 711
             L+ AAL AA+ +A  +GD   A  +    +K AK + +  WNG Y+     + D    
Sbjct: 553 STLFFAALIAAAEMAEYMGDQERARRYRDGAEKGAKLMDEKCWNGEYYIQLIDDVDKYRY 612

Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
                  +DQL GQ+ A+  GL  +  +  VKKA   I+ +N ++              +
Sbjct: 613 QYGKGCLSDQLLGQFLAQEAGLGYVLPKNHVKKAAESIFRYNFIERASDFGHVQRAYIFN 672

Query: 772 GRIDMS------GLQAR-------EIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA-- 816
             + ++      G + R       E+W G+ Y +AA +I+E++++        V +    
Sbjct: 673 DEMGLTPCTWPLGERPRFPFIYFGEVWTGIEYEVAALLIREDLLEEGLTIVKAVRDRQDG 732

Query: 817 -----WSEDGLGYSF 826
                WSE+  GY +
Sbjct: 733 YRRNPWSENESGYYY 747


>gi|410638454|ref|ZP_11349015.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
 gi|410141863|dbj|GAC16220.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
          Length = 1124

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 85/352 (24%), Positives = 148/352 (42%), Gaps = 43/352 (12%)

Query: 594  VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
            +L  YR+   + + ++ ++ WP V  A+ + + ++D +  G + N      T D  +  G
Sbjct: 726  ILNTYREHQCSANDSWLKSQWPRVKKAVEWGINEWDPNRIGFMSNT--QHNTLDG-NITG 782

Query: 653  VSAYCGGLWVAALQAASALANDVGDHASAS-YFWVRYQKAKAVYDSLWNGSYFNYDNSDG 711
             S++ G L++ +L+AA+ +A+ VG+  +A  Y  +R    K     LWNG Y  Y   +G
Sbjct: 783  CSSWIGSLYLTSLEAAARMADIVGEPQTAKEYRAIRESGKKLQNKRLWNGEY--YIQEEG 840

Query: 712  SSS-----TSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV--------LKVK 758
                         DQL G+W+A    +       + ++A++ +  +N         LK +
Sbjct: 841  KERFQDYLDGCHIDQLLGEWWADQLNIDRNYPRERAQQAMSALLKYNFRADFYGQSLKPR 900

Query: 759  GGMCGAMNGMQ----PDGRIDMSGLQ-AREIWPGVTYGLAASMIQEEMVD---MAFQTAA 810
                    GM+    P G+  + G++   E+  G  YG A +M+Q  M+D   M  +  A
Sbjct: 901  QYCEIEDGGMKMITWPRGQQPIPGMKYGDEVMTGFEYGAAVTMMQNGMIDEGLMLLKVVA 960

Query: 811  GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC---YMRPLTIWAMQWALTKPKLSRQEIK 867
              Y+     +G+      P  ++ N      C   Y R L++W+   AL       Q   
Sbjct: 961  DRYDGRLRTEGVSNVVNGPWGYSGNPYGDDECGKYYGRSLSVWSALTAL-------QGFI 1013

Query: 868  HE-----ISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRV 914
            ++     I  R     E HA+F  VA    L ++  S   L  V     G V
Sbjct: 1014 YDGPAGLIGFRPILQPENHASFFSVAQGYGLFEQTQSNNALTAVIAMREGSV 1065



 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 76/178 (42%), Gaps = 17/178 (9%)

Query: 194 IVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH-F 252
           +     +PFIP + + SS P +++  T+ N+       ++L +  N++    G  G    
Sbjct: 409 VALEVFNPFIPMDLQNSSIPCAIYQVTVKNTSTDEVTTSVLSSQKNALGYTEGEKGEFGS 468

Query: 253 NSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDM 312
           N  T+ K G    TL H T  G  P    V   + AD           +G +   +  + 
Sbjct: 469 NQNTLLKKG--NTTLLHMTRAGSHPKADMVLLTQAADA----------TGTTNWTSPAEF 516

Query: 313 WNEIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
             E   +G     +N   SP S  G ++  A++A LT+  G ++SVTF L W   E K
Sbjct: 517 KREFTDNGGLTGPNN---SPISRAGQTVNGALSAPLTLAPGESKSVTFVLTWYFTETK 571


>gi|326436882|gb|EGD82452.1| hypothetical protein PTSG_11966 [Salpingoeca sp. ATCC 50818]
          Length = 1720

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 85/337 (25%), Positives = 145/337 (43%), Gaps = 58/337 (17%)

Query: 90   PLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPG 149
            PLG  G G I  +  G  +++++ + IC DA V  + F+++ +   G   + VL      
Sbjct: 886  PLGPCGGGCIALAGDGGLRQWQILNRICHDAHVPDSFFALYTNDGKGGVDARVL-QLLKD 944

Query: 150  VPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKE 209
            +P     S IE     ++G+    + +    +T  +      +  +C    P IP + K 
Sbjct: 945  LPHL---SAIE-----IEGK----YPVMDVDYTSLNNGIKASMECMC----PCIPLDSKN 988

Query: 210  SSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS----------GHHFNSKTMTK 259
            S  P   F FTL+N+   + DV+LL +  N V  D   +          G + N+K MT 
Sbjct: 989  SGLPVIFFNFTLTNTTSDAVDVSLLMSQHNLVGWDGETAIENETEFAGYGGNVNAK-MTM 1047

Query: 260  DGVHGLTLHHRTANGRPPVT--FAVAAEETADVHVSECPCFLLSGNSKGITA-------K 310
            DGV  + +         P     A+AA               L+G+S  ++        K
Sbjct: 1048 DGVTAVDMTSTQFRDTDPQAGHIAIAA---------------LTGDSSEVSTMLQYDAHK 1092

Query: 311  DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE--V 368
             +W+   K G        +++PS  G +   A+  S T+ +G+T ++TF LAW  P   V
Sbjct: 1093 TLWSHFAKTGKLP--GQGQSTPSPSGKTWNGAVCVSRTLAAGATTTITFVLAWHFPNRYV 1150

Query: 369  KFFEKVYHRRYTKFYGTLGDSAARI--AHDAILEHAK 403
             + +K ++   TK    LG++ A    +   +LE+AK
Sbjct: 1151 NWSQKGFYIDDTKSQYWLGNAYANFWPSITEVLEYAK 1187


>gi|423347418|ref|ZP_17325105.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
           CL03T12C32]
 gi|409217877|gb|EKN10851.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
           CL03T12C32]
          Length = 837

 Score = 69.7 bits (169), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 49/320 (15%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           V+++YRD+  +GD  F +  W  V   +AY    + +D + DG++E  G    T D  + 
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
            G +   G  ++ AL+AA  +A  + D   A      +++     D+ L+NG Y+ +  +
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFRQGSTWMDANLFNGEYYEHKIT 592

Query: 710 DGSSSTSIQ------------------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           D  +   +                    DQL GQ+ A  CGL  + D+  ++  L  I  
Sbjct: 593 DPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIMK 652

Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           +N +K            V G   G +    P GR+++      E+  G  Y  A  MI E
Sbjct: 653 YNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIYE 712

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            M   A      + +     DG   + F   E  ++        Y R +  WA   AL+ 
Sbjct: 713 GMEKEALTCIRAIRD---RHDGAKRNPFSEAECGHH--------YARSMASWAAVIALSD 761

Query: 859 PKLSRQEIKHEISDRDSSYL 878
            + S  + +   +DR   Y 
Sbjct: 762 FQYSGVDRRMHFTDRPGCYF 781



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/322 (21%), Positives = 128/322 (39%), Gaps = 43/322 (13%)

Query: 73  PVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH---------GICDDAPVL 123
           PV   + + H+     +PLGGIG G++    RGE + +++ +            ++AP  
Sbjct: 18  PVLKQYDREHLYQ-VALPLGGIGTGTVSLGGRGELRDWEIMNVPGKKYSTVTTGNNAPF- 75

Query: 124 ANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENCTYHALFPR 179
              F+++     GE  +++L    P  P++      E    N  G     + T+ A +P 
Sbjct: 76  ---FAIYAKPQTGEAMTTLLA--GPLYPQEYLH--YEGRPVNHHGMPRFADATFEAAYPF 128

Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
              V+  +    +++  +  +P +P +   S  P +V  + ++N+     +V +  +  N
Sbjct: 129 G-QVHLSDRQLPVKVTVKGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRN 187

Query: 240 SVA-----------GDSGLSGHHFNSKTM-TKDGVHGLTLHHRTANGRPPVTFAVAAEET 287
            +            GD   +G   N      K+G+ GL L+    +   P    VA    
Sbjct: 188 FIGKDGSKFRTDWKGDYIPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQ 247

Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
           A   V+          + GI   + W++    G     + D+    +P +S+    A   
Sbjct: 248 AVEGVTYRTSSRADDWNNGIL--NFWDDFSADGEL--TERDRQEDEDPMASL----AVKK 299

Query: 348 TIPSGSTRSVTFSLAWDCPEVK 369
           T+  GST + TF L W+ P  K
Sbjct: 300 TVGPGSTETFTFYLTWNFPNRK 321


>gi|154490957|ref|ZP_02030898.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
           43184]
 gi|423724888|ref|ZP_17699030.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
           CL09T00C40]
 gi|154088705|gb|EDN87749.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
           43184]
 gi|409236060|gb|EKN28869.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
           CL09T00C40]
          Length = 837

 Score = 69.3 bits (168), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 49/320 (15%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           V+++YRD+  +GD  F +  W  V   +AY    + +D + DG++E  G    T D  + 
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
            G +   G  ++ AL+AA  +A  + D   A      +++     D+ L+NG Y+ +  +
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFRQGSTWMDANLFNGEYYEHKIT 592

Query: 710 DGSSSTSIQ------------------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           D  +   +                    DQL GQ+ A  CGL  + D+  ++  L  I  
Sbjct: 593 DPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIMK 652

Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           +N +K            V G   G +    P GR+++      E+  G  Y  A  MI E
Sbjct: 653 YNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIYE 712

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            M   A      + +     DG   + F   E  ++        Y R +  WA   AL+ 
Sbjct: 713 GMEKEALTCIRAIRD---RHDGAKRNPFSEAECGHH--------YARSMASWAAVIALSD 761

Query: 859 PKLSRQEIKHEISDRDSSYL 878
            + S  + +   +DR   Y 
Sbjct: 762 FQYSGVDRRMHFTDRPGCYF 781



 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 68/322 (21%), Positives = 128/322 (39%), Gaps = 43/322 (13%)

Query: 73  PVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH---------GICDDAPVL 123
           PV   + + H+     +PLGGIG G++    RGE + +++ +            ++AP  
Sbjct: 18  PVLKQYDREHLYQ-VALPLGGIGTGTVSLGGRGELRDWEIMNVPGKKYSTVTTGNNAPF- 75

Query: 124 ANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENCTYHALFPR 179
              F+++     GE  +++L    P  P++      E    N  G     + T+ A +P 
Sbjct: 76  ---FAIYAKPQTGEAMTTLLA--GPLYPQEYLH--YEGRPVNHHGMPRFADATFEAAYPF 128

Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
              V+  +    +++  +  +P +P +   S  P +V  + ++N+     +V +  +  N
Sbjct: 129 G-QVHLSDRQLPVKVTVKGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRN 187

Query: 240 SVA-----------GDSGLSGHHFNSKTM-TKDGVHGLTLHHRTANGRPPVTFAVAAEET 287
            +            GD   +G   N      K+G+ GL L+    +   P    VA    
Sbjct: 188 FIGKDGSKFRTDWKGDYIPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQ 247

Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
           A   V+          + GI   + W++    G     + D+    +P +S+    A   
Sbjct: 248 AVEGVTYRTSSRADNWNNGIL--NFWDDFSADGEL--TERDRQEDEDPMASL----AVKK 299

Query: 348 TIPSGSTRSVTFSLAWDCPEVK 369
           T+  GST + TF L W+ P  K
Sbjct: 300 TVGPGSTETFTFYLTWNFPNRK 321


>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
 gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
          Length = 804

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 63/241 (26%), Positives = 108/241 (44%), Gaps = 26/241 (10%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIENEGFPDQTYDAWSANG 652
           +L+VYR++  +GD +F + +W  V  A+ +   + D + D ++  +G    TYD     G
Sbjct: 477 ILRVYREWKISGDDDFLKKIWNQVVSALEFSASYWDSNQDCVL--DGQQHNTYDI-EFYG 533

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA--KAVYDSLWNGSYF------ 704
           V++    ++ AAL+A + +A  +G+H   S  W   ++A  K + + L+NG Y+      
Sbjct: 534 VNSLGNSIYYAALKAGAEMAEYLGEH-ERSQKWRSMEQAGCKRMDEMLFNGEYYRQVTDG 592

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
           + D           +DQL GQ  A   GL  +  E  VK A+  IY +N  +  G     
Sbjct: 593 DIDEYKYQYGEGCLSDQLLGQTLAHLYGLGHLFPEDHVKSAVFAIYKYNFKERMGSHKSL 652

Query: 765 MNGM----QP---------DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
             G     +P          GR     + + E+W G+ Y +AA +I E  +    +    
Sbjct: 653 QRGYAYQDEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQEGLEIVRA 712

Query: 812 V 812
           V
Sbjct: 713 V 713



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 136/334 (40%), Gaps = 43/334 (12%)

Query: 91  LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGV 150
           LGGIG G+I  + RGE   F++F+       +  + F+++ S P+GE  + VL  +  G+
Sbjct: 24  LGGIGTGNISLNARGELCDFEIFNRQSKGLKLPYSFFALWYSLPSGESDALVLEAKPQGL 83

Query: 151 ---PKKNTDS---GIESWDWNLKGENCTYHAL-FPRAWTVYDGEPDPELRIVCRQISPFI 203
              P     S   G+  +D      +C   A  F       DG P     +  +  +PFI
Sbjct: 84  CDAPMGVAPSLVPGLPRFD-----SSCFSSAYPFANIQLKKDGLP---FEVTLQAYTPFI 135

Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS----GLSGHHFN----SK 255
           P +   S  P   F + L N         +  T  N+   DS    GL+  +      ++
Sbjct: 136 PLDDVNSGIPGIRFVYRLKNLLDQPVKAAVSATMPNACGFDSFSTDGLNKLYLKGRPENR 195

Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVH-VSECPCFLLSGNSKGITAKDMWN 314
            + K+G+ G+   + T   +  + F   A  T +    S  P +L  G   G  A++ W 
Sbjct: 196 VVKKNGLQGIC--YSTDLEKEQLGFGTMALLTDEQEGFSAKPRWLNEGWWDG--AEEYWQ 251

Query: 315 EIKKHGSFDHLDN-----DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCP-EV 368
           ++ + G    + +      + +PS   S+IG +IA    +  G ++   F L W  P   
Sbjct: 252 DLMEDGCLGEVVSGGKPVSRINPSGEESAIG-SIAVQKVLQPGESKDFVFYLTWHFPNRY 310

Query: 369 KFFEKVYHRR--------YTKFYGTLGDSAARIA 394
            ++   +  R        Y  +Y TL D A  +A
Sbjct: 311 GWWPDGHDARKPIPCADIYQNYYSTLWDDALAVA 344


>gi|116624646|ref|YP_826802.1| hypothetical protein Acid_5570 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227808|gb|ABJ86517.1| protein of unknown function DUF608 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 844

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 82/187 (43%), Gaps = 30/187 (16%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           ++  Y D+  +GD  + R +WP V  A+ +      +D D DG++E  G    TYD    
Sbjct: 480 IMHAYLDWSLSGDDAWLREMWPRVKKAIEFAWIPGGWDGDRDGVLE--GAQHNTYDV-EF 536

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----- 704
            G +  CG  ++ AL+A   +A  VGD ASA+ +   ++  +   D+ L+NG Y+     
Sbjct: 537 YGPNPMCGIYYLGALRAGEEMARAVGDTASAAEYRRLFESGRQWIDAKLFNGEYYIQQIR 596

Query: 705 ------------------NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
                             + +N           DQL GQ+ A   GL P+     ++K L
Sbjct: 597 GVAKDKVAKAIIGTMGSDDTENPQYQVGAGCLVDQLVGQYMAEVAGLGPLVSPENIRKTL 656

Query: 747 TKIYDFN 753
             IY +N
Sbjct: 657 QSIYRYN 663



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 70/299 (23%), Positives = 119/299 (39%), Gaps = 28/299 (9%)

Query: 76  NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF-SVFVSRP 134
            VF    +T+    PLGG+ AGSI    RG+ + +++F+   D     +  F +++V   
Sbjct: 34  RVFSGRKLTA-IAFPLGGVCAGSISLGGRGQLRDWEIFN-RSDKGNAPSYAFPAIWVQSG 91

Query: 135 NGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
             +  + VL  R     +     G  S    ++    T+   FP A   +     P +++
Sbjct: 92  ARKPVARVLESRIQPPYEGQNGLGTRSAPGLVRLAGATFTGEFPLARIDFTDAALP-VKV 150

Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNS 254
                +PFIPH   +S  P +V  + +SN G  +A V++ ++  +S  G +  +    N 
Sbjct: 151 SLEAFTPFIPHEPDDSGLPVAVLRYRVSNPGAAAAKVSIAWS-IDSPVGRTQAADTRTN- 208

Query: 255 KTMTKDGVHGLTLHHRTANGRPPV------TFAVAAEETADVHVSECPCFLLSGNSKGIT 308
           +     G+ GL +     N   P       +FA+AA  TA   V+           +G  
Sbjct: 209 EYREAGGLAGLMM----TNAELPAADALKGSFALAAMGTAGARVTHL---------RGWE 255

Query: 309 AKDMWNEIKKHGSFDHLDND-KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
               WN    +  +D    D +  P     +   A+    TI +G      F LAW  P
Sbjct: 256 QGRWWNSPMLY--WDDFSADGELGPEAAKMNAVGALCVQRTIAAGGHADFPFLLAWHFP 312


>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 807

 Score = 67.0 bits (162), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 107/242 (44%), Gaps = 30/242 (12%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYME-QFDKDGDGMIENEGFPDQTYDAWSANG 652
           +L+V R++  +GD  + R VWP V  A+AY   Q+D DGDG+++       TYD     G
Sbjct: 462 ILRVLREWRLSGDMEWLRRVWPGVKSALAYASAQWDGDGDGVLDGR--QHNTYDV-EFYG 518

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----NYD 707
            +  CG  ++AAL+A   LA  +G+   A      +++     D  LWNG Y+    + D
Sbjct: 519 PNPLCGLYYLAALRAVEELAEVLGEANLARRCREVFERGSHRLDELLWNGEYYEQLGDED 578

Query: 708 NSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN-------------- 753
                      ADQL GQ  A   G+  +  E  V++AL  I+ +N              
Sbjct: 579 AWPYQHGKGCLADQLLGQLQAHVLGMGYLVPEEHVRQALRSIWRYNFRADLSEHVNCQRT 638

Query: 754 -VLKVKGG--MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
             L  + G  MC    G QP      S     E++ G  Y +AA MI E +++   Q   
Sbjct: 639 FALNDEAGLLMCTWPRGGQPTFPFPYSD----EVFTGSEYQVAAHMIYEGLLEEGLQIVG 694

Query: 811 GV 812
            V
Sbjct: 695 AV 696



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 81/385 (21%), Positives = 142/385 (36%), Gaps = 70/385 (18%)

Query: 89  IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSP 148
            PLGGIG G++    RG+ + +++ +       +    F+++    +G   + VL    P
Sbjct: 17  FPLGGIGTGNVSLGARGDLRDWEICNRPAKGNRLPNTFFALWTRTASGRCVTRVL--EGP 74

Query: 149 GVPKKNTDSGIE-SWDWNL-KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
             P  +   G   S    L +  +  +   +P AW V   +P   +R+     +P IP  
Sbjct: 75  IQPPHDGSHGYHPSTGAGLPRCRHSRFRGEYPLAW-VELADPALPVRVSLEAYTPLIPLC 133

Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS-----GLSGHHFNSKTMTKDG 261
            ++S  P ++ ++T+ N+      +TL+ +  N V G +        G   N     +  
Sbjct: 134 PEDSGIPCAILSYTVENTSDEVVALTLVGSLTNPVGGVTFDRFGNYRGAGGNQNEYRESE 193

Query: 262 VH-GLTLHHRT-------------ANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
           V+ GL L   T             A   P +T+  A                L G +   
Sbjct: 194 VYRGLWLRSTTLPSDDLRYGDMTLATDHPVITYKRA---------------WLRG-AWWD 237

Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
             ++ W ++ + G  + L  +  SPSEPG +   ++     +  G T +  F L W  P 
Sbjct: 238 YLREFWQDLSEDGRLEDLGYE--SPSEPGKTDTCSLGLRDELAPGETHTYRFVLTWYFPN 295

Query: 368 V---------KFFEKVYHRRYTKFYGTLG---------DSAARIAHDAILEHAKWECEIE 409
                     +     Y  R+   +   G         +   ++ HDA+ E    +C +E
Sbjct: 296 RPHSWREEAGEIVRNHYASRFASSWDVAGYVFTHLARLEQGTKLFHDALFESTLPDCVLE 355

Query: 410 AWQRPI----------LEDKRFPEW 424
           A    I          LED RF  W
Sbjct: 356 ALSANIVPLRSNTCFWLEDGRFLAW 380


>gi|284997210|ref|YP_003418977.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|284445105|gb|ADB86607.1| hypothetical protein LD85_0902 [Sulfolobus islandicus L.D.8.5]
          Length = 498

 Score = 66.6 bits (161), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 76/167 (45%), Gaps = 29/167 (17%)

Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
           + S  ++++FP+LE S  +   + V                   +  G VPHD+G   LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RSDGYVPHDLGFHSLD 390

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
            P     S       +WKD+N   +L VYR F  T D +F + V+P++   M +  +  +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443

Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
             DG+   EG  D  +DA    G  +Y   L++A+L A   +A  VG
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490


>gi|423217835|ref|ZP_17204331.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
           CL03T12C61]
 gi|392627338|gb|EIY21373.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
           CL03T12C61]
          Length = 851

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/322 (22%), Positives = 132/322 (40%), Gaps = 53/322 (16%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           ++++YR++  +GD +F +  W  V   ++Y    + +D + DG++E  G    T D  + 
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNS 709
            G +   G  ++ AL+AA  ++  + D   A      ++K     D +L+NG Y+ +  +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604

Query: 710 DGSS------------------STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           D  +                        DQL GQ+ A  CGL  + D+  ++  +  I  
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664

Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           +N ++            V G   G +    P GR+++      E+  G  Y  A  M+ E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC---YMRPLTIWAMQWAL 856
            M + A      +       DG   + + P S        S C   Y R +  W+   AL
Sbjct: 725 GMEEDALTCINAIRR---RHDG---AKRNPFS-------ESECGHHYARSMASWSAIIAL 771

Query: 857 TKPKLSRQEIKHEISDRDSSYL 878
           ++ + S  +   +I+DR  +Y 
Sbjct: 772 SEFQYSGVDKSMKITDRPGNYF 793



 Score = 46.6 bits (109), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 67/338 (19%), Positives = 132/338 (39%), Gaps = 39/338 (11%)

Query: 65  QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
           QE    + PV   + K H+     +PLGGIG G++    RGE + +++ +       V A
Sbjct: 22  QEWKPQQWPVLKKYDKEHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMN-------VPA 73

Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDW-NLKGENCTYHAL------- 176
            ++S   +  N   F+    P++           + S ++ + +G    +H L       
Sbjct: 74  KKYSTVTTGNNAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYEGRPVNHHGLPRFEQAS 133

Query: 177 FPRAW---TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
           F  A+    V   + +  +++  +  +PFIP +   S  P +V ++ ++N+     +V++
Sbjct: 134 FEAAYPFGQVSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAVLSYEVTNTTDQPMNVSI 193

Query: 234 LFTWANSVA-----------GDSGLSGHHFN-SKTMTKDGVHGLTLHHRTANGRPPVTFA 281
             +  N +            GD   +G   N +K +  + + G+ L+    +        
Sbjct: 194 CGSIRNFIGKDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKGIYLYSDGVDKDDSAWGT 253

Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
           VA    A   V+          +  I   + W++    G     +  K    +P     A
Sbjct: 254 VALTTQAVNGVTYRTSSKADNWNNSIL--NFWDDFSADGML--TERSKQEDEDP----MA 305

Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRY 379
           +++ + TI   ST + TF + W  P  K +  V    Y
Sbjct: 306 SLSVNKTIAPQSTETFTFYITWSFPNRKAWSSVVVGNY 343


>gi|153807893|ref|ZP_01960561.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
 gi|149129502|gb|EDM20716.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
          Length = 851

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/322 (22%), Positives = 132/322 (40%), Gaps = 53/322 (16%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           ++++YR++  +GD +F +  W  V   ++Y    + +D + DG++E  G    T D  + 
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNS 709
            G +   G  ++ AL+AA  ++  + D   A      ++K     D +L+NG Y+ +  +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604

Query: 710 DGSS------------------STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           D  +                        DQL GQ+ A  CGL  + D+  ++  +  I  
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664

Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           +N ++            V G   G +    P GR+++      E+  G  Y  A  M+ E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC---YMRPLTIWAMQWAL 856
            M + A      +       DG   + + P S        S C   Y R +  W+   AL
Sbjct: 725 GMEEDALTCINAIRR---RHDG---AKRNPFS-------ESECGHHYARSMASWSAIIAL 771

Query: 857 TKPKLSRQEIKHEISDRDSSYL 878
           ++ + S  +   +I+DR  +Y 
Sbjct: 772 SEFQYSGVDKSMKITDRPGNYF 793



 Score = 46.6 bits (109), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 67/338 (19%), Positives = 132/338 (39%), Gaps = 39/338 (11%)

Query: 65  QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
           QE    + PV   + K H+     +PLGGIG G++    RGE + +++ +       V A
Sbjct: 22  QEWKPQQWPVLKKYDKEHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMN-------VPA 73

Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDW-NLKGENCTYHAL------- 176
            ++S   +  N   F+    P++           + S ++ + +G    +H L       
Sbjct: 74  KKYSTVTTGNNAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYEGRPVNHHGLPRFEQAS 133

Query: 177 FPRAW---TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
           F  A+    V   + +  +++  +  +PFIP +   S  P +V ++ ++N+     +V++
Sbjct: 134 FEAAYPFGQVSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAVLSYEVTNTTDQPMNVSI 193

Query: 234 LFTWANSVA-----------GDSGLSGHHFN-SKTMTKDGVHGLTLHHRTANGRPPVTFA 281
             +  N +            GD   +G   N +K +  + + G+ L+    +        
Sbjct: 194 CGSIRNFIGKDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKGIYLYSDGVDKDDSAWGT 253

Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
           VA    A   V+          +  I   + W++    G     +  K    +P     A
Sbjct: 254 VALTTQAVNGVTYRTSSKADNWNNSIL--NFWDDFSADGML--TERSKQEDEDP----MA 305

Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRY 379
           +++ + TI   ST + TF + W  P  K +  V    Y
Sbjct: 306 SLSVNKTIAPQSTETFTFYITWSFPNRKAWSSVVVGNY 343


>gi|348173841|ref|ZP_08880735.1| hypothetical protein SspiN1_25464 [Saccharopolyspora spinosa NRRL
           18395]
          Length = 995

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 122/509 (23%), Positives = 193/509 (37%), Gaps = 117/509 (22%)

Query: 341 AAIAASLTIPSGSTRSVTFSLAW-----DCPEVKFFE--KVYHRRYTKFYGTLGDSAARI 393
           A + A +T+  G  R + F LAW     D   + F E  +   R Y   +G+  D  A +
Sbjct: 427 AVVRAGMTLAPGEQRMIRFVLAWFFPVPDREGLSFLEHSRELLRHYAARFGSAQDVVAHL 486

Query: 394 AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSL 453
           A     E  +      AWQR          WY                   D + P   L
Sbjct: 487 AR----EQERLIDGTRAWQR---------TWY------------------EDSTLPHWLL 515

Query: 454 ATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
             +                A++ T L T       +  G+F   EGA Y    T    ++
Sbjct: 516 ERV----------------ATNTTILSTSTCYRFAD--GRFYGWEGA-YCCAGTCTHVWH 556

Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWF 573
            + AL  LFP +E   +                         R  LG     IG      
Sbjct: 557 YAHALARLFPTVERDTRE------------------------RVDLG-----IGFHAATG 587

Query: 574 EINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDG 633
           +++     + S   D  +  +L++YR+   + D  + + VWP    A+ Y+   D + DG
Sbjct: 588 QLSMRGEADRSPAVDGQAGTILRIYREHQMSPDSQWLQRVWPRTRQAVEYLIASDAEPDG 647

Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFW-VRYQKAK 692
           ++  +G    T DA +  G +++  GL++AAL+A +A+A +VGD A A     +     +
Sbjct: 648 IL--DGAQPNTQDA-TWFGRNSWLSGLYLAALRAGAAMAGEVGDDAFARRCTELATSGTE 704

Query: 693 AVYDSLWNGSYFNYD---NSDGSSSTS--IQADQLAGQWYARACGLLPIADEAKVKKALT 747
            +   L+NG YF ++      GS +T+    ADQL G  +A   GL  +      + AL 
Sbjct: 705 VIVRDLFNGEYFVHELDPAHPGSVNTNRGCFADQLLGPSWATQLGLPRVLPPEPTRSALR 764

Query: 748 KIYDFNVL----------KVKGG------------MCGAMNGMQPDGRIDMSGLQAREIW 785
            I+  N +           ++GG            MC   NG   +   + S     E W
Sbjct: 765 SIWRHNFVPRPMEYRQNSPIEGGRIFYDADVPALVMCTWPNGGGDEAGDNWSVSYFNEAW 824

Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
            G+ Y +AA +I E MVD     A  V++
Sbjct: 825 HGIEYQVAAHLIAEGMVDEGLAVARSVHD 853



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 77/187 (41%), Gaps = 26/187 (13%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
           G+P+GGIG G +     G     +L+    D+    AN   +  +RP        + P S
Sbjct: 28  GLPIGGIGCGQLHLGGDG-----RLWLWDVDNRTAPANINDLHFTRP--------MLPSS 74

Query: 148 P---GVPKKNTDSGIESWDW-NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
           P   G   + TD       W + +G      A  P A TV   +P   +++     SPFI
Sbjct: 75  PVEQGFAVRVTDGDDVRTRWLDARGFPDATFAGRPPAATVDYADPQEPVQVRLDACSPFI 134

Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH-----FNSKTMT 258
           P    +SS+PA    +T +N+G  + DV +    AN V     L+  H       S+   
Sbjct: 135 PTEIDDSSYPAVFLDYTATNTGPATVDVAIAGFLANPVC----LTSRHTRPLRLRSQEFA 190

Query: 259 KDGVHGL 265
            DG  G+
Sbjct: 191 LDGAAGV 197


>gi|160890984|ref|ZP_02071987.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
 gi|156859205|gb|EDO52636.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
          Length = 858

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 83/368 (22%), Positives = 137/368 (37%), Gaps = 75/368 (20%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYR++   GD N+ + ++P+V  +M Y +  +D    G +E         + W ANG
Sbjct: 498 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 557

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
           +       +V ALQA   +   VGD  S      RY+    K+KA  +S L++G YF   
Sbjct: 558 MIT---SFYVGALQAICLMGKTVGDDIS------RYEELMNKSKAYLESKLFDGEYFIQN 608

Query: 705 ----NYDNSDGSSSTSI-------------------------QADQLAGQWYARACGLLP 735
                 D  D   + S                           +D + G W    CG+  
Sbjct: 609 IQWTGLDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPE 668

Query: 736 IADEAKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPD-GRIDMSGLQA 781
           + D  KVK  L  ++ +N  K               G   G +    P  G++ +  + +
Sbjct: 669 VVDRLKVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYS 728

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
            E+W G+ Y +AA ++ E  V+   +       V    D      + P     N+     
Sbjct: 729 NEVWTGIEYQVAAHLMFEGEVEKGLEI------VRTCRDRYNGRVRNP----FNEYECGA 778

Query: 842 CYMRPLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKLPKEE 897
            Y R +  +AM   LT  +    +    I   I D  +S+L     F  V      P  +
Sbjct: 779 WYARAMASYAMLEGLTGIRYDAVDKILYIDSRIGDDFTSFLSTETGFGNVGLKEGKPFID 838

Query: 898 ASKGFLKV 905
              G + V
Sbjct: 839 VKYGVIDV 846


>gi|270294306|ref|ZP_06200508.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270275773|gb|EFA21633.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 851

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/369 (22%), Positives = 138/369 (37%), Gaps = 77/369 (20%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYR++   GD N+ + ++P+V  +M Y +  +D    G +E         + W ANG
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
           +       +V ALQA   +   VGD  S      RY+    K+KA  +S L++G YF   
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDIS------RYEELMNKSKAYLESKLFDGEYFIQN 601

Query: 705 ----NYDNSDGSSSTSI-------------------------QADQLAGQWYARACGLLP 735
                 D  D   + S                           +D + G W    CG+  
Sbjct: 602 IQWTGLDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPE 661

Query: 736 IADEAKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPD-GRIDMSGLQA 781
           + D  KVK  L  ++ +N  K               G   G +    P  G++ +  + +
Sbjct: 662 VVDRLKVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYS 721

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
            E+W G+ Y +AA ++ E  V+   +       V    D      + P      +EY   
Sbjct: 722 NEVWTGIEYQVAAHLMFEGEVEKGLEI------VRTCRDRYNGRVRNP-----FNEYECG 770

Query: 842 C-YMRPLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKLPKE 896
             Y R +  +AM   LT  +    +    I   I D  +S+L     F  V      P  
Sbjct: 771 AWYARAMASYAMLEGLTGIRYDAVDKILYIDSRIGDDFTSFLSTETGFGNVGLKEGKPFI 830

Query: 897 EASKGFLKV 905
           +   G + V
Sbjct: 831 DVKYGVIDV 839


>gi|317480931|ref|ZP_07940011.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
 gi|316902824|gb|EFV24698.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
          Length = 851

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 83/368 (22%), Positives = 137/368 (37%), Gaps = 75/368 (20%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYR++   GD N+ + ++P+V  +M Y +  +D    G +E         + W ANG
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
           +       +V ALQA   +   VGD  S      RY+    K+KA  +S L++G YF   
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDIS------RYEELMNKSKAYLESKLFDGEYFIQN 601

Query: 705 ----NYDNSDGSSSTSI-------------------------QADQLAGQWYARACGLLP 735
                 D  D   + S                           +D + G W    CG+  
Sbjct: 602 IQWTGLDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPE 661

Query: 736 IADEAKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPD-GRIDMSGLQA 781
           + D  KVK  L  ++ +N  K               G   G +    P  G++ +  + +
Sbjct: 662 VVDRLKVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYS 721

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
            E+W G+ Y +AA ++ E  V+   +       V    D      + P     N+     
Sbjct: 722 NEVWTGIEYQVAAHLMFEGEVEKGLEI------VRTCRDRYNGRVRNP----FNEYECGA 771

Query: 842 CYMRPLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKLPKEE 897
            Y R +  +AM   LT  +    +    I   I D  +S+L     F  V      P  +
Sbjct: 772 WYARAMASYAMLEGLTGIRYDAVDKILYIDSRIGDDFTSFLSTETGFGNVGLKEGKPFID 831

Query: 898 ASKGFLKV 905
              G + V
Sbjct: 832 VKYGVIDV 839


>gi|332981217|ref|YP_004462658.1| hypothetical protein Mahau_0633 [Mahella australiensis 50-1 BON]
 gi|332698895|gb|AEE95836.1| protein of unknown function DUF608 [Mahella australiensis 50-1 BON]
          Length = 880

 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 172/815 (21%), Positives = 305/815 (37%), Gaps = 180/815 (22%)

Query: 89  IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLC---- 144
            PLGGIG G IG +  G    +++ +     +    + F+V V + N    + +L     
Sbjct: 18  FPLGGIGTGCIGLAGNGRLIDWEIQNRPNKGSYNGYSHFAVKVEKENEVIDTRILNGDLH 77

Query: 145 -PRSPGV-----------PKKNTDSGIESW-DWNLKGENCTYHALFPRAWTVYDGEPDPE 191
            P S              P +   SG+  + D   KGE       FP A   +  +  P 
Sbjct: 78  PPYSGNYAASHFNSFGFGPPRAYLSGVPHFRDVEFKGE-------FPIAELSFIDKTFPG 130

Query: 192 LRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH 251
             +     +PFIP + K+SS PA+ F  +++N+   +   T++    N   G        
Sbjct: 131 -NVKMTAFNPFIPLHDKDSSLPAAFFEISITNTTNETLHYTVVSCMGNVNGG-------- 181

Query: 252 FNSKTMTKDGVHGLTLHHRTANGRPP--VTFA--VAAEETADVHVSECPCFLLSGNSKGI 307
             ++   K    G+ +   T  G  P  V +     A +  DV   E          +G 
Sbjct: 182 -YNEYAKKGPAQGIKM---TCIGLTPDDVRYGDLTVATDMDDVSYQEYW-------YRGS 230

Query: 308 TAKDM---WNEIKKHGSF---DHLDNDKTSPSEPGSSIG-AAIAASLTIPSGSTRSVTFS 360
              D+   W+++ K G F    + D   ++ +  G+++  + +A   ++ +G T++V F 
Sbjct: 231 WFDDLEIYWSDLNKLGKFINRSYKDKGYSNGTGSGNAVDMSTLATHFSLDAGQTKNVRFI 290

Query: 361 LAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
           L W  P    +                +S          +  +W+     +   + ED  
Sbjct: 291 LTWYYPNCANYWSQ-------------NSQQSCCGCQTSQKGQWK----NYYASLFEDSF 333

Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD-TAL 479
               Y +  ++ LY                +     +   FS   PS I    S++ + L
Sbjct: 334 ACAQYCLKNWDRLY----------------EQTLCFKNALFSSTLPSAIIDAVSANLSIL 377

Query: 480 GTRLLENGEENIGQFLYLEGA--EYVMYNTYDVHFYS-SFALVMLFPKLELSIQRDFAAA 536
            T      E+  G F   EG   +         H ++ ++AL  LFPKLE S+ R+    
Sbjct: 378 KTPTCLRLED--GSFYGFEGCHCDSGCCEGSCTHVWNYAYALPFLFPKLERSM-RELDYK 434

Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKF--V 594
             M + G M               +    + L  P           SS     + +F  V
Sbjct: 435 YNMREDGGM---------------SFRLQLPLGSP----------RSSFRPCADGQFGGV 469

Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDAW 648
           ++ YR++  +GD  + +++WP V  ++ Y       +++D +  G++   G    T D  
Sbjct: 470 IKTYREWKISGDDEWLKSLWPRVKKSIEYAWVDTNEDRWDPEQKGVLT--GRQHHTLDM- 526

Query: 649 SANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY- 706
              G +++  G ++AAL+A + +A+ VGD  +A  +   +++ KA  DS L+NG Y+ + 
Sbjct: 527 ELFGPNSWLTGFYLAALKAGAEMASYVGDEQAAQRYTDIFRRGKAWADSNLFNGEYYYHL 586

Query: 707 -DNSDGS--------------SSTSIQA-------------------DQLAGQWYARACG 732
            D  D S                ++I+A                   DQ+  QW+A   G
Sbjct: 587 IDLQDKSILERFDTDKTGTLIGDSTIKAYWDEERGQIKYQVGEGCIIDQVLAQWHANLIG 646

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVK-----------GGMCGAMNGMQPDGRID--MSGL 779
           L  I D    +KA+  +Y  N  K++               G +    P GR    +S  
Sbjct: 647 LGEIFDPGNTRKAVYAVYKNNFKKMREVFNPCRVFSLNDESGMIICSYPAGRKKPAISVP 706

Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
            A+E   G  Y  A  +IQ+  +    +    + E
Sbjct: 707 YAQETMNGFEYQAAEHLIQKGFIREGEEVVRAIRE 741


>gi|283835266|ref|ZP_06355007.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
 gi|291068424|gb|EFE06533.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
          Length = 896

 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 66/268 (24%), Positives = 110/268 (41%), Gaps = 65/268 (24%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDA 647
           VL +YRD++  GD+ +    W +V   + +       +Q+D +  G++   G    T D 
Sbjct: 470 VLAIYRDWLLCGDRAWLEEKWSAVRDMLRFAWNPDNPDQWDPEQRGVLT--GRQHHTLDM 527

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFW--VRYQKAKAVYDSLWNGSYF- 704
               G + +  GL++AAL+A  A+AN +G H   +  W  +  +    + + L+NG YF 
Sbjct: 528 -ELFGPNPWLTGLYIAALEAGCAIANHLG-HNDDAMLWQDIARRGRVTLRERLFNGEYFQ 585

Query: 705 ---NYDNSDGSS---------------------------------STSIQADQLAGQWYA 728
              +  N D  +                                  +    DQL GQW+A
Sbjct: 586 QVIDLANRDTLAPWRTSAESADYGLSGDVWQQYWCEENHEVKYQFGSGCHIDQLMGQWFA 645

Query: 729 RACGLLPIADEAKVKKALTKIYDFNVLKVKGGM--CG--AMNGMQ-------PDGRIDMS 777
             CGL P+ D  + ++AL  ++ +N +  +     C    +N  Q       P G  DM 
Sbjct: 646 GLCGLPPVFDPGQTQRALRALWQYNFVSARHHTNPCRLYGLNDEQGVSICSWPTG--DMP 703

Query: 778 GLQ---AREIWPGVTYGLAASMIQEEMV 802
            +    A+E+  G  Y LAA M+   M 
Sbjct: 704 AIPVPYAQEMMTGFEYQLAALMVASGMA 731



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 73/318 (22%), Positives = 119/318 (37%), Gaps = 48/318 (15%)

Query: 89  IPLGGIGAGSIGRSYRGEFQRFKLF--------HGICDDAPVLANQFSVFVSRPNGEKFS 140
            PLGGIG G I  +  G     ++F        HG    A       ++  +R     + 
Sbjct: 14  FPLGGIGTGCISLAGNGHLTDVEIFNHPDKGYHHGYTHFAVKAERNGALVDARILQGDWQ 73

Query: 141 SVLCPR---SPGVPKKNTDSGIESWDWNLKGENCTYHALFPR-AWTVYDGE--------P 188
              C +      + K++T  G+        G +    A  P  A + +DGE         
Sbjct: 74  GNACGQWQIHDRLRKRHTGYGV--------GPDAETLAGMPHFAHSEFDGEFPMAQMSFS 125

Query: 189 DPELRIVCRQIS--PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
           DPE     R ++  PFIP +   SS P + F   + N+   + D TL  +  N +A DS 
Sbjct: 126 DPEFPGNIRMLAFNPFIPCDEDNSSLPLAWFELEVENTTDAALDYTLCLSLCNGLAPDST 185

Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG---N 303
            + H        ++  H +TL  R      P    +A   T D        +   G   +
Sbjct: 186 HNRHEI------REDRHMMTLTQRHYPADDPRYAELAT--TTDAQDVSWQTYWYRGDWFD 237

Query: 304 SKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
           S  +  +D   + +       L   + + S+PG +  A++AA L + +G    V F   W
Sbjct: 238 SLTMFWRDFCGQPR-------LPEREYTQSQPGFADHASLAAHLHLAAGERGRVRFWTGW 290

Query: 364 DCPEVKFFEKVYHRRYTK 381
            CP    +   +  R T+
Sbjct: 291 YCPVTYNYWTDWRDRATQ 308


>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
          Length = 90

 Score = 63.9 bits (154), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 35/76 (46%), Positives = 45/76 (59%), Gaps = 5/76 (6%)

Query: 719 ADQLAGQW--YARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
           ADQ    W   +RA  LLP++    +K AL +IY  N+L   GG  GA+NGM+ DG +  
Sbjct: 8   ADQTYDAWKLLSRARALLPLS---HIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGR 64

Query: 777 SGLQAREIWPGVTYGL 792
             LQA E+W GVTY L
Sbjct: 65  RHLQADEMWVGVTYAL 80


>gi|374373788|ref|ZP_09631448.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
 gi|373234761|gb|EHP54554.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
          Length = 897

 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 66/252 (26%), Positives = 107/252 (42%), Gaps = 36/252 (14%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP-DQTYDA-WSAN 651
           VL  YR+   + D  F +  W  +  A  ++   D++ DGM   E  P + T DA W  +
Sbjct: 504 VLGCYREHQMSADDGFLKRNWDKIKRATQFIINQDRNKDGM---EDTPLENTLDAIW--D 558

Query: 652 GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNY--DN 708
           G  A+  GL +AA++A  A+A ++ D A A+      QK ++ + D L+NG YF +  D 
Sbjct: 559 GEIAWLVGLCIAAVKAGQAMAEEMNDTAFAAVCKDYVQKGSRHMSDQLFNGEYFIHRPDP 618

Query: 709 SDG----SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
             G     S  +   DQ+ GQ +A   GL  + D+ +   AL  ++ +N +K  G     
Sbjct: 619 VKGRKGLGSYNTCHIDQVLGQSWAFQVGLGRVIDQQQTLSALKALWKYNFIKDMGIYAET 678

Query: 765 MNGMQPDGRIDMSGLQA----------------------REIWPGVTYGLAASMIQEEMV 802
               +P      SGL                         E   G  + +A+ M+ E M 
Sbjct: 679 HTDGRPYALYGESGLVMNTNPKNEPKPFGDGAAWQIGYFNECMTGFEHQVASHMMAEGMT 738

Query: 803 DMAFQTAAGVYE 814
           D A      +++
Sbjct: 739 DEALTITKAIHD 750



 Score = 46.2 bits (108), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 67/164 (40%), Gaps = 23/164 (14%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFH----GICD-DAP---VLANQFSVFVSRPNGEKF 139
           G+P+GGI AG +  S  G    + +F+    GI   D P    +  +     SR  G   
Sbjct: 69  GMPVGGINAGGMYLSGDGRLWLWDVFNLNQEGINPVDIPWKTSVQGELRKVRSRDGGSYI 128

Query: 140 S----SVLCPRSPGVPKKNTDSGIESW------DWNLKGENCTYHALFPRAWTVYDGEPD 189
           +      L P   G   K    G E        DW    +   + A +P A   Y  +  
Sbjct: 129 APPKAKELRPLDQGFALKLEYEGKEQTRQLREEDW----DEVRFEATYPVATITYSSKDI 184

Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
           P ++ V +  SPFIP +   S  P SVFT T+ N+G+     TL
Sbjct: 185 P-VQAVLQAYSPFIPLDTDRSGLPVSVFTITIKNAGKHPVKATL 227


>gi|89068658|ref|ZP_01156050.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
 gi|89045845|gb|EAR51906.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
          Length = 876

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/308 (24%), Positives = 122/308 (39%), Gaps = 50/308 (16%)

Query: 89  IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-------------- 134
            PLGGIGAG++G    G  Q +++F+     A    N FS F  R               
Sbjct: 19  FPLGGIGAGTVGIGADGRLQDWEIFN---RPAKGSVNGFSHFAVRAERAGEVVDTRVLHG 75

Query: 135 --NGEKFSSVLCPRSPGV---PKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
              G +  S L P   G    P++   +G  S+       + +    +P A   ++ +P 
Sbjct: 76  PFTGNRTGSFLAPEFNGYGFGPRRQQMTGFPSF------RSSSLAGPYPAADLTFE-DPA 128

Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
               +  R +SPF+P   ++SS P ++F  T +N G T  D TL    A       G + 
Sbjct: 129 FPGAVDMRALSPFVPLEDRDSSMPVAMFEVTFNNPGDTPLDYTLFGCLAFDFVDLEGNA- 187

Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITA 309
              +  T  +  + G T           ++ A   ++T+  H       L  GN    T 
Sbjct: 188 -EVDIATGDRTILTGRTTLGEDHPDHRQLSLATDHDDTSHQH------HLYRGNWFD-TM 239

Query: 310 KDMWNEIKKHGSFDHLDNDKTSPSEPGSSIG--------AAIAASLTIPSGSTRSVTFSL 361
           +  W +I + G       D+  PS   S           + +AA +T+P+G +R+V  +L
Sbjct: 240 EVYWADIARGGRL----RDRKYPSAKSSVANNQADAIEHSLLAAHVTVPAGESRTVRMAL 295

Query: 362 AWDCPEVK 369
           +W  P  +
Sbjct: 296 SWYVPNFR 303



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 94/212 (44%), Gaps = 46/212 (21%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDA 647
           V++ YRD+  +GD  + R++WPS+  A+ +       +++D D  G++   G    T D 
Sbjct: 474 VMKAYRDWKLSGDDAWLRSLWPSIKSAIEFAWHPDNPDRWDPDQTGVL--SGRQHHTLDM 531

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGSYF-- 704
               G +++  G ++ AL A + +A  VG+  +A+ +   ++K KA V + L+NG +F  
Sbjct: 532 -ELFGPNSWLAGFYLGALSAGAEMAEFVGEADTAAQYRALFEKGKAWVGEELFNGEWFAQ 590

Query: 705 NYDNSD----------GSSSTSIQA------------------------DQLAGQWYARA 730
             D +D          G  ++ I                          DQ+  +W+AR 
Sbjct: 591 KIDIADRDTLAPFQEPGGDASHISGSIFEQYWSEEHGELKYQIGAGCAIDQVLAEWHARL 650

Query: 731 CGLLPIADEAKVKKALTKIYDFNVLKVKGGMC 762
            GL  + D      A+  IY+ N ++  G + 
Sbjct: 651 YGLDHLYDPNDFATAVRAIYEHNFVERLGDLA 682


>gi|224025536|ref|ZP_03643902.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
           18228]
 gi|224018772|gb|EEF76770.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
           18228]
          Length = 846

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 59/253 (23%), Positives = 102/253 (40%), Gaps = 37/253 (14%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           +++ YRD+  +GD  F +  W  V   ++Y      +D + DG++E  G    T D  + 
Sbjct: 484 IMKFYRDWQLSGDSRFLKENWAQVKKVLSYAWTEHGWDDNQDGVME--GSQHNTMDV-NY 540

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
            G +   G  ++ AL+AA  +A+ + D A        ++      D+ L+NG Y+ +  +
Sbjct: 541 FGPNPQMGFWYLGALKAAEKMAHAMKDRAFEKKCRALFENGSEWLDNNLFNGEYYEHKIT 600

Query: 710 DGSS------------------STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           D  +                        DQL GQ+ A  CGL  +A +  ++  L  I  
Sbjct: 601 DPETFEFLDMNDPDTKIPDFQLGKGCLVDQLVGQYMAHICGLGYLAKKEHIRTTLKSIMR 660

Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           +N  +            V G   G +    P GR+++      E+  G  Y  A  MI E
Sbjct: 661 YNFKEDFSRHFNNMRSYVLGDEAGLLMASWPKGRLEVPFPYFAEVMTGFEYCAATGMIYE 720

Query: 800 EMVDMAFQTAAGV 812
            M + A +    +
Sbjct: 721 GMEEDALKCIRAI 733



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 69/334 (20%), Positives = 129/334 (38%), Gaps = 51/334 (15%)

Query: 65  QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH---------G 115
           QE   G  PV   + + H+     +PLGGIG G++    RGE + +++ +          
Sbjct: 18  QEWKAGDWPVLRHYDQQHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMNVPGKKYSTVT 76

Query: 116 ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENC 171
             ++AP     F+++     GE  +++L     G          E    N  G    +N 
Sbjct: 77  TGNNAPF----FAIYTKPEKGEAQTTLLA----GALYDQEYLHYEGRPVNHHGLPRFKNS 128

Query: 172 TYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
           ++ A +P    V+  +    + +  +  +P +P + + S  P +V  + ++N+ QT  DV
Sbjct: 129 SFDAAYPFG-QVHLSDKGIPVEVTVKGFNPLVPGDAEASGIPIAVLCYEVTNTSQTPVDV 187

Query: 232 TLLFTWANSVAGDSGLSGHHFNSK---TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETA 288
           ++  +  N +  D    G  F +         G       +++A G   + F     E  
Sbjct: 188 SICGSIRNFIGKD----GSKFVTDWKGDYIPVGCKDNRNIYKSAQGLQGIYFCSEGVEKT 243

Query: 289 DVHVSECPCFLLSGNSKGITAKD-------------MWNEIKKHGSFDHLDNDKTSPSEP 335
           D         L++   +G+T +               W++    G    +   K   ++P
Sbjct: 244 DPAYGNMT--LVTNAPEGVTYRTSSKADDWSNGILGFWDDFSADGELTEM--SKQYDADP 299

Query: 336 GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
                A++A    +  G TRS TF L W+ P  K
Sbjct: 300 ----MASLAVKKKLAPGETRSFTFFLTWNFPNRK 329


>gi|340616465|ref|YP_004734918.1| hypothetical protein zobellia_454 [Zobellia galactanivorans]
 gi|339731262|emb|CAZ94527.1| Conserved hypothetical protein [Zobellia galactanivorans]
          Length = 923

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 154/715 (21%), Positives = 259/715 (36%), Gaps = 196/715 (27%)

Query: 177 FPRAWTVYDGEPDP-ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLL- 234
           +P A   Y  + D   + I     SPF+P + + S+ P +V  + ++N+ +   DV+++ 
Sbjct: 165 YPIAQVNYRKKSDRLPIEISSEVFSPFVPLSVRSSANPVTVVKYKITNTSKDKIDVSIIG 224

Query: 235 ------------FTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAV 282
                           N+     G+SG  F  + M+ D V  L  H +         F++
Sbjct: 225 ELKNIAFPQPSKVKQVNTALRSKGVSGISFEMQPMSNDSV--LAKHPQLGG------FSL 276

Query: 283 AA-EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
           +   + AD+ V+                              +  + K   S     I  
Sbjct: 277 SVLNQKADLLVNA-----------------------------NEKDSKKKRSSSEDPISG 307

Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEH 401
           ++ +++++ SG T+ +TF ++W  P   ++E    +RY                DA+ E 
Sbjct: 308 SVGSTVSLRSGQTKEITFLVSWYFP--NYYEN--GKRY---------------KDAMAEA 348

Query: 402 AKWECEI-EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
             W   I   W     +  +    Y  + FN LY   A   ++ D         T  +  
Sbjct: 349 PGWVGHIYNNWYANSFDVTK----YVASNFNTLY---ADTKLFRD---------TYNDNT 392

Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGE----ENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
                 ++I  P S+  A    + +NG     E IG   + +G    +YN     F +  
Sbjct: 393 LPYWLANRITMPVSTLAAGNIAIWKNGRMYGYEGIG---FCQGTCGHVYN-----FVA-- 442

Query: 517 ALVMLFPKLELSIQ--RDFAAAVMM--HDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPW 572
           A+  LFP+LE S++  +DF +      + P + +I   G  V +     VPH        
Sbjct: 443 AISKLFPELERSVRLLQDFNSTEKYSGYSP-SGRINFRGYGVNKP---EVPH-------- 490

Query: 573 FEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG- 631
                      S   D  S +VL+ YR+ + + +  F   VW  V +A+ Y     KDG 
Sbjct: 491 -----------SYASDAQSGYVLKAYREHLNSPNNEFLDTVWEKVKMAIGY--HIFKDGA 537

Query: 632 ------DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFW 685
                 +G++E E    QT+      G + Y   L++AAL+AA  +A   G+   AS + 
Sbjct: 538 EIGLEPNGVLEGE----QTFWDPMWYGPNPYNNTLYLAALRAAEEMAKVKGEPELASRYH 593

Query: 686 VRYQKA-KAVYDSLWNGSY-----------------------------------FNYDNS 709
             ++   K + + +WNG +                                   FN   S
Sbjct: 594 KLFESGQKFMNEEMWNGEFYIHLYPVGLNGNVGIKNGFKLPSEIDDNAQSYIKAFNEGRS 653

Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA------------LTKIYDFN---- 753
                T+  A QL GQ +A   GL  I    K + A            +  IYDF     
Sbjct: 654 GYFPGTACDAQQLFGQNWANQLGLGYILSPEKSEIAAKSIFKYNWTPDIATIYDFEKPNA 713

Query: 754 -VLKVKGGMCGAMNGMQPDGRIDMSGLQAR-EIWPGVTYGLAASMIQEEMVDMAF 806
            VL  KG       G   +          +  +W G+ Y  A  MI E +++ A 
Sbjct: 714 RVLAAKGEAAMINGGWTKEKPKSFENTHDKTNVWTGLEYEAACDMINEGLLEEAI 768


>gi|312086024|ref|XP_003144914.1| hypothetical protein LOAG_09338 [Loa loa]
 gi|307759923|gb|EFO19157.1| hypothetical protein LOAG_09338, partial [Loa loa]
          Length = 84

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 29/76 (38%), Positives = 42/76 (55%)

Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
           DGMIEN G  DQTYD    NGVS YCG LW+  L A  + A+++ D  S + +     +A
Sbjct: 9   DGMIENSGTADQTYDTLKMNGVSTYCGSLWLTLLHAEVSFASEIDDIRSGNLYTGFLLRA 68

Query: 692 KAVYDSLWNGSYFNYD 707
             +++S +   Y   +
Sbjct: 69  GDIFESEYYNIYLQME 84


>gi|410096859|ref|ZP_11291844.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225476|gb|EKN18395.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 847

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 68/320 (21%), Positives = 130/320 (40%), Gaps = 49/320 (15%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           ++++YR++  +GD +F +  W  +   ++Y    + +D + DG++E  G    T D  + 
Sbjct: 486 IMKIYREWQLSGDNDFLKNNWGQIKKVLSYAWTEKGWDGNQDGVME--GSQHNTMDV-NY 542

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNS 709
            G +   G  ++ AL+AA  +A  + D   A      +++     D +L+NG Y+ +  +
Sbjct: 543 FGPNPQMGFWYMGALKAAEKMAIAMKDKGFAQKCRTLFEQGSNWMDKNLFNGEYYEHKIT 602

Query: 710 DGSSSTSIQ------------------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           D  +   +                    DQL GQ+ +  CGL  + ++  ++  +  I  
Sbjct: 603 DPETFEYLDMNNPNVKIPSFQLGPGCLVDQLVGQYMSHLCGLGYLGNKDHIQTTMNSIMK 662

Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           +N +             V G   G +    P GR+++      E+  G  Y  A  MI E
Sbjct: 663 YNYVSDFSRHFNNMRSYVMGNESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIYE 722

Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
            M   A      + +     DG   + F  PE  ++        Y R +  W+   AL+ 
Sbjct: 723 GMEKEALTCIRSIRD---RHDGAKRNPFSEPECGHH--------YARSMASWSAIIALSD 771

Query: 859 PKLSRQEIKHEISDRDSSYL 878
            + S  +   +I+ R  +Y 
Sbjct: 772 FQYSGIDKSMKITARPGNYF 791



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/346 (19%), Positives = 135/346 (39%), Gaps = 44/346 (12%)

Query: 63  SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPV 122
           + QE    + PV   + + H+     +PLGGIG G++    RGE + +++ +       V
Sbjct: 18  ASQEWKPQEWPVLKNYDQEHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMN-------V 69

Query: 123 LANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDW-NLKGENCTYHALFPRAW 181
            A ++S   +  N   F+    P++           +   ++ + +G    +H L   A 
Sbjct: 70  PAKKYSTVTTGNNAPFFAIYAKPQNQEATTTLLAGPLYPQEYLHYEGRPVNHHGLPRFAQ 129

Query: 182 TVYDG----------EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
             +D           + D  +++  +  +P IP + + S  P +V ++ ++N      ++
Sbjct: 130 ASFDAAYPFGQVHLSDKDLPVKVTIKGFNPLIPGDAEASGLPVAVLSYEVTNITSQPMEI 189

Query: 232 TLLFTWANSVA-----------GDSGLSGHHFN-SKTMTKDGVHGLTLHHRTANGRPPVT 279
            +  +  N +            GD   +G   N +K +   G+ G+ L+    +   P  
Sbjct: 190 AICGSMRNFIGKDGSKFRTDWKGDYIPTGVKDNKNKYVENKGIKGIYLYSDGVDKNDPAW 249

Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
             VA    A   VS          + GI   + W++    G     + +K    +P    
Sbjct: 250 GTVALTTQATSGVSYRTSSKSDNWNNGIL--NFWDDFSADGML--TERNKQEDEDP---- 301

Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-----YHRRYT 380
            A+++   T+   ST + TF + W+ P  K +        Y R+YT
Sbjct: 302 MASLSVKKTVKPQSTETFTFYITWNFPNRKAWSSTVVGNYYSRQYT 347


>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
          Length = 109

 Score = 59.7 bits (143), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 29/60 (48%), Positives = 38/60 (63%), Gaps = 3/60 (5%)

Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
           LLP++    +K AL +IY  N+L   GG  GA+NGM+ DG +    LQA E+W GVTY L
Sbjct: 1   LLPLS---HIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 57


>gi|333382634|ref|ZP_08474302.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332828576|gb|EGK01276.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 859

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/270 (21%), Positives = 111/270 (41%), Gaps = 69/270 (25%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           ++++YR++  +GD N+ + ++P V  +M Y ++ +D    G IE         + W A+G
Sbjct: 495 IMKIYREWRISGDNNWLKEIYPFVKASMDYCIKTWDSREVGAIEEPHHNTYDIEFWGADG 554

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
           +       +  ALQA   +   +G+  S      RY+    K+K   ++ L++G YF   
Sbjct: 555 MIT---SFYTGALQAMCLMGEYLGEDIS------RYEALMDKSKIYMETKLFDGEYFIQN 605

Query: 705 -NYDNSDGSSSTSIQA----------------------------DQLAGQWYARACGLLP 735
             +   +    T +QA                            D + G W +  CG+  
Sbjct: 606 IQWKGLNADDPTKVQAFNSGYSDEALKILEAEGPKYQYGKGCLSDGILGSWMSLVCGMPE 665

Query: 736 IADEAKVKKALTKIYDFNV----------------LKVKGG--MCGAMNGMQPDGRIDMS 777
           I D+ KV   L  ++ +N+                L  +GG  +C    G    G++ + 
Sbjct: 666 IIDKTKVTSHLNAVHKYNLKHDLREHANPQRPTFALGNEGGLLLCSWPKG----GKLQLP 721

Query: 778 GLQAREIWPGVTYGLAASMIQEEMVDMAFQ 807
            + + E+W G+ Y +AA ++ E  V+   +
Sbjct: 722 FVYSDEVWTGIEYQVAAHLMFEGEVEKGLE 751


>gi|326798039|ref|YP_004315858.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326548803|gb|ADZ77188.1| hypothetical protein Sph21_0609 [Sphingobacterium sp. 21]
          Length = 877

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/265 (20%), Positives = 104/265 (39%), Gaps = 59/265 (22%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYRD+  +GD  + + ++P V  ++ Y ++ +D +G G++E         + W  NG
Sbjct: 517 IMKVYRDWRISGDDEWMKKLYPMVQKSLDYCIQTWDPNGKGILEEPHHNTYDIEFWGPNG 576

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGSYFNY----- 706
           +   C   ++ AL+A   +        +   +   Y   +A + D L+NG YF       
Sbjct: 577 M---CTSFYLGALKALIEMGRY--QRKNMKRYETLYDSGRAYMEDQLYNGEYFIQKIQWT 631

Query: 707 -----DNSDGSS----------------------STSIQADQLAGQWYARACGLLPIADE 739
                D +   S                       T   +D + G W A  CGL    D+
Sbjct: 632 GLKAPDPTKAQSFHTQYTPEAIEILKKEGPKYQYGTGCLSDGVIGAWMAAMCGLENPLDQ 691

Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-----------------GRIDMSGLQAR 782
            KV   L  ++ +N   +K  +   +N  +P                  G + +  + + 
Sbjct: 692 EKVNSHLRAVHQYN---LKNDLTDHINPQRPTYAMGHEGGLLLCTWPKGGMLSLPFVYSN 748

Query: 783 EIWPGVTYGLAASMIQEEMVDMAFQ 807
           E+W G+ Y +A+ ++Q    D   +
Sbjct: 749 EVWTGIEYQVASHLMQIGETDKGLE 773



 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 73/329 (22%), Positives = 116/329 (35%), Gaps = 65/329 (19%)

Query: 61  RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
           +++K+E  +    VFN     H       P+GGIGAG       G      + H      
Sbjct: 46  QFTKEEHDR----VFNGPYVGHHLDKIAFPIGGIGAGMFCLEGSGAISHVSIRH-----K 96

Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG------------ 168
           P L ++ ++F +     + +S      P VP           DW L G            
Sbjct: 97  PELFHEPTIFGAVAIKGRANSAKVLEGP-VP-----------DWKLFGYKDYGGGSGGTT 144

Query: 169 ------ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLS 222
                 E   + A FP A      +  P L++     SPF+P +   SS P    T+   
Sbjct: 145 YGLPHFEQAEFLARFPFAEIQLRDDKFP-LQVRLEAWSPFVPSDADNSSLPVGALTYRFI 203

Query: 223 NSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVT--F 280
           N+ Q + +    F   N V    G        +   K   +G  L      G+  +   F
Sbjct: 204 NTSQENLEAVFSFNSRNFVVQPEG--------EGSIKKIQNGFLLSQAKTTGKNNLATDF 255

Query: 281 AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG--SFDHLDNDKTSPSEPGSS 338
           AV A+   +V V    C+   G    +T   +WN IK     + D +D D         +
Sbjct: 256 AVFADTGDEVIVDY--CWFRGGWFDPLTM--VWNTIKDANVKAVDPIDKD---------A 302

Query: 339 IGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
            GA++     + +   + +   +AW  PE
Sbjct: 303 PGASLFIPFQLGANEEKEIRILMAWYVPE 331


>gi|405382777|ref|ZP_11036555.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
 gi|397320811|gb|EJJ25241.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
          Length = 863

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 69/276 (25%), Positives = 112/276 (40%), Gaps = 61/276 (22%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDA 647
           +++ YRD+   GD ++ R  WP+V  A+ Y       +++D D  G++   G   QT D 
Sbjct: 475 IIKTYRDWKLCGDTDWLRRYWPNVKRAIEYAWSSENPDRWDPDQTGIL--SGRQHQTLDM 532

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY--DSLWNGSYF- 704
               G +++   ++VAAL   S +A  +GD    S    R  KA A Y    L+NG +F 
Sbjct: 533 -ELFGPNSWLASMYVAALLGVSEMAAVLGD-TELSEKTARMGKAGAAYINSELFNGRWFI 590

Query: 705 -NYDNSD---------GSSSTSIQ------------------------ADQLAGQWYARA 730
              D SD         G ++  +                         +DQ+ GQW+A  
Sbjct: 591 QKVDLSDKGVLIPFDVGRAAGVLADGFMETYWSEEFQELKYQMGEGCISDQILGQWHAEV 650

Query: 731 CGLLPIADEAKVKKALTKIYDFN---VLKVKGGMC---------GAMNGMQPDG-RIDM- 776
            G+    D  KV+ AL  ++  N    L+     C         G +    P+G R  M 
Sbjct: 651 AGIGAFLDADKVRTALKSVHRNNFRPTLEDHFNPCRNYAYEDEAGLLIATYPEGIRQPMV 710

Query: 777 SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
           +   A E+W G+ Y  A+ +I   +++       G 
Sbjct: 711 AAPYAEEVWTGIEYMAASHLIMHGLIEEGMDIVCGA 746



 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 55/209 (26%), Positives = 87/209 (41%), Gaps = 39/209 (18%)

Query: 173 YHALFPRAWTVYDGEPDP-ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
           ++  FP A  V+  +  P ++R+    +SPFIPHN ++SS P ++F F + N    +AD 
Sbjct: 119 FYGRFPIADLVFKDQRFPGDIRLTA--LSPFIPHNDRDSSMPVAMFEFEIVND---TAD- 172

Query: 232 TLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRT-------ANGRPPVTFAVAA 284
            L +T A ++      SG H    T  ++G  G++  H T       A  R  +T A  A
Sbjct: 173 ELTYTLAGTLGNYGSNSGIH----TFRQEG--GISSLHLTSSDTTLPATQRGDLTIATDA 226

Query: 285 EET--ADVHVSECPCFLLSGNSKGITAKDM---WNEIKKHGSF--DHLDNDKTSPSEPGS 337
           ++    D H             +G    D+   W E  + G     H D  +TS      
Sbjct: 227 DDVDHTDHHY------------RGQWFDDLSVYWKEFARPGRLPKRHYDEPRTSRHMSLQ 274

Query: 338 SIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
                + A   +  G  R V F + W+ P
Sbjct: 275 PEHGTLGARFAVAPGERRKVRFVITWNFP 303


>gi|156383620|ref|XP_001632931.1| predicted protein [Nematostella vectensis]
 gi|156219994|gb|EDO40868.1| predicted protein [Nematostella vectensis]
          Length = 1090

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 12/164 (7%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           VL+ YR+      K +   +WP V  I   +M   D +G G+I   G    TYD  +  G
Sbjct: 640 VLKTYREVRQGAPKTWFNKMWPQVKRIMKRWMTDLD-NGTGVIP--GPQPNTYDC-TLYG 695

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDG 711
           V+ Y  G ++A L+AA  +A    +   A+ +  R++   A  D +L+NGS++     D 
Sbjct: 696 VNVYISGYYLAGLRAAEKMAKIQNEPELAATYHQRFELGSAKIDKTLFNGSWY-IQEVDS 754

Query: 712 SSSTSIQADQ-----LAGQWYARACGLLPIADEAKVKKALTKIY 750
               ++  D      L GQW+A A GL  I  +  ++  L  I+
Sbjct: 755 EHPVNVVGDATWVDCLVGQWWAHALGLGYILSQKNIQSTLQNIF 798


>gi|332665583|ref|YP_004448371.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332334397|gb|AEE51498.1| protein of unknown function DUF608 [Haliscomenobacter hydrossis DSM
           1100]
          Length = 857

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 58/253 (22%), Positives = 104/253 (41%), Gaps = 37/253 (14%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           V++ YR+++ +GD  + +  W  V  AM+Y      +D + DG+ E +       D +  
Sbjct: 491 VMRFYREWILSGDNEWLKKHWARVKAAMSYAWVPGGWDANQDGVQEGKQHNTMDVDYYGP 550

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYF-WVRYQKAKAVYDSLWNGSYFNYDNS 709
           N    +    +  AL+A++A+A  + D   A     V  + +K V ++L+NG Y+ +  +
Sbjct: 551 NPQMQFW---YFGALKASAAMARAMNDPVFAKQCEQVLAKGSKWVDENLFNGEYYEHKIT 607

Query: 710 D---------GSSSTSIQ---------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
           D          + ST+I           DQ+ GQ+ A  CGL  +A +  ++ AL  +  
Sbjct: 608 DPKTFEFLDRSNPSTAIPNFQLGEGCLLDQVVGQYMAHVCGLGYLAKKENIQTALQTVMK 667

Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
           +N +             V     G +    P GR+ +      E   G  Y  A  M+ E
Sbjct: 668 YNFVPRFDNVFNNMRSYVMDKESGLIMASWPKGRLKVPFPYFAESMSGFEYAAAIGMLYE 727

Query: 800 EMVDMAFQTAAGV 812
              +   Q    +
Sbjct: 728 GQTEAGLQCIQSI 740



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 69/343 (20%), Positives = 132/343 (38%), Gaps = 47/343 (13%)

Query: 51  HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
           H +   F    ++   +A+   PV   + + H+  +  +PLGGIG G++    RGE + +
Sbjct: 2   HQSFYLFAFLLFATAIQAQTSWPVLKHYDQDHLL-NIALPLGGIGTGTVSLGGRGELRDW 60

Query: 111 KL-------FHGICD--DAPVLANQFSVFVSRPNGEKFSS--VLCPRSPGVPKKNTDSGI 159
           ++       F  I D   AP     F+V+V +P G+   S  +L P  P   +      +
Sbjct: 61  EIMNKPGKKFSTITDGNQAPF----FAVYV-KPIGKTAQSKALLGPFHPSEYQHYEGRPV 115

Query: 160 ESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTF 219
                  +    ++ A +P        +  P + +  +  +P IP +   SS P ++  +
Sbjct: 116 NHHGMP-RFSVASFDAAYPFGQVNLSDDVLP-ISVKIKGFNPLIPGDVAASSIPLAILKY 173

Query: 220 TLSNSGQTSADVTLLFTWANSVAGDSG------------LSGHHFNSKTMTKDGVHGLTL 267
            ++N+     +V +  T  N +  D              L   +  ++      + G+ +
Sbjct: 174 EVTNNTDAPLEVAICGTMRNFIGIDGSKQSKNWKGDLVYLGAKNNQNEYRESPDLKGIFM 233

Query: 268 HHRTANGRPP----VTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD 323
             +  + + P    +  + A  E      S  P      N    +  D W++    G   
Sbjct: 234 SSQGVDNQDPAWGTIALSTAERELVSFRRSSTP------NDWENSILDFWDDFSADGVLS 287

Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
             D DK     P     A++A   TIP+ S+ + TF ++W  P
Sbjct: 288 --DKDKLVDDNP----MASLAVKKTIPARSSLTFTFYISWHFP 324


>gi|429739681|ref|ZP_19273428.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
           F0055]
 gi|429156127|gb|EKX98766.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
           F0055]
          Length = 989

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 54/266 (20%), Positives = 105/266 (39%), Gaps = 60/266 (22%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++ YRD+  +GD  + + ++P++  ++ Y +  +D    G IE         + W A+G
Sbjct: 633 IMKAYRDWRISGDTEWLKKLYPAIKASLDYCIRTWDPRQKGCIEEPHHNTYDIEFWGADG 692

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
           +   C   +V AL+A   ++ D     S  Y  +  +  + +   L+NG Y+        
Sbjct: 693 M---CTSFYVGALEAFIRMS-DALHRPSTPYRKLLKKSVELMNTQLFNGEYYIQKTQWTG 748

Query: 705 --------------NYDNSDG-----------SSSTSIQADQLAGQWYARACGLLPIADE 739
                         NY N +               T   +D + G W A  CGL    ++
Sbjct: 749 LRAKSPTEVLSFGSNYSNPEALQLLQAEGPKYQYGTGCLSDGVIGMWMASVCGLPEPLNQ 808

Query: 740 AKVKKALTKIYDFNV----------------LKVKGG--MCGAMNGMQPDGRIDMSGLQA 781
           A+    L  +Y +NV                L  +GG  +C    G +P     +  + +
Sbjct: 809 AQTLSHLNAVYKYNVREDLYRHYNTQRPSYALGHEGGLLLCTWPKGKKP----TLPFVYS 864

Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQ 807
            E+W G+ Y +A+ ++ +  +D   +
Sbjct: 865 NEVWTGIEYQVASHLLMKGEIDKGLK 890


>gi|282877348|ref|ZP_06286172.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
 gi|281300533|gb|EFA92878.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
          Length = 997

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 51/262 (19%), Positives = 104/262 (39%), Gaps = 52/262 (19%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYRD+  +G+  + ++++P+V  ++ Y +  +D    G +          + W A+G
Sbjct: 638 IMKVYRDWRISGNDEWLKSLYPAVKASLDYCIRTWDPHEKGSLTEPHHNTYDIEFWGADG 697

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
           +   C   +  AL+A   ++  +   A   Y  +  +    +   L+NG Y+        
Sbjct: 698 M---CTSFYTGALEAFVQMSEHLNKQA-GKYKKLLKRSISCMDKDLFNGEYYFQKTQWTG 753

Query: 705 --------------NYDNSDGSS-----------STSIQADQLAGQWYARACGLLPIADE 739
                         NY +++               T   +D + G W A  CGL    ++
Sbjct: 754 LEAKSPTEIPSFGSNYRHAEAQELLKTEGPKYQYDTGCLSDGILGMWMASVCGLPEPLNQ 813

Query: 740 AKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPDG-RIDMSGLQAREIW 785
             VK  L  +Y +N+ K               GG  G +    P G +  +  + + E+W
Sbjct: 814 THVKSHLNAVYQYNLRKNMYNHYNTQRPSYAMGGDGGLLLCTWPKGAKPTLPFVYSNEVW 873

Query: 786 PGVTYGLAASMIQEEMVDMAFQ 807
            G+ Y +A+ +I +  V+   +
Sbjct: 874 TGIEYQVASHLIMKGEVEKGLE 895


>gi|284041275|ref|YP_003391205.1| hypothetical protein Slin_6449 [Spirosoma linguale DSM 74]
 gi|283820568|gb|ADB42406.1| protein of unknown function DUF608 [Spirosoma linguale DSM 74]
          Length = 857

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 65/256 (25%), Positives = 102/256 (39%), Gaps = 34/256 (13%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           +++ YRD+  +GD +F R  WP V  A+A+      +D + DG++E  G    T D    
Sbjct: 484 IMKFYRDWQLSGDTDFLRRYWPQVKQALAFAWVPGGWDANRDGVME--GAQHNTMDV-EY 540

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYD-- 707
            G +      ++ AL+AASA+A  + D + A      +QK     D+ L+NG Y+     
Sbjct: 541 FGPNPQMQLWYLGALKAASAMATAMNDRSFAQTCDQIFQKGSTWTDANLFNGDYYEQQVI 600

Query: 708 ---------------NSDGSSSTSIQ------ADQLAGQWYARACGLLPIADEAKVKKAL 746
                          N    +  + Q       DQL GQ+ A  CGL  + +   +K  L
Sbjct: 601 SPAGHLIAKGTFSGFNKVEQNDPAYQLAKGCLVDQLVGQFMAHVCGLGYLVNPDHIKTTL 660

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
             I  +N    +  M    N M+     D + L     WP     +      E M    +
Sbjct: 661 KSILKYN---YRASMTDHFNNMRSYALGDEASLLMAS-WPKGRPKVPFPYFSEVMTGFEY 716

Query: 807 QTAAGVYEVAWSEDGL 822
             A G+     +E GL
Sbjct: 717 TAAIGMLYENQTEPGL 732


>gi|255013072|ref|ZP_05285198.1| hypothetical protein B2_04148 [Bacteroides sp. 2_1_7]
 gi|410102535|ref|ZP_11297461.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
 gi|409238607|gb|EKN31398.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
          Length = 620

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 8/145 (5%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           D+N+ F+L++ R +    +K      WP++  AM ++   D D  G+     F     D 
Sbjct: 346 DINAFFILRIARFYRYYHNKQDLINYWPAMKNAMNWLISRDTDKIGLPAQVSFWGDWKDV 405

Query: 648 WSANG--VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS------LW 699
               G   S +   ++++AL+    LA + GD A A+Y+   YQK     +       LW
Sbjct: 406 KGIEGRKYSPFTCLIYLSALKEMVYLAEECGDQAGANYYQSAYQKGYKTMNKDVREGGLW 465

Query: 700 NGSYFNYDNSDGSSSTSIQADQLAG 724
           NG+Y+     DGS +  I  DQ  G
Sbjct: 466 NGNYYCQIWKDGSVNDRILQDQTIG 490


>gi|340616467|ref|YP_004734920.1| hypothetical protein zobellia_456 [Zobellia galactanivorans]
 gi|339731264|emb|CAZ94529.1| Conserved hypothetical protein [Zobellia galactanivorans]
          Length = 822

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 82/368 (22%), Positives = 132/368 (35%), Gaps = 75/368 (20%)

Query: 89  IPLGGIGAGSIGRSYRGEFQRFKLFH---GICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
           +PLGG GAGS+     G F  + +F+      +D   L + F     + +G+K  +    
Sbjct: 94  VPLGGFGAGSVELRADGGFMDWDIFNNSPATGNDKIQLNDAFMGLYIQADGKKAFATTLR 153

Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
             P  PKK               +   Y   FP +         P L+      S +   
Sbjct: 154 THP--PKKLPAV-----------QQLEYSGAFPVSRLNIQDSNIP-LKTKLYAYSEYKIR 199

Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAG----DSGLSGHHFNSKTMTKDG 261
           N  +S+ P ++F+  L N          LF   N + G    D GL         +TK G
Sbjct: 200 NSIQSATPCALFSLELENVTDDEYQAAFLFNLPNHIQGLFNVDQGL--------VLTKKG 251

Query: 262 VH----GLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIK 317
                  +TL    ANG   VT++VA +                          +W+E  
Sbjct: 252 KEPSSGDMTL---AANGADEVTYSVADD-----------------------LNKLWDEFS 285

Query: 318 KHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHR 377
            +G+F    +D        S    AI+A +++  G  R++T ++ W  PE     ++   
Sbjct: 286 DNGNFQEDIHD--------SGQYGAISAKVSLAPGEKRTITIAMGWYFPERPISVEIVGN 337

Query: 378 RYTKFYGTLGDSAARIAHDAILEHAK-WECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
            YT  +     S+  +   A+    + W+  IE W      D   P+W    + N    +
Sbjct: 338 YYTNLF----RSSTEVTEKALSRLPETWKGIIE-WNETCF-DNSLPDWLQDAMVNSTATI 391

Query: 437 NAGGTIWT 444
              G IWT
Sbjct: 392 VKTG-IWT 398



 Score = 47.8 bits (112), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 14/166 (8%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ---T 644
           D ++ F+L VY   +  GD+ F   +WP V  A  +  Q ++         G PD+   T
Sbjct: 476 DCSTSFILSVYASHIWDGDEEFVNEMWPHVKRAAQW--QINRS-----RKFGLPDRLAAT 528

Query: 645 YD--AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNG 701
           YD   +    + +Y   + +AA++AA+ LA +       S +      A+    +  W G
Sbjct: 529 YDLSGFGKKDLVSYNAFMHLAAMKAATELAKEREKEDLVSLYKKNIAAAQNTLKAHFWTG 588

Query: 702 SYF-NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
           +YF N+ N D   +  I  D   GQ ++   GL  + +   +K  L
Sbjct: 589 TYFRNWWNKDEPKNDDIHIDTQFGQVWSYLLGLGDVIEPELMKSHL 634


>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
 gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
           12333]
          Length = 865

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 68/157 (43%), Gaps = 16/157 (10%)

Query: 79  RKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK 138
           R  HI     +PLGGIG G++     G  ++++L +       +  + F+V VS+     
Sbjct: 11  RGRHIA----MPLGGIGTGNVAIGADGGLRQWQLHNIGNHHGDLPGSLFAVRVSQWE-PP 65

Query: 139 FSSVLCPRSPGVPKKNT------DSGIESWDWNLKGEN----CTYHALFPRAWTVYDGEP 188
              V   ++P  P  NT      D  +  W   L   +     T  A +P A   YD   
Sbjct: 66  LDEVRVLQAPAAPAGNTRTPMVDDDHVPDWQRELAAAHGVTATTISATYPVARVAYDLGE 125

Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
            P L I     +P +P +   SS P ++FTFTL N+G
Sbjct: 126 VP-LAIELEATTPLVPLDEAASSLPVAMFTFTLRNAG 161



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 130/330 (39%), Gaps = 68/330 (20%)

Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEIN 576
           AL  LFP LE S++      + + +  T               GAVPH + L  P +   
Sbjct: 444 ALARLFPGLEASMRATELDVMQVPERHTHA-------------GAVPHRVLL--PRYVPQ 488

Query: 577 SYNL-FNSSRWKDLNSKF--VLQVYRDFV--ATGDKNFARAVWPSVYIAMAYME-QFDKD 630
            +++         L+     +L+ YR+    A G  + A A WP + + + ++  ++D  
Sbjct: 489 MWDVPIGGPEEPALDGMLGVLLKTYREVRSGAIGLGHLA-ASWPRLMLLLDHVRGRWDPS 547

Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQA--------------ASALANDVG 676
           G G++   G    T+D     GV+ + G LW+AAL+A              A AL     
Sbjct: 548 GSGVLR--GVQPSTHDI-DLRGVNTFMGTLWLAALRACEEMALRLAADPHHADALGGATL 604

Query: 677 DHASASYFWVRYQKAKAVYDSL-WNGSYF----------NYDNSDGSSSTSIQADQLAGQ 725
               A+     +++  A YD + + G Y+           +   DG  S     DQL GQ
Sbjct: 605 GRELAAELHALFERGSAAYDEICFTGEYYRQVLEPDEPTQFQWGDGCLS-----DQLIGQ 659

Query: 726 WYARACGLLPIADEAKVKKALTKIYDFNVLK------------VKGGMCGAMNGMQP-DG 772
           W+A    L  +   A V+ AL  +   N+ +              G   G +    P  G
Sbjct: 660 WWAHLLDLGHVLPAAHVRSALRAVVGHNLRRGFDGWEHTQRVYADGDDVGLLMCTWPHGG 719

Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMV 802
           R +++     E+W GV Y +AA  + E + 
Sbjct: 720 RPEVATRYCDEVWSGVEYEVAAHCLLEGLT 749


>gi|156376779|ref|XP_001630536.1| predicted protein [Nematostella vectensis]
 gi|156217559|gb|EDO38473.1| predicted protein [Nematostella vectensis]
          Length = 666

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 83/177 (46%), Gaps = 19/177 (10%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           D ++ FV++VY  F  TGD    R +WP+V  A+ ++      G G+      P +    
Sbjct: 277 DSSALFVVEVYHYFKWTGDSALLRRLWPNVKRAVTWLIGESTKGTGL------PFRKVST 330

Query: 648 WSANGVSAYC-----GGLWVAALQAASALANDVGD---HASASYFWVRYQKAKAVYDSLW 699
           +   G+  Y       G+++ AL+AA  LA  + D   +A+ ++  +R  +   V  +LW
Sbjct: 331 YDLVGIDKYDHVMYEAGVFLLALRAAQHLAVHLKDKPFNATTTHALIRATQLVDV--TLW 388

Query: 700 NGS---YFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
           N     Y  + +S+  +S  I +D L GQ +A   GL  +  E K+K  L     +N
Sbjct: 389 NEERQFYRAWWDSEYGASDWIMSDSLYGQVWANTLGLGKLLPEEKMKVHLQNEIKYN 445


>gi|312373134|gb|EFR20945.1| hypothetical protein AND_18252 [Anopheles darlingi]
          Length = 734

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 36/57 (63%), Gaps = 1/57 (1%)

Query: 88  GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVL 143
           G PLGGIGAG+IGR + GEF R++L  G+ +   V ANQF V +    G   F S+L
Sbjct: 674 GAPLGGIGAGTIGRGFAGEFCRYQLRPGLYEYNTVHANQFIVTIKDETGATIFQSLL 730


>gi|167524276|ref|XP_001746474.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775236|gb|EDQ88861.1| predicted protein [Monosiga brevicollis MX1]
          Length = 775

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 93/221 (42%), Gaps = 30/221 (13%)

Query: 618 YIAMAYME---QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
           ++A A ME   +  + GDG++   G    TYD  +  G + + G L++ AL  A+ +A  
Sbjct: 468 HLAAATMELLARLYRPGDGVVR--GAQPSTYDV-ALYGANTFIGSLYLCALCCAAEMATR 524

Query: 675 VGDHASASYFWVRYQKAKAVYD--SLWNGSYFNY----DNSDGSSSTSIQADQLAGQWYA 728
           +G    A+ +  RY+ + A YD     NG ++      D++    S     D L GQW+A
Sbjct: 525 LGQSELAATYTARYKLSVANYDKACFTNGKWYTQVVDPDHATNVVSDGTFVDALLGQWWA 584

Query: 729 RACGLLPIADEAKVKKALTKIYDFNVL---------------KVKGGM--CGAMNGMQPD 771
              G+  +  +A V   ++ +++ N +                   GM  C      QP 
Sbjct: 585 HLLGIGDMLPQAHVASNVSSVFEANHVDEFDVDRQAPRKFCDNRDAGMYICTWGGAAQPP 644

Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
               +    +  +W G+ Y  A   +Q   VD+A Q  + V
Sbjct: 645 PSNALL-YTSEAMWSGLAYPFAGLALQTGQVDVALQVLSDV 684


>gi|410867532|ref|YP_006982143.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
           ATCC 4875]
 gi|410824173|gb|AFV90788.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
           ATCC 4875]
          Length = 845

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 96/229 (41%), Gaps = 25/229 (10%)

Query: 594 VLQVYRDFV--ATGDKNFARAVWPSVYIAMAYME-QFDKDGDGMIENEGFPDQTYDAWSA 650
           VL+ YR+    A G   + R  W ++   M ++  ++D +  G++  E     T+D  S 
Sbjct: 497 VLKTYREVRRGAVG-AGWLRDRWNNLCRLMDHVAARWDPECSGLLTGE--QPSTHDI-SL 552

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----- 704
            G + + G LW+AAL+A+  +A  +G    A  +  R+  + A YD+ L+ G Y+     
Sbjct: 553 YGANMFMGSLWLAALRASEEMALILGATDRAMGWRERFVLSSAAYDTALFRGGYYVQAAD 612

Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV---------- 754
                D        ADQL GQW+A    L  +     V+ AL  +   N+          
Sbjct: 613 ETHGDDDEFGDGCLADQLIGQWWAHLLDLGHLLPAEHVRSALQAVVAHNLNPADETRHSQ 672

Query: 755 --LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEM 801
               V+G     M      GR   +     E+W G  Y +AA  + E +
Sbjct: 673 RAYAVRGEQGLVMCSWPRGGRPARATRYCDEVWTGCEYEVAALCLAEGL 721


>gi|156383694|ref|XP_001632968.1| predicted protein [Nematostella vectensis]
 gi|156220031|gb|EDO40905.1| predicted protein [Nematostella vectensis]
          Length = 1641

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 55/221 (24%), Positives = 92/221 (41%), Gaps = 15/221 (6%)

Query: 553 WVARKCLGAVPHDIGLD-DPWFEINSYNLFNSS-RWKDLNSKFVLQVYRDFVATGDKNFA 610
           W   +  G +P  +G     W + N ++         D+N  F+ QVY  ++ TG+K   
Sbjct: 639 WAEYRRDGGIPEALGHQYGCWRKTNRFDKAGGRLNMADVNPAFLSQVYHLYMWTGNKKLL 698

Query: 611 RAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD--AWSANGVSAYCGGLWVAALQAA 668
             +WP++      M  +  +G  +         TYD   +    VS Y   + +  L  A
Sbjct: 699 DDLWPAIKDVTHAMVTWTTNGTSLPLRM---TNTYDILQFQRYDVSMYNSVMHLLGLTVA 755

Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWN---GSYFNYDNSDGSSSTSIQADQLAG 724
            A+     +        +R +KA +  D  LW+   G Y  + +    S++++ AD L G
Sbjct: 756 RAIGQVRKERKFVKDLTLRIRKATSAIDRLLWDEKRGYYRAWWDHAQPSTSALMADSLYG 815

Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNV----LKVKGGM 761
           Q +A   GL  + DE K+   L K  + N     LKV  G+
Sbjct: 816 QVWAYTLGLGNLLDENKMASHLRKEAELNDTPFGLKVMSGL 856


>gi|361067543|gb|AEW08083.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162873|gb|AFG64132.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162874|gb|AFG64133.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162875|gb|AFG64134.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162876|gb|AFG64135.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162877|gb|AFG64136.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162878|gb|AFG64137.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162879|gb|AFG64138.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162880|gb|AFG64139.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162881|gb|AFG64140.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
 gi|383162882|gb|AFG64141.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
          Length = 79

 Score = 50.8 bits (120), Expect = 0.004,   Method: Composition-based stats.
 Identities = 26/55 (47%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 838 YRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVASLLK 892
           YRSL YMRPL IWAMQWAL  PK   +  K  + +R+    + HA  + VA   K
Sbjct: 3   YRSLIYMRPLAIWAMQWALNPPKAVLEAPKINLMEREHPS-QFHAGLAAVAEAFK 56


>gi|220925399|ref|YP_002500701.1| amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
 gi|219950006|gb|ACL60398.1| Amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
          Length = 765

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 62/130 (47%), Gaps = 18/130 (13%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
           D    FV+   + F  TGDK+  +A+WP++  A+ +++ + D+DGDG +E          
Sbjct: 413 DATPLFVMLAGQYFEQTGDKDTIQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 472

Query: 637 NEGFPDQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYFWVRYQ 689
           N+G+ D        D   A G  A C   G   AA +AAS LA  +G+ A A       +
Sbjct: 473 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAATLGNAALAERLKDAAE 532

Query: 690 KAKAVYDSLW 699
           + +  +D  +
Sbjct: 533 RLRVAFDEAF 542


>gi|338212680|ref|YP_004656735.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336306501|gb|AEI49603.1| hypothetical protein Runsl_3226 [Runella slithyformis DSM 19594]
          Length = 855

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 77/378 (20%), Positives = 145/378 (38%), Gaps = 47/378 (12%)

Query: 71  KVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH----GICDDAP-VLAN 125
           K P+   + + H+T    +P+GGIG G++    RG+ Q +++ +    G     P  +A 
Sbjct: 23  KWPMLKHYDQQHLTRI-ALPIGGIGTGTVSLGGRGQLQDWEIMNRPGKGFSTTLPGNMAP 81

Query: 126 QFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENCTYHALFPRAW 181
            F+++V +P  +  +  L     G  + +    +E    +  G       ++ A +P   
Sbjct: 82  FFAIYVKKPTEKAQAKALI----GPLETHEYQHMEGRPVDHHGLPRFAKASFDAAYPFGQ 137

Query: 182 TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV 241
                E  P + +  +  +P IP + + S  P +  T+ ++N+      V++  +  N V
Sbjct: 138 VNLSDEKLP-ISVQIKAFNPLIPGDAERSGIPMAALTYEVTNTSNQPLTVSVAGSMRNFV 196

Query: 242 A-----------GDSGLSGHHFNSKTMTKDGV-HGLTLHHRTANGRPPV--TFAVAAEET 287
                       G+   SG   N     +     G+ ++    +   P   T A++    
Sbjct: 197 GKDGSKGSKSWKGEVNPSGAKQNKNQFRQGKTGQGIFMYSDGVDKADPAWGTIALSTNAV 256

Query: 288 ADV--HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
            DV    S  P      N+      D W++    G       +KT+ ++      A++A 
Sbjct: 257 GDVSYRTSSVP------NAWENALLDFWDDFTDDGKL----TEKTTVADEDPM--ASLAV 304

Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWE 405
             TI    T S TF + W  P    + K    R   +Y T    A  +A  A+ +  + E
Sbjct: 305 KKTIQPKQTESFTFFITWHFPNRFAWSK---ERVGNYYTTRYADAWDVAEKAVPQLPQLE 361

Query: 406 CEIEAWQRPILEDKRFPE 423
            E   +    L+   +PE
Sbjct: 362 NETLLFVNSFLKSS-YPE 378



 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 73/328 (22%), Positives = 123/328 (37%), Gaps = 60/328 (18%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
           +++ YRD+  +G+  F +  WP +  A+A+    + +D + DG+++  G    T D    
Sbjct: 484 IMKFYRDWQLSGNTAFLQKNWPKIKRALAFAWVPDGWDGNLDGVMD--GAQHNTMDV-EY 540

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----- 704
            G +      ++ AL+AA  +A  + D A A      +Q      D  L+NG Y+     
Sbjct: 541 FGPNPQMQIWYLGALKAAEKMAEALHDAAFAQQCKTLFQSGSTWADQQLFNGEYYEQQVM 600

Query: 705 -----NYDNSDGSSSTSIQAD-------------QLAGQWYARACGLLPIADEAKVKKAL 746
                 +     S   S Q D             QL GQ+ A  CGL  +     V+ +L
Sbjct: 601 SPVGRKFAKGTVSGWNSTQGDFPPFQLAKGCLVDQLVGQFMAHVCGLGYLVKPQNVQTSL 660

Query: 747 TKIYDFNVLKVKGGMCGAMNGMQ---------------PDGRIDMSGLQAREIWPGVTYG 791
             I  +N    +  +    N M+               P GR  +      E+  G  Y 
Sbjct: 661 KSILKYN---YRATLTEHFNNMRSYALADEPALLMASWPKGRPTVPFPYFAEVMTGFEYT 717

Query: 792 LAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIW 850
            A  M+ E M D   +    +  + +  DG   S F   E  ++        Y R +  W
Sbjct: 718 AAIGMLYENMTDDGLKC---IQNIRYRYDGAKRSPFDEAECGHH--------YARAMISW 766

Query: 851 AMQWALTKPKLSRQEIKHEISDRDSSYL 878
           A   AL+  + S  E     +D+  ++ 
Sbjct: 767 ASTLALSGFQYSGVEKTLHFNDQPGTFF 794


>gi|428168440|gb|EKX37385.1| hypothetical protein GUITHDRAFT_145081 [Guillardia theta CCMP2712]
          Length = 887

 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 87/226 (38%), Gaps = 62/226 (27%)

Query: 510 VHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLD 569
           +H Y S  L +LFP+L                              R  L   P    +D
Sbjct: 528 LHLYRSIPLALLFPQL-----------------------------VRNILDTGP----ID 554

Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
           +P F   S  + +S      N  FVL V   F++TGD+           + + ++E    
Sbjct: 555 EPKFPCGSRMMGDS------NPGFVLSVLNLFMSTGDRR----------VLLDFLEPV-- 596

Query: 630 DGDGMIENEGFP---DQTYD--AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYF 684
               M    G P     TYD   +    VSAY   L++A L AA  +A  V D +   Y 
Sbjct: 597 ----MSSRFGLPRHLQNTYDWFHFHLKDVSAYNAVLYLAQLAAAERIAALVQDDSFQHYC 652

Query: 685 WVRYQKAK-AVYDSLWNGSYFN-YDNSDGSSSTSIQADQLAGQWYA 728
            +   KA+  V + LWNG +   + N +G    ++  + L GQ +A
Sbjct: 653 QLHLSKAQEKVKELLWNGRFLRAWWNEEGELVEALHTETLYGQLWA 698


>gi|406663141|ref|ZP_11071211.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
 gi|405552803|gb|EKB48138.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
          Length = 937

 Score = 48.1 bits (113), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 48/178 (26%), Positives = 84/178 (47%), Gaps = 14/178 (7%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD----GDGMIENEGFPDQ 643
           D  +  VL+ YRD+  + D  F +  W ++ +A+ Y+   DK+     +GMI  E     
Sbjct: 534 DGQAGVVLRAYRDYQMSEDDTFLKNNWDNIKLALKYLIDLDKEYGETANGMIYGE--QHN 591

Query: 644 TYDA-WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGS 702
           T DA W   G       L++AAL AA  +A   GD +S   +    +  +   +SL+NG 
Sbjct: 592 TLDAEWF--GYIPAITSLYLAALAAAVEMAKATGDISSEKEYLSILESGRKNIESLFNGE 649

Query: 703 YF-NYDNSDGSSSTSIQA----DQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
           YF   ++ D   +  I      DQ+ GQ +A    L  + ++  ++ +L  ++ +N +
Sbjct: 650 YFIQEEDPDHLDAIGIGKGCYIDQVFGQGWAFQLNLGRLYNKKMIQSSLDSLWKYNFV 707


>gi|270295078|ref|ZP_06201279.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270274325|gb|EFA20186.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 970

 Score = 46.2 bits (108), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 49/260 (18%), Positives = 103/260 (39%), Gaps = 57/260 (21%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYR++  +GD  + + ++P+V  ++ Y +  +D    G +E         + W  +G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
           +   C   ++ AL A   +  ++       Y  +  +  K +  +L++G YF        
Sbjct: 672 M---CTSFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727

Query: 705 -------NYDNSDGSSS-----------------TSIQADQLAGQWYARACGLLPIADEA 740
                  +  +  GS S                 T   +D + G W A  CGL  + D  
Sbjct: 728 LQAPNPVDVMSFGGSYSDEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQP-----------------DGRIDMSGLQARE 783
           KV+  L  ++ +N   +K  +    N  +P                  G + +  + + E
Sbjct: 788 KVRSHLVAVHKYN---LKHDLVDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNE 844

Query: 784 IWPGVTYGLAASMIQEEMVD 803
           +W G+ Y +A+ ++ +  V+
Sbjct: 845 VWTGIEYQVASHLMMKGEVE 864


>gi|160887666|ref|ZP_02068669.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
 gi|156862844|gb|EDO56275.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
          Length = 992

 Score = 46.2 bits (108), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 49/260 (18%), Positives = 103/260 (39%), Gaps = 57/260 (21%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYR++  +GD  + + ++P+V  ++ Y +  +D    G +E         + W  +G
Sbjct: 634 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 693

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
           +   C   ++ AL A   +  ++       Y  +  +  K +  +L++G YF        
Sbjct: 694 M---CTSFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 749

Query: 705 -------NYDNSDGSSS-----------------TSIQADQLAGQWYARACGLLPIADEA 740
                  +  +  GS S                 T   +D + G W A  CGL  + D  
Sbjct: 750 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 809

Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQP-----------------DGRIDMSGLQARE 783
           KV+  L  ++ +N   +K  +    N  +P                  G + +  + + E
Sbjct: 810 KVRSHLVAVHKYN---LKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNE 866

Query: 784 IWPGVTYGLAASMIQEEMVD 803
           +W G+ Y +A+ ++ +  V+
Sbjct: 867 VWTGIEYQVASHLMMKGEVE 886


>gi|424029315|ref|ZP_17768855.1| hypothetical protein VCHENC01_3267 [Vibrio cholerae HENC-01]
 gi|408887281|gb|EKM25904.1| hypothetical protein VCHENC01_3267 [Vibrio cholerae HENC-01]
          Length = 924

 Score = 46.2 bits (108), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 44/170 (25%), Positives = 70/170 (41%), Gaps = 19/170 (11%)

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           YN  + + W       + +++     +GD  FA+ + P +   +  +E+   D DG++ +
Sbjct: 480 YNTTDGTPW------MIREIFEYINYSGDTQFAQQMLPIIERFIEGVEKHYLDDDGLMRH 533

Query: 638 EGFPDQTYDA-------WSANGVSAY-CGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
              PD   DA       WS  G  A     LW  +LQ A+ LA   GDHA A +      
Sbjct: 534 RD-PDTWMDAKINGQIPWSPRGPKANDIEALWFESLQVANQLAKWNGDHALAEHCQTLAN 592

Query: 690 KAKAVY-DSLWNGSYF---NYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
           K KA + D  W+       ++   D     SI+ +QL          L+P
Sbjct: 593 KVKASFIDKFWDSESMCLADHLKQDDVKDISIRPNQLMTLTIPTNQALIP 642


>gi|431796995|ref|YP_007223899.1| glycogen debranching protein [Echinicola vietnamensis DSM 17526]
 gi|430787760|gb|AGA77889.1| glycogen debranching enzyme [Echinicola vietnamensis DSM 17526]
          Length = 1213

 Score = 46.2 bits (108), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 42/141 (29%), Positives = 68/141 (48%), Gaps = 16/141 (11%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
           D+N  F  Q+ R F  TGDK + +AVWPS+   +A+ ++ FD DGDG+       +    
Sbjct: 419 DMNQVFFDQLLRHFDWTGDKAYLQAVWPSIERHLAWEKRNFDPDGDGLY------NAYAS 472

Query: 647 AWSANGVSAYCGGLWVA------ALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW- 699
            W+++ +    GG+  A      A + A+ LA  +G+     Y        KA+ ++LW 
Sbjct: 473 IWASDALQYNSGGVAHASAYNYFANKKAAELAAFIGEDGK-QYAAEAEAILKAMNETLWL 531

Query: 700 -NGSYFNYDNSDGSSSTSIQA 719
            +G Y  Y +  G  +   QA
Sbjct: 532 PSGWYAEYKDFLGPQNLHPQA 552


>gi|317478064|ref|ZP_07937244.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
 gi|316905767|gb|EFV27541.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
          Length = 970

 Score = 46.2 bits (108), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 49/257 (19%), Positives = 103/257 (40%), Gaps = 51/257 (19%)

Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           +++VYR++  +GD  + + ++P+V  ++ Y +  +D    G +E         + W  +G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
           +   C   ++ AL A   +  ++       Y  +  +  K +  +L++G YF        
Sbjct: 672 M---CTSFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727

Query: 705 -------NYDNSDGSSS-----------------TSIQADQLAGQWYARACGLLPIADEA 740
                  +  +  GS S                 T   +D + G W A  CGL  + D  
Sbjct: 728 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787

Query: 741 KVKKALTKIYDFNVL---------KVKGGMCGAMNGM-----QPDGRIDMSGLQAREIWP 786
           KV+  L  ++ +N+          +     CG   G+        G + +  + + E+W 
Sbjct: 788 KVRSHLVAVHKYNLKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 847

Query: 787 GVTYGLAASMIQEEMVD 803
           G+ Y +A+ ++ +  V+
Sbjct: 848 GIEYQVASHLMMKGEVE 864


>gi|268317317|ref|YP_003291036.1| amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
 gi|262334851|gb|ACY48648.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
          Length = 868

 Score = 45.8 bits (107), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 65/147 (44%), Gaps = 19/147 (12%)

Query: 560 GAVPHDI----GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
           G +PH+I     L D WFE   Y   ++    D    F++     + A+GD ++ R  W 
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417

Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV------SAYCGGLWVAALQAAS 669
           ++  A  +    D DG+ ++EN G        W   G+        Y  G+W+AAL+   
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473

Query: 670 ALANDVGDHASASYFWVRYQKAKAVYD 696
           A+A  +G+   A+    R  + +A  +
Sbjct: 474 AMATALGETELAAEVRQRAVRVRAAIE 500


>gi|345303570|ref|YP_004825472.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
 gi|345112803|gb|AEN73635.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
          Length = 868

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 65/147 (44%), Gaps = 19/147 (12%)

Query: 560 GAVPHDI----GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
           G +PH+I     L D WFE   Y   ++    D    F++     + A+GD ++ R  W 
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417

Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV------SAYCGGLWVAALQAAS 669
           ++  A  +    D DG+ ++EN G        W   G+        Y  G+W+AAL+   
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473

Query: 670 ALANDVGDHASASYFWVRYQKAKAVYD 696
           A+A  +G+   A+    R  + +A  +
Sbjct: 474 AMATALGETELAAEVRQRAVRVRAAIE 500


>gi|170743420|ref|YP_001772075.1| amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
 gi|168197694|gb|ACA19641.1| Amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
          Length = 764

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 59/127 (46%), Gaps = 18/127 (14%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
           D    FV+   + F  TGD+   +A+WP++  A+ +++ + D+DGDG +E          
Sbjct: 412 DATPLFVMLAGQYFEQTGDRATVQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 471

Query: 637 NEGFPDQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYFWVRYQ 689
           N+G+ D        D   A G  A C   G   AA +AAS LA  +G  A +       +
Sbjct: 472 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAGVLGHVALSERLKDAAE 531

Query: 690 KAKAVYD 696
           + +  +D
Sbjct: 532 RLRTAFD 538


>gi|154246650|ref|YP_001417608.1| amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
 gi|154160735|gb|ABS67951.1| Amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
          Length = 733

 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 30/124 (24%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD + AR +WP+V  A+ ++  + D+DGDG +E          N+G+ D     + A+G
Sbjct: 397 TGDLDTARLLWPAVAAALGWITDYGDRDGDGFVEYGRRSGDGLVNQGWKDSHDSVFHADG 456

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
                      V AY  G W    QAA+++A  +G    A  F    +KA  + D+ ++ 
Sbjct: 457 RLARGPIALVEVQAYVFGAW----QAAASIARRLGQAPDAMVF---ERKADLLRDA-FDA 508

Query: 702 SYFN 705
           ++F+
Sbjct: 509 AFFD 512


>gi|449681370|ref|XP_002168270.2| PREDICTED: uncharacterized protein LOC100212022 [Hydra
           magnipapillata]
          Length = 455

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 68/268 (25%), Positives = 104/268 (38%), Gaps = 35/268 (13%)

Query: 594 VLQVYRDFVATGDKNFARAVW-PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
           VL+ YR+      K++   VW P   +   +++  D D +GMI   G    TYD  +  G
Sbjct: 67  VLKTYREIRQGASKSWFNTVWNPIKLLINRWLDVMDTDKNGMIY--GPQPNTYDV-ATYG 123

Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD--SLWNGSYFNYDNSD 710
            S   G L++ AL+A   +A    D   A     R+       D     NG ++ +    
Sbjct: 124 YSTLIGLLYLCALRATEEMAKLQNDMNFAKVCRDRFNLGTVNLDKACFTNGKWYTHVMDP 183

Query: 711 G------SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL-------KV 757
                   S T I    + GQW+A    L  +  +  +K  L   Y  N +       + 
Sbjct: 184 AHPFNVLGSCTFIGG--MYGQWWAHILNLGYLLPQEHIKSHLKYTYTRNFVPSFDPKTQS 241

Query: 758 KGGMC-----GAMNGMQPDGRIDMSGLQ-AREI-WPGVTYGLAASMIQE----EMVDMAF 806
               C     G + G+  DG    + L  + EI W GV Y  AA +I+E    E +D+  
Sbjct: 242 PRAFCDQRDSGWVIGVWDDGTKPANPLYYSSEIAWSGVIYPFAAMLIEEGLTKEGLDVLE 301

Query: 807 QTAA---GVYEVAWSEDGLGYSFQTPES 831
           +T     G     W+E   G  +  P S
Sbjct: 302 KTRYFYDGTRRSPWNEIECGDHYARPLS 329


>gi|300770983|ref|ZP_07080860.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762256|gb|EFK59075.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 1205

 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 15/78 (19%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
           D+N  F  Q++  F  TGD+ F + VWP++   +A+ ++ FD DGDG+          YD
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGL----------YD 467

Query: 647 A----WSANGVSAYCGGL 660
           A    W+++G+    GG+
Sbjct: 468 AYCAIWASDGLQYSGGGV 485


>gi|424041235|ref|ZP_17779213.1| hypothetical protein VCHENC02_5228 [Vibrio cholerae HENC-02]
 gi|408890954|gb|EKM28915.1| hypothetical protein VCHENC02_5228 [Vibrio cholerae HENC-02]
          Length = 924

 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 44/170 (25%), Positives = 69/170 (40%), Gaps = 19/170 (11%)

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           YN  + + W       + +++     +GD  FA+ + P +   +  +E+   D DG++ +
Sbjct: 480 YNTTDGTPW------MIREIFEYINYSGDTQFAQQMLPIIERFIEGVEKHYLDDDGLMRH 533

Query: 638 EGFPDQTYDA-------WSANGVSAY-CGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
              PD   DA       WS  G  A     LW  +LQ A  LA   GDHA A +      
Sbjct: 534 RD-PDTWMDAKINGQIPWSPRGPKANDIEALWFESLQVAIQLAKWNGDHALAEHCQTLAN 592

Query: 690 KAKAVY-DSLWNGSYF---NYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
           K KA + D  W+       ++   D     SI+ +QL          L+P
Sbjct: 593 KVKASFIDKFWDSESMCLADHLKQDDVKDISIRPNQLMTLTIPTNQALIP 642


>gi|227537591|ref|ZP_03967640.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227242593|gb|EEI92608.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 1205

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 15/78 (19%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
           D+N  F  Q++  F  TGD+ F + VWP++   +A+ ++ FD DGDG+          YD
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGL----------YD 467

Query: 647 A----WSANGVSAYCGGL 660
           A    W+++G+    GG+
Sbjct: 468 AYCAIWASDGLQYSGGGV 485


>gi|159040682|ref|YP_001539934.1| glycoside hydrolase 15-like protein [Caldivirga maquilingensis
           IC-167]
 gi|157919517|gb|ABW00944.1| glycoside hydrolase 15-related [Caldivirga maquilingensis IC-167]
          Length = 604

 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 49/120 (40%), Gaps = 11/120 (9%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           D  +  VL  Y  F  T D+ F +  W  V  A  Y+          + ++G    T   
Sbjct: 346 DETASVVLATYVHFKLTLDRQFLKNAWVMVRKAAEYLA-------ANVSDDGLTTPTVGP 398

Query: 648 WSAN-GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA--KAVYDSLWNGSYF 704
           W  + GV  Y      AAL +AS LA ++GD   A   W RY         +  WNGS+F
Sbjct: 399 WEEHLGVHTYTNASVYAALSSASYLAGEIGDRNRAVE-WGRYASVIRSTTLNQAWNGSFF 457


>gi|398838157|ref|ZP_10595439.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
 gi|398116719|gb|EJM06477.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
          Length = 1066

 Score = 44.7 bits (104), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 57/132 (43%), Gaps = 26/132 (19%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
           D    FV+        T D    R +WP++  A++++E++ D+DGDG +E          
Sbjct: 714 DSTPLFVMLAAAYLERTNDLGTLRQLWPNIEAALSWIEEYGDRDGDGFVEYGRQSAEGLI 773

Query: 637 NEGFPDQTYDAWSANG-----------VSAYCGGLWVAALQAASALANDVGDHASASYFW 685
           N+G+ D     + ANG           V AY  G W      A+ +A  +GD   A    
Sbjct: 774 NQGWKDSHDSVFHANGQLAVGPIAIVEVQAYVYGAWT----GAAKIARRLGDPDRAQRLK 829

Query: 686 VRYQKAKAVYDS 697
            + Q+ +  +D+
Sbjct: 830 YKAQRLREEFDN 841


>gi|326797904|ref|YP_004315723.1| hypothetical protein [Sphingobacterium sp. 21]
 gi|326548668|gb|ADZ77053.1| hypothetical protein Sph21_0471 [Sphingobacterium sp. 21]
          Length = 1132

 Score = 44.3 bits (103), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 21/101 (20%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
           D+N  F  Q+Y  F  TGD  F +A WP++   +A+ ++ FD+DGDG+          YD
Sbjct: 416 DMNLVFFDQLYNHFQYTGDTAFIKASWPAIKRHLAWEKRNFDQDGDGL----------YD 465

Query: 647 A----WSANGVSAYCGGLW------VAALQAASALANDVGD 677
           A    W+++ +    GG+           + A+ LA  VG+
Sbjct: 466 AYCAIWASDALQYSGGGVTHTSAYNYRGFEMAAKLATIVGE 506


>gi|170749555|ref|YP_001755815.1| amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
           2831]
 gi|170656077|gb|ACB25132.1| Amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
           2831]
          Length = 781

 Score = 43.9 bits (102), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 57/133 (42%), Gaps = 19/133 (14%)

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMI----------ENEGFP 641
           FV+  +  +  TGD    RA+WP++ +A+ +M+++ D+DGDG +          EN+G+ 
Sbjct: 434 FVMLAWEYYAVTGDLETVRAIWPALELALEWMDRYGDRDGDGFVEYARDTDKGLENQGWK 493

Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
           D        D   A G  A C   G   AA    + LA  +G    A          +  
Sbjct: 494 DSHDSIFHADGHLAKGPIALCEVQGYVYAAKNGMAKLAALIGHDPMAERLTQEAASLRER 553

Query: 695 YD-SLWNGSYFNY 706
           +D + WN     Y
Sbjct: 554 FDAAFWNEEIGTY 566


>gi|373851062|ref|ZP_09593863.1| transcriptional regulator, GntR family [Opitutaceae bacterium TAV5]
 gi|372477227|gb|EHP37236.1| transcriptional regulator, GntR family [Opitutaceae bacterium TAV5]
          Length = 938

 Score = 43.9 bits (102), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 57/114 (50%), Gaps = 8/114 (7%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG---FPDQTYDA----WSANGVSAY 656
           TGD +     WP++ +A+ +   F++  DG++       + D   DA      +  V A 
Sbjct: 602 TGDDSLLAHWWPNLLLAIRWFSAFERPDDGLLARVPHWIYIDMGRDAAGRGMGSGEVLAT 661

Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNYDNS 709
               +++AL+A +  AN +GD AS ++F  R ++ A ++ +  WN +   Y ++
Sbjct: 662 LNLQYLSALRAIAGYANRLGDQASHAWFASRAERLASSLREQFWNETEHGYADA 715


>gi|393766406|ref|ZP_10354962.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
 gi|392728187|gb|EIZ85496.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
          Length = 780

 Score = 43.5 bits (101), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 31/45 (68%), Gaps = 1/45 (2%)

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE 636
           FV+  +  +  TGD    RA+WPS+ +A+ +M+++ D+DGDG +E
Sbjct: 433 FVMLAWEYYAVTGDLETIRAIWPSLELALEWMDRYGDRDGDGFVE 477


>gi|391229074|ref|ZP_10265280.1| transcriptional regulator [Opitutaceae bacterium TAV1]
 gi|391218735|gb|EIP97155.1| transcriptional regulator [Opitutaceae bacterium TAV1]
          Length = 939

 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 29/114 (25%), Positives = 57/114 (50%), Gaps = 8/114 (7%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG---FPDQTYDA----WSANGVSAY 656
           TGD +     WP++ +A+ +   F++  DG++       + D   DA      +  V A 
Sbjct: 603 TGDDSLLAHWWPNLLLAIRWFSAFERPDDGLLARVPHWIYIDMGRDAAGRGMGSGEVLAT 662

Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNYDNS 709
               +++AL+A +  AN +GD AS ++F  R ++ A ++ +  WN +   Y ++
Sbjct: 663 LNLQYLSALRAVAGYANRLGDQASHAWFSGRAERLATSLREQFWNETEHGYADA 716


>gi|75676297|ref|YP_318718.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
 gi|74421167|gb|ABA05366.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
          Length = 741

 Score = 43.5 bits (101), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 22/90 (24%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD    R +WP +  A+ ++EQ+ D+DGDG +E          N+G+ D     + A+G
Sbjct: 405 TGDVVTIRRLWPHIEAALTWIEQYGDRDGDGFVEYHRQTSEGLINQGWKDSYDSVFHADG 464

Query: 653 -----------VSAYCGGLWVAALQAASAL 671
                      V AY  G W AA + A  L
Sbjct: 465 TFAKGPIALVEVQAYVYGAWCAAAKIAEQL 494


>gi|384263211|ref|YP_005418399.1| amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
 gi|378404313|emb|CCG09429.1| Amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
          Length = 728

 Score = 42.7 bits (99), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 12/74 (16%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD  FARA+WP+V  A+A+++   D+DGDG +E          N+G+ D     + A+G
Sbjct: 413 TGDGAFARALWPAVDAALAWIDHHGDRDGDGFVEYGRRCDDGLVNQGWKDSDDSVFHADG 472

Query: 653 VSAYCGGLWVAALQ 666
             A  G L V  +Q
Sbjct: 473 RLAQ-GPLAVCEVQ 485


>gi|403511592|ref|YP_006643230.1| bacterial alpha-L-rhamnosidase family protein [Nocardiopsis alba
           ATCC BAA-2165]
 gi|402801909|gb|AFR09319.1| bacterial alpha-L-rhamnosidase family protein [Nocardiopsis alba
           ATCC BAA-2165]
          Length = 712

 Score = 42.7 bits (99), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 53/112 (47%), Gaps = 18/112 (16%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA---------WS----A 650
           TGD  FARA+ P    A++++E    D DG+   E   D+  D          WS    A
Sbjct: 346 TGDTGFARAMLPVARRAISFLESR-TDADGLFREE--EDRRTDPEAPLAMVANWSPLDLA 402

Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASY-FWVRYQK-AKAVYDSLWN 700
           +GV A    +   ALQ  +AL  DV D   A++    R ++  +A+ D LW+
Sbjct: 403 SGVDALTNAVLYDALQGLAALERDVADDVDAAHRLEERAERLRRALVDRLWD 454


>gi|334345035|ref|YP_004553587.1| amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
 gi|334101657|gb|AEG49081.1| Amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
          Length = 744

 Score = 42.7 bits (99), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 26/118 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYM-EQFDKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD  F  ++ P V  A+A++ E  D+DGDG +E          N+G+ D     + A+G
Sbjct: 408 TGDSAFIASILPHVRAALAWIDEHGDRDGDGFVEYGRLTDQGLVNQGWKDSHDSVFHADG 467

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
                      V AY  G W    +AA A+   +GD   A +F  R    +  +D+ +
Sbjct: 468 SIARGPIALAEVQAYVYGAW----RAAEAIFVTLGDPDRALHFRARADDLRERFDAAF 521


>gi|429221453|ref|YP_007173779.1| glycogen debranching protein [Deinococcus peraridilitoris DSM
           19664]
 gi|429132316|gb|AFZ69330.1| glycogen debranching enzyme [Deinococcus peraridilitoris DSM 19664]
          Length = 430

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 1/94 (1%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           D N  FVL  Y     T D  F RA WP +  A+ ++   D +  G++E     D   D 
Sbjct: 99  DANMWFVLGHYACHRRTNDTAFLRAAWPRIQAALLWLRYQDMNDCGLLEAPEAADWA-DL 157

Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASA 681
           ++    + Y   L+V  L+AA  L+  +G+  +A
Sbjct: 158 YATRYNTLYANALYVGTLEAAGYLSAALGEDGAA 191


>gi|449672634|ref|XP_002169551.2| PREDICTED: uncharacterized protein LOC100201428 [Hydra
           magnipapillata]
          Length = 502

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 75/165 (45%), Gaps = 9/165 (5%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
           D+N  F++Q    +  + D  F   ++P    AM ++ Q    G G+   +  PD TYD 
Sbjct: 107 DVNVGFIVQALLMYKWSNDTVFFDEIYPFTVRAMNWLMQDATKGTGLPYRK--PD-TYDL 163

Query: 648 WSANGVS--AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSLWN---G 701
           +        AY    ++ AL A  ++A+   D+ +         +A K +   +WN   G
Sbjct: 164 FDLEKYDHCAYNSISYLLALHAMKSMASMQKDNNTIKNVNAALARATKQLEVEMWNEKKG 223

Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
            Y  + + +  S T + +D L GQ +A   GL  + D+ K+KK L
Sbjct: 224 FYHAWFDDEWGSPTWLMSDVLYGQVWAYTLGLGDLLDKDKMKKHL 268


>gi|217978466|ref|YP_002362613.1| amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
 gi|217503842|gb|ACK51251.1| Amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
          Length = 729

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 51/115 (44%), Gaps = 18/115 (15%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
           D    FV+     F  TGD      +WP++  A+ ++++F D+DGDG +E          
Sbjct: 377 DATPLFVMLAGMYFERTGDSETIAKIWPNILAALRWIDEFGDRDGDGFVEYLRETENGLA 436

Query: 637 NEGFPDQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYF 684
           N+G+ D        D   A G  A C   G   AA  AA  LA  +GD  + + F
Sbjct: 437 NQGWKDSHDSIFHADGSLAEGPIALCEVQGYVYAAKSAAGRLARLMGDDDAGAKF 491


>gi|150376166|ref|YP_001312762.1| amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
 gi|150030713|gb|ABR62829.1| Amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
          Length = 734

 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/140 (24%), Positives = 63/140 (45%), Gaps = 32/140 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD +  R +WP++  A+ ++++F D+DGDG +E          N+ + D     + A+G
Sbjct: 398 TGDTDTVRGLWPNIVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHADG 457

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
                      V AY  G W    +AA+ L+  +G    A     R +  +  +D+ +  
Sbjct: 458 RLAKGPVATAEVQAYIFGAW----RAAARLSRKLGHAEDALRLEQRAEDLRIRFDAAF-- 511

Query: 702 SYFNYDNSDGSSSTSIQADQ 721
               +D   G+ S ++  D+
Sbjct: 512 ----FDEELGTYSLALDGDK 527


>gi|408404211|ref|YP_006862194.1| glycogen debranching enzyme [Candidatus Nitrososphaera gargensis
           Ga9.2]
 gi|408364807|gb|AFU58537.1| putative glycogen debranching enzyme [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 863

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 70/312 (22%), Positives = 126/312 (40%), Gaps = 49/312 (15%)

Query: 575 INSYNLFNSSRWKDLNSKFVLQ-VYRDFV-ATGDKNFARAVWPSVYIAMAYMEQFDKDGD 632
           I+     +++ +   +S F+     R++V  TGD  + +  W S+   + Y    D DGD
Sbjct: 498 ISGKAFLHTTTYGSADSTFLFPWAIREYVLGTGDIGYLKKRWDSIVDLVNYGFLKDVDGD 557

Query: 633 GMIEN------EGFPDQTYDAW--------SANGVSAYCGGLWVAALQAASALANDVGDH 678
           G+IE+      E  P Q    W        SAN + A    L+  +L+  S LA   GD 
Sbjct: 558 GLIEHGFTGTAEKLPIQD-STWMDHIDRRKSANDIQA----LFYESLRIGSELAAMAGDS 612

Query: 679 ASASYFWVRYQKAKAVYD-SLWN---GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL 734
            + + +  + ++ +   D   WN   G Y++    DGS   SI+ + L          LL
Sbjct: 613 ENEARWSSKAKELQETIDREYWNTSAGFYYDTIRKDGSKDPSIRPNAL--------VLLL 664

Query: 735 P--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
              + + ++    L++I + ++        G       D +   +      +WP VT   
Sbjct: 665 SDVVRERSRADSVLSRIEEQDMTTA----WGVRTLSSKDQKYQPTLYHDGAVWPLVTGWA 720

Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGL---GYSFQTPESWNNNDEYRSLCYMRPLTI 849
           AAS I+    + A +    + E    E+G+    Y    PE +N+       C ++  ++
Sbjct: 721 AASEIKFGRHEQALRYMGSMAERILHENGMFAETYRGDRPEPFNS-------CILQAWSV 773

Query: 850 WAMQWALTKPKL 861
               +AL +  L
Sbjct: 774 GMYAYALREMML 785


>gi|170060875|ref|XP_001865995.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879232|gb|EDS42615.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 183

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 42/99 (42%), Gaps = 29/99 (29%)

Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYI--------------- 619
           + +Y + +   WKDLNSKF+LQVYRD+       +A       +I               
Sbjct: 54  VRAYPIHDVYEWKDLNSKFILQVYRDYYTQLKAEYAGKFSSIEFINKESFRNRMSRTERD 113

Query: 620 -------------AMAYMEQFDKDGDGMIENEGFPDQTY 645
                        A  Y  ++D D DG+IEN+  PD TY
Sbjct: 114 VLLEENVSGLQAGAGEYPLEWDMDNDGLIENKP-PDLTY 151


>gi|418404986|ref|ZP_12978419.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
 gi|359501068|gb|EHK73697.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
          Length = 734

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 32/140 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD +  R +WP+V  A+ ++++F D+DGDG +E          N+ + D     + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
                      V AY  G W    QAA+ L+  +G    A     R ++ +  +D+ +  
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRKLGHVEDALRLERRAEELRIRFDTAF-- 511

Query: 702 SYFNYDNSDGSSSTSIQADQ 721
               +D+  G+ S ++  ++
Sbjct: 512 ----FDDELGTYSLALDGEK 527


>gi|16263864|ref|NP_436656.1| hypothetical protein SM_b20116 [Sinorhizobium meliloti 1021]
 gi|334319999|ref|YP_004556628.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
 gi|384539727|ref|YP_005723811.1| hypothetical protein SM11_pD1478 [Sinorhizobium meliloti SM11]
 gi|407724161|ref|YP_006843822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
 gi|15139988|emb|CAC48516.1| CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti 1021]
 gi|334097738|gb|AEG55748.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
 gi|336038380|gb|AEH84310.1| conserved hypothetical protein [Sinorhizobium meliloti SM11]
 gi|407324221|emb|CCM72822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
          Length = 734

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 32/140 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD +  R +WP+V  A+ ++++F D+DGDG +E          N+ + D     + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
                      V AY  G W    QAA+ L+  +G    A     R ++ +  +D+ +  
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRKLGHVEDALRLERRAEELRIRFDTAF-- 511

Query: 702 SYFNYDNSDGSSSTSIQADQ 721
               +D+  G+ S ++  ++
Sbjct: 512 ----FDDELGTYSLALDGEK 527


>gi|374372806|ref|ZP_09630467.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
 gi|373234882|gb|EHP54674.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
          Length = 872

 Score = 41.6 bits (96), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 58/310 (18%), Positives = 112/310 (36%), Gaps = 31/310 (10%)

Query: 64  KQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL 123
           K+E  +      + +R  H+ S    P+GG+GAG       G      +     ++ P +
Sbjct: 44  KEETPRTSRAYNSAYRDEHL-SRVAFPIGGLGAGMFCLEGSGAVSHMSI-----NNHPEI 97

Query: 124 ANQFSVF--VSRPNGEKFSSVL---CPRSPGVPKKNTDSGIESWDWNL-KGENCTYHALF 177
             + ++F  ++    +  + VL    P      ++ T +G    ++ L + +N ++ A F
Sbjct: 98  FFEPNIFAAIAIKGAQPMAKVLEGPVPEWKIFGQRGTGNGASGTNYGLPRFDNASFLARF 157

Query: 178 PRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTW 237
           P   T+   + +  + +     SPFIP +   S  P     +T +N    + +    +  
Sbjct: 158 PFG-TIELKDNEMPMDVAINGWSPFIPGDEDNSGIPVGALEYTFANRSSKTVEAVFSYNA 216

Query: 238 ANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPC 297
            N VA            K   +   +G  L       +P      A   TA+       C
Sbjct: 217 KNFVA---------RGEKAGIRKLENGFILSQGQGKDKPFTQTDFAIYTTANEQPVVDYC 267

Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
           +   G    +T   +WN I              +P     + GA++     +  G+ + +
Sbjct: 268 WFRGGWFDPVTM--IWNTIAD-------GKVSNTPPVAADAPGASLFVPFKLAPGAKKII 318

Query: 358 TFSLAWDCPE 367
              +AW  PE
Sbjct: 319 RVMMAWYVPE 328


>gi|433611718|ref|YP_007195179.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
 gi|429556660|gb|AGA11580.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
          Length = 734

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 32/140 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD +  R +WP+V  A+ ++++F D+DGDG +E          N+ + D     + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
                      V AY  G W    QAA+ L+  +G    A     R ++ +  +D+ +  
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRRLGHVEDALRLERRAEELRIRFDTAF-- 511

Query: 702 SYFNYDNSDGSSSTSIQADQ 721
               +D+  G+ S ++  ++
Sbjct: 512 ----FDDELGTYSLALDGEK 527


>gi|373951743|ref|ZP_09611703.1| alpha-L-rhamnosidase [Mucilaginibacter paludis DSM 18603]
 gi|373888343|gb|EHQ24240.1| alpha-L-rhamnosidase [Mucilaginibacter paludis DSM 18603]
          Length = 1252

 Score = 41.2 bits (95), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 39/144 (27%), Positives = 68/144 (47%), Gaps = 35/144 (24%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
           D+N  F+ ++   F  TGD ++ + +WP +   +A+ ++ FD DGDG+          YD
Sbjct: 442 DMNLVFIDELLNHFNWTGDLDYVKQMWPVIKRHLAWEKRNFDADGDGL----------YD 491

Query: 647 A----WSANGVSAYCGGLWVA------ALQAASALANDVGDHASASYFWVRYQKA----- 691
           A    W+++ +    GG+  +      A +AA+ LA  +G+ A+       YQK      
Sbjct: 492 AYCAIWASDALQYSGGGVTHSSAYNYRANKAAAQLAKLIGEDAAP------YQKEAEHIL 545

Query: 692 KAVYDSLW---NGSYFNYDNSDGS 712
            A+  +LW    G Y  Y +  G+
Sbjct: 546 NAINKTLWMPQTGQYAEYQDLLGN 569


>gi|398384505|ref|ZP_10542535.1| glycogen debranching enzyme [Sphingobium sp. AP49]
 gi|397722664|gb|EJK83200.1| glycogen debranching enzyme [Sphingobium sp. AP49]
          Length = 738

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 52/118 (44%), Gaps = 26/118 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD  F  A+ P V  A+A+++   D+DGDG +E          N+G+ D     + A+G
Sbjct: 402 TGDAAFLAAILPHVEAALAWIDAHGDRDGDGFVEYGRLTDQGLQNQGWKDSHDSIFHADG 461

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
                      V AY  G W    QAA A+   VG    A+ +  R    +  +D+ +
Sbjct: 462 SIARGPIALAEVQAYVYGAW----QAAGAIFAAVGQEDKATGYRARADDLRTRFDAAF 515


>gi|294012573|ref|YP_003546033.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
 gi|292675903|dbj|BAI97421.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
          Length = 749

 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 26/118 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYM-EQFDKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD  F  ++ P V  A++++ E  D+DGDG +E          N+G+ D     + A+G
Sbjct: 413 TGDSGFIASILPHVEAALSWIDEHGDRDGDGFVEYGRLTDQGLVNQGWKDSHDSVFHADG 472

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
                      V AY  G W    +AA A+   +GD   A  + +R    +  +D+ +
Sbjct: 473 GIARGPIALAEVQAYVYGAW----RAAEAIFTTLGDPDKARTYRMRADDLRERFDAAF 526


>gi|333978445|ref|YP_004516390.1| glycoside hydrolase 15-like protein [Desulfotomaculum kuznetsovii
           DSM 6115]
 gi|333821926|gb|AEG14589.1| glycoside hydrolase 15-related protein [Desulfotomaculum
           kuznetsovii DSM 6115]
          Length = 671

 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 45/106 (42%), Gaps = 11/106 (10%)

Query: 598 YRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV-SAY 656
           +  F  T D+ F + +WPS ++  AY+ +        +   G P+   D W  N + S Y
Sbjct: 381 HHHFKLTRDREFLQKIWPSTFLGAAYLVEH-------LSPNGLPEPGLDLWEDNFIQSTY 433

Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQKA--KAVYDSLWN 700
                   L+ A+ALA  +G    A   W    +   K++   LWN
Sbjct: 434 AAAAIYGGLRGAAALAGSIGAPEKAKK-WSEAAETVRKSILTHLWN 478


>gi|384533984|ref|YP_005716648.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
 gi|333816160|gb|AEG08827.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
          Length = 734

 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 32/136 (23%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD +  R +WP+V  A+ ++++F D+DGDG +E          N+ + D     + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
                      V AY  G W    QAA+ L+  +G    A     R ++ +  +D+ +  
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRKLGHVEDALRLERRAEELRIRFDTAF-- 511

Query: 702 SYFNYDNSDGSSSTSI 717
               +D+  G+ S ++
Sbjct: 512 ----FDDELGTYSLAL 523


>gi|330507058|ref|YP_004383486.1| glycogen debranching protein [Methanosaeta concilii GP6]
 gi|328927866|gb|AEB67668.1| glycogen debranching enzyme, archaeal type, putative [Methanosaeta
           concilii GP6]
          Length = 621

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 67/163 (41%), Gaps = 37/163 (22%)

Query: 532 DFAAAVMMHDPGTMKIMSDGKWVARKCL----------GAVPHDIGLDDPWFEINSYNLF 581
           D+    M+  PG +  +S G++ A + +          G +P+D+G         SYN  
Sbjct: 288 DWGRDAMIALPGIL--LSTGRYEAARLVLSSFAGAMKDGVLPNDLG-------ARSYNTV 338

Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM----EQFDKDGDGMIEN 637
           ++S W      FV    R F ATGD +F   +WP +   +       + F  D DG+I N
Sbjct: 339 DASLW------FVQATSRYFDATGDLDFLAELWPKLLDVVERYSRPGDDFGADEDGLI-N 391

Query: 638 EG----FPDQTYDAWSANGVSAYC---GGLWVAALQAASALAN 673
            G    + D   D       +  C     LW +AL+   +L+ 
Sbjct: 392 SGPALTWMDARVDGRPVTSRTGKCCEINALWYSALRRTESLSR 434


>gi|390166485|ref|ZP_10218748.1| amylo-alpha-1,6-glucosidase [Sphingobium indicum B90A]
 gi|389590882|gb|EIM68867.1| amylo-alpha-1,6-glucosidase [Sphingobium indicum B90A]
          Length = 749

 Score = 40.8 bits (94), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 26/118 (22%)

Query: 604 TGDKNFARAVWPSVYIAMAYM-EQFDKDGDGMIE----------NEGFPDQTYDAWSANG 652
           TGD  F  ++ P V  A++++ E  D+DGDG +E          N+G+ D     + A+G
Sbjct: 413 TGDSGFIASILPHVEAALSWIDEHGDRDGDGFVEYGRLTDQGLVNQGWKDSHDSVFHADG 472

Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
                      V AY  G W    +AA A+   +GD   A  +  R    +  +D+ +
Sbjct: 473 GIARGPIALAEVQAYVYGAW----RAAEAIFTTLGDPDKARAYRTRADDLRERFDAAF 526


>gi|323495701|ref|ZP_08100771.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
 gi|323319168|gb|EGA72109.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
          Length = 924

 Score = 40.4 bits (93), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 10/105 (9%)

Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA-------WSANGVSAY 656
           +GD +FA+ ++P V   +A +E+   D DG+I++   PD   DA       WS  G  A 
Sbjct: 498 SGDIDFAKRIYPVVQRFIAGVERNYLDQDGLIKHR-HPDTWMDAKIDGQIPWSPRGPKAN 556

Query: 657 -CGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLW 699
               LW  AL  A  LA    D   A +     Q+AK  + D  W
Sbjct: 557 DIQALWFDALHVAQTLAQLNQDDVYADHVVSLAQQAKQSFIDKFW 601


>gi|257052439|ref|YP_003130272.1| glycosyltransferase 36 [Halorhabdus utahensis DSM 12940]
 gi|256691202|gb|ACV11539.1| glycosyltransferase 36 [Halorhabdus utahensis DSM 12940]
          Length = 815

 Score = 40.4 bits (93), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 56/124 (45%), Gaps = 17/124 (13%)

Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALA--NDVGDHASASYFWVRYQKAKAV 694
           +E F    +D       S + GG +V ALQ  + LA   D+   ++  Y     + A+AV
Sbjct: 497 DESFQTAEHDVEEERAESVFIGGQFVYALQELAELAEQTDILPESAEEYQGYADEMAEAV 556

Query: 695 YDSLWNGSYF-----NYDNSDGSSSTSIQADQLAGQWYARACGLLPIA----DEAKVKKA 745
            D+ W+G +F     +Y +  GSS          GQ +  + G+  +A    DE KV+ A
Sbjct: 557 ADAGWDGEWFRRAYDHYSDPIGSSEND------EGQIFVESNGMCGMAGIGRDEGKVQDA 610

Query: 746 LTKI 749
           +  +
Sbjct: 611 MDSV 614


>gi|240138186|ref|YP_002962658.1| hypothetical protein MexAM1_META1p1519 [Methylobacterium extorquens
           AM1]
 gi|418061390|ref|ZP_12699252.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
 gi|240008155|gb|ACS39381.1| conserved hypothetical protein, putative glycosyltransferase
           [Methylobacterium extorquens AM1]
 gi|373565054|gb|EHP91121.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
          Length = 778

 Score = 40.4 bits (93), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 18/103 (17%)

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
           FV+   +    TGD +  RA+WP + +A+ +++ + D+DGDG +E          N+G+ 
Sbjct: 431 FVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490

Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
           D        D   A G  A C   G   AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533


>gi|218529882|ref|YP_002420698.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
 gi|218522185|gb|ACK82770.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
          Length = 778

 Score = 40.4 bits (93), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 18/103 (17%)

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
           FV+   +    TGD +  RA+WP + +A+ +++ + D+DGDG +E          N+G+ 
Sbjct: 431 FVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490

Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
           D        D   A G  A C   G   AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533


>gi|254560746|ref|YP_003067841.1| hypothetical protein METDI2293 [Methylobacterium extorquens DM4]
 gi|254268024|emb|CAX23895.1| conserved hypothetical protein, putative glycosyltransferase
           [Methylobacterium extorquens DM4]
          Length = 778

 Score = 40.4 bits (93), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 18/103 (17%)

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
           FV+   +    TGD +  RA+WP + +A+ +++ + D+DGDG +E          N+G+ 
Sbjct: 431 FVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490

Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
           D        D   A G  A C   G   AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533


>gi|403382326|ref|ZP_10924383.1| alpha-L-rhamnosidase [Paenibacillus sp. JC66]
          Length = 1003

 Score = 40.0 bits (92), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 54/116 (46%), Gaps = 5/116 (4%)

Query: 604 TGDKNFARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
           TGD  +AR++WP V Y    Y+E+ D+ G   ++   F D        +GV ++     V
Sbjct: 692 TGDTGYARSIWPHVRYTLEHYLERLDERGLLAMKAWNFLDWAPMDQPRSGVVSHQNMFLV 751

Query: 663 AALQAASALANDVGDHASASYFWVRYQKAK-AVYDSLWN---GSYFNYDNSDGSSS 714
            AL++A+ LA   G       F    ++ K A+   LW+    +Y +  + DG  S
Sbjct: 752 KALRSAALLAELAGQAEEGDGFLDHAERLKQAIGQHLWSEERKAYLDCIHEDGRRS 807


>gi|337288158|ref|YP_004627630.1| alpha-L-rhamnosidase [Thermodesulfobacterium sp. OPB45]
 gi|334901896|gb|AEH22702.1| alpha-L-rhamnosidase [Thermodesulfobacterium geofontis OPF15]
          Length = 707

 Score = 40.0 bits (92), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 17/112 (15%)

Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------N 637
           D    FV+   + F  TGD +F +++W ++ +A+++++ + D DGDG +E         N
Sbjct: 358 DATPLFVILASKYFERTGDIDFIKSIWKNIVLAISWIDNYGDVDGDGFVEYFPSKKGLTN 417

Query: 638 EGFPDQTYDAWSANGVSAY-------CGGLWVAALQAASALANDVGDHASAS 682
           +G+ D     +  +G             G    A + AS LA  +G+   AS
Sbjct: 418 KGWKDSEDSIFYEDGTLVKPPVALVEVQGYVYKAKREASKLARILGEKELAS 469


>gi|261368895|ref|ZP_05981778.1| cellulose degradation product phosphorylase [Subdoligranulum
           variabile DSM 15176]
 gi|282568993|gb|EFB74528.1| glycosyltransferase family 36 [Subdoligranulum variabile DSM 15176]
          Length = 803

 Score = 40.0 bits (92), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 67/174 (38%), Gaps = 25/174 (14%)

Query: 597 VYRDFVATGDKNFARAVWP-------SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
           VY+    TGDK F   V P       +VY  +    QF  D  G     G P   Y  W+
Sbjct: 438 VYKYVAETGDKAFLDEVIPFANRDEGTVYEHLKRAVQFSLDHLG---PHGLPAGLYADWN 494

Query: 650 ------ANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY-QKAKAVYDSLWNGS 702
                 ANG S +    +  AL+     A D GD A+  +        A+ +    W+G 
Sbjct: 495 DCLRLGANGESTFVALQFYYALEILRRFAADRGDEAACRHLEATMADTARKIQSLCWDGD 554

Query: 703 YFNYDNSDGSSSTSIQADQLAGQW-----YARACGLLPIADEAKVKKALTKIYD 751
            F    ++        AD  A  W     +A   GL   AD+ +  +A+ ++YD
Sbjct: 555 RFIRGFTEEGQRIGAAADPEANLWLNPQSWAVISGL---ADKDQADRAMAQVYD 605


>gi|441502511|ref|ZP_20984521.1| Glycogen debranching enzyme [Photobacterium sp. AK15]
 gi|441429689|gb|ELR67141.1| Glycogen debranching enzyme [Photobacterium sp. AK15]
          Length = 935

 Score = 40.0 bits (92), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 41/157 (26%), Positives = 70/157 (44%), Gaps = 19/157 (12%)

Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
           YN  + + W       V +V+     +GDK++AR+++P V   +  +E+   D  G++ +
Sbjct: 489 YNTTDGTPW------MVREVWDYLRYSGDKDYARSIYPVVKTYIDGIEKNYLDQHGLMTH 542

Query: 638 EGFPDQTYDA-------WSANGVSAY-CGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
              PD   DA       WSA G  A     LW  +L+ A  LA    D     ++    +
Sbjct: 543 RD-PDTWMDAKLEGKLPWSARGNRANDIQALWYTSLKVAVELAKLNQDGEGEQHYLAMAE 601

Query: 690 KAKAVYDSL-WNGS---YFNYDNSDGSSSTSIQADQL 722
           K +A +  L W+G      +  ++D S    I+ +QL
Sbjct: 602 KVQASFQQLFWDGELSLLADRLHADDSRDLQIRPNQL 638


>gi|373456251|ref|ZP_09548018.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
 gi|371717915|gb|EHO39686.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
          Length = 1123

 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 4/76 (5%)

Query: 603 ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA---WSANGVSAYCGG 659
           A+GD  F +  W  +  A+ ++   D DGD +IEN+       +    W AN  + Y  G
Sbjct: 666 ASGDLTFIQQSWSHIKRAIDFLYSTDTDGDMLIENKNVGHGWVEGGKLWGANP-TFYLAG 724

Query: 660 LWVAALQAASALANDV 675
           LW   L+ A+ LA  +
Sbjct: 725 LWAQTLKDAAYLAKKL 740


>gi|163851052|ref|YP_001639095.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
 gi|163662657|gb|ABY30024.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
          Length = 778

 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 18/103 (17%)

Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
           FV+   +    TGD    RA+WP + +A+ +++ + D+DGDG +E          N+G+ 
Sbjct: 431 FVMLAAQYHEVTGDLETIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490

Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
           D        D   A G  A C   G   AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533


>gi|40850651|gb|AAR96047.1| alpha-L-rhamnosidase B [Thermomicrobia bacterium PRI-1686]
          Length = 954

 Score = 39.7 bits (91), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 58/143 (40%), Gaps = 2/143 (1%)

Query: 604 TGDKNFARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
           TGD  F + +WP + Y    Y++  + DG   I      D        +GV  +    +V
Sbjct: 647 TGDLAFVQDIWPDIQYTLDHYLQHINDDGLLEISAWNLLDWAPIDQPNSGVVTHQNCFFV 706

Query: 663 AALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQL 722
            AL+ A  L    GD  +  Y     + A A+   LW+  +  Y +S  + ST      +
Sbjct: 707 RALKDADELGQSAGDETAGRYAERARELAAAINTHLWSDEHKAYIDSIHADSTRSSVISM 766

Query: 723 AGQWYARACGLLPIADEAKVKKA 745
             Q  A   G+    D A+V ++
Sbjct: 767 QTQVVALLTGVAE-GDRAEVVRS 788


>gi|373457216|ref|ZP_09548983.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
 gi|371718880|gb|EHO40651.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
          Length = 976

 Score = 39.3 bits (90), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 43/195 (22%), Positives = 89/195 (45%), Gaps = 30/195 (15%)

Query: 557 KCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
           + LG +P+ I  ++       YN  + + W      F+  +Y  F+ TGD++F + ++P 
Sbjct: 316 RELGRIPNRITNNE-----TIYNTADGTWW------FIRAIYEYFLYTGDQSFLQEMFPY 364

Query: 617 VYIAM--AYMEQFD-------KDGDGMIENEGFPDQTYDAWSANGVSAY-CGGLWVAALQ 666
           V  A+  A  ++ D       +D +  ++  G    +   WS  G  A     LW  ALQ
Sbjct: 365 VRRALEGAIGKRVDELGFLTHRDAETWMDAVG----SEGPWSPRGNRAVEIQALWYTALQ 420

Query: 667 AASALANDV-GDHASASYFWVRYQKAKAVYDS-LWN---GSYFNYDNSDGSSSTSIQADQ 721
             + +A ++ G+ +    + +  +K K  ++   WN    + +++ N+DG++   I+ +Q
Sbjct: 421 IGAQMAKNIEGEDSQGQKWLLLSRKVKDHFNRYYWNETRQALYDHLNADGTADQQIRPNQ 480

Query: 722 LAGQWYARACGLLPI 736
           +         G+ P+
Sbjct: 481 MFAVTVPDLPGIEPL 495


>gi|94500338|ref|ZP_01306871.1| Predicted peptidoglycan hydrolase, FlgJ family protein
           [Oceanobacter sp. RED65]
 gi|94427637|gb|EAT12614.1| Predicted peptidoglycan hydrolase, FlgJ family protein
           [Oceanobacter sp. RED65]
          Length = 251

 Score = 39.3 bits (90), Expect = 9.7,   Method: Composition-based stats.
 Identities = 37/170 (21%), Positives = 72/170 (42%), Gaps = 18/170 (10%)

Query: 341 AAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILE 400
           ++  A    P+ + +   F +    P+ + +E V  ++   F   L     ++A++ ILE
Sbjct: 17  SSFTAGYLFPTNTEKKYKFEVE-TMPDFRQYENVQKKKQDFFNFLL--PRVKVANENILE 73

Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
             +W  E++       +++R  E        E Y +        D     + L T+ +R+
Sbjct: 74  ERRWLLELDLSNLSDDQEQRLNE------LAEKYEV--------DDYENSEQLFTVLKRR 119

Query: 461 FSLDTPSKIHTPASSDTALGT-RLLENGEENIGQFLYLEGAEYVMYNTYD 509
             +  PS +   A++++A GT R    G    GQ+ Y+EG   +  N  D
Sbjct: 120 IDVLPPSMVLAQAANESAWGTSRFARKGNNLFGQWCYVEGCGLIPKNRND 169


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.419 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,894,215,217
Number of Sequences: 23463169
Number of extensions: 700295105
Number of successful extensions: 1444822
Number of sequences better than 100.0: 545
Number of HSP's better than 100.0 without gapping: 406
Number of HSP's successfully gapped in prelim test: 139
Number of HSP's that attempted gapping in prelim test: 1440787
Number of HSP's gapped (non-prelim): 1253
length of query: 920
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 768
effective length of database: 8,792,793,679
effective search space: 6752865545472
effective search space used: 6752865545472
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)