BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038924
(920 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224146269|ref|XP_002325943.1| predicted protein [Populus trichocarpa]
gi|222862818|gb|EEF00325.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 1375 bits (3559), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 641/891 (71%), Positives = 745/891 (83%), Gaps = 34/891 (3%)
Query: 55 IGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH 114
+G RL+R+ KQE AK + +F+ +KH +TSD GIPLGGIGAGSIGRSYRGEFQ F+LF
Sbjct: 4 MGLRLWRHIKQEAAKERATIFDFSKKHILTSDHGIPLGGIGAGSIGRSYRGEFQHFRLFP 63
Query: 115 GICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYH 174
GIC++ PVLANQFS FVSRP+GE FSSVLC R+P +PK++T SGIESWDWNL G+ CTYH
Sbjct: 64 GICEEGPVLANQFSAFVSRPSGETFSSVLCSRTPDIPKESTGSGIESWDWNLNGQKCTYH 123
Query: 175 ALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLL 234
ALFPRAWT+YDG PDPEL IV RQISPFIPHNYKESSFP SVFTFTLSN G+TSADVTL+
Sbjct: 124 ALFPRAWTIYDGAPDPELTIVSRQISPFIPHNYKESSFPVSVFTFTLSNHGKTSADVTLM 183
Query: 235 FTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
FTWANSV G SGLSGHHFNSK MTKDGVH +TLHH+ N +PPVTFA+AA+ETADVHVSE
Sbjct: 184 FTWANSVGGVSGLSGHHFNSKMMTKDGVHAVTLHHKMTNRQPPVTFAIAAQETADVHVSE 243
Query: 295 CPCFLLSGNSK--GITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
CPCFL+SG+S+ +TAKDMW+ I+K+G+FD + +KTSPSEPGSSIGAAIAAS+T+PSG
Sbjct: 244 CPCFLISGDSQDVSVTAKDMWDVIRKNGTFDQIGCNKTSPSEPGSSIGAAIAASVTVPSG 303
Query: 353 STRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
S R+VTFSL+WD PE++F E+ YHRRYTKFYGTLGD+AA IA DAILEHA WE +IEAWQ
Sbjct: 304 SIRTVTFSLSWDIPEIRFSERSYHRRYTKFYGTLGDAAANIARDAILEHANWESQIEAWQ 363
Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS----- 467
RPILEDKR PEWYPITLFNELYYLNAGGTIWTD SPPMQ+L +RER+FSL+ S
Sbjct: 364 RPILEDKRVPEWYPITLFNELYYLNAGGTIWTDESPPMQNLTAVRERRFSLERSSSGYKN 423
Query: 468 -----------------------KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
++H P SS++ G LL+NGEENIGQ LYLEG EY+M
Sbjct: 424 GNGIAYKNDTAIEILERMASTYEQLHNPVSSNSVFGANLLQNGEENIGQLLYLEGTEYLM 483
Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
+NTYDVHFY+SFALVMLFPKLEL++QRDFAAAV+MHDP M+IMSDGKWV RK LGAVPH
Sbjct: 484 WNTYDVHFYASFALVMLFPKLELNLQRDFAAAVLMHDPSRMQIMSDGKWVPRKVLGAVPH 543
Query: 565 DIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM 624
D+GL+DPWFEIN+YNLF+++RWKDLN KFVLQVYRD VATGDK+FARAVWPSVY+AMAYM
Sbjct: 544 DVGLNDPWFEINAYNLFSTARWKDLNCKFVLQVYRDVVATGDKDFARAVWPSVYVAMAYM 603
Query: 625 EQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYF 684
+QFDKDGDGMIENEG PDQTYD WS GVSAYCGGLWVAALQA SA+A++VGD SA+YF
Sbjct: 604 DQFDKDGDGMIENEGIPDQTYDTWSVTGVSAYCGGLWVAALQATSAMAHEVGDDGSANYF 663
Query: 685 WVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKK 744
W++YQKAK VY LWNGSYFNYD+ S + SIQADQLAGQWYARACGL P+ D K +K
Sbjct: 664 WIKYQKAKTVYGKLWNGSYFNYDS---SGNKSIQADQLAGQWYARACGLSPVVDGDKARK 720
Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
AL KIY+FNVLKVKGG GA+NGM PDGR+D++ +Q +EIWPGVTY L+ASMIQE + +M
Sbjct: 721 ALEKIYEFNVLKVKGGTRGAVNGMLPDGRVDLTAMQTKEIWPGVTYALSASMIQEGLEEM 780
Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQ 864
AFQTA G+Y AWSE+GLGYSFQ PESW+ ND+YRSLCYMRPL IWAMQWAL+KPK ++
Sbjct: 781 AFQTAVGIYNAAWSEEGLGYSFQIPESWDMNDQYRSLCYMRPLAIWAMQWALSKPKDFKE 840
Query: 865 EIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
E+ HE D SYL+QHA FSKVA LL+LP+EEA K F + +Y+FT R+L
Sbjct: 841 EMHHE-GIEDESYLKQHAGFSKVAHLLRLPEEEAPKSFFQAIYEFTCKRML 890
>gi|255569126|ref|XP_002525532.1| conserved hypothetical protein [Ricinus communis]
gi|223535211|gb|EEF36890.1| conserved hypothetical protein [Ricinus communis]
Length = 948
Score = 1359 bits (3517), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 666/948 (70%), Positives = 770/948 (81%), Gaps = 37/948 (3%)
Query: 1 MENGVKEEGENAA----SMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
MEN +KEEGE + S K++PG PP L+W+R L+SN VP GFTLSF+E H+ IG
Sbjct: 2 MENRIKEEGEKDSYLDSSTQKINPGVPPSLSWERPLNSNGNVPLGFTLSFREILHMLPIG 61
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
RL+RYSK+E KG+VP+F+ +KH IT D +PLGGIGAGSIGRSY+GEFQ FKL
Sbjct: 62 LRLWRYSKEEPTKGRVPIFDFSKKHVITGDHAVPLGGIGAGSIGRSYKGEFQCFKLLPLA 121
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C++ P+LANQFSVFVSRPNG+KFSSVLC R P +P + SGIESWDWNL GENCTYHAL
Sbjct: 122 CEEGPILANQFSVFVSRPNGKKFSSVLCSRRPELPTEIKGSGIESWDWNLNGENCTYHAL 181
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
FPRAWT Y G+PDPEL+IV +QISPFIPHNYKESSFP SVFTFTLSN G+TSADVTLLFT
Sbjct: 182 FPRAWTTYKGQPDPELKIVSQQISPFIPHNYKESSFPVSVFTFTLSNFGRTSADVTLLFT 241
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANS+ G SG S HHFNS M KDGVH +TLHH+T +G+PP+TFA+AA+ET DVHVSECP
Sbjct: 242 WANSIGGVSGSSRHHFNSSIMKKDGVHTVTLHHKTGDGQPPLTFAIAAQETPDVHVSECP 301
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
CFL+SGNS+G+TAKDMW+EIKKHG+FDHL +K SPSE GS IGAAIAA+LTIP + R+
Sbjct: 302 CFLISGNSQGVTAKDMWDEIKKHGTFDHLSYNKISPSEGGSCIGAAIAATLTIPPDTIRT 361
Query: 357 VTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
TFSLAWDCPEV+F + YHRRYTKFYGTLGD+AA IAHDAILEH WE +IEAWQRPIL
Sbjct: 362 ATFSLAWDCPEVRFSGRTYHRRYTKFYGTLGDAAADIAHDAILEHGHWESQIEAWQRPIL 421
Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP---------- 466
EDKR P+WYPITLFNELYYLNAGGT+WTDGSPPMQS A I+ K +LD
Sbjct: 422 EDKRLPKWYPITLFNELYYLNAGGTVWTDGSPPMQSFAAIKGGKLTLDKSRSEFENARPV 481
Query: 467 ------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTY 508
K+H P +S+ A GT LL+N EENIGQFLYLEG+EY+M+NTY
Sbjct: 482 AHRNDTAVEILNRMASIYEKMHNPVTSNAAFGTYLLQNDEENIGQFLYLEGSEYLMWNTY 541
Query: 509 DVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL 568
DVHFYSSFAL+MLFPKLELSIQRDFAAAVMMHDP M+IMSDG+ V RK LGAVPHDIGL
Sbjct: 542 DVHFYSSFALLMLFPKLELSIQRDFAAAVMMHDPSRMQIMSDGRRVPRKVLGAVPHDIGL 601
Query: 569 DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFD 628
+DPWFE+N+YNL +++RWKDLN KFVLQ+YRD VATGDK+FA AVWPSVY+AMAYM+QFD
Sbjct: 602 NDPWFEVNAYNLISTARWKDLNPKFVLQIYRDVVATGDKSFALAVWPSVYVAMAYMDQFD 661
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
KDGDGMIENEGFPDQTYD WS GVSAYCGGLWVAALQAASALA +VGD SAS+FWV+Y
Sbjct: 662 KDGDGMIENEGFPDQTYDTWSVTGVSAYCGGLWVAALQAASALACEVGDSESASFFWVKY 721
Query: 689 QKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
QKAKAVY +LWNGSYFNYD+S S SI ADQLAGQWYARACGL I DE KV+ AL K
Sbjct: 722 QKAKAVYSTLWNGSYFNYDSSGNS---SIHADQLAGQWYARACGLSSIVDEEKVRSALEK 778
Query: 749 IYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
IY FNVLKVK G GA+NGM PDG++DMS +Q+REIWPGVTY L+ASMIQE M +MAFQT
Sbjct: 779 IYKFNVLKVKEGKRGAVNGMLPDGKVDMSVMQSREIWPGVTYALSASMIQEGMAEMAFQT 838
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIK- 867
A+G+YE AWS++GLGYSFQ PE WN +D+YRSLCYMRPL IWAMQWAL+KPK+ ++E+K
Sbjct: 839 ASGIYEAAWSQEGLGYSFQIPEGWNTDDQYRSLCYMRPLAIWAMQWALSKPKVFKEEMKL 898
Query: 868 HEISDRDSSYLEQHAAFSKVASLLKLPKEEAS-KGFLKVVYDFTIGRV 914
++ D + Q+A FSKVA LLKLP +E S K FL+ Y+FT R+
Sbjct: 899 LSLAADDRLHPSQNAGFSKVAHLLKLPDDEGSNKSFLQSAYEFTCRRL 946
>gi|225460125|ref|XP_002275782.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 969
Score = 1328 bits (3437), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 657/954 (68%), Positives = 770/954 (80%), Gaps = 35/954 (3%)
Query: 1 MENGVKEEGEN----AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
MENG KE+G+ +S KV+PG+P LTWQRKL++ T F L +E +HLA +G
Sbjct: 17 MENGHKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPLG 76
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
RL+ + E AKG++ + + F K +TS G+PLGGIG GSIGRSYRGEFQR++LF I
Sbjct: 77 VRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPRI 136
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D+PVLANQFSVFVSRPNG+K S+VLCPR+P V K + SGI SWDWNL GE+CTYHAL
Sbjct: 137 CEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHAL 196
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PRAWTVY+GEPDPE+ I+ QISPFIPHNYKESSFP SVF FTLSNSG+TSAD+TLLFT
Sbjct: 197 YPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLFT 256
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANSV G S SGHH+NSK TKDGVHG+ LHH+TANG PPVTFA+AAEET DVH+SECP
Sbjct: 257 WANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISECP 316
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
CFL+SGNS G+TAK+MW EIK+HGSFDHLD D +S SEPGSSIGAA+AASLT+P + R
Sbjct: 317 CFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTVR 376
Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+VTFSLAW CPEV+F K YHRRYT+FYGT D+A IAHDAILEHA W EIEAWQ P
Sbjct: 377 TVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQGP 436
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-------- 466
ILED+R PEWY ITLFNELY+LNAGGTIWTDG PPMQSLATI + KFSLD
Sbjct: 437 ILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTT 496
Query: 467 --------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
+IH P +S++A GT LL++GEEN+GQFLYLEG EY M+N
Sbjct: 497 DIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWN 556
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFYSSFA++MLFP+LELSIQRDFAAAVM+HDP MKIMSDGKWV RK LGAVPHDI
Sbjct: 557 TYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDI 616
Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
G+ DPWFE+N+YNL+++ RWKDLNSKFVLQVYRD VATGDKNFARAVWP+VYIA+A+++Q
Sbjct: 617 GISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQ 676
Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
FDKDGDGMIEN+GFPDQTYDAWS GVSAYCGGLWVAALQAASA+A +VGD +A YFW
Sbjct: 677 FDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWF 736
Query: 687 RYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
++QKAKAVYD LWNGSYFNYDNS GSSS+SIQADQLAGQWYARACGL PI D+ K + AL
Sbjct: 737 KFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSAL 796
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
K+Y+FNVLKVK G CGA+NGM PDGR+DMS +Q+REIW GVTY +AA+MI E MV+ AF
Sbjct: 797 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 856
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEI 866
TA+G+Y+ AWS++GLGYSFQTPE+WN ++EYRSLCYMRPL IWAMQWAL+KP+L ++
Sbjct: 857 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPELHNHDM 916
Query: 867 KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVLCHNLN 920
KHE + + E H F KVA LLKLP+EEASK FL++ +D T R+ LN
Sbjct: 917 KHE-EGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLYTLRLN 969
>gi|297741007|emb|CBI31319.3| unnamed protein product [Vitis vinifera]
Length = 953
Score = 1328 bits (3437), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 657/954 (68%), Positives = 770/954 (80%), Gaps = 35/954 (3%)
Query: 1 MENGVKEEGEN----AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
MENG KE+G+ +S KV+PG+P LTWQRKL++ T F L +E +HLA +G
Sbjct: 1 MENGHKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHLAPLG 60
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
RL+ + E AKG++ + + F K +TS G+PLGGIG GSIGRSYRGEFQR++LF I
Sbjct: 61 VRLWHHVNAEAAKGRISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPRI 120
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D+PVLANQFSVFVSRPNG+K S+VLCPR+P V K + SGI SWDWNL GE+CTYHAL
Sbjct: 121 CEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHAL 180
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PRAWTVY+GEPDPE+ I+ QISPFIPHNYKESSFP SVF FTLSNSG+TSAD+TLLFT
Sbjct: 181 YPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLFT 240
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANSV G S SGHH+NSK TKDGVHG+ LHH+TANG PPVTFA+AAEET DVH+SECP
Sbjct: 241 WANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISECP 300
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
CFL+SGNS G+TAK+MW EIK+HGSFDHLD D +S SEPGSSIGAA+AASLT+P + R
Sbjct: 301 CFLISGNSLGVTAKEMWQEIKQHGSFDHLDFDGSSMRSEPGSSIGAAVAASLTLPPDTVR 360
Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+VTFSLAW CPEV+F K YHRRYT+FYGT D+A IAHDAILEHA W EIEAWQ P
Sbjct: 361 TVTFSLAWACPEVRFTSGKTYHRRYTRFYGTHVDAAEEIAHDAILEHANWVSEIEAWQGP 420
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-------- 466
ILED+R PEWY ITLFNELY+LNAGGTIWTDG PPMQSLATI + KFSLD
Sbjct: 421 ILEDRRLPEWYRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTT 480
Query: 467 --------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
+IH P +S++A GT LL++GEEN+GQFLYLEG EY M+N
Sbjct: 481 DIVHQNDSTVEILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWN 540
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFYSSFA++MLFP+LELSIQRDFAAAVM+HDP MKIMSDGKWV RK LGAVPHDI
Sbjct: 541 TYDVHFYSSFAIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDI 600
Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
G+ DPWFE+N+YNL+++ RWKDLNSKFVLQVYRD VATGDKNFARAVWP+VYIA+A+++Q
Sbjct: 601 GISDPWFELNAYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQ 660
Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
FDKDGDGMIEN+GFPDQTYDAWS GVSAYCGGLWVAALQAASA+A +VGD +A YFW
Sbjct: 661 FDKDGDGMIENDGFPDQTYDAWSVTGVSAYCGGLWVAALQAASAMAREVGDSMTADYFWF 720
Query: 687 RYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
++QKAKAVYD LWNGSYFNYDNS GSSS+SIQADQLAGQWYARACGL PI D+ K + AL
Sbjct: 721 KFQKAKAVYDKLWNGSYFNYDNSGGSSSSSIQADQLAGQWYARACGLQPIVDDEKARSAL 780
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
K+Y+FNVLKVK G CGA+NGM PDGR+DMS +Q+REIW GVTY +AA+MI E MV+ AF
Sbjct: 781 EKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETAF 840
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEI 866
TA+G+Y+ AWS++GLGYSFQTPE+WN ++EYRSLCYMRPL IWAMQWAL+KP+L ++
Sbjct: 841 NTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPELHNHDM 900
Query: 867 KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVLCHNLN 920
KHE + + E H F KVA LLKLP+EEASK FL++ +D T R+ LN
Sbjct: 901 KHE-EGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLYTLRLN 953
>gi|297739499|emb|CBI29681.3| unnamed protein product [Vitis vinifera]
Length = 949
Score = 1316 bits (3406), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 640/944 (67%), Positives = 759/944 (80%), Gaps = 32/944 (3%)
Query: 1 MENGVKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
+E G +E ++ KVDPG+P LTWQRKL+S+ P F ++ +E HLA IGFRL+
Sbjct: 6 LEEGEREPSNSSIEEIKVDPGKPGSLTWQRKLNSDGNAPVEFKINLRETFHLAPIGFRLW 65
Query: 61 RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
R+ ++E AKG+ + + F K +I+S QG+PLGGIGAGSIGRSY+GEFQRF+LF ++
Sbjct: 66 RHVREETAKGRGAMIDPFVKRYISSSQGVPLGGIGAGSIGRSYKGEFQRFQLFPITSENE 125
Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRA 180
PVL NQFSVFVSRPNGEK+S+VLC +SP K+ SGI SWDWNL G TY AL+PRA
Sbjct: 126 PVLENQFSVFVSRPNGEKYSTVLCRQSPEALKECPPSGIGSWDWNLNGNKSTYLALYPRA 185
Query: 181 WTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANS 240
WTVYDGEPDP L+IVCRQISP IPHNYKESSFP +VFTFTL NSG+T+AD+TLLFTWANS
Sbjct: 186 WTVYDGEPDPALKIVCRQISPIIPHNYKESSFPVAVFTFTLFNSGKTAADITLLFTWANS 245
Query: 241 VAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLL 300
V G SGLSG H NSK M KDGV G+ LHH+TANGRPPVT+A+AA+E VH+SECPCF +
Sbjct: 246 VGGVSGLSGQHLNSKFMMKDGVRGVLLHHKTANGRPPVTYAIAAQEMDGVHISECPCFFI 305
Query: 301 SGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTF 359
SG++ GITAKDMWNEIK+HGSFD L++ +TS PSE GSS+GAA+AASLTIPS S ++VTF
Sbjct: 306 SGDTPGITAKDMWNEIKEHGSFDRLNSAETSMPSELGSSVGAAVAASLTIPSDSEQTVTF 365
Query: 360 SLAWDCPEVKFF-EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
SLAWDCPE+ F E+ Y+RRYTKFYGT GD+AA+IAHDAIL+H WE +IEAWQ+P+LED
Sbjct: 366 SLAWDCPEINFSKERTYYRRYTKFYGTRGDAAAKIAHDAILDHGHWESQIEAWQKPVLED 425
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP------------ 466
KRFPEWYPITLFNELYYLN+GGT+WTDGSPP+ S +I ERKFSLD
Sbjct: 426 KRFPEWYPITLFNELYYLNSGGTVWTDGSPPVHSFTSIIERKFSLDRSRSDLKNTVSISH 485
Query: 467 ----------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDV 510
++HTP +S++A G LL+ GEENIGQFLYLEG EY M+NT DV
Sbjct: 486 HNDTAVDILERMTSVLEQVHTPVTSNSAFGPNLLQQGEENIGQFLYLEGVEYFMWNTNDV 545
Query: 511 HFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDD 570
HFYSSFAL+MLFPKLELSIQRDFAA+VMMHDP MK++ +GKWV+RK LGAVPHD+G D
Sbjct: 546 HFYSSFALIMLFPKLELSIQRDFAASVMMHDPSKMKLLCNGKWVSRKVLGAVPHDVGNYD 605
Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
PWFE+N YNL+N+ RWKDLN KFVLQVYRD VATGDKNFA+AVWPSVY+A+AYM QFDKD
Sbjct: 606 PWFEVNGYNLYNTDRWKDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDKD 665
Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK 690
GDGMIENEGFPDQTYD WS +GVSAY GGLWVAALQAASALA VGD S YFW ++QK
Sbjct: 666 GDGMIENEGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQK 725
Query: 691 AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIY 750
AK VY LWNGSYFNYD+SDGSSS+SIQADQLAGQWYARACGL PI DE K K AL K+Y
Sbjct: 726 AKGVYQKLWNGSYFNYDDSDGSSSSSIQADQLAGQWYARACGLSPIVDEDKAKSALEKVY 785
Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
+NVLKV GG GA+NGM PDG++D + +Q+REIW GVTYG+AA+MI E +VDMAFQTA+
Sbjct: 786 HYNVLKVGGGKRGAVNGMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAFQTAS 845
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEI 870
GVYE AWS++GLGYSFQTPESWN +D+YRSLCYMRPL IWAMQWA ++PKL + E E+
Sbjct: 846 GVYEAAWSQEGLGYSFQTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLHKYEANPEM 905
Query: 871 SDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRV 914
+ + S + QHA FS+VA LLKLP E+ S+ L+V+YD+T R+
Sbjct: 906 N--EDSLVLQHAGFSRVARLLKLPDEDVSRSALQVIYDYTCKRM 947
>gi|359486233|ref|XP_002264575.2| PREDICTED: non-lysosomal glucosylceramidase [Vitis vinifera]
Length = 960
Score = 1311 bits (3393), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 637/928 (68%), Positives = 752/928 (81%), Gaps = 32/928 (3%)
Query: 17 KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
KVDPG+P LTWQRKL+S+ P F ++ +E HLA IGFRL+R+ ++E AKG+ + +
Sbjct: 33 KVDPGKPGSLTWQRKLNSDGNAPVEFKINLRETFHLAPIGFRLWRHVREETAKGRGAMID 92
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
F K +I+S QG+PLGGIGAGSIGRSY+GEFQRF+LF ++ PVL NQFSVFVSRPNG
Sbjct: 93 PFVKRYISSSQGVPLGGIGAGSIGRSYKGEFQRFQLFPITSENEPVLENQFSVFVSRPNG 152
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
EK+S+VLC +SP K+ SGI SWDWNL G TY AL+PRAWTVYDGEPDP L+IVC
Sbjct: 153 EKYSTVLCRQSPEALKECPPSGIGSWDWNLNGNKSTYLALYPRAWTVYDGEPDPALKIVC 212
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQISP IPHNYKESSFP +VFTFTL NSG+T+AD+TLLFTWANSV G SGLSG H NSK
Sbjct: 213 RQISPIIPHNYKESSFPVAVFTFTLFNSGKTAADITLLFTWANSVGGVSGLSGQHLNSKF 272
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
M KDGV G+ LHH+TANGRPPVT+A+AA+E VH+SECPCF +SG++ GITAKDMWNEI
Sbjct: 273 MMKDGVRGVLLHHKTANGRPPVTYAIAAQEMDGVHISECPCFFISGDTPGITAKDMWNEI 332
Query: 317 KKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF-EKV 374
K+HGSFD L++ +TS PSE GSS+GAA+AASLTIPS S ++VTFSLAWDCPE+ F E+
Sbjct: 333 KEHGSFDRLNSAETSMPSELGSSVGAAVAASLTIPSDSEQTVTFSLAWDCPEINFSKERT 392
Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
Y+RRYTKFYGT GD+AA+IAHDAIL+H WE +IEAWQ+P+LEDKRFPEWYPITLFNELY
Sbjct: 393 YYRRYTKFYGTRGDAAAKIAHDAILDHGHWESQIEAWQKPVLEDKRFPEWYPITLFNELY 452
Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTP---------------------------- 466
YLN+GGT+WTDGSPP+ S +I ERKFSLD
Sbjct: 453 YLNSGGTVWTDGSPPVHSFTSIIERKFSLDRSRSDLKNTVSISHHNDTAVDILERMTSVL 512
Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
++HTP +S++A G LL+ GEENIGQFLYLEG EY M+NT DVHFYSSFAL+MLFPKLE
Sbjct: 513 EQVHTPVTSNSAFGPNLLQQGEENIGQFLYLEGVEYFMWNTNDVHFYSSFALIMLFPKLE 572
Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
LSIQRDFAA+VMMHDP MK++ +GKWV+RK LGAVPHD+G DPWFE+N YNL+N+ RW
Sbjct: 573 LSIQRDFAASVMMHDPSKMKLLCNGKWVSRKVLGAVPHDVGNYDPWFEVNGYNLYNTDRW 632
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
KDLN KFVLQVYRD VATGDKNFA+AVWPSVY+A+AYM QFDKDGDGMIENEGFPDQTYD
Sbjct: 633 KDLNPKFVLQVYRDVVATGDKNFAKAVWPSVYVALAYMNQFDKDGDGMIENEGFPDQTYD 692
Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNY 706
WS +GVSAY GGLWVAALQAASALA VGD S YFW ++QKAK VY LWNGSYFNY
Sbjct: 693 TWSVSGVSAYSGGLWVAALQAASALARVVGDKGSEDYFWFKFQKAKGVYQKLWNGSYFNY 752
Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
D+SDGSSS+SIQADQLAGQWYARACGL PI DE K K AL K+Y +NVLKV GG GA+N
Sbjct: 753 DDSDGSSSSSIQADQLAGQWYARACGLSPIVDEDKAKSALEKVYHYNVLKVGGGKRGAVN 812
Query: 767 GMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
GM PDG++D + +Q+REIW GVTYG+AA+MI E +VDMAFQTA+GVYE AWS++GLGYSF
Sbjct: 813 GMLPDGKVDTTTMQSREIWSGVTYGVAATMIHEGLVDMAFQTASGVYEAAWSQEGLGYSF 872
Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSK 886
QTPESWN +D+YRSLCYMRPL IWAMQWA ++PKL + E E++ + S + QHA FS+
Sbjct: 873 QTPESWNTDDQYRSLCYMRPLAIWAMQWAFSQPKLHKYEANPEMN--EDSLVLQHAGFSR 930
Query: 887 VASLLKLPKEEASKGFLKVVYDFTIGRV 914
VA LLKLP E+ S+ L+V+YD+T R+
Sbjct: 931 VARLLKLPDEDVSRSALQVIYDYTCKRM 958
>gi|255575898|ref|XP_002528846.1| conserved hypothetical protein [Ricinus communis]
gi|223531697|gb|EEF33520.1| conserved hypothetical protein [Ricinus communis]
Length = 952
Score = 1298 bits (3359), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 643/943 (68%), Positives = 760/943 (80%), Gaps = 33/943 (3%)
Query: 3 NGVKEEGENAASM-PKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYR 61
NG +E+ E + S+ KVDPG P LTWQRKL+S + F LSFQE LA +G RL+R
Sbjct: 9 NGCQEDREPSDSLLEKVDPGSPASLTWQRKLNSEDIALSQFNLSFQEKFQLAPVGIRLWR 68
Query: 62 YSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAP 121
++E AKG+V + N F K ITS GIPLGGIG+GSIGRSY+GEFQR++LF IC++ P
Sbjct: 69 LIREETAKGRVSIINPFLKRFITSCHGIPLGGIGSGSIGRSYKGEFQRWQLFPRICEEKP 128
Query: 122 VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAW 181
VLANQFSVFVSR +GEK+SSVLCPR+P V + SGI SWDWNLKG+N TYHAL+PRAW
Sbjct: 129 VLANQFSVFVSRSSGEKYSSVLCPRNPEVLMEPAISGIGSWDWNLKGDNSTYHALYPRAW 188
Query: 182 TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV 241
T+YDGEPDPELRIVCRQISP IPHNYKESS+P SVFTFTL NSG+T+ADV+LLFTW NSV
Sbjct: 189 TIYDGEPDPELRIVCRQISPIIPHNYKESSYPVSVFTFTLYNSGKTTADVSLLFTWTNSV 248
Query: 242 AGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS 301
G+S SG HFNS TM +DGVH + LHH+TA G PPVTFA+AA+ET DVHVS+CP F++S
Sbjct: 249 GGNSEYSGQHFNSTTMMEDGVHAVLLHHKTAEGFPPVTFAIAAQETNDVHVSKCPRFVIS 308
Query: 302 GNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFS 360
GN +GITAKDMW+E+K+HGSFD+L + TS PSEPGSSIGAAIAAS+TIP + RSVTFS
Sbjct: 309 GNCQGITAKDMWHEVKEHGSFDNLKSAGTSGPSEPGSSIGAAIAASVTIPPDAIRSVTFS 368
Query: 361 LAWDCPEVKFF-EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDK 419
L+WDCPEV F + YHRRYTKFY T GD+AARIAHDAILEH WE +I AWQRPILEDK
Sbjct: 369 LSWDCPEVYFMGGRTYHRRYTKFYSTHGDAAARIAHDAILEHGLWESQIVAWQRPILEDK 428
Query: 420 RFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP------------- 466
R PEWYPITLFNELYYLN+GGTIWTDGSPP +L +IR KFSLDT
Sbjct: 429 RLPEWYPITLFNELYYLNSGGTIWTDGSPPYHNLVSIRGSKFSLDTSGAGLKSIIDVTHE 488
Query: 467 ---------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
+IH +S++A GT LL+ GEENIGQFLYLEG EY M+NTYDVH
Sbjct: 489 NDTAVNILGRMTSALEQIHAHVASNSAFGTNLLQEGEENIGQFLYLEGIEYHMWNTYDVH 548
Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDP 571
FYSSFALVMLFPKLELS+QRDFAAAVMMHDP M+++ DG+WV RK LGAVPHDIG++DP
Sbjct: 549 FYSSFALVMLFPKLELSVQRDFAAAVMMHDPSKMQLLHDGQWVCRKVLGAVPHDIGINDP 608
Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
W+E+N+Y+L+N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVYIAMAYM+QFD+DG
Sbjct: 609 WYEVNAYSLYNTDRWKDLNPKFVLQVYRDVVATGDKKFAEAVWPSVYIAMAYMDQFDRDG 668
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
DGMIEN+GFPDQTYD WS +GVSAY GGLWVAALQAASALA +VGD S YFW R+QKA
Sbjct: 669 DGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALAREVGDKGSEDYFWARFQKA 728
Query: 692 KAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
K VYD LWNGSYFNYDNS G +S+SIQADQLAGQWYARACGL PI D+ K + AL K+Y+
Sbjct: 729 KDVYDKLWNGSYFNYDNSGGRNSSSIQADQLAGQWYARACGLFPIVDKDKARSALEKVYN 788
Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
+NVLKVK G GA+NGM PDG++D+S +Q+REIW GVTY LAA+MIQE+M+DMAF TA+G
Sbjct: 789 YNVLKVKDGKRGAINGMLPDGKVDLSSMQSREIWSGVTYALAATMIQEDMLDMAFHTASG 848
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEIS 871
+YE AWSE GLGYSFQTPE+WNN D+YRSLCYMRPL IWAMQWAL++PKL ++E++ E++
Sbjct: 849 IYEAAWSERGLGYSFQTPEAWNNVDQYRSLCYMRPLAIWAMQWALSRPKLEKEEMEMEVN 908
Query: 872 DRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRV 914
+ L HA F+KVA L+LP+ E S G L+ ++++T ++
Sbjct: 909 --EDYLLPHHAGFTKVARFLRLPEGEESLGLLQSLFNYTCKKL 949
>gi|449447581|ref|XP_004141546.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 944
Score = 1269 bits (3285), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 613/941 (65%), Positives = 731/941 (77%), Gaps = 34/941 (3%)
Query: 7 EEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQE 66
E+G N AS +VDP +PP LTW+RKLD K P F+ + + H+ G+RL+R K+E
Sbjct: 2 EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPESFSFTLTDAWHMGMTGYRLWRNGKEE 61
Query: 67 EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
AKG++P++ F IT G+ LGGIGAGSIGRSYRGEFQRF++F+G C+D PVLANQ
Sbjct: 62 IAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLANQ 121
Query: 127 FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
FSVFVSRPNG KFSSVLC P K +GIESWDWNL GEN TYHALFPR+WTVYDG
Sbjct: 122 FSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYDG 181
Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
EPDP+L+IVCRQ+SP IPHNYKESSFP SVFTF LSN GQTSA VTLLFTWANSV G SG
Sbjct: 182 EPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKSG 241
Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
+GHHFNSK +DG G+ LHH++ANGRP VT+ +AAE T DVHVS CPCF++SG+S+G
Sbjct: 242 FTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSEG 301
Query: 307 ITAKDMWNEIKKHGSFDHLDN-DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
I+AKDMW EIK HGSFD+L + S+PG SIGAA+AA+LTIP S R+VTFSLAWDC
Sbjct: 302 ISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWDC 361
Query: 366 PEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
PEVKF K YHR+Y+KFYGTLGD+A IA DAI +H KWE EIEAWQRPI+EDKR P+WY
Sbjct: 362 PEVKFDGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPDWY 421
Query: 426 PITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------TPSKIHT----- 471
P+TLFNELY+LN+GGTIWTDG PP+Q+L+TI +K+ L+ P+ H
Sbjct: 422 PVTLFNELYFLNSGGTIWTDGLPPLQNLSTISHKKYFLERSKSELNGGAPNGDHRKDVAV 481
Query: 472 ----------------PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSS 515
S+ ALGTRLL GEEN+G L +EG++Y+M+NTYDVHFYSS
Sbjct: 482 DILERMSQILDQTHGGAGPSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVHFYSS 541
Query: 516 FALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
FAL+MLFPKLELSIQRDFAAAV+MHDP K M DG WV RK LGAVPHDIG +DPW E+
Sbjct: 542 FALIMLFPKLELSIQRDFAAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDPWLEV 601
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
N+YNL N SRWKDL SKFVLQVYRD VATGDKNFA++VWPSVY+A+A+MEQFDKD DGMI
Sbjct: 602 NAYNLLNVSRWKDLGSKFVLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDKDGMI 661
Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
ENEGFPDQTYD W+ GVSAYCGGLWVAALQAASALA++V D A+A YFW++YQKA++VY
Sbjct: 662 ENEGFPDQTYDTWTVKGVSAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKARSVY 721
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
++LWNGSYFNYDNS G S+SIQADQLAGQWYARACGL PIADE K++ AL KIY+FNV+
Sbjct: 722 ETLWNGSYFNYDNSKGPWSSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYNFNVM 781
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
KVKGG GA+NGM PDG +D S LQ +EIW GVTY +AA+MIQE MV+ FQTA G+Y+
Sbjct: 782 KVKGGTRGAVNGMFPDGSVDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQA 841
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA-LTKPKLSRQEIKHEISDRD 874
AW++DGLGYSFQTPE+W+ +D +RS+ YMRPL IWAMQWA ++ K ++ K ++
Sbjct: 842 AWAQDGLGYSFQTPEAWDVDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFSEMQE 901
Query: 875 SSYLEQHAAFSKVASLLKLPKEE--ASKGFLKVVYDFTIGR 913
S++ QHAAF KVASLLKLP + A + ++ YDF R
Sbjct: 902 SAFATQHAAFLKVASLLKLPSNDDTARRSLVEAAYDFICKR 942
>gi|297795757|ref|XP_002865763.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
gi|297311598|gb|EFH42022.1| hypothetical protein ARALYDRAFT_331392 [Arabidopsis lyrata subsp.
lyrata]
Length = 956
Score = 1268 bits (3280), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 599/934 (64%), Positives = 726/934 (77%), Gaps = 35/934 (3%)
Query: 17 KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
KVDP P LTWQRK+DS+ K P F L+ +E LA +G RL+ ++E AKG++ +
Sbjct: 22 KVDPAVPASLTWQRKIDSDAKPPREFNLTAKEIFQLAPVGIRLWFLCREEAAKGRLAFID 81
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
F +H +TS G+PLGGIGAGSIGRS++GEFQR++LF C+ PVLANQFS FVSR NG
Sbjct: 82 PFSRHSVTSSHGVPLGGIGAGSIGRSFQGEFQRWQLFPPKCEYEPVLANQFSAFVSRANG 141
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
+K+S+VLCPR+P + K++++SGI SWDWNLKG+ TYHAL+PR+WT+Y+GEPDPELRI+C
Sbjct: 142 KKYSTVLCPRNPKMGKQDSESGIGSWDWNLKGDKSTYHALYPRSWTMYEGEPDPELRIIC 201
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQ+SPFIPHNYKESSFP SVFTFTL N G T+A+VTLLFTWANSV GDS SG H+NSK
Sbjct: 202 RQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTANVTLLFTWANSVGGDSEFSGGHYNSKI 261
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
M DGV G+ LHH+TANG P +++A++A+ T V VS CP F++SG GITAKDMW +
Sbjct: 262 MMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACPFFIVSGKQNGITAKDMWQAV 321
Query: 317 KKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KV 374
K++GSFDH + S SE GSSIGAA+AASLT+ G +R VTFSLAWDCPEV+F K+
Sbjct: 322 KENGSFDHFKASEASMQSENGSSIGAAVAASLTVLPGESRIVTFSLAWDCPEVQFPSGKI 381
Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
Y RRYTKFYG GD+AA+IA+DAIL H++WE IE WQRPILEDKR P WYP+TLFNELY
Sbjct: 382 YSRRYTKFYGNHGDAAAQIAYDAILGHSQWESWIEDWQRPILEDKRLPAWYPVTLFNELY 441
Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLD----------------------------TP 466
YLN+GGT+WTDGS P+ SLA +RE+KFSLD T
Sbjct: 442 YLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLNNDIEVPHQNDTAISVLGKMASTL 501
Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
++H +S++A GT+LLE GEENIG FLYLEG EY M+NTYDVHFY+SFALVMLFPKLE
Sbjct: 502 EQLHASTASNSAFGTKLLEEGEENIGHFLYLEGIEYRMWNTYDVHFYASFALVMLFPKLE 561
Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
LSIQRDFAAAVM+HDP +K +S+G+WV RK LGAVPHD+G++DPWFE+N Y L N+ RW
Sbjct: 562 LSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDLGINDPWFEVNGYTLHNTDRW 621
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
KDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM QFDKDGDGMIENEGFPDQTYD
Sbjct: 622 KDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQFDKDGDGMIENEGFPDQTYD 681
Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFN 705
WSA+GVSAYCGGLWVAALQAAS LA VGD S YFW ++QKAK VY+ LWNGSYFN
Sbjct: 682 TWSASGVSAYCGGLWVAALQAASVLARVVGDKNSQDYFWSKFQKAKVVYEKKLWNGSYFN 741
Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
YDNS S+SIQADQLAGQWYARA GLLPI DE K + AL K+Y+FNV+K+K G GA+
Sbjct: 742 YDNSGSQYSSSIQADQLAGQWYARASGLLPIVDEDKARTALEKVYNFNVMKIKDGKRGAV 801
Query: 766 NGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYS 825
NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MAFQTA+GVYE AWSE GLGYS
Sbjct: 802 NGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMAFQTASGVYEAAWSEAGLGYS 861
Query: 826 FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEI----KHEISDRDSSYLEQH 881
FQTPE+WN NDEYRSL YMRPL IWAMQWALTK +Q++ + E +S ++
Sbjct: 862 FQTPEAWNTNDEYRSLTYMRPLAIWAMQWALTKTSQKQQQLGLEPEQEPEVETNSLMKHD 921
Query: 882 AAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
FS+V+ LL LP E ++K L+ ++D+T R++
Sbjct: 922 IGFSRVSRLLNLPNEASAKSTLQTLFDYTCRRLM 955
>gi|224129460|ref|XP_002328722.1| predicted protein [Populus trichocarpa]
gi|222839020|gb|EEE77371.1| predicted protein [Populus trichocarpa]
Length = 922
Score = 1263 bits (3267), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 620/922 (67%), Positives = 738/922 (80%), Gaps = 26/922 (2%)
Query: 17 KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
KVDPG+P LTWQR+LD++ V + FTL++QE +A IG RL+RY K+ K K +
Sbjct: 1 KVDPGKPAPLTWQRRLDTSETVLSQFTLTWQEILRMAPIGIRLWRYVKENAKKKKGIFID 60
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
F K ++TS GIP+GG+G+GSIGRSYRGEFQR++LF + ++ PVLANQFS+FVSR NG
Sbjct: 61 PFAKRNVTSSHGIPVGGVGSGSIGRSYRGEFQRWQLFPRV-EEKPVLANQFSIFVSRSNG 119
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
+K+ SVLC RSP ++ SGIESW+WNLKG+N TYHAL+PRAWTVY+GEPDPELR+VC
Sbjct: 120 KKYCSVLCSRSPDKLEEPAGSGIESWEWNLKGDNSTYHALYPRAWTVYEGEPDPELRVVC 179
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQISP IPHNYKESSFP SVFTFTL NSG+T+ADVTLLFTWANSV G S SG H NS
Sbjct: 180 RQISPVIPHNYKESSFPVSVFTFTLYNSGETAADVTLLFTWANSVGGVSEFSGQHLNSTK 239
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
M DGVH + LHH+TAN PP+TFA+AA+ET VHVS+CP F++SGNS+G+TAK+MWNE+
Sbjct: 240 MMDDGVHCVLLHHKTANELPPLTFAIAAQETPGVHVSKCPSFVISGNSQGLTAKEMWNEV 299
Query: 317 KKHGSFDHLDND-KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KV 374
K+HGSFD+L++ K PSEPGSSIGAAIAA+ T+P S +VTFSLAWDCPEV F +
Sbjct: 300 KEHGSFDNLNSSGKPVPSEPGSSIGAAIAATSTVPPDSVCTVTFSLAWDCPEVIFASGRT 359
Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
YHRRYTKFYGT GD+AA IAHDAIL H W+ +IEAWQRPILEDKR PEWYP+TLFNELY
Sbjct: 360 YHRRYTKFYGTHGDAAANIAHDAILGHGHWDSQIEAWQRPILEDKRLPEWYPVTLFNELY 419
Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTP----------------------SKIHTP 472
YLN+GGTIWTDGS P+ SLAT+ +KFSLD +IHTP
Sbjct: 420 YLNSGGTIWTDGSSPLHSLATVGGKKFSLDRTGSDLGHQGDTSVDILGRMTSVLEQIHTP 479
Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
++++ALGT LL+ GEEN+GQFLYLEG EY M+NTYDVHFY+SFAL+MLFPKL+LSIQRD
Sbjct: 480 LATNSALGTNLLQEGEENVGQFLYLEGIEYPMWNTYDVHFYASFALIMLFPKLQLSIQRD 539
Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK 592
FAAAVMMHDP M ++ DG+ V RK LGAVPHDIG+DDPWFE+N+YNL N+ RWKDLN K
Sbjct: 540 FAAAVMMHDPSKMHLLLDGQCVTRKVLGAVPHDIGIDDPWFEVNAYNLHNTDRWKDLNPK 599
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
FVLQVYRD VATGDK FA+A WPSVY+AMAYM+QFDKDGDGMIEN+GFPDQTYD WS +G
Sbjct: 600 FVLQVYRDVVATGDKKFAQAAWPSVYVAMAYMDQFDKDGDGMIENDGFPDQTYDTWSVSG 659
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGS 712
VSAYCGGLWVAALQAASALA +VGD SA YFW R+QKAK VYD LWNGSYFNYD+S+G
Sbjct: 660 VSAYCGGLWVAALQAASALAWEVGDKDSAEYFWFRFQKAKVVYDKLWNGSYFNYDDSNGR 719
Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
+S SIQADQLAGQWYARACGLLPI DE K + AL KIY++N LKV G GA+NGM PDG
Sbjct: 720 NSPSIQADQLAGQWYARACGLLPIVDEDKARSALEKIYNYNFLKVHDGKRGAVNGMLPDG 779
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
+DMS +Q+REIW GVTY +AA+M+QE ++DMAF TA+GVYE AW+E GLGYSFQTPE W
Sbjct: 780 TVDMSDMQSREIWSGVTYAVAATMMQEGLMDMAFHTASGVYEAAWAEQGLGYSFQTPEGW 839
Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQ-HAAFSKVASLL 891
N N +YRSL YMRPL IWAMQW L+ PKL +QE+ ++ DS Q HA F+KVA L
Sbjct: 840 NTNGQYRSLGYMRPLAIWAMQWTLSSPKLHKQEMNFQVKLEDSLLGHQHHAGFAKVARFL 899
Query: 892 KLPKEEASKGFLKVVYDFTIGR 913
KLP+EE+S +L+ ++D+ +
Sbjct: 900 KLPEEESSVSYLQALFDYACKK 921
>gi|224120058|ref|XP_002331126.1| predicted protein [Populus trichocarpa]
gi|222872854|gb|EEF09985.1| predicted protein [Populus trichocarpa]
Length = 948
Score = 1262 bits (3266), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 627/938 (66%), Positives = 742/938 (79%), Gaps = 29/938 (3%)
Query: 3 NGVKEEGENAASMP--KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
NG EE ++ P KVD G+P LTWQRKLD + V + FTLS QE +A IG RL+
Sbjct: 9 NGFDEEQGEPSNYPVDKVDAGKPAPLTWQRKLDGDETVLSQFTLSLQEKLLMAPIGIRLW 68
Query: 61 RYSKQEEAKGKVPVF-NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDD 119
R+ ++E A+ + + + F K ++TS GIP+GGIG+GSIGRSY+GEFQR++LF IC++
Sbjct: 69 RHIREENARKRGGFYMDPFAKRNVTSCLGIPVGGIGSGSIGRSYKGEFQRWQLFPRICEE 128
Query: 120 APVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
PVLANQFS+FVSR NG+K+SSVLC RSP V ++ SGIESWDWNLKG N TYHAL+PR
Sbjct: 129 KPVLANQFSIFVSRSNGKKYSSVLCSRSPDVLEEAAGSGIESWDWNLKGNNSTYHALYPR 188
Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
AWTVY+GEPDPELR+VCRQISP IPHNYKESSFPASVFTF L NSG+TSADVTLLFTWAN
Sbjct: 189 AWTVYEGEPDPELRVVCRQISPIIPHNYKESSFPASVFTFKLYNSGKTSADVTLLFTWAN 248
Query: 240 SVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
SV G S SG H NS M +DGVH L+H+TANG P V+FA+AA+ET VHVS+CPCF+
Sbjct: 249 SVGGVSEFSGQHLNSTKMMEDGVHCALLNHKTANGLPSVSFAIAAQETPVVHVSKCPCFV 308
Query: 300 LSGNSKGITAKDMWNEIKKHGSFDHLDN-DKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
+SGNS+G+TAK+MWNE+K+HGSFD+L++ K PSEPGSSIGAA+AA+ T+P +VT
Sbjct: 309 ISGNSQGVTAKEMWNEVKEHGSFDNLNSPGKLVPSEPGSSIGAAVAATSTVPPDGVCTVT 368
Query: 359 FSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
FSLAWDCP+VKF + YHRRYTKFYGT GD+AA IAHDAILEH WE EIEAWQRPILE
Sbjct: 369 FSLAWDCPDVKFGSGRTYHRRYTKFYGTHGDAAANIAHDAILEHGLWESEIEAWQRPILE 428
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP----------- 466
DKR PEWY TLFNELYYLN+GGT+WTDGSPP +SLATI KFSLD
Sbjct: 429 DKRLPEWYAGTLFNELYYLNSGGTVWTDGSPPFRSLATIEGSKFSLDRAGSNLGHQGDTA 488
Query: 467 -----------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSS 515
+IHTP ++++A GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+S
Sbjct: 489 VDILGRMTSALEEIHTPLTTNSAFGTNLLQEGEENIGQFLYLEGIEYHMWNTYDVHFYAS 548
Query: 516 FALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
FA +MLFPKL+LSIQRDFAAAVMMHDP M ++ DG+ V RK +GAVPHDIG+ DPWFE+
Sbjct: 549 FAFIMLFPKLQLSIQRDFAAAVMMHDPSMMPLLHDGQRVPRKVIGAVPHDIGIHDPWFEV 608
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
N+YNL N+ RWKDLN KFVLQVYRD +ATGDK FARAVWPSVY+AMAYM+QFD+DGDGMI
Sbjct: 609 NAYNLHNTDRWKDLNPKFVLQVYRDVIATGDKKFARAVWPSVYVAMAYMDQFDRDGDGMI 668
Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
EN+GFPDQTYD WS +GVSAYCGGLWVAALQAASALA +VGD SA YFW R+QKAK VY
Sbjct: 669 ENDGFPDQTYDTWSLSGVSAYCGGLWVAALQAASALAWEVGDKDSAEYFWCRFQKAKIVY 728
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
LWNGSYFNYDNS G +S+SIQADQLAGQWYARACGL PI DE K + AL KIY++NVL
Sbjct: 729 GKLWNGSYFNYDNSGGINSSSIQADQLAGQWYARACGLSPIVDEDKARCALEKIYNYNVL 788
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
KV+ G GA+NGM PDG +D+S LQ+REIW GVTY +AA+MIQE + DMAF TA+GVYE
Sbjct: 789 KVQDGKRGAVNGMLPDGTVDLSCLQSREIWSGVTYAVAATMIQEGLTDMAFHTASGVYEA 848
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDS 875
W+E GLGYSFQTPE WN D+YRSLCYMRPL IWAMQWAL++P+L RQE+K + +
Sbjct: 849 VWAEQGLGYSFQTPEGWNTTDQYRSLCYMRPLAIWAMQWALSRPELHRQEMK--LQAEED 906
Query: 876 SYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
S HA F+KVA LKLP E+SK + ++++ +
Sbjct: 907 SVPVHHAGFAKVARFLKLPHAESSKSHFQSLFEYATTK 944
>gi|356520149|ref|XP_003528727.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Glycine max]
Length = 944
Score = 1261 bits (3264), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 625/923 (67%), Positives = 737/923 (79%), Gaps = 30/923 (3%)
Query: 18 VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
V+PG+P LTWQRKL++N + +LS +E HLA IG+RL+R+ ++E AKG+ + +
Sbjct: 24 VEPGKPAGLTWQRKLNNNGNASSQISLSLKEIIHLAPIGYRLWRHCREEAAKGRGGMIDP 83
Query: 78 FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
F K H+T G+PLGGIGAGSIGRS+RGEFQR++LF IC++ PVLANQFSVFVSRP+GE
Sbjct: 84 FAKRHVTFCHGVPLGGIGAGSIGRSFRGEFQRWQLFPVICEEKPVLANQFSVFVSRPSGE 143
Query: 138 KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
K+SSVLCPR P + K+N SGIESWDWN+ G + TYHAL+PRAWTVY+ EPDP LRIVCR
Sbjct: 144 KYSSVLCPRKPVIIKQNPVSGIESWDWNINGNSSTYHALYPRAWTVYE-EPDPALRIVCR 202
Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM 257
QISP IPHNYKESSFP +VFTFTL N G T+ADVTLLFTW NSV G S +G HFNSKT
Sbjct: 203 QISPVIPHNYKESSFPVTVFTFTLKNLGNTTADVTLLFTWTNSVGGISEFTGDHFNSKTT 262
Query: 258 TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIK 317
DGVH + LHH+TAN R PVTFA+AAEET DVH+SECP F++SG I+AK+MWNE+K
Sbjct: 263 RNDGVHAVLLHHKTANERSPVTFAIAAEETEDVHISECPVFVISGAYSSISAKNMWNEVK 322
Query: 318 KHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVY 375
+HGSFDHL+ +TS PSEPGSSIGAAIAA++TIPS + R VTFSLAWDCPEVKF E K Y
Sbjct: 323 QHGSFDHLNFAETSAPSEPGSSIGAAIAATVTIPSNAQRIVTFSLAWDCPEVKFPEGKTY 382
Query: 376 HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYY 435
+RRYTKFYGT GD+AA IAHDAI+EH +WE +I+ WQRPILEDKRFPEWYP TL NELYY
Sbjct: 383 YRRYTKFYGTNGDAAADIAHDAIMEHCQWEAQIDDWQRPILEDKRFPEWYPTTLLNELYY 442
Query: 436 LNAGGTIWTDGSPPMQSLATIRERKFSLD----------------------TPSKIHTPA 473
LN+GGTIWTDGS P+ L ERKFSLD + H+P
Sbjct: 443 LNSGGTIWTDGSLPVHGLVNTGERKFSLDGFNNNLSHKNDIAINILEMFNSVVEQTHSPP 502
Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
+S +A G LL+ GEENIGQFLYLEG EY M+NTYDVHFYSSF+LVMLFPKLELSIQRDF
Sbjct: 503 ASKSAYGVNLLQEGEENIGQFLYLEGIEYQMWNTYDVHFYSSFSLVMLFPKLELSIQRDF 562
Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKF 593
AAAV+MHDP MK++ DG+W RK LGAVPHDIGL+DPWFE+N YNL+N+ RWKDLN KF
Sbjct: 563 AAAVLMHDPSKMKLLYDGQWAPRKVLGAVPHDIGLNDPWFEVNGYNLYNTDRWKDLNPKF 622
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
VLQ+YRD VATGDK FA+AVWP+VYIA+AYM+QFDKDGDGMIENEGFPDQTYD WS +GV
Sbjct: 623 VLQIYRDVVATGDKKFAQAVWPAVYIAIAYMDQFDKDGDGMIENEGFPDQTYDTWSVSGV 682
Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSS 713
SAY GGLWVAALQAASALA +VGD S YFW+++QKAK+VY++LWNGSYFNYD+S GSS
Sbjct: 683 SAYSGGLWVAALQAASALAREVGDKGSEDYFWLKFQKAKSVYENLWNGSYFNYDSSGGSS 742
Query: 714 STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
S SIQADQLAGQWYARACGLLPI +E K + AL +Y+ NV+KV G GA+NGM PDG+
Sbjct: 743 SLSIQADQLAGQWYARACGLLPIVEEKKSRSALQIVYNHNVMKVHXGRRGAVNGMLPDGK 802
Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
+DMS +Q+REIW GVTY LAA+MIQE M+DMAFQTA GVYE AWS +GLGYSFQTPE+W
Sbjct: 803 VDMSTMQSREIWSGVTYALAATMIQENMIDMAFQTAGGVYETAWSNNGLGYSFQTPEAWT 862
Query: 834 NNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRD--SSYLEQHAAFSKVASLL 891
DEYRSLCYMRPL IWAMQW L++ K + E K ++ ++D S Y H FSKVA LL
Sbjct: 863 TKDEYRSLCYMRPLAIWAMQWELSRAKHIQHESKSDMKEKDMLSRY---HDGFSKVARLL 919
Query: 892 KLPKEEASKGFLKVVYDFTIGRV 914
K+ +E S+ +V+YDFT R+
Sbjct: 920 KVKEETDSRSLFQVIYDFTCKRM 942
>gi|30695777|ref|NP_199801.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|27311753|gb|AAO00842.1| putative protein [Arabidopsis thaliana]
gi|34365725|gb|AAQ65174.1| At5g49900 [Arabidopsis thaliana]
gi|332008486|gb|AED95869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 957
Score = 1252 bits (3240), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 607/955 (63%), Positives = 736/955 (77%), Gaps = 40/955 (4%)
Query: 1 MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
E + + GE+ +S KVDP P LTWQRK+DS+VK P F LS +E LA +G
Sbjct: 2 FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
RL+ ++E AKG++ + F KH +TS G+PLGGIGAGSIGRS++GEFQR++LF
Sbjct: 62 IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+ TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PR+WT+Y+GEPDPELRIVCRQ+SPFIPHNYKESSFP SVFTFTL N G T+ADVTLLFT
Sbjct: 182 YPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVTLLFT 241
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANSV GDS SG H+NSK DGV G+ LHH+TANG P +++A++A+ T V VS CP
Sbjct: 242 WANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACP 301
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
F++SG GITAKDMW +K++GSFDHL + S S+ GSSIGAA+AAS+T+ G +R
Sbjct: 302 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 361
Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
VTFSLAWDCPEV+F K+Y RRYTKFYG GD+AA+IAHDAIL H++WE IE WQRP
Sbjct: 362 IVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIEDWQRP 421
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------- 464
ILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD
Sbjct: 422 ILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLKNDI 481
Query: 465 ------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
T ++H +S++A GT+LLE GEENIG FLYLEG EY M+N
Sbjct: 482 DVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEYRMWN 541
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP +K +S+G+WV RK LGAVPHD+
Sbjct: 542 TYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDL 601
Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM Q
Sbjct: 602 GINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQ 661
Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
FDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA VGD S YFW
Sbjct: 662 FDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWS 721
Query: 687 RYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
++QKAK VY+ LWNGSYFNYDNS S++IQADQLAGQWYARA GLLPI DE K + A
Sbjct: 722 KFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDKARTA 781
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
L K+Y++NV+K+K G GA+NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MA
Sbjct: 782 LEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMA 841
Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
FQTA+G+YE AWSE GLGYSFQTPESWN DEYRSL YMRPL IWAMQWALTK +++
Sbjct: 842 FQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQKQEQ 901
Query: 866 I-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
+ + E SS ++ FS+V+ LL LP E ++K L+ ++D+T R++
Sbjct: 902 LGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956
>gi|110742221|dbj|BAE99037.1| hypothetical protein [Arabidopsis thaliana]
Length = 957
Score = 1251 bits (3236), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 606/955 (63%), Positives = 736/955 (77%), Gaps = 40/955 (4%)
Query: 1 MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
E + + GE+ +S KVDP P LTWQRK+DS+VK P F LS +E LA +G
Sbjct: 2 FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
RL+ ++E AKG++ + F KH +TS G+PLGGIGAGSIGRS++GEFQR++LF
Sbjct: 62 IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+ TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PR+WT+Y+GEPDPELRIVCRQ+SPFIPHNYKESSFP SVFTFTL N G T+ADVTLLFT
Sbjct: 182 YPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVTLLFT 241
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANSV GDS SG H+NSK DGV G+ LHH+TANG P +++A++A+ T V VS CP
Sbjct: 242 WANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACP 301
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
F++SG GITAKDMW +K++GSFDHL + S S+ GSSIGAA+AAS+T+ G +R
Sbjct: 302 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 361
Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
VTFSLAWDCPEV+F K+Y RRYTKFYG GD+AA+IAHDAIL H++WE IE WQRP
Sbjct: 362 IVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWESWIEDWQRP 421
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------- 464
ILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD
Sbjct: 422 ILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLKNDI 481
Query: 465 ------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
T ++H +S++A GT+LLE GEENIG FLYLEG EY M+N
Sbjct: 482 DVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEYRMWN 541
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP +K +S+G+WV RK LGAVPHD+
Sbjct: 542 TYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDL 601
Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM Q
Sbjct: 602 GINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQ 661
Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
FDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA VGD S YFW
Sbjct: 662 FDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWS 721
Query: 687 RYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
++QKAK VY+ LWNGSYFNYDNS S++IQADQLAGQWYARA GLLPI DE K + A
Sbjct: 722 KFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDKARTA 781
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
L K+Y++NV+K+K G GA+NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MA
Sbjct: 782 LEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMA 841
Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
FQTA+G+YE AWS+ GLGYSFQTPESWN DEYRSL YMRPL IWAMQWALTK +++
Sbjct: 842 FQTASGIYEAAWSKTGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQKQEQ 901
Query: 866 I-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
+ + E SS ++ FS+V+ LL LP E ++K L+ ++D+T R++
Sbjct: 902 LGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956
>gi|222423179|dbj|BAH19567.1| AT5G49900 [Arabidopsis thaliana]
Length = 957
Score = 1248 bits (3229), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 606/955 (63%), Positives = 734/955 (76%), Gaps = 40/955 (4%)
Query: 1 MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
E + + GE+ +S KVDP P LTWQRK+DS+VK P F LS +E LA +G
Sbjct: 2 FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
RL+ ++E AKG++ + F KH +TS G+PLGGIGAGSIGRS++GEFQR++LF
Sbjct: 62 IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+ TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PR+WT+Y+GEPDPELRIVCRQ+SPFIPHNYKESSFP SVFTFTL N G T+ADVTLLFT
Sbjct: 182 YPRSWTMYEGEPDPELRIVCRQVSPFIPHNYKESSFPVSVFTFTLHNLGNTTADVTLLFT 241
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANSV GDS SG H+NSK DGV G+ LHH+TANG P +++A++A+ T V VS CP
Sbjct: 242 WANSVGGDSEFSGGHYNSKITMNDGVQGVLLHHKTANGLPSLSYAISAQATDGVSVSACP 301
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
F++SG GITAKDMW +K++GSFDHL + S S+ GSSIGAA+AAS+T+ G +R
Sbjct: 302 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 361
Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
VTFSLAWDCPEV+F K+Y RRYTKFYG GD+AA+IAHDAIL H++W IE WQRP
Sbjct: 362 IVTFSLAWDCPEVQFPSGKIYSRRYTKFYGNNGDAAAQIAHDAILGHSQWGSWIEDWQRP 421
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD---------- 464
ILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD
Sbjct: 422 ILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDKSQLGLKNDI 481
Query: 465 ------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
T +H +S++A GT+LLE GEENIG FLYLEG EY M+N
Sbjct: 482 DVPHQNDTAVSVLEKMASTLEGLHASTTSNSAFGTKLLEEGEENIGHFLYLEGIEYRMWN 541
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP +K +S+G+WV RK LGAVPHD+
Sbjct: 542 TYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGAVPHDL 601
Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AMAYM Q
Sbjct: 602 GINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSVYVAMAYMAQ 661
Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
FDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA VGD S YFW
Sbjct: 662 FDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGDKNSQDYFWS 721
Query: 687 RYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
++QKAK VY+ LWNGSYFNYDNS S++IQADQLAGQWYARA GLLPI DE K + A
Sbjct: 722 KFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPIVDEDKARTA 781
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
L K+Y++NV+K+K G GA+NGM P+G++D + +Q+REIW GVTY L+A+MIQE +V+MA
Sbjct: 782 LEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATMIQEGLVEMA 841
Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
FQTA+G+YE AWSE GLGYSFQTPESWN DEYRSL YMRPL IWAMQWALTK +++
Sbjct: 842 FQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWALTKTSQKQEQ 901
Query: 866 I-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
+ + E SS ++ FS+V+ LL LP E ++K L+ ++D+T R++
Sbjct: 902 LGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTCRRMM 956
>gi|297851790|ref|XP_002893776.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
gi|297339618|gb|EFH70035.1| hypothetical protein ARALYDRAFT_473529 [Arabidopsis lyrata subsp.
lyrata]
Length = 947
Score = 1230 bits (3182), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 585/935 (62%), Positives = 713/935 (76%), Gaps = 39/935 (4%)
Query: 18 VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
+D PP +W+RKL+S K P+ F LS ++ HL +G+RL+R++K E AKG+ +F++
Sbjct: 13 IDDKLPP-FSWERKLNSQAKTPSEFKLSKRDHLHLFPLGYRLWRHTKDEAAKGRASIFDI 71
Query: 78 FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
FRKHHIT D G+PLGGIGAGSIGRSY+GEFQ+FKLF IC++AP+L NQFS FVSRP G
Sbjct: 72 FRKHHITGDHGVPLGGIGAGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPGGV 131
Query: 138 KFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP 188
K S+VLCP P V K N + GIESWDWN+ GE TYHAL+PR+WTVYDGEP
Sbjct: 132 KHSTVLCPSKPQVIKDNGGYLCKGQVPNIGIESWDWNMTGEKSTYHALYPRSWTVYDGEP 191
Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS 248
DPELRIV RQ+SPFIPHNY+ESS P SVF FT++N+G A VTLLFTW NSV G SGL+
Sbjct: 192 DPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEQAIVTLLFTWENSVGGASGLT 251
Query: 249 GHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG-I 307
G HFNS KDGVH + L H+TANG PPV++A+AA+ET DV VS CPCF++SG + I
Sbjct: 252 GQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPNKI 311
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
TA DMW+EIKK+ SFD L ++ SPS+PG+SIGAAIAA + +P G R+VTFSL+WDCPE
Sbjct: 312 TAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPE 371
Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
+F EK YHRRYT+FYG+LG++A +AHDA+L ++WE +IE WQ PIL D PEWY I
Sbjct: 372 ARFDEKTYHRRYTRFYGSLGNAAVAMAHDALLNFSEWETQIEEWQAPILADTTLPEWYRI 431
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP--------------------- 466
TLFNELYY N+GGT+WTDG PP QSL +I RK SL
Sbjct: 432 TLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLSISTTDNTDPDQNDIALDILGRID 491
Query: 467 ---SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
S+IH P SS+ ALGT +++N ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFP
Sbjct: 492 AVCSQIHAPLSSNAALGTTMIQNTSENIGQFLYLEGIQYLMYNTYDVHFYSSFALLMLFP 551
Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNS 583
KLELSIQRDFAAAV+MHD ++MS G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+
Sbjct: 552 KLELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKALGAVPHDIGLNDPWFEVNAYNLFNT 611
Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQ
Sbjct: 612 DRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQ 671
Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSY 703
TYDAWS +GVSAYCGGLWVAALQA SALA +VGD+ +A YF +Y+KA++VY+ LWNGSY
Sbjct: 672 TYDAWSCSGVSAYCGGLWVAALQAGSALAREVGDNGAAVYFNAKYEKARSVYEKLWNGSY 731
Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
FNYDNS SS+SI ADQLAGQWYARACGL PIA E +KKAL +YDFNV++V+ G G
Sbjct: 732 FNYDNSRSGSSSSILADQLAGQWYARACGLKPIAKEEWIKKALETVYDFNVMRVRDGTRG 791
Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
A+NGM PDGR+D S + +RE+W G TY +AA MIQE + D F+TA+G+YE AWS+ GLG
Sbjct: 792 AVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLG 851
Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE---IKHEISDRDSSYLEQ 880
+FQTPE+W NDE+RSLCYMRPL IW +QWA T P+ +R++ +K + D S +Q
Sbjct: 852 CAFQTPEAWTTNDEFRSLCYMRPLAIWGIQWAHTMPQPNREQEVSLKQQEEDATSVLFQQ 911
Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
HA F KVA LK K + + L+ Y+ T RV+
Sbjct: 912 HAGFIKVAHYLKNTKGKDHRNRLQTAYE-TFLRVI 945
>gi|242070593|ref|XP_002450573.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
gi|241936416|gb|EES09561.1| hypothetical protein SORBIDRAFT_05g007160 [Sorghum bicolor]
Length = 951
Score = 1214 bits (3140), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 594/932 (63%), Positives = 713/932 (76%), Gaps = 46/932 (4%)
Query: 17 KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
KV GQ P LTW+ KL F L+++E LA IG RL R+ +E +KG++ V +
Sbjct: 28 KVYSGQLPVLTWEHKLTRGRHDLPPFRLTWRETMQLAGIGLRLSRHILEETSKGRIAVID 87
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+K S QG+PLGG SIGRSY+G+FQR++LF G C+D PVLANQFS F+SR +G
Sbjct: 88 PMKKRSARSGQGVPLGG----SIGRSYKGDFQRWQLFPGTCEDKPVLANQFSAFISRQDG 143
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
K+S+VL P P +PK + SGI SWDWN+ GE TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 RKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGEQSTYHALYPRAWTVYDGEPDPELNIVC 203
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
+ KDGVHG+ LHHRTA+G+PPVTF +AA+E D+ +S+CP F++SG+S TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPPVTFVIAAQEKEDILISQCPYFVISGSSASDEFTAKDMWN 323
Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+K+HGSFDHLD KTS S PGSSIGAAIAAS+ + +T+ ++FSLAW CPEVKF
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATKDISFSLAWACPEVKFSSG 383
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
K YHRRYTKFYGT GD+AA +AHDAILEHA WE +I+ WQ PIL+D+RFP WYP+TLFNE
Sbjct: 384 KTYHRRYTKFYGTDGDAAATLAHDAILEHASWERQIDEWQDPILQDERFPAWYPVTLFNE 443
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS------------------------- 467
LYYLNAGGTIWTDG PP+QSL I +KFSLD +
Sbjct: 444 LYYLNAGGTIWTDGLPPIQSLTAIGWKKFSLDMLNGESDDFNEMVRQNNTASDILHRMAS 503
Query: 468 ---KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
++H +S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SF+L+MLFPK
Sbjct: 504 VFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPK 562
Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
L+LS+QRDFAAAVMMHDP ++I+ DGKW ARK LGAVPHD+GL DPWF++N+Y L+N+
Sbjct: 563 LQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAYTLYNTD 622
Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
RWKDLN KFVLQVYRD VATGDK+FARAVWPSVYIAMAYMEQFDKD DGMIENE FPDQT
Sbjct: 623 RWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENEDFPDQT 682
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF 704
YD WS G+SAYCGGLWVAALQAASALA +VGD AS FW +Y+KAK+VY LWNGSYF
Sbjct: 683 YDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKSVYSKLWNGSYF 742
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
+YD+ D STSIQADQLAGQWYA+ACGL PI D+ K + AL KI+ FNV+K K G GA
Sbjct: 743 SYDDGDNKVSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKIFSFNVMKFKDGTRGA 802
Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
MNGM PDG +DMS +Q+REIWPGVTY LAASMIQE MV+ F+TA G+Y AWS +GLGY
Sbjct: 803 MNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGIYHAAWSPEGLGY 862
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRD---SSYLEQH 881
SFQTPE+WNN+DEYRSLCYMRPL IWAMQWAL+ PKL H D S+ +
Sbjct: 863 SFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSNPKL------HNTPQADIPQDSFPKNQ 916
Query: 882 AAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
A++++ LL+LP+EE SK FL+VVY+ R
Sbjct: 917 FAYARITKLLQLPEEETSKSFLRVVYEIIRNR 948
>gi|357157041|ref|XP_003577664.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 951
Score = 1206 bits (3120), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 587/928 (63%), Positives = 721/928 (77%), Gaps = 38/928 (4%)
Query: 18 VDPGQPPQLTWQRKLDSNVK--VPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVF 75
VDPGQ P+LTW+ KL S+V+ +P+ F L ++E +A +GFRL ++ +E +KG+ +
Sbjct: 27 VDPGQLPELTWEHKL-SHVRYDLPS-FGLKWREAVKMAGLGFRLGQHIVEETSKGRTAII 84
Query: 76 NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPN 135
+ +K S QG+PLGGIGAGSIGRSY+GEFQR++LF G C+D PVLANQFS F+SR +
Sbjct: 85 DPMKKRTAKSGQGVPLGGIGAGSIGRSYKGEFQRWQLFPGACEDKPVLANQFSAFISRQD 144
Query: 136 GEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
G K+S+VL P P +PK SGI SWDWNL G+ TYHAL+PRAWTVYDGEPDP+L+IV
Sbjct: 145 GRKYSTVLHPGKPDLPKGTNISGIGSWDWNLSGQKSTYHALYPRAWTVYDGEPDPDLKIV 204
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQ+SP IPHNY++SS+PA+VFTFT++NSG T+ DVTLLFTWANSV G S L+G+H NS
Sbjct: 205 CRQVSPIIPHNYQQSSYPAAVFTFTVANSGNTAVDVTLLFTWANSVGGKSELTGYHCNSS 264
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ KDGVHG+ LHHRTA+G+PPVTFA+AA+E DVH+SECP F++SG+S TAKDMWN
Sbjct: 265 MLEKDGVHGILLHHRTADGQPPVTFAIAAQEKEDVHISECPYFVMSGSSDEFTAKDMWNS 324
Query: 316 IKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-K 373
+K+HGSFD LD + S S PG+SIGAAIAAS+ + ST+ V+FSLAW CPEVKF K
Sbjct: 325 VKEHGSFDLLDPIEASICSRPGTSIGAAIAASVKLAPQSTKDVSFSLAWACPEVKFSSGK 384
Query: 374 VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
YHRRYTKFYGT D+AA +AHDAI++H+ WE +IE WQ PIL+DKRFP WYP+TLFNEL
Sbjct: 385 TYHRRYTKFYGTDVDAAASLAHDAIVDHSSWEKQIEEWQNPILQDKRFPAWYPVTLFNEL 444
Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS-------------------------- 467
YYLNAGG+IWTDG PP+QSL I +KFSLD +
Sbjct: 445 YYLNAGGSIWTDGLPPIQSLTAIGGKKFSLDMSNGETDDVDEMIPHNNTATDILQQMASI 504
Query: 468 --KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
+IH +S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SFAL+MLFPKL
Sbjct: 505 LERIHASLASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFALIMLFPKL 563
Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
+LSIQRDFAAAVMMHDP +K++ DGK RK LGAVPHD+GL DPWF++N+Y L N+ R
Sbjct: 564 QLSIQRDFAAAVMMHDPEKLKLLHDGKLAPRKVLGAVPHDLGLYDPWFKVNAYTLHNTDR 623
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
WKDLN KFVLQVYRD VATG+K+FARAVWPSVY+AMAYMEQFDKD DGMIENE FPDQTY
Sbjct: 624 WKDLNPKFVLQVYRDVVATGNKSFARAVWPSVYMAMAYMEQFDKDKDGMIENEDFPDQTY 683
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN 705
D WS GVSAYCGGLWVAALQAASALA +VGD AS FW +Y+KAK+VY LWNGSYFN
Sbjct: 684 DVWSMAGVSAYCGGLWVAALQAASALAREVGDKASEELFWNKYEKAKSVYGKLWNGSYFN 743
Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
YD++ STSI ADQLAGQWYA++CGL I D+ K + AL KIY FNV+K K G GAM
Sbjct: 744 YDDAGTKVSTSIHADQLAGQWYAKSCGLSSIVDKDKSQSALEKIYSFNVMKFKDGKRGAM 803
Query: 766 NGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYS 825
NGM PDG +DMS +Q+REIWPGVTY LAASMIQE MV+ F+TA G+Y AWS +GLGY+
Sbjct: 804 NGMWPDGTVDMSTMQSREIWPGVTYALAASMIQEGMVEEGFKTAEGIYHAAWSPEGLGYA 863
Query: 826 FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFS 885
FQTPE+W N+D YRSLCYMRPL IW++QWAL+ PKL ++ + D S+ + +++
Sbjct: 864 FQTPEAWTNDDGYRSLCYMRPLAIWSIQWALSSPKLHKEPQRDLAQD---SFPKNQFSYA 920
Query: 886 KVASLLKLPKEEASKGFLKVVYDFTIGR 913
++A LL+LP++E+SK FL+V+Y+ R
Sbjct: 921 RIAKLLQLPEDESSKSFLQVIYEIVRNR 948
>gi|449463809|ref|XP_004149624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449494604|ref|XP_004159595.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 951
Score = 1202 bits (3111), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 607/946 (64%), Positives = 726/946 (76%), Gaps = 36/946 (3%)
Query: 1 MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
+ NG+ E+ E+ +DPGQ P LTWQRKL+ F L +E HLA +G
Sbjct: 7 LGNGLVEQDEDINKSLTEKTGIDPGQTPSLTWQRKLNREGTSLWQFRLHLKEIIHLAPVG 66
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
+RL+RY ++E KG+ + N F + IT IPLGG+G+GSIGRSYRGEFQR++LF
Sbjct: 67 YRLFRYIREESVKGRGAMINPFIRRRITDTHAIPLGGLGSGSIGRSYRGEFQRWQLFPRK 126
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D P+LANQFSVFVSRPN EK+S+VLC +SP ++ SGI SWDWNLKG + TYHAL
Sbjct: 127 CEDKPILANQFSVFVSRPNAEKYSTVLCAQSPETQREVESSGIGSWDWNLKGHSSTYHAL 186
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PRAWT+YDGEPDPELRIVCRQISP IPHNYKESS+P SVFTFTL NSG+T ADV+LLFT
Sbjct: 187 YPRAWTIYDGEPDPELRIVCRQISPVIPHNYKESSYPVSVFTFTLHNSGKTDADVSLLFT 246
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANSV G S SG+H NS+T KDGVH + LHH+TA+G PPVT+A+AA+E VHVS+CP
Sbjct: 247 WANSVGGLSEYSGNHINSRTKKKDGVHTVLLHHKTASGFPPVTYAIAAQEGNGVHVSDCP 306
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLD-NDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
CF++SGNS+GI+AKDMW EIK+HGSFD L D + PSE GSSIGAA++AS+T+ S S R
Sbjct: 307 CFVISGNSQGISAKDMWLEIKEHGSFDRLKFADMSMPSEVGSSIGAAVSASVTVSSDSVR 366
Query: 356 SVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+VTFSL+WDCPEV F K YHRRYTKFYG LGD+AA IA DAILEH WE +I+AWQRP
Sbjct: 367 TVTFSLSWDCPEVNFCRGKTYHRRYTKFYGNLGDAAADIARDAILEHHHWESQIDAWQRP 426
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-------- 466
+LEDKRFP+WYP+TLFNELYYLNAGGTIWTDGS P+QSL +I ER F LD
Sbjct: 427 VLEDKRFPKWYPVTLFNELYYLNAGGTIWTDGSLPIQSLVSIGERGFCLDQCRSGDSVIY 486
Query: 467 -------------------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNT 507
++ +S++A G LL+ GEEN+GQFLYLEG EY M+NT
Sbjct: 487 ESHENDTANDILGRMTSRLDELRDSVASNSAFGVNLLQKGEENVGQFLYLEGVEYTMWNT 546
Query: 508 YDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG 567
YDVHFYSSFA++MLFPKLELSIQRDFAAAVMMHDP M+++ +GKW AR LGAVPHDIG
Sbjct: 547 YDVHFYSSFAIIMLFPKLELSIQRDFAAAVMMHDPSKMQLLDNGKWEARNVLGAVPHDIG 606
Query: 568 LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF 627
++DPWFE+N YNL+N+ RWKDLN KFVLQ+YRD VATGD FA+AVWP VY+A+AYM+QF
Sbjct: 607 VNDPWFEVNGYNLYNTDRWKDLNPKFVLQIYRDVVATGDMKFAKAVWPPVYLAIAYMDQF 666
Query: 628 DKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVR 687
D+DGDGMIEN+GFPDQTYD WS +GVSAY GGLWVAALQAASALA + + YFW +
Sbjct: 667 DRDGDGMIENDGFPDQTYDTWSVSGVSAYSGGLWVAALQAASALARVADEKDAEHYFWFK 726
Query: 688 YQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALT 747
+QKAK Y+ LWNGSYFNYD+S GSSS+SIQADQLAGQWYARA GLLPI DE K K AL
Sbjct: 727 FQKAKRAYEKLWNGSYFNYDSSGGSSSSSIQADQLAGQWYARASGLLPIVDEDKAKSALA 786
Query: 748 KIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQ 807
K+Y++NV+KVK G GA+NGM PDG ID S +Q+REIW GVTY +AASMI E+M DMAF+
Sbjct: 787 KVYNYNVMKVKDGKRGAVNGMLPDGTIDFSSMQSREIWSGVTYAVAASMIHEDMTDMAFR 846
Query: 808 TAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIK 867
TA G++E AWSEDGLGY+FQTPE+W D YRSLCYMRPL IWAMQWA ++ +E+K
Sbjct: 847 TAEGIHEAAWSEDGLGYNFQTPEAWTTTDRYRSLCYMRPLAIWAMQWAFSEKVSVIEELK 906
Query: 868 HEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
SD S + HA FSKVA LKLP++ S L+ VYD+T+ R
Sbjct: 907 ELDSDAISRH---HARFSKVARYLKLPEDGTSSSVLQTVYDYTLKR 949
>gi|30692911|ref|NP_174631.2| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|79319136|ref|NP_001031135.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193493|gb|AEE31614.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
gi|332193494|gb|AEE31615.1| Beta-glucosidase, GBA2 type protein [Arabidopsis thaliana]
Length = 947
Score = 1201 bits (3106), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 575/928 (61%), Positives = 707/928 (76%), Gaps = 38/928 (4%)
Query: 18 VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
+D PP +W+RKL+S K P+ F LS ++ HL +G+RL+R++K E AKG+ +F++
Sbjct: 13 IDDKLPP-FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIFDI 71
Query: 78 FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
FRKHHIT D G+PLGGIG+GSIGRSY+GEFQ+FKLF IC++AP+L NQFS FVSRP G
Sbjct: 72 FRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPGGV 131
Query: 138 KFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP 188
K S+VLCP P V K N + GIESWDWN+ GE TYHAL+PR+WTVYDGEP
Sbjct: 132 KHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDGEP 191
Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS 248
DPELRIV RQ+SPFIPHNY+ESS P SVF FT++N+G A VTLLFTW NSV G SGL+
Sbjct: 192 DPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASGLT 251
Query: 249 GHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG-I 307
G HFNS KDGVH + L H+TANG PPV++A+AA+ET DV VS CPCF++SG + I
Sbjct: 252 GQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPNQI 311
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
TA DMW+EIKK+ SFD L ++ SPS+PG+SIGAAIAA + +P G R+VTFSL+WDCPE
Sbjct: 312 TAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPE 371
Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
+F EK YHRRYT+FYG+LG++A +AHDA+L +WE +IE WQ P+L D PEWY +
Sbjct: 372 ARFDEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPEWYRV 431
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP--------------------- 466
TLFNELYY N+GGT+WTDG PP QSL +I RK SL
Sbjct: 432 TLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTIDKNDQDQNNVALDILGRID 491
Query: 467 ---SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
S+IH P +S+ ALG +++N ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFP
Sbjct: 492 AVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYDVHFYSSFALLMLFP 551
Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNS 583
K+ELSIQRDFAAAV+MHD ++MS G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+
Sbjct: 552 KIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFELNAYNLFNT 611
Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQ
Sbjct: 612 DRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQ 671
Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSY 703
TYDAWS +GVSAYCGGLWVAALQA SALA ++GD+ +A YF +Y+KA++VY+ LWNGSY
Sbjct: 672 TYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVYEKLWNGSY 731
Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
FNYDNS SS+SI ADQ+AGQWYARACGL PIA E +KKAL +YDFNV++V+ G G
Sbjct: 732 FNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVMRVRDGTRG 791
Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
A+NGM PDGR+D S + +RE+W G TY +AA MIQE + D F+TA+G+YE AWS+ GLG
Sbjct: 792 AVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLG 851
Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE---IKHEISDRDSSYLEQ 880
+FQTPE+W NDEYRSLCYMRPL IW +QWA T PK +R++ ++ + D S +Q
Sbjct: 852 CAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEEDATSVLFQQ 911
Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYD 908
HA F KVA LK K + + L+ Y+
Sbjct: 912 HAGFIKVAHYLKNTKGKDHRNRLQTAYE 939
>gi|115484889|ref|NP_001067588.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|77549531|gb|ABA92328.1| expressed protein [Oryza sativa Japonica Group]
gi|113644810|dbj|BAF27951.1| Os11g0242100 [Oryza sativa Japonica Group]
gi|215704397|dbj|BAG93831.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222615769|gb|EEE51901.1| hypothetical protein OsJ_33494 [Oryza sativa Japonica Group]
Length = 950
Score = 1201 bits (3106), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 595/955 (62%), Positives = 741/955 (77%), Gaps = 51/955 (5%)
Query: 1 MENGVKEEGENAA-SMPK-------VDPGQPPQLTWQRKLDSNV--KVPTGFTLSFQEFR 50
+ENGV E+ + + + P+ VDPG P+LTW+ KL SN+ +P+ F L+++E
Sbjct: 2 VENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKL-SNIGYDLPS-FRLTWRETF 59
Query: 51 HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
LA +G RL R+ +E +KG+ V + +K S QG+PLGGIG+GSIGRSY+GEFQR+
Sbjct: 60 QLAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRW 119
Query: 111 KLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGEN 170
+LF G C++ PVLANQFS F+SR +G +SSVL P P +PK + SGI SWDWN+ G+N
Sbjct: 120 QLFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQN 179
Query: 171 CTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSAD 230
TYHAL+PR+WT+Y+GEPDP++ IVCRQISP IPHNY++SS+P SVFTFT++NSG T+AD
Sbjct: 180 STYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAAD 239
Query: 231 VTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADV 290
VTLLFTWANSV G S L+G+H NS + KDGVHG+ LHHRTANG+PPVTFA+AA+E D+
Sbjct: 240 VTLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDI 299
Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTI 349
H+SECP F++SG+S +AKDMWN +K++GSFD+LD KTS S+PG SIGAAIAAS+ +
Sbjct: 300 HISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKL 359
Query: 350 PSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
P +T++V+F+LAW CPEVKF K YHRRYTKF+GT D+AA +AHDAILEH WE +I
Sbjct: 360 PPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQI 419
Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS- 467
E WQ PIL+D+RFP+WYP+TLFNELYYLNAGGTIWTDG PP+QSL I E+KFSLD +
Sbjct: 420 EEWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNG 479
Query: 468 ---------------------------KIHTPASSDTALGTRLLENGEENIGQFLYLEGA 500
+IH S++A+GT LL+ GEENIGQFLYLEG
Sbjct: 480 DADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGI 538
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
EY M+NTYDVHFY+SF+L+MLFPKL+LSIQRDFAAAV+MHDP ++++ DGKWVARK LG
Sbjct: 539 EYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLG 598
Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
AVPHD+GL DPWF++N+Y L+N+ RWKDLN KFVLQVYRD VATGDK+FARAVWPSVY+A
Sbjct: 599 AVPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMA 658
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
MAYMEQFD+D DGMIENE FPDQTYD WS G+SAYCGGLWVAALQAASALA++VGD AS
Sbjct: 659 MAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKAS 718
Query: 681 ASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
FW +Y+KAK+VY LWNGSYFNYD+ D S SI ADQLAGQWYA+ACGL PI D+
Sbjct: 719 EKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKD 778
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
K + AL KIY FNV+K K G GAMNGM PDG +DMS +Q+REIWPGVTY LAA+MIQE
Sbjct: 779 KAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEG 838
Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
MV+ F+TA G+Y AWS +GLGYSFQTPE+WNN+DEYRSLCYMRPL IWA+QWAL+ PK
Sbjct: 839 MVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK 898
Query: 861 LSRQ--EIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
L +Q +I + S+ + +++++A LL LP++E+ K FL+V+Y+ R
Sbjct: 899 LHKQTADIPQD------SFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947
>gi|218185509|gb|EEC67936.1| hypothetical protein OsI_35656 [Oryza sativa Indica Group]
Length = 950
Score = 1201 bits (3106), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 595/955 (62%), Positives = 741/955 (77%), Gaps = 51/955 (5%)
Query: 1 MENGVKEEGENAA-SMPK-------VDPGQPPQLTWQRKLDSNV--KVPTGFTLSFQEFR 50
+ENGV E+ + + + P+ VDPG P+LTW+ KL SN+ +P+ F L+++E
Sbjct: 2 VENGVLEQPKGVSRNRPRAQSNDHPVDPGYLPELTWEHKL-SNIGYDLPS-FRLTWRETF 59
Query: 51 HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
LA +G RL R+ +E +KG+ V + +K S QG+PLGGIG+GSIGRSY+GEFQR+
Sbjct: 60 QLAGLGLRLGRHILEETSKGRAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRW 119
Query: 111 KLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGEN 170
+LF G C++ PVLANQFS F+SR +G +SSVL P P +PK + SGI SWDWN+ G+N
Sbjct: 120 QLFPGTCEERPVLANQFSAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNMSGQN 179
Query: 171 CTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSAD 230
TYHAL+PR+WT+Y+GEPDP++ IVCRQISP IPHNY++SS+P SVFTFT++NSG T+AD
Sbjct: 180 STYHALYPRSWTIYNGEPDPDVNIVCRQISPIIPHNYQQSSYPVSVFTFTVTNSGNTAAD 239
Query: 231 VTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADV 290
VTLLFTWANSV G S L+G+H NS + KDGVHG+ LHHRTANG+PPVTFA+AA+E DV
Sbjct: 240 VTLLFTWANSVGGKSELTGYHSNSPMIEKDGVHGILLHHRTANGQPPVTFAIAAQEKEDV 299
Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTI 349
H+SECP F++SG+S +AKDMWN +K++GSFD+LD KTS S+PG SIGAAIAAS+ +
Sbjct: 300 HISECPYFIISGSSDAFSAKDMWNYVKENGSFDNLDLTKTSMCSKPGLSIGAAIAASVKL 359
Query: 350 PSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
P +T++V+F+LAW CPEVKF K YHRRYTKF+GT D+AA +AHDAILEH WE +I
Sbjct: 360 PPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKFHGTDNDAAASLAHDAILEHNSWERQI 419
Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS- 467
E WQ PIL+D+RFP+WYP+TLFNELYYLNAGGTIWTDG PP+QSL I E+KFSLD +
Sbjct: 420 EEWQNPILQDERFPDWYPVTLFNELYYLNAGGTIWTDGLPPIQSLTGIGEKKFSLDMQNG 479
Query: 468 ---------------------------KIHTPASSDTALGTRLLENGEENIGQFLYLEGA 500
+IH S++A+GT LL+ GEENIGQFLYLEG
Sbjct: 480 DADDANGIIPRNNTASDILNQMASVLERIHASMESNSAIGTTLLQ-GEENIGQFLYLEGI 538
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
EY M+NTYDVHFY+SF+L+MLFPKL+LSIQRDFAAAV+MHDP ++++ DGKWVARK LG
Sbjct: 539 EYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFAAAVLMHDPEKLRMLHDGKWVARKVLG 598
Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
A+PHD+GL DPWF++N+Y L+N+ RWKDLN KFVLQVYRD VATGDK+FARAVWPSVY+A
Sbjct: 599 AIPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYMA 658
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
MAYMEQFD+D DGMIENE FPDQTYD WS G+SAYCGGLWVAALQAASALA++VGD AS
Sbjct: 659 MAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGISAYCGGLWVAALQAASALAHEVGDKAS 718
Query: 681 ASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
FW +Y+KAK+VY LWNGSYFNYD+ D S SI ADQLAGQWYA+ACGL PI D+
Sbjct: 719 EKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMSASIHADQLAGQWYAKACGLFPIVDKD 778
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
K + AL KIY FNV+K K G GAMNGM PDG +DMS +Q+REIWPGVTY LAA+MIQE
Sbjct: 779 KAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTVDMSAMQSREIWPGVTYALAATMIQEG 838
Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
MV+ F+TA G+Y AWS +GLGYSFQTPE+WNN+DEYRSLCYMRPL IWA+QWAL+ PK
Sbjct: 839 MVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNNDDEYRSLCYMRPLAIWAIQWALSNPK 898
Query: 861 LSRQ--EIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
L +Q +I + S+ + +++++A LL LP++E+ K FL+V+Y+ R
Sbjct: 899 LHKQTADIPQD------SFPKNQFSYARIAKLLHLPEDESPKSFLRVIYEIVRNR 947
>gi|28393605|gb|AAO42222.1| unknown protein [Arabidopsis thaliana]
Length = 947
Score = 1197 bits (3098), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 574/928 (61%), Positives = 706/928 (76%), Gaps = 38/928 (4%)
Query: 18 VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
+D PP +W+RKL+S K P+ F LS ++ HL +G+RL+R++K E AKG+ +F++
Sbjct: 13 IDDKLPP-FSWERKLNSQAKTPSEFKLSKRDHMHLFPLGYRLWRHTKDEAAKGRASIFDI 71
Query: 78 FRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
FRKHHIT D G+PLGGIG+GSIGRSY+GEFQ+FKLF IC++AP+L NQFS FVSRP G
Sbjct: 72 FRKHHITGDHGVPLGGIGSGSIGRSYKGEFQQFKLFPKICEEAPILTNQFSAFVSRPGGV 131
Query: 138 KFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP 188
K S+VLCP P V K N + GIESWDWN+ GE TYHAL+PR+WTVYDGEP
Sbjct: 132 KHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRSWTVYDGEP 191
Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS 248
DPELRIV RQ+SPFIPHNY+ESS P SVF FT++N+G A VTLLFTW NSV G SGL+
Sbjct: 192 DPELRIVSRQVSPFIPHNYEESSLPVSVFNFTVTNTGAEPAIVTLLFTWENSVGGASGLT 251
Query: 249 GHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG-I 307
G HFNS KDGVH + L H+TANG PPV++A+AA+ET DV VS CPCF++SG + I
Sbjct: 252 GQHFNSTMKAKDGVHAVALQHKTANGHPPVSYAIAAKETEDVRVSSCPCFIVSGTTPNQI 311
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
TA DMW+EIKK+ SFD L ++ SPS+PG+SIGAAIAA + +P G R+VTFSL+WDCPE
Sbjct: 312 TAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPE 371
Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
+F EK YHRRYT+FYG+LG++A +AHDA+L +WE +IE WQ P+L D PEWY +
Sbjct: 372 ARFDEKTYHRRYTRFYGSLGNAAVAMAHDALLNFPEWETQIEEWQAPVLADTTLPEWYRV 431
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP--------------------- 466
TLFNELYY N+GGT+WTDG PP QSL +I RK SL
Sbjct: 432 TLFNELYYFNSGGTMWTDGLPPKQSLDSIGRRKISLGLSTIDKNDQDQNNVALDILGRID 491
Query: 467 ---SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
S+IH P +S+ ALG +++N ENIGQFLYLEG +Y+MYNTY VHFYSSFAL+MLFP
Sbjct: 492 AVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTYGVHFYSSFALLMLFP 551
Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNS 583
K+ELSIQRDFAAAV+MHD ++MS G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+
Sbjct: 552 KIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGLNDPWFELNAYNLFNT 611
Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQ
Sbjct: 612 DRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQ 671
Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSY 703
TYDAWS +GVSAYCGGLWVAALQA SALA ++GD+ +A YF +Y+KA++VY+ LWNGSY
Sbjct: 672 TYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKYEKARSVYEKLWNGSY 731
Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
FNYDNS SS+SI ADQ+AGQWYARACGL PIA E +KKAL +YDFNV++V+ G G
Sbjct: 732 FNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALETVYDFNVMRVRDGTRG 791
Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
A+NGM PDGR+D S + +RE+W G TY +AA MIQE + D F+TA+G+YE AWS+ GLG
Sbjct: 792 AVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRTASGIYEAAWSDRGLG 851
Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE---IKHEISDRDSSYLEQ 880
+FQTPE+W NDEYRSLCYMRPL IW +QWA T PK +R++ ++ + D S +Q
Sbjct: 852 CAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVSLRPQEEDATSVLFQQ 911
Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYD 908
HA F KVA LK K + + L+ Y+
Sbjct: 912 HAGFIKVAHYLKNTKGKDHRNRLQTAYE 939
>gi|413920603|gb|AFW60535.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
Length = 951
Score = 1190 bits (3079), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 594/932 (63%), Positives = 714/932 (76%), Gaps = 46/932 (4%)
Query: 17 KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
KV PGQ P LTW+ KL F L+++E LA IG RL R+ +E +KG++ V +
Sbjct: 28 KVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKGRIAVID 87
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+K S QG+PLGG SIGRSY+G+FQR++LF G C+D VLANQFS F+SR +G
Sbjct: 88 PMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAFISRQDG 143
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
+K+S+VL P P +PK + SGI SWDWN+ G+ TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 KKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDPELNIVC 203
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
+ KDGVHG+ LHHRTA+G+P VTF +AA+E D+ +SECP F++SG+S TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFTAKDMWN 323
Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+K+HGSFDHLD KTS S PGSSIGAAIAAS+ + +T+ ++FSLAW CPEVKF
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPEVKFSSG 383
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
K YHRRYTKFYGT GD+AA +AHDAILEHA WE +IE WQ PIL+D+RFP WYP+TLFNE
Sbjct: 384 KTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYPVTLFNE 443
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS------------------------- 467
LYYLNAGGTIWTDG PP+QSL + +KFSLD +
Sbjct: 444 LYYLNAGGTIWTDGLPPIQSLTAVGWKKFSLDMLNGESDDFNEMVRQNNTASDILHRMAS 503
Query: 468 ---KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
++H +S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SF+L+MLFPK
Sbjct: 504 VFERMHASIASNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPK 562
Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
L+LS+QRDFAAAVMMHDP ++I+ DGKW ARK LGAVPHD+GL DPWF++N+Y L+N+
Sbjct: 563 LQLSVQRDFAAAVMMHDPEKLRILHDGKWAARKVLGAVPHDLGLYDPWFKVNAYTLYNTD 622
Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
RWKDLN KFVLQVYRD VATGDK+FARAVWPSVYIAMAYMEQFDKD DGMIENE FPDQT
Sbjct: 623 RWKDLNPKFVLQVYRDVVATGDKSFARAVWPSVYIAMAYMEQFDKDKDGMIENEDFPDQT 682
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF 704
YD WS G+SAYCGGLWVAALQAASALA +VGD AS FW +Y+KAK VY LWNGSYF
Sbjct: 683 YDVWSMAGISAYCGGLWVAALQAASALAREVGDKASEKLFWDKYEKAKFVYSKLWNGSYF 742
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
+YD+ D STSIQADQLAGQWYA+ACGL PI D+ K + AL K+Y FNV+K KGG GA
Sbjct: 743 SYDDGDNKMSTSIQADQLAGQWYAKACGLFPIVDKDKAQSALEKVYSFNVMKFKGGTRGA 802
Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
MNGM PDG +DMS +Q+REIWPGVTY LAASMIQE MV+ F+TA GVY AWS GLGY
Sbjct: 803 MNGMWPDGTVDMSAMQSREIWPGVTYALAASMIQEGMVEQGFKTAEGVYHAAWSPGGLGY 862
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRD---SSYLEQH 881
SFQTPE+WNN+DEYRSLCYMRPL IWAMQWAL+KPKL H D S+ +
Sbjct: 863 SFQTPEAWNNDDEYRSLCYMRPLAIWAMQWALSKPKL------HNTPQADIPQDSFPKNQ 916
Query: 882 AAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
A++++ LL+LP++E+SK F++VVY+ R
Sbjct: 917 FAYARITKLLQLPEDESSKSFVRVVYEIIRNR 948
>gi|20453181|gb|AAM19831.1| AT4g10060/T5L19_190 [Arabidopsis thaliana]
gi|24111431|gb|AAN46866.1| At4g10060/T5L19_190 [Arabidopsis thaliana]
Length = 922
Score = 1146 bits (2965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/930 (60%), Positives = 697/930 (74%), Gaps = 32/930 (3%)
Query: 1 MENGVKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
ME E E M V+ + PQ+TWQRKL+S VK P+ F +S ++ HL +G+RL+
Sbjct: 1 MEKNGHTESELQTQM--VENERLPQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLW 58
Query: 61 RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
RY+K+E KG+ ++++F+K H+ D G+PLGGIG GSIGRSY+GEFQ+FKLF IC++A
Sbjct: 59 RYTKEEAKKGRFSMYDIFKKRHVRGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEA 118
Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
P+L NQFSVFVSRP G +S+VLCP P V K D GIESWDWN++G+ TYHAL+PR
Sbjct: 119 PILTNQFSVFVSRPGGLSYSTVLCPTKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPR 178
Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
+WTVY+ EPDPELRIV RQ+SPFIPHNYKESS P SVF FT+SN G+ A VTLLFTW N
Sbjct: 179 SWTVYN-EPDPELRIVSRQVSPFIPHNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWEN 237
Query: 240 SVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
SV G SGL+G HFNS M +DGVH + LHH+T NG PPVT+A+AA+ET DVHVSECPCFL
Sbjct: 238 SVGGASGLTGEHFNSTIMERDGVHAIVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFL 297
Query: 300 LSGNS-KGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
+SG+S K ITAK+MW+EIKK+ SFD L+++ SPS PG+SIGAAIAA + +P G R+VT
Sbjct: 298 VSGHSPKEITAKEMWDEIKKNKSFDELNSEPGSPSRPGTSIGAAIAAKVKVPPGCDRTVT 357
Query: 359 FSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
FSL+WDCPEV+F EK YHRRYTKFYG LGD+A ++A DA+L + WE +IEAWQ PIL D
Sbjct: 358 FSLSWDCPEVRFNEKTYHRRYTKFYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSD 417
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSL-----ATIRERKFSLDTPSKI---- 469
P+WY +TLFNELYY N+GGTIWTDG PP +S+ + +D KI
Sbjct: 418 TTLPDWYRVTLFNELYYFNSGGTIWTDGLPPKESIERSKVTNTEQNDIVIDLFQKINAVC 477
Query: 470 ---HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
++P SS N EENIGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKL
Sbjct: 478 EQIYSPQSS----------NSEENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLA 527
Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
LSIQRDFAA V++ DP KIMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RW
Sbjct: 528 LSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRW 587
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
KDLN+KFVLQVYRD VAT D++FA+AVWPSVY A+AY++QFDKD DGMIENEGFPDQTYD
Sbjct: 588 KDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYD 647
Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNY 706
AWS GVSAYCGGLWVAALQAASA A+ VG++A A YF +Y+KAK VY+ LWNGSYFNY
Sbjct: 648 AWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNY 707
Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
D+S SS+SI ADQLAGQWYARACGL PI E +KKAL IY+FNV+KVKGG GA+N
Sbjct: 708 DDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVN 767
Query: 767 GMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
GM +G++D + L ++E+W G TY +AA MIQE + FQTA+G+YE WS+ GL SF
Sbjct: 768 GMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSF 827
Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEIS----DRDSSYLEQHA 882
QTPE+WN NDEYRSLCYMRPL IWA+QWALT+ + +E + ++ + + L QH
Sbjct: 828 QTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHK 887
Query: 883 AFSKVASLLKL-PKEEASKGFLKVVYDFTI 911
F VA +K+ P + L+ Y+ +
Sbjct: 888 GFKDVARFVKIVPTSNVHRSRLQHTYETVL 917
>gi|240255774|ref|NP_192744.6| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332657434|gb|AEE82834.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 922
Score = 1144 bits (2960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/930 (60%), Positives = 696/930 (74%), Gaps = 32/930 (3%)
Query: 1 MENGVKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLY 60
ME E E M V + PQ+TWQRKL+S VK P+ F +S ++ HL +G+RL+
Sbjct: 1 MEKNGHTESELQTQM--VGNERLPQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPLGYRLW 58
Query: 61 RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
RY+K+E KG+ ++++F+K H+ D G+PLGGIG GSIGRSY+GEFQ+FKLF IC++A
Sbjct: 59 RYTKEEAKKGRFSMYDIFKKRHVRGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKICEEA 118
Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
P+L NQFSVFVSRP G +S+VLCP P V K D GIESWDWN++G+ TYHAL+PR
Sbjct: 119 PILTNQFSVFVSRPGGLSYSTVLCPTKPKSVKGKTEDLGIESWDWNMEGDKSTYHALYPR 178
Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
+WTVY+ EPDPELRIV RQ+SPFIPHNYKESS P SVF FT+SN G+ A VTLLFTW N
Sbjct: 179 SWTVYN-EPDPELRIVSRQVSPFIPHNYKESSLPVSVFEFTMSNLGKEEATVTLLFTWEN 237
Query: 240 SVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
SV G SGL+G HFNS M +DGVH + LHH+T NG PPVT+A+AA+ET DVHVSECPCFL
Sbjct: 238 SVGGASGLTGEHFNSTIMERDGVHAIVLHHKTGNGHPPVTYAIAAQETEDVHVSECPCFL 297
Query: 300 LSGNS-KGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
+SG+S K ITAK+MW+EIKK+ SFD L+++ SPS PG+SIGAAIAA + +P G R+VT
Sbjct: 298 VSGHSPKEITAKEMWDEIKKNKSFDELNSEPGSPSRPGTSIGAAIAAKVKVPPGCDRTVT 357
Query: 359 FSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
FSL+WDCPEV+F EK YHRRYTKFYG LGD+A ++A DA+L + WE +IEAWQ PIL D
Sbjct: 358 FSLSWDCPEVRFNEKTYHRRYTKFYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSD 417
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSL-----ATIRERKFSLDTPSKI---- 469
P+WY +TLFNELYY N+GGTIWTDG PP +S+ + +D KI
Sbjct: 418 TTLPDWYRVTLFNELYYFNSGGTIWTDGLPPKESIERSKVTNTEQNDIVIDLFQKINAVC 477
Query: 470 ---HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
++P SS N EENIGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKL
Sbjct: 478 EQIYSPQSS----------NSEENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLA 527
Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRW 586
LSIQRDFAA V++ DP KIMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RW
Sbjct: 528 LSIQRDFAATVLIQDPTKKKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRW 587
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
KDLN+KFVLQVYRD VAT D++FA+AVWPSVY A+AY++QFDKD DGMIENEGFPDQTYD
Sbjct: 588 KDLNAKFVLQVYRDVVATNDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYD 647
Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNY 706
AWS GVSAYCGGLWVAALQAASA A+ VG++A A YF +Y+KAK VY+ LWNGSYFNY
Sbjct: 648 AWSVTGVSAYCGGLWVAALQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNY 707
Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
D+S SS+SI ADQLAGQWYARACGL PI E +KKAL IY+FNV+KVKGG GA+N
Sbjct: 708 DDSGSGSSSSILADQLAGQWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVN 767
Query: 767 GMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
GM +G++D + L ++E+W G TY +AA MIQE + FQTA+G+YE WS+ GL SF
Sbjct: 768 GMSTEGKVDTNSLVSKEVWAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSF 827
Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEIS----DRDSSYLEQHA 882
QTPE+WN NDEYRSLCYMRPL IWA+QWALT+ + +E + ++ + + L QH
Sbjct: 828 QTPEAWNMNDEYRSLCYMRPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHK 887
Query: 883 AFSKVASLLKL-PKEEASKGFLKVVYDFTI 911
F VA +K+ P + L+ Y+ +
Sbjct: 888 GFKDVARFVKIVPTSNVHRSRLQHTYETVL 917
>gi|297809209|ref|XP_002872488.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
gi|297318325|gb|EFH48747.1| AT4g10060/T5L19_190 [Arabidopsis lyrata subsp. lyrata]
Length = 921
Score = 1127 bits (2915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 557/908 (61%), Positives = 687/908 (75%), Gaps = 39/908 (4%)
Query: 24 PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
PQ+TWQRKL+S VK P+ F +S ++ HL IG+RL+ ++K+E KG+ ++++F+K H+
Sbjct: 22 PQVTWQRKLNSKVKNPSEFKMSIRDVLHLFPIGYRLWSHTKEEAKKGRASMYDIFKKRHV 81
Query: 84 TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVL 143
T D G+PLGGIG GSIGRSY+GEFQ+FKLF +C++AP+L NQFSVFV R G +SSVL
Sbjct: 82 TGDHGVPLGGIGGGSIGRSYKGEFQQFKLFPKVCEEAPILTNQFSVFVMRSGGVNYSSVL 141
Query: 144 CPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
CP P V K + GIESWDWN+KG YHAL+PR+WT+Y+ EPDPELRIV RQISPF
Sbjct: 142 CPTKPKSVRGKTEELGIESWDWNMKGYKSMYHALYPRSWTIYN-EPDPELRIVSRQISPF 200
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
IPHNYKESS P SVFTFT++N G+ A VTLLFTW NSV G SGL+G HFNS M +DGV
Sbjct: 201 IPHNYKESSLPVSVFTFTVTNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTMMERDGV 260
Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS-KGITAKDMWNEIKKHGS 321
H + LHH+T+NG PPVT+A+AA+ET DVHVSEC CFL+SG+S K ITAK+MW+EIKK+ S
Sbjct: 261 HAIALHHKTSNGHPPVTYAIAAQETEDVHVSECSCFLVSGHSPKDITAKEMWDEIKKNKS 320
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTK 381
FD L+++ SPS PG+SIGAAIAA++ +P G R+V FSL+WDCPEV+F EK YHRRYTK
Sbjct: 321 FDGLNSEPGSPSRPGTSIGAAIAANVKVPPGCDRTVMFSLSWDCPEVRFNEKTYHRRYTK 380
Query: 382 FYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGT 441
FYG LGD+A ++A DA+L + WE +IEAWQ PIL D P+WY +TLFNELYY N+GGT
Sbjct: 381 FYGNLGDAAVKMARDALLNYVDWESQIEAWQSPILSDTALPDWYRVTLFNELYYFNSGGT 440
Query: 442 IWTDGSPPMQSLATIRERKFSLDTP----------------SKIHTPASSDTALGTRLLE 485
IWTDG PP + + ER ++T ++IH+P S
Sbjct: 441 IWTDGLPPKERI----ERSKIINTEQNNIVTELFKKINVICAQIHSPQSL---------- 486
Query: 486 NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTM 545
N EE+IGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKLELSIQRDFAA V+ DP
Sbjct: 487 NAEEHIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLELSIQRDFAATVLFQDPTKK 546
Query: 546 KIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG 605
+IMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RWKDLN+KFVLQVYRD V+TG
Sbjct: 547 QIMSSGEWVPRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVSTG 606
Query: 606 DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAAL 665
D++FA+AVWPSVY A+AY++QFDKD D MIENEGFPDQTYDAWS GVSAYCGGLWVAAL
Sbjct: 607 DQSFAKAVWPSVYTAVAYLDQFDKDEDEMIENEGFPDQTYDAWSVKGVSAYCGGLWVAAL 666
Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
QAASA A+ VG++ A YF +Y+KAK+VY+ LWNGSYFNYD+S SS+SI ADQLAGQ
Sbjct: 667 QAASAFASIVGENGVAIYFNTKYEKAKSVYEKLWNGSYFNYDDSGSGSSSSILADQLAGQ 726
Query: 726 WYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIW 785
WYARACGL PI E +KKAL IY+FNV+KVKGG GA+NGM PDG++D + L ++E+W
Sbjct: 727 WYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSPDGQVDTNSLVSKEVW 786
Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
G TY +AA MIQE + FQTA+G+YE WS+ GL SFQTPE+WN NDEYRSLCYMR
Sbjct: 787 AGTTYSVAACMIQEGQRERGFQTASGIYEAVWSDRGLSCSFQTPEAWNMNDEYRSLCYMR 846
Query: 846 PLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKL-PKEEASK 900
PL IW++QWALT+ + +E +K E + D L QH F VA +K+ P +
Sbjct: 847 PLAIWSIQWALTRTQSFGEEKEKLVKGEEEESD-LLLRQHKGFKDVARFVKIVPTRNEHR 905
Query: 901 GFLKVVYD 908
L+ Y+
Sbjct: 906 SRLQHTYE 913
>gi|8777421|dbj|BAA97011.1| unnamed protein product [Arabidopsis thaliana]
Length = 928
Score = 1092 bits (2824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 553/964 (57%), Positives = 683/964 (70%), Gaps = 87/964 (9%)
Query: 1 MENGVKEEGENA----ASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
E + + GE+ +S KVDP P LTWQRK+DS+VK P F LS +E LA +G
Sbjct: 2 FEEKIMDIGEDVKPLNSSDTKVDPAVPASLTWQRKIDSDVKAPREFNLSVKEIFQLAPVG 61
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
RL+ ++E AKG++ + F KH +TS G+PLGGIGAGSIGRS++GEFQR++LF
Sbjct: 62 IRLWFLCREEAAKGRLAFIDPFSKHSVTSSHGVPLGGIGAGSIGRSFKGEFQRWQLFPPK 121
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D PVLANQFS FVSR NG+K+SSVLCPR+P + K++++SGI SWDWNLKG+ TYHAL
Sbjct: 122 CEDEPVLANQFSAFVSRANGKKYSSVLCPRNPKLDKQDSESGIGSWDWNLKGDKSTYHAL 181
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PR+WT+Y+G + A ++FTL N Q+ + F
Sbjct: 182 YPRSWTMYEG------------------------NIYAFEYSFTLVNLIQSLGLFVVKF- 216
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
H T K L +TANG P +++A++A+ T V VS CP
Sbjct: 217 -------------HPLYLTTTKKAAFQCLFSLSQTANGLPSLSYAISAQATDGVSVSACP 263
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTR 355
F++SG GITAKDMW +K++GSFDHL + S S+ GSSIGAA+AAS+T+ G +R
Sbjct: 264 FFIVSGKQDGITAKDMWQAVKENGSFDHLKASEASMQSDHGSSIGAAVAASVTVLPGESR 323
Query: 356 SVTFSLAWDCPEVKFFE-KVYHR---------RYTKFYGTLGDSAARIAHDAILEHAKWE 405
VTFSLAWDCPEV+F K+Y R RYTKFYG GD+AA+IAHDAIL H++WE
Sbjct: 324 IVTFSLAWDCPEVQFPSGKIYSRQPIKQNLFRRYTKFYGNNGDAAAQIAHDAILGHSQWE 383
Query: 406 CEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD- 464
IE WQRPILEDKR P WYP+TLFNELYYLN+GGT+WTDGS P+ SLA +RE+KFSLD
Sbjct: 384 SWIEDWQRPILEDKRLPAWYPVTLFNELYYLNSGGTLWTDGSSPVHSLAGVREKKFSLDK 443
Query: 465 ---------------------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYL 497
T ++H +S++A GT+LLE GEENIG FLYL
Sbjct: 444 SQLGLKNDIDVPHQNDTAVSVLEKMASTLEELHASTTSNSAFGTKLLEEGEENIGHFLYL 503
Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
EG EY M+NTYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP +K +S+G+WV RK
Sbjct: 504 EGIEYRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRK 563
Query: 558 CLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV 617
LGAVPHD+G++DPWFE+N Y L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSV
Sbjct: 564 VLGAVPHDLGINDPWFEVNGYTLHNTDRWKDLNPKFVLQVYRDVVATGDKKFASAVWPSV 623
Query: 618 YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGD 677
Y+AMAYM QFDKDGDGMIENEGFPDQTYD WSA+GVSAYCGGLWVAALQAASALA VGD
Sbjct: 624 YVAMAYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYCGGLWVAALQAASALARVVGD 683
Query: 678 HASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
S YFW ++QKAK VY+ LWNGSYFNYDNS S++IQADQLAGQWYARA GLLPI
Sbjct: 684 KNSQDYFWSKFQKAKVVYEKKLWNGSYFNYDNSGSQYSSTIQADQLAGQWYARASGLLPI 743
Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
DE K + AL K+Y++NV+K+K G GA+NGM P+G++D + +Q+REIW GVTY L+A+M
Sbjct: 744 VDEDKARTALEKVYNYNVMKIKDGKRGAVNGMHPNGKVDTASMQSREIWSGVTYALSATM 803
Query: 797 IQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
IQE +V+MAFQTA+G+YE AWSE GLGYSFQTPESWN DEYRSL YMRPL IWAMQWAL
Sbjct: 804 IQEGLVEMAFQTASGIYEAAWSETGLGYSFQTPESWNTVDEYRSLTYMRPLAIWAMQWAL 863
Query: 857 TKPKLSRQEI-----KHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTI 911
TK ++++ + E SS ++ FS+V+ LL LP E ++K L+ ++D+T
Sbjct: 864 TKTSQKQEQLGLEPEQQEPELEPSSSMKHDIGFSRVSRLLSLPNEASAKSTLQTLFDYTC 923
Query: 912 GRVL 915
R++
Sbjct: 924 RRMM 927
>gi|326489639|dbj|BAK01800.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 797
Score = 1063 bits (2748), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/788 (65%), Positives = 618/788 (78%), Gaps = 35/788 (4%)
Query: 157 SGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASV 216
SGI SWDWNL G TYHAL+PRAWTVYDGEPDP+L+IVCRQISP IPHNY++SS+PA+V
Sbjct: 11 SGIGSWDWNLNGGKSTYHALYPRAWTVYDGEPDPDLKIVCRQISPIIPHNYQQSSYPAAV 70
Query: 217 FTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRP 276
FTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS KDGVHG+ LHHRTA+GRP
Sbjct: 71 FTFTVTNSGNTAADVTLLFTWANSVGGKSELTGNHSNSSMTEKDGVHGILLHHRTADGRP 130
Query: 277 PVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEP 335
PV FA+AA+E D+H+SECP F++SG+S TAKDMWN +K+HGSFD LD K S S P
Sbjct: 131 PVAFAIAAQEKEDIHISECPYFVMSGSSDEFTAKDMWNSVKEHGSFDLLDPVKESTSSRP 190
Query: 336 GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIA 394
G+SIGAAIAAS+ + +T++V+FSLAW PEVKF K YHRRYTKFYGT D+AA +A
Sbjct: 191 GTSIGAAIAASIKLAPQATQNVSFSLAWASPEVKFCSGKTYHRRYTKFYGTDVDAAASLA 250
Query: 395 HDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLA 454
HDAIL+H+ WE +IE WQ PIL+DKRFP WYP+TLFNELYYLNAGGTIWTDG PP+QSL
Sbjct: 251 HDAILDHSSWETQIENWQHPILQDKRFPAWYPVTLFNELYYLNAGGTIWTDGLPPIQSLT 310
Query: 455 TIRERKFSLDTPS----------------------------KIHTPASSDTALGTRLLEN 486
I +KFSLD + +IH +S +A+GT LL+
Sbjct: 311 AIGGKKFSLDMSNEETDDDSQMNPQTNTATDILHQMASVLERIHASLASSSAIGTTLLQ- 369
Query: 487 GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK 546
GEENIGQFLYLEG EY M+NTYDVHFYSSF+L+MLFPKL+LS+QRDFAAAVM+HDP +K
Sbjct: 370 GEENIGQFLYLEGIEYYMWNTYDVHFYSSFSLIMLFPKLQLSVQRDFAAAVMIHDPEKLK 429
Query: 547 IMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD 606
++ DGK ARK LGAVPHD+GL DPW ++N+Y L N+ RWKDLN KFVLQVYRD VATGD
Sbjct: 430 LLHDGKLAARKVLGAVPHDLGLYDPWIKVNAYTLHNTDRWKDLNPKFVLQVYRDVVATGD 489
Query: 607 KNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQ 666
K+FARAVWPSVY+AMAYMEQFDKD DGMIENE FPDQTYD WS GVSAYCGGLWVAALQ
Sbjct: 490 KSFARAVWPSVYMAMAYMEQFDKDKDGMIENEDFPDQTYDVWSMAGVSAYCGGLWVAALQ 549
Query: 667 AASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
AASALA++VGD AS FW +Y+KAK+VYD LWNGSYFNYD++ +STSI ADQLAGQ
Sbjct: 550 AASALAHEVGDKASEKLFWNKYEKAKSVYDKKLWNGSYFNYDDAGTKASTSIHADQLAGQ 609
Query: 726 WYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIW 785
WYA+ACGL I D+ K + AL KIY FNV+K K G GA+NGM PDG +DMS +Q+REIW
Sbjct: 610 WYAKACGLSSIVDKDKSQSALEKIYAFNVMKFKDGNRGAINGMWPDGTLDMSTMQSREIW 669
Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
PGVTY LAASMIQE MVD F+TA G+Y AWS +GLGY+FQTPESWNN+DEYRSLCYMR
Sbjct: 670 PGVTYALAASMIQEGMVDEGFKTAEGIYNAAWSTEGLGYAFQTPESWNNDDEYRSLCYMR 729
Query: 846 PLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKV 905
PL IW++QWAL+ PKL + D S+ + +++++A LL+LP++E+SK +V
Sbjct: 730 PLAIWSIQWALSNPKLHKAPQTDITQD---SFPKNQFSYARIAKLLQLPEDESSKSLPRV 786
Query: 906 VYDFTIGR 913
+Y+ R
Sbjct: 787 IYEIVRNR 794
>gi|62733283|gb|AAX95400.1| At5g49900 [Oryza sativa Japonica Group]
Length = 931
Score = 1037 bits (2681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 535/921 (58%), Positives = 661/921 (71%), Gaps = 84/921 (9%)
Query: 65 QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
Q++ + V + +K S QG+PLGGIG+GSIGRSY+GEFQR++LF G C++ PVLA
Sbjct: 20 QKDVFIQAAVIDPMKKRIAKSGQGVPLGGIGSGSIGRSYKGEFQRWQLFPGTCEERPVLA 79
Query: 125 NQFS------------------VFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNL 166
NQFS F+SR +G +SSVL P P +PK + SGI SWDWN+
Sbjct: 80 NQFSDRYLATVNVLTVSRDYTQAFISRKDGRNYSSVLHPGKPDLPKGSNISGIGSWDWNM 139
Query: 167 KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN-----YKESSFPASVFTFTL 221
G+N TYHAL+PR+WT+Y+G V Q + + N E ++ T
Sbjct: 140 SGQNSTYHALYPRSWTIYNGIAVTNNNRVYFQATTILFSNVGFCLLGEPDPDVNIVT--- 196
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHR----------- 270
NSG T+ADVTLLFTWA V + + KDGVHG+ LHHR
Sbjct: 197 -NSGNTAADVTLLFTWA-FVINEHLFYTFNILMNDREKDGVHGILLHHRLSPKQISICLN 254
Query: 271 ------TANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
TANG+PPVTFA+AA+E D+H+SECP F++SG+S +AKDMWN +K++GSFD+
Sbjct: 255 DMYESRTANGQPPVTFAIAAQEKEDIHISECPYFIISGSSDAFSAKDMWNYVKENGSFDN 314
Query: 325 LDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKF 382
LD KTS S+PG SIGAAIAAS+ +P +T++V+F+LAW CPEVKF K YHRRYTKF
Sbjct: 315 LDLTKTSMCSKPGLSIGAAIAASVKLPPQTTQNVSFALAWACPEVKFSSGKTYHRRYTKF 374
Query: 383 YGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTI 442
+GT D+AA +AHDAILEH WE +IE WQ PIL+D+RFP+WYP+TLFNELYYLNAGGTI
Sbjct: 375 HGTDNDAAASLAHDAILEHNSWERQIEEWQNPILQDERFPDWYPVTLFNELYYLNAGGTI 434
Query: 443 WTDGSPPMQSLATIRERKFSLDTPS----------------------------KIHTPAS 474
WTDG PP+QSL I E+KFSLD + +IH
Sbjct: 435 WTDGLPPIQSLTGIGEKKFSLDMQNGDADDANGIIPRNNTASDILNQMASVLERIHASME 494
Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
S++A+GT LL+ GEENIGQFLYLEG EY M+NTYDVHFY+SF+L+MLFPKL+LSIQRDFA
Sbjct: 495 SNSAIGTTLLQ-GEENIGQFLYLEGIEYYMWNTYDVHFYASFSLIMLFPKLQLSIQRDFA 553
Query: 535 AAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFV 594
AAV+MHDP ++++ DGKWVARK LGAVPHD+GL DPWF++N+Y L+N+ RWKDLN KFV
Sbjct: 554 AAVLMHDPEKLRMLHDGKWVARKVLGAVPHDLGLYDPWFKVNAYTLYNTDRWKDLNPKFV 613
Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
LQVYRD VATGDK+FARAVWPSVY+AMAYMEQFD+D DGMIENE FPDQTYD WS G+S
Sbjct: 614 LQVYRDVVATGDKSFARAVWPSVYMAMAYMEQFDRDKDGMIENEDFPDQTYDVWSMAGIS 673
Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSS 714
AYCGGLWVAALQAASALA++VGD AS FW +Y+KAK+VY LWNGSYFNYD+ D S
Sbjct: 674 AYCGGLWVAALQAASALAHEVGDKASEKLFWDKYEKAKSVYGKLWNGSYFNYDDGDNIMS 733
Query: 715 TSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRI 774
SI ADQLAGQWYA+ACGL PI D+ K + AL KIY FNV+K K G GAMNGM PDG +
Sbjct: 734 ASIHADQLAGQWYAKACGLFPIVDKDKAESALEKIYSFNVMKFKDGKRGAMNGMWPDGTV 793
Query: 775 DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNN 834
DMS +Q+REIWPGVTY LAA+MIQE MV+ F+TA G+Y AWS +GLGYSFQTPE+WNN
Sbjct: 794 DMSAMQSREIWPGVTYALAATMIQEGMVEKGFKTAEGIYHAAWSPEGLGYSFQTPEAWNN 853
Query: 835 NDEYRSLCYMRPLTIWAMQWALTKPKLSRQ--EIKHEISDRDSSYLEQHAAFSKVASLLK 892
+DEYRSLCYMRPL IWA+QWAL+ PKL +Q +I + S+ + +++++A LL
Sbjct: 854 DDEYRSLCYMRPLAIWAIQWALSNPKLHKQTADIPQD------SFPKNQFSYARIAKLLH 907
Query: 893 LPKEEASKGFLKVVYDFTIGR 913
LP++E+ K FL+V+Y+ R
Sbjct: 908 LPEDESPKSFLRVIYEIVRNR 928
>gi|115474457|ref|NP_001060825.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|42408344|dbj|BAD09496.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|42408391|dbj|BAD09542.1| putative Bile acid beta-glucosidase [Oryza sativa Japonica Group]
gi|113622794|dbj|BAF22739.1| Os08g0111200 [Oryza sativa Japonica Group]
gi|215701490|dbj|BAG92914.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 928
Score = 1023 bits (2644), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 520/929 (55%), Positives = 663/929 (71%), Gaps = 42/929 (4%)
Query: 8 EGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
E E ++ +D +PP ++W+R D K F+++ + + + ++ + ++
Sbjct: 5 ELEETKAVLHMDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDN 64
Query: 68 AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
AKG V++ +K +G+PLGGIGAGSIGRSYRG FQ+F++F I ++ P+LANQF
Sbjct: 65 AKGLASVYDPLKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQF 124
Query: 128 SVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
S F+SRP+G+++S+VL + V K +GI SWDW LK +NCTYH LFPR+WTVY+GE
Sbjct: 125 SAFISRPDGKRYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGE 184
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
PDPE++I CRQISPFIPHNYKESSFP +VFTFTL NSG T ADVTLLFTWANSV G S L
Sbjct: 185 PDPEIKITCRQISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSEL 244
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS-GNSKG 306
+G+H NS+ MT TA+GRPPVTFA+A+ ET V V+ CP F + NS
Sbjct: 245 TGNHKNSR-MT------------TADGRPPVTFAIASRETDGVRVTGCPRFTMGPSNSGD 291
Query: 307 ITAKDMWNEIKKHGSFDHLDN---DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
+TAKDMW++I K+GSF N T S PGSSIGAA+AA+ T+ +G R+V+F+L+W
Sbjct: 292 VTAKDMWDQINKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSW 351
Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
CPEVKF + YHRRYTKF+GT D+AA R+AHDA+LEH KWE +IE WQRPIL+DK
Sbjct: 352 SCPEVKFPAGRTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSL 411
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS-DTALG 480
PEWYPITLFNELYYLNAGGTIWTDG PP + + F+LDT S + S+ D L
Sbjct: 412 PEWYPITLFNELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLDTFSTVANGGSAVDGILS 471
Query: 481 T--------------------RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
T LL +GEEN+GQ LYLEG EY M+NTYDVHFY+SFAL+
Sbjct: 472 TVAVAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLS 531
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNL 580
LFP+LEL++QRDF V++HDP + + DG VARK LGAVPHD+GL+DPWFE+N+Y L
Sbjct: 532 LFPELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYML 590
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+ RWKDLN KFVLQVYRD VATG+ FA A WP+VY+AMAYM+QFD+DGDGM+ENEG
Sbjct: 591 HDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGR 650
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLW 699
PDQTYD WS +GVSAY GGLWVAALQAA+A+A VGD A+ +YF RY +A+ VY D LW
Sbjct: 651 PDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELW 710
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
NG YFNYDNS G++S+SIQADQLAGQWYARACGL PI D K ++AL + D+NV++VKG
Sbjct: 711 NGGYFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRVKG 770
Query: 760 GMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
G GA+NGM+PDG +D S Q++E+WPGVTY +AA+MI E M + AF+TA G+++ W +
Sbjct: 771 GAIGAVNGMRPDGAVDASSTQSKEVWPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGK 830
Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLE 879
G GY+FQTPESW + YR+L YMRPL +WAMQWAL+ P L ++ ++
Sbjct: 831 HGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAAL 890
Query: 880 QHAAFSKVASLLKLPKEEASKGFLKVVYD 908
F KVAS+L+LP+E+ KG L+ +YD
Sbjct: 891 GQEKFDKVASMLRLPEEQQHKGILRALYD 919
>gi|242080271|ref|XP_002444904.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
gi|241941254|gb|EES14399.1| hypothetical protein SORBIDRAFT_07g001140 [Sorghum bicolor]
Length = 929
Score = 1016 bits (2627), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 523/946 (55%), Positives = 654/946 (69%), Gaps = 59/946 (6%)
Query: 7 EEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQE 66
EE + A + +D QPP TWQR+ D K ++L+ + +
Sbjct: 4 EELQQAKVLSSIDCSQPPARTWQRRFDDEGKKVAMYSLTMNDLTAI-------------- 49
Query: 67 EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
V V++ RK +G+PLGGIGAGSIGRSYRG FQ+F++F ++ P+LANQ
Sbjct: 50 -----VGVYDPLRKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPATNEEKPILANQ 104
Query: 127 FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
FS FVSRPNG K+S+VL + + K+ + I SWDW LK E CTYHALFPR+WTVYDG
Sbjct: 105 FSAFVSRPNGTKYSTVLSAPTADLLKEVDKASIGSWDWKLKEEKCTYHALFPRSWTVYDG 164
Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
EPDPE++I RQISPFIPHNY+ESSFPA+VFTFT+ NSG T+ADVTLLFTWANSV G S
Sbjct: 165 EPDPEIKITSRQISPFIPHNYRESSFPAAVFTFTVHNSGSTAADVTLLFTWANSVGGKSE 224
Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS-GNSK 305
L+G+H NSK +DGVHG+ LHHRTA G PPVTFA+A++ET V VS CP F + S+
Sbjct: 225 LTGNHVNSKMTEQDGVHGVLLHHRTAGGHPPVTFAIASQETDGVRVSVCPSFTMGPSGSR 284
Query: 306 GITAKDMWNEIKKH--GSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
TA DMW+EIKKH N + S+ GSS+GAA+AAS T+P+G TR V+FSLAW
Sbjct: 285 DFTAADMWDEIKKHGAFGHAGAGNAARAASKAGSSLGAAVAASTTVPAGETRVVSFSLAW 344
Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
CP++KF YHRRYTKFYG D+AA ++AHDA+LEH WE +IE WQRPIL DKR
Sbjct: 345 ACPDIKFPAGSTYHRRYTKFYGVDADAAAEQLAHDALLEHMNWESQIEEWQRPILNDKRL 404
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQ---SLATIRERKFSLDT------------- 465
PEWYP+ LFNELYYLNAGGTIWTDG PP + + + FS+DT
Sbjct: 405 PEWYPVALFNELYYLNAGGTIWTDGQPPKKAGFASSLTGTEPFSIDTLLAEDGGTGGSAV 464
Query: 466 -------PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
S S+ A G LL +GE+N+GQFLYLEG EY MYNTYDVHFY+SFAL
Sbjct: 465 DGVVRAVASAKELSHSAAAAFGAALLRDGEDNVGQFLYLEGMEYNMYNTYDVHFYASFAL 524
Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSY 578
+ LFP LELS+QRDFA AV++HDP + DG+ V RK LG VPHD+GL+DPW E+N Y
Sbjct: 525 LSLFPSLELSLQRDFARAVLLHDPRRRRTF-DGRTVPRKVLGTVPHDVGLNDPWVEMNEY 583
Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
L + + WKDLN KFVLQVYRD VATGD FA A WP+VYIAMAYM QFD+DGDGM+ENE
Sbjct: 584 MLHDPALWKDLNPKFVLQVYRDAVATGDAAFAEAAWPAVYIAMAYMHQFDRDGDGMVENE 643
Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS- 697
G PDQTYD WS +GVSAY GGLWVAAL+AA+ +A VGD + +YF R +A VYD+
Sbjct: 644 GIPDQTYDIWSVSGVSAYTGGLWVAALEAAAGMARVVGDRHAEAYFRGRRDRAARVYDAE 703
Query: 698 LWNGSYFNYDNSDGSS-STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
LWNG+YF YDNS G + S SI ADQ+AGQWYARACG+ P+ +E K + AL + D+NV++
Sbjct: 704 LWNGTYFRYDNSGGGACSESIMADQMAGQWYARACGMEPVVEEGKARSALATVLDYNVMR 763
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
VKGG GA+NGM+PDG +D S Q++E+WPGVTY +AA+M+ E M + AF+TA G ++
Sbjct: 764 VKGGAVGAVNGMRPDGAVDASSGQSKEVWPGVTYAVAAAMVHEGMHEAAFRTAKGAHDAG 823
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSR-------QEIKHE 869
W +DG GY+FQTPE+W + YRSL YMRPL IWAMQWAL+ P+L + + ++ E
Sbjct: 824 WGKDGFGYAFQTPEAWTEDGGYRSLHYMRPLGIWAMQWALSPPELHKDLRAPGARSVEEE 883
Query: 870 ISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVL 915
S D++ A F KVA++L+LP+E+ KG+L +Y VL
Sbjct: 884 SSPADAAL--GQAQFEKVANMLRLPEEKQPKGYLWAIYKVIRQMVL 927
>gi|357144426|ref|XP_003573288.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 1
[Brachypodium distachyon]
Length = 932
Score = 1005 bits (2599), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 524/924 (56%), Positives = 661/924 (71%), Gaps = 27/924 (2%)
Query: 5 VKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSK 64
+K E A S+ +D QP TWQRK D K F+++ + + + F+ +
Sbjct: 4 LKLEQTKAHSL--IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNA 61
Query: 65 QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
+E KG+ +++ F+K +G+PLGG+GAGSIGRSYRG FQ F++F I + P+LA
Sbjct: 62 EERGKGRDVIYDPFKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILA 121
Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVY 184
NQFS F+SRPNG+ +S+VL + V K +GI SWDW LK +NCTYHALFPR+WTVY
Sbjct: 122 NQFSAFISRPNGKSYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVY 181
Query: 185 DGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD 244
DGEPDP ++I CRQISPFIPHNYKESSFP VFTFT+ N+G T ADVTLLFTWANSV G
Sbjct: 182 DGEPDPAIKITCRQISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGR 241
Query: 245 SGLSGHHFNSKTMTKDGVHGLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSGN 303
S L+G+H NSK M +DGVHG+ LHHRTAN G PPVTFA+A++ET DV V++CP F + +
Sbjct: 242 SELTGNHTNSKMMERDGVHGVLLHHRTANDGHPPVTFAIASQETEDVRVTDCPSFTMGSS 301
Query: 304 SKG-ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
S G TAKDMW EIKKHGSF D + + S+PGSSIGAA+AA+ T+P+G TR V+F+L+
Sbjct: 302 SSGDFTAKDMWQEIKKHGSFITTDKEPRA-SKPGSSIGAAVAAATTVPAGGTRVVSFALS 360
Query: 363 WDCPEVKF-FEKVYHRRYTKFYGTLGDSAAR-IAHDAILEHAKWECEIEAWQRPILEDKR 420
W CPEVKF + YHRRYT F+G D+AA +AHDA+LEH WE +IE WQRPIL+DKR
Sbjct: 361 WSCPEVKFPAGRTYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKR 420
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT--IRERKFSLDTP-------SKIHT 471
P+WYP+ LFNELYYLNAGGTIWTDG PP S+++ + FSL T I +
Sbjct: 421 LPDWYPVALFNELYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSLATTYLPGTAVDGILS 480
Query: 472 PASSDT-------ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
+S T A G LL +GEEN+GQFLYLE EY MYNTYDVHFY+SFAL+ LFP+
Sbjct: 481 AVASATEDQHSAAAFGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSLFPE 540
Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
LELS+QRDFA AV++HDP M + DG V RK LGAVPHDIGL+DPWFE+N+Y + + +
Sbjct: 541 LELSLQRDFARAVLIHDPRPM-VTLDGATVLRKVLGAVPHDIGLNDPWFELNAYMIHDPA 599
Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
RWKDLN KFVLQVYR V TG+ FA+A WP+VY+AMAYM+Q+D+DGDGM+ENEG PDQT
Sbjct: 600 RWKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-PDQT 658
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSY 703
YD WS GVSAY GG+WV ALQA +A+A VGD + YF RY +AK VYD+ LWNG+Y
Sbjct: 659 YDLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWNGTY 718
Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
F YDNS G +S SI ADQLAGQWYA CGL P+ +E K + AL + D+NV++VKGG G
Sbjct: 719 FRYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGGAVG 778
Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
A+NGM+PDG IDMS Q++EIW G TY +AA+M+ E M + AF+TA G ++ +W + G G
Sbjct: 779 AVNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKAGFG 838
Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAA 883
Y+FQTPE+W YR L YMRPL++WAMQWAL+ P+L + +S +
Sbjct: 839 YAFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPELHKDLGLASVSPPGDA-ARGREK 897
Query: 884 FSKVASLLKLPKEEASKGFLKVVY 907
F KVA +L+LP++ KGF +Y
Sbjct: 898 FDKVAGMLRLPEKVQHKGFFLALY 921
>gi|222639789|gb|EEE67921.1| hypothetical protein OsJ_25789 [Oryza sativa Japonica Group]
Length = 919
Score = 1004 bits (2596), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/929 (55%), Positives = 653/929 (70%), Gaps = 51/929 (5%)
Query: 8 EGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
E E ++ +D +PP ++W+R D K F+++ + + + ++ + ++
Sbjct: 5 ELEETKAVLHMDCEKPPAISWERTFDDEGKKVAMFSMTLNDMMAIVPLMIKMLGLNLKDN 64
Query: 68 AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
AKG V++ +K +G+PLGGIGAGSIGRSYRG FQ+F++F I ++ P+LANQF
Sbjct: 65 AKGLASVYDPLKKWMDNCYRGVPLGGIGAGSIGRSYRGYFQQFQIFPSIYEEKPILANQF 124
Query: 128 SVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
S F+SRP+G+++S+VL + V K +GI SWDW LK +NCTYH LFPR+WTVY+GE
Sbjct: 125 SAFISRPDGKRYSTVLSAPNADVLKGIDKAGIGSWDWKLKEKNCTYHGLFPRSWTVYNGE 184
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
PDPE++I CRQISPFIPHNYKESSFP +VFTFTL NSG T ADVTLLFTWANSV G S L
Sbjct: 185 PDPEIKITCRQISPFIPHNYKESSFPVAVFTFTLHNSGSTPADVTLLFTWANSVGGKSEL 244
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLS-GNSKG 306
+G+H NS+ +DGVHG+ LHHRTA+GRPPVTFA+A+ ET V V+ CP F + NS
Sbjct: 245 TGNHKNSRMTARDGVHGVLLHHRTADGRPPVTFAIASRETDGVRVTGCPRFTMGPSNSGD 304
Query: 307 ITAKDMWNEIKKHGSFDHLDN---DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
+TAKDMW++I K+GSF N T S PGSSIGAA+AA+ T+ +G R+V+F+L+W
Sbjct: 305 VTAKDMWDQINKNGSFVGDGNAAAAATGASRPGSSIGAAVAATTTVAAGGARAVSFALSW 364
Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
CPEVKF + YHRRYTKF+GT D+AA R+AHDA+LEH KWE +IE WQRPIL+DK
Sbjct: 365 SCPEVKFPAGRTYHRRYTKFHGTDRDAAAERLAHDALLEHMKWESQIEEWQRPILQDKSL 424
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS-DTALG 480
PEWYPITLFNELYYLNAGGTIWTDG PP + + F+LDT S + S+ D L
Sbjct: 425 PEWYPITLFNELYYLNAGGTIWTDGQPPKNTSLSSATEPFNLDTFSTVANGGSAVDGILS 484
Query: 481 T--------------------RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
T LL +GEEN+GQ LYLEG EY M+NTYDVHFY+SFAL+
Sbjct: 485 TVAVAAARSNTAAAAAAAMGTALLRDGEENVGQVLYLEGMEYNMWNTYDVHFYASFALLS 544
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNL 580
LFP+LEL++QRDF V++HDP + + DG VARK LGAVPHD+GL+DPWFE+N+Y L
Sbjct: 545 LFPELELNLQRDFVRGVLLHDPCLRRTL-DGATVARKVLGAVPHDMGLNDPWFEVNAYML 603
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+ RWKDLN KFVLQVYRD VATG+ FA A WP+VY+AMAYM+QFD+DGDGM+ENEG
Sbjct: 604 HDPVRWKDLNPKFVLQVYRDVVATGNAGFAEAAWPAVYLAMAYMDQFDRDGDGMVENEGR 663
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLW 699
PDQTYD WS +GVSAY GGLWVAALQAA+A+A VGD A+ +YF RY +A+ VY D LW
Sbjct: 664 PDQTYDLWSVSGVSAYTGGLWVAALQAAAAMAGIVGDGAAEAYFRGRYHRARRVYTDELW 723
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
NG YFNYDNS G++S+SIQADQLAGQWYARACGL PI D K ++AL + D+NV++
Sbjct: 724 NGGYFNYDNSGGATSSSIQADQLAGQWYARACGLEPIVDGDKARRALATVLDYNVMRT-- 781
Query: 760 GMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
++E+ PGVTY +AA+MI E M + AF+TA G+++ W +
Sbjct: 782 --------------------HSKEVCPGVTYAVAAAMIHEGMPEAAFKTAKGIHDAGWGK 821
Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLE 879
G GY+FQTPESW + YR+L YMRPL +WAMQWAL+ P L ++ ++
Sbjct: 822 HGFGYAFQTPESWTADGGYRALHYMRPLGVWAMQWALSPPVLHKEHRVAAVAASPEDAAL 881
Query: 880 QHAAFSKVASLLKLPKEEASKGFLKVVYD 908
F KVAS+L+LP+E+ KG L+ +YD
Sbjct: 882 GQEKFDKVASMLRLPEEQQHKGILRALYD 910
>gi|357144429|ref|XP_003573289.1| PREDICTED: non-lysosomal glucosylceramidase-like isoform 2
[Brachypodium distachyon]
Length = 919
Score = 975 bits (2520), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 512/923 (55%), Positives = 650/923 (70%), Gaps = 38/923 (4%)
Query: 5 VKEEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSK 64
+K E A S+ +D QP TWQRK D K F+++ + + + F+ +
Sbjct: 4 LKLEQTKAHSL--IDCAQPATRTWQRKFDDEGKKIEMFSMTMNDMILIVPMIFKGLVRNA 61
Query: 65 QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
+E KG+ +++ F+K +G+PLGG+GAGSIGRSYRG FQ F++F I + P+LA
Sbjct: 62 EERGKGRDVIYDPFKKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPAIYEQKPILA 121
Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVY 184
NQFS F+SRPNG+ +S+VL + V K +GI SWDW LK +NCTYHALFPR+WTVY
Sbjct: 122 NQFSAFISRPNGKSYSTVLSAPTADVLKGVDKAGIGSWDWKLKEKNCTYHALFPRSWTVY 181
Query: 185 DGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD 244
DGEPDP ++I CRQISPFIPHNYKESSFP VFTFT+ N+G T ADVTLLFTWANSV G
Sbjct: 182 DGEPDPAIKITCRQISPFIPHNYKESSFPVVVFTFTVHNTGSTPADVTLLFTWANSVGGR 241
Query: 245 SGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS 304
S L+G+H NSK +T + +G PPVTFA+A++ET DV V++CP F + +S
Sbjct: 242 SELTGNHTNSKMITAN------------DGHPPVTFAIASQETEDVRVTDCPSFTMGSSS 289
Query: 305 KG-ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
G TAKDMW EIKKHGSF D + + S+PGSSIGAA+AA+ T+P+G TR V+F+L+W
Sbjct: 290 SGDFTAKDMWQEIKKHGSFITTDKEPRA-SKPGSSIGAAVAAATTVPAGGTRVVSFALSW 348
Query: 364 DCPEVKF-FEKVYHRRYTKFYGTLGDSAAR-IAHDAILEHAKWECEIEAWQRPILEDKRF 421
CPEVKF + YHRRYT F+G D+AA +AHDA+LEH WE +IE WQRPIL+DKR
Sbjct: 349 SCPEVKFPAGRTYHRRYTTFHGLDRDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRL 408
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT--IRERKFSLDTP-------SKIHTP 472
P+WYP+ LFNELYYLNAGGTIWTDG PP S+++ + FSL T I +
Sbjct: 409 PDWYPVALFNELYYLNAGGTIWTDGLPPKMSVSSSGVGTEPFSLATTYLPGTAVDGILSA 468
Query: 473 ASSDT-------ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
+S T A G LL +GEEN+GQFLYLE EY MYNTYDVHFY+SFAL+ LFP+L
Sbjct: 469 VASATEDQHSAAAFGATLLGDGEENVGQFLYLEATEYYMYNTYDVHFYASFALLSLFPEL 528
Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
ELS+QRDFA AV++HDP M + DG V RK LGAVPHDIGL+DPWFE+N+Y + + +R
Sbjct: 529 ELSLQRDFARAVLIHDPRPM-VTLDGATVLRKVLGAVPHDIGLNDPWFELNAYMIHDPAR 587
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
WKDLN KFVLQVYR V TG+ FA+A WP+VY+AMAYM+Q+D+DGDGM+ENEG PDQTY
Sbjct: 588 WKDLNPKFVLQVYRAVVVTGNAAFAKAAWPAVYLAMAYMDQYDRDGDGMVENEG-PDQTY 646
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
D WS GVSAY GG+WV ALQA +A+A VGD + YF RY +AK VYD+ LWNG+YF
Sbjct: 647 DLWSVTGVSAYTGGIWVTALQATAAMARIVGDGDAECYFHGRYLRAKQVYDAELWNGTYF 706
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
YDNS G +S SI ADQLAGQWYA CGL P+ +E K + AL + D+NV++VKGG GA
Sbjct: 707 RYDNSGGETSESIMADQLAGQWYAHVCGLEPVVEEDKARSALATVLDYNVMRVKGGAVGA 766
Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
+NGM+PDG IDMS Q++EIW G TY +AA+M+ E M + AF+TA G ++ +W + G GY
Sbjct: 767 VNGMRPDGGIDMSSTQSKEIWSGTTYAVAAAMVHEGMPEGAFRTAKGAHDASWGKAGFGY 826
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAF 884
+FQTPE+W YR L YMRPL++WAMQWAL+ P+L + +S + F
Sbjct: 827 AFQTPEAWTAEGGYRGLHYMRPLSVWAMQWALSPPELHKDLGLASVSPPGDA-ARGREKF 885
Query: 885 SKVASLLKLPKEEASKGFLKVVY 907
KVA +L+LP++ KGF +Y
Sbjct: 886 DKVAGMLRLPEKVQHKGFFLALY 908
>gi|357139475|ref|XP_003571307.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 946
Score = 946 bits (2444), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 512/935 (54%), Positives = 643/935 (68%), Gaps = 36/935 (3%)
Query: 8 EGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
E E + +D QPP+ TWQRK D K F+ + + +A + ++ R +E
Sbjct: 5 ELEQTKAHSHIDCAQPPERTWQRKFDDEGKEIAMFSFTMNDIMTVAPLIVKMMRILVEER 64
Query: 68 AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
AKGK V++ F+K +G+PLG +GAGSIGRSYRG FQ F+LF GI + P+LANQF
Sbjct: 65 AKGKASVYDPFKKWMSNCYRGVPLGTLGAGSIGRSYRGYFQHFQLFPGIYEQKPILANQF 124
Query: 128 SVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
S FVSRP G+ +S+VL V K + I SWDWNLK +NC+YH LFPR+WTVYDGE
Sbjct: 125 SAFVSRPGGKSYSTVLSAPPAEVLKGIDKASIGSWDWNLKEKNCSYHGLFPRSWTVYDGE 184
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
PDPE++I CRQISPFIP NYKESSFP +VFTFT+ NSG T ADVTLLFTWANSV G S L
Sbjct: 185 PDPEIKITCRQISPFIPRNYKESSFPVAVFTFTVQNSGSTPADVTLLFTWANSVGGKSEL 244
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG- 306
+G+H NS +DGVHG+ L H TA G PPVTFA+A+++T DV V+ CP F + +SK
Sbjct: 245 TGNHSNSTMRARDGVHGVLLRHSTAEGHPPVTFAIASQDTGDVRVTCCPSFSMGPSSKSG 304
Query: 307 -ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
TAK+MW EIKKHGSF PS PGSS+GAA+AA+ T+ +G TR V+F+L+W C
Sbjct: 305 EPTAKEMWEEIKKHGSFGDAAAGTGRPSRPGSSVGAAVAAATTVTAGCTREVSFALSWSC 364
Query: 366 PEVKF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
P VKF + YHRR+TKF G D+AA ++AH A+LEH +WE +IEAWQRP+L+DK P+
Sbjct: 365 PVVKFPAGRTYHRRHTKFVGLDRDAAAEQLAHHALLEHMEWERQIEAWQRPVLQDKSLPD 424
Query: 424 WYPITLFNELYYLNAGGTIWTDGSP----PMQSLATIRERKFSLDTPSKIHTPASSDTAL 479
WYP+ LFNELYYLNAGGTIWTDG P + S ++ FSL SS TA
Sbjct: 425 WYPVALFNELYYLNAGGTIWTDGMPSKKTSLVSSSSGTMEPFSLAAFHPDPNATSSSTAA 484
Query: 480 GTRLL------------ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
LL E+ +G+FLYLEG EY MYNTYDVHFY+SFAL+ LFP LEL
Sbjct: 485 DDILLAMARAEEHLLPAAEDEKGVGKFLYLEGMEYHMYNTYDVHFYASFALLSLFPDLEL 544
Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWK 587
S+QRDFAAAV+ HDP M + DGK+V RK LGAVPHDIGL+DPW E+N+Y + + SRWK
Sbjct: 545 SLQRDFAAAVLRHDPRLMYTL-DGKFVPRKVLGAVPHDIGLNDPWHELNAYMIHDPSRWK 603
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
DLN KFVLQVYRD ATGD +FAR+ WPSVY+A+AYM+QFD+D DGM+ENEG PDQTYD
Sbjct: 604 DLNPKFVLQVYRDVAATGDLDFARSAWPSVYVALAYMDQFDRDRDGMVENEGRPDQTYDL 663
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY 706
WS +GVSAY GGLWVAAL+AA+A+A VGD + + F RY +A VYDS LW G YF Y
Sbjct: 664 WSVSGVSAYTGGLWVAALRAATAMAALVGDLPAEAVFLERYNRANKVYDSELWTGDYFRY 723
Query: 707 DNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN 766
DNS G +S S+ ADQLAGQWYARACGL P+ K ++AL + + NV++V+GG GA+N
Sbjct: 724 DNSGGGNSESVMADQLAGQWYARACGLEPVVGRDKARRALAAVLEHNVMQVQGGGVGAVN 783
Query: 767 GMQ-PD--GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
G + P+ G +D S Q++E+W G TY +AA+MI E M + F A G Y W EDG G
Sbjct: 784 GARLPEHGGGVDESSTQSKEVWTGTTYAVAAAMIGEGMREEGFTAAKGAYGAGWGEDGYG 843
Query: 824 YSFQTPESWNNNDE--YRSLCYMRPLTIWAMQWALTK-----PKLSRQEIKHEISDRDSS 876
Y+FQ PESW + YRSL YMRPL +WAMQWAL+ P+L R K + ++ +
Sbjct: 844 YAFQMPESWTADGAGGYRSLHYMRPLAVWAMQWALSPPTPVLPELERMAAKSKATEEEVE 903
Query: 877 YLEQHAAFSKVASLLKLP--KEEASKGFLKVVYDF 909
++ F KVASLL+LP +++ KG+L+ Y
Sbjct: 904 LAKEK--FEKVASLLRLPEEEKKKEKGYLRAFYQI 936
>gi|326517703|dbj|BAK03770.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 821
Score = 942 bits (2434), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/812 (58%), Positives = 595/812 (73%), Gaps = 39/812 (4%)
Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
FVS P G+ +S+VL + + K +GI SWDW LK ENC YH LFPR+WTVYDGEPD
Sbjct: 2 FVSHPGGKSYSTVLSAPTADLLKGIDKAGIGSWDWKLKEENCNYHGLFPRSWTVYDGEPD 61
Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
+++I CRQISPFIPHNYKESSFP +VFTFT+ NSG T ADVTLLFTWANSV G S L+G
Sbjct: 62 TKIKITCRQISPFIPHNYKESSFPVTVFTFTVQNSGSTPADVTLLFTWANSVGGRSELTG 121
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--- 306
+H NS+ +DGVHG+ L HRTA+G PPVTFA+A+EET+DV V+ CP F + +S G
Sbjct: 122 NHTNSRIKARDGVHGVLLRHRTADGNPPVTFAIASEETSDVRVTCCPTFAMGPSSPGAEQ 181
Query: 307 ITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
TAKDMW+E+K GSF T+ S PGSSIGAA+A++ +P+G TR V+F+L+W C
Sbjct: 182 FTAKDMWDEVKNRGSFGGDAGGAPTASSRPGSSIGAAVASTTKVPAGDTRVVSFALSWSC 241
Query: 366 PEVKFFE-KVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
PEVKF + YHRRYTKF G D+AA ++ HDA+LEH KWE IE WQ PIL+D+R PE
Sbjct: 242 PEVKFPSGRTYHRRYTKFLGLDRDAAAEQLVHDALLEHMKWESLIEEWQTPILQDRRLPE 301
Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQ-SLATIR----ERKFSLD-------------- 464
WYP+ LFNELYYLNAGGTIWTDG PP + S A+ + FSLD
Sbjct: 302 WYPVALFNELYYLNAGGTIWTDGMPPKKTSFASSKYGSTAESFSLDGFRAGDPAVDGILR 361
Query: 465 ---TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
T + P+S A GT LL +GEEN+GQFLYLEG EY M+NTYDVHFY+SF+L+ L
Sbjct: 362 AMATAEERLEPSS---AFGTALLGDGEENVGQFLYLEGMEYHMWNTYDVHFYASFSLLSL 418
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLF 581
FP++ELS+QRDFA AV++HDP M+ + DG V RK LGAVPHDIGL DPWFE+N+Y +
Sbjct: 419 FPEIELSLQRDFARAVLLHDPRPMRTL-DGVDVPRKVLGAVPHDIGLADPWFELNAYMIH 477
Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
+ SRWKDLN KFVLQVYRD ATG+ FA A WP+VY+AMAYM+QFD+DGDGM+ENEG P
Sbjct: 478 DPSRWKDLNPKFVLQVYRDVAATGNVTFATAAWPAVYLAMAYMDQFDRDGDGMVENEGRP 537
Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
DQTYD WS +GVSAY GGLW+AALQAA+A+A VGD + YF RY++A+ VYD LWN
Sbjct: 538 DQTYDLWSVSGVSAYTGGLWLAALQAAAAMARIVGDRGAEGYFLERYKRAQRVYDGELWN 597
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
GSYF+YDNS ++S SI ADQLAGQWYARACGL PI +E K + AL + D+NV++V+GG
Sbjct: 598 GSYFDYDNSGCATSKSIMADQLAGQWYARACGLEPIVEEEKARSALGTVLDYNVMRVQGG 657
Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
GA+NGM+PDG +D S LQ++E+W GVTYG+AA+M+ E M + AF+TA G ++ W D
Sbjct: 658 AVGAVNGMRPDGAVDASSLQSKEVWVGVTYGVAAAMLHEGMTEAAFRTAKGAHDAGWGRD 717
Query: 821 GLGYSFQTPESWNNN--DEYRSLCYMRPLTIWAMQWALTKPKLSR--QEIKHEISDRDSS 876
G GY+FQTPE+W ++ YRSL YMRPL+IWAMQWAL+ P+L R + + +S S
Sbjct: 718 GFGYAFQTPEAWTSDAGGGYRSLHYMRPLSIWAMQWALSPPELHRDLRVVPGSVSAVASP 777
Query: 877 YLEQHA--AFSKVASLLKLPKEEASKGFLKVV 906
A F KVAS+LKLP+E KG+L+ +
Sbjct: 778 AEVDLAREKFEKVASMLKLPEEVQHKGYLRAI 809
>gi|302807226|ref|XP_002985326.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
gi|300147154|gb|EFJ13820.1| hypothetical protein SELMODRAFT_121820 [Selaginella moellendorffii]
Length = 983
Score = 929 bits (2401), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/935 (51%), Positives = 637/935 (68%), Gaps = 62/935 (6%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQE-FRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
DPG PP W+RKL+ + T F ++F E + + +G RL+++ + E ++G+ P +
Sbjct: 27 DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86
Query: 78 FRKHHITSD-QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-- 134
F++ S G+PLGG+G GSIGR +RGEF+R++L + +A VLANQFSVFV+R
Sbjct: 87 FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146
Query: 135 NGEK--FSSVLCPRSP--GVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDP 190
+GEK +SVL PR V KK I +WDWNL G+N TYH LFPRAWT+YDGEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206
Query: 191 ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGH 250
+L+I CRQ+SPFIPH+Y+ESS P VF++ + N+G+ SA V+LLFTWANSV GD+ LSG
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDTHLSGG 266
Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
H N + DGV G+ LHHRTA G+P V FA+AA + ADV VS CP FL+SG +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326
Query: 311 DMWNEIKKHGSFD-HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
+MWN++++ G FD + + PS SS+GAA++AS+TI + V F+L+WD PEVK
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVVDFALSWDIPEVK 386
Query: 370 FFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
F + + YHRRYT+FYG+ GD+A R+ HDA+L + KWE EIE WQ P+L+D PEWY T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYKFT 446
Query: 429 LFNELYYLNAGGTIW---------------------------------------TDGSPP 449
LFNELYYL AGGT+W T G+
Sbjct: 447 LFNELYYLVAGGTVWTAFVTPSFRSNGFTSRTNSESSITGAVEELVEMIPLNGGTSGNGG 506
Query: 450 MQSLATIRERKFSLDTPSKIHTP--------ASSDTALGTRLLENGEENIGQFLYLEGAE 501
+ ++ + E + + + A D GT LLE+GEEN+GQFLYLE E
Sbjct: 507 VDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLYLESVE 566
Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
YVM+ TYDVHFY+SFAL+ LFPKLELSIQRD+AAA + H+P +K ++ GKW +K LGA
Sbjct: 567 YVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIKKVLGA 626
Query: 562 VPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
VPHD+G+ DPW E+N+YN+ ++SRWKDLNSKFVLQ+YRD VAT D NFAR WP+VY AM
Sbjct: 627 VPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPAVYTAM 686
Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA 681
AYM++FD+D DGMIEN+GFPDQTYD W+ +G+SAYCGGLW+AALQAA+A+A V + +A
Sbjct: 687 AYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVNEPNAA 746
Query: 682 SYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
+YF +++ A+ VYD LWNGSYFNYD+ S+S SIQADQ+AGQWYA A GL P+ D+
Sbjct: 747 NYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPPLFDDY 806
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
K + AL +IYD+NV+KVKGG GA+NGM P+GR+D + +Q+REIW GVTY AA+MI E
Sbjct: 807 KARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAAMIHEG 866
Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
M++ AF TA G+Y WS+ G GY FQTPE+W + +R+L YMRPL IWAMQWAL P
Sbjct: 867 MLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWALYPPN 926
Query: 861 L----SRQEIKHEISDRDSSYLEQHAAFSKVASLL 891
R K E + +S +A F+KV+ L
Sbjct: 927 CVAAGPRIPSKEESAGGSTSSSYLYAGFAKVSEAL 961
>gi|168037390|ref|XP_001771187.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677567|gb|EDQ64036.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 981
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 494/934 (52%), Positives = 645/934 (69%), Gaps = 52/934 (5%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
DPG P L W+RKL+S+ T F ++F E + +G R++ Y K E A+G+ P + F
Sbjct: 54 DPGAPSALAWRRKLNSHADFLTEFRVTFMEAVRMITLGMRMWFYVKSERAQGRAPPIDPF 113
Query: 79 -RKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG- 136
R+ ++ G+P GG+G GSIGR +RG+F+R++L G+C++APVLA+QFSVFV R G
Sbjct: 114 NRETKPSACHGVPCGGMGGGSIGRGFRGDFRRWQLIPGVCEEAPVLADQFSVFVKRDKGN 173
Query: 137 ---EKFSSVLCPRSPG-VPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPEL 192
+K +SVL P P + D+ + SWDWNL G+N TYHALFPRAWT+Y+GEPDP+L
Sbjct: 174 GVVKKDASVLYPGRPQELSDTKDDTSVSSWDWNLDGQNSTYHALFPRAWTIYEGEPDPDL 233
Query: 193 RIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHF 252
+I CRQ+SPFIPHNY+ESSFP VF++ + N+G+ +ADVTLLFTWANS+ G+S +G HF
Sbjct: 234 KISCRQVSPFIPHNYEESSFPCCVFSYVVMNTGKQAADVTLLFTWANSIGGNSATTGGHF 293
Query: 253 NSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDM 312
N T A G PVTFA+AA+E DV ++ CPCF L G +TAKDM
Sbjct: 294 NEPFETG------------AKGGRPVTFAIAAKEMRDVTLTTCPCFSLGGKEVEVTAKDM 341
Query: 313 WNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
W E+K++GSFD + + ++P+ GS+IGAAIAA + +P G R V FSLAWD PEVKF
Sbjct: 342 WTEVKENGSFDKENWNFKPSAPTRQGSAIGAAIAARVEVPPGEKRDVVFSLAWDSPEVKF 401
Query: 371 FE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITL 429
+ K YHRRYT FYGT GD+A+++ DA+ ++ KWE IE WQ+P+L+D++ PEWY +TL
Sbjct: 402 LKGKSYHRRYTSFYGTAGDAASKLTRDALRDYRKWEIAIEDWQQPVLKDEKLPEWYRVTL 461
Query: 430 FNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT-----------------P 472
FNELYYL AGGTIWT S T ER S +T ++ T
Sbjct: 462 FNELYYLVAGGTIWTAELELAASQET--ERLQSGETADQMQTETLEHLILKQMQTAMLGS 519
Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
+ + A G LL+ GEEN+GQFLYLEG EY+M+NTYDVHFY+SFAL+ LFPKLEL+IQRD
Sbjct: 520 SEAGIATGPSLLQEGEENVGQFLYLEGIEYIMWNTYDVHFYASFALLALFPKLELAIQRD 579
Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK 592
AAA + +P +K M+DG RK GAVPHD+G DPW E+N+YN+ ++S WKDLN K
Sbjct: 580 VAAATLSQNPEKVKYMADGGVGIRKVFGAVPHDLGQHDPWVEVNAYNIHDTSHWKDLNPK 639
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
FVLQVYRD VATGD+ FA+AVWP+VY AMAYM+QFD+D DG+IEN+GFPDQTYD WS +G
Sbjct: 640 FVLQVYRDVVATGDRQFAKAVWPAVYAAMAYMDQFDRDRDGLIENDGFPDQTYDTWSVHG 699
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDG 711
VSAYCGGLW+AALQAA+A+A+ V D +A+YF ++ +A+ VY+ LWNG YFNYD+
Sbjct: 700 VSAYCGGLWIAALQAAAAMADLVDDKDAATYFKGKFNQARDVYERKLWNGEYFNYDSGTS 759
Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
S+S S+QADQLAGQWYA A GL P+ D+ K + AL K++DFNV+KVKGG GA NGM P+
Sbjct: 760 SNSNSVQADQLAGQWYAWASGLPPLFDDYKARSALQKVFDFNVMKVKGGKWGAANGMHPN 819
Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
G++D + +Q+REIW GVTY +A+MI E MV+ AF A GV+ WS+ LGY FQTPE+
Sbjct: 820 GKVDETCMQSREIWTGVTYAASAAMIHEGMVEQAFTAAQGVFLAGWSD--LGYWFQTPEA 877
Query: 832 WNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVA--- 888
W + +RSL YMRPL IWAMQWAL PK + + DR S+ L H FS +A
Sbjct: 878 WTIDGYFRSLAYMRPLAIWAMQWALYPPKSIAEAPRVPSMDRGSA-LVTHDRFSSLANAL 936
Query: 889 ---SLLKLPKEEASKGFLKVVYDFTIGRVLCHNL 919
SL +L K GF V++ +G V+ +
Sbjct: 937 RSTSLPQLRKTHDPAGF--VLHIILLGGVVLSTI 968
>gi|413941636|gb|AFW74285.1| hypothetical protein ZEAMMB73_466373 [Zea mays]
Length = 966
Score = 923 bits (2385), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 506/983 (51%), Positives = 643/983 (65%), Gaps = 96/983 (9%)
Query: 7 EEGENAASMPKVD-PGQPP-QLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSK 64
+E E A +P D P QPP TWQR+ D K F+++ + L
Sbjct: 4 QESEQAKVLPGTDCPQQPPASRTWQRRFDDEGKNVAMFSMTMSDLVEL------------ 51
Query: 65 QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
V V+++ RK +G+P+GGIG+GSIGRSYRG FQ+F++F ++ PVLA
Sbjct: 52 -------VAVYDLLRKWMDNCYRGVPVGGIGSGSIGRSYRGYFQQFQIFPVTNEEKPVLA 104
Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVY 184
NQFS FVSRPNG K+S+VL + + K +GI SWDW LK + CTYHALFPR+WTVY
Sbjct: 105 NQFSAFVSRPNGNKYSTVLSAPTADLLKGVGKAGIGSWDWRLKEDRCTYHALFPRSWTVY 164
Query: 185 DGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFT-------LSNSGQTSADVTLLFTW 237
DGEPDPE++I CRQISPFIPHNY+ESSFPA+VFTFT L +G+ ++T
Sbjct: 165 DGEPDPEIKITCRQISPFIPHNYRESSFPAAVFTFTVQSVLSVLFKAGEQLWELTCRCDA 224
Query: 238 A-------------NSVAGDSGLSGHHFNSKTMTKDGVHGLTLHH-------RTANGRPP 277
A +S+ G SGL + V+ L RTA G+PP
Sbjct: 225 ALHLGRSTTASVAFSSITGGSGLIFRGARCYRPCRWLVYTPVLETDADRRQCRTAGGQPP 284
Query: 278 VTFAVAAEETADVHVSECPCFLLSG----NSKGITAKDMWNEIKKHGSFDHLDNDKTSPS 333
VTFA+A+ ET V VS CP F ++ +S TA MW+EIKKHG+F H + +
Sbjct: 285 VTFAIASRETDGVRVSVCPSFTIASGGPSSSGEPTAAAMWDEIKKHGAFGHAGAGNGNAA 344
Query: 334 EPGSS-----IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF-FEKVYHRRYTKFYGTLG 387
+S +GAA+AAS +P+G+TR V+FSLAW CPEVKF Y+RRYT+FYGT
Sbjct: 345 RAAASKAGSSVGAAVAASTAVPAGATRVVSFSLAWACPEVKFPAGSTYYRRYTRFYGTDA 404
Query: 388 DSAAR-IAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT-- 444
D+AA +AHDA+LEH WE +IE WQ+PIL D+R PEWYP+ LFNELYYLNAGGTIWT
Sbjct: 405 DAAAEHLAHDALLEHMNWESQIEEWQKPILHDQRLPEWYPVALFNELYYLNAGGTIWTVE 464
Query: 445 ----DGSPPMQS---LATIRERKFSLDTPSKIHTPAS-----------------SDTALG 480
DG P ++ + FS+DT + S S A G
Sbjct: 465 EMLTDGQAPKRAGFASSLTATEPFSIDTLLPVAAGGSAVDGVVRAVASATERSHSAAAFG 524
Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
LL GE+N+GQFLYLEG EY MYNTYDVHFY+SFAL+ LFP+LELS+QRDFA AV++H
Sbjct: 525 AALLRGGEDNVGQFLYLEGMEYNMYNTYDVHFYASFALLSLFPRLELSLQRDFARAVLLH 584
Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRD 600
DP + + DG+ VARK LGAVPHD+GL+DPW E+N+Y L +++RWKDLN KFVLQVYRD
Sbjct: 585 DPRPRRTL-DGRTVARKVLGAVPHDVGLNDPWVELNAYMLHDAARWKDLNPKFVLQVYRD 643
Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
VATGD FA AVWP+VY+AMAYM+QFD+DGDGM+ENEG PDQTYD WS +GVSAY GG+
Sbjct: 644 AVATGDAAFAEAVWPAVYVAMAYMDQFDRDGDGMVENEGVPDQTYDMWSVSGVSAYTGGV 703
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS-DGSSSTSIQ 718
WVAALQAA+ +A VGD + +YF R +A VYD+ LW G+YF YDNS G +S SI
Sbjct: 704 WVAALQAAAGMARVVGDRPAEAYFRARRDRAARVYDAELWTGAYFRYDNSGGGGASESIM 763
Query: 719 ADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG 778
ADQLAGQWYARACGL P+ ++AK + AL + D+NV++VKGG GA+NGM+PDG +D S
Sbjct: 764 ADQLAGQWYARACGLEPVVEDAKARSALATVLDYNVMRVKGGAIGAVNGMRPDGAVDASS 823
Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
Q++E+WPGVTY +AA+M+ E M + AF+TA G ++ AW DG GY+FQTPE+W + Y
Sbjct: 824 TQSKEVWPGVTYAVAAAMVHEGMTEAAFRTAKGAHDAAWGNDGFGYAFQTPEAWTEDGGY 883
Query: 839 RSLCYMRPLTIWAMQWALTKPKLSRQ------EIKHEISDRDSSYLEQHAAFSKVASLLK 892
RSL YMRPL IWAMQWAL+ P+L R E S D++ A F KVAS+L+
Sbjct: 884 RSLHYMRPLGIWAMQWALSPPELHRDLRTPEARSSEEPSPADAAL--GQAQFDKVASMLR 941
Query: 893 LPKEEASKGFLKVVYDFTIGRVL 915
LP+E+ KG+L +Y+ VL
Sbjct: 942 LPEEQQPKGYLWAIYNLVRQIVL 964
>gi|357146604|ref|XP_003574050.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 962
Score = 922 bits (2382), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/923 (51%), Positives = 629/923 (68%), Gaps = 44/923 (4%)
Query: 9 GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
G +A + +D PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E +
Sbjct: 21 GRSALQLLDLDGAAPPEQAWRRRLNSHANILKEFSVTFMEAMKMMSLGVRLWSYVREEAS 80
Query: 69 KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
G+ + F + ++ QG+PLGG+G+GSI R +RGEF+ + + G+C+++PV+ NQ
Sbjct: 81 HGRKAPIDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQ 140
Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
FS+FVSR G +K SSVL P KK +DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 141 FSIFVSRDGGNKKCSSVLAPGHHDGLKKYSDSGISSWDWNLSGQHSTYHALFPRAWTVYD 200
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
GEPDP+L+I CRQISPFIPH+YKESS P SVF +TL N+G+ A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKESSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFS 260
Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
SG HFN + +DGV G+ LHH+TA PPVTFA+AA ET +V+V+ P F LSG +
Sbjct: 261 HHSGGHFNEPFIGEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGENH 320
Query: 306 GITAKDMWNEIKKHGSF--DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
++AKDMW+ +KK G F ++ + + PS PG ++ AA+ AS + +V F+L+W
Sbjct: 321 -VSAKDMWDIMKKDGHFNLENFNAGCSMPSSPGETLCAAVTASTWVEPHGRCTVAFALSW 379
Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
P+VKF + Y+RRYT+FYGT S+ + HDA+ ++ WE EIE WQ PIL D+R P
Sbjct: 380 SSPKVKFQKGCTYNRRYTEFYGTSERSSINLVHDALTKYRLWEEEIEKWQNPILRDERLP 439
Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-----SLD 464
EWY TLFNELY+L AGGT+WTDG PP S I++ K +L
Sbjct: 440 EWYKFTLFNELYFLVAGGTVWTDGQPPAIDEKTNPASNQQKHSKKPIKDTKSESVKDNLP 499
Query: 465 TPSKIHT----------PASSDTALGTRLLE--------NGEENIGQFLYLEGAEYVMYN 506
P+ P + G R+ E +G EN+G+FLYLEG EY+M+N
Sbjct: 500 RPTAEQVFNGDDLTNGGPQMPEQTNGLRVQEPVPCIHSKDGPENVGKFLYLEGVEYIMWN 559
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D +K ++DG RK GAVPHD+
Sbjct: 560 TYDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKVKGAVPHDL 619
Query: 567 GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ 626
G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD F R VWP+V AM YM+Q
Sbjct: 620 GTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVCAAMDYMDQ 679
Query: 627 FDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWV 686
FD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD A + +
Sbjct: 680 FDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRPYAEKYKL 739
Query: 687 RYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
++ KAKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL PI DE K++ A
Sbjct: 740 KFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIFDEHKIRSA 799
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
L KI++FNV+KVKGG GA+NGM P G++D + +Q+REIW GVTYG+AA+M+ M
Sbjct: 800 LQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANMLLHGMEHQG 859
Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
F TA G++ WSE+G GY FQTPE W + YRSL YMRPL IWAMQWAL+ PK +
Sbjct: 860 FITAEGIFLAGWSEEGYGYWFQTPEGWTTDGHYRSLIYMRPLAIWAMQWALSPPKAILEA 919
Query: 866 IKHEISDRDSSYLEQHAAFSKVA 888
K + DR + A S+++
Sbjct: 920 PKVNLMDRIHVSPQAARAISEIS 942
>gi|168014916|ref|XP_001759997.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688747|gb|EDQ75122.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 970
Score = 919 bits (2375), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/933 (52%), Positives = 634/933 (67%), Gaps = 55/933 (5%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
DPG PP W+RKL+S+ +V F ++F E + +G R++ Y + E+++G++P + F
Sbjct: 46 DPGAPPHQAWRRKLNSHAEVLKEFRVTFMEGFKMLRLGMRMWTYVQSEKSEGRLPPIDPF 105
Query: 79 RKHHITSD-QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGE 137
R+ S G+P GG+G GSIGR +RGEF+R+++ G+CD+APVLANQFSV VS GE
Sbjct: 106 RRETKPSACHGVPCGGMGGGSIGRGFRGEFRRWQIVPGVCDEAPVLANQFSVHVSLSGGE 165
Query: 138 K-FSSVLCPRSPGVPK--KNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
K SSVL P P K KN ++GI SWDW L G+N TYHALFPRAWT+Y+GEPDP L+I
Sbjct: 166 KAVSSVLYPGRPRELKGLKNDENGISSWDWKLDGQNTTYHALFPRAWTIYEGEPDPALKI 225
Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNS 254
CRQISPFIPHNYKESS+PA VF++ N+G+ ADV LLFTWANS+ +G +G H N
Sbjct: 226 SCRQISPFIPHNYKESSYPACVFSYVAVNTGKEVADVNLLFTWANSIGSTTGSTGGHINE 285
Query: 255 KTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWN 314
T + G P+TFA+AA+++ DV +S CPCF +SG + ITAKD+W
Sbjct: 286 TFTT------------SVKGGQPLTFAIAAKQSPDVSLSACPCFQVSGEGQSITAKDLWG 333
Query: 315 EIKKHGSF--DHLDNDKTS-PSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF 371
+K+HG D + TS PS+PGS IGAAIAA + IP R++ F LAWD P VKF
Sbjct: 334 RMKEHGRLEVDDWNTINTSLPSKPGSVIGAAIAAHVVIPPNEKRTIDFVLAWDSPTVKFL 393
Query: 372 E-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLF 430
+ + Y RRYT FY G++A RIAH+A+ + WE EIE WQRPILED++ PEWY +TLF
Sbjct: 394 KGRSYLRRYTTFYDADGNAAPRIAHNALCGYRLWEEEIEKWQRPILEDEKLPEWYRVTLF 453
Query: 431 NELYYLNAGGTIWT--------DGSP------------PMQS-----LATIRERKFSLDT 465
NELYYL AGGT+WT GSP P + LA +R D
Sbjct: 454 NELYYLVAGGTVWTASSANTSAPGSPLHGDLSLGGKVVPFSTRRDIDLAALRRDCKDGDV 513
Query: 466 PSKIH----TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
+ + + +S G LL+ EEN+GQFLYLEG EY+M+NTYDVHFY+SFAL+ L
Sbjct: 514 VAALEGSNKSSFNSRLPSGLALLQAEEENVGQFLYLEGIEYIMWNTYDVHFYASFALLSL 573
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLF 581
FPKLEL+IQRDFAA + HDP K M+ GK +K G+VPHD+G DPW E+N+YN+
Sbjct: 574 FPKLELAIQRDFAAGCLAHDPRRTKFMAGGKKGIKKVFGSVPHDLGQHDPWAEVNAYNIH 633
Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
++SRWKDLN+KFVLQVYRD VATGDK FA+AVWP+VY AMAY++QFD+D DG+IEN+GFP
Sbjct: 634 DTSRWKDLNAKFVLQVYRDVVATGDKVFAKAVWPAVYAAMAYLDQFDRDRDGLIENDGFP 693
Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWN 700
DQTYD W+ +GVSAYCGGLW+AALQAA A+A + + +A YF ++ +A+ Y LWN
Sbjct: 694 DQTYDTWTVHGVSAYCGGLWIAALQAAEAMAVLLEEKNAARYFQSKFIQAREAYLKKLWN 753
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
GSYF+YD+ + S+S SIQADQ+AGQWYA A GL P+ D+AK + AL KI+DFNV+KVK G
Sbjct: 754 GSYFSYDSGNSSNSKSIQADQMAGQWYAWASGLPPLFDDAKARSALQKIFDFNVMKVKNG 813
Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
GA+NGM P+G++D + +Q+RE+W GVTY LAA+MI E M+D AF+ A GV+ WS+
Sbjct: 814 HLGAVNGMHPNGKVDETCMQSREVWTGVTYALAAAMIHEGMLDQAFRAAEGVFIAGWSD- 872
Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQ 880
LGY FQTPE+W + +RSL YMRPL IWAMQWA+ P+ + DR S +
Sbjct: 873 -LGYWFQTPEAWTMDGYFRSLAYMRPLAIWAMQWAIDPPETIANAPRIPTVDRLPSECDT 931
Query: 881 HAAFSKVASLLKLPKEEASKGFLKVVYDFTIGR 913
+ FS + S L+ P + + Y+F G+
Sbjct: 932 -SGFSSMQSTLRNPLPQLRP--FRRAYNFLRGK 961
>gi|218184729|gb|EEC67156.1| hypothetical protein OsI_34005 [Oryza sativa Indica Group]
Length = 974
Score = 917 bits (2369), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 470/920 (51%), Positives = 622/920 (67%), Gaps = 56/920 (6%)
Query: 9 GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
G A + D G PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E +
Sbjct: 21 GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEAS 80
Query: 69 KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
G+ + F K ++ QG+PLGG+G+GSI R +RGEF+ + + G+C+ +PV+ NQ
Sbjct: 81 HGRKAPIDPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQ 140
Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
FS+FVSR G +K+SSVL P KK DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 141 FSIFVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 200
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+ A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 260
Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
SG HFN + +DGV G+ LHH+TA PPVTFA+AA ET +V+V+ P F LSG
Sbjct: 261 HHSGGHFNEPFIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 320
Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
+AK MW+ +K++G FD + + + PS G ++ AA++AS + +V F LAW
Sbjct: 321 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 379
Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
P++KF + Y+RRYT+FYGT SA + HDA+ ++ WE EIE WQ PIL+++R P
Sbjct: 380 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNERLP 439
Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
EWY TLFNELY+L AGGT+WTDG PP+ S R+ K
Sbjct: 440 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 499
Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
S+ + +H P A + LG++ L +NG EN+G+F
Sbjct: 500 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 559
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D MK ++DG
Sbjct: 560 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 619
Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 620 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 679
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P+V AM YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+
Sbjct: 680 PAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 739
Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+GD A + +++ +AKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL
Sbjct: 740 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 799
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
P+ DE K++ AL KI++FNV+KVKGG GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 800 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 859
Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
A+M+ M F TA G++ WSE+G GY FQTPE W + YRSL YMRPL IWAMQ
Sbjct: 860 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919
Query: 854 WALTKPKLSRQEIKHEISDR 873
WA + PK K + DR
Sbjct: 920 WARSPPKAILDAPKVNLMDR 939
>gi|326516802|dbj|BAJ96393.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 970
Score = 916 bits (2367), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/916 (51%), Positives = 624/916 (68%), Gaps = 52/916 (5%)
Query: 9 GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
G A + +D G PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E +
Sbjct: 21 GRTALQLLDLDGGSPPEQAWRRRLNSHANILKEFSVTFMEAMKMMTLGVRLWSYVREEAS 80
Query: 69 KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
G+ + F + ++ QG+PLGG+G+GSI R +RGEF+ + + G+C+++PV+ NQ
Sbjct: 81 HGRKAPIDPFTRERCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCENSPVMENQ 140
Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
FS+FVSR +G +K+SSVL P KK DSGI SWDWNL G++ TYHALFPRAWT+YD
Sbjct: 141 FSIFVSRDSGNKKYSSVLAPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTIYD 200
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+ A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGRDRAKVSLLMTWANSIGGFS 260
Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
SG HFN + DGV G+ LHH+TA PPVTF++AA ET +V V+ P F LSG +
Sbjct: 261 HHSGGHFNEPFIGDDGVSGVLLHHKTAKDNPPVTFSIAACETQNVSVTVLPVFGLSGENH 320
Query: 306 GITAKDMWNEIKKHGSF--DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
++AK+MW+ + K G F ++ + + PS G ++ AA++AS + +V F+LAW
Sbjct: 321 -VSAKEMWDTMSKDGHFSRENFNAGCSMPSSSGETLCAAVSASTWVEPHGRCTVAFALAW 379
Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRF 421
P+VKF + Y+RRYT+FYGT S++ + HDA+ ++ WE EIE WQ PIL+D++
Sbjct: 380 SSPKVKFQKGCTYNRRYTEFYGTSERSSSINLVHDALTKYRLWEEEIEKWQDPILKDEKL 439
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT-IRERKFS------------------ 462
PEWY TLFNELY+L AGGT+WTDG PP S A+ + K+S
Sbjct: 440 PEWYKFTLFNELYFLVAGGTVWTDGQPPAISEASPAYQHKYSKKGAKSESVKDNHVKPAA 499
Query: 463 ----------------LDTPSKIHTPASSDTALGTRLLE--------NGEENIGQFLYLE 498
+ T +H + G L E +G EN+G+FLYLE
Sbjct: 500 EQVSDGDDLPNGEERSVSTYVAVHGSQMPEQTSGLGLQEPIPYLLSKDGPENVGKFLYLE 559
Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
G EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D +K ++DG RK
Sbjct: 560 GVEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFADAVLYEDRRRVKFLADGTSGIRKV 619
Query: 559 LGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY 618
GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD F R VWP+V
Sbjct: 620 KGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMTFGRDVWPAVS 679
Query: 619 IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH 678
AM YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+ +GD
Sbjct: 680 AAMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHRLGDR 739
Query: 679 ASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIA 737
A + +++ KAKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL PI
Sbjct: 740 PYAEKYKLKFMKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPIF 799
Query: 738 DEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMI 797
DE K++ AL KI++FNV+KVKGG GA+NGM P G++D + +Q+REIW GVTYG+AA+M+
Sbjct: 800 DEHKIRSALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYGVAANML 859
Query: 798 QEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
M F TA G++ WSEDG GY FQTPE W + YRSL YMRPL IWAMQWAL+
Sbjct: 860 LHGMEHQGFITAEGIFLAGWSEDGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAMQWALS 919
Query: 858 KPKLSRQEIKHEISDR 873
PK + K + DR
Sbjct: 920 PPKAILEAPKVNLMDR 935
>gi|115482480|ref|NP_001064833.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|110289241|gb|AAP54244.2| expressed protein [Oryza sativa Japonica Group]
gi|113639442|dbj|BAF26747.1| Os10g0473400 [Oryza sativa Japonica Group]
gi|215695411|dbj|BAG90602.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 974
Score = 914 bits (2362), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/920 (50%), Positives = 621/920 (67%), Gaps = 56/920 (6%)
Query: 9 GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
G A + D G PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E +
Sbjct: 21 GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEAS 80
Query: 69 KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
G+ + F K ++ QG+PLGG+G+GSI R +RGEF+ + + G+C+ +PV+ NQ
Sbjct: 81 HGRKAPIDPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQ 140
Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
FS+FVSR G +K+SSVL P KK DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 141 FSIFVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 200
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+ A V+LL TWANS+ G S
Sbjct: 201 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 260
Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
SG HFN + +DGV G+ LHH+TA PPVTFA+AA ET +V+V+ P F LSG
Sbjct: 261 HHSGGHFNEPFIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 320
Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
+AK MW+ +K++G FD + + + PS G ++ AA++AS + +V F LAW
Sbjct: 321 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 379
Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
P++KF + Y+RRYT+FYGT SA + HDA+ ++ WE EIE WQ PIL++++ P
Sbjct: 380 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLP 439
Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
EWY TLFNELY+L AGGT+WTDG PP+ S R+ K
Sbjct: 440 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 499
Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
S+ + +H P A + LG++ L +NG EN+G+F
Sbjct: 500 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 559
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D MK ++DG
Sbjct: 560 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 619
Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 620 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 679
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P+V M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+
Sbjct: 680 PAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 739
Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+GD A + +++ +AKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL
Sbjct: 740 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 799
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
P+ DE K++ AL KI++FNV+KVKGG GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 800 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 859
Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
A+M+ M F TA G++ WSE+G GY FQTPE W + YRSL YMRPL IWAMQ
Sbjct: 860 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 919
Query: 854 WALTKPKLSRQEIKHEISDR 873
WA + PK K + DR
Sbjct: 920 WARSPPKAILDAPKVNLMDR 939
>gi|302773445|ref|XP_002970140.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
gi|300162651|gb|EFJ29264.1| hypothetical protein SELMODRAFT_92385 [Selaginella moellendorffii]
Length = 983
Score = 909 bits (2349), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/941 (51%), Positives = 630/941 (66%), Gaps = 72/941 (7%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQE-FRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
DPG PP W+RKL+ + T F ++F E + + +G RL+++ + E ++G+ P +
Sbjct: 27 DPGAPPAHAWRRKLNCHANFLTEFRVTFTEALKLVLGLGLRLWKFIQAERSQGRTPPIDP 86
Query: 78 FRKHHITSD-QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-- 134
F++ S G+PLGG+G GSIGR +RGEF+R++L + +A VLANQFSVFV+R
Sbjct: 87 FKRGTKPSACHGVPLGGMGGGSIGRGFRGEFRRWQLLPSVTSEAAVLANQFSVFVTRDKD 146
Query: 135 NGEK--FSSVLCPRSP--GVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDP 190
+GEK +SVL PR V KK I +WDWNL G+N TYH LFPRAWT+YDGEPDP
Sbjct: 147 HGEKKKIASVLSPRPSELDVGKKAQSPEICAWDWNLDGQNSTYHGLFPRAWTIYDGEPDP 206
Query: 191 ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGH 250
+L+I CRQ+SPFIPH+Y+ESS P VF++ + N+G+ SA V+LLFTWANSV GDS LSG
Sbjct: 207 DLKISCRQVSPFIPHDYRESSLPVCVFSYVIVNTGKESASVSLLFTWANSVGGDSHLSGG 266
Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
H N + DGV G+ LHHRTA G+P V FA+AA + ADV VS CP FL+SG +TAK
Sbjct: 267 HSNKPFLEDDGVAGVVLHHRTAKGQPSVEFAIAARQNADVAVSVCPSFLVSGKGSTMTAK 326
Query: 311 DMWNEIKKHGSFD-HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
+MWN++++ G FD + + PS SS+GAA++AS+TI + + F+L+WD PEVK
Sbjct: 327 EMWNQMREKGCFDTDTVCEPSLPSLVNSSLGAAVSASVTIAPHGKKVIDFALSWDIPEVK 386
Query: 370 FFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
F + + YHRRYT+FYG+ GD+A R+ HDA+L + KWE EIE WQ P+L+D PEWY T
Sbjct: 387 FPKGRSYHRRYTQFYGSYGDAAPRLVHDALLNYPKWEAEIEKWQEPVLKDANLPEWYRFT 446
Query: 429 LFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT------- 481
LFNELYYL AGGT+WT P + R F+ T S+ + + +
Sbjct: 447 LFNELYYLVAGGTVWTAFVTP-----SFRSNGFTSRTNSESSITGAVEELVEMIPLNGGT 501
Query: 482 -------RLLENGEENIGQ--------------------------------------FLY 496
+L EE I + FLY
Sbjct: 502 TGNGGVDAILSETEEEIAEKLVKAMEAAILDKQINSAPRDFPYGTYLLEDGEENVGQFLY 561
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
LE EYVM+ TYDVHFY+SFAL+ LFPKLELSIQRD+AAA + H+P +K ++ GKW +
Sbjct: 562 LESVEYVMWCTYDVHFYASFALISLFPKLELSIQRDYAAATLSHNPELVKFLAGGKWGIK 621
Query: 557 KCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
K LGAVPHD+G+ DPW E+N+YN+ ++SRWKDLNSKFVLQ+YRD VAT D NFAR WP+
Sbjct: 622 KVLGAVPHDLGIHDPWVEVNAYNIHDTSRWKDLNSKFVLQIYRDVVATEDLNFARVTWPA 681
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
VY AMAYM++FD+D DGMIEN+GFPDQTYD W+ +G+SAYCGGLW+AALQAA+A+A V
Sbjct: 682 VYTAMAYMDRFDRDRDGMIENDGFPDQTYDTWTVHGISAYCGGLWLAALQAAAAMAEKVN 741
Query: 677 DHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
+ +A+YF +++ A+ VYD LWNGSYFNYD+ S+S SIQADQ+AGQWYA A GL P
Sbjct: 742 EPNAANYFLRKFRDARKVYDKKLWNGSYFNYDSGSSSNSNSIQADQMAGQWYAWASGLPP 801
Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAAS 795
+ D+ K + AL +IYD+NV+KVKGG GA+NGM P+GR+D + +Q+REIW GVTY AA+
Sbjct: 802 LFDDYKARSALQRIYDYNVMKVKGGKLGAVNGMHPNGRVDETAMQSREIWSGVTYAAAAA 861
Query: 796 MIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
MI E M++ AF TA G+Y WS+ G GY FQTPE+W + +R+L YMRPL IWAMQWA
Sbjct: 862 MIHEGMLEQAFTTAEGIYNAGWSDSGYGYWFQTPEAWTTDGHFRALTYMRPLAIWAMQWA 921
Query: 856 LTKPKL----SRQEIKHEISDRDSSYLEQHAAFSKVASLLK 892
L P R K E + +S +A F+KV+ LK
Sbjct: 922 LYPPNCVAAGPRIPSKEESAGGSTSSSYLYAGFAKVSEALK 962
>gi|357116998|ref|XP_003560263.1| PREDICTED: non-lysosomal glucosylceramidase-like [Brachypodium
distachyon]
Length = 932
Score = 907 bits (2345), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/869 (53%), Positives = 599/869 (68%), Gaps = 21/869 (2%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D G PP+ W+RKL S+ F ++F+E + +G RL+ Y ++E + G+ + F
Sbjct: 37 DDGSPPEHAWRRKLSSHANRLKEFNVTFREAFKMMKLGLRLWSYIREEASYGRKAPIDPF 96
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G GSI R +RGEF+ +++ G C+ PV+ NQFS+F+ R
Sbjct: 97 TRESTKPSASQGVPLGGMGTGSISRGFRGEFKHWQITPGSCEMLPVMENQFSIFIIR-GS 155
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
+K+SSVL P KK+++ GI SWDW L+G+ TYHALFPRAWTVYDGEPDPEL++ C
Sbjct: 156 KKYSSVLAPGQHDGLKKSSNDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSC 215
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQISPFIPHNY+ESS P SVF +TL N+G+ A V+L+ TWANS+ G S SG H N
Sbjct: 216 RQISPFIPHNYQESSLPTSVFVYTLVNTGKERAKVSLVMTWANSIGGLSDHSGGHVNEPF 275
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
+ ++GV G+ LHH+TAN PPVTFAVAA E +V V+ P F LSG S +TA++MW I
Sbjct: 276 IGENGVSGVLLHHKTANNNPPVTFAVAACENQNVDVTVLPVFGLSGESS-VTAREMWGTI 334
Query: 317 KKHGSFDHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ G FD DN K PS P G ++ AA+ AS + + +V F+LAW PEVKF +
Sbjct: 335 VQDGCFDR-DNFKAGPSMPSSLGETVCAAVCASTWVEAHGRCTVVFALAWSSPEVKFKKG 393
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
YHRRYTKFYGT SA + DA++ + WE EI+ WQ PIL D+R PEWY ITLFNE
Sbjct: 394 STYHRRYTKFYGTSPRSAINLVQDALMSYKHWEEEIDKWQTPILRDERLPEWYKITLFNE 453
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLEN------ 486
LY+L AGGT+W D +SL ++K + P P T L +
Sbjct: 454 LYFLVAGGTVWIDS----ESLMVDADKKVNSSLPEDSDLPLHDSNCNSTVPLIDPMPLDF 509
Query: 487 -GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTM 545
+EN+G+FLYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D +
Sbjct: 510 DDKENVGKFLYLEGVEYFMWCTYDVHFYASFALLNLFPKIELSIQRDFARAVLREDRSRV 569
Query: 546 KIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG 605
+ ++DG W RK +GAVPHD+G DPW E+N+YN+ ++SRWKDLN KFVLQVYRDF ATG
Sbjct: 570 RFLADGTWGTRKVIGAVPHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATG 629
Query: 606 DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAAL 665
D +F + VWP+V AM YMEQFD+DGDGMIEN+GFPDQTYDAW+ GVSAYCG LW+A+L
Sbjct: 630 DMSFGKDVWPAVCTAMEYMEQFDRDGDGMIENDGFPDQTYDAWTVLGVSAYCGCLWLASL 689
Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAG 724
QAA+A+A +G + A V++ KAK V+++ LWNGSYFNYD+ SS SIQADQLAG
Sbjct: 690 QAAAAMARGLGHNDYAERCMVKFAKAKHVFETKLWNGSYFNYDSGASYSSRSIQADQLAG 749
Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
QWYA + GL P+ DE ++K L KI+D+NV++VKGG GA+NGM P+G++D + +Q+REI
Sbjct: 750 QWYAASSGLPPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMYPNGKVDETCMQSREI 809
Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
W GVTY LAA+M+ M D AF TA G++ WSE+G GY FQTPE+W + YRSL YM
Sbjct: 810 WTGVTYSLAATMLLHGMEDQAFTTAEGIFLAGWSEEGYGYWFQTPEAWTIDGHYRSLIYM 869
Query: 845 RPLTIWAMQWALTKPKLSRQEIKHEISDR 873
RPL IWAMQ+AL+ PK + K DR
Sbjct: 870 RPLAIWAMQYALSPPKAILEAPKVNTMDR 898
>gi|16905165|gb|AAL31035.1|AC078948_19 unknown protein [Oryza sativa Japonica Group]
Length = 967
Score = 897 bits (2317), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/920 (50%), Positives = 615/920 (66%), Gaps = 63/920 (6%)
Query: 9 GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
G A + D G PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E +
Sbjct: 21 GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRMMSLGLRLWSYVREEAS 80
Query: 69 KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
G+ + F K ++ QG+PLGG+G+GSI R +RGEF+ + + G+C+ +P+
Sbjct: 81 HGRKAPIDPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI---- 136
Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
FVSR G +K+SSVL P KK DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 137 ---FVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 193
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+ A V+LL TWANS+ G S
Sbjct: 194 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 253
Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
SG HFN + +DGV G+ LHH+TA PPVTFA+AA ET +V+V+ P F LSG
Sbjct: 254 HHSGGHFNEPFIAEDGVSGVLLHHKTAKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 313
Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
+AK MW+ +K++G FD + + + PS G ++ AA++AS + +V F LAW
Sbjct: 314 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 372
Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
P++KF + Y+RRYT+FYGT SA + HDA+ ++ WE EIE WQ PIL++++ P
Sbjct: 373 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLP 432
Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
EWY TLFNELY+L AGGT+WTDG PP+ S R+ K
Sbjct: 433 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 492
Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
S+ + +H P A + LG++ L +NG EN+G+F
Sbjct: 493 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 552
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D MK ++DG
Sbjct: 553 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 612
Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 613 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 672
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P+V M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+
Sbjct: 673 PAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 732
Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+GD A + +++ +AKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL
Sbjct: 733 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 792
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
P+ DE K++ AL KI++FNV+KVKGG GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 793 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 852
Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
A+M+ M F TA G++ WSE+G GY FQTPE W + YRSL YMRPL IWAMQ
Sbjct: 853 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 912
Query: 854 WALTKPKLSRQEIKHEISDR 873
WA + PK K + DR
Sbjct: 913 WARSPPKAILDAPKVNLMDR 932
>gi|297831192|ref|XP_002883478.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
gi|297329318|gb|EFH59737.1| hypothetical protein ARALYDRAFT_479911 [Arabidopsis lyrata subsp.
lyrata]
Length = 950
Score = 893 bits (2307), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/889 (50%), Positives = 611/889 (68%), Gaps = 36/889 (4%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP W+R+L+ + + FT++F+E + +G RL+ Y ++E + G+ + F
Sbjct: 31 DCAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
K + ++ QG+PLGG+G+GSI R +RGEF+++++ G CD +P+++NQFS+F+SR G
Sbjct: 91 TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150
Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K++SVL P G K+ D G+ SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPFIP+NY++SS PA+VF +TL N+G+ A V+LLFTWANS+ G S +SG H N
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ +DGV G+ LHH+T G PPVTFA+AA ET +V+V+ PCF LS +S TAKDMW+
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDSS-FTAKDMWDT 329
Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
++K G FD + ++ ++PS G +I AA++AS + + +V+F+L+W P+VKF +
Sbjct: 330 VEKDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
Y RRYTKFYGT +A + HDA+ + +WE +IEAWQ PIL D+R PEWY TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449
Query: 433 LYYLNAGGTIWTDGSP------------------PMQSLATIRERKFSLDTPSKIHTPAS 474
LY+L AGGT+W D S M I +++ L + + ++
Sbjct: 450 LYFLVAGGTVWIDSSSLNANGNSQHQQSGLGNSDGMAGGLDINDQRNDLANGNSVGVKSN 509
Query: 475 SDTA---------LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
+ + + TR +++G++ +G+FLYLEG EYVM+ TYDVHFY+S+AL+MLFPK+
Sbjct: 510 DEVSAIHNRNGLFVDTRHVDDGDD-VGRFLYLEGVEYVMWCTYDVHFYASYALLMLFPKI 568
Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
EL+IQRDFA AV+ D +K +++G RK GAVPHD+G+ DPW E+N+YN+ ++S+
Sbjct: 569 ELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSK 628
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
WKDLN KFVLQVYRDF ATGD F VWP+V AM YMEQFD+D D +IEN+GFPDQTY
Sbjct: 629 WKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTY 688
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
D W+ +GVSAYCG LW+AALQAA+A+A +GD A ++ AKA ++ LWNGSYF
Sbjct: 689 DTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYF 748
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
NYD+ S+S SIQ DQLAGQWYA + GL PI +E+K++ + KI+DFNV+K KGG GA
Sbjct: 749 NYDSGASSNSKSIQTDQLAGQWYAASSGLPPIFEESKIRSTMQKIFDFNVMKTKGGKMGA 808
Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
+NGM PDG++D + +Q+REIW GVTY AA+MI M + F TA G++ WSE+G GY
Sbjct: 809 VNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGY 868
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
FQTPE W + YRSL YMRPL IW MQWAL+ PK + + DR
Sbjct: 869 WFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPKAILDAPQINMMDR 917
>gi|255554158|ref|XP_002518119.1| conserved hypothetical protein [Ricinus communis]
gi|223542715|gb|EEF44252.1| conserved hypothetical protein [Ricinus communis]
Length = 968
Score = 892 bits (2304), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/904 (50%), Positives = 607/904 (67%), Gaps = 50/904 (5%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E + G+ + F
Sbjct: 31 DSAAPPKHAWRRRLNSHANILKEFSVTFTEAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G+GSI R +RGEF+++++ IC+ +PV+ANQFS+F+SR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPSICEVSPVMANQFSIFISRDGG 150
Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K++SVL P K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I
Sbjct: 151 TKKYASVLAPGQHEGLGKDGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPFIPHNY++SS P +VF +TL NSG+ A V+LLFTWANS+ G S LSG H N
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNSGKERAKVSLLFTWANSIGGVSHLSGDHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ +DGV G+ LHH+TA G PPVTFA+AA ET +V V+ P F LS S ITAKDMW++
Sbjct: 271 FIGEDGVSGVLLHHKTAKGNPPVTFAIAACETQNVSVTVLPSFGLSEESH-ITAKDMWSK 329
Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ + G FD + D T PS PG ++ AA++AS + +V F+L+W P++KF +
Sbjct: 330 MVQDGQFDRENFDCGPTMPSSPGETLCAAVSASAWVEPHGKCTVAFALSWSSPKIKFSKG 389
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
YHRRYTKFYGT +A + HDA+ + WE EIE WQ PIL+D+R PEWY TLFNE
Sbjct: 390 STYHRRYTKFYGTSERAAQNLVHDALKNYKWWEEEIEKWQNPILKDERLPEWYKFTLFNE 449
Query: 433 LYYLNAGGTIWTDGS--------------------PPMQSLATIRERKFSLDT------- 465
LY+L AGGT+W D S P + I +++ T
Sbjct: 450 LYFLVAGGTVWIDSSLLTEDMRETMNVDVIEVQVSRPKGAEKQIATNGYNVATIGLEEKD 509
Query: 466 -------PSKIHTPASSDTALGTRLL--------ENGEENIGQFLYLEGAEYVMYNTYDV 510
PSK P S + L +N +++G+FLYLEG EY+M+ TYDV
Sbjct: 510 GASNGNYPSKDELPVSHENGHLNHSLKLSPLMEWQNNSDDVGRFLYLEGVEYIMWCTYDV 569
Query: 511 HFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDD 570
HFY+SFAL+ LFPK+EL+IQRDFA AV+ D +K +++G RK GAVPHD+G D
Sbjct: 570 HFYASFALLELFPKIELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGTHD 629
Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
PW E+N+YN+ ++S+WKDLN KFVLQVYRDF AT D +F VWP+V AM YMEQFD+D
Sbjct: 630 PWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATQDMSFGVDVWPAVRSAMEYMEQFDRD 689
Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK 690
GD +IEN+GFPDQTYDAW+ +GVSAYCG LW+AAL+AA+A+A VGD A ++ K
Sbjct: 690 GDALIENDGFPDQTYDAWTVHGVSAYCGCLWLAALEAAAAMALQVGDKYFAELCRSKFVK 749
Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
AK+ +++ LWNGSYFNYD+ S+S SIQADQLAGQWY + GL P+ D++K+K L KI
Sbjct: 750 AKSAFEAKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYVASSGLPPLFDDSKIKSTLQKI 809
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
YDFNV+KV+GG GA+NGM P+G++D + +Q+REIW GVTY +AA+MI M D AF A
Sbjct: 810 YDFNVMKVRGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYAVAATMILAGMEDKAFAAA 869
Query: 810 AGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHE 869
G++ WSEDG GY FQTPE W + +RSL YMRPL IW MQWAL+ PK + K
Sbjct: 870 EGIFLAGWSEDGYGYWFQTPEGWTTDGHFRSLIYMRPLAIWGMQWALSLPKAILEAPKIN 929
Query: 870 ISDR 873
I DR
Sbjct: 930 IMDR 933
>gi|9665098|gb|AAF97289.1|AC010164_11 Hypothetical protein [Arabidopsis thaliana]
Length = 790
Score = 890 bits (2299), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 452/823 (54%), Positives = 559/823 (67%), Gaps = 108/823 (13%)
Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKN---------TDSGIESWDWNLKGENCTYHALFPRA 180
FVSRP G K S+VLCP P V K N + GIESWDWN+ GE TYHAL+PR+
Sbjct: 24 FVSRPGGVKHSTVLCPSKPQVVKDNGGYLCQGQAPNMGIESWDWNMTGEKSTYHALYPRS 83
Query: 181 WTVYDG-------EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
WTVYDG R++ ++ +I + ++N+G A VTL
Sbjct: 84 WTVYDGIIFDLNVSNHVYYRLIYCKVLMYI------KVMKMFLLDEQVTNTGAEPAIVTL 137
Query: 234 LFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVS 293
LFTW NSV G SGL+G HFNS TM +TANG PPV++A+AA+ET DV VS
Sbjct: 138 LFTWENSVGGASGLTGQHFNS-TM------------KTANGHPPVSYAIAAKETEDVRVS 184
Query: 294 ECPCFLLSGNSKG-ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
CPCF++SG + ITA DMW+EIKK+ SFD L ++ SPS+PG+SIGAAIAA + +P G
Sbjct: 185 SCPCFIVSGTTPNQITAGDMWDEIKKNASFDKLTSNACSPSKPGTSIGAAIAAKVKVPPG 244
Query: 353 STRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
R+VTFSL+WDCPE +F EK YHRRYT+FYG+LG++A +AHDA+L
Sbjct: 245 CDRTVTFSLSWDCPEARFDEKTYHRRYTRFYGSLGNAAVAMAHDALL------------- 291
Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD-------- 464
+G PP QSL +I RK SL
Sbjct: 292 --------------------------------NGLPPKQSLDSIGRRKISLGLSTIDKND 319
Query: 465 ----------------TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTY 508
S+IH P +S+ ALG +++N ENIGQFLYLEG +Y+MYNTY
Sbjct: 320 QDQNNVALDILGRIDAVCSQIHAPLTSNAALGNTMIQNTTENIGQFLYLEGVQYLMYNTY 379
Query: 509 DVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL 568
DVHFYSSFAL+MLFPK+ELSIQRDFAAAV+MHD ++MS G++V RK LGAVPHDIGL
Sbjct: 380 DVHFYSSFALLMLFPKIELSIQRDFAAAVLMHDSSKKQVMSSGEFVTRKVLGAVPHDIGL 439
Query: 569 DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFD 628
+DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFD
Sbjct: 440 NDPWFELNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFD 499
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
KDGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA ++GD+ +A YF +Y
Sbjct: 500 KDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALAREIGDNGAAVYFNAKY 559
Query: 689 QKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
+KA++VY+ LWNGSYFNYDNS SS+SI ADQ+AGQWYARACGL PIA E +KKAL
Sbjct: 560 EKARSVYEKLWNGSYFNYDNSRSGSSSSILADQMAGQWYARACGLKPIAKEEWIKKALET 619
Query: 749 IYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
+YDFNV++V+ G GA+NGM PDGR+D S + +RE+W G TY +AA MIQE + D F+T
Sbjct: 620 VYDFNVMRVRDGTRGAVNGMLPDGRVDTSTMVSREVWAGTTYSVAACMIQEGLADKGFRT 679
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE--- 865
A+G+YE AWS+ GLG +FQTPE+W NDEYRSLCYMRPL IW +QWA T PK +R++
Sbjct: 680 ASGIYEAAWSDRGLGCAFQTPEAWTTNDEYRSLCYMRPLAIWGIQWAHTMPKPNREQEVS 739
Query: 866 IKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYD 908
++ + D S +QHA F KVA LK K + + L+ Y+
Sbjct: 740 LRPQEEDATSVLFQQHAGFIKVAHYLKNTKGKDHRNRLQTAYE 782
>gi|356576421|ref|XP_003556330.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 950
Score = 884 bits (2284), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/885 (51%), Positives = 609/885 (68%), Gaps = 31/885 (3%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP+ W+R+L+S+ + F ++F E + +G R++ Y ++E + G+ + F
Sbjct: 31 DSSAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G+GSI R +RGEF+++++ +C+ +PV+ANQFS+F+SR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMANQFSIFISREGG 150
Query: 137 EK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
K F+SVL P K D GI SW WNL G++ TYHALFPRAWTVYDGEPDPEL+I
Sbjct: 151 NKNFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPF+PHNY+ESS PA+VF +TL N+G+ A V+LLFTWANS+ G S LSG H N
Sbjct: 211 CRQISPFVPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHLSGDHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+DGV G+ L+H+TA G PPVTFA+AA ET +V+VS P F LS S TAK MW++
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEGSS-TTAKGMWSK 329
Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ K G FD + ++ + PS PG ++ AA+AAS+ + +V FSLAW P+VKF +
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASMWVEPHGKCTVAFSLAWSSPKVKFVKG 389
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
++RRYTKFYGT +AA +AHDA+ + +WE EIE WQ PIL+D+ PEWY TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPILKDETLPEWYKFTLFNE 449
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP----------------ASSD 476
LY+L AGGTIW D ++ ++R L+ T ++ D
Sbjct: 450 LYFLVAGGTIWIDSPLLSSNMRNDQDRVRELENTVVKETEDKMSDRKRTVVERIMDSTCD 509
Query: 477 TALGT-------RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
+A+ T +L + + ++G+FLYLEG EY+M+ TYDVHFY+SFAL+ LFPK+EL+I
Sbjct: 510 SAVITGHDPADEKLSGDDDADVGRFLYLEGVEYIMWCTYDVHFYASFALLELFPKIELNI 569
Query: 530 QRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDL 589
QRDFA AV+ D +K +++G W RK GAVPHD+G DPW E+N+YN+ ++S+WKDL
Sbjct: 570 QRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDL 629
Query: 590 NSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
N KFVLQVYRDF TGD F VWP+V AM YMEQFD+DGDG+IEN+GFPDQTYD W+
Sbjct: 630 NPKFVLQVYRDFATTGDLEFGVDVWPAVRAAMEYMEQFDRDGDGLIENDGFPDQTYDTWT 689
Query: 650 ANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDN 708
+GVS YCG LW+AALQAA+A+A D+GD A ++ KAK + + LWNG+YFNYD+
Sbjct: 690 VHGVSTYCGCLWLAALQAAAAMALDLGDREFAEKCKRKFLKAKPAFEEKLWNGTYFNYDS 749
Query: 709 SDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM 768
+S SIQADQLAGQWY + GL P+ +++K+K AL K+YDFNV+KVKGG GA+NGM
Sbjct: 750 GSSGNSKSIQADQLAGQWYTASSGLPPLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGM 809
Query: 769 QPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
P+G++D + +Q+RE+W GVTYGLAA+MI M + AF TA G++ WSEDG GY FQT
Sbjct: 810 HPNGKVDETCMQSREVWTGVTYGLAATMILAGMEEEAFATAEGIFLAGWSEDGYGYWFQT 869
Query: 829 PESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
PE+W + YRSL YMRPL IW MQ+A+ +PK + K I DR
Sbjct: 870 PEAWTMDGHYRSLMYMRPLAIWGMQYAINRPKAILEAPKINIMDR 914
>gi|22331303|ref|NP_189060.2| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|79313355|ref|NP_001030757.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|17529232|gb|AAL38843.1| unknown protein [Arabidopsis thaliana]
gi|110742225|dbj|BAE99039.1| hypothetical protein [Arabidopsis thaliana]
gi|332643347|gb|AEE76868.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
gi|332643348|gb|AEE76869.1| Beta-glucosidase, GBA2 type family protein [Arabidopsis thaliana]
Length = 950
Score = 880 bits (2274), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 444/889 (49%), Positives = 610/889 (68%), Gaps = 36/889 (4%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP W+R+L+ + + FT++F+E + +G RL+ Y ++E + G+ + F
Sbjct: 31 DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
K + ++ QG+PLGG+G+GSI R +RGEF+++++ G CD +P+++NQFS+F+SR G
Sbjct: 91 TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150
Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K++SVL P G K+ D G+ SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPFIP+NY++SS PA+VF +TL N+G+ A V+LLFTWANS+ G S +SG H N
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ +DGV G+ LHH+T G PPVTFA+AA ET +V+V+ PCF LS +S TAKDMW+
Sbjct: 271 FIGEDGVSGVLLHHKTGKGNPPVTFAIAASETQNVNVTVLPCFGLSEDSS-FTAKDMWDT 329
Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+++ G FD + ++ ++PS G +I AA++AS + + +V+F+L+W P+VKF +
Sbjct: 330 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 389
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
Y RRYTKFYGT +A + HDA+ + +WE +IEAWQ PIL D+R PEWY TLFNE
Sbjct: 390 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 449
Query: 433 LYYLNAGGTIWTDGSP---------PMQSLAT---------IRERKFSLDTPSKIHTPAS 474
LY+L AGGT+W D S L I +++ L + + ++
Sbjct: 450 LYFLVAGGTVWIDSSSLNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKSN 509
Query: 475 SDTA---------LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
+ + + T +++G++ +G+FLYLEG EYVM+ TYDVHFY+S+AL+MLFPK+
Sbjct: 510 DEVSAIHNRNGLFVDTPHVDDGDD-VGRFLYLEGVEYVMWCTYDVHFYASYALLMLFPKI 568
Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
EL+IQRDFA AV+ D +K +++G RK GAVPHD+G+ DPW E+N+YN+ ++SR
Sbjct: 569 ELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSR 628
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
WKDLN KFVLQVYRDF ATGD F VWP+V AM YMEQFD+D D +IEN+GFPDQTY
Sbjct: 629 WKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTY 688
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
D W+ +GVSAYCG LW+AALQAA+A+A +GD A ++ AKA ++ LWNGSYF
Sbjct: 689 DTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYF 748
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
NYD+ S+S SIQ DQLAGQWYA + GL P+ +E+K++ + KI+DFNV+K KGG GA
Sbjct: 749 NYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKTKGGKMGA 808
Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
+NGM PDG++D + +Q+REIW GVTY AA+MI M + F TA G++ WSE+G GY
Sbjct: 809 VNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGY 868
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
FQTPE W + YRSL YMRPL IW MQWAL+ PK + + DR
Sbjct: 869 WFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPKAILDAPQINMMDR 917
>gi|225433442|ref|XP_002285674.1| PREDICTED: non-lysosomal glucosylceramidase-like [Vitis vinifera]
Length = 978
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/901 (50%), Positives = 600/901 (66%), Gaps = 60/901 (6%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E ++G+ + F
Sbjct: 31 DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G+GSI R +RGEF+ +++ G CD +P++ANQFS+F+SR G
Sbjct: 91 TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150
Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K++SVL P K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL++
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPFIPHNY++SS P +VF +TL N+G+ A V+LLFTWANS+ G S LSG H N
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ +DGV G+ LHH+TA PPVTFA+AA ET +V V+ P F LS S ITAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGSH-ITAKDMWGK 329
Query: 316 IKKHGSFDHLD--NDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ + G FD + + ++ PS PG ++ AA++AS + +V F+LAW P+VKF +
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
YHRRYTK+YGT +A I HDA+ + +WE EIE WQ PIL D R PEWY TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT--------------- 477
LY+L AGGT+W D S P S + +++ + T A ++
Sbjct: 450 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYD 509
Query: 478 ALGTRLLENGEENI-------------------------------------GQFLYLEGA 500
A+ + LE EE I G+FLYLEG
Sbjct: 510 AISRKGLEYDEEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGV 569
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQR+FA AV+ D +K +++G W RK G
Sbjct: 570 EYIMWCTYDVHFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRG 629
Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
AVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF AT D +F VWP+V A
Sbjct: 630 AVPHDLGTHDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAA 689
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
M YMEQFD+D DG+IEN+GFPDQTYD W+ +G+SAYCG LW+AALQAA+A+A +GD
Sbjct: 690 MEYMEQFDRDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPF 749
Query: 681 ASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
A ++ KAK V+ + LWNGSYFNYD+ S+S SIQADQLAGQWY + GL + D+
Sbjct: 750 AEKCKSKFFKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDD 809
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
K+K +L KIYDFNV+KVKGG GA+NGM P+G++D S +Q+REIW GVTYG+AA+MI
Sbjct: 810 YKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILS 869
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
M + AF TA G++ WSE+G GY FQTPE W + +RSL YMRPL IW MQWAL+ P
Sbjct: 870 GMEEQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMP 929
Query: 860 K 860
+
Sbjct: 930 R 930
>gi|356535470|ref|XP_003536268.1| PREDICTED: non-lysosomal glucosylceramidase-like [Glycine max]
Length = 949
Score = 871 bits (2251), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/885 (50%), Positives = 609/885 (68%), Gaps = 32/885 (3%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP+ W+R+L+S+ + F ++F E + +G R++ Y ++E + G+ + F
Sbjct: 31 DSSAPPEQAWRRRLNSHANLLKEFRVTFTEAIKMVRLGIRIWSYVREEASHGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G+GSI R +RGEF+++++ +C+ +PV++NQFS+F+SR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPSLCEASPVMSNQFSIFISREGG 150
Query: 137 EK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K F+SVL P K D GI SW WNL G++ TYHALFPRAWTVYDGEPDPEL+I
Sbjct: 151 KKKFASVLAPGQHEGLGKPDDQGISSWGWNLSGQHSTYHALFPRAWTVYDGEPDPELKIS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPFIPHNY+ESS PA+VF +TL N+G+ A V+LLFTWANS+ G S SG H N
Sbjct: 211 CRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGSSHSSGDHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+DGV G+ L+H+TA G PPVTFA+AA ET +V+VS P F LS S +TAK MW++
Sbjct: 271 FKAEDGVSGVLLYHKTAKGNPPVTFAIAACETQNVNVSVLPSFGLSEESS-MTAKHMWSK 329
Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ K G FD + ++ + PS PG ++ AA+AAS + +V FSLAW P+VKF +
Sbjct: 330 MVKDGQFDQENFNSGPSMPSSPGETLCAAVAASTWVEPHGKCTVAFSLAWSSPKVKFVKG 389
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
++RRYTKFYGT +AA +AHDA+ + +WE EIE WQ P+L+D+ PEWY TLFNE
Sbjct: 390 STFNRRYTKFYGTSEKAAADLAHDALTHYNRWEEEIEKWQNPVLKDEALPEWYKFTLFNE 449
Query: 433 LYYLNAGGTIWTDGSPPMQS-------------LATIRERKFSLDTPSKIHTPASSDTAL 479
LY+L AGGTIW D SP + S A ++E + + + +++D+
Sbjct: 450 LYFLVAGGTIWID-SPVLSSNMRNDQDRVRELESAVVKETEDKMSDRKRTVVESTTDSTY 508
Query: 480 GTRL----------LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
+ + L ++++G+FLYLEG EY+M+ TYDVHFY+SFAL+ LFP++EL+I
Sbjct: 509 DSAVITGHDRADEKLYEDDDDVGRFLYLEGVEYIMWCTYDVHFYASFALLELFPRIELNI 568
Query: 530 QRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDL 589
QRDFA AV+ D +K +++G W RK GAVPHD+G DPW E+N+YN+ ++S+WKDL
Sbjct: 569 QRDFARAVLCEDGRKVKFLAEGNWGIRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDL 628
Query: 590 NSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
N KFVLQVYRDF TGD F VWP+V AM YMEQFD+DGDG+IEN+GFPDQTYD W+
Sbjct: 629 NPKFVLQVYRDFATTGDLQFGVDVWPAVRAAMEYMEQFDRDGDGLIENDGFPDQTYDTWT 688
Query: 650 ANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDN 708
+GVS YCG LW+AALQAA+ +A ++GD A ++ KAK + + LWNG+YFNYD+
Sbjct: 689 VHGVSTYCGCLWLAALQAAAVMALELGDREFAETCKRKFLKAKPAFEEKLWNGTYFNYDS 748
Query: 709 SDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM 768
+S SIQADQLAGQWY + GL + +++K+K AL K+YDFNV+KVKGG GA+NGM
Sbjct: 749 GSSGNSKSIQADQLAGQWYTASSGLPSLFEDSKIKSALRKVYDFNVMKVKGGRMGAVNGM 808
Query: 769 QPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
P+G++D + +Q+RE+W GVTYGLAA+MI M + AF TA G++ WSEDG GY FQT
Sbjct: 809 HPNGKVDETCMQSREVWTGVTYGLAATMIHAGMEEEAFTTAEGIFLAGWSEDGYGYWFQT 868
Query: 829 PESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
PE+W + YRSL YMRPL IW MQ+A+ +PK + K I DR
Sbjct: 869 PEAWTMDGHYRSLMYMRPLAIWGMQYAINRPKAILEAPKINIMDR 913
>gi|224072655|ref|XP_002303825.1| predicted protein [Populus trichocarpa]
gi|222841257|gb|EEE78804.1| predicted protein [Populus trichocarpa]
Length = 966
Score = 860 bits (2221), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/934 (47%), Positives = 601/934 (64%), Gaps = 67/934 (7%)
Query: 10 ENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAK 69
N + D PP+ W+R+L+S+ + F+++F+E + +G RL+ Y ++E +
Sbjct: 22 RNTLQLFDFDSAAPPEQAWRRRLNSHANILKEFSVTFKEAIQMVRLGIRLWSYVREEASH 81
Query: 70 GKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
G+ + F + ++ QG+PLGG+G+GSI R +RGEF+++++ GIC+ +P+
Sbjct: 82 GRKAPIDPFTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIVPGICESSPI----- 136
Query: 128 SVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
F+SR G K ++SVL P K D GI SW WNL G++ TYHALFPRAWTVYDG
Sbjct: 137 --FISRDGGNKNYASVLAPGQHEGIGKAGDQGISSWGWNLSGQHSTYHALFPRAWTVYDG 194
Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
EPDPEL+I CRQISPFIPHNY++SS P +VF +TL N+G+ A V+LLFTWANS+ G S
Sbjct: 195 EPDPELKISCRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISH 254
Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
LSG H N + +DGV G+ LHH+ N PPVTFA+AA ET +V V+ P F LS S
Sbjct: 255 LSGDHVNEPFIGEDGVSGVLLHHKQGN--PPVTFAIAACETQNVSVTVLPSFGLSEGS-C 311
Query: 307 ITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
TAK MW + + G FD + + + PS PG ++ AA++AS + +V F+LAW
Sbjct: 312 TTAKAMWGTMVQDGHFDRGNFNWGPSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWS 371
Query: 365 CPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
P++KF + YHRRYTKFYGT +A + HDA+ + +WE EIE WQ PIL+D++ PE
Sbjct: 372 SPKIKFLKGSSYHRRYTKFYGTSERAAQNLVHDALTNYKQWEEEIEKWQDPILKDEKLPE 431
Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS-KIHTP---------- 472
WY TLFNELY+L AGGT+W D S R ++T K+ P
Sbjct: 432 WYKFTLFNELYFLVAGGTVWIDSSLSSADTRNGHHRSREVETTGIKVTEPQVNCNGGPDH 491
Query: 473 -------------ASSDTALGTRLLENGE------------------------ENIGQFL 495
++ A T+ + E +++G+FL
Sbjct: 492 TTTNDHNTTSSEQKENNKAFHTKCICKDESAVSRERGNLDHTLDPFTFLDPLSDDVGRFL 551
Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
YLEG EY+M+ TYDVHFY+SFAL+ LFPK+EL+IQRDFA AV+ D ++ ++DG
Sbjct: 552 YLEGVEYIMWCTYDVHFYASFALLALFPKIELNIQRDFAKAVLSEDGRKVRFLADGSVGI 611
Query: 556 RKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F VWP
Sbjct: 612 RKARGAVPHDLGTHDPWNEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGVDVWP 671
Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDV 675
+V AM YMEQFD+D DG++EN+GFPDQTYDAW+ +GVSAYCG LW+A+LQAA+A+A +
Sbjct: 672 AVRTAMEYMEQFDRDDDGLVENDGFPDQTYDAWTVHGVSAYCGCLWLASLQAAAAMAMQL 731
Query: 676 GDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL 734
GD A ++ KAK+ ++S LWNGSYFNYD+ ++S SIQADQLAG+WY + GL
Sbjct: 732 GDKYFAELCKSKFAKAKSAFESKLWNGSYFNYDSGSSNNSKSIQADQLAGEWYMASSGLP 791
Query: 735 PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
+ D+ K++ AL KIYDFNV+KV+GG GA+NGM P+G++D + +Q+REIW GVTY +AA
Sbjct: 792 SLFDDVKIRSALNKIYDFNVMKVRGGKMGAVNGMHPNGKVDETCMQSREIWSGVTYAVAA 851
Query: 795 SMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQW 854
+MI M D AF TA G++ WSE+G GY FQTPE+W + +RSL YMRPL IW MQW
Sbjct: 852 TMILSGMEDKAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHFRSLIYMRPLAIWGMQW 911
Query: 855 ALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVA 888
AL+ PK K I +R S L FS +
Sbjct: 912 ALSLPKAILDAPKINIMER--SLLSPSTRFSLIG 943
>gi|11994237|dbj|BAB01359.1| unnamed protein product [Arabidopsis thaliana]
Length = 937
Score = 857 bits (2213), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/889 (49%), Positives = 600/889 (67%), Gaps = 49/889 (5%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP W+R+L+ + + FT++F+E + +G RL+ Y ++E + G+ + F
Sbjct: 31 DSAAPPPHAWRRRLNCHANILKEFTITFREAIKMVRLGIRLWSYVREEASHGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
K + ++ QG+PLGG+G+GSI R +RGEF+++++ G CD +P+++NQFS+F+SR G
Sbjct: 91 TKENCKPSASQGVPLGGMGSGSISRGFRGEFKQWQITPGTCDPSPMMSNQFSIFISRDGG 150
Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K++SVL P G K+ D G+ SW WNL G++ TYHALFPRAWT+YDGEPDPEL+I
Sbjct: 151 HKKYASVLSPGQHGSLGKSRDKGLSSWGWNLNGQHSTYHALFPRAWTIYDGEPDPELKIS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPFIP+NY++SS PA+VF +TL N+G+ A V+LLFTWANS+ G S +SG H N
Sbjct: 211 CRQISPFIPNNYRDSSLPAAVFVYTLVNTGKERAKVSLLFTWANSMGGTSHMSGGHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
MT G PPVTFA+AA ET +V+V+ PCF LS +S TAKDMW+
Sbjct: 271 FMT-------------GKGNPPVTFAIAASETQNVNVTVLPCFGLSEDSS-FTAKDMWDT 316
Query: 316 IKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+++ G FD + ++ ++PS G +I AA++AS + + +V+F+L+W P+VKF +
Sbjct: 317 MEQDGKFDQENFNSGPSTPSLAGDTICAAVSASAWVEAHGKCTVSFALSWSSPKVKFSKG 376
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
Y RRYTKFYGT +A + HDA+ + +WE +IEAWQ PIL D+R PEWY TLFNE
Sbjct: 377 STYDRRYTKFYGTSPRAALDLVHDALTNYKRWEEDIEAWQNPILRDERLPEWYKFTLFNE 436
Query: 433 LYYLNAGGTIWTDGSP---------PMQSLAT---------IRERKFSLDTPSKIHTPAS 474
LY+L AGGT+W D S L I +++ L + + ++
Sbjct: 437 LYFLVAGGTVWIDSSSLNANGNSQHQQSGLGNSDGKVGGLDINDQRNDLGNGNSVGVKSN 496
Query: 475 SDTA---------LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
+ + + T +++G++ +G+FLYLEG EYVM+ TYDVHFY+S+AL+MLFPK+
Sbjct: 497 DEVSAIHNRNGLFVDTPHVDDGDD-VGRFLYLEGVEYVMWCTYDVHFYASYALLMLFPKI 555
Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
EL+IQRDFA AV+ D +K +++G RK GAVPHD+G+ DPW E+N+YN+ ++SR
Sbjct: 556 ELNIQRDFAKAVLSEDGRKVKFLAEGNVGIRKVRGAVPHDLGMHDPWNEMNAYNIHDTSR 615
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
WKDLN KFVLQVYRDF ATGD F VWP+V AM YMEQFD+D D +IEN+GFPDQTY
Sbjct: 616 WKDLNPKFVLQVYRDFAATGDYQFGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTY 675
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
D W+ +GVSAYCG LW+AALQAA+A+A +GD A ++ AKA ++ LWNGSYF
Sbjct: 676 DTWTVHGVSAYCGCLWLAALQAAAAMALQIGDKFFAELCKNKFLNAKAALETKLWNGSYF 735
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
NYD+ S+S SIQ DQLAGQWYA + GL P+ +E+K++ + KI+DFNV+K KGG GA
Sbjct: 736 NYDSGSSSNSKSIQTDQLAGQWYAASSGLPPLFEESKIRSTMQKIFDFNVMKTKGGKMGA 795
Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
+NGM PDG++D + +Q+REIW GVTY AA+MI M + F TA G++ WSE+G GY
Sbjct: 796 VNGMHPDGKVDDTCMQSREIWTGVTYAAAATMILSGMEEQGFTTAEGIFTAGWSEEGFGY 855
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
FQTPE W + YRSL YMRPL IW MQWAL+ PK + + DR
Sbjct: 856 WFQTPEGWTMDGHYRSLIYMRPLAIWGMQWALSLPKAILDAPQINMMDR 904
>gi|222612993|gb|EEE51125.1| hypothetical protein OsJ_31868 [Oryza sativa Japonica Group]
Length = 935
Score = 847 bits (2187), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 452/920 (49%), Positives = 596/920 (64%), Gaps = 95/920 (10%)
Query: 9 GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
G A + D G PP+ W+R+L+S+ + F+++F E +
Sbjct: 21 GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM---------------- 64
Query: 69 KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
K P+ + F K ++ QG+PLGG+G+GSI R +RGEF+ + + G+C+ +P+
Sbjct: 65 --KAPI-DPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPI---- 117
Query: 127 FSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
FVSR G +K+SSVL P KK DSGI SWDWNL G++ TYHALFPRAWTVYD
Sbjct: 118 ---FVSRDGGNKKYSSVLSPGHHEGLKKCNDSGISSWDWNLSGQHSTYHALFPRAWTVYD 174
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
GEPDP+L+I CRQISPFIPH+YK+SS P SVF +TL N+G+ A V+LL TWANS+ G S
Sbjct: 175 GEPDPDLKISCRQISPFIPHDYKDSSLPTSVFVYTLVNTGKDRAKVSLLMTWANSIGGFS 234
Query: 246 GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
SG HFN +T A PPVTFA+AA ET +V+V+ P F LSG
Sbjct: 235 HHSGGHFNEPFIT-------------AKDNPPVTFAIAACETQNVNVTVLPVFGLSGEGH 281
Query: 306 GITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
+AK MW+ +K++G FD + + + PS G ++ AA++AS + +V F LAW
Sbjct: 282 D-SAKQMWDRMKQNGHFDRENFEAGTSMPSSSGETLCAAVSASTWVEPHGRCTVVFGLAW 340
Query: 364 DCPEVKFFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
P++KF + Y+RRYT+FYGT SA + HDA+ ++ WE EIE WQ PIL++++ P
Sbjct: 341 SSPKIKFQKGCTYNRRYTEFYGTSERSAVNLVHDALTKYRIWEEEIEKWQNPILKNEKLP 400
Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPM-------------QSLATIRERKF-------- 461
EWY TLFNELY+L AGGT+WTDG PP+ S R+ K
Sbjct: 401 EWYKFTLFNELYFLVAGGTVWTDGQPPVIDEKPSPGSNQQKSSKRGTRDTKQESVKDNHV 460
Query: 462 -------------------SLDTPSKIHTP--ASSDTALGTR------LLENGEENIGQF 494
S+ + +H P A + LG++ L +NG EN+G+F
Sbjct: 461 KLTAEQVTNGGDLANGEEQSVSKYAAVHGPQMAKATNGLGSQEPIPYLLSKNGPENVGKF 520
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
LYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D MK ++DG
Sbjct: 521 LYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRRMKFLADGTSG 580
Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD +F R VW
Sbjct: 581 IRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFAATGDMSFGRDVW 640
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P+V M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+AALQAA+ +A+
Sbjct: 641 PAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWLAALQAAATMAHR 700
Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+GD A + +++ +AKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL
Sbjct: 701 LGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGL 760
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
P+ DE K++ AL KI++FNV+KVKGG GA+NGM P+G++D + +Q+REIW GVTYG+A
Sbjct: 761 PPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQSREIWTGVTYGVA 820
Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
A+M+ M F TA G++ WSE+G GY FQTPE W + YRSL YMRPL IWAMQ
Sbjct: 821 ANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSLIYMRPLAIWAMQ 880
Query: 854 WALTKPKLSRQEIKHEISDR 873
WA + PK K + DR
Sbjct: 881 WARSPPKAILDAPKVNLMDR 900
>gi|302762052|ref|XP_002964448.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
gi|300168177|gb|EFJ34781.1| hypothetical protein SELMODRAFT_166634 [Selaginella moellendorffii]
Length = 916
Score = 816 bits (2109), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/929 (49%), Positives = 613/929 (65%), Gaps = 58/929 (6%)
Query: 24 PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
P L W+R+LD+ V T F + +E + +G RL+++ + E +G+ + F
Sbjct: 4 PALAWKRRLDTPVVELTEFKPTLREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDT 63
Query: 84 --TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG----E 137
+S QG+PLGGIG GSIGR +RG+F R++L +C + VLA+QFSVF++R + +
Sbjct: 64 PASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQ 123
Query: 138 KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
K SSVL P P K T+SGI SW+WNL G TY+ LFPRAWT YDGEPDP+++I CR
Sbjct: 124 KTSSVLFPGEPVFESKETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCR 183
Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM 257
Q+SPF+PHNYKESSFP+ VF + L N+G A TLLFTWANS+AG S SG H NS M
Sbjct: 184 QVSPFVPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSPFM 243
Query: 258 TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG---ITAKDMWN 314
++ VTFA+AA ET DV S CPCFL++G+ +G TAKDMW+
Sbjct: 244 KEEH---------------SVTFAIAARETDDVSTSVCPCFLVAGSPEGNSYQTAKDMWS 288
Query: 315 EIKKHGSFDHLDNDKT-SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
EI + G F+ ++ + S S+P + +GAA+AAS+ +P+ T+S+TF LAWD P+ F
Sbjct: 289 EITEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGG 348
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
Y+RRYTKFYG G++A+++ HDAIL + +WE EI AWQ PIL D PEWY TLFNE
Sbjct: 349 SSYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNE 408
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP-----SKIHTPASSD---TALGTRLL 484
LYYL AGGTIWT + + + FS DT ++ P S+ AL +
Sbjct: 409 LYYLVAGGTIWTGMNMKRIYTTLLCDLSFS-DTCRQACLAQKLNPVKSNGVSEALLDAIN 467
Query: 485 ENGE-------------ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
N E +N+GQFLYLEG EY +NTYDVHFY+S+AL+ LFP+LEL +QR
Sbjct: 468 ANSEAQSFWSIGIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRLELQLQR 527
Query: 532 DFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNS 591
DFAAA + HDP M ++ G R G+VPHD+G DPW +NSYNL ++S+WKDLN
Sbjct: 528 DFAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLHDTSKWKDLNC 587
Query: 592 KFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
KFV+QV RD ATGD FARAVWP+ Y AMA +QFD+DGDGMIEN+G PDQTYD W+ +
Sbjct: 588 KFVVQVNRDAAATGDLEFARAVWPAAYAAMAVTDQFDRDGDGMIENDGVPDQTYDLWTVD 647
Query: 652 GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSD 710
GVS+YCGGLWVAALQAASALA+ VGD S+ F ++ +AKAVY+ SLWNGSYF+YD
Sbjct: 648 GVSSYCGGLWVAALQAASALADLVGDKPSSRMFHGKFLRAKAVYEKSLWNGSYFHYDK-- 705
Query: 711 GSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP 770
+STS+Q+DQLAGQWY+ A GL I E + AL ++ NV++ KGG GA+NGM P
Sbjct: 706 --NSTSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLP 763
Query: 771 DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPE 830
G D + LQ+RE+W G TY +AA+MIQ+ M + F+TA GV+ WS+ G GY+FQTPE
Sbjct: 764 SGGEDRTSLQSREVWAGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGYAFQTPE 823
Query: 831 SWNNNDEYRSLCYMRPLTIWAMQWALTKP-KLSRQEIKHEISDRDSSYLEQ-HAAFSKVA 888
+W+N+ +YRSL YMRPL+IWAMQWAL P +L ++ E+ D + +E + +F+K+A
Sbjct: 824 AWDNDGKYRSLAYMRPLSIWAMQWALDPPEQLKQRSCSVEVGDITAQEVEACNKSFAKLA 883
Query: 889 SLLKLPKEEASKGFLKV---VYDFTIGRV 914
S LK +K + V ++D+ RV
Sbjct: 884 SQLKELSPPEAKAYSSVWIWMFDYIRSRV 912
>gi|302787158|ref|XP_002975349.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
gi|300156923|gb|EFJ23550.1| hypothetical protein SELMODRAFT_174907 [Selaginella moellendorffii]
Length = 916
Score = 816 bits (2108), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/928 (49%), Positives = 613/928 (66%), Gaps = 56/928 (6%)
Query: 24 PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
P L W+R+LD+ V F +F+E + +G RL+++ + E +G+ + F
Sbjct: 4 PALAWKRRLDTPVMELKEFKPTFREILEMLPMGVRLWKFCRSEHEQGRRAPIDPFYPQDT 63
Query: 84 --TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG----E 137
+S QG+PLGGIG GSIGR +RG+F R++L +C + VLA+QFSVF++R + +
Sbjct: 64 PASSCQGVPLGGIGGGSIGRGFRGDFSRWQLLPTVCQTSAVLADQFSVFITRHSKDGTTQ 123
Query: 138 KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
K SSVL P P + T+SGI SW+WNL G TY+ LFPRAWT YDGEPDP+++I CR
Sbjct: 124 KTSSVLFPGEPVFESEETESGISSWNWNLTGVKSTYYGLFPRAWTTYDGEPDPQMKITCR 183
Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM 257
Q+SPF+PHNYKESSFP+ VF + L N+G A TLLFTWANS+AG S SG H NS M
Sbjct: 184 QVSPFLPHNYKESSFPSCVFIYELVNTGDEDATATLLFTWANSIAGGSETSGGHENSPFM 243
Query: 258 TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG---ITAKDMWN 314
++ VTFA+AA ET DV S CPCFL++G +G TAKDMW+
Sbjct: 244 KEEH---------------SVTFAIAARETDDVSTSVCPCFLVAGAPEGNSYQTAKDMWS 288
Query: 315 EIKKHGSFDHLDNDKT-SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
EI + G F+ ++ + S S+P + +GAA+AAS+ +P+ T+S+TF LAWD P+ F
Sbjct: 289 EIIEKGCFNAKNSTMSASLSQPETVVGAAVAASVVVPAHGTKSITFGLAWDSPKATFLGG 348
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
Y+RRYTKFYG G++A+++ HDAIL + +WE EI AWQ PIL D PEWY TLFNE
Sbjct: 349 SSYYRRYTKFYGKSGNAASKLVHDAILNYKRWESEIHAWQEPILNDHSLPEWYRYTLFNE 408
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP----SKIHTPASSD---TALGTRLLE 485
LYYL AGGTIWT + + + FS ++ P S+ AL +
Sbjct: 409 LYYLVAGGTIWTGMNMKRIYTTLLCDLSFSDSCRQACLAQKLNPVKSNGVSEALLDAINA 468
Query: 486 NGE-------------ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
N E +N+GQFLYLEG EY +NTYDVHFY+S+AL+ LFP+LEL +QRD
Sbjct: 469 NSEAQSFWSFDIPAVEDNVGQFLYLEGVEYHFWNTYDVHFYASYALLALFPRLELQLQRD 528
Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK 592
FAAA + HDP M ++ G R G+VPHD+G DPW +NSYNL+++S+WKDLN K
Sbjct: 529 FAAATLSHDPEKMFYLAYGNTGIRAVFGSVPHDLGTHDPWNTLNSYNLYDTSKWKDLNCK 588
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
FV+QV RD ATGD +FARAVWP+ Y AMA+ +QFD+DGDGMIEN+G PDQTYD W+ +G
Sbjct: 589 FVVQVNRDAAATGDLDFARAVWPAAYAAMAFTDQFDRDGDGMIENDGVPDQTYDLWTVDG 648
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDG 711
VS+YCGGLWVAALQAASALA+ VGD S+ F ++ +AKAVY+ SLWNGSYF+YD
Sbjct: 649 VSSYCGGLWVAALQAASALADLVGDKPSSRMFHDKFLRAKAVYEKSLWNGSYFHYDK--- 705
Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
+STS+Q+DQLAGQWY+ A GL I E + AL ++ NV++ KGG GA+NGM P
Sbjct: 706 -NSTSVQSDQLAGQWYSLASGLPGIVREDQATSALGTVFATNVMRYKGGNEGAVNGMLPS 764
Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
G D + LQ+RE+W G TY +AA+MIQ+ M + F+TA GV+ WS+ G GY+FQTPE+
Sbjct: 765 GGEDRTSLQSREVWVGTTYAVAAAMIQQGMREEGFRTAKGVFLNGWSDQGHGYAFQTPEA 824
Query: 832 WNNNDEYRSLCYMRPLTIWAMQWALTKP-KLSRQEIKHEISDRDSSYLEQ-HAAFSKVAS 889
W+N+ +YRSL YMRPL+IWAMQWAL P +L ++ E+ D + +E + +F+K+AS
Sbjct: 825 WDNDGKYRSLAYMRPLSIWAMQWALDPPEQLKQRSCNVEVGDITAQEVEACNKSFAKLAS 884
Query: 890 LLKLPKEEASKGFLKV---VYDFTIGRV 914
LK +K + V ++D+ RV
Sbjct: 885 QLKELSPPETKAYSSVWIWMFDYIRSRV 912
>gi|218199525|gb|EEC81952.1| hypothetical protein OsI_25831 [Oryza sativa Indica Group]
Length = 850
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/742 (54%), Positives = 515/742 (69%), Gaps = 29/742 (3%)
Query: 147 SPGVP----KKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
S GVP K+ D GI SWDW L+G+ TYHALFPRAWTVYDGEPDPEL++ CRQISPF
Sbjct: 92 SQGVPLGGMGKSNDDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSCRQISPF 151
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
IPHNY+ESS P SVF +T+ N+G+ A V+L+ TWANS+ G S SG H N MT
Sbjct: 152 IPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEPFMT---- 207
Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
AN PPVTFAVAA E +V+V+ P F LSG S +TA++MW + + G F
Sbjct: 208 ---------ANNNPPVTFAVAACENQNVNVTVLPAFGLSGESS-VTAREMWGTLSQDGCF 257
Query: 323 DHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRR 378
D DN PS P G ++ AA+ AS + +V FSLAW P+VKF + Y+RR
Sbjct: 258 DR-DNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGNAYYRR 316
Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
YTKFYGT SA + D+++++ WE EI+ WQRPIL D+RFPEWY +TLFNELY+L +
Sbjct: 317 YTKFYGTSPRSAINLVQDSLMKYKHWEEEIDKWQRPILHDERFPEWYKVTLFNELYFLVS 376
Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG----TRLLENGEENIGQF 494
GGT+W D + M + + S D +H +S ++A+ + + +G EN+G+F
Sbjct: 377 GGTVWIDSASLMADADEMMNSRLSEDNDLPLHH-SSRNSAVPLIGFSPHIIDGRENVGKF 435
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
LYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D ++ ++DG W
Sbjct: 436 LYLEGIEYFMWCTYDVHFYASFALLELFPKIELSIQRDFATAVLREDKSRVRFLADGTWG 495
Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
RK +GAV HD+G DPW E+N+YN+ ++SRWKDLN KFVLQVYRDFVATGD +F + VW
Sbjct: 496 TRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFVATGDMSFGKDVW 555
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P+V AM YMEQFD D DGMIEN+GFPDQTYDAW+ GVSAYCG LW+AALQAA+ALA
Sbjct: 556 PAVCTAMEYMEQFDHDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQAAAALART 615
Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+G H A +++ KAK V+++ LWNGSYFNYD+ +S SI ADQLAGQWY + GL
Sbjct: 616 LGHHDYAERCMLKFAKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL 675
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
P+ DE ++K L KI+D+NV++VKGG GA+NGM P+G++D + +Q+REIW GVTY LA
Sbjct: 676 PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLA 735
Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
A+M+ M D AF TA G++ WSE+G GY FQTPE+W + YRSL YMRPL IWAMQ
Sbjct: 736 ATMLLHGMDDQAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 795
Query: 854 WALTKPKLSRQEIKHEISDRDS 875
+AL+ P+ + K DR S
Sbjct: 796 YALSPPRTILEAPKVNTMDRTS 817
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 2/86 (2%)
Query: 12 AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGK 71
A + D G PP+ W+RKL S+ F ++F+E + +G RL+ Y ++E + G+
Sbjct: 16 AGDLLDFDDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIQEEASHGR 75
Query: 72 VPVFNVFRKH--HITSDQGIPLGGIG 95
+ F + ++ QG+PLGG+G
Sbjct: 76 KAPIDPFTRESDKPSASQGVPLGGMG 101
>gi|4539009|emb|CAB39630.1| putative protein [Arabidopsis thaliana]
gi|7267702|emb|CAB78129.1| putative protein [Arabidopsis thaliana]
Length = 750
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/729 (55%), Positives = 494/729 (67%), Gaps = 90/729 (12%)
Query: 213 PASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTM-TKDGVHGLTLHH-- 269
P F++SN G+ A VTLLFTW NSV G SGL+G HFNS + K G+ ++
Sbjct: 71 PILTNQFSMSNLGKEEATVTLLFTWENSVGGASGLTGEHFNSTIIHLKRGLVPVSGRFIA 130
Query: 270 ------------RTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS-KGITAKDMWNEI 316
RT NG PPVT+A+AA+ET DVHVSECPCFL+SG+S K ITAK+MW+EI
Sbjct: 131 SIYCNCDEMKTCRTGNGHPPVTYAIAAQETEDVHVSECPCFLVSGHSPKEITAKEMWDEI 190
Query: 317 KKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYH 376
KK+ SFD L+++ SPS PG+SIGAAIAA + +P G R+VTFSL+WDCPEV+F EK YH
Sbjct: 191 KKNKSFDELNSEPGSPSRPGTSIGAAIAAKVKVPPGCDRTVTFSLSWDCPEVRFNEKTYH 250
Query: 377 RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
R Y +TLFNELYY
Sbjct: 251 R-----------------------------------------------YRVTLFNELYYF 263
Query: 437 NAGGTIWTDGSPPMQSL-----ATIRERKFSLDTPSKI-------HTPASSDTALGTRLL 484
N+GGTIWTDG PP +S+ + +D KI ++P SS
Sbjct: 264 NSGGTIWTDGLPPKESIERSKVTNTEQNDIVIDLFQKINAVCEQIYSPQSS--------- 314
Query: 485 ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGT 544
N EENIGQF+YLEG EY+MYNTYDVHFYSSFAL+ LFPKL LSIQRDFAA V++ DP
Sbjct: 315 -NSEENIGQFIYLEGIEYLMYNTYDVHFYSSFALLSLFPKLALSIQRDFAATVLIQDPTK 373
Query: 545 MKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVAT 604
KIMS G+WV RK LG+VPHDIGL+DPW E+N YN FN+ RWKDLN+KFVLQVYRD VAT
Sbjct: 374 KKIMSSGEWVTRKLLGSVPHDIGLNDPWLELNEYNFFNTDRWKDLNAKFVLQVYRDVVAT 433
Query: 605 GDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAA 664
D++FA+AVWPSVY A+AY++QFDKD DGMIENEGFPDQTYDAWS GVSAYCGGLWVAA
Sbjct: 434 NDQSFAKAVWPSVYTAVAYLDQFDKDEDGMIENEGFPDQTYDAWSVTGVSAYCGGLWVAA 493
Query: 665 LQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAG 724
LQAASA A+ VG++A A YF +Y+KAK VY+ LWNGSYFNYD+S SS+SI ADQLAG
Sbjct: 494 LQAASAFASIVGENAVAIYFNAKYEKAKIVYEKLWNGSYFNYDDSGSGSSSSILADQLAG 553
Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
QWYARACGL PI E +KKAL IY+FNV+KVKGG GA+NGM +G++D + L ++E+
Sbjct: 554 QWYARACGLKPITKEEWIKKALETIYEFNVMKVKGGTRGAVNGMSTEGKVDTNSLVSKEV 613
Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
W G TY +AA MIQE + FQTA+G+YE WS+ GL SFQTPE+WN NDEYRSLCYM
Sbjct: 614 WAGTTYSVAACMIQEGQREKGFQTASGIYEAIWSDRGLSCSFQTPEAWNMNDEYRSLCYM 673
Query: 845 RPLTIWAMQWALTKPKLSRQEIKHEIS----DRDSSYLEQHAAFSKVASLLKL-PKEEAS 899
RPL IWA+QWALT+ + +E + ++ + + L QH F VA +K+ P
Sbjct: 674 RPLAIWAIQWALTRTQSFGEEKQKLVAGDEEEESNLLLRQHKGFKDVARFVKIVPTSNVH 733
Query: 900 KGFLKVVYD 908
+ L+ Y+
Sbjct: 734 RSRLQHTYE 742
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 12/68 (17%)
Query: 74 VFNVFRKHHITSDQGIPLGGIG------------AGSIGRSYRGEFQRFKLFHGICDDAP 121
++++F+K H+ D G+PLGGIG GSIGRSY+GEFQ+FKLF IC++AP
Sbjct: 12 MYDIFKKRHVRGDHGVPLGGIGNVVNFTQTRIFSGGSIGRSYKGEFQQFKLFPKICEEAP 71
Query: 122 VLANQFSV 129
+L NQFS+
Sbjct: 72 ILTNQFSM 79
>gi|222636955|gb|EEE67087.1| hypothetical protein OsJ_24071 [Oryza sativa Japonica Group]
Length = 838
Score = 765 bits (1976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/742 (53%), Positives = 507/742 (68%), Gaps = 41/742 (5%)
Query: 147 SPGVP----KKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
S GVP K+ D GI SWDW L+G+ TYHALFPRAWTVYDGEPDPEL++ CRQISPF
Sbjct: 92 SQGVPLGGMGKSNDDGISSWDWKLRGDRSTYHALFPRAWTVYDGEPDPELKVSCRQISPF 151
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
IPHNY+ESS P SVF +T+ N+G+ A V+L+ TWANS+ G S SG H N MT
Sbjct: 152 IPHNYQESSLPTSVFVYTIVNTGKERAKVSLVMTWANSIGGLSHHSGGHVNEPFMT---- 207
Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
AN PPVTFAVAA E +V+V+ P F LSG S +TA++MW + + G F
Sbjct: 208 ---------ANNNPPVTFAVAACENQNVNVTVLPAFGLSGESS-VTAREMWGTLSQDGCF 257
Query: 323 DHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRR 378
D DN PS P G ++ AA+ AS + +V FSLAW P+VKF + Y+RR
Sbjct: 258 DR-DNFSAGPSMPSSLGDTVCAAVCASTWVEPHGRCTVVFSLAWSSPKVKFKKGNAYYRR 316
Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
YTKFYGT + I++ WE EI+ WQRPIL D+RFPEWY +TLFNELY+L A
Sbjct: 317 YTKFYGT----SPSISN--------WEEEIDKWQRPILHDERFPEWYKVTLFNELYFLVA 364
Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG----TRLLENGEENIGQF 494
GGT+W D + M + + S D +H +S ++A+ + + + EN+G+F
Sbjct: 365 GGTVWIDSASLMADADEMMNSRLSEDNDLPLHH-SSRNSAVPLIGFSPHIIDDRENVGKF 423
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
LYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D ++ ++DG W
Sbjct: 424 LYLEGIEYFMWCTYDVHFYASFALLELFPKIELSIQRDFATAVLREDKSRVRFLADGTWG 483
Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
RK +GAV HD+G DPW E+N+YN+ ++SRWKDLN KFVLQVYRDF ATGD +F + VW
Sbjct: 484 TRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQVYRDFAATGDMSFGKDVW 543
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P+V AM YMEQFD D DGMIEN+GFPDQTYDAW+ GVSAYCG LW+AALQ A+ALA
Sbjct: 544 PAVCTAMEYMEQFDHDDDGMIENDGFPDQTYDAWTVRGVSAYCGCLWLAALQTAAALART 603
Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+G H A +++ KAK V+++ LWNGSYFNYD+ +S SI ADQLAGQWY + GL
Sbjct: 604 LGHHDYAERCMLKFAKAKPVFEAKLWNGSYFNYDSGTSYNSRSIHADQLAGQWYTASSGL 663
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
P+ DE ++K L KI+D+NV++VKGG GA+NGM P+G++D + +Q+REIW GVTY LA
Sbjct: 664 PPLFDEGRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWTGVTYSLA 723
Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
A+M+ M D AF TA G++ WSE+G GY FQTPE+W + YRSL YMRPL IWAMQ
Sbjct: 724 ATMLLHGMDDQAFTTAEGIFTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRPLAIWAMQ 783
Query: 854 WALTKPKLSRQEIKHEISDRDS 875
+AL+ P+ + K DR S
Sbjct: 784 YALSPPRTILEAPKVNTMDRTS 805
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 2/86 (2%)
Query: 12 AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGK 71
A + D G PP+ W+RKL S+ F ++F+E + +G RL+ Y ++E + G+
Sbjct: 16 AGDLLDFDDGSPPEYAWRRKLSSHANRLKEFNVTFREAIKMMKLGLRLWSYIREEASHGR 75
Query: 72 VPVFNVFRKH--HITSDQGIPLGGIG 95
+ F + ++ QG+PLGG+G
Sbjct: 76 KAPIDPFTRESDKPSASQGVPLGGMG 101
>gi|326507888|dbj|BAJ86687.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 676
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/600 (58%), Positives = 437/600 (72%), Gaps = 30/600 (5%)
Query: 252 FNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG---IT 308
F +DGVHG+ LHHRTA+G PPVTFA+A++ET DV V+ CP F + ++ G
Sbjct: 5 FLGACRARDGVHGVLLHHRTADGHPPVTFAIASQETDDVRVTCCPSFAMGPSAPGGEQFM 64
Query: 309 AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
AKDMW E K GS + ++ S PGSSIGAA+AA+ T+P+G TR+V+F+L+W CPEV
Sbjct: 65 AKDMWEEAKNRGSVG-VAPGASASSRPGSSIGAAVAAATTVPAGGTRAVSFALSWSCPEV 123
Query: 369 KF-FEKVYHRRYTKFYGTLGDSAA-RIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYP 426
KF + YHRRYTKF G D+AA R+AHDA+LEH KWE IE WQRP+L DKR P WYP
Sbjct: 124 KFPAGRTYHRRYTKFLGLDRDAAAERLAHDALLEHMKWESLIEEWQRPVLHDKRLPGWYP 183
Query: 427 ITLFNELYYLNAGGTIWTDGSPPMQS------LATIRERKFSLDT---PSKIHTP----- 472
+ LFNELYYLNAGGTIWTDG PP ++ + FSLD S H P
Sbjct: 184 VALFNELYYLNAGGTIWTDGLPPKKTSFASSKYGSTTMESFSLDGFHFHSGDHAPDGILR 243
Query: 473 --------ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
+ +A GT LL +GEE++G+FLYLEG EY ++NTYDVHFY+SFAL+ LFP+
Sbjct: 244 AMATAEERTEASSAFGTALLGDGEESVGRFLYLEGMEYHLWNTYDVHFYASFALLSLFPE 303
Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSS 584
+EL +QRDFA AV+ HDP M+ + DG V RK LGAVPHD+GL DPWF++N+Y + + +
Sbjct: 304 IELGLQRDFARAVLHHDPRPMRTL-DGATVPRKVLGAVPHDVGLGDPWFQLNAYMIHDPA 362
Query: 585 RWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
RWKDLN+KFVLQVYRD ATGD FA AVWP+VY+AMAYM+QFD+DGDGM+ENEG PDQT
Sbjct: 363 RWKDLNTKFVLQVYRDGAATGDAAFATAVWPAVYLAMAYMDQFDRDGDGMVENEGRPDQT 422
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSY 703
YD WS +GVSAY GGLWVAALQAA+ +A VGD S YF RY+KA+ VYD LWNG+Y
Sbjct: 423 YDFWSVSGVSAYTGGLWVAALQAAAVMARVVGDRGSEGYFVERYEKARRVYDGELWNGAY 482
Query: 704 FNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
F+YDNS G++S SI ADQLAGQWYARACGL PI +E K + AL + D+NV++V+GG G
Sbjct: 483 FDYDNSGGTNSKSIMADQLAGQWYARACGLEPIVEEEKARSALGTVLDYNVMRVQGGAIG 542
Query: 764 AMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLG 823
A+NGM+PDG +D S +Q+RE+WPGVTYG+AA+M E M + AF+TA G ++ W DG G
Sbjct: 543 AVNGMRPDGTVDTSSIQSREVWPGVTYGVAAAMAHEGMPEAAFRTAKGAHDAGWGRDGFG 602
>gi|330844075|ref|XP_003293963.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
gi|325075658|gb|EGC29519.1| hypothetical protein DICPUDRAFT_158887 [Dictyostelium purpureum]
Length = 1205
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/886 (39%), Positives = 517/886 (58%), Gaps = 80/886 (9%)
Query: 24 PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
PQ +W R+LD N F +S + L IG+R+++Y K+E + G+VP+ + F
Sbjct: 324 PQCSWARRLDHNFPDHKHFQVSMWQGLKLVGIGYRMWKYVKKENSSGRVPMMDPFNLPKP 383
Query: 84 TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVS----------- 132
G+P+GGIG+GSI R +RG+F R+ + G+ + V ++FSV+++
Sbjct: 384 GPIMGVPIGGIGSGSINRGWRGDFVRWNMNSGLVTNETVDVDKFSVYINFDDSNSASSST 443
Query: 133 --------RPNGEKF--SSVLCPRS-----PGVPKKNTDSGIESWDWNLKGENCTYHALF 177
PNG F SS + PG PK N + ++ W+W LKGEN Y LF
Sbjct: 444 GSTPSGTFTPNGPNFDPSSTKSKQKATVLYPGKPKNNLN--LDVWNWGLKGENSCYFGLF 501
Query: 178 PRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQ-TSADVTLLFT 236
PRAWTVY+ EP P++++VC+QISP IPHNY+ESS+P V+ + + N+ + +SADV+L+ T
Sbjct: 502 PRAWTVYE-EPHPDIKLVCKQISPVIPHNYQESSYPVGVYVWKIENNNKNSSADVSLMLT 560
Query: 237 WANSVAGDSGLSGHHFNSKTMTKD----GVHGLTLHHRTANG----------RPPVTFAV 282
W NS+ S G H+N D + G+TL HR P+ +++
Sbjct: 561 WQNSIGTKSDQDGGHYNKYFTLGDEPNKKIKGVTLIHRKEQRCAYNSSNIVYHDPMEYSI 620
Query: 283 AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
A ++ DV+++ F + + A ++W K G +++ + + PS P IGAA
Sbjct: 621 AVKDDPDVNITFNERFETTSR---LDAANLWYTFNKKGELENIQDSR--PSAPKKPIGAA 675
Query: 343 IAASLTIPSGSTRSVTFSLAWDCPEVKF-FEKVYHRRYTKFYGTLGDSAARIAHDAILEH 401
+AA + +P+G ++++ F +AWD P +F Y++RYTKFYG G ++ RIA+DA+ +
Sbjct: 676 VAAKVKVPAGGSKTIVFCIAWDQPITRFQLGSAYYKRYTKFYGNKGGNSQRIAYDALQNY 735
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
W+ +I WQ PI++D P +Y + LFNELYYL GGT+WT GSP Q A ++
Sbjct: 736 KSWDNQIAQWQNPIIQDAELPSFYKMALFNELYYLVDGGTVWTHGSPQDQLPA-----RY 790
Query: 462 SLDT---PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
L T P I P +IG+F YLE EY+MYNTYDVHFYSS+AL
Sbjct: 791 PLKTQLKPEDISDP----------------NHIGRFAYLESLEYLMYNTYDVHFYSSYAL 834
Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEIN 576
ML+P LE+S+Q D A A M+ T + + G+ + RK VPHD+G +DPW +N
Sbjct: 835 AMLWPLLEISLQYDIAEATMLDYGITWEGIHSGQQIPRKKRCTVPHDLGNPGEDPWKRVN 894
Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFDKDGDGMI 635
SYN+ + SRWKDL SKFVLQVYRD++ DKNF V+ V + +E FD D DG++
Sbjct: 895 SYNIQDISRWKDLPSKFVLQVYRDYLVVEDKNFLLQVYNVVEEVIQRTLESFDTDHDGVV 954
Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
+NEGFPDQTYD W A G SAY GGLW+A+L+ AS +A +G S + ++K A Y
Sbjct: 955 DNEGFPDQTYDVWPAVGCSAYSGGLWLASLKVASEMAKILGFKEDESIYNAIFEKGSASY 1014
Query: 696 -DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
LWNG YF YD S+ + SI +D LAG WY +CGL + +L+ I ++NV
Sbjct: 1015 TKKLWNGYYFKYDCSNSVHADSIMSDMLAGHWYLLSCGLPSYMTFDQALSSLSIINEYNV 1074
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
CGA+NGM+P+G +D + LQ+ E+W G ++ LAA+MIQ M A++ G+
Sbjct: 1075 NSYGKERCGAVNGMRPEGVVDNTCLQSSEVWIGTSFSLAATMIQHHMDKDAWELVKGIVN 1134
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
+++ G+ +QTPE+W++N YR+ YMRPL+IW++QWAL K K
Sbjct: 1135 SSYNR--WGFQYQTPEAWDSNGCYRAGAYMRPLSIWSIQWALLKKK 1178
>gi|281212396|gb|EFA86556.1| hypothetical protein PPL_00357 [Polysphondylium pallidum PN500]
Length = 4775
Score = 649 bits (1675), Expect = 0.0, Method: Composition-based stats.
Identities = 347/853 (40%), Positives = 506/853 (59%), Gaps = 69/853 (8%)
Query: 28 WQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQ 87
W ++LD F +S + L +G+R+++Y K+E + G+VP+ + F
Sbjct: 339 WTKRLDHQFPEHKAFQVSIGQGFKLMGLGYRMWKYVKRENSAGRVPIMDPFNIPKPGPVM 398
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV------SRPNGEKFSS 141
G+P+GGIG+GSI R +RG+F R+ L +G+ + V AN+FSV++ S+P K ++
Sbjct: 399 GVPIGGIGSGSITRGWRGDFVRWNLKNGMVNSEVVDANKFSVYIKMEEQQSQPLSTKRAT 458
Query: 142 VLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
VLCP G PK N ++ W+W+LKG+ +Y FPRAWTVY+ EP ++R+VC+Q+SP
Sbjct: 459 VLCP---GKPKSNF--ALDVWNWSLKGDRSSYFGQFPRAWTVYE-EPHQDVRLVCKQVSP 512
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKD- 260
IP+NYKESS+P VF + + N+ T A+V+L+FTW NS +G H N + D
Sbjct: 513 VIPNNYKESSYPCGVFVWKIENTASTDAEVSLMFTWQNSDGTAVDQAGGHHNKRFKYTDE 572
Query: 261 ---GVHGLTLHHRTANG------------RPPVTFAVAAEETADVHVSECPCFLLSGNSK 305
++G+TL T N + P+ ++ + ADV S F +
Sbjct: 573 QGRSINGVTL---TTNRDLKSSVDPKKVYQDPLELSIGVRDDADVQFSFVSRF---ETTN 626
Query: 306 GITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
+ A ++W K G D +++ + PS P IGAAIAA + + +G+TR V FS+AWD
Sbjct: 627 RLEAANLWYSFNKSGVLD--NSEDSRPSAPKKPIGAAIAAKVLVKAGTTREVAFSVAWDT 684
Query: 366 PEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEW 424
P +F Y+RRYTKF+GT G+++ RIA DA+ + KWE EI WQ PIL D P +
Sbjct: 685 PVCRFNSGSGYYRRYTKFFGTAGNNSQRIACDAVYNYRKWEEEIVRWQHPILSDPSLPTF 744
Query: 425 YPITLFNELYYLNAGGTIWTDGSP---PMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
Y +FNELYY GGT+WT G+P P Q + ++ L+ P+ I
Sbjct: 745 YKQAIFNELYYFVDGGTVWTHGAPDDQPNQKKIVSKFQQEDLNDPNYI------------ 792
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
G+F YLE +Y+MYNTYDVHFY+SFAL ML+P+LELS+Q DFA A ++
Sbjct: 793 ----------GRFAYLESVDYLMYNTYDVHFYASFALAMLWPRLELSLQTDFADATLLDY 842
Query: 542 PGT-MKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
T ++ + GK + RK GAVPHD+G +DPW +N+Y++ + SRWKDL SKFVLQ+Y
Sbjct: 843 SDTQVECIQTGKNMPRKVRGAVPHDLGNPGEDPWKRVNAYHIQDVSRWKDLPSKFVLQIY 902
Query: 599 RDFVATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
RD++ G D+ F +W V + ++D D DG+I+NEG PDQTYDAWSA G SAY
Sbjct: 903 RDYLINGNDRTFLLQMWGVVEEVIRRAFEYDIDIDGVIDNEGVPDQTYDAWSALGCSAYS 962
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTS 716
GGLW+AA++ AS +A +G + + ++K K Y + LWNG YFNYD+S S
Sbjct: 963 GGLWLAAVKVASEMARILGLKEDETVYKKIFEKGKKSYSTKLWNGHYFNYDSSKNPHFDS 1022
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
I +DQLAG WY ACGL + +L+ I ++NV G CGA+NGM+P+G D
Sbjct: 1023 IMSDQLAGHWYLLACGLPSYITLDQALSSLSIINEYNVKSYSNGSCGAVNGMRPEGGPDT 1082
Query: 777 SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNND 836
+ LQ+ E+W G +YGLA++M+ M + A++ G+ + +++ G+ +QTPE+W+ N
Sbjct: 1083 TSLQSCEVWIGTSYGLASTMLLHFMDNEAWELIKGLVDSTYNK--WGFQYQTPEAWDQNG 1140
Query: 837 EYRSLCYMRPLTI 849
YR+ YMRPL I
Sbjct: 1141 LYRAGTYMRPLAI 1153
>gi|147812668|emb|CAN61857.1| hypothetical protein VITISV_016690 [Vitis vinifera]
Length = 521
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 320/535 (59%), Positives = 383/535 (71%), Gaps = 63/535 (11%)
Query: 425 YPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP------------------ 466
Y ITLFNELY+LNAGGTIWTDG PPMQSLATI + KFSLD
Sbjct: 11 YRITLFNELYFLNAGGTIWTDGLPPMQSLATIEQIKFSLDRSISDPKNTTDIVHQNDSTV 70
Query: 467 ----------SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
+IH P +S++A GT LL++GEEN+GQFLYLEG EY M+NTYDVHFYSSF
Sbjct: 71 EILGRMTSMLEQIHNPTTSNSAFGTYLLQSGEENVGQFLYLEGIEYHMWNTYDVHFYSSF 130
Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEIN 576
A++MLFP+LELSIQRDFAAAVM+HDP MKIMSDGKWV RK LGAVPHDIG+ DPWFE+N
Sbjct: 131 AIIMLFPQLELSIQRDFAAAVMVHDPSRMKIMSDGKWVPRKVLGAVPHDIGISDPWFELN 190
Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
+YNL+++ RWKDLNSKFVLQVYRD VATGDKNFARAVWP+VYIA+A+++QFDKDGDGMIE
Sbjct: 191 AYNLYDTDRWKDLNSKFVLQVYRDMVATGDKNFARAVWPAVYIAIAFLDQFDKDGDGMIE 250
Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD 696
N+G P LQAASA+A +VGD +A YFW ++QKAKAVYD
Sbjct: 251 NDGLP-----------------------LQAASAMAREVGDSMTADYFWFKFQKAKAVYD 287
Query: 697 S------LWNGSYFNYDNSDGS-SSTSIQAD-QLAGQW---YARACGLLPIADEAKVKKA 745
+ G G+ S T I Q W YARACGL PI D+ K + A
Sbjct: 288 KDQLAAVIGEGRLLXLFRFLGNPSHTGIDGLFQDHRDWTTRYARACGLQPIVDDEKARSA 347
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMA 805
L K+Y+FNVLKVK G CGA+NGM PDGR+DMS +Q+REIW GVTY +AA+MI E MV+ A
Sbjct: 348 LEKVYNFNVLKVKEGKCGAVNGMLPDGRVDMSAMQSREIWAGVTYSVAANMIHEGMVETA 407
Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQE 865
F TA+G+Y+ AWS++GLGYSFQTPE+WN ++EYRSLCYMRPL IWAMQWAL+KP+L +
Sbjct: 408 FNTASGIYDAAWSQEGLGYSFQTPEAWNTDEEYRSLCYMRPLAIWAMQWALSKPELHNHD 467
Query: 866 IKHEISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRVLCHNLN 920
+KHE + + E H F KVA LLKLP+EEASK FL++ +D T R+ LN
Sbjct: 468 MKHE-EGKGTLNFEHHVGFEKVAHLLKLPEEEASKSFLQLFFDLTCRRLYTLRLN 521
>gi|449526389|ref|XP_004170196.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
sativus]
Length = 440
Score = 637 bits (1644), Expect = e-180, Method: Compositional matrix adjust.
Identities = 297/439 (67%), Positives = 349/439 (79%), Gaps = 1/439 (0%)
Query: 7 EEGENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQE 66
E+G N AS +VDP +PP LTW+RKLD K P F+ + + H+ G+RL+R K+E
Sbjct: 2 EKGGNGASSTEVDPSKPPSLTWKRKLDFTGKSPESFSFTLTDAWHMGMTGYRLWRNGKEE 61
Query: 67 EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
AKG++P++ F IT G+ LGGIGAGSIGRSYRGEFQRF++F+G C+D PVLANQ
Sbjct: 62 IAKGRIPIYEFFSDVPITCYHGVSLGGIGAGSIGRSYRGEFQRFQMFYGPCEDEPVLANQ 121
Query: 127 FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
FSVFVSRPNG KFSSVLC P K +GIESWDWNL GEN TYHALFPR+WTVYDG
Sbjct: 122 FSVFVSRPNGNKFSSVLCSAKPQKSKDGKQTGIESWDWNLSGENSTYHALFPRSWTVYDG 181
Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
EPDP+L+IVCRQ+SP IPHNYKESSFP SVFTF LSN GQTSA VTLLFTWANSV G SG
Sbjct: 182 EPDPDLKIVCRQMSPIIPHNYKESSFPVSVFTFKLSNEGQTSAQVTLLFTWANSVGGKSG 241
Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
+GHHFNSK +DG G+ LHH++ANGRP VT+ +AAE T DVHVS CPCF++SG+S+G
Sbjct: 242 FTGHHFNSKMGAEDGAQGVLLHHKSANGRPTVTYGIAAEATDDVHVSLCPCFVISGDSEG 301
Query: 307 ITAKDMWNEIKKHGSFDHLDN-DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDC 365
I+AKDMW EIK HGSFD+L + S+PG SIGAA+AA+LTIP S R+VTFSLAWDC
Sbjct: 302 ISAKDMWQEIKNHGSFDNLGSVGANEGSKPGCSIGAAVAATLTIPPTSARTVTFSLAWDC 361
Query: 366 PEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
PEVKF K YHR+Y+KFYGTLGD+A IA DAI +H KWE EIEAWQRPI+EDKR P+WY
Sbjct: 362 PEVKFDGKTYHRQYSKFYGTLGDAAEIIARDAISKHGKWEAEIEAWQRPIIEDKRLPDWY 421
Query: 426 PITLFNELYYLNAGGTIWT 444
P+TLFNELY+LN+GGTIWT
Sbjct: 422 PVTLFNELYFLNSGGTIWT 440
>gi|449482327|ref|XP_004156248.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Cucumis
sativus]
Length = 508
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 316/503 (62%), Positives = 383/503 (76%), Gaps = 33/503 (6%)
Query: 444 TDGSPPMQSLATIRERKFSLD---------TPSKIHT---------------------PA 473
+DG PP+Q+L+TI +K+ L+ P+ H
Sbjct: 4 SDGLPPLQNLSTISHKKYFLERSKSELNGGAPNGDHRKDVAVDILERMSQILDQTHGGAG 63
Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
S+ ALGTRLL GEEN+G L +EG++Y+M+NTYDVHFYSSFAL+MLFPKLELSIQRDF
Sbjct: 64 PSNAALGTRLLHPGEENVGNLLLVEGSQYLMWNTYDVHFYSSFALIMLFPKLELSIQRDF 123
Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKF 593
AAAV+MHDP K M DG WV RK LGAVPHDIG +DPW E+N+YNL N SRWKDL SKF
Sbjct: 124 AAAVLMHDPRKAKTMCDGNWVPRKVLGAVPHDIGFNDPWLEVNAYNLLNVSRWKDLGSKF 183
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
VLQVYRD VATGDKNFA++VWPSVY+A+A+MEQFDKD DGMIENEGFPDQTYD W+ GV
Sbjct: 184 VLQVYRDVVATGDKNFAKSVWPSVYVALAFMEQFDKDKDGMIENEGFPDQTYDTWTVKGV 243
Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSS 713
SAYCGGLWVAALQAASALA++V D A+A YFW++YQKA++VY++LWNGSYFNYDNS G
Sbjct: 244 SAYCGGLWVAALQAASALASEVDDEAAAHYFWIKYQKARSVYETLWNGSYFNYDNSKGPW 303
Query: 714 STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
S+SIQADQLAGQWYARACGL PIADE K++ AL KIY+FNV+KVKGG GA+NGM PDG
Sbjct: 304 SSSIQADQLAGQWYARACGLCPIADEEKIRVALEKIYNFNVMKVKGGTRGAVNGMFPDGS 363
Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
+D S LQ +EIW GVTY +AA+MIQE MV+ FQTA G+Y+ AW++DGLGYSFQTPE+W+
Sbjct: 364 VDKSILQPKEIWAGVTYSVAATMIQEGMVETGFQTAMGIYQAAWAQDGLGYSFQTPEAWD 423
Query: 834 NNDEYRSLCYMRPLTIWAMQWA-LTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVASLLK 892
+D +RS+ YMRPL IWAMQWA ++ K ++ K ++S++ QHAAF KVASLLK
Sbjct: 424 VDDRFRSIGYMRPLAIWAMQWAMMSDSKPTKVPTKAFSEMQESAFATQHAAFLKVASLLK 483
Query: 893 LPKEE--ASKGFLKVVYDFTIGR 913
LP + A + ++ YDF R
Sbjct: 484 LPSNDDTARRSLVEAAYDFICKR 506
>gi|328869576|gb|EGG17953.1| hypothetical protein DFA_06619 [Dictyostelium fasciculatum]
Length = 1200
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 346/908 (38%), Positives = 509/908 (56%), Gaps = 81/908 (8%)
Query: 2 ENGVKEEGENAASMPK---VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFR 58
EN EE + + M K + PP W R L++ F +SF + L +G+R
Sbjct: 309 ENSTFEEEDIQSEMEKWQRLGEDIPPH-CWTRALNNQFPDHKKFQVSFGQGIKLMGLGYR 367
Query: 59 LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICD 118
+++Y K+E G++P+ + F + G+P+GGIG GSI R +RG+F R+ L +G+
Sbjct: 368 MWKYVKREMNAGRLPIMDPFNMPNPGPIMGVPIGGIGCGSITRGWRGDFVRWNLSNGVVT 427
Query: 119 DAPVLANQFSVFVSRPN---------------GEKFSSVLCPRSPGVPKKNTDSGIESWD 163
+ V AN FSVF+ N K ++VLC G P+ N + ++ W+
Sbjct: 428 NKVVDANNFSVFIKMNNQGSTSSSSSSSSSSATNKLATVLCQ---GKPRNNQN--LDVWN 482
Query: 164 WNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSN 223
W LKG+ +Y +FPRAWTVY+ EP P++++VC+Q+SP I HNY+ESS+P +VF + + N
Sbjct: 483 WALKGDKSSYFGMFPRAWTVYE-EPHPDIKLVCKQVSPVIAHNYQESSYPCAVFVWKIDN 541
Query: 224 SGQTSADVTLLFTWANSVAGDSG--LSGHHFNSKTMTKDGVHGLTLH-HRTANGRPPVTF 280
S SAD++L+FTW N DS L GH+ S ++ G+TL+ +R+A+ +
Sbjct: 542 SAPVSADISLMFTWQNGDGTDSTDQLGGHYNRSFSLDNGKFRGITLNTNRSASQNSSMEL 601
Query: 281 AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIG 340
++ A D S + + A ++W K+G D+ +D T P++ IG
Sbjct: 602 SIGA--LCDEPGSNFSYVSRFETTNRLEAANLWYSFNKNGVLDN--SDDTRPNQAKKPIG 657
Query: 341 AAIAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAI 398
AAIA + + + S+R + F++AWD P F + +Y RRYTKF+GT G+++ IAH A
Sbjct: 658 AAIAYKVRVEAQSSRQIVFAVAWDSPYCTFNSGKSIYPRRYTKFFGTSGNNSQSIAHYAS 717
Query: 399 LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
L + WE +I +WQ PIL+D + P +Y LFNELY+L GG+IWT +
Sbjct: 718 LNYQNWEQQINSWQNPILQDPQLPSFYKKALFNELYFLVDGGSIWTINQNNQNNNQNNII 777
Query: 459 RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
+ + + I+ ++ IGQF YLE EY+MYNTYDVHFY+SFAL
Sbjct: 778 DNNNNNNNNYIY-----------------DKEIGQFAYLESQEYLMYNTYDVHFYASFAL 820
Query: 519 VMLFPKLELSIQRDFAAAVMMHDPG-TMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEI 575
L+P LE S+Q A A M G T++ + GK + RK G VPHDIG +DPW +
Sbjct: 821 ATLWPSLEFSLQSGIADATMEDYGGETVECIHSGKQIPRKLRGTVPHDIGNPGEDPWKRV 880
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVAT------------------------GDKNFAR 611
N+YN+ + SRWKDL KF+LQVYRD++ T GD+ F
Sbjct: 881 NAYNIQDISRWKDLPCKFILQVYRDYLLTSVDSSGQKSNATSSSFGQEYIPLDGDRGFLL 940
Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
+W +V + +FD D DG+I+NEGFPDQTYD WSA+G SAY GGLW+AA++A SA+
Sbjct: 941 QMWGTVEEVIRKAFEFDTDDDGVIDNEGFPDQTYDTWSASGCSAYTGGLWLAAIKATSAM 1000
Query: 672 ANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
A +G + ++K K Y+ LWNG YFNYD+S S SI ADQLAG WY +
Sbjct: 1001 ARILGLRDDEEVYSKLFEKGKKSYNKKLWNGHYFNYDSSKQSHYNSIMADQLAGHWYLTS 1060
Query: 731 CGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTY 790
CGL + L+ I ++N+ G CGA+NG+ P +D + LQ+ E+W G +Y
Sbjct: 1061 CGLSSYMTLDQALSTLSIINEYNIKSYSNGSCGAVNGISPLAIVDQTCLQSSEVWIGTSY 1120
Query: 791 GLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
LA++ + M A+ G+ V S + G+ +QTPE+W+ N +R+ YMRPL IW
Sbjct: 1121 SLASTFLLHYMDKEAWSLIKGL--VNSSYEKWGFQYQTPEAWDMNGSFRASTYMRPLAIW 1178
Query: 851 AMQWALTK 858
++QWAL++
Sbjct: 1179 SVQWALSR 1186
>gi|125821356|ref|XP_687652.2| PREDICTED: non-lysosomal glucosylceramidase [Danio rerio]
Length = 851
Score = 616 bits (1588), Expect = e-173, Method: Compositional matrix adjust.
Identities = 338/822 (41%), Positives = 474/822 (57%), Gaps = 64/822 (7%)
Query: 55 IGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLF 113
+G R ++ ++ + + K P ++FR + G+PLGGIG GSI R +RGEF R++L
Sbjct: 61 LGVRYLKWWYRKTQVEKKAPFIDMFRAVPLRQIYGVPLGGIGGGSITRGWRGEFCRWQLN 120
Query: 114 HGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
G+ V+ANQF+V + R + VL P S ++ W+W GE+ Y
Sbjct: 121 PGMYHYKTVIANQFTVCLRRDGQTVYQQVLSTERP--------STLQGWNWGYCGEHAFY 172
Query: 174 HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
H LFPRAWTVY+ P + + CRQ+SP IPH+YK+SS P +VF + + N + D+++
Sbjct: 173 HGLFPRAWTVYN-LPGQNVTLTCRQVSPVIPHDYKDSSLPVAVFVWDIENKNDYALDISI 231
Query: 234 LFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADV 290
+FT N SG H+N + K+G V G+ LHH+T P T +AA + +
Sbjct: 232 MFTMVNGSGQKDDKSGGHWNEPFHLEKEGESVSGVLLHHQTPAN--PYTLCIAARQKSGQ 289
Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIP 350
+S F G T +W ++ G D N + P+E G + AA+ S ++
Sbjct: 290 EISHQTAFSPKG-----TCSAVWCDLMTDGRLDSPTN-SSPPTEKGEEVAAALVVSCSVS 343
Query: 351 SGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
+ S SV FSLAWD P++ F E+ Y RRYT++YGT GD+A I+H A+ ++KWE I
Sbjct: 344 ANSRNSVDFSLAWDMPKITFGSKERTYVRRYTRYYGTKGDAAPSISHYALTHYSKWEESI 403
Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
E WQRPIL+D+ P WY LFNELY++ GGTIW
Sbjct: 404 EEWQRPILQDRSLPAWYKSALFNELYFVVDGGTIW------------------------- 438
Query: 469 IHTPASSDTALGTRLLENG-------EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
+ P +D + G R + G ++ G+F YLEG EY MYNTYDVHFY+SFAL+ML
Sbjct: 439 VELPEDADISGGLRPEDGGLPAQPEVVKDFGRFAYLEGQEYRMYNTYDVHFYASFALIML 498
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
+PKL LS+Q D A +V+ HDP + +G++ K G VPHDIG D+PW +N+Y
Sbjct: 499 WPKLALSLQYDIAGSVVQHDPMERLNLMNGRYSPVKTRGVVPHDIGDPDDEPWVRVNAYL 558
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQVYRD+ T D+ + + +WP M +FDKDGDG+IEN G
Sbjct: 559 IHDTADWKDLNLKFVLQVYRDYHLTQDQQYLKDMWPVCQTVMENELKFDKDGDGLIENSG 618
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
+ DQTYD W G SAYCGGLW+A++ +A + + + ++ A +D L
Sbjct: 619 YADQTYDGWKVTGPSAYCGGLWLASVCMMCKMARVLNCESVYQRYRDILERGSAAFDKLL 678
Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNV 754
WNG Y+NYD+S S S S+ +DQ AG W+ RA GL +A K+ AL ++D NV
Sbjct: 679 WNGKYYNYDSSGRSLSNSVMSDQCAGHWFLRASGLGDDEYQAFPKEKICSALKSVFDLNV 738
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
+ GG GA+NGM+P+G D S +Q+ E+W GV YGLAA+MI E MV+ +TA G Y
Sbjct: 739 MSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWVGVVYGLAATMIHEGMVEEGLRTAEGCYR 798
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
W +G +FQTPE++ YRSL YMRPL+IWAMQ AL
Sbjct: 799 AVWER--MGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLAL 838
>gi|66801399|ref|XP_629625.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
gi|60463006|gb|EAL61202.1| hypothetical protein DDB_G0292446 [Dictyostelium discoideum AX4]
Length = 1302
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 346/917 (37%), Positives = 510/917 (55%), Gaps = 108/917 (11%)
Query: 24 PQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHI 83
PQ W R+LD+N F +S + L +G+R+++Y K+E + G+VP+ + F
Sbjct: 369 PQYAWARRLDNNYPEHKHFQVSMWQGLKLVGMGYRMWKYVKKETSSGRVPIMDPFNLPKP 428
Query: 84 TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP--------- 134
G+P+GGIG+GSI R ++G+F R+ L +G+ + V ++FSV+++
Sbjct: 429 GPIMGVPIGGIGSGSINRGWKGDFVRWNLNNGLVSNEIVDVDKFSVYINFDNSNTSSSSS 488
Query: 135 -----------------------------------NGEKFSSVLCPRSPGVPKKNTDSGI 159
N K PG PK N + +
Sbjct: 489 SSSSNSSTPPPQFSNNNNNSYNNSYNNNINNNGFDNNLKSKQKAVVLYPGKPKSNLN--L 546
Query: 160 ESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTF 219
+ W+W LKG+N Y LFPRAWTVY+ EP P++R+ C+Q+SP IPHNY+ESS+P V+ +
Sbjct: 547 DVWNWGLKGDNSCYFGLFPRAWTVYE-EPHPDIRLTCKQVSPVIPHNYQESSYPCGVYVW 605
Query: 220 TLSNSGQ-TSADVTLLFTWANSVAGDSGLSGHHFNS-------------------KTMTK 259
+ N+ Q +SADV+L+ TW NS+ S G HFN K
Sbjct: 606 KIENNNQYSSADVSLMLTWQNSIGTQSDQDGGHFNKYFTFGDNDQQSPSSSSSSSSGNNK 665
Query: 260 DGVHGLTLHHRTA------NGRP-----PVTFAVAAEETADVHVSECPCFLLSGNSKGIT 308
+ G+T+ H+ NG P+ +++A + DV +S C F + +
Sbjct: 666 KNIKGITMIHKKEQKCTLPNGNTKVFNDPMEYSIAVMDEPDVEISFCERFETTSR---LD 722
Query: 309 AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
A ++W K G + +N + PS P IGAA+AA + + +GS++++ F ++WD P
Sbjct: 723 AANLWYTFNKKGELE--NNKDSRPSAPKKPIGAAVAAKVKVAAGSSKTIVFCISWDAPIT 780
Query: 369 KF-FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
+F Y++RYTKFYG G ++ RIA+DAI W+ +I WQ+PIL D P +Y +
Sbjct: 781 RFQMGSAYYKRYTKFYGNQGGNSQRIAYDAINNWKHWDNQIIQWQQPILSDPELPTFYKM 840
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
LFNELYYL GGT+WT GSP + + R T +++ ++DT
Sbjct: 841 ALFNELYYLVDGGTVWTHGSP----IDPVPSR---YPTKTQLKQEDTTDT---------- 883
Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
+IG+F YLE EY+MYNTYDV FY+SFAL ML+P LE+S+Q D A A + +
Sbjct: 884 -NHIGRFAYLESLEYLMYNTYDVMFYASFALTMLWPLLEISLQYDIADATQLDYGINWEG 942
Query: 548 MSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG 605
+ G + RK G VPHD+G +DPW +NSY + + SRWKDL SKFVLQVYRD++
Sbjct: 943 IHSGHLMPRKKQGTVPHDLGNPGEDPWKRVNSYCIQDVSRWKDLPSKFVLQVYRDYLVME 1002
Query: 606 DKNFARAVWPSVY-IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAA 664
DKNF + V + +E FD D DG+++NEGFPDQTYD W A G SAY GGLW+AA
Sbjct: 1003 DKNFLLQCYNVVEGVIQRTLEHFDFDHDGVVDNEGFPDQTYDVWPATGCSAYSGGLWLAA 1062
Query: 665 LQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
L+ A+ +A +G + S + ++K +K+ LWNG YFNYD+S S SI +D LA
Sbjct: 1063 LKVAAEIAKILGFNEDESIYNAIFEKGSKSFTKKLWNGHYFNYDSSKSVHSDSIMSDMLA 1122
Query: 724 GQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQARE 783
G WY +CGL + +L+ I ++NV G+CGA+NGM+P+G +D + LQ+ E
Sbjct: 1123 GHWYLLSCGLKSYMTFDQALSSLSIINEYNVKSYSKGVCGAVNGMRPEGTVDTTCLQSSE 1182
Query: 784 IWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCY 843
+W G ++ LAA+MI M A+ G+ ++ + G+ +QTPE+W+ N +R+ Y
Sbjct: 1183 VWIGTSFSLAATMILHHMDSDAWDLVKGIVNSSYQK--WGFQYQTPEAWDQNGCFRAASY 1240
Query: 844 MRPLTIWAMQWALTKPK 860
MRPL+IW++QWAL K K
Sbjct: 1241 MRPLSIWSIQWALQKRK 1257
>gi|363744064|ref|XP_003642968.1| PREDICTED: non-lysosomal glucosylceramidase-like [Gallus gallus]
Length = 853
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 334/820 (40%), Positives = 469/820 (57%), Gaps = 52/820 (6%)
Query: 52 LAHIGFRLYRYS---KQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
L H+G L + K+ + K ++ + G PLGGIG+G+I R +RGEF
Sbjct: 51 LRHVGLALRYFKWWYKKTRIEKKSAFIDLLCAVPLQQIYGCPLGGIGSGTITRGWRGEFC 110
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L G V+A+QF+V + R + VL P S ++ W+W G
Sbjct: 111 RWQLNPGKYHYETVIADQFTVCLRRKGQTVYQQVLSVEKP--------SALQGWNWGYCG 162
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAW VY+ P + + CRQISP IPH+YK+SS P VF + + N+ +
Sbjct: 163 RYAFYHALYPRAWMVYE-LPGQNVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENNSEEP 221
Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
DV+++FT N SG H+N +DG V G+ LHH T P TFAVAA
Sbjct: 222 VDVSIMFTLQNGTGTKGDRSGGHWNEPFALQDGGERVAGVLLHHCTPVN--PFTFAVAAR 279
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
E A ++ F +G + +D+W ++ + G + K+ ++ G AA+ A
Sbjct: 280 EKAGTVITHLTAFDPAG-----SGRDVWQDLLQDGKLES-PTGKSKQTQNGEVTAAAVCA 333
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
S T+P+ ++ +L WD P V F E+++ RRYT+F+G+ GD+A ++H A+ + +
Sbjct: 334 SCTVPAQGHGTLELALVWDMPHVHFGSKERLHLRRYTRFFGSNGDAAPALSHYALTHYKE 393
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
WE +IEAWQ+PILED + P WY LFNELY++ GGTIW D P
Sbjct: 394 WERKIEAWQKPILEDSQLPSWYKSALFNELYFMTDGGTIWLDLPPDC------------- 440
Query: 464 DTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
P + P ++ + +L G+F YLEG EY MYNTYDVHFY+SFALVML+P
Sbjct: 441 -LPQDLQGPGAAKLSHLLPVLRE----YGRFAYLEGQEYRMYNTYDVHFYASFALVMLWP 495
Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLF 581
KL++S+Q D A V+ D + + GK K VPHDIG D+PW +N+Y +
Sbjct: 496 KLQISLQYDIAVTVLNEDVQPRQYLVCGKTAQVKVKNVVPHDIGDPDDEPWQRVNAYLMH 555
Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
+++ WKDLN KFVLQVYRD+ T D + + +WP M +FD D DG+IEN G
Sbjct: 556 DTANWKDLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGLIENGGIA 615
Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
DQTYDAW +G SAYCGGLW+AA++ +A +GD + + QK K ++ LWN
Sbjct: 616 DQTYDAWVVDGASAYCGGLWLAAVRMMCEMAEVLGDTETRQKYDAILQKGKESFERLLWN 675
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLK 756
G Y+NYD+S S+S+SI +DQ AGQW+ ACGL + + ++ + AL I++ NV+
Sbjct: 676 GKYYNYDSSGSSTSSSIMSDQCAGQWFLGACGLDQKEVEVFPKSHIVSALKTIFEKNVMS 735
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
GG GA+NGM+PDG D S +Q+ E+W GV Y LAA+MIQE +V F TA G Y
Sbjct: 736 FAGGKMGAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTV 795
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
W LG +FQTPE++ YRSL YMRPL+IW+MQ AL
Sbjct: 796 WEH--LGMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLAL 833
>gi|156356077|ref|XP_001623757.1| predicted protein [Nematostella vectensis]
gi|156210486|gb|EDO31657.1| predicted protein [Nematostella vectensis]
Length = 783
Score = 600 bits (1547), Expect = e-168, Method: Compositional matrix adjust.
Identities = 338/829 (40%), Positives = 481/829 (58%), Gaps = 74/829 (8%)
Query: 61 RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
+Y+ +E+ G+VP + + G+PLGGIG G+IGR +RGEF R++L GI
Sbjct: 2 KYATKEKKNGRVPYIDPLAANPCRQVYGVPLGGIGCGTIGRGWRGEFNRWQLTPGIYSYN 61
Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSP-GVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
V ANQF V V + + +VL P P GV ++ W+W G N YHAL+PR
Sbjct: 62 YVEANQFVVCVRKKGRTTYQAVLSPNRPNGVL---CGRSLQGWNWGFSGSNAVYHALYPR 118
Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
AWT Y+ P ++ +VCRQ+SP PH+YK++S P +VF +++ N+ +V+++F++ N
Sbjct: 119 AWTRYE-LPGQDIILVCRQVSPVFPHDYKDTSLPVAVFIWSIENNNNEEVEVSIMFSFEN 177
Query: 240 SVAGDSGLSGHHFN-----SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
G L+ H+N + + T V G+ LHH+ + + P T A+AA+ +H E
Sbjct: 178 G-DGTQDLTAGHYNESADTTGSNTTCDVTGVLLHHK--HHKLPYTLAIAAKA---LHKKE 231
Query: 295 CPCFLLSGNSKG-----ITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI 349
L++ +K + +WN++ G N T PS P S + AA
Sbjct: 232 ----LVTVTTKTWFDSRTPCRRVWNDLMDDGKL----NSSTEPSPPSRSDQSLCAAVAAT 283
Query: 350 PSGSTRS---VTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKW 404
+ + RS + F+LAWD P + F +K ++R YT+++G G++ + A++ + W
Sbjct: 284 TTVAARSRGELEFALAWDMPVIYFGNSKKRHYRYYTRYFGHQGNAGPALCSHALMSYPDW 343
Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
E +IEAWQ+PIL+D+ P WY LFNELY++ GGT+W D
Sbjct: 344 ETKIEAWQKPILQDESLPNWYKSALFNELYFVADGGTVWLD------------------- 384
Query: 465 TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
+ S T+ GT + IG+F YLEG EY MYNTYDVHFY+SFAL ML+PK
Sbjct: 385 ------VRSESGTSKGTADIV---RKIGRFAYLEGHEYRMYNTYDVHFYASFALAMLWPK 435
Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFN 582
L+LS+Q D A AV + D + M +G + +K G +PHDIG + PW +N+Y++ +
Sbjct: 436 LQLSLQYDMAHAVNVSDNQVVMTMMNGHYCRKKVPGCIPHDIGDPSEAPWDHVNAYHIHD 495
Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
+S+WKDLN KFVLQVYRD+V T D + + +WP M +D DGDG+IEN G D
Sbjct: 496 TSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWPITKTVMTKSMTYDSDGDGLIENSGLAD 555
Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALAN--DVGDHASASYFWVRYQKAKAVYDSLWN 700
QT+DAW G SAYCGGLW+AAL+ + +A D D Y + + KA LWN
Sbjct: 556 QTFDAWPVTGPSAYCGGLWLAALRVMAEIATILDFPDE-RGKYEKILARGKKAYERLLWN 614
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE-----AKVKKALTKIYDFNVL 755
G Y+NYD+S SI ADQL+GQWY AC L +++ V AL +++FNV+
Sbjct: 615 GKYYNYDSSTSKYHNSIMADQLSGQWYLHACDLAQTSNDRVFPSENVISALRTVFNFNVM 674
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
K + G GA+NG++PDG++D S LQA E+W GVTY +AASMIQE +VD F+TA+G+Y
Sbjct: 675 KFQEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNT 734
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQ 864
+ LG +FQTPE+ N YRSL YMRPL+IWAMQWAL K K R+
Sbjct: 735 CFER--LGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNKRE 781
>gi|226499590|ref|NP_001146319.1| uncharacterized protein LOC100279895 [Zea mays]
gi|219886613|gb|ACL53681.1| unknown [Zea mays]
Length = 649
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 306/614 (49%), Positives = 407/614 (66%), Gaps = 52/614 (8%)
Query: 312 MWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
MWN + ++G FD + + PS PG + AA++AS + +V F+LAW P+VK
Sbjct: 1 MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60
Query: 370 FFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
F + Y+RRYT+FYGT SA + HDA+ ++ WE EIE WQ PIL+D+R PEWY T
Sbjct: 61 FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120
Query: 429 LFNELYYLNAGGTIWTDGSPPM---------------------QSLATIRERKFSLDTPS 467
LFNELY+L AGGT+WTDG PP + ++++R +L
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180
Query: 468 KIH---------------------------TPASSDTALGTRLLENGEENIGQFLYLEGA 500
H S+ + + ++G E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQINGLKSEEPIPYLISKDGPEHVGKFLYLEGV 240
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D +K ++DG RK G
Sbjct: 241 EYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVKG 300
Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
AVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF ATGD F R VWP+V A
Sbjct: 301 AVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRGVWPAVCAA 360
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
M YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD
Sbjct: 361 MDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHF 420
Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
A + +++ KAKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL P+ DE
Sbjct: 421 AEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDE 480
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
K++ AL KI++FNV+KVKGG GA+NGM P G++D + +Q+REIW GVTY +AA+M+
Sbjct: 481 HKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLH 540
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
M F TA G++ WSE+G GY FQTPE W + YRSL YMRPL IWA+Q+A++ P
Sbjct: 541 GMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSPP 600
Query: 860 KLSRQEIKHEISDR 873
K + K + DR
Sbjct: 601 KAILEAPKVNLMDR 614
>gi|414871104|tpg|DAA49661.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 649
Score = 597 bits (1538), Expect = e-167, Method: Compositional matrix adjust.
Identities = 306/614 (49%), Positives = 407/614 (66%), Gaps = 52/614 (8%)
Query: 312 MWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
MWN + ++G FD + + PS PG + AA++AS + +V F+LAW P+VK
Sbjct: 1 MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60
Query: 370 FFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
F + Y+RRYT+FYGT SA + HDA+ ++ WE EIE WQ PIL+D+R PEWY T
Sbjct: 61 FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120
Query: 429 LFNELYYLNAGGTIWTDGSPPM---------------------QSLATIRERKFSLDTPS 467
LFNELY+L AGGT+WTDG PP + ++++R +L
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180
Query: 468 KIH---------------------------TPASSDTALGTRLLENGEENIGQFLYLEGA 500
H S+ + + ++G E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQINGLKSEEPIPYLISKDGPEHVGKFLYLEGV 240
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D +K ++DG RK G
Sbjct: 241 EYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVKG 300
Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
AVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF ATGD F R VWP+V A
Sbjct: 301 AVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAA 360
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
M YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD
Sbjct: 361 MDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHF 420
Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
A + +++ KAKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL P+ DE
Sbjct: 421 AEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDE 480
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
K++ AL KI++FNV+KVKGG GA+NGM P G++D + +Q+REIW GVTY +AA+M+
Sbjct: 481 HKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLH 540
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
M F TA G++ WSE+G GY FQTPE W + YRSL YMRPL IWA+Q+A++ P
Sbjct: 541 GMEHQGFTTAEGIFTAGWSEEGYGYWFQTPEGWTTDGHYRSLVYMRPLAIWAIQYAVSPP 600
Query: 860 KLSRQEIKHEISDR 873
K + K + DR
Sbjct: 601 KAILEAPKVNLMDR 614
>gi|414589150|tpg|DAA39721.1| TPA: hypothetical protein ZEAMMB73_266944 [Zea mays]
Length = 583
Score = 593 bits (1529), Expect = e-166, Method: Compositional matrix adjust.
Identities = 307/567 (54%), Positives = 394/567 (69%), Gaps = 23/567 (4%)
Query: 312 MWNEIKKHGSFDHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
MW + + GSFD DN T PS P G ++ AA++AS + +V F+LAW P+V
Sbjct: 1 MWGTMVQDGSFDQ-DNLNTGPSMPSLLGDTVCAAVSASTWVEPHGRCAVVFALAWSSPQV 59
Query: 369 KFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
KF + Y+RRYTKFYGT SA + DA++++ WE EI+ WQ P+L+D+R PEWY I
Sbjct: 60 KFKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEEIDKWQTPVLQDERLPEWYKI 119
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
TLFNELY+L AGGT+W D SL + + K + + + H +
Sbjct: 120 TLFNELYFLVAGGTVWIDSG----SLVSDADNKSNPEESNLSHEI-------------DD 162
Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
+EN+G+FLYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D ++
Sbjct: 163 KENVGKFLYLEGIEYFMWCTYDVHFYASFALLDLFPKIELSIQRDFARAVLREDNRRVRF 222
Query: 548 MSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK 607
++DG W RK +GAV HD+G DPW E+N+YN+ ++SRWKDLN KFVLQ+YRDF ATGDK
Sbjct: 223 LADGTWGIRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQIYRDFAATGDK 282
Query: 608 NFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
+F + VWP+V AM YMEQFD DGDGMIEN+GFPDQTYDAW+ GVSAYCG LW+AALQA
Sbjct: 283 SFGKDVWPAVCTAMEYMEQFDHDGDGMIENDGFPDQTYDAWTVQGVSAYCGCLWLAALQA 342
Query: 668 ASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
A+ALA +G A R+ KAK V+++ LWNGS+FNYD+ SS SIQADQLAGQW
Sbjct: 343 AAALAGSLGHDDYAERCMTRFVKAKTVFEARLWNGSHFNYDSGTSYSSRSIQADQLAGQW 402
Query: 727 YARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
Y + GL P+ DE ++K L KI+D+NV++VKGG GA+NGM P+G++D + +Q+REIW
Sbjct: 403 YTASSGLPPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDETCMQSREIWT 462
Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
GVTY LAA+M+ M AF TA G+Y WSE+G GY FQTPE+W + YRSL YMRP
Sbjct: 463 GVTYSLAATMLLHGMEHQAFTTAEGIYTAGWSEEGYGYWFQTPEAWTIDGHYRSLIYMRP 522
Query: 847 LTIWAMQWALTKPKLSRQEIKHEISDR 873
L IWAMQWAL+ P+ + K DR
Sbjct: 523 LAIWAMQWALSPPRSILEAPKVNTMDR 549
>gi|147788372|emb|CAN61187.1| hypothetical protein VITISV_019326 [Vitis vinifera]
Length = 900
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 330/697 (47%), Positives = 426/697 (61%), Gaps = 93/697 (13%)
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
T+ + G L +RTA PPVTFA+AA ET +V V+ P F LS S ITAKDMW +
Sbjct: 157 TLDRLQKRGWVLANRTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGSH-ITAKDMWGK 215
Query: 316 IKKHGSFDHLD--NDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ + G FD + + ++ PS PG ++ AA++AS + +V F+LAW P+VKF +
Sbjct: 216 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 275
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
YHRRYTK+YGT +A I HDA+ + +WE EIE WQ PIL D R PEWY TLFNE
Sbjct: 276 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 335
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL--------- 483
LY+L AGGT+W D S P S + +++ + T A ++ G +
Sbjct: 336 LYFLVAGGTVWIDSSLPATSSKNSLHQSAAVENTNVNVTVAKGNSRRGAAVENSVTDGYD 395
Query: 484 ------LENGEENI-------------------------------------GQFLYLEGA 500
LE EE I G+FLYLEG
Sbjct: 396 TTSRKGLEYDEEEIHTRNTCEEKPVIPQESNSHHSIHKDTLKDPQDETDDVGRFLYLEGV 455
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQR+FA AV+ D +K +++G W RK G
Sbjct: 456 EYIMWCTYDVHFYASFALLELFPKIELSIQREFAKAVLSEDGRRVKFLAEGNWGIRKVRG 515
Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
AVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF AT D +F VWP+V A
Sbjct: 516 AVPHDLGTHDPWHEMNAYNIHDTSQWKDLNPKFVLQVYRDFAATRDFSFGADVWPAVRAA 575
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
M YMEQFD+D DG+IEN+GFPDQTYD W+ +G+SAYCG LW+AALQAA+A+A +GD
Sbjct: 576 MEYMEQFDRDSDGLIENDGFPDQTYDTWTVHGISAYCGCLWLAALQAAAAMALQLGDKPF 635
Query: 681 ASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
A ++ KAK V+ + LWNGSYFNYD+ S+S SIQADQLAGQWY + GL + D+
Sbjct: 636 AEKCKSKFFKAKLVFEEKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASSGLPSLFDD 695
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
K+K +L KIYDFNV+KVKGG GA+NGM P+G++D S +Q+REIW GVTYG+AA+MI
Sbjct: 696 CKIKSSLHKIYDFNVMKVKGGKMGAVNGMHPNGKVDESCMQSREIWTGVTYGVAATMILS 755
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLG------------------------------------ 823
M + AF TA G++ WSE+G G
Sbjct: 756 GMEEQAFTTAEGIFTAGWSEEGYGTLDFIDDMHCVVQILNCFVHSGLCVACVILLYCTCR 815
Query: 824 YSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
Y FQTPE W + +RSL YMRPL IW MQWAL+ P+
Sbjct: 816 YWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMPR 852
>gi|253761389|ref|XP_002489100.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
gi|241947399|gb|EES20544.1| hypothetical protein SORBIDRAFT_0073s002050 [Sorghum bicolor]
Length = 603
Score = 591 bits (1524), Expect = e-166, Method: Compositional matrix adjust.
Identities = 309/574 (53%), Positives = 395/574 (68%), Gaps = 17/574 (2%)
Query: 312 MWNEIKKHGSFDHLDNDKTSPSEP---GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
MW + + GSFD DN PS P G ++ AA++AS + +V F+LAW P+V
Sbjct: 1 MWGTMVQDGSFDR-DNFSAGPSMPCLLGDTVCAAVSASTWVEPHGRCTVAFALAWSSPKV 59
Query: 369 KFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
KF + Y+RRYTKFYGT SA + DA++++ WE I+ WQ PIL D+R PEWY I
Sbjct: 60 KFKKGSTYYRRYTKFYGTSPRSAVNLVEDALMKYKHWEEAIDKWQTPILRDERLPEWYKI 119
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLE-- 485
TLFNELY+L AGGT+W D +SL + + K + P + P T T L
Sbjct: 120 TLFNELYFLVAGGTVWIDS----ESLVSDADNKLNPSPPEDSNFPFHDSTCNSTVPLIGF 175
Query: 486 -----NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
+ +EN+G+FLYLEG EY M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+
Sbjct: 176 DPHEIDDKENVGKFLYLEGIEYFMWCTYDVHFYASFALLDLFPKIELSIQRDFARAVLRE 235
Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRD 600
D ++ ++DG W RK +GAV HD+G DPW E+N+YN+ ++SRWKDLN KFVLQ+YRD
Sbjct: 236 DNRRVRFLADGTWGIRKVIGAVAHDLGAHDPWHELNAYNIHDTSRWKDLNPKFVLQIYRD 295
Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
F ATGDK+F + VWP+V AM YMEQFD DGD MIEN+GFPDQTYDAW+ GVSAYCG L
Sbjct: 296 FAATGDKSFGKDVWPAVCTAMEYMEQFDHDGDDMIENDGFPDQTYDAWTVQGVSAYCGCL 355
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQA 719
W+AALQAA+ALA +G A R+ KAK+V+++ LWNGSYFNYD+ +S SIQA
Sbjct: 356 WLAALQAAAALARSLGHDDYAERCMTRFAKAKSVFEARLWNGSYFNYDSGTSYNSRSIQA 415
Query: 720 DQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGL 779
DQLAGQWY + GL P+ DE ++K L KI+D+NV++VKGG GA+NGM P+G++D + +
Sbjct: 416 DQLAGQWYTASSGLPPLFDEDRIKCTLQKIFDYNVMRVKGGRMGAVNGMHPNGKVDDTCM 475
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
Q+REIW GVTY LAA+M+ M AF TA G+Y WSE+G GY FQTPE+W + YR
Sbjct: 476 QSREIWTGVTYSLAATMLLHGMEHQAFTTAEGIYIAGWSEEGYGYWFQTPEAWTIDGHYR 535
Query: 840 SLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
SL YMRPL IWAMQWAL+ P+ + K DR
Sbjct: 536 SLIYMRPLAIWAMQWALSPPRSILEAPKVNTMDR 569
>gi|405977906|gb|EKC42333.1| Non-lysosomal glucosylceramidase [Crassostrea gigas]
Length = 1464
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 328/863 (38%), Positives = 483/863 (55%), Gaps = 55/863 (6%)
Query: 10 ENAASMPKVDPGQPPQLTWQRKL--DSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEE 67
+N+ S PK P W+ L + +K F + +G R + +
Sbjct: 635 DNSISTPKTRIEGIPDFGWRVNLNYECGIKCKPFTQPRFSQVIGFIGLGIRYAKEWWKRR 694
Query: 68 AKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF 127
+G+ + + + G P+GGIG G++GR YRGEF RF++ GI D + ANQF
Sbjct: 695 RQGRKMFIDHMDQLFHKAIYGAPIGGIGCGTMGRGYRGEFARFQMVPGIYDHTVIQANQF 754
Query: 128 SVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
+ R NG+ + VL ++ + + SW+W E YHAL+PR+WTVY
Sbjct: 755 -ILCIRKNGQTVYQKVL------TGQRESSKSLRSWEWQAPREGDVYHALYPRSWTVYHL 807
Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA---- 242
+ + LR++CRQ+SP PH+YK++SFP + F +T N+G DV+++FT+ N
Sbjct: 808 D-EFNLRLICRQVSPIFPHDYKDTSFPMATFIWTAENNGTEPLDVSIMFTFKNGRGVKDD 866
Query: 243 GDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG 302
G+ G H F+ + + V G++++H + + T+ + DV V+ F G
Sbjct: 867 GNGGCHNHVFDENSEGRQ-VSGVSINHNIQDMK--CTYCIGGVHKDDVSVTCKEYFDPRG 923
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
T +++W ++K+ G + + KT+ + G AA+ +P + + F+L
Sbjct: 924 -----TGQEVWEDLKEDGQLN-TEGGKTTCTGKGEESAAAVCVKANVPPKQCKDLEFTLT 977
Query: 363 WDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
WD P + F E+ Y RRY++++G G++ R++ A ++ WE +IE WQ+P+L++K
Sbjct: 978 WDMPIIHFKAKERFYARRYSRWFGLDGEAGPRMSSYAAANYSTWEQKIEEWQKPVLQNKN 1037
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
P WY LFNELYY++ GGT+W D P++ R P +
Sbjct: 1038 LPAWYKSALFNELYYVSDGGTVWID---PVEGQGGKLVRVTDPKDPPIV----------- 1083
Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
+ +F YLEG EY MYNTYDVH Y+SF+L+ML+PKL++S+Q D A AV
Sbjct: 1084 --------QEFSRFGYLEGHEYRMYNTYDVHHYASFSLIMLWPKLQISLQYDIATAVESE 1135
Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
D +K ++ G+ RK L AVPHD+G D+PW +N Y + + WKDLN KFVLQ Y
Sbjct: 1136 DSTRIKFLTSGEVGIRKRLHAVPHDMGDPEDEPWNRVNCYVIHPTCNWKDLNMKFVLQTY 1195
Query: 599 RDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
RD+ AT D + R ++P M +++D D DG+I+N GF DQT+DAW+ G SAYCG
Sbjct: 1196 RDYSATKDDAYLRHMYPVAKYVMETAKKWDVDDDGIIDNGGFADQTFDAWTMTGASAYCG 1255
Query: 659 GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSI 717
G+W+AAL+ +A + ++ K K Y + LWNGSY+NYD S G SI
Sbjct: 1256 GMWLAALRMMIEMATILKKEDDVLHYKEILDKGKVSYQNKLWNGSYYNYDCSSGGHHDSI 1315
Query: 718 QADQLAGQWYARACGLL--PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID 775
ADQLAG W+ +A GL I +VK +L KI++ NV+ +GG GA+NG +PDG D
Sbjct: 1316 MADQLAGHWFLKASGLQDDDIFPPDRVKSSLKKIFENNVMMFEGGNMGAINGTRPDGSKD 1375
Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
+S Q+ E W GVTYGLAA+MIQE ++D AFQTA G Y V W GL +FQTPE++ +
Sbjct: 1376 ISSCQSEEFWVGVTYGLAANMIQEGLLDQAFQTAWGAYHVCWEWYGL--AFQTPEAYMTD 1433
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+ YRSL YMRPL IW+MQWAL K
Sbjct: 1434 NIYRSLGYMRPLAIWSMQWALEK 1456
>gi|344271640|ref|XP_003407645.1| PREDICTED: non-lysosomal glucosylceramidase [Loxodonta africana]
Length = 967
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 335/820 (40%), Positives = 451/820 (55%), Gaps = 54/820 (6%)
Query: 44 LSFQEFRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRS 102
+S L +GFR +++ Q+ + K P ++ + G PLGGIG G+I R
Sbjct: 150 ISLSNMIKLVGVGFRYFQWWYQKTRVEKKTPFIDMLNSVPLRQIYGCPLGGIGGGTITRG 209
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESW 162
+RG+F R++L G+ V+A+QF+V + R + VL P V + SW
Sbjct: 210 WRGQFCRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSW 261
Query: 163 DWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLS 222
+W L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + +
Sbjct: 262 NWGLCGFFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVE 320
Query: 223 NSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVT 279
N G + DV+++FT N + G G +N + +DG V GL LHH T P T
Sbjct: 321 NEGDEALDVSIMFTMRNGLGGRDDAPGGLWNEPFCLERDGETVQGLLLHHPTPPN--PYT 378
Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
A+AA TAD V+ F T + +W ++ + G D +++PS+ G I
Sbjct: 379 MAMAARLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGI 432
Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDA 397
A+ + +P + FSLAWD P + F K VY+RRYT+F+G G +A ++H A
Sbjct: 433 AGAVCVAAKLPPRGRCCLEFSLAWDMPRIVFGAKGQVYYRRYTRFFGQDGSAAPALSHYA 492
Query: 398 ILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIR 457
+ +A WE I AWQ PILED+ P WY LFNELY+L GGT+W
Sbjct: 493 LCRYADWEERISAWQSPILEDRSLPAWYKSALFNELYFLADGGTVW-------------- 538
Query: 458 ERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFA 517
L+ P T + G R L G+F YLEG EY MYNTYDVHFY+SFA
Sbjct: 539 -----LEVPEDALT---EEPEGGMRQLRPMLREYGRFGYLEGQEYRMYNTYDVHFYASFA 590
Query: 518 LVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEI 575
L+ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +
Sbjct: 591 LIMLWPKLELSLQYDMALATLKEDLTPRRYLMSGLTAPVKRRNVIPHDIGDPDDEPWLRV 650
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
N+Y + +++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+I
Sbjct: 651 NAYVIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMEFDKDHDGLI 710
Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
EN G+ DQTYD W G SAYCGGLW+AA+ +A F + K Y
Sbjct: 711 ENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMATLCEAQDIQEKFSSILSRGKEAY 770
Query: 696 DS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIY 750
+ LWNG Y+NYD+S S SI +DQ AGQW+ RACGL + V +AL I+
Sbjct: 771 ERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTPHVVRALQTIF 830
Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
+FNV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F TA
Sbjct: 831 EFNVQAFAGGAMGAVNGMQPHGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTSKGFWTAE 890
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 891 GCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 928
>gi|334333160|ref|XP_003341682.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal
glucosylceramidase-like [Monodelphis domestica]
Length = 950
Score = 587 bits (1514), Expect = e-165, Method: Compositional matrix adjust.
Identities = 332/827 (40%), Positives = 463/827 (55%), Gaps = 60/827 (7%)
Query: 48 EFRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGE 106
+ +HL +GFR ++ Q+ + + K+P ++ + G PLGGIG G+I R +RG+
Sbjct: 115 KMKHLG-MGFRYLKWWYQKTQVEKKIPFIDMLNFLPLRQIYGCPLGGIGGGTITRGWRGQ 173
Query: 107 FQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNL 166
F R++L GI V+A+QF+V + R + VL P V + SW+W L
Sbjct: 174 FCRWQLNPGIYQHQTVIADQFTVCLRRGGQTVYQQVLSLERPSV--------LRSWNWGL 225
Query: 167 KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQ 226
G YHAL+PRAWTVY P ++ + CRQI+P +PH+Y++SS P VF + + N+
Sbjct: 226 CGFFAFYHALYPRAWTVYQ-LPGQQVTLTCRQITPILPHDYQDSSLPVGVFVWDVENNSD 284
Query: 227 TSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVA 283
D++++F+ N + G G +N + DG V GL LHH N P T A+A
Sbjct: 285 EDLDISIMFSMRNGLGGGHDSPGGLWNEPFCLEHDGETVQGLLLHH--PNPPNPYTMALA 342
Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
A TA+ V+ F T + +W ++ D + P++ G IG A+
Sbjct: 343 ARLTANTTVTHITAF-----DPXCTGQKVWQDLLPRXQLDSPLAEWGPPTQKGQGIGGAV 397
Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
A +P + F+LAWD P++ F + ++RRYT+F+G G++A + H A+ ++
Sbjct: 398 CAGTHLPRRGRCQLEFALAWDMPKILFGARGQAHYRRYTRFFGQEGNAAPALCHYALSQY 457
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
WE IEAWQ P+LED+ P WY LFNELY+L GGT+W
Sbjct: 458 RSWEERIEAWQNPVLEDRSLPAWYKSALFNELYFLADGGTMW------------------ 499
Query: 462 SLDTPSKIHTPASSDTA-LGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
+ PA + T G L L+ + G+F YLEG EY MYNTYDVHFY+SFAL
Sbjct: 500 -------VEVPAETTTEEFGGILNQLQPTLQEYGRFAYLEGQEYRMYNTYDVHFYASFAL 552
Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEIN 576
+ML+PKLELS+Q D A A + D K + G K +PHD+G D+PW IN
Sbjct: 553 IMLWPKLELSLQYDMAVATLSEDLTRRKYLMSGVMAPVKKKNVIPHDVGDPDDEPWLRIN 612
Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
+Y + +++ WKDLN KFVLQ+YRDF TG+++F R +WP M +FDKD DG+IE
Sbjct: 613 AYLIHDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGLIE 672
Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD 696
N G+PDQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 673 NGGYPDQTYDGWITTGPSAYCGGLWLAAVAVMVQMATVCGTQDIQEKFLSILSRGREAYE 732
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYD 751
LWNG Y+NYD+S S I +DQ AGQW+ RACGL + V +AL I++
Sbjct: 733 RLLWNGRYYNYDSSTQPQSRIIMSDQCAGQWFLRACGLGEGDTEVFPTPHVIRALQTIFE 792
Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
NV GG GA+NGMQPDG D S +Q+ E+W GV YGLAA+MIQE ++ F+TA G
Sbjct: 793 VNVQGFAGGAMGAVNGMQPDGIPDTSSVQSDEVWVGVVYGLAATMIQEGLIQEGFRTAEG 852
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
Y W LG +FQTPE++ + +RSL YMRPL+IWAMQ AL +
Sbjct: 853 CYRTVWER--LGMAFQTPEAYCQHRVFRSLAYMRPLSIWAMQLALQQ 897
>gi|426220232|ref|XP_004004320.1| PREDICTED: non-lysosomal glucosylceramidase [Ovis aries]
Length = 918
Score = 586 bits (1511), Expect = e-164, Method: Compositional matrix adjust.
Identities = 339/817 (41%), Positives = 461/817 (56%), Gaps = 55/817 (6%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 102 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 160
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+ANQF+V + R + VL P V + SW+W L
Sbjct: 161 CRWQLNPGMYQHQTVIANQFTVCLRRRGQTVYQQVLSVERPSV--------LRSWNWGLC 212
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 213 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 271
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ +V+++F+ N + G+ G +N + +DG V GL LHH T P T AVAA
Sbjct: 272 ALEVSIMFSMRNGLGGEDDAPGGLWNEPFCLERDGETVQGLLLHHPTPPN--PYTMAVAA 329
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
+AD V+ F +S G + +W ++ G D L K+ PS+ G I A+
Sbjct: 330 RLSADTTVTHLTAF--DPDSAG---QQVWQDLLLDGQLDSLAG-KSLPSQKGVGIAGAVC 383
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
S +P + + FSLAWD P + F K VY+RRYT+F+G GD+A ++H A+ +A
Sbjct: 384 VSGKLPPRGRQRLEFSLAWDMPRIMFGAKGRVYYRRYTRFFGPDGDAAPALSHYALSHYA 443
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ P+LED+ P WY LFNELY+L GGT+W
Sbjct: 444 DWEEKISAWQNPVLEDRSLPAWYKSALFNELYFLADGGTVW------------------- 484
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
L+ P P + ++G L + G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 485 LEVPED-SLPEALVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYDVHFYASFALVMLW 541
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHD+G D+PW +N+Y +
Sbjct: 542 PKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDPDDEPWLRVNAYEI 601
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN G+
Sbjct: 602 HDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKFDKDHDGLIENGGY 661
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G + + F + + YD LW
Sbjct: 662 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSILSRGQEAYDRLLW 721
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVL 755
NG Y+NYD S S SI +DQ AGQW+ RA GL E +V +AL I++FNV
Sbjct: 722 NGRYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQ 781
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 782 AFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRT 841
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
W LG +FQTPE++ +RSL YMRPL+IWAM
Sbjct: 842 VWER--LGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|410925034|ref|XP_003975986.1| PREDICTED: non-lysosomal glucosylceramidase-like [Takifugu
rubripes]
Length = 827
Score = 586 bits (1510), Expect = e-164, Method: Compositional matrix adjust.
Identities = 334/824 (40%), Positives = 462/824 (56%), Gaps = 53/824 (6%)
Query: 47 QEFRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
Q + HL +G R LY + K+ + K P ++F + G PLGGIG G+I R +RG
Sbjct: 34 QVWEHLG-LGIRYLYWWYKKTRVEKKAPFIDIFGAVPLRQIYGAPLGGIGGGTITRGWRG 92
Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
EF R++L G+ V+ NQF V + R + VL P ++ W+W
Sbjct: 93 EFCRWQLKPGMYHYKTVIENQFMVCLRRNGQTVYQQVLSVERPPT--------LQGWNWG 144
Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
GE YHAL+PRAW VY P + + CRQISP IPH+Y+++S P +VF + + N
Sbjct: 145 FCGEYAYYHALYPRAWNVYH-LPGQNVTLTCRQISPIIPHDYQDASLPVAVFVWDIENKN 203
Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAV 282
+ DV+++FT N SG H+N + K+G V G+ LHH TA P T +
Sbjct: 204 DYALDVSIMFTLLNGSGHADDKSGGHWNEPFRLEKNGEAVSGVLLHHCTAVN--PYTLCI 261
Query: 283 AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
AA E V+ F G G +W+++ G D + P+ G + AA
Sbjct: 262 AAREQPKREVTHQTAFSPKGTCHG-----LWSDLITDGRLDS-PTGSSPPTAKGERVAAA 315
Query: 343 IAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILE 400
+AA ++ + S S+ F LAWD P++ F EK + RRYT++YG+ GD+ ++H A+
Sbjct: 316 LAAGCSVATQSRGSLEFCLAWDMPKITFGSGEKEHSRRYTRYYGSKGDACPLLSHHALTH 375
Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
+ WE IE WQ PIL+D P WY LFNELY++ GGT+WT+ + +R +
Sbjct: 376 YRGWEKSIEEWQGPILQDSSLPTWYKSALFNELYFVADGGTVWTELAEDADVSGGLRSEE 435
Query: 461 FSLDT-PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
L PS I ++ G+F YLEG EY MYNTYDVH+Y+SFAL+
Sbjct: 436 GGLPAQPSVI-------------------KDYGRFAYLEGQEYRMYNTYDVHYYASFALI 476
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINS 577
ML+PKL LS+Q D A +V+ DP + +G+ K G VPHDIG D+PW +N+
Sbjct: 477 MLWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNA 536
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + N++ WKDLN KFVLQVYRDF T D + + +WP M +FD DGDG+IEN
Sbjct: 537 YLIHNTADWKDLNLKFVLQVYRDFHLTQDSQYLQDMWPICQTVMESALKFDLDGDGLIEN 596
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
GF DQTYD W+ G SAYCGGLW+A+L +A + + Y+ + YD
Sbjct: 597 SGFADQTYDGWTVTGPSAYCGGLWLASLVVMCKMARLLENEMKYQYYRDILDRGSVAYDK 656
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AG W+ RA GL +A K+++AL ++D
Sbjct: 657 VLWNGKYYNYDSSGKDHSNSVMSDQCAGHWFLRASGLGDGEYQAFPKEKIQRALKTVFDL 716
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV+ GG GA+NGM+P+G D S +Q+ E+W GV YGLAA+MI E M++ +TA G
Sbjct: 717 NVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMLEEGMRTAEGC 776
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
Y W LG +FQTPE++ YRSL YMRPL+IWAMQ AL
Sbjct: 777 YRTVWER--LGMAFQTPEAYCEKGIYRSLAYMRPLSIWAMQLAL 818
>gi|291241331|ref|XP_002740565.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 792
Score = 584 bits (1506), Expect = e-164, Method: Compositional matrix adjust.
Identities = 327/852 (38%), Positives = 472/852 (55%), Gaps = 91/852 (10%)
Query: 20 PGQPPQLTWQRKLD---SNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
P P+ W+ KLD + + P +L ++ L + FR Y + KG+ P +
Sbjct: 14 PSTVPEFGWRIKLDHVFAEKRAPLR-SLRLKQIPELLPMFFRYAWYCRGVTKKGRRPYID 72
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
T G+PLGGIG G I R ++G+F R+ L G+ V +QF+V + R
Sbjct: 73 QIHPLQSTQSYGVPLGGIGCGCINRGWKGDFCRWSLIPGMYTHDTVQTDQFTVCIRRGGR 132
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
+ VL P TD + +W+W TYHAL+PRAWTVY P + ++C
Sbjct: 133 TTYQQVLSTSRP------TDQNLNTWNWGFNPSCATYHALYPRAWTVYH-LPGQNVTLLC 185
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK- 255
RQ+SP +PH+YK++S P +VF + + N G DVT++F+W N S SG +N
Sbjct: 186 RQVSPILPHDYKDTSLPVAVFVWEVYNHGNEPIDVTIMFSWQNGTGQKSDKSGGRWNKPF 245
Query: 256 TMTKDGVHGLTLHHRTANGRPPV--TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMW 313
TM + V G+ LH NG P + T A++A DV VS F SG+ ++W
Sbjct: 246 TMNEGLVDGVLLH----NGHPTISCTMAISAVCKDDVSVSHKTAFDPSGD-----GLELW 296
Query: 314 NEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF-- 371
++ G D + + + PG AA+AA + + S+ + F LAWD P V+F
Sbjct: 297 QDLISDGELDS-EPGASFRTVPGELTAAAVAAKCHVAASSSCTAEFCLAWDMPRVQFMAK 355
Query: 372 EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFN 431
EK+++RRYT+++GT GD+ ++ A+ ++ +WE IE WQ PIL+++ P WY LFN
Sbjct: 356 EKIHNRRYTRWFGTDGDAGPSLSAYALTQYREWEHRIERWQNPILQNESLPSWYKSALFN 415
Query: 432 ELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
ELY+++ GGTIW + + L I++ +++ + +
Sbjct: 416 ELYFISDGGTIWIEHNKEHNGL---------------INSGDINNSVI---------KQY 451
Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
G+F YLEG EY M+NTYDVHFY+SFAL+ML+P+L+LS+Q D
Sbjct: 452 GKFAYLEGHEYRMFNTYDVHFYASFALIMLWPQLQLSLQYDID----------------- 494
Query: 552 KWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
D+PW +N+Y +++++ WKDLN KFVLQVYRD+ T D + +
Sbjct: 495 -----------------DEPWVRVNAYLMYDTADWKDLNLKFVLQVYRDYYVTKDFIYLQ 537
Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
+WP V + M +D DGDG+I+N G+ DQTYDAWS G SAYCGGLW+AA++A +
Sbjct: 538 DMWPKVKVVMQTSLAWDTDGDGIIDNSGYADQTYDAWSVTGASAYCGGLWLAAVRAMMEI 597
Query: 672 ANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
A + + + + K+ Y+ LWNG Y+NYD+S S SI DQ AGQWY RA
Sbjct: 598 AKILDEEDQFKKYSDVLIQGKSSYEKKLWNGKYYNYDSSKQKHSNSIMCDQAAGQWYLRA 657
Query: 731 CGLLPIADEAKVKK----ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
CGL+ DE K AL IY NV+ + G GA+NGM+PDG D + Q+ E+W
Sbjct: 658 CGLVQENDEVFPKDHILCALRAIYKMNVMGFEKGTMGAVNGMRPDGSPDTTSFQSEEVWT 717
Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
GVTY LAA+MIQE +++ AF TA+GVY+ + GLG FQTPE++ N+ YRSL YMRP
Sbjct: 718 GVTYALAANMIQEGLLEEAFTTASGVYKTCYERCGLG--FQTPEAYFANNCYRSLGYMRP 775
Query: 847 LTIWAMQWALTK 858
L IWAMQ+A+ +
Sbjct: 776 LAIWAMQYAVER 787
>gi|440901933|gb|ELR52793.1| Non-lysosomal glucosylceramidase, partial [Bos grunniens mutus]
Length = 927
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 338/817 (41%), Positives = 458/817 (56%), Gaps = 55/817 (6%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 111 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 169
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+ANQF+V + R + VL P V + SW+W L
Sbjct: 170 CRWQLNPGMYQHQTVIANQFTVCLRRRGQTVYQQVLSVERPSV--------LRSWNWGLC 221
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 222 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 280
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ +V+++F+ N + G+ G +N + DG V GL LHH T P T AVAA
Sbjct: 281 ALEVSIMFSMRNGLGGEDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAA 338
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
+AD V+ F +S G + +W ++ G D L K+ PS+ G I A+
Sbjct: 339 RLSADTTVTHLTAF--DPDSAG---QQVWQDLLLDGQLDSLAG-KSLPSQKGVGIAGAVC 392
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
S +P + FSLAWD P + F K VY+RRYT+F+G GD+A ++H A+ +A
Sbjct: 393 VSGKLPPRGRHRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYA 452
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ P+LED+ P WY LFNELY+L GGT+W
Sbjct: 453 DWEEKISAWQSPVLEDRSLPAWYKSALFNELYFLADGGTVW------------------- 493
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
L+ P P ++G L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 494 LEVPED-SLPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLW 550
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHD+G D+PW +N+Y +
Sbjct: 551 PKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDPDDEPWLRVNAYEI 610
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN G+
Sbjct: 611 HDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKFDKDHDGLIENGGY 670
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G + + F + + YD LW
Sbjct: 671 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSILSRGQEAYDRLLW 730
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVL 755
NG Y+NYD S S SI +DQ AGQW+ RA GL E +V +AL I++FNV
Sbjct: 731 NGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQ 790
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 791 AFAGGAMGAVNGMQPRGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRT 850
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
W LG +FQTPE++ +RSL YMRPL+IWAM
Sbjct: 851 VWER--LGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 885
>gi|194669510|ref|XP_001790122.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|297477943|ref|XP_002689726.1| PREDICTED: non-lysosomal glucosylceramidase [Bos taurus]
gi|296484715|tpg|DAA26830.1| TPA: hypothetical protein BOS_8918 [Bos taurus]
Length = 918
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 338/817 (41%), Positives = 458/817 (56%), Gaps = 55/817 (6%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 102 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 160
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+ANQF+V + R + VL P V + SW+W L
Sbjct: 161 CRWQLNPGMYQHQTVIANQFTVCLRRRGQTVYQQVLSVERPSV--------LCSWNWGLC 212
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 213 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 271
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ +V+++F+ N + G+ G +N + DG V GL LHH T P T AVAA
Sbjct: 272 ALEVSIMFSMRNGLGGEDDAPGGLWNEPFCLEHDGETVQGLLLHHPTPPN--PYTMAVAA 329
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
+AD V+ F +S G + +W ++ G D L K+ PS+ G I A+
Sbjct: 330 RLSADTTVTHLTAF--DPDSAG---QQVWQDLLLDGQLDSLAG-KSLPSQKGVGIAGAVC 383
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
S +P + FSLAWD P + F K VY+RRYT+F+G GD+A ++H A+ +A
Sbjct: 384 VSGKLPPRGRHRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGPDGDAAPALSHYALRHYA 443
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ P+LED+ P WY LFNELY+L GGT+W
Sbjct: 444 DWEEKISAWQSPVLEDRSLPAWYKSALFNELYFLADGGTVW------------------- 484
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
L+ P P ++G L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 485 LEVPED-SLPEELVGSMGQ--LRPLLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLW 541
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHD+G D+PW +N+Y +
Sbjct: 542 PKLELSLQYDMALATLREDLTQRRFLMSGVTAPVKRRNVIPHDVGDPDDEPWLRVNAYEI 601
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN G+
Sbjct: 602 HDTGDWKDLNLKFVLQVYRDYYLTGDQCFLRDMWPVCLAVMESEMKFDKDHDGLIENGGY 661
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G + + F + + YD LW
Sbjct: 662 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQEAQNKFSSILSRGQEAYDRLLW 721
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVL 755
NG Y+NYD S S SI +DQ AGQW+ RA GL E +V +AL I++FNV
Sbjct: 722 NGRYYNYDCSSQPQSCSIMSDQCAGQWFLRASGLGEGDSEVFPTQQVVRALQTIFEFNVQ 781
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 782 AFAGGAMGAVNGMQPCGVPDTSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCYRT 841
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
W LG +FQTPE++ +RSL YMRPL+IWAM
Sbjct: 842 VWER--LGLAFQTPEAYCQRRVFRSLAYMRPLSIWAM 876
>gi|351707038|gb|EHB09957.1| Non-lysosomal glucosylceramidase [Heterocephalus glaber]
Length = 922
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 339/830 (40%), Positives = 464/830 (55%), Gaps = 64/830 (7%)
Query: 50 RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
+HL +G R ++ ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 105 KHLG-MGIRYLKWWYRKTQVEKKKPFIDMINCVPLRQIYGCPLGGIGGGTITRGWRGQFC 163
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L GI V+A+QF+V + R + VL P V + SW+W L G
Sbjct: 164 RWQLNPGIYQHQTVIADQFTVCLRREGRTVYQQVLSMERPSV--------LHSWNWGLCG 215
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G +
Sbjct: 216 YFAFYHALYPRAWTVYQ-LPGQNITLTCRQITPILPHDYEDSSLPVGVFVWDVENEGDEA 274
Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAE 285
DV+++F+ N + G+ G +N + +DG V GL LHH T P T AVAA
Sbjct: 275 VDVSIMFSMRNGLGGEDDAQGGLWNEPFCLEQDGETVQGLLLHHPTLPN--PYTMAVAAR 332
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHL--DNDKTSPSEPGSSIGAAI 343
T D V+ F T +W ++ + G D D +++PS+ G+ I A+
Sbjct: 333 LTEDSTVTHITAF-----DPDSTGHQVWQDLLQDGQLDSPAGDGGRSNPSQKGAGIAGAL 387
Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S +P S+ FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 388 CVSGKLPPRGRCSLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGPDGDAAPALSHYALCHY 447
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A WE I AWQ P+L+D+ P WY LFNELY+L GGT+W
Sbjct: 448 ADWEDRISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------ 489
Query: 462 SLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
L+ P P ++G L G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 490 -LEVPED-SLPEELVGSMGQ--LRPILREYGRFGYLEGQEYRMYNTYDVHFYASFALVML 545
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYN 579
+PK+ELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 546 WPKIELSLQYDMALATLREDLTRRRYLMSGVMGPVKRRNVIPHDIGDPDDEPWLRVNAYL 605
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQVYRD+ TGD+ F + +WP M +FDKD DG+IEN G
Sbjct: 606 IHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGG 665
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAAL-----QAASALANDVGDHASASYFWVRYQKAKAV 694
+ DQTYD W G SAYCGGLW+AA+ A A DV D S+ + + +A
Sbjct: 666 YADQTYDGWVTTGPSAYCGGLWLAAVAVTVQMAVLCAAQDVQDKFSS----ILSRGQEAY 721
Query: 695 YDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIY 750
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I+
Sbjct: 722 ERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTTHVVRALQTIF 781
Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
+ NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA
Sbjct: 782 ELNVQSFAGGAMGAVNGMQPHGIPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAE 841
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
G Y W LG +FQTPE++ +RSL YMRPL+IWAMQ AL + K
Sbjct: 842 GCYRTVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIWAMQLALQQQK 889
>gi|260813884|ref|XP_002601646.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
gi|229286945|gb|EEN57658.1| hypothetical protein BRAFLDRAFT_124315 [Branchiostoma floridae]
Length = 871
Score = 581 bits (1498), Expect = e-163, Method: Compositional matrix adjust.
Identities = 322/841 (38%), Positives = 461/841 (54%), Gaps = 100/841 (11%)
Query: 46 FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
F++ + L + +R + + +++ K P + + G+PLGG+G GSI R ++G
Sbjct: 104 FKQLKDLYGVFYRYFFWWMKKKTGHKRPFIDALNPVQLQQIYGVPLGGLGCGSICRGWKG 163
Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
+F R+ L GI +ANQF+V V R + VL R P S + W+W+
Sbjct: 164 DFCRWALLPGIYSYDIKMANQFTVCVRRRGQTTYQQVLSTRHP--------SCLTEWNWS 215
Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
+ YHAL+PRAWTVY+ P + + CRQ+SP +PH+YK+SS PA+VF +T N G
Sbjct: 216 YSPSHAYYHALYPRAWTVYE-LPGQNITLTCRQVSPVLPHDYKDSSIPAAVFVWTAENHG 274
Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK----DGVHGLTLHHRTANGRPPVTFA 281
+ D++L+FT+ NS+ D+ LSG H+N D V G+ LHH+ + P T A
Sbjct: 275 KEEVDISLMFTFQNSIGADTDLSGGHWNEPFSVNEEGVDPVSGILLHHQ--HKTMPCTIA 332
Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
+ + V+ F +G + K++W ++ G + T+ + G ++
Sbjct: 333 LGIPHKEGIRVTGSTSFNPAG-----SGKEIWRDLMYDGELGSPEA-PTNKTVKGEALAG 386
Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFYGTLGDSAARIAHDAIL 399
A+ AS +PSG + F L+WD P +KF + Y R YT ++G G +A ++A A+
Sbjct: 387 AVCASFKLPSGGKQDTDFVLSWDMPVIKFVGSCRTYKRWYTGWFGDDGTAAPKLAAYALS 446
Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW-----TDGSPPMQSLA 454
+ +WE IE WQRPIL+++ +PEWY LFNELY+++ GGTIW DG
Sbjct: 447 HYPEWESTIEEWQRPILDNQSYPEWYRSALFNELYFMSDGGTIWLVAEDNDGHVENNLPP 506
Query: 455 TIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYS 514
+RE+ TP+ I G+F YLEG EY MYNTYDVH Y+
Sbjct: 507 WLREK-----TPAHIM-------------------EYGRFGYLEGHEYRMYNTYDVHHYA 542
Query: 515 SFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFE 574
SFAL ML+P+LE SIQ D+ D+P+
Sbjct: 543 SFALAMLWPQLEYSIQYDYD----------------------------------DEPFIH 568
Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
+N+Y + ++S W+DLN KFVLQ++RD+ DK F +WP M +D D DGM
Sbjct: 569 VNAYVMHDTSDWRDLNLKFVLQIFRDYFVGRDKGFLAFMWPRAQAVMTKALTWDTDQDGM 628
Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
I+N G DQTYDAW G SAYCGGLW+AAL+ +AN +G ++ Q+AK
Sbjct: 629 IDNSGAADQTYDAWIVTGASAYCGGLWLAALRCMVEIANILGKEEEEQHYLAVMQRAKTA 688
Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA-----------KV 742
Y + LWNG Y+NYD S S S SI A +G WY RACGL+P +E V
Sbjct: 689 YEEKLWNGEYYNYDCSGRSYSNSIMAGAGSGHWYLRACGLVPEDNEVMDSDREVLPLDHV 748
Query: 743 KKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMV 802
+L K+YDFNV++ G GA+NGM+P+G+ D++ +Q+ E+W G+TY LAASMIQE MV
Sbjct: 749 ASSLRKVYDFNVMQFHQGTMGAVNGMRPNGKKDLTSMQSEEVWTGITYALAASMIQEGMV 808
Query: 803 DMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLS 862
+ FQTA GVY + + + G++FQTPE++ D +RSL YMRPL IWAMQWAL +
Sbjct: 809 EEGFQTAYGVYNMCYLQ--CGFAFQTPEAYLERDFFRSLGYMRPLAIWAMQWALESRHRA 866
Query: 863 R 863
R
Sbjct: 867 R 867
>gi|348520215|ref|XP_003447624.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oreochromis
niloticus]
Length = 856
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 331/819 (40%), Positives = 458/819 (55%), Gaps = 52/819 (6%)
Query: 53 AHIGF---RLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQR 109
HIG L + K+ + + K P ++F + G PLGGIG G+I R +RGEF R
Sbjct: 58 GHIGLGVRYLKFWYKKTQVEKKAPFIDMFGAQPLRQIYGAPLGGIGGGTITRGWRGEFCR 117
Query: 110 FKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGE 169
++L G+ V+ NQF+V + R + VL P ++ W+W GE
Sbjct: 118 WQLNPGMYHYKTVIPNQFTVCLRREGKTVYQQVLSVERPPT--------LQGWNWGYCGE 169
Query: 170 NCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSA 229
YHAL+PRAWTVYD P + + CRQISP IPH+Y++SS P +V + + N +
Sbjct: 170 YAFYHALYPRAWTVYD-LPGQNVTLTCRQISPVIPHDYQDSSLPVAVLVWDIENKNEYDL 228
Query: 230 DVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEE 286
DV+++FT N G H+N + K+G V G+ LHH TA P T +AA E
Sbjct: 229 DVSIMFTMVNGSGQKDDKCGGHWNEPFHLEKEGEAVSGVLLHHCTAVN--PYTLCIAARE 286
Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
+D +S F G +WN++ G D + P+ G + AA+A
Sbjct: 287 QSDREISHQTAFSPKG-----PCFSLWNDLIDDGRLDS-PTGSSPPTVKGEKVAAAVAVG 340
Query: 347 LTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKW 404
++P+ S+ F LAWD P++ F EK + RRYT+++GT GD++ ++H A+ + +W
Sbjct: 341 CSVPAQGHNSLEFCLAWDMPKMTFGSQEKEHTRRYTRYFGTKGDASPSLSHYALTHYKQW 400
Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
E IE WQRPIL+D P WY LFNELY++ GGT+WT+ M A I S D
Sbjct: 401 ERSIEEWQRPILQDSSLPSWYKSALFNELYFVADGGTVWTE----MTEDADISGGICSED 456
Query: 465 TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
PA + G+F YLEG EY MYNTYDVHFYSSFAL+ML+PK
Sbjct: 457 G----GLPAQPAVI----------KEYGRFAYLEGQEYRMYNTYDVHFYSSFALIMLWPK 502
Query: 525 LELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFN 582
L LS+Q D A +V+ D + G++ K VPHDIG D+PW +N+Y + +
Sbjct: 503 LALSVQYDIAGSVVQQDLTERLYLMSGQYSPVKTKNVVPHDIGDPDDEPWMRVNAYLIHD 562
Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
++ WKDLN KFVLQVYRDF T D+ + + +WP M +FD DGDG+IEN G+ D
Sbjct: 563 TAGWKDLNLKFVLQVYRDFHLTQDRQYLQDMWPICQAVMESELKFDLDGDGLIENSGYAD 622
Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNG 701
QTYD W G SAYCGG+WVA+L +A + S ++ + A ++ LWNG
Sbjct: 623 QTYDGWIVTGPSAYCGGMWVASLCVMCKMARLLDSEESYRHYKDLLDRGSAAFEKLLWNG 682
Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKV 757
Y+NYD+S S S+ +DQ AG W+ RA GL + K++ AL ++D NV+
Sbjct: 683 KYYNYDSSGRELSNSVMSDQCAGHWFLRASGLGDGEFQAFPKEKIQSALKSVFDLNVMSF 742
Query: 758 KGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAW 817
GG GA+NGM+P+G D S +Q+ E+W GV YGLAA+MI E M + +TA G Y W
Sbjct: 743 AGGQMGAVNGMRPEGVPDHSSVQSDEVWIGVVYGLAATMIHEGMQEEGMRTAEGCYRTVW 802
Query: 818 SEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
+ LG +FQTPE++ + YRSL YMRPL+IWAMQ AL
Sbjct: 803 EK--LGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLAL 839
>gi|73971791|ref|XP_531991.2| PREDICTED: non-lysosomal glucosylceramidase [Canis lupus familiaris]
Length = 1136
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 337/817 (41%), Positives = 458/817 (56%), Gaps = 55/817 (6%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 323 IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 381
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 382 CRWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLC 433
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 434 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 492
Query: 228 SADVTLLFTWANSVAG-DSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G D + G + +DG V GL LHH P T AVAA
Sbjct: 493 ALDVSIMFSMRNGLGGGDDAIGGLWNEPFCLERDGETVQGLLLHHPALPN--PYTMAVAA 550
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++PS+ G+ I A+
Sbjct: 551 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGAGIAGAVC 604
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
+ +P + FSLAWD P + F K VY+RRYT+F+G G++A ++H A+ ++
Sbjct: 605 VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCQYV 664
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ P+L+D+ P WY LFNELY+L GGT+W + P SL
Sbjct: 665 DWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 713
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
P K+ S L L E G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 714 ---PEKL---VESMCQLRPILRE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 762
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y +
Sbjct: 763 PKLELSLQYDMAVATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 822
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN G+
Sbjct: 823 HDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 882
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G H F ++ + Y+ LW
Sbjct: 883 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAILCGAHDVQDKFSSILRRGQEAYERLLW 942
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
NG Y+NYD S S SI +DQ AGQW+ RA GL + V AL I++FNV
Sbjct: 943 NGRYYNYDCSPQPQSCSIMSDQCAGQWFLRASGLGEGDTEVFPTRHVVCALQTIFEFNVR 1002
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y
Sbjct: 1003 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 1062
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
W LG +FQTPE++ +RSL YMRPL+IWAM
Sbjct: 1063 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 1097
>gi|385648282|ref|NP_001245314.1| glucosidase, beta (bile acid) 2 [Sus scrofa]
Length = 915
Score = 576 bits (1485), Expect = e-161, Method: Compositional matrix adjust.
Identities = 333/817 (40%), Positives = 448/817 (54%), Gaps = 62/817 (7%)
Query: 52 LAHIGFRL------YRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
+ H G L YR ++ E+ K P ++ + G PLGGIG G+I R +RG
Sbjct: 104 IKHFGMGLRYLQWWYRKTRVEK---KTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRG 160
Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
+F R++L G+ V+A+QF+V + R + VL P V + SW+W
Sbjct: 161 QFCRWQLNPGMYRHQTVIADQFTVCLRRKGQTVYQQVLSVEHPSV--------LRSWNWG 212
Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + N G
Sbjct: 213 LCGSFAFYHALYPRAWTVYQ-LPGQSVTLTCRQITPILPHDYQDSSLPVGVFVWDAENEG 271
Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAV 282
+ DV+++F+ N + GD G +N + +DG + GL LHH T P T A+
Sbjct: 272 DETLDVSIMFSMRNGLGGDEDAPGGLWNEPFCLERDGETIQGLLLHHPTLPN--PYTMAM 329
Query: 283 AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
AA TAD V+ F G T + +W ++ G D ++ PS+ G I A
Sbjct: 330 AARLTADTTVTHITAFDPDG-----TGQQVWQDLLXDGQLDS-PAGQSPPSQKGVGIAGA 383
Query: 343 IAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILE 400
+ + +P G R + FSLAWD P + F K VYHRRYT+F+G GD+A ++H A+
Sbjct: 384 VCVTTRLPPGGRRRLEFSLAWDMPRIMFGAKGQVYHRRYTRFFGPDGDAAPALSHHALSR 443
Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
+A WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + P SL
Sbjct: 444 YADWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSLP------ 495
Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
D L + G+F YLEG EY MYNTYDVHFY+SFAL+M
Sbjct: 496 --------------EDLVGSMCQLRPILQEYGRFGYLEGQEYRMYNTYDVHFYASFALIM 541
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
L+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 542 LWPKLELSLQYDMALATLREDLTERRYLMSGITAPVKRRNVIPHDIGDPDDEPWLRVNAY 601
Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
+ +++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN
Sbjct: 602 VVHDTADWKDLNLKFVLQVYRDYYLTGDQVFLRDMWPVCLAVMESEMKFDKDQDGLIENG 661
Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS- 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 662 GYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGTREVQDQFSSILSRGREAYERL 721
Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFN 753
LWNG Y+NYD S + SI +DQ AGQW+ RA GL + V +AL I++FN
Sbjct: 722 LWNGRYYNYDCSSQPQAYSIMSDQCAGQWFLRASGLGEGDTEVFPTPHVVQALQTIFEFN 781
Query: 754 VLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
V GG GA+NGMQP G D S +Q+ E+W GV Y LAA+MIQE + F+TA G Y
Sbjct: 782 VQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYALAATMIQEGLTWEGFRTAEGCY 841
Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 842 RTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|332228461|ref|XP_003263408.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Nomascus leucogenys]
Length = 919
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 331/818 (40%), Positives = 454/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 104 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 162
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 163 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 214
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G C HAL+PR WTVY P PE + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 215 GYFCFLHALYPRXWTVYQ-LPGPECSLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 273
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + G V GL LHH T P T AVAA
Sbjct: 274 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERGGETVQGLLLHHPTLPN--PYTMAVAA 331
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 332 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 385
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ ++
Sbjct: 386 VSSKLQPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPTLSHYALCQY 444
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 445 PEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 492
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 493 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 540
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 541 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 600
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 601 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 660
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 661 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 720
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 721 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFEL 780
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 781 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 840
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 841 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|390458158|ref|XP_002743170.2| PREDICTED: non-lysosomal glucosylceramidase [Callithrix jacchus]
Length = 1065
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 341/842 (40%), Positives = 470/842 (55%), Gaps = 63/842 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 254 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 312
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 313 CRWQLNPGMYRHQTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 364
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 365 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 423
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
DV+++F+ N + G G +N + +DG V GL LHH T P T AVAA
Sbjct: 424 DLDVSIMFSMRNGLGGGDDAPGGLWNEPFHLERDGKTVQGLLLHHPTLPN--PYTMAVAA 481
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F G T + +W ++ + G D +++P++ G + A+
Sbjct: 482 RVTADTTVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGVAGAVC 535
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ ++
Sbjct: 536 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCQY 594
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + +RE
Sbjct: 595 TEWEERISAWQNPVLDDRLLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 644
Query: 462 SLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
P D + L + G+F YLEG EY MYNTYDVHFY+SFAL+ML
Sbjct: 645 ----------PQELDGNMCH--LRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIML 692
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 693 WPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYL 752
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN G
Sbjct: 753 IHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGG 812
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+ L
Sbjct: 813 YADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSSILSRGQEAYERLL 872
Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
WNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++ NV
Sbjct: 873 WNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFELNV 932
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y
Sbjct: 933 QAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYR 992
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK---PKLSRQEIKHEIS 871
W LG +FQTPE++ +RSL YMRPL+IWAMQ AL + K SR ++K +
Sbjct: 993 TVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAMQLALQQQQHKKASRPKVKQDTG 1050
Query: 872 DR 873
R
Sbjct: 1051 LR 1052
>gi|198424051|ref|XP_002127036.1| PREDICTED: similar to Non-lysosomal glucosylceramidase (NLGase)
(Glucosylceramidase 2) (Beta-glucocerebrosidase 2)
(Beta-glucosidase 2) [Ciona intestinalis]
Length = 861
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 327/853 (38%), Positives = 468/853 (54%), Gaps = 49/853 (5%)
Query: 24 PQLTWQRKLDSNVKV-PTGFTLS-FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKH 81
P+ W+ L K PT ++ + + + R +Y Q K K P + F +H
Sbjct: 41 PEHGWKIALSHKFKTKPTPWSFPPLTQIPSVLGLAVRYLQYYNQLRRKNKKPCMDFFNQH 100
Query: 82 HITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSS 141
+ G+PLGGIG+G+I R ++G+F +++L GI LA+QF V + R N
Sbjct: 101 PLRPIYGVPLGGIGSGTINRGWQGDFCKWQLKPGIYTFDCPLADQFIVSIRRHNTVVCQQ 160
Query: 142 VLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
VL + P + + +W+W G Y+ L+PRAWTVYD + + + CRQISP
Sbjct: 161 VLSTKGP------SGEALRTWNWKYCGSYAYYNGLYPRAWTVYDLQGQ-NIVLTCRQISP 213
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD------SGLSGHHFNSK 255
P++YK++S PA+VF + + N G +VT++ + N SGL F S
Sbjct: 214 VFPNDYKDTSLPAAVFVWDVENRGNEEVEVTIVMSLENGTGNPEQQKKVSGLWNEPF-SN 272
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
T + G+ +H+ T P T VAA E V V+ C F + N ++WN
Sbjct: 273 TDHNNKAKGVVMHNTTYRDMP-YTLTVAAAEKPGVDVTHCTSFNPTHN-----GAEVWNS 326
Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFE-- 372
+ G K+S + G A+AAS + P S+ F+L WD P++ F +
Sbjct: 327 LLNTGRLQSKPG-KSSLTATGEHTACAVAASCKVKPHSMAGSMDFALTWDMPQIHFGDAG 385
Query: 373 KVYHRRYTKFYGTLGDSA-ARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFN 431
K Y RRYT+++ GD+A A IA A+ + WE +IEAWQRPIL++K+ P+WY +FN
Sbjct: 386 KHYKRRYTRWFN--GDNAGATIASYALDNYQTWEDKIEAWQRPILDNKQLPDWYKSAIFN 443
Query: 432 ELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
ELYY+ GGT+W + + A + P + D+ + +
Sbjct: 444 ELYYITDGGTVWLEVAEEENGTANGK-------VPETLQNTTKVDSC-------DFLKEY 489
Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
G+F YLEG EY MYNTYDVHFY+S AL L+PKLELS+Q D A +++ +P K + DG
Sbjct: 490 GKFAYLEGQEYKMYNTYDVHFYASIALAYLWPKLELSVQYDIATSILHSNPQPHKYLMDG 549
Query: 552 KWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
K VPHD+G D+PW +N+Y + +++ W+DLN KFVLQ YRD+ T D +F
Sbjct: 550 VTAPVKTPNVVPHDVGCPEDEPWLRVNTYFVHDTADWRDLNPKFVLQAYRDYYITKDIDF 609
Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
+A+WP I M + DKD DG+IEN G DQT+D W G SAYCGGLW+AAL+
Sbjct: 610 LKAMWPICKIVMEQSMRHDKDNDGLIENSGAADQTFDGWCVTGPSAYCGGLWLAALRCME 669
Query: 670 ALANDVGDHASASYFW-VRYQKAKAVYDSLWNGSYFNYDNSDG-SSSTSIQADQLAGQWY 727
A+ + + F + + KA D LWNG YFNYD S+ TS+ ADQ AGQW+
Sbjct: 670 EAADILHEKDDQVKFHDILVRATKAYQDRLWNGKYFNYDCSNNPQHRTSVMADQCAGQWF 729
Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
A G+ + + +V +L I+ NV + GG GA+NGM+P G +D S +Q+ E+W G
Sbjct: 730 LLASGICDVLPDDQVMSSLKTIFSLNVEQYDGGRHGAVNGMRPHGAVDTSSVQSDEVWTG 789
Query: 788 VTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPL 847
VTY LAA+MI + M+ F+TA+G+Y W G+ +FQTPE+ N YRSL YMRPL
Sbjct: 790 VTYALAATMIHKGMLKEGFRTASGIYRSCWQRYGM--AFQTPEAMRKNHTYRSLAYMRPL 847
Query: 848 TIWAMQWALTKPK 860
+IW MQ+AL PK
Sbjct: 848 SIWGMQYALENPK 860
>gi|402897018|ref|XP_003911574.1| PREDICTED: non-lysosomal glucosylceramidase [Papio anubis]
Length = 927
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 342/859 (39%), Positives = 473/859 (55%), Gaps = 67/859 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ ++
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCQY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + +RE
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 502
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
+ LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 503 PEE--------------LGRNMCHLCPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G S YCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSVYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK---PKLSRQEIKHE 869
Y W LG +FQTPE++ +RSL YMRPL+IWAMQ AL + K SR +++
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAMQLALQQQQHKKASRPKVEQG 906
Query: 870 ISDRDSSYLEQHAAFSKVA 888
R E A + V+
Sbjct: 907 TGLRTGPMFEPKEAMANVS 925
>gi|109111050|ref|XP_001084705.1| PREDICTED: non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 331/818 (40%), Positives = 456/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHQTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P ++ + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + +RE
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 502
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
+ LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 503 PEE--------------LGRNMCHLCPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|355567647|gb|EHH23988.1| Non-lysosomal glucosylceramidase [Macaca mulatta]
gi|355753217|gb|EHH57263.1| Non-lysosomal glucosylceramidase [Macaca fascicularis]
gi|380789499|gb|AFE66625.1| non-lysosomal glucosylceramidase [Macaca mulatta]
gi|384943046|gb|AFI35128.1| non-lysosomal glucosylceramidase [Macaca mulatta]
Length = 927
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 331/818 (40%), Positives = 456/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P ++ + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQDVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEHGGETVQGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRILFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + +RE
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL 502
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
+ LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 503 PEE--------------LGRNMCHLCPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILHRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGETEVFPTPHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|432920219|ref|XP_004079895.1| PREDICTED: non-lysosomal glucosylceramidase-like [Oryzias latipes]
Length = 858
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 326/815 (40%), Positives = 458/815 (56%), Gaps = 50/815 (6%)
Query: 55 IGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLF 113
+G R R+ K+ + + K P ++F + G PLGGIG G+I R +RGEF R++L
Sbjct: 62 LGIRYLRWWYKKTQVEKKAPFIDMFHALPLRQIYGAPLGGIGGGTITRGWRGEFCRWQLN 121
Query: 114 HGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
G+ V+A+QF+V + R + VL P S ++ W+W GE Y
Sbjct: 122 PGMYHYKTVIADQFTVCLRRGGQTVYQQVLSVEHP--------STLQGWNWGYCGEYAFY 173
Query: 174 HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
HAL+PRAWTVY P + + CRQISP IPH+YK+SS P +VF + + N + +V++
Sbjct: 174 HALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDYKDSSLPVAVFVWDIENKNDYALEVSI 232
Query: 234 LFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADV 290
+FT N SG H+N + K+G + G+ LHH TA P T ++A E D
Sbjct: 233 MFTMVNGSGHKDDSSGGHWNEPFHLEKEGKALSGVLLHHCTAVN--PYTLCISAREQPDR 290
Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIP 350
VS F SG G +W+++ G D + + P++ G + AA+A ++P
Sbjct: 291 KVSHQTAFSPSGTCSG-----LWSDLMTDGRLDS-PSGSSPPTQKGEKVAAALAVGCSVP 344
Query: 351 SGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
+ V F LAWD P + F E+ + RRYT+++G+ G+++ ++H A+ + +WE I
Sbjct: 345 AHGHNDVEFCLAWDMPLITFGSRERKHVRRYTRYFGSKGEASPSLSHYALTHYKEWERRI 404
Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
E WQ PIL+D P WY LFNELY++ GGT+WT+ + +R L
Sbjct: 405 EEWQSPILQDSSLPSWYKSALFNELYFVADGGTVWTELAEDADVSGGVRSEDGGL----- 459
Query: 469 IHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELS 528
PA D + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKL LS
Sbjct: 460 ---PAQPDVI----------KEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLWPKLALS 506
Query: 529 IQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
+Q D A +V+ D + G++ K VPHDIG D+PW +N+Y + +++ W
Sbjct: 507 VQYDIAGSVVQQDLTERLHLMSGRYSPVKTKNVVPHDIGDPDDEPWHRLNAYLIHDTAGW 566
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD 646
KDLN KFVLQVYRDF T D + + +WP M +FD DGDG+IEN G+ DQTYD
Sbjct: 567 KDLNLKFVLQVYRDFHITQDNQYLKDMWPICQAVMESEIKFDLDGDGLIENSGYADQTYD 626
Query: 647 AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFN 705
W+ G SAYCGGLW+A+L +A V + ++ + A +D LWNG Y+N
Sbjct: 627 GWTVTGPSAYCGGLWLASLCVMCKMARLVDTERTYQHYKDILDRGSAAFDKLLWNGKYYN 686
Query: 706 YDNSDGSSSTSIQADQLAGQWY--ARACGL--LPIADEAKVKKALTKIYDFNVLKVKGGM 761
YD+S S S+ +DQ AG W+ A ACG + K++ L I+D NV+ GG
Sbjct: 687 YDSSGRYHSNSVMSDQCAGHWFLKASACGEEDYQAFPKEKIQSTLKSIFDLNVMAFAGGQ 746
Query: 762 CGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDG 821
GA+NGM+P+G D S +Q+ E+W GV YGLAA+MI E M + TA G Y W
Sbjct: 747 MGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEGMREEGMHTAEGCYRTVWER-- 804
Query: 822 LGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
LG +FQTPE++ + YRSL YMRPL+IWAMQ AL
Sbjct: 805 LGMAFQTPEAYCEKNIYRSLAYMRPLSIWAMQLAL 839
>gi|354485725|ref|XP_003505033.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Cricetulus griseus]
Length = 898
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 330/817 (40%), Positives = 451/817 (55%), Gaps = 65/817 (7%)
Query: 52 LAHIGFRLYRYSK----QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ H G L RY K + + + K P ++F + G PLGGIG G+I R +RG+F
Sbjct: 86 IKHFGMGL-RYLKWWYRKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQF 144
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 145 CRWQLNPGMYQHQTVIADQFTVCLRRNGRTVYQQVLSLERPSV--------LRSWNWGLC 196
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++S P VF + + N G
Sbjct: 197 GHFAFYHALYPRAWTVYQ-LPGQSVTLTCRQITPILPHDYQDSCLPVGVFVWDVENQGDE 255
Query: 228 SADVTLLFTWANSVAG-DSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G D L G + +DG V GL LHH T P T AVAA
Sbjct: 256 TLDVSIMFSMRNGLGGEDDALGGLWNEPFCLQRDGKTVQGLLLHHPTPPN--PYTMAVAA 313
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
+T H++ T + +W + + G D +++PS+ G + A+
Sbjct: 314 RDTTVTHIT--------AFDPDSTGQQVWKHLLQDGQLDS-PAGQSTPSKKGEGVAGAVC 364
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
S +P + FSLAWD P + F K +++RRYT+F+G+ GD A ++H A+ ++A
Sbjct: 365 ISSKLPPRGQCCLEFSLAWDMPRIMFGAKGQIHYRRYTRFFGSDGDVAPALSHYALCQYA 424
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE I AWQ P+L+D+ P WY LFNELY+L GGT+W
Sbjct: 425 DWENRISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------- 465
Query: 463 LDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
L+ P S LG + L ++ G+F YLEG EY MYNTYDVHFY+SFALVM
Sbjct: 466 LEVPED-----SLPEELGENMYQLRPILQDYGRFGYLEGQEYRMYNTYDVHFYASFALVM 520
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
L+PKLELS+Q D A A D + + G K +PHDIG D+PW +N+Y
Sbjct: 521 LWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAY 580
Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
+ +++ WKDLN KFVLQVYRD+ TGD+ F + +WP M +FDKD DG+IEN
Sbjct: 581 LIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDQDGLIENG 640
Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS- 697
G+ DQTYD W A G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 641 GYADQTYDGWVATGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILCRGREAYERL 700
Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFN 753
LWNG Y+NYD+S S SI +DQ AGQW+ RACGL + V +AL I++ N
Sbjct: 701 LWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELN 760
Query: 754 VLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
V GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 761 VQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTREGFRTAEGCY 820
Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 821 RTVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 855
>gi|297684086|ref|XP_002819683.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pongo
abelii]
Length = 927
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 331/818 (40%), Positives = 454/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYQKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPTV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERGRETVQGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 ISSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|395515264|ref|XP_003761826.1| PREDICTED: non-lysosomal glucosylceramidase [Sarcophilus harrisii]
Length = 954
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 330/823 (40%), Positives = 460/823 (55%), Gaps = 55/823 (6%)
Query: 49 FRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +GFR ++ Q+ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 121 LKHLG-MGFRYLKWWYQKTQVEKKTPFIDMLNCLPLRQIYGCPLGGIGGGTITRGWRGQF 179
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L GI V+A+QF+V + R + VL P V + SW+W L
Sbjct: 180 CRWQLNPGIYQHRTVMADQFTVCLRREGQTVYQQVLSLERPNV--------LRSWNWGLC 231
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P ++ + C QI+P +PH+Y++SS P VF + + N+G
Sbjct: 232 GFFAFYHALYPRAWTVYQ-LPGQQVTLTCLQITPILPHDYQDSSLPVGVFVWNVENTGDE 290
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+V+++F+ N + G G +N + +DG V G+ LHH N P T A+AA
Sbjct: 291 DLEVSIMFSMRNGLGGGHDNLGGLWNEPFCLERDGETVQGVLLHH--PNPPNPYTMALAA 348
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TA+ V+ F G T + +W ++ G D + P++ IG A+
Sbjct: 349 RRTANTTVTHITAFDPDG-----TGQKVWQDLLLDGQLDS-PLGPSPPTQKRQGIGGAVC 402
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
A +P + F+L+WD P+V F K ++RRYT+F+G G++A + H A+ ++
Sbjct: 403 AGAHLPRRGRCQLEFALSWDMPKVLFGAKGQAHYRRYTRFFGQEGNAAPALCHYALTQYR 462
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ P+LED+ P WY LFNELY+L GGT+W ++ +FS
Sbjct: 463 SWEEKIIAWQSPVLEDRTLPAWYKSALFNELYFLADGGTVW------LEVTEETITEEFS 516
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
+ +L +E G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 517 ---------------GMLNQLRPTLQE-YGRFAYLEGQEYHMYNTYDVHFYASFALIMLW 560
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
PKLELS+Q D A A + D K + G K +PHD+G D+PW IN+Y +
Sbjct: 561 PKLELSLQYDMAVATLSEDLTRRKYLMSGVIAPVKKKNVIPHDVGDPDDEPWLRINAYLI 620
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+++ WKDLN KFVLQ+YRDF TG+++F R +WP M +FDKD DG+IEN G+
Sbjct: 621 HDTADWKDLNLKFVLQIYRDFYLTGNESFLRDMWPVCQAVMESEMKFDKDQDGLIENGGY 680
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G F + + Y+ LW
Sbjct: 681 ADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAAVCGAQDVQEKFLSILSRGREAYERLLW 740
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
NG Y+NYD+S S I +DQ AGQW+ RACGL + A V +AL I++ NV
Sbjct: 741 NGRYYNYDSSSQPQSRIIMSDQCAGQWFLRACGLGKGDTEVFPSAHVIRALQTIFELNVQ 800
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGM PDG D S +Q+ E+W GV YGLAA+MIQE +V F TA G Y
Sbjct: 801 GFAGGAMGAVNGMHPDGVPDTSSVQSDEVWVGVVYGLAATMIQEGLVQEGFLTAEGCYRT 860
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
W LG +FQTPE++ +RSL YMRPL+IWAMQ AL +
Sbjct: 861 VWER--LGMAFQTPEAYCQRRVFRSLAYMRPLSIWAMQLALQQ 901
>gi|403306662|ref|XP_003943842.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Saimiri
boliviensis boliviensis]
Length = 923
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 327/815 (40%), Positives = 452/815 (55%), Gaps = 58/815 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPNV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + +DG V GL LHH T P T A AA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFHLERDGETVQGLLLHHPTLPN--PYTMAAAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F G T + +W ++ + G D +++P++ G + A+
Sbjct: 340 RVTADTMVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGVGVAGAVC 393
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
S + + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCCLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALSRYT 453
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + +RE
Sbjct: 454 EWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------VREDSL- 502
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
P D + L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+
Sbjct: 503 ---------PQELDGNMCH--LRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLW 551
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y +
Sbjct: 552 PKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLI 611
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN G+
Sbjct: 612 HDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGY 671
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G F + + Y+ LW
Sbjct: 672 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSSILSRGQEAYERLLW 731
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
NG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++ NV
Sbjct: 732 NGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFELNVQ 791
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y
Sbjct: 792 AFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 851
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 852 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|410978559|ref|XP_003995657.1| PREDICTED: non-lysosomal glucosylceramidase [Felis catus]
Length = 917
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 334/815 (40%), Positives = 454/815 (55%), Gaps = 55/815 (6%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 104 IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 162
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 163 CRWQLNPGMYQHQTVVADQFTVCLRREGTTVYQQVLSVERPSV--------LRSWNWGLC 214
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 215 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 273
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + +DG V GL LHH T P T AVAA
Sbjct: 274 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAA 331
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++PS+ G I A+
Sbjct: 332 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGIAGAVC 385
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
+ +P + FSLAWD P + F K VY+RRYT+F+G G++A ++H A+ +
Sbjct: 386 VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPTLSHYALCRYT 445
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ P+L+D+ P WY LFNELY+L GGT+W + P SL
Sbjct: 446 DWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 494
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
P ++ S L L E G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 495 ---PEEL---VGSMCQLRPILQE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 543
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y +
Sbjct: 544 PKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 603
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN G+
Sbjct: 604 HDTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 663
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G F ++ + Y+ LW
Sbjct: 664 ADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILRRGQEAYERLLW 723
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
NG Y+NYD S S SI +DQ AGQW+ RA GL + V AL I++FNV
Sbjct: 724 NGRYYNYDCSPPPQSCSIMSDQCAGQWFLRASGLGEGDTEVFPTPHVVCALQTIFEFNVR 783
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y
Sbjct: 784 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 843
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 844 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 876
>gi|426361725|ref|XP_004048050.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Gorilla
gorilla gorilla]
Length = 927
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 330/818 (40%), Positives = 455/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + +W+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRTWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|395855828|ref|XP_003800351.1| PREDICTED: LOW QUALITY PROTEIN: non-lysosomal glucosylceramidase
[Otolemur garnettii]
Length = 1147
Score = 570 bits (1468), Expect = e-159, Method: Compositional matrix adjust.
Identities = 328/821 (39%), Positives = 456/821 (55%), Gaps = 61/821 (7%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 326 IKHLG-MGLRYLQWWYRKTQVEKKAPFIDMINCVPLRQIYGCPLGGIGGGTITRGWRGQF 384
Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
R++L G + D P+ +F+V + R + VL P V + S
Sbjct: 385 CRWQLNPGMYQHRTVIADQPISPTKFTVCLRRKGQTVYQQVLSLECPSV--------LRS 436
Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
W+W L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + +
Sbjct: 437 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 495
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
N G + DV+++F+ N + G G +N + +DG V GL LHH T P
Sbjct: 496 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PY 553
Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
T AVAA TAD V+ F T + +W ++ + G D +++PS+ G
Sbjct: 554 TMAVAARCTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSAPSQKGVG 607
Query: 339 IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHD 396
+ A+ S +P + FSLAWD P++ F K V++RRYT+F+G G++A ++H
Sbjct: 608 VAGAVCVSGKLPPQGQCRLEFSLAWDMPKIMFGAKGQVHYRRYTRFFGQDGNAAPALSHY 667
Query: 397 AILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATI 456
A+ +A WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + +
Sbjct: 668 ALCRYADWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------V 717
Query: 457 RERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
RE S + G L + G+F YLEG EY MYNTYDVHFY+SF
Sbjct: 718 REDSIS------------EELGGGMCQLHPMLRDYGRFGYLEGQEYRMYNTYDVHFYASF 765
Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFE 574
AL+ML+PKLELS+Q D A A + D + + +G K +PHDIG D+PW
Sbjct: 766 ALIMLWPKLELSLQYDMALATLREDLTRRRYLMNGVVAPVKRRNVIPHDIGDPDDEPWLR 825
Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
+N+Y + +++ WKDLN KFVLQVYRD+ TGD+ F + +WP M +FDKD DG+
Sbjct: 826 VNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGL 885
Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
IEN G+ DQTYDAW G SAYCGGLW+AA+ +A G F ++ +
Sbjct: 886 IENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDLQDKFSSILRQGQEA 945
Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKI 749
Y+ LWNG Y+NYD+S S SI +DQ AGQW+ +ACGL + V +AL I
Sbjct: 946 YERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTI 1005
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
Y+ NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA
Sbjct: 1006 YELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTA 1065
Query: 810 AGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 1066 EGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 1104
>gi|301767028|ref|XP_002918945.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Ailuropoda melanoleuca]
Length = 928
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 334/815 (40%), Positives = 454/815 (55%), Gaps = 55/815 (6%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 115 IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 173
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 174 CRWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLC 225
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 226 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 284
Query: 228 SADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + GD G + +DG V GL LHH T P T AVAA
Sbjct: 285 ALDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAA 342
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++PS+ G I A+
Sbjct: 343 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPSQKGVGIAGAVC 396
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
+ +P + FSLAWD P + F K VY+RRYT+F+G G++A ++H A+ +
Sbjct: 397 VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYV 456
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ PIL+D+ P WY LFNELY+L GGT+W + P SL
Sbjct: 457 DWEEKISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 505
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
P ++ S L L E G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 506 ---PEEL---VGSMCQLRPILRE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 554
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y +
Sbjct: 555 PKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 614
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+++ WKDLN KFVLQVYRD+ TGD++F R +WP M +FDKD DG+IEN G+
Sbjct: 615 HDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 674
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G F ++ + Y+ LW
Sbjct: 675 ADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSSILRRGREAYERLLW 734
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
NG Y+NYD S S SI +DQ AGQW+ +A GL + V AL I++FNV
Sbjct: 735 NGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPTPHVVCALQTIFEFNVQ 794
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y
Sbjct: 795 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 854
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 855 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 887
>gi|281339788|gb|EFB15372.1| hypothetical protein PANDA_007478 [Ailuropoda melanoleuca]
Length = 923
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 334/815 (40%), Positives = 454/815 (55%), Gaps = 55/815 (6%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHLG-MGLRYLQWWYRKTQVEKKTPFIDLINCVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + GD G + +DG V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TAD V+ F T + +W ++ + G D +++PS+ G I A+
Sbjct: 340 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPSQKGVGIAGAVC 393
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHA 402
+ +P + FSLAWD P + F K VY+RRYT+F+G G++A ++H A+ +
Sbjct: 394 VTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDGNAAPALSHYALCRYV 453
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
WE +I AWQ PIL+D+ P WY LFNELY+L GGT+W + P SL
Sbjct: 454 DWEEKISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL--------- 502
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
P ++ S L L E G+F YLEG EY MYNTYDVHFY+SFALVML+
Sbjct: 503 ---PEEL---VGSMCQLRPILRE-----YGRFGYLEGQEYRMYNTYDVHFYASFALVMLW 551
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y +
Sbjct: 552 PKLELSLQYDMALATLREDLTRRQYLMSGVTAPVKRRNVIPHDIGDPDDEPWLRVNAYVV 611
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+++ WKDLN KFVLQVYRD+ TGD++F R +WP M +FDKD DG+IEN G+
Sbjct: 612 HDTADWKDLNLKFVLQVYRDYYLTGDQSFLRDMWPVCLAVMESEMKFDKDQDGLIENGGY 671
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LW 699
DQTYD W G SAYCGGLW+AA+ +A G F ++ + Y+ LW
Sbjct: 672 ADQTYDGWITTGPSAYCGGLWLAAVAVMVQIAVLCGAQDVQDKFSSILRRGREAYERLLW 731
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVL 755
NG Y+NYD S S SI +DQ AGQW+ +A GL + V AL I++FNV
Sbjct: 732 NGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPTPHVVCALQTIFEFNVQ 791
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y
Sbjct: 792 AFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRT 851
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 852 VWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|196002317|ref|XP_002111026.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
gi|190586977|gb|EDV27030.1| hypothetical protein TRIADDRAFT_54543 [Trichoplax adhaerens]
Length = 843
Score = 567 bits (1461), Expect = e-158, Method: Compositional matrix adjust.
Identities = 324/833 (38%), Positives = 462/833 (55%), Gaps = 67/833 (8%)
Query: 55 IGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH 114
+G R Y + P ++F K + G+P+GG+G G+I R + G+F R++L
Sbjct: 36 MGLRYGMYILHNRFHSREPFIDLFGKRQVDRLYGVPIGGLGCGTIMRGWNGQFCRWQLRP 95
Query: 115 GICDDAPVLANQFSVFVSRPNGE-KFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
G V A+QF+V + R NGE + VL P P + + + W W G TY
Sbjct: 96 GWYTYDIVHADQFTVSI-RKNGEPSYQQVLSPCRP------SSAQLGDWFWGYHGSQATY 148
Query: 174 HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
AL+PRAWT Y+ P + + CRQISP IPHNYK+S P F + + N +V++
Sbjct: 149 RALYPRAWTTYN-LPGQNVTLTCRQISPIIPHNYKDSCLPVGCFIWEIDNQSDDEIEVSI 207
Query: 234 LFTWANSVAGDSGLSGHHFN-------SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEE 286
+FT+ N + +G H N ++ M + G+ LHHR + P TF + A+E
Sbjct: 208 MFTFQNGDGSEHDSAGGHQNETFTRSSNENMNNSKISGVLLHHRAS--LQPCTFTICAKE 265
Query: 287 TAD------VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPS---EPGS 337
+ V VS F G+ + I+ ++ HG ++ N K + S + G
Sbjct: 266 CPETGDYPAVSVSNSVSFDPKGSGREIS-----RQLLSHGKLENSSNTKATQSPVSKKGE 320
Query: 338 SIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGD-SAARIA 394
IGAA+A + + + F+L WD PEV F K Y RRYT+F+ G AA +A
Sbjct: 321 IIGAAVATTAIVKPQGKAKMEFALTWDQPEVYFDAKEIKYRRRYTRFFNPNGSHRAADLA 380
Query: 395 HDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW--TDGSPPMQS 452
A+ WE +I+ WQ PILE++ +P+WY +FNELYY++ G+IW TDG
Sbjct: 381 VYALQNVKSWEEKIDQWQSPILENESYPDWYKSAIFNELYYMSDSGSIWAETDGDE---- 436
Query: 453 LATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHF 512
D + + + G ++ + G+F YLEG EY+MYNTYDVHF
Sbjct: 437 -----------DDETNLASCLKEVPGKGKQI----HKEYGRFAYLEGHEYLMYNTYDVHF 481
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDD 570
Y+SFAL L+P +ELSIQ DFAA++M DP IM G RK G VPHDIG D+
Sbjct: 482 YASFALAQLWPMIELSIQYDFAASIMHEDPELRLIMWTGLRKPRKSYGFVPHDIGEPCDE 541
Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
PW IN+Y + + W DLN+KFVLQ+YRD+ ATG+ F + +WP M D D
Sbjct: 542 PWKRINAYCISDVQDWTDLNTKFVLQIYRDYCATGNLEFLKDMWPCAKHVMETAISQDTD 601
Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG-DHASASYFWVRYQ 689
GDG+I+N G DQTYD W G SAYCGG+++AAL +A+ +G A Y + +
Sbjct: 602 GDGIIDNNG-ADQTYDVWRMYGASAYCGGIFLAALYIMYKIADLIGCKEDKAKYSKILLR 660
Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL-----PIADEAKVKK 744
+ LWNG Y+ YD+S G+ S SI ADQ+ G WY +ACGL+ I +++V K
Sbjct: 661 GRDSFQQKLWNGDYYLYDSSKGNHSDSIMADQMCGHWYLQACGLVNNNEDDIFPKSQVDK 720
Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
AL K++D+NV+K G GA+NGM+P+G ID S +Q+ E+W GVTY LAA++IQ+ M+
Sbjct: 721 ALRKVFDYNVMKFMNGTWGAVNGMKPNGEIDTSSVQSEEVWTGVTYALAANLIQQGMIKE 780
Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
F+ A G+Y+ ++ GLG FQTPE++ + YR YMRPL+IWA++WA
Sbjct: 781 GFKVAEGMYDTTFNVMGLG--FQTPEAYYKINRYRCTGYMRPLSIWAIEWAFN 831
>gi|395740303|ref|XP_003777396.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pongo
abelii]
Length = 933
Score = 566 bits (1460), Expect = e-158, Method: Compositional matrix adjust.
Identities = 331/824 (40%), Positives = 453/824 (54%), Gaps = 70/824 (8%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYQKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
R++L G + D PV +F+V + R + VL P V + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPVCPLKFTVCLRREGQTVYQQVLSLERPTV--------LRS 222
Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
W+W L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPV 278
N G + DV+++F+ N + G G +N + G V GL LHH T P
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERGRETVQGLLLHHPTLPN--PY 339
Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
T AVAA TAD V+ F T + +W ++ + G D +++P++ G
Sbjct: 340 TMAVAARVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVG 393
Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
I A+ S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H
Sbjct: 394 IAGAVCISSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452
Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
A+ +A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 453 YALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503
Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
+ + S LG + L + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548
Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
+SFAL+ML+PKLELS+Q D A A + D + + G K +PHDIG D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608
Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
W +N+Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
DG+IEN G+ DQTYD W G SAYCGGLW+AA+ +A G F +
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728
Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
+ Y+ LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRAL 788
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
I++ NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
QTA G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|338720173|ref|XP_001497796.3| PREDICTED: non-lysosomal glucosylceramidase [Equus caballus]
Length = 983
Score = 566 bits (1459), Expect = e-158, Method: Compositional matrix adjust.
Identities = 335/816 (41%), Positives = 451/816 (55%), Gaps = 57/816 (6%)
Query: 49 FRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R ++ Q+ + + K P + + G PLGGIG G+I R +RG+F
Sbjct: 168 IKHLG-MGMRYLQWWYQKTQVEKKTPFIDFMNCVPLRQIYGCPLGGIGGGTITRGWRGQF 226
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 227 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSVERPRV--------LRSWNWGLC 278
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 279 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 337
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + +DG GL LHH T P T AVAA
Sbjct: 338 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLEQDGETTQGLLLHHPTLPN--PYTMAVAA 395
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI-GAAI 343
TAD V+ F T + +W ++ + G D T P++ G I GA
Sbjct: 396 RLTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDSPAGPST-PTQKGVGIAGAVC 449
Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRYTKFYGTLGDSAARIAHDAILEH 401
A P G R + FSLAWD P + F K Y+RRYT+F+G GD+A ++H A+ +
Sbjct: 450 VAGKLSPRGRCR-LEFSLAWDMPRIMFGAKGQDYYRRYTRFFGPDGDAAPALSHYALCRY 508
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A WE I AWQ PIL+D+ P WY LFNELY+L GGT+W + P SL
Sbjct: 509 ADWEKRISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL-------- 558
Query: 462 SLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
P ++ A S L L E G+F YLEG EY MYNTYDVHFY+SFAL+ML
Sbjct: 559 ----PEEL---AGSMCQLRPILQE-----YGRFGYLEGQEYRMYNTYDVHFYASFALIML 606
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYN 579
+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 607 WPKLELSLQYDMALATLGEDLTRRRYLMSGVMAPVKKRNVIPHDIGDPDDEPWLRVNAYL 666
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQ+YRD+ TGD+ F R +WP M +FDKD DG+IEN G
Sbjct: 667 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGG 726
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+ L
Sbjct: 727 YADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQDRFSSILSRGQEAYERLL 786
Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
WNG Y+NYD S S SI +DQ AGQW+ RACGL + + +AL I++ NV
Sbjct: 787 WNGRYYNYDCSSQPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTRHIVRALQTIFELNV 846
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
GG GA+NGMQP G D + +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 847 QAFAGGAMGAVNGMQPHGVPDRTSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 906
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 907 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 940
>gi|403306664|ref|XP_003943843.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Saimiri
boliviensis boliviensis]
Length = 929
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 326/821 (39%), Positives = 451/821 (54%), Gaps = 64/821 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
R++L G + D P+ +F+V + R + VL P V + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICPLKFTVCLRREGQTVYQQVLSLERPNV--------LRS 222
Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
W+W L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
N G + DV+++F+ N + G G +N + +DG V GL LHH T P
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFHLERDGETVQGLLLHHPTLPN--PY 339
Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
T A AA TAD V+ F G T + +W ++ + G D +++P++ G
Sbjct: 340 TMAAAARVTADTMVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGVG 393
Query: 339 IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHD 396
+ A+ S + + FSLAWD P + F K V++RRYT+F+G GD+A ++H
Sbjct: 394 VAGAVCVSSKLRPRGQCCLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHY 453
Query: 397 AILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATI 456
A+ + +WE I AWQ P+L+D+ P WY LFNELY+L GGT+W + +
Sbjct: 454 ALSRYTEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE----------V 503
Query: 457 RERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
RE P D + L + G+F YLEG EY MYNTYDVHFY+SF
Sbjct: 504 REDSL----------PQELDGNMCH--LRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASF 551
Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFE 574
AL+ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW
Sbjct: 552 ALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLR 611
Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
+N+Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+
Sbjct: 612 VNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGL 671
Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
IEN G+ DQTYD W G SAYCGGLW+AA+ +A G F + +
Sbjct: 672 IENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAAVCGAQDIQDKFSSILSRGQEA 731
Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKI 749
Y+ LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I
Sbjct: 732 YERLLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTI 791
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
++ NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA
Sbjct: 792 FELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTA 851
Query: 810 AGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 852 EGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|426361727|ref|XP_004048051.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Gorilla
gorilla gorilla]
Length = 933
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 329/824 (39%), Positives = 454/824 (55%), Gaps = 70/824 (8%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
R++L G + D P+ +F+V + R + VL P V + +
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICPLKFTVCLRREGQTVYQQVLSLERPSV--------LRT 222
Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
W+W L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
N G + DV+++F+ N + G G +N + + G V GL LHH T P
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PY 339
Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
T AVAA TAD V+ F T + +W ++ + G D +++P++ G
Sbjct: 340 TMAVAARVTADTTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVG 393
Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
I A+ S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H
Sbjct: 394 IAGAVCVSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452
Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
A+ +A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 453 YALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503
Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
+ + S LG + L + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548
Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
+SFAL+ML+PKLELS+Q D A A + D + + G K +PHDIG D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608
Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
W +N+Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
DG+IEN G+ DQTYD W G SAYCGGLW+AA+ +A G F +
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728
Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
+ Y+ LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 788
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
I++ NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
QTA G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|224090236|ref|XP_002190896.1| PREDICTED: non-lysosomal glucosylceramidase [Taeniopygia guttata]
Length = 826
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 325/783 (41%), Positives = 450/783 (57%), Gaps = 49/783 (6%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G PLGGIG G+I R +RGEF R++L G+ V+ANQF+V + + VL
Sbjct: 65 GCPLGGIGGGTITRGWRGEFCRWQLNPGLYHYGTVIANQFTVCLRCKGQTIYQQVLSMER 124
Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
P S ++ W+W G YHAL+PRAW VY+ P ++ + CRQ+SP IPH+Y
Sbjct: 125 P--------SSLQGWNWGYCGHYAFYHALYPRAWLVYE-LPGQQVVLTCRQVSPVIPHDY 175
Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHG 264
K+SS P VF + + N +V+++F+ N + SG H+N T KDG V G
Sbjct: 176 KDSSLPVGVFIWEVENRRDEDVEVSIMFSLQNGMGTKEDGSGGHWNEPFTFQKDGERVAG 235
Query: 265 LTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
+ LHH A P T A++A E A V+ F N G+ +++W ++ + G D
Sbjct: 236 VLLHHCPAVN--PFTLAISAREKAGTGVTHVTAF----NPTGL-GREVWQDLLQDGRLDS 288
Query: 325 LDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKF 382
K+ P+E G AA+ AS +P+ +++ +LAWD P V F EK++ RRYT+F
Sbjct: 289 -PAGKSRPTEKGEVTAAAVCASCRVPAHGHKTLEMALAWDMPCVHFGSKEKLHVRRYTRF 347
Query: 383 YGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTI 442
+G+ GD+A ++H A+ + +WE +IEAWQ PILE+ + P WY LFNELY++ GGTI
Sbjct: 348 FGSKGDAAPTLSHYALTHYEEWERKIEAWQNPILENSQLPAWYKSALFNELYFMTDGGTI 407
Query: 443 WTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEY 502
W + P + PA + G L G+F YLEG EY
Sbjct: 408 WVEVPPDC--------------CAEDLQGPAGA----GLSHLLPVLREYGRFAYLEGQEY 449
Query: 503 VMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAV 562
MYNTYDVHFY+SFAL+ML+PKL++S+Q D A V+ D + + G+ K V
Sbjct: 450 RMYNTYDVHFYASFALIMLWPKLQISLQYDIAVTVVNEDVQPRQYLMGGQTAQVKMKNVV 509
Query: 563 PHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
PHDIG D+PW +N+Y + +++ WKDLN KFVLQVYRD+ T D + + +WP
Sbjct: 510 PHDIGDPGDEPWQRVNAYLMHDTADWKDLNLKFVLQVYRDYYLTHDSLYLQDMWPVCQAV 569
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
M +FD D DG+IEN GF DQTYDAW AYCGGLW+AA+ +A +GD
Sbjct: 570 MESELKFDTDNDGLIENGGFADQTYDAWVVIRRCAYCGGLWLAAVCMMCKMAEVLGDTEI 629
Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LP 735
+ K K ++ LWNG Y+NYD+S +S+SI +DQ AGQW+ ACGL
Sbjct: 630 RQKYLGILNKGKEAFERMLWNGKYYNYDSSGSDTSSSIMSDQCAGQWFLGACGLDQGEFE 689
Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAAS 795
+ ++ + AL I++ NVL GG GA+NGM+PDG D S +Q+ E+W GV Y LAA+
Sbjct: 690 VFPKSHILSALRTIFEKNVLSFAGGTMGAVNGMRPDGVPDTSSVQSNEVWVGVVYSLAAT 749
Query: 796 MIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
MIQE MV+ F+TA G Y W LG +FQTPE++ YRSL YMRPL+IW+MQ A
Sbjct: 750 MIQEGMVEEGFRTAEGCYRTVWER--LGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQLA 807
Query: 856 LTK 858
L +
Sbjct: 808 LER 810
>gi|50511011|dbj|BAD32491.1| mKIAA1605 protein [Mus musculus]
Length = 940
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)
Query: 50 RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
+HL +G R ++ ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 126 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 184
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L G+ V+A+QF V + R + VL P V + SW+W L G
Sbjct: 185 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 236
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAWTVY P + + CRQ++P +PH+Y++SS P VF + + N G +
Sbjct: 237 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 295
Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
DV++ F+ N + G+ +G +N + G V GL LHH T P T AVAA
Sbjct: 296 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 353
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
TAD V+ F +G T + +W ++ + G D +++P++ G I A+
Sbjct: 354 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCV 407
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
S + S + FSLAWD P++ F K V++RRYT+F+G+ GD A ++H A+ +A
Sbjct: 408 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 467
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 468 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 508
Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
+ P+ S LG R L + ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 509 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 563
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 564 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 623
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQ+YRD+ TGD+ F +WP M +FDKD DG+IEN G
Sbjct: 624 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 683
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
+ DQTYDAW G SAYCGGLW+AA+ +A G F + + Y+ L
Sbjct: 684 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 743
Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
WNG Y+NYD+S S SI +DQ AGQW+ RACGL + V +AL I++ NV
Sbjct: 744 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 803
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
GG GA+NGM P G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 804 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 863
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 864 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 897
>gi|240120073|ref|NP_766280.2| non-lysosomal glucosylceramidase [Mus musculus]
gi|143018443|sp|Q69ZF3.2|GBA2_MOUSE RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|34784135|gb|AAH56935.1| Glucosidase beta 2 [Mus musculus]
Length = 918
Score = 563 bits (1452), Expect = e-157, Method: Compositional matrix adjust.
Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)
Query: 50 RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
+HL +G R ++ ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 104 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 162
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L G+ V+A+QF V + R + VL P V + SW+W L G
Sbjct: 163 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 214
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAWTVY P + + CRQ++P +PH+Y++SS P VF + + N G +
Sbjct: 215 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 273
Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
DV++ F+ N + G+ +G +N + G V GL LHH T P T AVAA
Sbjct: 274 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 331
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
TAD V+ F +G T + +W ++ + G D +++P++ G I A+
Sbjct: 332 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCV 385
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
S + S + FSLAWD P++ F K V++RRYT+F+G+ GD A ++H A+ +A
Sbjct: 386 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 445
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 446 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 486
Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
+ P+ S LG R L + ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 487 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 541
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 542 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 601
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQ+YRD+ TGD+ F +WP M +FDKD DG+IEN G
Sbjct: 602 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 661
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
+ DQTYDAW G SAYCGGLW+AA+ +A G F + + Y+ L
Sbjct: 662 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 721
Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
WNG Y+NYD+S S SI +DQ AGQW+ RACGL + V +AL I++ NV
Sbjct: 722 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 781
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
GG GA+NGM P G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 782 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 841
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 842 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|148670514|gb|EDL02461.1| glucosidase beta 2, isoform CRA_a [Mus musculus]
Length = 918
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)
Query: 50 RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
+HL +G R ++ ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 104 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 162
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L G+ V+A+QF V + R + VL P V + SW+W L G
Sbjct: 163 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 214
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAWTVY P + + CRQ++P +PH+Y++SS P VF + + N G +
Sbjct: 215 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 273
Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
DV++ F+ N + G+ +G +N + G V GL LHH T P T AVAA
Sbjct: 274 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 331
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
TAD V+ F +G T + +W ++ + G D +++P++ G I A+
Sbjct: 332 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCV 385
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
S + S + FSLAWD P++ F K V++RRYT+F+G+ GD A ++H A+ +A
Sbjct: 386 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 445
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 446 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 486
Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
+ P+ S LG R L + ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 487 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 541
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 542 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 601
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQ+YRD+ TGD+ F +WP M +FDKD DG+IEN G
Sbjct: 602 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 661
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
+ DQTYDAW G SAYCGGLW+AA+ +A G F + + Y+ L
Sbjct: 662 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 721
Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
WNG Y+NYD+S S SI +DQ AGQW+ RACGL + V +AL I++ NV
Sbjct: 722 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 781
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
GG GA+NGM P G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 782 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 841
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 842 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|26354314|dbj|BAC40785.1| unnamed protein product [Mus musculus]
Length = 918
Score = 563 bits (1450), Expect = e-157, Method: Compositional matrix adjust.
Identities = 329/816 (40%), Positives = 455/816 (55%), Gaps = 59/816 (7%)
Query: 50 RHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
+HL +G R ++ ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 104 KHLG-MGLRYLKWWYRKTHVEKKTPFIDMLNSLPLRQIYGCPLGGIGGGTITRGWRGQFC 162
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L G+ V+A+QF V + R + VL P V + SW+W L G
Sbjct: 163 RWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPNV--------LRSWNWGLCG 214
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAWTVY P + + CRQ++P +PH+Y++SS P VF + + N G +
Sbjct: 215 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQVTPILPHDYQDSSLPVGVFVWDVENEGDET 273
Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHGLTLHHRTANGRPPVTFAVAAE 285
DV++ F+ N + G+ +G +N + G V GL LHH T P T AVAA
Sbjct: 274 LDVSITFSMRNGLGGEDDAAGSLWNEPFRLEQGGTTVQGLLLHHPTPPN--PYTMAVAAR 331
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
TAD V+ F +G T + +W ++ + G D +++P++ G I A+
Sbjct: 332 CTADTTVTHTTAFDPNG-----TGQQVWQDLLQDGQLDS-PAGQSTPTQKGEGIAGAVCI 385
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
S + S + FSLAWD P++ F K V++RRYT+F+G+ GD A ++H A+ +A
Sbjct: 386 SSKLLPRSRCCLEFSLAWDMPKIMFGAKSQVHYRRYTRFFGSDGDVAPALSHYALCHYAD 445
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 446 WEDRISAWQNPVLDDRTLPAWYKSALFNELYFLADGGTVW-------------------L 486
Query: 464 DTPSKIHTPASSDTALG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVML 521
+ P+ S LG R L + ++ G+F YLEG EY MYNTYDVHFY+SFALVML
Sbjct: 487 EVPAD-----SLPEGLGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVML 541
Query: 522 FPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYN 579
+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y
Sbjct: 542 WPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYL 601
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ +++ WKDLN KFVLQ+YRD+ TGD+ F +WP M +FDKD DG+IEN G
Sbjct: 602 IHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGG 661
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-L 698
+ DQTYDAW G SAYCGGLW+AA+ +A G F + + Y+ L
Sbjct: 662 YADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLL 721
Query: 699 WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNV 754
WNG Y+NYD+S S SI +DQ AGQW+ RACGL + V +AL I++ NV
Sbjct: 722 WNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNV 781
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
GG GA+NGM P G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 782 QAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYR 841
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 842 TVWER--LGLAFQTPEAYCQQQVFRSLAYMRPLSIW 875
>gi|143018484|sp|Q5M868.2|GBA2_RAT RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
Length = 912
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 334/818 (40%), Positives = 457/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRYSK----QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ H G L RY K + + + K P ++F + G PLGGIG G+I R +RG+F
Sbjct: 103 VKHFGMGL-RYLKWWYRKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQF 161
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF V + R + VL P V + SW+W L
Sbjct: 162 CRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPSV--------LRSWNWGLC 213
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 214 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 272
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G+ +G +N + +DG V GL LHH T P T AVAA
Sbjct: 273 TLDVSIMFSMRNGLGGEDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAA 330
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI-GAAI 343
TAD V+ F T + +W ++ + G D +++P++ G + GA
Sbjct: 331 RHTADTTVTYTTAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQRGEGVAGAVC 384
Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
A+S +P G + FSLAWD P + F K V++RRYT+F+G+ GD A ++H A+ ++
Sbjct: 385 ASSKLLPRGRC-CLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQY 443
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A WE I AWQ P+L+D+ P WY LFNELY+L GGT+W
Sbjct: 444 AGWENSISAWQNPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------ 485
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
L+ P S LG + L ++ G+F YLEG EY MYNTYDVHFY+SFALV
Sbjct: 486 -LEVPED-----SLPEELGGSMYQLRPILQDYGRFGYLEGQEYRMYNTYDVHFYASFALV 539
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A D + + G K +PHDIG D+PW +N+
Sbjct: 540 MLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNA 599
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+ F + +WP M +FDKD DG+IEN
Sbjct: 600 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDQDGLIEN 659
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 660 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILCRGREAYER 719
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ RACGL + V +AL I++
Sbjct: 720 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALKTIFEL 779
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA G
Sbjct: 780 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 839
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 840 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|189217893|ref|NP_001013109.2| non-lysosomal glucosylceramidase [Rattus norvegicus]
gi|149045757|gb|EDL98757.1| glucosidase beta 2 [Rattus norvegicus]
Length = 904
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 334/818 (40%), Positives = 457/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRYSK----QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ H G L RY K + + + K P ++F + G PLGGIG G+I R +RG+F
Sbjct: 103 VKHFGMGL-RYLKWWYRKTQVEKKTPFIDMFNSVPLRQIYGCPLGGIGGGTITRGWRGQF 161
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF V + R + VL P V + SW+W L
Sbjct: 162 CRWQLNPGMYQHQTVIADQFIVCLRRDGRTVYQQVLSLELPSV--------LRSWNWGLC 213
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 214 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 272
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G+ +G +N + +DG V GL LHH T P T AVAA
Sbjct: 273 TLDVSIMFSMRNGLGGEDDAAGGLWNEPFRLEQDGTTVQGLLLHHPTPPN--PYTMAVAA 330
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI-GAAI 343
TAD V+ F T + +W ++ + G D +++P++ G + GA
Sbjct: 331 RHTADTTVTYTTAF-----DPDSTGQQVWQDLLQDGQLDS-PAGQSTPTQRGEGVAGAVC 384
Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
A+S +P G + FSLAWD P + F K V++RRYT+F+G+ GD A ++H A+ ++
Sbjct: 385 ASSKLLPRGRC-CLEFSLAWDMPRIMFGAKGQVHYRRYTRFFGSDGDVAPALSHYALCQY 443
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A WE I AWQ P+L+D+ P WY LFNELY+L GGT+W
Sbjct: 444 AGWENSISAWQNPVLDDRSLPAWYKSALFNELYFLADGGTVW------------------ 485
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
L+ P S LG + L ++ G+F YLEG EY MYNTYDVHFY+SFALV
Sbjct: 486 -LEVPED-----SLPEELGGSMYQLRPILQDYGRFGYLEGQEYRMYNTYDVHFYASFALV 539
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A D + + G K +PHDIG D+PW +N+
Sbjct: 540 MLWPKLELSLQYDMALATFKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNA 599
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+ F + +WP M +FDKD DG+IEN
Sbjct: 600 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDQDGLIEN 659
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 660 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAVLCGAQDVQDKFSSILCRGREAYER 719
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ RACGL + V +AL I++
Sbjct: 720 LLWNGRYYNYDSSSQPQSRSVMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALKTIFEL 779
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA G
Sbjct: 780 NVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGC 839
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 840 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 875
>gi|444729906|gb|ELW70309.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 1001
Score = 560 bits (1442), Expect = e-156, Method: Compositional matrix adjust.
Identities = 342/876 (39%), Positives = 473/876 (53%), Gaps = 93/876 (10%)
Query: 50 RHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 105 KHLG-MGLRYLQWWYRKTQVEKKTPFIDMINCVPLRQIYGCPLGGIGGGTITRGWRGQFC 163
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L G+ V+A+QF+V + R + VL P V + SW+W L G
Sbjct: 164 RWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLDHPSV--------LRSWNWGLCG 215
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G +
Sbjct: 216 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEA 274
Query: 229 ADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAE 285
DV+++F+ N + G+ G +N + +DG V GL LHH T P T AVAA
Sbjct: 275 LDVSIMFSMRNGLGGEDDAPGGLWNEPFCLERDGETVQGLLLHHPTLPN--PYTMAVAAR 332
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
TA+ V+ F G T + +W ++ + G D +++PS+ G I A+
Sbjct: 333 LTANTIVTHITAFDPDG-----TGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGIAGAVCI 386
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
S +P + FSLAWD P++ F K V++RRYT+F+ GD+A ++H A+ +A
Sbjct: 387 SGKLPPRGQCCLEFSLAWDMPKIVFGAKGQVHYRRYTRFFSPDGDAAPALSHYALCRYAD 446
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTD----------GSPPMQSL 453
WE I AWQ PIL+D+ P WY LFNELY+L GGT+W + G Q
Sbjct: 447 WEERISAWQSPILDDRSLPAWYKSALFNELYFLADGGTVWLEVPEDSLPEELGGGMHQLR 506
Query: 454 ATIRER-KFS----LDTPSKI----HTPASSDTAL----------GTRLLENGEENI--- 491
+R+ +F + P + PA +AL GT LE E+++
Sbjct: 507 PILRDYGRFGYLEGMLNPCPLCLRRSLPAWYKSALFNELYFLADGGTVWLEVPEDSLPEE 566
Query: 492 ---------------GQFLYLEGA-------------EYVMYNTYDVHFYSSFALVMLFP 523
G+F YLEG EY MYNTYDVHFY+SFALVML+P
Sbjct: 567 LGGGMHQLRPILRDYGRFGYLEGMAAGGLQYGWGWGQEYRMYNTYDVHFYASFALVMLWP 626
Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLF 581
KLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y +
Sbjct: 627 KLELSLQYDMALATLREDLTRRRYLMSGVIAPVKRRNVIPHDIGDPDDEPWLRVNAYLIH 686
Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
+++ WKDLN KFVLQVYRD+ TGD+ F + +WP M +FDKD DG+IEN G+
Sbjct: 687 DTADWKDLNLKFVLQVYRDYYLTGDQGFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYA 746
Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
DQTYD W G SAYCGGLW+AA+ +A G F + + Y+ LWN
Sbjct: 747 DQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDVQEKFTSILTRGQEAYERLLWN 806
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLK 756
G Y+NYD+S S SI +DQ AGQW+ +ACGL + V +AL I++FNV
Sbjct: 807 GRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEGDTEVFPTPHVVRALQTIFEFNVQA 866
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y
Sbjct: 867 FAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTV 926
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
W LG +FQTPE++ +RSL YMRPL+IWAM
Sbjct: 927 WER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 960
>gi|24308251|ref|NP_065995.1| non-lysosomal glucosylceramidase [Homo sapiens]
gi|143018392|sp|Q9HCG7.2|GBA2_HUMAN RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase;
AltName: Full=Beta-glucocerebrosidase 2;
Short=Beta-glucosidase 2; AltName:
Full=Glucosylceramidase 2
gi|15030210|gb|AAH11363.1| Glucosidase, beta (bile acid) 2 [Homo sapiens]
gi|16215453|emb|CAC83792.1| bile acid beta-glucosidase [Homo sapiens]
gi|21739906|emb|CAD38976.1| hypothetical protein [Homo sapiens]
gi|119578752|gb|EAW58348.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
gi|119578753|gb|EAW58349.1| glucosidase, beta (bile acid) 2, isoform CRA_d [Homo sapiens]
Length = 927
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 330/818 (40%), Positives = 453/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TA V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|14042883|dbj|BAB55430.1| unnamed protein product [Homo sapiens]
Length = 927
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 329/818 (40%), Positives = 453/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L+
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLR 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TA V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LF ELY+L GGT+W L + +
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFKELYFLADGGTVW---------LEVLED--- 500
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|397519481|ref|XP_003829887.1| PREDICTED: non-lysosomal glucosylceramidase isoform 1 [Pan
paniscus]
Length = 927
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 328/818 (40%), Positives = 453/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TA V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 453 TEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLHPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL +++
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTVFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|55633173|ref|XP_520565.1| PREDICTED: non-lysosomal glucosylceramidase isoform 6 [Pan
troglodytes]
gi|410227830|gb|JAA11134.1| talin 1 [Pan troglodytes]
Length = 927
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 329/818 (40%), Positives = 452/818 (55%), Gaps = 64/818 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TA V+ F T + +W ++ + G D +++P+ G I A+
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTHKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 453 TEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGC 848
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 YRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 884
>gi|432110769|gb|ELK34246.1| Non-lysosomal glucosylceramidase [Myotis davidii]
Length = 868
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 325/820 (39%), Positives = 444/820 (54%), Gaps = 79/820 (9%)
Query: 49 FRHLAHIGFRLYRYSKQE-EAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R ++ Q+ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 104 IKHLG-MGLRYLQWWYQKTQVEKKKPFIDLINSVPLRQIYGCPLGGIGGGTITRGWRGQF 162
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 163 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSVERPSV--------LRSWNWGLC 214
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 215 GFFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPVLPHDYQDSSLPVGVFVWDVENEGDE 273
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEET 287
+ DV+++F+ N + G SG +N L H T VT A +
Sbjct: 274 ALDVSIMFSMRNGLGGGDDASGGLWNEP---------FCLEHTT------VTHITAFDPD 318
Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
T + +W ++ + G D +++ S+ G I A+ +
Sbjct: 319 G-------------------TGQQVWQDLLQDGQLDS-PAGRSTLSQKGVGIAGAVCVTS 358
Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWE 405
+P + FSLAWD P + F + VY+RRYT+F+G+ GD+A ++H A+ +A WE
Sbjct: 359 KLPPRGRCRMEFSLAWDMPRIMFGAQGQVYYRRYTRFFGSDGDAAPALSHYALCRYADWE 418
Query: 406 CEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDT 465
I AWQ P+L+D+ P WY LFNELY+L GGT+W + P SL
Sbjct: 419 ERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVWLE--VPEDSL------------ 464
Query: 466 PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
P ++ L T G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKL
Sbjct: 465 PEELGGNMCQLRPLLT--------EYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKL 516
Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNS 583
ELS+Q D A A + D + + G K +PHDIG D+PW +N+Y + ++
Sbjct: 517 ELSLQYDMALATLGEDLTWRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYVIHDT 576
Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
+ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN G+ DQ
Sbjct: 577 ADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYADQ 636
Query: 644 TYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGS 702
TYDAW G SAYCGGLW+AA+ +A G F + + Y+ LWNG
Sbjct: 637 TYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDIQEKFSSILSRGQQAYERLLWNGR 696
Query: 703 YFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVK 758
Y+NYD S S SI +DQ AGQW+ RA GL + V +AL I++FNV
Sbjct: 697 YYNYDCSSRPQSCSIMSDQCAGQWFLRASGLGEGETEVFPTQHVVRALQTIFEFNVQAFA 756
Query: 759 GGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWS 818
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y W
Sbjct: 757 GGTMGAVNGMQPQGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWE 816
Query: 819 EDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
GLG FQTPE++ +RSL YMRPL+IWAMQ AL +
Sbjct: 817 RLGLG--FQTPEAYCQKRVFRSLAYMRPLSIWAMQLALQQ 854
>gi|440799031|gb|ELR20092.1| glucosylceramidase [Acanthamoeba castellanii str. Neff]
Length = 860
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 368/909 (40%), Positives = 513/909 (56%), Gaps = 122/909 (13%)
Query: 3 NGVKEEGENAASMPKV--DPGQPPQLT---WQRKLDSN-------VKVPTGFTLSFQEFR 50
+G +EEGE +P+ + Q P T WQR+L + P TL R
Sbjct: 7 SGQQEEGERI-KVPESTRERAQMPTATDALWQRRLSEQPTRQLRPLTAPLLHTL-----R 60
Query: 51 HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
+L +GFRL+R ++ EE G+ PV + S G+P GG+G G +GRS+RG+F+
Sbjct: 61 NLP-LGFRLWRQAR-EERGGRDPVLDASGGQ--ASSSGLPAGGMGGGCVGRSWRGDFEPG 116
Query: 111 KLFHGICDDAPVL-ANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGE 169
+ APV+ ANQFSVF+ P+GE + + +PG+P + + +WDW+ GE
Sbjct: 117 LILPNGRGQAPVVPANQFSVFIE-PDGEDSEATVL--NPGIPPTDR---LGAWDWSFVGE 170
Query: 170 NCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSA 229
Y ALFPRAWTVYD EP+P LR+ RQISP I +NYKESS PA VF +TL N Q
Sbjct: 171 ESYYSALFPRAWTVYD-EPEPRLRLTSRQISPVIGNNYKESSLPACVFVWTLENLDQEDE 229
Query: 230 -DVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHH----------RTANGRPPV 278
+L+FTW N S G H N+ T+ + + H +T P+
Sbjct: 230 ITASLMFTWQNGTGAASDALGGHANTP-FTRPHPERVVMDHTQWRVDERGGKTHASSEPL 288
Query: 279 TFAVAAEET-ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGS 337
+FA+AA +D H++ C+ + NS G ++W + K+ G+ + + S+ G
Sbjct: 289 SFAIAASSPDSDAHITVRSCW--ATNSDG---AEVWKDFKRSGALEENWDALGKASQEGQ 343
Query: 338 SIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHD 396
++GAAIA +TIP G R V F+LAWD P V+F + Y RRYT+FY GDSAA++A D
Sbjct: 344 TLGAAIAVRVTIPPGEHRQVEFALAWDHPVVEFGAGRRYTRRYTRFYSQHGDSAAQLACD 403
Query: 397 AILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATI 456
A++++ +WE IE WQ PIL+DKR P GGT+W +
Sbjct: 404 ALVQYPQWERLIEQWQAPILDDKRLP--------------GCGGTVWLQDELSAKEEKPR 449
Query: 457 RERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
RE++ T E + ++G++ LE E+ +YN D FY+SF
Sbjct: 450 REKR-------------------ETDGQEEEQSDVGRWACLESQEHRLYNASDALFYASF 490
Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFE 574
AL+ +P+L+L++QR+ AAA + D + +++ G+ K GAVP+D+G DDPW
Sbjct: 491 ALIANWPQLQLALQREVAAATLREDSRMVDVLATGRQAPNKVRGAVPYDMGTPADDPWVA 550
Query: 575 INSYNLFNSSRWK---------------------DLNSKFVLQVYRDFVATGDKNFARAV 613
+N+YN+ +S WK DLNSKFVL VYRDFVAT DK F V
Sbjct: 551 LNAYNIHETSSWKGKLGLLPASHLSPSTELTEAADLNSKFVLGVYRDFVATQDKRFLADV 610
Query: 614 WPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL-- 671
WP+V A+ +++QFD++ DGMIENEG PDQTY +W+ GGLWVAAL AA A+
Sbjct: 611 WPAVDAALRHLQQFDRNKDGMIENEGLPDQTYHSWT--------GGLWVAALSAACAMAQ 662
Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGS-YFNYDNSDGSSSTSIQADQLAGQWYAR 729
A + D + A ++ +Q A VY +SLWN S YFNYD S + SIQADQL GQWYAR
Sbjct: 663 ALQLADDSVAQHYRDLHQSAARVYNESLWNPSGYFNYDTSTSTHHDSIQADQLCGQWYAR 722
Query: 730 ACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVT 789
ACGL IA + ++ AL ++ N + G GA+NGM+PDGR+D + L + E WPG T
Sbjct: 723 ACGLPSIAPDKRIVTALKTVFANNFNE---GKRGAVNGMRPDGRVDYACLHSSEAWPGTT 779
Query: 790 YGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTI 849
+ LAA M+ E +VD AF TAAGV + + LGY F+TPE+W+ + YR+L +MRPL I
Sbjct: 780 FALAACMMGEGLVDQAFITAAGVCGSIYED--LGYWFRTPEAWDGSGNYRALGHMRPLAI 837
Query: 850 WAMQWALTK 858
WAMQWA T+
Sbjct: 838 WAMQWAWTQ 846
>gi|221041110|dbj|BAH12232.1| unnamed protein product [Homo sapiens]
Length = 933
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 329/824 (39%), Positives = 452/824 (54%), Gaps = 70/824 (8%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
R++L G + D P+ +F+V + R + VL P V + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICPLKFTVCLRRERQTVYQQVLSLERPSV--------LRS 222
Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
W+W L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
N G + DV+++F+ N + G G +N + + G V GL LHH T P
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PY 339
Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
T AVAA TA V+ F T + +W ++ + G D +++P++ G
Sbjct: 340 TMAVAARVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVG 393
Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
I A+ S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H
Sbjct: 394 IAGAVCVSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452
Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
A+ +A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 453 YALCRYAEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503
Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
+ + S LG + L + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548
Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
+SFAL+ML+PKLELS+Q D A A + D + + G K +PHDIG D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608
Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
W +N+Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
DG+IEN G+ DQTYD W G SAYCGGLW+AA+ +A G F +
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728
Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
+ Y+ LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 788
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
I++ NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
QTA G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|332831849|ref|XP_001167923.2| PREDICTED: non-lysosomal glucosylceramidase isoform 4 [Pan
troglodytes]
Length = 933
Score = 547 bits (1409), Expect = e-152, Method: Compositional matrix adjust.
Identities = 328/824 (39%), Positives = 452/824 (54%), Gaps = 70/824 (8%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTQVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHG------ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIES 161
R++L G + D P+ + +F+V + R + VL P V + S
Sbjct: 171 CRWQLNPGMYQHRTVIADQPICSLKFTVCLRREGQTVYQQVLSLERPSV--------LRS 222
Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
W+W L G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + +
Sbjct: 223 WNWGLCGYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDV 281
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPV 278
N G + DV+++F+ N + G G +N + + G V GL LHH T P
Sbjct: 282 ENEGDEALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PY 339
Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
T AVAA TA V+ F T + +W ++ + G D +++P+ G
Sbjct: 340 TMAVAARVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTHKGVG 393
Query: 339 IGAAIAASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAH 395
I A+ S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H
Sbjct: 394 IAGAVCVSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSH 452
Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
A+ + +WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L
Sbjct: 453 YALCRYTEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEV 503
Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
+ + S LG + L + G+F YLEG EY MYNTYDVHFY
Sbjct: 504 LED---------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFY 548
Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDP 571
+SFAL+ML+PKLELS+Q D A A + D + + G K +PHDIG D+P
Sbjct: 549 ASFALIMLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEP 608
Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
W +N+Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD
Sbjct: 609 WLRVNAYLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDH 668
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
DG+IEN G+ DQTYD W G SAYCGGLW+AA+ +A G F +
Sbjct: 669 DGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRG 728
Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKAL 746
+ Y+ LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL
Sbjct: 729 QEAYERLLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRAL 788
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
I++ NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + F
Sbjct: 789 QTIFELNVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGF 848
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
QTA G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 849 QTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 890
>gi|414871105|tpg|DAA49662.1| TPA: hypothetical protein ZEAMMB73_422318 [Zea mays]
Length = 589
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 281/567 (49%), Positives = 378/567 (66%), Gaps = 52/567 (9%)
Query: 312 MWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
MWN + ++G FD + + PS PG + AA++AS + +V F+LAW P+VK
Sbjct: 1 MWNTMLQNGHFDRENFSAGPSMPSSPGQKLCAAVSASTWVEPHGRCTVVFALAWSSPKVK 60
Query: 370 FFEK-VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
F + Y+RRYT+FYGT SA + HDA+ ++ WE EIE WQ PIL+D+R PEWY T
Sbjct: 61 FQKGCTYNRRYTQFYGTSEKSAVNLVHDALTKYKLWEEEIEKWQNPILKDERLPEWYKFT 120
Query: 429 LFNELYYLNAGGTIWTDGSPPM---------------------QSLATIRERKFSLDTP- 466
LFNELY+L AGGT+WTDG PP + ++++R +L
Sbjct: 121 LFNELYFLVAGGTVWTDGQPPAIDEKKSPGFNHQKSSKRGTKDTNQGSVKDRHVNLAVEQ 180
Query: 467 ------------------SKIHTPASSDTALGTR--------LLENGEENIGQFLYLEGA 500
+ +H + G + + ++G E++G+FLYLEG
Sbjct: 181 VPHGGYMANGDDHSVSKFAAVHGSEMQEQINGLKSEEPIPYLISKDGPEHVGKFLYLEGV 240
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLG 560
EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D +K ++DG RK G
Sbjct: 241 EYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRKVKFLADGTSGIRKVKG 300
Query: 561 AVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIA 620
AVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF ATGD F R VWP+V A
Sbjct: 301 AVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAATGDMQFGRDVWPAVCAA 360
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
M YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AALQAA+ +A+ +GD
Sbjct: 361 MDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAALQAAATMAHRLGDRHF 420
Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
A + +++ KAKAVY++ LWNGSYFNYD+ S+S SIQADQLAGQWYA + GL P+ DE
Sbjct: 421 AEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLAGQWYAASSGLPPLFDE 480
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
K++ AL KI++FNV+KVKGG GA+NGM P G++D + +Q+REIW GVTY +AA+M+
Sbjct: 481 HKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSREIWTGVTYAVAANMLLH 540
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSF 826
M F TA G++ WSE+G GY+
Sbjct: 541 GMEHQGFTTAEGIFTAGWSEEGYGYAL 567
>gi|413920604|gb|AFW60536.1| hypothetical protein ZEAMMB73_659971 [Zea mays]
Length = 459
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 273/436 (62%), Positives = 334/436 (76%), Gaps = 8/436 (1%)
Query: 17 KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
KV PGQ P LTW+ KL F L+++E LA IG RL R+ +E +KG++ V +
Sbjct: 28 KVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKGRIAVID 87
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+K S QG+PLGG SIGRSY+G+FQR++LF G C+D VLANQFS F+SR +G
Sbjct: 88 PMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAFISRQDG 143
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
+K+S+VL P P +PK + SGI SWDWN+ G+ TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 KKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDPELNIVC 203
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
+ KDGVHG+ LHHRTA+G+P VTF +AA+E D+ +SECP F++SG+S TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFTAKDMWN 323
Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+K+HGSFDHLD KTS S PGSSIGAAIAAS+ + +T+ ++FSLAW CPEVKF
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPEVKFSSG 383
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
K YHRRYTKFYGT GD+AA +AHDAILEHA WE +IE WQ PIL+D+RFP WYP+TLFNE
Sbjct: 384 KTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWYPVTLFNE 443
Query: 433 LYYLNAGGTIWTDGSP 448
LYYLNAGGTIWT P
Sbjct: 444 LYYLNAGGTIWTGMHP 459
>gi|333898016|ref|YP_004471890.1| glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
gi|333113281|gb|AEF18218.1| Glucosylceramidase [Thermoanaerobacterium xylanolyticum LX-11]
Length = 806
Score = 536 bits (1382), Expect = e-149, Method: Compositional matrix adjust.
Identities = 303/781 (38%), Positives = 439/781 (56%), Gaps = 65/781 (8%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVS-RPNGEKFSSVLCP 145
QG PLGG GAG+IGR+Y G F R+ L G V ANQFSVF N + + VL
Sbjct: 71 QGAPLGGFGAGTIGRTYNGGFSRWHLEIGKNKYTTVYANQFSVFQKVEGNKDGVAQVLYA 130
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
P + + SW W+ E+ Y+AL+P +W Y + P +++ +Q SP IP+
Sbjct: 131 GEP------ENGYLSSWKWDYPKESGMYYALYPNSWYTYTNKDLP-VQLAVKQFSPIIPY 183
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA---GDSGLSGHHFNS--KTMTKD 260
NYKE+S+P +VF +T N + DV+++FTW N + ++ +FN K +KD
Sbjct: 184 NYKETSYPVAVFKWTAYNPTNKNVDVSIMFTWQNMIGFFGKQVNVNSGNFNKIIKDKSKD 243
Query: 261 G--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKK 318
V + + N +++ ++ V +S F+ +G+ D+W+E K
Sbjct: 244 SEIVAAVMGNISNDNEEWNGEYSIGVKKVPGVDISYKAKFVTTGD-----GSDLWHEFSK 298
Query: 319 HGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHR 377
+G D+ D D+T + G IG+AIA + + G T V F+L+WD P +KF +++
Sbjct: 299 NGILDNKD-DETPTKQDG--IGSAIAVNFKLQPGQTIEVPFALSWDLPIMKFGGGDKWYK 355
Query: 378 RYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLN 437
YTK++G G ++ I +A+ + KWE I+ WQ+PIL +K P+WY LFNELYYL
Sbjct: 356 MYTKYFGKNGKNSFAILKEALNNYQKWEKMIDDWQKPILSNKSKPDWYKTALFNELYYLA 415
Query: 438 AGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYL 497
GGT W +G ++ E + F L
Sbjct: 416 DGGTAWENG-----------------------------------KVGEKDKRTNNMFGLL 440
Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
E +Y Y T DV FY SF LVML+P +E + R FA + + D K+ S+G +K
Sbjct: 441 ECFDYNYYETLDVRFYGSFPLVMLWPDIEKQVMRQFADTINVQDSSEFKVGSNGAMAVKK 500
Query: 558 CLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAV 613
G +PHD+G PW +IN+Y+ N + WKDLNSK+VL VYRD+V TG DK F +
Sbjct: 501 VQGMIPHDLGSSYALPWIKINAYDWQNPNIWKDLNSKYVLLVYRDYVLTGKTDKEFLKYT 560
Query: 614 WPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAN 673
W SV A+ +++ DKD DG+ +NEG PDQTYD WS G SAYCG LW+AAL+AA +
Sbjct: 561 WKSVKTALDKLKEMDKDNDGIPDNEGIPDQTYDTWSMKGTSAYCGSLWLAALKAAQEIGK 620
Query: 674 DVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACG 732
+ D+ + + Y+ A+ ++ LWNG Y+N+D ++ SI ADQLAGQWYA
Sbjct: 621 VLKDNEAYIKYNEWYKIAQQNFEKELWNGEYYNFD-TESDHKDSIMADQLAGQWYADILR 679
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
L I + V+KAL KIY+FNV+K + G GA+NGM+PDG +D S +QA+E+W GVTY L
Sbjct: 680 LGDILPKDHVQKALKKIYEFNVMKFENGKMGAVNGMRPDGIVDESDIQAQEVWTGVTYAL 739
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A+ M M + A+ TA GVY++ + + G GY F+TPE+W + YR+ YMRPL+IW+M
Sbjct: 740 ASFMKYRGMTEEAYNTAYGVYKMTYDKSGKGYWFRTPEAWTKDGNYRASMYMRPLSIWSM 799
Query: 853 Q 853
+
Sbjct: 800 E 800
>gi|194386992|dbj|BAG59862.1| unnamed protein product [Homo sapiens]
Length = 755
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 305/750 (40%), Positives = 416/750 (55%), Gaps = 59/750 (7%)
Query: 116 ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHA 175
+ D P+ +F+V + R + VL P V + SW+W L G YHA
Sbjct: 7 VIADQPICPLKFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLCGYFAFYHA 58
Query: 176 LFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLF 235
L+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G + DV+++F
Sbjct: 59 LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117
Query: 236 TWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHV 292
+ N + G G +N + + G V GL LHH T P T AVAA TA V
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PHTMAVAARVTAATTV 175
Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI-PS 351
+ F T + +W ++ + G D +++P++ G I A+ S + P
Sbjct: 176 THITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVCVSSKLRPR 229
Query: 352 GSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +A+WE I
Sbjct: 230 GQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEERIS 288
Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 289 AWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED----------- 328
Query: 470 HTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLEL
Sbjct: 329 ----SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLEL 384
Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSR 585
S+Q D A A + D + + G K +PHDIG D+PW +N+Y + +++
Sbjct: 385 SLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTAD 444
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN G+ DQTY
Sbjct: 445 WKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTY 504
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
D W G SAYCGGLW+AA+ +A G F + + Y+ LWNG Y+
Sbjct: 505 DGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYY 564
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGG 760
NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++ NV GG
Sbjct: 565 NYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFELNVQAFAGG 624
Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y W
Sbjct: 625 AMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWER- 683
Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 684 -LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|397519483|ref|XP_003829888.1| PREDICTED: non-lysosomal glucosylceramidase isoform 2 [Pan
paniscus]
Length = 755
Score = 534 bits (1375), Expect = e-148, Method: Compositional matrix adjust.
Identities = 303/750 (40%), Positives = 416/750 (55%), Gaps = 59/750 (7%)
Query: 116 ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHA 175
+ D P+ + +F+V + R + VL P V + SW+W L G YHA
Sbjct: 7 VIADQPICSLKFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLCGYFAFYHA 58
Query: 176 LFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLF 235
L+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G + DV+++F
Sbjct: 59 LYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEALDVSIMF 117
Query: 236 TWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHV 292
+ N + G G +N + + G V GL LHH T P T AVAA TA V
Sbjct: 118 SMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAARVTAATTV 175
Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI-PS 351
+ F T + +W ++ + G D +++P++ G I A+ S + P
Sbjct: 176 THITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVCVSSKLRPR 229
Query: 352 GSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ + +WE I
Sbjct: 230 GQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERIS 288
Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 289 AWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED----------- 328
Query: 470 HTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLEL
Sbjct: 329 ----SLPEELGRNMCHLHPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLEL 384
Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSR 585
S+Q D A A + D + + G K +PHDIG D+PW +N+Y + +++
Sbjct: 385 SLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTAD 444
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN G+ DQTY
Sbjct: 445 WKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTY 504
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF 704
D W G SAYCGGLW+AA+ +A G F + + Y+ LWNG Y+
Sbjct: 505 DGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYY 564
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGG 760
NYD+S S S+ +DQ AGQW+ +ACGL + V +AL +++ NV GG
Sbjct: 565 NYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTVFELNVQAFAGG 624
Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y W
Sbjct: 625 AMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWER- 683
Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 684 -LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 712
>gi|242015137|ref|XP_002428230.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212512791|gb|EEB15492.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 812
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 318/822 (38%), Positives = 455/822 (55%), Gaps = 61/822 (7%)
Query: 46 FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
+F + + R YS KG+ P+ + + + G+PLGGIG+GSIGR ++G
Sbjct: 33 LNQFWSMIPLIIRYLIYSIYILCKGRRPLMDYMKNLNGKQMYGVPLGGIGSGSIGRGFKG 92
Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVL-CPRSPGVPKKNTDSGIESWD 163
EF R++L G + + NQF + V G F VL C R+P S +++WD
Sbjct: 93 EFCRYQLRPGNYEYETIYGNQFILTVRDGEGNAVFQKVLNCGRAP-------KSCLKNWD 145
Query: 164 WNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSN 223
W L GEN Y L+PR+WT Y+ E + +LR++C+QISP IP+NYK+S P S+F + + N
Sbjct: 146 WTLNGENANYVGLYPRSWTEYNIE-EQKLRLICKQISPVIPNNYKDSCIPGSMFIWHIEN 204
Query: 224 SGQTSADVTLLFTWANSVA--GDS--GLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVT 279
DV + FT+ N + GD G F + + + G+ ++H ++ T
Sbjct: 205 QSNEVKDVNITFTFQNGIGEKGDKKKGCWSQMF-KENCSNGNILGVLINHVISSMN--YT 261
Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
F ++ E A VHVS L N G KD+WN+IK G + D D + + G +
Sbjct: 262 FGISVCEKAGVHVS----CLKKWNPLG-NGKDIWNKIKT-GKLEKFDFDPSHHTANGEQL 315
Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFYGTLGDSAARIAHDA 397
+A+ A IP S ++ F+L WD P++ F E K Y + YT+F+G + +++
Sbjct: 316 ASAVCAQTLIPGKSVSTLEFNLIWDMPKINFHENKKQYWKYYTRFFGK-EKNGPKLSAYC 374
Query: 398 ILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIR 457
+ W+ EI+ WQ PIL D P WY LFNE Y+++ GGT+W D
Sbjct: 375 FENYKNWDLEIDKWQNPILNDDNLPSWYKSALFNETYFVSDGGTVWLD------------ 422
Query: 458 ERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFA 517
LD K A+ G+F YLEG EY MYNTYDVHFY+SFA
Sbjct: 423 -----LDEDEKKLLSANDPRC-----------EYGRFAYLEGHEYRMYNTYDVHFYASFA 466
Query: 518 LVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEI 575
L ML+P L+ +QRDF V D + +G RK +VPHD+G ++P+ +I
Sbjct: 467 LAMLWPNLQSCVQRDFCDTVPQEDRSRQWHLFNGSIGYRKVKNSVPHDLGDPFEEPFLKI 526
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMI 635
NSY + + S W+DL+ KFVLQ YRD+ D N + +W SV + M ++DKDGDG+I
Sbjct: 527 NSYPIHDVSEWRDLDLKFVLQSYRDYFLNNDLNQLKYLWKSVCVLMNNALRWDKDGDGLI 586
Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
EN G PDQTYDAW+ G SAYCG LW+AALQ +A + D + + F + A+ +
Sbjct: 587 ENVGEPDQTYDAWTMEGPSAYCGSLWLAALQCTCKMALILSDVDTYNKFSETLKMAQQSF 646
Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL-PIADEAKVKKALTKIYDFN 753
D LWNG Y+N+D S+ T+I +DQL G WY CG+ + AKV +L IY+ N
Sbjct: 647 DKKLWNGKYYNFDCSE-KWGTTIMSDQLCGLWYLYCCGIKNEVLPPAKVNSSLRCIYENN 705
Query: 754 VLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
VLK+ G G +NGM P+G ID+ LQ+ EIW GV Y LA+ MI E MV+ ++TA G+Y
Sbjct: 706 VLKLGKGNLGCVNGMLPNGEIDIHTLQSEEIWVGVVYALASLMIFEGMVEEGWKTARGIY 765
Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
E + G+G F+TPE+ + +YR++ YMRPL+IW+MQ A
Sbjct: 766 ETVYDRIGMG--FETPEALRESKKYRAIGYMRPLSIWSMQLA 805
>gi|119578751|gb|EAW58347.1| glucosidase, beta (bile acid) 2, isoform CRA_c [Homo sapiens]
gi|168270622|dbj|BAG10104.1| non-lysosomal glucosylceramidase [synthetic construct]
Length = 877
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 306/767 (39%), Positives = 423/767 (55%), Gaps = 62/767 (8%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + + G V GL LHH T P T AVAA
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 339
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TA V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 340 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 393
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 394 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 452
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 453 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 500
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 501 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 548
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 549 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 608
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 609 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 668
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 669 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 728
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 729 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 788
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 789 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 835
>gi|156554493|ref|XP_001604835.1| PREDICTED: non-lysosomal glucosylceramidase-like [Nasonia
vitripennis]
Length = 826
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 304/824 (36%), Positives = 451/824 (54%), Gaps = 60/824 (7%)
Query: 46 FQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
F++ + + R Y G+ + + F H G+P+GG+G GSIGR +RG
Sbjct: 33 FKQTIQMMPLAVRYVFYYTNVSRYGRPVLMDYFNMRHGQQIYGVPIGGLGGGSIGRGFRG 92
Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIESWDW 164
EF R+ L G+ + ANQF + V NG ++ VL ++ PK + SW W
Sbjct: 93 EFCRYALLPGLYSYHVIPANQFILTVRDVNGNTIYNQVLSTKTK--PKH-----LRSWKW 145
Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
+G +Y L+PR+WT Y+ E + EL++ CRQ+SP IPH+YK+SS P +VF + + N
Sbjct: 146 GFEGNKASYTGLYPRSWTTYEVE-ELELKLTCRQVSPVIPHDYKDSSLPCAVFVWDVVNR 204
Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAA 284
+ + FT+ + A G + ++ D G T+ H+ G P T+A+++
Sbjct: 205 SNRDLRIAITFTFQSGCASKEDAQGDKW-TEFFELDESRG-TMIHQEFRGMP-CTYAISS 261
Query: 285 EETAD-VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
D V V+ F GN +W ++ G FD + S S+ I A+
Sbjct: 262 RTRPDDVGVTRMIGFDPRGN-----GAVLWQRLQNAGRFDESAEPRIS-SKTRKPIATAV 315
Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAA-RIAHDAILE 400
A + +GS+ + FSLAW P+++FF +K + R YT+F+ G+ A I A+ +
Sbjct: 316 CADTLVKAGSSGFLEFSLAWHMPKIQFFFKKKDHLRYYTQFFS--GEKAGPEICRYALSQ 373
Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
+ +WE EIE WQ+ +LED P+WY +FNELY+++ GG++W L+T +
Sbjct: 374 YERWEDEIERWQKSVLEDPDLPDWYKSAIFNELYFVSDGGSLW---------LSTDESNE 424
Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
D P + + G+F YLEG EY MYNTYDVHFY++FAL
Sbjct: 425 LQPDDPRRRY---------------------GRFAYLEGHEYRMYNTYDVHFYAAFALAQ 463
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
L+P LE IQ +F +V D ++ + DG RK G+VPHDIG ++P+ IN+Y
Sbjct: 464 LWPNLEACIQYEFRDSVHAEDQQRLQGLYDGSKRHRKVKGSVPHDIGDPGEEPFDLINAY 523
Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
+ + S+W+DLNSKFVL YR + + R W ++ + + + FD+D DG+IEN
Sbjct: 524 PIHDVSQWRDLNSKFVLSCYRIYFFNRNLEQLRDFWSTIKLVLEHSLTFDEDNDGLIENG 583
Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DS 697
GFPDQTYD W +G SAYCGGLW+AAL A +A +G+ + + + K + D
Sbjct: 584 GFPDQTYDCWVMSGPSAYCGGLWIAALHCAVKMAELLGESEDEARYKGILDRGKVAFQDK 643
Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL-LPIADEAKVKKALTKIYDFNVLK 756
LWNG Y+N+D S SI +DQL G W RACG + + +V+ +L ++ NV+K
Sbjct: 644 LWNGKYYNFDCGKDESKLSIMSDQLCGHWLLRACGFSYEVFPQDRVRSSLETVFQNNVMK 703
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
K G GA+NG P G ID S +Q+ E+W GV YGLAA +I E M++ AF+TA GVY
Sbjct: 704 YKNGQQGAVNGFSPTGSIDYSCIQSEEMWTGVAYGLAALLIHEGMIEEAFRTAEGVYRTV 763
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
+ + G+G F+TPE+ + YR++ YMRPL+IWAMQ A T K
Sbjct: 764 YEKIGMG--FETPEALYEHKVYRAIGYMRPLSIWAMQHAWTMRK 805
>gi|10047285|dbj|BAB13431.1| KIAA1605 protein [Homo sapiens]
Length = 922
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 306/767 (39%), Positives = 423/767 (55%), Gaps = 62/767 (8%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 157 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 215
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 216 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 267
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 268 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 326
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAA 284
+ DV+++F+ N + G G +N + + G V GL LHH T P T AVAA
Sbjct: 327 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAA 384
Query: 285 EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
TA V+ F T + +W ++ + G D +++P++ G I A+
Sbjct: 385 RVTAATTVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVC 438
Query: 345 ASLTI-PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEH 401
S + P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +
Sbjct: 439 VSSKLRPRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRY 497
Query: 402 AKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKF 461
A+WE I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 498 AEWEERISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--- 545
Query: 462 SLDTPSKIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+
Sbjct: 546 ------------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALI 593
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINS 577
ML+PKLELS+Q D A A + D + + G K +PHDIG D+PW +N+
Sbjct: 594 MLWPKLELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNA 653
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
Y + +++ WKDLN KFVLQVYRD+ TGD+NF + +WP M +FDKD DG+IEN
Sbjct: 654 YLIHDTADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIEN 713
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
G+ DQTYD W G SAYCGGLW+AA+ +A G F + + Y+
Sbjct: 714 GGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYER 773
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDF 752
LWNG Y+NYD+S S S+ +DQ AGQW+ +ACGL + V +AL I++
Sbjct: 774 LLWNGRYYNYDSSSRPQSRSVMSDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFEL 833
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
NV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 834 NVQAFAGGAMGAVNGMQPHGVPDKSSVQSDEVWVGVVYGLAATMIQE 880
>gi|431902810|gb|ELK09025.1| Non-lysosomal glucosylceramidase [Pteropus alecto]
Length = 970
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 296/763 (38%), Positives = 411/763 (53%), Gaps = 75/763 (9%)
Query: 49 FRHLAHIGFR-LYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+HL +G R L + ++ + + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 104 IKHLG-MGLRYLQWWYRKTQVEKKKPFIDLINSVPLKQIYGCPLGGIGGGTITRGWRGQF 162
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 163 CRWQLNPGMYQHRTVIADQFTVCLRREGRTVYQQVLSVERPSV--------LRSWNWGLC 214
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P ++ + CRQI+P +PH+Y++SS P +F + + N G
Sbjct: 215 GYFAFYHALYPRAWTVYQ-LPGQKVTLTCRQITPILPHDYQDSSLPVGIFVWDVENEGDE 273
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDGVHGLTLHHRTANGRPPVTFAVAAEE 286
+ DV+++F+ N + G SG +N + +DG E
Sbjct: 274 ALDVSIMFSMRNGLGGGDDASGGLWNEPFCLERDG------------------------E 309
Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
TAD ++ F T + +W ++ + G D +++PS+ G I A+ +
Sbjct: 310 TADTTITHITAF-----DPDSTGQQVWQDLLQDGQLDS-PAGRSTPSQKGVGIAGAVCVT 363
Query: 347 LTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKW 404
+P + FSLAWD P + F K VY+RRYT+F+G+ GD+A ++H A+ + W
Sbjct: 364 GKLPPRGQCCLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGSDGDAAPALSHYALCRYTDW 423
Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
E I AWQ P+L+D+ P WY LFNELY+L GGTIW L+
Sbjct: 424 EERISAWQSPVLDDRSLPSWYKSALFNELYFLADGGTIW-------------------LE 464
Query: 465 TPSKIHTPASSDTALGTRL-LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
P A + +G L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+P
Sbjct: 465 VPED----ALPEELVGNMCQLRPILQEYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWP 520
Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLF 581
KLELS+Q D A A + D + + G K +PHDIG D+PW +N+Y +
Sbjct: 521 KLELSLQYDMALATLREDLSWRQYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYMIH 580
Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
+++ WKDLN KFVLQVYRD+ TGD+ F R +WP M +FDKD DG+IEN G+
Sbjct: 581 DTADWKDLNLKFVLQVYRDYYLTGDQGFLRDMWPVCLAVMESEMKFDKDQDGLIENGGYA 640
Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
DQTYDAW G SAYCGGLW+AA+ +A G F + + Y+ LWN
Sbjct: 641 DQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAALCGTQDILDKFSSILSRGREAYERLLWN 700
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLK 756
G Y+NYD S S SI +DQ AGQW+ RA GL + V AL I+ FNV
Sbjct: 701 GHYYNYDCSSQPQSYSIMSDQCAGQWFLRASGLGEGDTEVFPTPHVVCALQTIFKFNVQA 760
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 761 FAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 803
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%), Gaps = 2/41 (4%)
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
Y W LG +FQTPE++ + +RSL YMRPL+IWAM
Sbjct: 890 CYRTVWER--LGLAFQTPEAYCQHQVFRSLAYMRPLSIWAM 928
>gi|413920605|gb|AFW60537.1| hypothetical protein ZEAMMB73_659971, partial [Zea mays]
Length = 449
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 256/421 (60%), Positives = 317/421 (75%), Gaps = 8/421 (1%)
Query: 17 KVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
KV PGQ P LTW+ KL F L+++E LA IG RL R+ +E +KG++ V +
Sbjct: 28 KVYPGQLPVLTWEHKLTHVRHDLPPFRLTWREIMQLAGIGLRLSRHILEETSKGRIAVID 87
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+K S QG+PLGG SIGRSY+G+FQR++LF G C+D VLANQFS F+SR +G
Sbjct: 88 PMKKRAARSGQGVPLGG----SIGRSYKGDFQRWQLFPGSCEDNAVLANQFSAFISRQDG 143
Query: 137 EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
+K+S+VL P P +PK + SGI SWDWN+ G+ TYHAL+PRAWTVYDGEPDPEL IVC
Sbjct: 144 KKYSTVLHPGKPDLPKGSDISGIGSWDWNMSGQQSTYHALYPRAWTVYDGEPDPELNIVC 203
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKT 256
RQISP IPHNY++SS+P +VFTFT++NSG T+ADVTLLFTWANSV G S L+G+H NS
Sbjct: 204 RQISPIIPHNYQQSSYPVAVFTFTVTNSGHTAADVTLLFTWANSVGGKSELTGYHSNSSM 263
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG--ITAKDMWN 314
+ KDGVHG+ LHHRTA+G+P VTF +AA+E D+ +SECP F++SG+S TAKDMWN
Sbjct: 264 IEKDGVHGILLHHRTADGQPHVTFVIAAQEKEDILISECPYFVISGSSASDEFTAKDMWN 323
Query: 315 EIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+K+HGSFDHLD KTS S PGSSIGAAIAAS+ + +T+ ++FSLAW CPEVKF
Sbjct: 324 SVKEHGSFDHLDPIKTSMCSRPGSSIGAAIAASVKLAPKATQDISFSLAWACPEVKFSSG 383
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
K YHRRYTKFYGT GD+AA +AHDAILEHA WE +IE WQ PIL+D+RFP W+ TL
Sbjct: 384 KTYHRRYTKFYGTDGDAAAALAHDAILEHASWERQIEEWQDPILQDERFPAWFVPTLIIH 443
Query: 433 L 433
L
Sbjct: 444 L 444
>gi|443709232|gb|ELU03988.1| hypothetical protein CAPTEDRAFT_186839 [Capitella teleta]
Length = 802
Score = 493 bits (1269), Expect = e-136, Method: Compositional matrix adjust.
Identities = 297/787 (37%), Positives = 425/787 (54%), Gaps = 98/787 (12%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G P+GGIG GSIGR +RGEF RF++ G+ V A+QF + V + N VL P
Sbjct: 87 GAPIGGIGCGSIGRGFRGEFCRFQMIPGMYTHKTVEADQFILTVRKSNETIHQCVLSPL- 145
Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
K + SW W + TYHAL+PR+WT Y+ P+ +L ++CRQISP P++Y
Sbjct: 146 -----KLKGKRLSSWRWGFPSQQATYHALYPRSWTRYE-IPECDLTLICRQISPIFPNDY 199
Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK-----TMTKDGV 262
+E+ FP SVF + + N G + D++L FT+ N SG ++ T T D V
Sbjct: 200 QETCFPGSVFVWEVINKGSDAVDISLTFTFKNGQGVKEDRSGGVWSRAFQEGITGTDDCV 259
Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
G+ +H + + P + VAA+ ++VSE F + T +++WN++ GS
Sbjct: 260 SGVEIHQKFND--LPCVYGVAAKHGPGINVSEKSYF-----NPQSTGEEVWNDLLAEGS- 311
Query: 323 DHLDNDKTSPSE--PGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRR 378
L + S SE G + A+ A TI S S+TFSL WD P + F K YHRR
Sbjct: 312 --LKSTPVSGSETAKGEELACAVCAKTTISPYSHTSLTFSLTWDMPIIHFGAKRNKYHRR 369
Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
Y +F+G G + R+A A+ WE +I WQ+P L+ + P WY LFNE Y+++
Sbjct: 370 YARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANKLPAWYKSALFNETYFISD 429
Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLE 498
GG++W D + + E F + G+F YLE
Sbjct: 430 GGSVWVDS---IDVKDAVSEHPFV--------------------------QEFGKFAYLE 460
Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
G EY MYNT DVH+Y+SFAL+ L+P L+LS+Q D
Sbjct: 461 GHEYRMYNTLDVHYYASFALMKLWPNLQLSLQYDID------------------------ 496
Query: 559 LGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY 618
++PW +N+Y + ++ WKDLN KFVLQVYRD+V T D+ + ++P
Sbjct: 497 ----------NEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMYPQCK 546
Query: 619 IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH 678
M + ++D D DG+I+N GF DQTYDAW+ G SAYCGGLW+AA++A + +A+ +G
Sbjct: 547 AVMDHSLRWDVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHRLGVV 606
Query: 679 ASASYFWVRYQKAKAVY-DSLWNG----SYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+ ++ K + + LWNG Y+NYD S+ SI ADQ+ GQW+ +ACG+
Sbjct: 607 EDLMKYEEILERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLKACGV 666
Query: 734 L--PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYG 791
+ A VK+AL ++ NVL GG GA+NGM+P+ + D++ Q+ E W GVTY
Sbjct: 667 ADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTGVTYA 726
Query: 792 LAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWA 851
L A+MIQ M+D FQTA G Y W GL +FQTPE++ +N +RSL YMRPL IWA
Sbjct: 727 LGATMIQVGMIDKGFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPLAIWA 784
Query: 852 MQWALTK 858
+Q A+ K
Sbjct: 785 IQHAVEK 791
>gi|189239183|ref|XP_966847.2| PREDICTED: similar to bile acid beta-glucosidase, putative
[Tribolium castaneum]
gi|270010940|gb|EFA07388.1| hypothetical protein TcasGA2_TC016367 [Tribolium castaneum]
Length = 818
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 306/800 (38%), Positives = 428/800 (53%), Gaps = 62/800 (7%)
Query: 70 GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
GK V + G+P+GGIG G+IGR YRGEF RF+L G+ + V ANQF V
Sbjct: 64 GKSIVMDYVHTLRAKQIYGVPIGGIGCGTIGRGYRGEFCRFQLKPGLYEYNTVDANQFIV 123
Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
+ + E L P ++SW+ + G C Y L+PR+WT YD
Sbjct: 124 TIKDGHNETIFHSLLSTFPT-------KSLKSWESLIDGSECFYTGLYPRSWTEYDLSK- 175
Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
+ + CRQI+P IPHNYK+SS P ++F + + N + VT+ FT+ N V S
Sbjct: 176 YGVMLTCRQITPIIPHNYKDSSLPCAMFVWDVKNISEEDRTVTIAFTFKNGVGDKSRDRS 235
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITA 309
+SK T G+ L+H + ++++AA+ + +S+C F NS GI
Sbjct: 236 STCSSKAFTLADSEGVILYHTI--DKMQCSYSLAAKTNPQIDISKCLYF--DPNSDGIKP 291
Query: 310 KDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
W ++K +GSFD + +DK G + IA TI +G+T + +L WD P V+
Sbjct: 292 ---WIQLKNNGSFDKI-SDKNYGQIFGE-MACGIATKATIRAGNTVTSEMALVWDMPSVQ 346
Query: 370 FFEKVYHRRYTKFYGTLGD---SAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYP 426
F +K ++YT++Y D + RI A + WE I WQR IL+D+ P+WY
Sbjct: 347 FPKK--QKKYTRYYTKHFDGDNTGLRIVRYAFENYQNWERAIYDWQRRILDDENLPDWYK 404
Query: 427 ITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLEN 486
LFNE YY++ GG IW L +K + P H
Sbjct: 405 SALFNESYYISDGGAIWV-------ILDEQEAKKLPKNDPRLEH---------------- 441
Query: 487 GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK 546
G+F LEG EY MYNTYDVHFY+SF+ V+ FP L+ +Q D A+ P +K
Sbjct: 442 -----GKFALLEGHEYKMYNTYDVHFYASFSFVLNFPLLQSILQYDMRDAIFTEIPDKVK 496
Query: 547 IMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVAT 604
++ DG+ RK VPHDIG ++P+ +NSY + + S+W+DLNSKFVLQV+RD T
Sbjct: 497 MLYDGEVCERKVPNTVPHDIGDPGEEPFILLNSYPIHDVSQWRDLNSKFVLQVFRDAFIT 556
Query: 605 G--DKNFA--RAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
G D++ A ++ + Y M FD DGDG+IEN G PDQT+D W G SAYCGGL
Sbjct: 557 GLDDRSIAYLNDMYNACYTVMHKSLDFDVDGDGLIENSGSPDQTFDTWVMTGASAYCGGL 616
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQA 719
W+AAL A + +A+ + F K A ++ LWNG +N+D SD SI A
Sbjct: 617 WLAALFAMTKIADALQKTEDKEKFQELLDKGTAAFERKLWNGKCYNFDCSD-KECRSIMA 675
Query: 720 DQLAGQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG 778
DQL GQWY R+CG + + +VK +L IY+ NV G GA+NG DG ID
Sbjct: 676 DQLCGQWYLRSCGFNYEVFPQDRVKTSLKTIYENNVQSFCDGRMGAVNGF-IDGVIDEFT 734
Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
+Q++E+W GVTY LAASM+QE M AF TA G+++ + G SF TPE+ Y
Sbjct: 735 IQSQEVWTGVTYALAASMLQEGMRTEAFNTAGGMFKSM--SERFGLSFDTPEALYAAKYY 792
Query: 839 RSLCYMRPLTIWAMQWALTK 858
R++ YMRPL+IW+MQ A+ +
Sbjct: 793 RAIGYMRPLSIWSMQLAIDQ 812
>gi|71992715|ref|NP_001021680.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
gi|18376555|emb|CAD21661.1| Protein HPO-13, isoform a [Caenorhabditis elegans]
Length = 959
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 302/852 (35%), Positives = 451/852 (52%), Gaps = 73/852 (8%)
Query: 37 KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
+VP +Q + L + + K K K+ + N F+ G+P GGIG
Sbjct: 86 RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 144
Query: 97 GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
G+IGR +RG F +F L G+ + V A+QF + V + + VL P
Sbjct: 145 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLSAADVQRP--- 201
Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
+ +WD+ +N Y LFPR+WT + P+ EL +V RQ+SP +PHNY+++++P
Sbjct: 202 -SGQLSTWDFKFPKKNVHYRGLFPRSWTTFRV-PELELTVVIRQVSPVLPHNYEDTTYPV 259
Query: 215 SVFTFTLSNSGQTSAD--VTLLFTWANSVAG-----DSGLSGHHFNSK-------TMTKD 260
+F + N G S + +++ FT+ N ++ SG F ++ T ++
Sbjct: 260 CLFLIDVENGGSASREYEISVAFTFRNGTGNRRWEREAECSGVKFETEEEGLKPTTTAQE 319
Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
+ G++L+H ++ P T+ +A + C F S N +WN +K+ G
Sbjct: 320 KISGVSLYHTISSM--PCTYGLATTHREHTTTTVCERFDPSRNGAA-----LWNHLKQTG 372
Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRR 378
+ + + + A+ T+P ST++ ++L+WD P+V F + YHRR
Sbjct: 373 DVPSCEEECLINAR---EMAVAVCNRFTLPPNSTKTHDYALSWDMPKVHFGSVARSYHRR 429
Query: 379 YTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
YT+F+ G +A + A+ KWE EIEAWQ PIL K+ P+WY +FNELY++
Sbjct: 430 YTRFFEASEAGSAADSLCVRALRLKDKWESEIEAWQSPILNHKKLPDWYKSAIFNELYFI 489
Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
GGT+W F D A +T L + ++ + IG+F Y
Sbjct: 490 TDGGTVW-----------------FEFD-----ENWAEHETHL-SHYTKDKMKEIGRFGY 526
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
LE EY M NTYDVHFY+SFAL L+P+LE++IQ +F V + +G W
Sbjct: 527 LESWEYRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANV 586
Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK--NFARA 612
K VPHD+G D+PW N+Y + ++ +WKDLN K+VL +RD+V ++ +F
Sbjct: 587 KTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFH 646
Query: 613 VWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
WP+V I + +E +D+DGDGMIEN G DQTYDAW GVSAYCG LW+A+L+ A +
Sbjct: 647 TWPAVRMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEM 706
Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
A + D + F +KAK V+ D+LW G+YF + S T + ADQL G W+ ++
Sbjct: 707 AGLMKDDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQS 765
Query: 731 CGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
+ADE V+ AL IY NV K G GA+NGM+P G +D +QA E+W
Sbjct: 766 VS-PELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWT 824
Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
GVTY +A+ +IQ+ V+ AF TA+G Y + E GL Y QTPE+ + YR++ YMRP
Sbjct: 825 GVTYAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRP 882
Query: 847 LTIWAMQWALTK 858
L+IWAMQW+L K
Sbjct: 883 LSIWAMQWSLKK 894
>gi|71992731|ref|NP_001021682.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
gi|15130776|emb|CAC48128.1| Protein HPO-13, isoform c [Caenorhabditis elegans]
Length = 922
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 302/852 (35%), Positives = 450/852 (52%), Gaps = 73/852 (8%)
Query: 37 KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
+VP +Q + L + + K K K+ N F+ G+P GGIG
Sbjct: 49 RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKL-FINTFQPLKHKPYYGVPCGGIGC 107
Query: 97 GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
G+IGR +RG F +F L G+ + V A+QF + V + + VL P
Sbjct: 108 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLSAADVQRPSGQ 167
Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
+ +WD+ +N Y LFPR+WT + P+ EL +V RQ+SP +PHNY+++++P
Sbjct: 168 ----LSTWDFKFPKKNVHYRGLFPRSWTTFRV-PELELTVVIRQVSPVLPHNYEDTTYPV 222
Query: 215 SVFTFTLSNSGQTSAD--VTLLFTWANSVAG-----DSGLSGHHFNSK-------TMTKD 260
+F + N G S + +++ FT+ N ++ SG F ++ T ++
Sbjct: 223 CLFLIDVENGGSASREYEISVAFTFRNGTGNRRWEREAECSGVKFETEEEGLKPTTTAQE 282
Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
+ G++L+H ++ P T+ +A + C F S N +WN +K+ G
Sbjct: 283 KISGVSLYHTISSM--PCTYGLATTHREHTTTTVCERFDPSRNGAA-----LWNHLKQTG 335
Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRR 378
+ + + + A+ T+P ST++ ++L+WD P+V F + YHRR
Sbjct: 336 DVPSCEEECLINAR---EMAVAVCNRFTLPPNSTKTHDYALSWDMPKVHFGSVARSYHRR 392
Query: 379 YTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
YT+F+ G +A + A+ KWE EIEAWQ PIL K+ P+WY +FNELY++
Sbjct: 393 YTRFFEASEAGSAADSLCVRALRLKDKWESEIEAWQSPILNHKKLPDWYKSAIFNELYFI 452
Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
GGT+W F D A +T L + ++ + IG+F Y
Sbjct: 453 TDGGTVW-----------------FEFD-----ENWAEHETHL-SHYTKDKMKEIGRFGY 489
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
LE EY M NTYDVHFY+SFAL L+P+LE++IQ +F V + +G W
Sbjct: 490 LESWEYRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANV 549
Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK--NFARA 612
K VPHD+G D+PW N+Y + ++ +WKDLN K+VL +RD+V ++ +F
Sbjct: 550 KTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFH 609
Query: 613 VWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
WP+V I + +E +D+DGDGMIEN G DQTYDAW GVSAYCG LW+A+L+ A +
Sbjct: 610 TWPAVRMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEM 669
Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
A + D + F +KAK V+ D+LW G+YF + S T + ADQL G W+ ++
Sbjct: 670 AGLMKDDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQS 728
Query: 731 CGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
+ADE V+ AL IY NV K G GA+NGM+P G +D +QA E+W
Sbjct: 729 VS-PELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWT 787
Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
GVTY +A+ +IQ+ V+ AF TA+G Y + E GL Y QTPE+ + YR++ YMRP
Sbjct: 788 GVTYAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRP 845
Query: 847 LTIWAMQWALTK 858
L+IWAMQW+L K
Sbjct: 846 LSIWAMQWSLKK 857
>gi|71992723|ref|NP_001021681.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
gi|50511739|emb|CAH04764.1| Protein HPO-13, isoform b [Caenorhabditis elegans]
Length = 930
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 302/852 (35%), Positives = 450/852 (52%), Gaps = 73/852 (8%)
Query: 37 KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
+VP +Q + L + + K K K+ N F+ G+P GGIG
Sbjct: 57 RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKL-FINTFQPLKHKPYYGVPCGGIGC 115
Query: 97 GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
G+IGR +RG F +F L G+ + V A+QF + V + + VL P
Sbjct: 116 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLSAADVQRPSGQ 175
Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
+ +WD+ +N Y LFPR+WT + P+ EL +V RQ+SP +PHNY+++++P
Sbjct: 176 ----LSTWDFKFPKKNVHYRGLFPRSWTTFRV-PELELTVVIRQVSPVLPHNYEDTTYPV 230
Query: 215 SVFTFTLSNSGQTSAD--VTLLFTWANSVAG-----DSGLSGHHFNSK-------TMTKD 260
+F + N G S + +++ FT+ N ++ SG F ++ T ++
Sbjct: 231 CLFLIDVENGGSASREYEISVAFTFRNGTGNRRWEREAECSGVKFETEEEGLKPTTTAQE 290
Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
+ G++L+H ++ P T+ +A + C F S N +WN +K+ G
Sbjct: 291 KISGVSLYHTISSM--PCTYGLATTHREHTTTTVCERFDPSRNGAA-----LWNHLKQTG 343
Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRR 378
+ + + + A+ T+P ST++ ++L+WD P+V F + YHRR
Sbjct: 344 DVPSCEEECLINAR---EMAVAVCNRFTLPPNSTKTHDYALSWDMPKVHFGSVARSYHRR 400
Query: 379 YTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
YT+F+ G +A + A+ KWE EIEAWQ PIL K+ P+WY +FNELY++
Sbjct: 401 YTRFFEASEAGSAADSLCVRALRLKDKWESEIEAWQSPILNHKKLPDWYKSAIFNELYFI 460
Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
GGT+W F D A +T L + ++ + IG+F Y
Sbjct: 461 TDGGTVW-----------------FEFD-----ENWAEHETHL-SHYTKDKMKEIGRFGY 497
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
LE EY M NTYDVHFY+SFAL L+P+LE++IQ +F V + +G W
Sbjct: 498 LESWEYRMVNTYDVHFYASFALAELWPELEITIQSEFTDQVYHSIEKPTRFHMEGDWANV 557
Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK--NFARA 612
K VPHD+G D+PW N+Y + ++ +WKDLN K+VL +RD+V ++ +F
Sbjct: 558 KTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSNEHEDFLFH 617
Query: 613 VWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
WP+V I + +E +D+DGDGMIEN G DQTYDAW GVSAYCG LW+A+L+ A +
Sbjct: 618 TWPAVRMIMLEALENWDQDGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVAIEM 677
Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
A + D + F +KAK V+ D+LW G+YF + S T + ADQL G W+ ++
Sbjct: 678 AGLMKDDETQQLFRNTLEKAKKVFIDTLWTGTYFRFCERSRSRET-VMADQLCGYWFLQS 736
Query: 731 CGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
+ADE V+ AL IY NV K G GA+NGM+P G +D +QA E+W
Sbjct: 737 VS-PELADELLPNHMVRSALDTIYRLNVCKFGNGRMGAVNGMKPSGVVDREYIQADEMWT 795
Query: 787 GVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRP 846
GVTY +A+ +IQ+ V+ AF TA+G Y + E GL Y QTPE+ + YR++ YMRP
Sbjct: 796 GVTYAVASLLIQQGEVEKAFHTASGSYLTCFEETGLQY--QTPEALYESKFYRAIGYMRP 853
Query: 847 LTIWAMQWALTK 858
L+IWAMQW+L K
Sbjct: 854 LSIWAMQWSLKK 865
>gi|10440626|gb|AAG16864.1|AC069145_13 unknown protein [Oryza sativa Japonica Group]
Length = 444
Score = 480 bits (1235), Expect = e-132, Method: Compositional matrix adjust.
Identities = 237/392 (60%), Positives = 300/392 (76%), Gaps = 1/392 (0%)
Query: 483 LLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP 542
L +NG EN+G+FLYLEG EY+M+ TYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D
Sbjct: 18 LSKNGPENVGKFLYLEGVEYIMWCTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDR 77
Query: 543 GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFV 602
MK ++DG RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF
Sbjct: 78 RRMKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFA 137
Query: 603 ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
ATGD +F R VWP+V M YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCGGLW+
Sbjct: 138 ATGDMSFGRDVWPAVCAVMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGGLWL 197
Query: 663 AALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQ 721
AALQAA+ +A+ +GD A + +++ +AKAVY++ LWNGSYFNYD+ S+S SIQADQ
Sbjct: 198 AALQAAATMAHRLGDRPFAEKYKLKFIQAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQ 257
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
LAGQWYA + GL P+ DE K++ AL KI++FNV+KVKGG GA+NGM P+G++D + +Q+
Sbjct: 258 LAGQWYAASSGLPPLFDENKIRSALQKIFEFNVMKVKGGRLGAVNGMTPNGKVDETCMQS 317
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
REIW GVTYG+AA+M+ M F TA G++ WSE+G GY FQTPE W + YRSL
Sbjct: 318 REIWTGVTYGVAANMLLHGMEHQGFTTAEGIFIAGWSEEGYGYWFQTPEGWTIDGHYRSL 377
Query: 842 CYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
YMRPL IWAMQWA + PK K + DR
Sbjct: 378 IYMRPLAIWAMQWARSPPKAILDAPKVNLMDR 409
>gi|449442371|ref|XP_004138955.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
gi|449520930|ref|XP_004167485.1| PREDICTED: non-lysosomal glucosylceramidase-like [Cucumis sativus]
Length = 993
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 236/395 (59%), Positives = 306/395 (77%), Gaps = 3/395 (0%)
Query: 482 RLLE--NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMM 539
++LE N EE++G+FLYLEG EYVM+ TYDVHFY+S+AL+ LFPK+EL+IQRDFA AV+
Sbjct: 563 KVLEPGNTEEDVGRFLYLEGVEYVMWCTYDVHFYASYALLELFPKIELNIQRDFAKAVLS 622
Query: 540 HDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
D ++ +++GK+ RK GAVPHD+G DPW E+N+YN+ ++SRWKDLN+KFVLQVYR
Sbjct: 623 EDGRKVRFLAEGKYGIRKVRGAVPHDLGTHDPWNEMNAYNIHDTSRWKDLNTKFVLQVYR 682
Query: 600 DFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGG 659
DF AT D +F VWPSV A+ YMEQFD+DGDG+IEN+GFPDQTYD W+ +G+SAYCG
Sbjct: 683 DFAATRDMSFGVDVWPSVRAAIEYMEQFDRDGDGVIENDGFPDQTYDTWTVHGISAYCGC 742
Query: 660 LWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQ 718
LWVAALQAA+A+A+++GD A ++ KA+ V ++ LWNGSYFNYD+ S+S SIQ
Sbjct: 743 LWVAALQAAAAMAHELGDQEFAETCKSKFLKARPVLEAELWNGSYFNYDSGSSSNSKSIQ 802
Query: 719 ADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG 778
ADQLAGQWY + GL P+ D+ K+K AL KIYDFNV+KV+GG GA+NGM P+G+ID +
Sbjct: 803 ADQLAGQWYTASSGLPPLFDDFKIKSALRKIYDFNVMKVRGGRMGAVNGMHPNGKIDETC 862
Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
+Q+REIW GVTYG+AA+MI M + AF+TA G++ WSE+G GY FQTPE+W+ + Y
Sbjct: 863 MQSREIWTGVTYGVAATMILAGMEEEAFKTAEGIFLAGWSEEGFGYWFQTPEAWSTDGHY 922
Query: 839 RSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDR 873
RSL YMRPL+IW MQWAL+ PK K + DR
Sbjct: 923 RSLIYMRPLSIWGMQWALSLPKAILDAPKINVMDR 957
Score = 447 bits (1151), Expect = e-122, Method: Compositional matrix adjust.
Identities = 226/449 (50%), Positives = 308/449 (68%), Gaps = 21/449 (4%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP+ W+RKL+ + + F+++F E + +G RL+ Y ++E ++G+ + F
Sbjct: 31 DSASPPEQAWRRKLNGHANLLKEFSVTFVEAIKMVRLGIRLWSYVREEASQGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G+GSI R +RGEF+++++ G C+ +PV+ANQFS+FVSR G
Sbjct: 91 TRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGTCEASPVMANQFSIFVSRDGG 150
Query: 137 -EKFSSVLCP---RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPEL 192
+K++SVL P G +K+ DSGI SW WNL G++ TYHALFPRAWTVYDGEPDPEL
Sbjct: 151 GKKYASVLAPGQHEGLGCCRKDGDSGISSWGWNLDGQHSTYHALFPRAWTVYDGEPDPEL 210
Query: 193 RIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHF 252
++ CRQISPFIPHNY++SS P +VF +TL N+G+ A V+LLFTWANS+ G+S LSG+H
Sbjct: 211 KVSCRQISPFIPHNYRDSSLPTAVFVYTLVNTGRERAKVSLLFTWANSIGGNSHLSGNHV 270
Query: 253 NSKTMTKDGVHGLTLHH-----------RTANGRPPVTFAVAAEETADVHVSECPCFLLS 301
N + +DGV G+ LHH RTA G PPVTFA+AA ET +V V+ P F LS
Sbjct: 271 NEPFIDEDGVSGVLLHHKQNSKDLCKCFRTAKGNPPVTFAIAACETQNVSVTVLPSFGLS 330
Query: 302 GNSKGITAKDMWNEIKKHGSF--DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
S ITAKDMW+++ + G F D+ + + PS PG ++ AA+AAS + +V F
Sbjct: 331 EGS-CITAKDMWDKMVQDGQFDRDNFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAF 389
Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
SL+W P+VKF + YHRRYTKFYGT G +A +AHDA+ + +WE EIE WQRP+L D
Sbjct: 390 SLSWSSPKVKFLKGFSYHRRYTKFYGTSGKAAQTLAHDALTNYKRWEEEIEKWQRPVLMD 449
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGS 447
+R PEWY TLFNELY+L AGGT+W D S
Sbjct: 450 ERLPEWYKFTLFNELYFLVAGGTVWIDSS 478
>gi|159896834|ref|YP_001543081.1| hypothetical protein Haur_0301 [Herpetosiphon aurantiacus DSM 785]
gi|159889873|gb|ABX02953.1| protein of unknown function DUF608 [Herpetosiphon aurantiacus DSM
785]
Length = 774
Score = 477 bits (1228), Expect = e-131, Method: Compositional matrix adjust.
Identities = 288/788 (36%), Positives = 427/788 (54%), Gaps = 81/788 (10%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+P+GG+G+G+IGR++RG++ R+ L G V NQ+SVF + ++ + VLC
Sbjct: 43 HGLPIGGMGSGAIGRNFRGDWSRWHLEVGKHVHRSVWPNQWSVFWQTAS-QQAAQVLCTT 101
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P +TD + SW+WN YHALFPRAW Y PD L +V Q SP + N
Sbjct: 102 QP-----DTDE-LSSWNWNYPVGAGNYHALFPRAWFDYQ-HPDWPLELVQEQFSPVLAGN 154
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS-GLSGHHFNSK---------T 256
KESSFP VFT+ ++N G + + L+ TW ++ A ++ GLS +S T
Sbjct: 155 LKESSFPVGVFTWRVTNRGSETVRLGLMLTWEHTRAVEAAGLSLQRQHSAWNDGNTSGVT 214
Query: 257 MTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
+T+ L+ H NG T+A+A + VS+ C+ ++ + A +W +
Sbjct: 215 LTQTSDQALSSH----NG----TWALAVQAPESASVSQWTCWDVAQD-----AAALWQDF 261
Query: 317 KKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV- 374
G D+ + K + + ++ AIA +L + G++ + FSLAWD P V+F ++
Sbjct: 262 ASDGQLADYPTSQKVAADQRSAT---AIAVTLELAPGASAVIPFSLAWDFPIVEFADQSR 318
Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
+++RYT+F+GT GD A +A ++ W IEAWQ PIL D + P WY LFNELY
Sbjct: 319 WYKRYTRFWGTNGDQAQALAVASLTNADAWRTAIEAWQNPILADDQRPFWYKSALFNELY 378
Query: 435 YLNAGGTIWTD---GSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
YL GGT+W D G P E +++
Sbjct: 379 YLVDGGTLWVDRAVGGP------------------------------------EPAADDV 402
Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
G F YLE +Y Y T DV FYSS++++ L+P+LE F+ V D + I++
Sbjct: 403 GLFSYLECYDYPFYGTLDVSFYSSWSILALWPELERGEILAFSKTVNDADDTVVTIVATQ 462
Query: 552 KWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
RK GA+PHD+G + P + N+Y+ + + WKDLN K++L++YRD D+
Sbjct: 463 VPAIRKAAGALPHDLGAPKEQPLIKTNAYDFQDINNWKDLNLKYILRIYRDVSLWNDQAM 522
Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
A W ++ A+ Y+ QFD DGDG++++ G DQTYD W+ +G ++Y L + AL+AA
Sbjct: 523 LEATWDTIPTALEYVHQFDSDGDGLLDHSG-ADQTYDTWAMSGAASYSASLLICALEAAI 581
Query: 670 ALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYA 728
LA +GDHA A + A+ +++ LWNG+YF Y +D I ADQL GQWYA
Sbjct: 582 RLAQRMGDHAQADAWSEWLAAARQSFETKLWNGTYFRYHTADTDLREVIMADQLVGQWYA 641
Query: 729 RACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGV 788
A GL +A ++ AL +Y FNV++ G GA+NGM PDG +D S QA E+W G
Sbjct: 642 GAIGLPAVAPREMIRSALQTVYRFNVMQYANGALGAVNGMHPDGTVDTSSNQASEVWSGT 701
Query: 789 TYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLT 848
TY +AA M+QE + +QTA G Y ++E LG F+TPE+W +R+ YMRP +
Sbjct: 702 TYAIAAMMLQEGLDLEGWQTAWGAYNATYNE--LGLWFRTPEAWGIERTFRASMYMRPQS 759
Query: 849 IWAMQWAL 856
IWA++ AL
Sbjct: 760 IWAIEHAL 767
>gi|341901617|gb|EGT57552.1| hypothetical protein CAEBREN_15139 [Caenorhabditis brenneri]
Length = 925
Score = 474 bits (1219), Expect = e-130, Method: Compositional matrix adjust.
Identities = 297/846 (35%), Positives = 447/846 (52%), Gaps = 71/846 (8%)
Query: 37 KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
+VP +Q + L + + K K K+ + N F+ G+P GGIG
Sbjct: 48 RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 106
Query: 97 GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
G+IGR +RG F +F L G+ + V A+QF + V + + VL + V +
Sbjct: 107 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDTCIYQKVLS--AADVHR-- 162
Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
S + SWD+ ++ Y LFPR+WT + P+ L + RQ+SP +PHNY+E+++P
Sbjct: 163 --SQLSSWDFGFPKKHLHYRGLFPRSWTTFRV-PELGLTVTIRQVSPVLPHNYQETTYPV 219
Query: 215 SVFTFTLSNSGQTSA-DVTLLFTWANSVAG-----DSGLSGHHFNSKTMTKDGVHGLTLH 268
+F + N+ A DV++ FT+ N +S S F+S ++ V G++L+
Sbjct: 220 CLFLVDVENTSNNRAYDVSIAFTFRNGTGNRRWDRESECSAGRFDSIHSDRESVSGVSLY 279
Query: 269 HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDND 328
H ++ P T+ +A+ D H+S C F S + WN +K+ G + +
Sbjct: 280 HTISSM--PCTYGLASTHKKDSHLSICERFDPSRSGAAF-----WNHLKQTGDVPEFEEE 332
Query: 329 KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYTKFY--G 384
+ + ++ +P S+++ ++L+WD P+V F + YHRRYT+F+
Sbjct: 333 ---CPQNAREMAVSVCNRFHLPPQSSKTHDYALSWDMPKVHFGSVARSYHRRYTRFFEGS 389
Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
G A + A+L +WE EIE WQ PIL + P+WY +FNELY++ GGT+W
Sbjct: 390 EAGSVADSLCVRALLARNQWEEEIEKWQSPILTHPKLPDWYKSAIFNELYFMVDGGTVWF 449
Query: 445 DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
+ E S+ T K+ +NIG+F YLE EY M
Sbjct: 450 EFDENWAD----HEAHLSVYTKEKM-------------------KNIGRFGYLESWEYRM 486
Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
NTYDVHFY+S+AL L+P+LEL++Q +F V + +G W K VPH
Sbjct: 487 VNTYDVHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPH 546
Query: 565 DIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD--KNFARAVWPSV-YI 619
D+G D+PW N+Y + ++ +WKDLN K+VL +RD+V + + F WP++ I
Sbjct: 547 DLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEAHQEFLHHTWPAIKMI 606
Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
+ +E +D++GDGMIEN G DQTYDAW GVSAYCG LW+A+L+ + +A V D
Sbjct: 607 MLEALENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSVEMAAIVKDEI 666
Query: 680 SASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACG------ 732
+ + F +KAK V+ D LW G+YF + S T + ADQL G W+ ++
Sbjct: 667 TENLFKGTLEKAKKVFIDLLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEMADD 725
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
LLP V+ AL IY NV K G GA+NGM+P G +D +QA E+W GVTY +
Sbjct: 726 LLP---NHMVRSALDTIYRLNVCKFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAV 782
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A+ +IQ+ V+ AF TA+G Y + + GL Y QTPE+ + YR++ YMRPL+IWAM
Sbjct: 783 ASLLIQQGEVEKAFHTASGSYLTCFEQAGLQY--QTPEALYESKFYRAIGYMRPLSIWAM 840
Query: 853 QWALTK 858
QW+L K
Sbjct: 841 QWSLKK 846
>gi|324500616|gb|ADY40283.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 890
Score = 467 bits (1202), Expect = e-128, Method: Compositional matrix adjust.
Identities = 315/826 (38%), Positives = 444/826 (53%), Gaps = 69/826 (8%)
Query: 58 RLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGIC 117
R + Y + K + N F+ G+P GGIG GSIGR +RG F +F L G+
Sbjct: 50 RYFIYWLKHVGKKEKLFINTFQPLKHKPHYGVPCGGIGCGSIGRDFRGGFCKFGLRPGLI 109
Query: 118 DDAP--VLANQFSVFVSRPNGEKFSSV----LCPRSPGVPKKNTDSGIESWDWNLKGENC 171
+ + A+QF V + R NGE F++V LC +P P +T+ + +WD++ +
Sbjct: 110 EHKVDVLRADQFIVTLRRKNGEGFATVYQKVLCA-APTTPATSTE--LNAWDFSFPPDCL 166
Query: 172 TYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
TY L+PR+WT YD + + IV RQISP IP +Y++SS P SVF F + N G V
Sbjct: 167 TYRGLYPRSWTRYDIR-EYRVSIVMRQISPVIPGDYRDSSLPVSVFVFDVENGGDEDLYV 225
Query: 232 TLLFTWANSV-----AGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEE 286
++ FT+ N + + + F SK V G+ L H + R T+AVAA
Sbjct: 226 SITFTFRNGTGYHKWSQEGNCTSSLFESK-HAHSRVEGVCLEHTISRMR--CTYAVAANC 282
Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
VS C F S + ++W + G HL N P + A+ A
Sbjct: 283 NQRTTVSRCAAFDPSK-----SGDEIWQSLWSTG---HLQN--VGEVTPTRELAIAVCAQ 332
Query: 347 LTIPSGSTRS--VTFSLAWDCPEVKF--FEKVYHRRYTKFYGTLGDSAARIAHDAILEHA 402
+ + S + F+LAWD P V+F ++ Y RRYT+F+G GD+A + ++ +
Sbjct: 333 TIVKANSKAEAPMEFALAWDMPIVQFGARKRNYRRRYTRFFGDRGDAAPALCAHSLTVYE 392
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
+WE I WQ P+L + P+WY LFNELY++ GG++W + F
Sbjct: 393 QWERRINDWQDPVLNHRNLPDWYKSALFNELYFMTDGGSVWFE---------------FE 437
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
D + + AL ++ G+F YLE EY M NTYDVHFY+SFAL L+
Sbjct: 438 EDWACEEQHLSPYSIAL--------MKDFGRFGYLESWEYRMVNTYDVHFYASFALAQLW 489
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNL 580
P LE IQ +F + D + +G +K VPHD+G DPW N+Y +
Sbjct: 490 PSLEHVIQAEFTDQIEHFDGKPVIYHMEGNRGPKKSRTRVPHDLGNPAVDPWLLTNAYVM 549
Query: 581 FNSSRWKDLNSKFVLQVYRDF--VATGDKNFARAVWPSVY-IAMAYMEQFDKDGDGMIEN 637
++ +WKDLN KF+L +RD+ V D+ F VWP+V + ++ +D+DGDGMIEN
Sbjct: 550 HDTGKWKDLNLKFILISFRDYTCVLGKDQRFLNLVWPAVKSLINEGLKNWDQDGDGMIEN 609
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-D 696
G DQTYDAW +GVSAYCG LW+AAL+AA +A +VGD A A + +A+ Y
Sbjct: 610 FGAADQTYDAWKMSGVSAYCGCLWLAALRAALVMAREVGDSACAHRYAETLSRAREAYIT 669
Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE----AKVKKALTKIYDF 752
LWNG Y+N+D S T I ADQL G W+ ++ +ADE A VK AL IY++
Sbjct: 670 KLWNGEYYNFDERSRSRHT-IMADQLCGYWFLQSIS-PELADELLPAAHVKSALNIIYNY 727
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NVL+ GG GA+NGM+ DG +D + +QA E+W GVTY LAA +IQ+ V F TA G
Sbjct: 728 NVLRFAGGRLGAVNGMRRDGTVDRNYIQADEMWTGVTYALAAFLIQQGAVQRGFDTAWGC 787
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
Y+ ++ GL Y QTPE+ YR++ YMRPL+IWAMQWAL +
Sbjct: 788 YDGCFNRFGLQY--QTPEALYEQRFYRAIGYMRPLSIWAMQWALIR 831
>gi|195115764|ref|XP_002002426.1| GI12858 [Drosophila mojavensis]
gi|193913001|gb|EDW11868.1| GI12858 [Drosophila mojavensis]
Length = 877
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 297/877 (33%), Positives = 433/877 (49%), Gaps = 163/877 (18%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLC----------PRSPGVP 151
Y GEF RF+L GI + V ANQF V + G F S+L ++P
Sbjct: 47 YAGEFCRFQLRPGIYEYNVVHANQFIVTIKDHKGCTIFQSLLSRCSTKSFKAKTKTPSTN 106
Query: 152 KKNTDSG------------------------IESWDWNLKGENCTYHALFPRAWTVYDGE 187
+ D+ + +W N++ C+Y L+PRAWT YD
Sbjct: 107 NNDIDASTDEAEAQSTKCQLPHCSCSRNRQPLSTWHSNIEDSRCSYTGLYPRAWTEYDLS 166
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
+R+VCRQISP IPH+YKESS P +VF +++ N V++ FT+ N
Sbjct: 167 -HYGVRLVCRQISPVIPHDYKESSLPCAVFVWSVQNVCDQERKVSITFTFKNGTGNKKQD 225
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
+ + ++ + + G+ + A+ P ++ +A ++ ++ CP F +G
Sbjct: 226 AEGNPEAQLIAEGNAKGVAIRQTIADM--PCSYNLACRVLPEISITRCPQFDPAG----- 278
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
+ + +W ++K++G L+ TS + IG A+ A L + ++ + F LAWD P+
Sbjct: 279 SGEQLWAQLKEYG---QLNEQPTSETLKTKDIGVAVCAQLALKPQASHELEFVLAWDMPK 335
Query: 368 VKFFEKV--YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
++F K+ + R YTK++ G+ RI A+ +A WE I+AWQRPIL D+ P+WY
Sbjct: 336 IQFPRKLQTHTRYYTKYFDDSGECGPRICEYALKHYASWERLIDAWQRPILNDEGLPDWY 395
Query: 426 PITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-L 484
+FN+LYY++ GGTIW D++LG +L
Sbjct: 396 KCAIFNQLYYISDGGTIWL-----------------------------KCDSSLGNQLAF 426
Query: 485 ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGT 544
++ G+F YLEG EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T
Sbjct: 427 DDPRLAYGRFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIGAELNDT 486
Query: 545 MKIMSDGKWVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFV 602
K++ DGK + RK VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+
Sbjct: 487 RKMLYDGKVMPRKIKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYY 546
Query: 603 ATGDKNFARAVWPSVYIAMAYM------EQFDKDG------------------------- 631
+ A+A S + ++ ++ EQ+ +D
Sbjct: 547 VLNELAQAQADNASKFSSIEFIDKESLYEQYSQDNNRKSSADGKQQNRKSASMYINETNG 606
Query: 632 -----DGMI---------------------------ENEGFPDQTYDAWSANGVSAYCGG 659
D M+ EN PDQTYDAW +G SAYC G
Sbjct: 607 KVYLMDAMVYLKSMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDAWVMDGPSAYCSG 666
Query: 660 LWVAALQAASALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSS 714
LW+AALQA S +A + +RYQ +++ + LWNGSY+ +D S
Sbjct: 667 LWLAALQAMSVMATLLDQPNDC----LRYQDILEKGKRSLEEKLWNGSYYRFDLSHNHRD 722
Query: 715 TSIQADQLAGQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-- 771
T I ADQL G WY ++CG I + V+ AL +IYD NV+ G GA NG +
Sbjct: 723 T-IMADQLCGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVS 781
Query: 772 -----GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
G +D S +QA E+WPGV Y LAA+MIQE M D AFQTA G+Y+ G+ Y
Sbjct: 782 EPSKPGHVDNSNIQAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTLSQRIGMNY-- 839
Query: 827 QTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSR 863
+TPE+ YRS+ YMRPL+IW+MQ A + + R
Sbjct: 840 ETPEALYGEKRYRSIGYMRPLSIWSMQVAWERRRALR 876
>gi|45552032|ref|NP_788055.2| CG33090 [Drosophila melanogaster]
gi|74876618|sp|Q7KT91.1|C3390_DROME RecName: Full=Non-lysosomal glucosylceramidase; Short=NLGase
gi|45445134|gb|AAO41192.2| CG33090 [Drosophila melanogaster]
gi|201065481|gb|ACH92150.1| FI02015p [Drosophila melanogaster]
Length = 948
Score = 463 bits (1192), Expect = e-127, Method: Compositional matrix adjust.
Identities = 289/863 (33%), Positives = 429/863 (49%), Gaps = 149/863 (17%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
Y GEF RF++ GI + VLANQF V + P G L + K + G
Sbjct: 132 YAGEFCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDG 191
Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+ +W N++ C+Y L+PR+WT YD +R+ CRQ+SP
Sbjct: 192 ERTKCQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 250
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
IPH Y+ESS P +VF +++ N V++ FT+ N + S+ +++
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
G+++ + + P ++ +A ++ ++ CP F +GN + +W ++K+HG
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 362
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
L TS + IG A+ + + ++ + F LAWD P+++F K+ + R Y
Sbjct: 363 --QLSEHPTSEALKTKDIGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 420
Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
TK++ GDS RI A+ +++ WE I+AWQRPIL D+ P+WY +FN+LY+++ G
Sbjct: 421 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 480
Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
GTIW D++LG L ++ G+F YLE
Sbjct: 481 GTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYLE 511
Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
G EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK
Sbjct: 512 GHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKV 571
Query: 559 LGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA------------- 603
VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+
Sbjct: 572 KNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNAS 631
Query: 604 -------------------------TGDKNFARAVWPSVYI-----------AMAYMEQF 627
+ D+ S+YI A+ Y++
Sbjct: 632 KFSSIEFIDKESLYELYSQDNKRKNSADEKQQNRKSASMYINETNGKVYLMDAIGYLKAM 691
Query: 628 DKDGDGMIE--------NEGFPDQT------YDAWSANGVSAYCGGLWVAALQAASALAN 673
++E N+G + T YD+W +G SAYC GLW+AALQA SA+A
Sbjct: 692 YASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAMSAMAT 751
Query: 674 DVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYA 728
+ D + +RYQ +++ + LWNGSY+ +D S T I ADQL G WY
Sbjct: 752 -ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWYL 806
Query: 729 RACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGLQ 780
++CG I + V+ AL +IYD NV+ G GA NG + G +D S +Q
Sbjct: 807 KSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNIQ 866
Query: 781 AREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRS 840
A E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+ YRS
Sbjct: 867 AEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGEKRYRS 924
Query: 841 LCYMRPLTIWAMQWALTKPKLSR 863
+ YMRPL+IW+MQ AL + + R
Sbjct: 925 IGYMRPLSIWSMQVALERRRAQR 947
>gi|94968715|ref|YP_590763.1| hypothetical protein Acid345_1688 [Candidatus Koribacter versatilis
Ellin345]
gi|94550765|gb|ABF40689.1| protein of unknown function DUF608 [Candidatus Koribacter
versatilis Ellin345]
Length = 811
Score = 463 bits (1192), Expect = e-127, Method: Compositional matrix adjust.
Identities = 284/821 (34%), Positives = 418/821 (50%), Gaps = 91/821 (11%)
Query: 66 EEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLAN 125
+ A GK P + + QG P+GG GAG+ R+YRG F+R+ + GI V AN
Sbjct: 43 QNAGGKKPGLPIIDDGYW---QGAPVGGFGAGTFSRTYRGNFERWHVKAGIHKYQNVPAN 99
Query: 126 QFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
QF+VF + VL + P + SW+W Y AL+P++W Y
Sbjct: 100 QFAVFAQADGDAAVAQVLSTQKP------EHGELSSWNWTYPVGAGDYAALYPKSWFAYK 153
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV---- 241
P + + Q SP IP NYKESS+P +V+ + ++N Q V++LF+W N V
Sbjct: 154 -SPQLPITLTLEQFSPVIPDNYKESSYPVAVYNWYVTNPSQKRVTVSILFSWTNMVGWFR 212
Query: 242 -------AGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVT------FAVAAEETA 288
A + +H+ S+ + + G+ + PV+ FA+A+ +
Sbjct: 213 DTTTGFGAALNSQDINHYKSEAIKSSQMQGIVFDRLRSE---PVSDEWDGQFAIASLAGS 269
Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
V +S + SG+ D+W K G L N + G S+ AIA T
Sbjct: 270 GVEISYLTTYRPSGD-----GSDVWTPFAKDG---RLPNVAAGLASSGDSLAGAIAVRFT 321
Query: 349 IPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
+ G R++ +LAWD P ++F + + R YT+F+ G +A ++A A+ + W +
Sbjct: 322 LAPGEKRTIPMALAWDLPFIQFGSGRKWARHYTRFFDRFGTNAWKLARTALEQGNDWSRQ 381
Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
I+AWQ+P + D+ P WY LFNELY L GGT+W + TP
Sbjct: 382 IDAWQKPYISDESKPLWYRGMLFNELYDLVDGGTMWAH----------------EVGTPG 425
Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
H P++ F YLE +Y Y T DV FY SF L +P++E
Sbjct: 426 NPHHPSAVKD--------------DSFSYLECFDYAYYGTLDVRFYGSFPLAKFWPEIEK 471
Query: 528 SIQRDFAAAVMMHDPGTMKIMSDGKW----------VARKCLGAVPHDIG--LDDPWFEI 575
R++A + P + W RK G+ PHD+G L+D
Sbjct: 472 QEMREYADTI----PENIDQKYIWAWKSDNEHKLEPTQRKVAGSAPHDLGSPLEDALILP 527
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDG 633
N YN + S WKDLNSK+VL V+RD+V +G DK F + + +V +M ++ QFDK+ DG
Sbjct: 528 NQYNYQSVSFWKDLNSKYVLMVWRDYVMSGSKDKAFLQYNYNAVKQSMQFLRQFDKNNDG 587
Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA 693
+IEN+G+PDQTYD WSA G SAY G L++AAL+A +A +GD +A+ +++A+
Sbjct: 588 LIENDGYPDQTYDNWSARGESAYSGSLYLAALRATEEMAKLLGDQRTAADTAALFKRAQD 647
Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+ LWNG+YFNYD S +I A+QLAGQWYA GL + AL K+YDF
Sbjct: 648 SFVKKLWNGTYFNYD-VGSSYKDAIMAEQLAGQWYASLTGLGDLVPRDMQHSALKKVYDF 706
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
NV+K++ G GA+NG+ G + Q E+W GVT+ +AA+M+Q + + F TA GV
Sbjct: 707 NVMKLQNGTMGALNGISASGEVLKDNEQTEEVWTGVTFAVAATMLQNGLREEGFNTAKGV 766
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
Y V + D GY F+TPE+++ N YR+ YMRP IW+M+
Sbjct: 767 YNVVY--DQKGYWFRTPEAYDTNGMYRASMYMRPGAIWSME 805
>gi|268569852|ref|XP_002640631.1| Hypothetical protein CBG08749 [Caenorhabditis briggsae]
Length = 899
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 291/846 (34%), Positives = 442/846 (52%), Gaps = 69/846 (8%)
Query: 37 KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
+VP +Q + L + + K K K+ + N F+ G+P GGIG
Sbjct: 42 RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 100
Query: 97 GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
G+IGR +RG F +F L G+ + V A+QF + V + + VL
Sbjct: 101 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFILSVRENDRCIYQKVLS----AADVHR 156
Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
+ + +WD+ +N Y LFPR+WT + PD L +V RQ+SP +P+NY+++++P
Sbjct: 157 SSGQLSTWDFGFPKKNVHYRGLFPRSWTTFRI-PDLGLTVVIRQVSPVLPNNYEDTTYPV 215
Query: 215 SVFTFTLSNS-GQTSADVTLLFTWANSVAG-----DSGLSGHHFNSKTMTKDGVHGLTLH 268
+F + N+ DV++ FT+ N ++ S F + + + V G++L+
Sbjct: 216 CLFLIDVENTTNDREYDVSVAFTFRNGTGNRRWEREAECSAVMFETPEDSAEQVTGVSLY 275
Query: 269 HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDND 328
HR + T+ +A+ + D +S C F S + +WN + + G L+ +
Sbjct: 276 HRISGM--SCTYGLASTHSKDSVLSVCERFDPSKSGGA-----LWNHLMQTGDVPSLEEE 328
Query: 329 KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFY--G 384
+ + A++ T+P S+++ ++L+WD P+V F + YHRRYT+F+
Sbjct: 329 CAINAR---EMAVAVSNRFTLPPLSSKTHDYALSWDMPKVHFGSTGRSYHRRYTRFFEAS 385
Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
G +A + A+ KWE EIE WQ PIL + P+WY +FNELY++ GGT+W
Sbjct: 386 EAGSAADSLCVRALRLRHKWEEEIEKWQSPILTHDKLPDWYKSAIFNELYFITDGGTVW- 444
Query: 445 DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
F D H S+ + +NIG+F YLE EY M
Sbjct: 445 ----------------FEFDENWADHEAHLSNYT------KEKMKNIGRFGYLESWEYRM 482
Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
NTYDVHFY+S+AL L+P+LEL++Q +F V + +G W K VPH
Sbjct: 483 VNTYDVHFYASYALAELWPELELTVQSEFTDQVYHSIEKPTRFHMEGDWANVKTASRVPH 542
Query: 565 DIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSV-YI 619
D+G D+PW N+Y + ++ +WKDLN K+VL +RD+V +++ F WP+V I
Sbjct: 543 DLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVVLSEEHQEFLHHTWPAVKMI 602
Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
+ + +D++GDGMIEN G DQTYDAW GVSAYCG LW+A+L+ + +A + D
Sbjct: 603 MLEALANWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSLWLASLRVSIEMAGLLEDGE 662
Query: 680 SASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACG------ 732
+ F KAK V+ D LW G+YF + S T + ADQL G W+ ++
Sbjct: 663 TKKLFLETLNKAKKVFVDVLWTGTYFRFCERSRSRET-VMADQLCGYWFLQSVSPEMADD 721
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
LLP V+ AL IY NV + G GA+NGM+P G +D +QA E+W GVTY +
Sbjct: 722 LLP---SHMVRSALDTIYRLNVCQFGNGQMGAVNGMKPSGVVDREYIQADEMWTGVTYAV 778
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A+ +IQ+ V+ AF TA+G Y + + GL Y QTPE+ + YR++ YMRPL+IWAM
Sbjct: 779 ASLLIQQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALYESKFYRAIGYMRPLSIWAM 836
Query: 853 QWALTK 858
QW+L K
Sbjct: 837 QWSLKK 842
>gi|170036781|ref|XP_001846240.1| bile acid beta-glucosidase [Culex quinquefasciatus]
gi|167879683|gb|EDS43066.1| bile acid beta-glucosidase [Culex quinquefasciatus]
Length = 913
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 303/865 (35%), Positives = 434/865 (50%), Gaps = 157/865 (18%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVL------------------ 143
+ GEF RF++ G+ + V ANQF V + N F S+L
Sbjct: 101 FAGEFCRFQMRPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRTKDDEVHDVCAPNQ 160
Query: 144 ---------CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
C P P + SWD + C+Y AL+PRAW+ YD + +++
Sbjct: 161 TTLLHGLMPCETRPKTP-------LASWDSTINSSKCSYTALYPRAWSEYDLS-EHGIKL 212
Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNS 254
V RQISP IPH+YKESS P +VF +T+ N VT+ FT+ N + + +
Sbjct: 213 VQRQISPIIPHDYKESSLPVAVFAWTVENVCGKDRHVTITFTFKNGTGNKKQDAEGNAET 272
Query: 255 KTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWN 314
G+++ A+ P T+ V+ + +++++++ C F SGN + +WN
Sbjct: 273 AAFAHGTAKGVSIKQTIAD--LPCTYCVSCKTSSEINLTRCEKFDPSGN-----GEKLWN 325
Query: 315 EIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FE 372
++K++G HL S + A++A + +PS S+ + FSL WD P++ F
Sbjct: 326 DLKENG---HLTEKSGDESLKTKDVAVAVSAQILVPSESSSELEFSLVWDMPKIHFTKKM 382
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
K Y++ YTK++G GD+ I+ A+ + KWE I+ WQRPIL+D P+WY +FNE
Sbjct: 383 KEYYKYYTKYFGKNGDAGPVISDYALRNYGKWESLIDEWQRPILDDSDLPDWYKSAIFNE 442
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIG 492
LY++ GG+IW + + D P + G
Sbjct: 443 LYFIADGGSIWL----------AVDDTTLPYDDPRLAY---------------------G 471
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
++ YLEG EY MYNTYDVHFY+S AL L+P L++S+Q D+ AV K + DGK
Sbjct: 472 RYAYLEGHEYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDAVQREITEGRKHLYDGK 531
Query: 553 WVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDF--------- 601
+ RK +VPHD+G D P+ IN+Y + + S WKDLN+KF+LQVYRD+
Sbjct: 532 VIPRKIKSSVPHDLGDPDEEPFELINAYPIHDVSEWKDLNTKFILQVYRDYYTLNHYAQL 591
Query: 602 ------------------------------VATGDKNFARAVWPSVYI-----------A 620
V+ DK R S+YI A
Sbjct: 592 NADNASKFSSIEFIDKESMFEMYVQDNRNKVSPEDKAQNRKS-ASMYINETNGKVYLMDA 650
Query: 621 MAYME--------------QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQ 666
M Y++ ++DKD DG+IEN PDQTYD W +G S+YCGGLW+A+L
Sbjct: 651 MTYLKSMYPACKQVLEKTLEWDKDNDGLIENSKSPDQTYDTWVMDGPSSYCGGLWLASLH 710
Query: 667 AASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
S +AN + + +K KA ++ LWNGSY+ +D S SS SI +DQL G
Sbjct: 711 CMSVMANLLDQSEDCIKYREILEKGKASFEEKLWNGSYYKFD-SQSSSKNSIMSDQLCGH 769
Query: 726 WYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSG 778
WY R CG I + V+ AL IYD NV++ GG GA+NG P DGR D S
Sbjct: 770 WYLRCCGFDYDIVPKENVRTALKTIYDNNVMRFCGGNMGAVNGYIPSSQPNKDGRADSST 829
Query: 779 LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY 838
LQA E+W GVTY LA++MI E M AFQTA G+Y+ + +G SF+TPE+ Y
Sbjct: 830 LQAEEVWTGVTYALASTMIHEGMFTEAFQTAGGLYQTL--SEKIGMSFETPEALYAERHY 887
Query: 839 RSLCYMRPLTIWAMQWALTKPKLSR 863
R++ YMRPL+IW+MQ A + K +R
Sbjct: 888 RAIGYMRPLSIWSMQTAWEQKKQNR 912
>gi|357440847|ref|XP_003590701.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479749|gb|AES60952.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 992
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 235/407 (57%), Positives = 299/407 (73%), Gaps = 6/407 (1%)
Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
G+FLYLEG EYVM+ TYDVHFY+SFAL+MLFP++EL+IQRDFA AV+ D +K +++G
Sbjct: 575 GRFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNIQRDFAQAVLCEDGRKVKFLAEG 634
Query: 552 KWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
W RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD F
Sbjct: 635 NWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSATGDLQFGV 694
Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
VWP+V AM YMEQFD+D DG+IEN+GFPDQTYD W+ +GVSAYCGGLW+AALQAA+A+
Sbjct: 695 DVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLAALQAAAAM 754
Query: 672 ANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
A +GD A ++ KAK VY+ LWNGSYFNYD+ S+S SIQADQLAGQWY +
Sbjct: 755 ALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTAS 814
Query: 731 CGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTY 790
GL + D+ K+K +L K++DFNV+KVKGG GA+NGM P+G++D + +Q+REIW GVTY
Sbjct: 815 SGLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREIWAGVTY 874
Query: 791 GLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
G+AA+MI M + AF TA G++ WSE+G GY FQTPE++ + YRSL YMRPL+IW
Sbjct: 875 GVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRSLIYMRPLSIW 934
Query: 851 AMQWALTKPKLSRQEIKHEISDRD-----SSYLEQHAAFSKVASLLK 892
MQ+ALT PK + K DR S L + K+A+ K
Sbjct: 935 GMQYALTMPKAVLEAPKINFMDRIHLSPVSGGLHKETGVKKIATKTK 981
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 222/446 (49%), Positives = 302/446 (67%), Gaps = 17/446 (3%)
Query: 18 VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
+D PP+ W+R+L+S+ + F ++F E + +G R++ Y ++E + G+ +
Sbjct: 30 IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAPIDP 89
Query: 78 FRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPN 135
F + ++ QG+PLGG+G+GSI R +RGEF+++++ G+C+ +PV+ANQFS+FVSR
Sbjct: 90 FTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSIFVSREG 149
Query: 136 GEK-FSSVLCP---RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPE 191
G K F+SVL P G +K D GI SW WNL G++ TYHALFPRAWTVYDGEPDPE
Sbjct: 150 GNKSFASVLAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEPDPE 209
Query: 192 LRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH 251
L+I CRQISPFIPHNY+ESS PA+VF +TL N+G+ A V+LLFTWANS+ G+S LSG H
Sbjct: 210 LKISCRQISPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLSGDH 269
Query: 252 FNSKTMTKDGVHGLTLHH-------RTANGRPPVTFAVAAEETADVHVSECPCFLLSGNS 304
N + +DGV G+ L+H RTA PPVTF++AA ET +V VS PCF LS S
Sbjct: 270 VNEPFIAEDGVSGVLLYHKQVVEYFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLSDRS 329
Query: 305 KGITAKDMWNEIKKHGSFD--HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+TAK MW ++ K G FD + + + PS PG ++ AA+AAS + +V FSLA
Sbjct: 330 S-VTAKGMWTKMVKDGQFDRENFSSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAFSLA 388
Query: 363 WDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
W P+VKF + ++RRYTKFYGT +A +AHDA+ + +WE EI WQ PIL+D++
Sbjct: 389 WSSPKVKFVKGSTFNRRYTKFYGTSERAAVHLAHDALTHYTRWEEEIAKWQDPILKDEKL 448
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGS 447
PEWY TLFNELY+L AGGTIW D +
Sbjct: 449 PEWYKFTLFNELYFLVAGGTIWIDST 474
>gi|357440839|ref|XP_003590697.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
gi|355479745|gb|AES60948.1| Non-lysosomal glucosylceramidase [Medicago truncatula]
Length = 1103
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 235/407 (57%), Positives = 299/407 (73%), Gaps = 6/407 (1%)
Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
G+FLYLEG EYVM+ TYDVHFY+SFAL+MLFP++EL+IQRDFA AV+ D +K +++G
Sbjct: 686 GRFLYLEGVEYVMWCTYDVHFYASFALLMLFPRIELNIQRDFAQAVLCEDGRKVKFLAEG 745
Query: 552 KWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
W RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQVYRDF ATGD F
Sbjct: 746 NWGTRKVYGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQVYRDFSATGDLQFGV 805
Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
VWP+V AM YMEQFD+D DG+IEN+GFPDQTYD W+ +GVSAYCGGLW+AALQAA+A+
Sbjct: 806 DVWPAVRAAMEYMEQFDRDADGLIENDGFPDQTYDTWTVHGVSAYCGGLWLAALQAAAAM 865
Query: 672 ANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
A +GD A ++ KAK VY+ LWNGSYFNYD+ S+S SIQADQLAGQWY +
Sbjct: 866 ALQLGDRDFAETCKRKFLKAKPVYEQKLWNGSYFNYDSGSSSNSKSIQADQLAGQWYTAS 925
Query: 731 CGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTY 790
GL + D+ K+K +L K++DFNV+KVKGG GA+NGM P+G++D + +Q+REIW GVTY
Sbjct: 926 SGLPSLFDDFKIKSSLRKVFDFNVMKVKGGRMGAVNGMHPNGKVDETCMQSREIWAGVTY 985
Query: 791 GLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
G+AA+MI M + AF TA G++ WSE+G GY FQTPE++ + YRSL YMRPL+IW
Sbjct: 986 GVAATMILAGMEEEAFTTAEGIFLAGWSEEGSGYWFQTPEAFTIDGHYRSLIYMRPLSIW 1045
Query: 851 AMQWALTKPKLSRQEIKHEISDRD-----SSYLEQHAAFSKVASLLK 892
MQ+ALT PK + K DR S L + K+A+ K
Sbjct: 1046 GMQYALTMPKAVLEAPKINFMDRIHLSPVSGGLHKETGVKKIATKTK 1092
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 183/311 (58%), Gaps = 58/311 (18%)
Query: 18 VDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNV 77
+D PP+ W+R+L+S+ + F ++F E + +G R++ Y ++E + G+ +
Sbjct: 30 IDNAAPPEQAWRRRLNSHANLLKEFRVTFMEAIKMVRLGIRMWSYVREEASHGRKAPIDP 89
Query: 78 FRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV------ 129
F + ++ QG+PLGG+G+GSI R +RGEF+++++ G+C+ +PV+ANQFSV
Sbjct: 90 FTRESCKPSASQGVPLGGMGSGSISRGFRGEFRQWQIIPGLCEPSPVMANQFSVSLVFPC 149
Query: 130 ----------------------------------------------FVSRPNGEK-FSSV 142
FVSR G K F+SV
Sbjct: 150 TIDSLYTSLICSQIRSRQNHLWVYFPSIYEDYFVSWGNECSLFAEIFVSREGGNKSFASV 209
Query: 143 LCP---RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
L P G +K D GI SW WNL G++ TYHALFPRAWTVYDGEPDPEL+I CRQI
Sbjct: 210 LAPGQHEGVGACRKADDQGISSWGWNLNGQHSTYHALFPRAWTVYDGEPDPELKISCRQI 269
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
SPFIPHNY+ESS PA+VF +TL N+G+ A V+LLFTWANS+ G+S LSG H N + +
Sbjct: 270 SPFIPHNYRESSLPAAVFVYTLVNTGKERAKVSLLFTWANSIGGNSHLSGDHVNEPFIAE 329
Query: 260 DGVHGLTLHHR 270
DGV G+ L+H+
Sbjct: 330 DGVSGVLLYHK 340
Score = 169 bits (427), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 92/183 (50%), Positives = 121/183 (66%), Gaps = 4/183 (2%)
Query: 268 HHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD--HL 325
+ RTA PPVTF++AA ET +V VS PCF LS S +TAK MW ++ K G FD +
Sbjct: 404 YFRTAKDNPPVTFSIAACETQNVSVSVLPCFGLSDRS-SVTAKGMWTKMVKDGQFDRENF 462
Query: 326 DNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYG 384
+ + PS PG ++ AA+AAS + +V FSLAW P+VKF + ++RRYTKFYG
Sbjct: 463 SSGPSMPSSPGETLCAAVAASAWVEPHGKCTVAFSLAWSSPKVKFVKGSTFNRRYTKFYG 522
Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
T +A +AHDA+ + +WE EI WQ PIL+D++ PEWY TLFNELY+L AGGTIW
Sbjct: 523 TSERAAVHLAHDALTHYTRWEEEIAKWQDPILKDEKLPEWYKFTLFNELYFLVAGGTIWI 582
Query: 445 DGS 447
D +
Sbjct: 583 DST 585
>gi|374311163|ref|YP_005057593.1| glucosylceramidase [Granulicella mallensis MP5ACTX8]
gi|358753173|gb|AEU36563.1| Glucosylceramidase [Granulicella mallensis MP5ACTX8]
Length = 811
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 294/807 (36%), Positives = 404/807 (50%), Gaps = 99/807 (12%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV-SRPNGEKFSSVLCP 145
QG+P+GG GAG+ RSYRG+F R+ + G+ PV ANQF++F S + S L
Sbjct: 67 QGVPVGGFGAGTFSRSYRGDFARWHIKAGVHKYEPVYANQFAMFQQSEGDAHGTSQALMN 126
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
P ++ + SW W+ Y+ LFP+AW Y + P + Q SP +P+
Sbjct: 127 GHP------SNGELSSWQWDYPVGAGDYYGLFPKAWFDYKWDKFPA-HVTLEQFSPVLPN 179
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN------------SVAGDSGLSGHHFN 253
NY+ESS+P +V+ + N + V++L +W N A + G + +
Sbjct: 180 NYRESSYPVAVYRWHAENPTNKTVVVSVLLSWTNMSGWFRTYTRDFKGAPNQGNHNEYVS 239
Query: 254 SKTMTKDGVHGLTLHHRTANGRP---PVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
K + G+ A P FA+AA E+ V VS + SG+ K +
Sbjct: 240 EKVGDAGTMKGIVFDRNRAGTAPNDWDGQFAIAAMESPGVEVSYQTTYQASGDGKAV--- 296
Query: 311 DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
W+ K G L ND S + AIA T+ G + V ++WD ++F
Sbjct: 297 --WSAFSKDG---RLANDGKSWVSDKEKLAGAIALRFTLRPGEKKVVPMVVSWDFSLIQF 351
Query: 371 FE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITL 429
E + + RRYT FYGT G +A +IA D +L +W EI+ WQ P + D+ P WY L
Sbjct: 352 GEGRKWDRRYTDFYGTSGQNAWKIARDGLLHATEWSDEIDKWQAPYITDESKPLWYRGML 411
Query: 430 FNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEE 489
FNELY L GGT W R+ D + TP
Sbjct: 412 FNELYALTDGGTFWG--------------RQAGSDAKA---TPT---------------- 438
Query: 490 NIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMS 549
F LE +Y Y T DV FY+S L+ +P ++ + R+FA V P
Sbjct: 439 ----FALLECFDYAYYGTLDVRFYASLPLLKFWPDIDKQVLREFADTVPKEWP------E 488
Query: 550 DGKWV------------ARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
G WV RK +GAVPHD+G+ DP+ +N +++ WKDLNSKFVL
Sbjct: 489 QGLWVWKTQQVGEPITHKRKKIGAVPHDLGVPEGDPFVAVNEPGWQDTNDWKDLNSKFVL 548
Query: 596 QVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
VYRD+V TG K+ F R WP+V A+ Y+ QFD G G+ EN G+PDQTYD W GV
Sbjct: 549 MVYRDYVLTGRKDTAFLRETWPAVKAAIEYLRQFDHGG-GVPENSGYPDQTYDDWVVRGV 607
Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
SAY GGLW+AAL+A A VGD + + + + K + Y S LWNG YF YD S
Sbjct: 608 SAYSGGLWLAALRAGEETARVVGDTKTTAEYHALFLKGQKTYISQLWNGEYFRYDTSS-E 666
Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
S IQADQLAGQWYA GL I A KI+D NV+K G GA NGM DG
Sbjct: 667 SKDDIQADQLAGQWYANLTGLGEIVPHTMQVSAAKKIFDVNVMKFGHGEMGAANGMTADG 726
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
I ++ +A+E+W G T G A ++QE M D A++T G+Y V + + GY F+TPE+W
Sbjct: 727 AI-LTNAEAKEVWVGTTLGYAGLLMQEGMNDEAWKTTRGLYHVIYEDK--GYWFRTPEAW 783
Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKP 859
+ +R+ YMRP IWA++ +T P
Sbjct: 784 DITGNFRAGMYMRPTAIWALE--MTSP 808
>gi|169642399|gb|AAI60640.1| LOC100145361 protein [Xenopus (Silurana) tropicalis]
Length = 571
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 243/580 (41%), Positives = 334/580 (57%), Gaps = 35/580 (6%)
Query: 286 ETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
E V VS C F +G +++W ++ + G + S ++ G AA+AA
Sbjct: 2 ERPGVCVSHCTQFDPTG-----LGQEVWKDLLEDGRLNSAPGPSPSTAK-GKKTAAAVAA 55
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
++P S ++ F L+WD P ++F ++ +HRRYT+F+G GD+A + H +
Sbjct: 56 GCSVPPRSRETLEFCLSWDMPNIRFGAGQREHHRRYTRFFGCQGDAAPALCHYGLTHFVG 115
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
WE +IEAWQ PIL++ P WY LFNELY++ GGT+W + S + R+
Sbjct: 116 WEQKIEAWQEPILQNGDLPAWYKSALFNELYFMADGGTVWVEVSAADDDDNLM--RRGCK 173
Query: 464 DTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
D P +++ + G+F YLEG EY MYNTYDVHFY+SFAL+ML+P
Sbjct: 174 DLPG----------------MKDVLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLWP 217
Query: 524 KLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLF 581
+LE+S+Q D AA+V+ DP K + G K VPHD+G D+PW ++N+Y +
Sbjct: 218 QLEISLQYDMAASVLQEDPELRKYLMSGTVAPVKSRSVVPHDVGDPDDEPWQKLNAYLIH 277
Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
+++ WKDLN KFVLQVYRD+ T D + R +W M +FD+DGDG+IEN GF
Sbjct: 278 DTANWKDLNLKFVLQVYRDYHITKDTTYLRDMWSVCQTVMETSLKFDEDGDGLIENSGFA 337
Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
DQTYD W G S+YCGGLW+AA+ +A +GD A+ F + KA ++ LWN
Sbjct: 338 DQTYDDWVMTGPSSYCGGLWLAAVCMMCKMAEVLGDGAAQEKFTDILHRGKAAFEKLLWN 397
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVLK 756
G Y+N+D D S S+ DQ AG W+ RACGL E V AL I++ NV +
Sbjct: 398 GKYYNFDCGDQPYSNSVMVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQ 457
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
G GA+NGM+PDG D S +Q+ E+W GV YGLAA+MI E +V F TA G Y
Sbjct: 458 FADGQMGAVNGMRPDGTTDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYRTV 517
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
W LG SFQTPE++ +RSL YMRPL+IWAMQ AL
Sbjct: 518 WER--LGMSFQTPEAYCEKQVFRSLAYMRPLSIWAMQLAL 555
>gi|195033785|ref|XP_001988762.1| GH11342 [Drosophila grimshawi]
gi|193904762|gb|EDW03629.1| GH11342 [Drosophila grimshawi]
Length = 869
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 294/869 (33%), Positives = 425/869 (48%), Gaps = 155/869 (17%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVL--CPRSPGVPKKNTDSG- 158
Y GEF RF++ GI + V ANQF V + G F S+L C K N +
Sbjct: 47 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDHKGCTIFQSLLSKCSTKSIKAKPNNNDAE 106
Query: 159 -----------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+ +W N++ C+Y L+PR+WT YD +R++
Sbjct: 107 ADEAEAQKSKCQLPNCSSRSRQPLSAWHSNIEDARCSYTGLYPRSWTEYDLS-HYGVRLI 165
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISP IPH+YKESS P +VF ++ N V++ FT+ N + S+
Sbjct: 166 CRQISPVIPHDYKESSLPCAVFVWSAQNVCDHERKVSITFTFKNGTGNKKQDGEGNAESQ 225
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+++ G+ + R G P ++ +A ++ ++ CP F +GN + +W +
Sbjct: 226 LISEGNAKGVAI--RQTIGDMPCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWMQ 278
Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV- 374
+K+HG L T+ S IG A+ A L + ++ + F LAWD P+++F K+
Sbjct: 279 LKEHG---QLSEQPTAESLKTKDIGVAVCAHLALKPKTSHDLEFVLAWDMPKIQFPRKLQ 335
Query: 375 -YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
+ R YTK++ G+S RI A+ +A WE I+AWQRPIL D P+WY +FN+L
Sbjct: 336 THTRYYTKYFDDSGESGPRICEYALKHYASWERLIDAWQRPILNDDGLPDWYKCAIFNQL 395
Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIG 492
Y+++ GGT+W DT+LG L ++ G
Sbjct: 396 YFISDGGTLWL-----------------------------KCDTSLGKPLAFDDPRLAYG 426
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
+F YLEG EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK
Sbjct: 427 RFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGK 486
Query: 553 WVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA------- 603
+ RK VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+
Sbjct: 487 VMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDYYVLNELAQA 546
Query: 604 -------------------------------TGDKNFARAVWPSVYI-----------AM 621
+ D+ S+YI AM
Sbjct: 547 QADNASKFSSIEFIDKESLYELYTQDNKRKNSADEKQQNRKSASMYINETNGKVYLMDAM 606
Query: 622 AYMEQFDKDGDGMIE--------NEGFPDQT------YDAWSANGVSAYCGGLWVAALQA 667
+Y++ ++E N+G + T YD+W +G SAYC GLW+AALQ
Sbjct: 607 SYLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQT 666
Query: 668 ASALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQL 722
S +A + +RYQ ++ + LWNGSY+ +D S T I ADQL
Sbjct: 667 MSVMATLLDQPNDC----LRYQDILERGKNSLEEKLWNGSYYRFDQSHNHRDT-IMADQL 721
Query: 723 AGQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM-----QPD--GRI 774
G WY ++CG I + V+ AL +IYD NV+ G GA NG +P G +
Sbjct: 722 CGHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANANEPSKPGHV 781
Query: 775 DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNN 834
D S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+
Sbjct: 782 DNSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTI--SQRIGMNFETPEALYG 839
Query: 835 NDEYRSLCYMRPLTIWAMQWALTKPKLSR 863
YRS+ YMRPL+IW+MQ A + + R
Sbjct: 840 EKRYRSIGYMRPLSIWSMQVAWERRRALR 868
>gi|308485740|ref|XP_003105068.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
gi|308257013|gb|EFP00966.1| hypothetical protein CRE_20734 [Caenorhabditis remanei]
Length = 933
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 290/865 (33%), Positives = 448/865 (51%), Gaps = 85/865 (9%)
Query: 37 KVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGA 96
+VP +Q + L + + K K K+ + N F+ G+P GGIG
Sbjct: 48 RVPFNRPTPYQIYEALPFVRRYFMYWMKHTFDKEKLFI-NTFQPLKHKPYYGVPCGGIGC 106
Query: 97 GSIGRSYRGEFQRFKLFHGICDDAP--VLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKN 154
G+IGR +RG F +F L G+ + V A+QF + V + + VL
Sbjct: 107 GAIGRDFRGGFCKFSLRPGLVEHKVDVVAADQFIISVRENDTCIYQKVLS----AADVHR 162
Query: 155 TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPA 214
+ + + +WD+ +N Y LFPR+WT + P+ + + RQ+SP +PHNY+++++P
Sbjct: 163 SGNQLSTWDFTFPKKNVNYRGLFPRSWTTFRI-PELGITVTIRQVSPVLPHNYEDTTYPV 221
Query: 215 SVFTFTLSNSGQTSA-DVTLLFTWANSVAG-----DSGLSGHHFNSKTMT-------KDG 261
+F + N+ + +V++ FT+ N +S S F + + ++
Sbjct: 222 CLFVIDVENTTTSKEYEVSIAFTFRNGTGNRRWERESECSAVKFETPGPSDSESNPGQEK 281
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
V G++L+H ++ P T+ +A +S C F + N WN +++ G
Sbjct: 282 VTGISLYHTISSM--PCTYGLATTHKNGSILSVCERFDPARNGSAF-----WNHLQQTGD 334
Query: 322 ----------FDHLDN--DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
F H N + + + A+ + G+ ++ ++L+WD P+V
Sbjct: 335 VPSCVRILHVFFHKINSFQEEECAVNAREMAVAVCNRFRLSPGTKKTHDYALSWDMPKVH 394
Query: 370 F--FEKVYHRRYTKFY--GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWY 425
F + YHRRYT+F+ G +A + A+ KWE EIE WQ PIL ++ P+WY
Sbjct: 395 FGSVARSYHRRYTRFFDAAEAGTAADSLCVRALHLKNKWEEEIEKWQLPILNHEKLPDWY 454
Query: 426 PITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLE 485
+FNELY++ GGT+W F D H S+ +
Sbjct: 455 KSAIFNELYFITDGGTVW-----------------FEFDEQWAEHEMHLSEYT------K 491
Query: 486 NGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTM 545
++IG+F YLE EY M NTYDVHFY+S+AL L+P+LEL++Q +F V
Sbjct: 492 EKMKSIGRFGYLESWEYRMVNTYDVHFYASYALAELWPQLELTVQSEFTDQVYHSIEKPT 551
Query: 546 KIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA 603
+ +G W K VPHD+G D+PW N+Y + ++ +WKDLN K+VL +RD+V
Sbjct: 552 RFHMEGDWANVKTASRVPHDLGNPADEPWIATNAYVMHDTGKWKDLNMKYVLTSWRDYVV 611
Query: 604 TGDKN--FARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
+++ F WP+V I + +E +D++GDGMIEN G DQTYDAW GVSAYCG L
Sbjct: 612 LSEEHQEFLFHTWPAVKMIMLEALENWDQNGDGMIENFGKADQTYDAWQMEGVSAYCGSL 671
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQA 719
W+A+L+ + +A +G+ A+ + F V KAK V+ D+LW G+YF + S T + A
Sbjct: 672 WLASLRVSIEMARLMGEEATENLFRVTLNKAKKVFIDTLWTGTYFRFCERSRSRET-VMA 730
Query: 720 DQLAGQWYARACG------LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
DQL G W+ ++ LLP V+ AL IY NV + G GA+NGM+P+G
Sbjct: 731 DQLCGYWFLQSVSPEMVDDLLP---NHMVRSALDTIYRLNVCRFGNGQMGAVNGMKPNGV 787
Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
+D +QA E+W GVTY +A+ +IQ+ V+ AF TA+G Y + + GL Y QTPE+
Sbjct: 788 VDREYIQADEMWTGVTYAVASLLIQQGEVEKAFHTASGSYLTCFEQTGLQY--QTPEALY 845
Query: 834 NNDEYRSLCYMRPLTIWAMQWALTK 858
+ YR++ YMRPL+IWAMQW+L K
Sbjct: 846 ESKFYRAIGYMRPLSIWAMQWSLKK 870
>gi|328709489|ref|XP_001943760.2| PREDICTED: non-lysosomal glucosylceramidase-like [Acyrthosiphon
pisum]
Length = 776
Score = 453 bits (1165), Expect = e-124, Method: Compositional matrix adjust.
Identities = 299/855 (34%), Positives = 426/855 (49%), Gaps = 111/855 (12%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPT--GFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFN 76
D G P+ W+ D P + ++ L + R Y KG+ P+ +
Sbjct: 10 DKGGVPEYGWKVDFDHQFPDPRTPSYMPRIKQILQLLPLIIRYVFYMLLCYLKGQKPIMD 69
Query: 77 VFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-N 135
G+PLGGIG G+IGR ++GEF R+++ G+ + V ANQF V + N
Sbjct: 70 FLFPIKSKHMYGVPLGGIGGGTIGRGFKGEFCRYQVVPGMYEYESVQANQFIVNIQNSEN 129
Query: 136 GEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+++VL K + + +W W+ C Y L+PR+WT Y P+ +R++
Sbjct: 130 ITVYNNVLSCL------KCNNKTLYNWKWDFPSNQCKYTGLYPRSWTEY-LIPEFHIRLI 182
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
C+QISP IPHNYK+S P VF +T+ N+ ++L FT+A+ +G+S +H +
Sbjct: 183 CKQISPIIPHNYKDSCIPGGVFIWTVENNSDQQYSISLTFTFASG----NGISSYHSDEP 238
Query: 256 T-----MTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGIT 308
+ + KDG V G +H+ E C + G
Sbjct: 239 SCEPFNLNKDGTEVAGSLIHNC---------------------FKEMKCTYVIGGL---- 273
Query: 309 AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEV 368
IK+ +F L+ D G I+ SL + G R + +
Sbjct: 274 -------IKEGAAFSQLNFDPKGN-------GMTISKSLHL-FGELRDMQLGPS------ 312
Query: 369 KFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPIT 428
+Y + YTKF+ G + I H ++ KWE EIE WQ PIL + P+WY
Sbjct: 313 ----NIYSKYYTKFFDKNGTAGPEIMHYSMQNFKKWEQEIEKWQNPILNNPLLPDWYKSA 368
Query: 429 LFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGE 488
+FNE+Y++ GGTIW ++E LD + +
Sbjct: 369 IFNEVYFVADGGTIW---------FLPLKEDYERLD-------------------VTDPR 400
Query: 489 ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIM 548
G F YLEG EY MYNTYDVHFY+SF L L+PKL+ IQ F + T K +
Sbjct: 401 YQYGYFGYLEGHEYRMYNTYDVHFYASFTLAKLWPKLQACIQYKFRDTIDNALNITRKHL 460
Query: 549 SDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD 606
S G RK +VPHD+G +DP+ +N+Y + N S W+DLN KFVLQ YRD+ GD
Sbjct: 461 SCGTRGLRKYKYSVPHDLGDPEEDPFNLVNAYLVHNVSEWRDLNLKFVLQCYRDYTMFGD 520
Query: 607 KNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQ 666
+ ++P + M +D DGDGMIEN G DQT+D W G+SAYCGGLW+AAL
Sbjct: 521 NKYLADMYPQIKNVMEKSLSWDTDGDGMIENSGTADQTFDTWIMTGISAYCGGLWLAALY 580
Query: 667 AASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQ 725
++N + + + KAK ++ +LWNG YFN+D+S S SI ADQL G+
Sbjct: 581 CTVQISNILKQDDISEKYSTILNKAKESFNAALWNGGYFNFDSSGQHHSKSIMADQLCGE 640
Query: 726 WYARACG----LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
WY + CG +LP+ +VKKAL IY NVL G GA+NGM PDG D LQ+
Sbjct: 641 WYLKCCGVNEEVLPV---DRVKKALHTIYKMNVLGFNDGKMGAVNGMMPDGNPDTFSLQS 697
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
E+W GVTY L++ MI + D F TA G+Y ++ G+ Y QTPE+ + + YRS+
Sbjct: 698 EEVWTGVTYALSSLMIAHGLRDEGFNTAKGIYNTVYNNIGMAY--QTPEAIYSKNAYRSV 755
Query: 842 CYMRPLTIWAMQWAL 856
YMRPL+IW++Q AL
Sbjct: 756 GYMRPLSIWSIQTAL 770
>gi|297741934|emb|CBI33369.3| unnamed protein product [Vitis vinifera]
Length = 508
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 219/440 (49%), Positives = 300/440 (68%), Gaps = 7/440 (1%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E ++G+ + F
Sbjct: 31 DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G+GSI R +RGEF+ +++ G CD +P++ANQFS+F+SR G
Sbjct: 91 TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150
Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K++SVL P K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL++
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISPFIPHNY++SS P +VF +TL N+G+ A V+LLFTWANS+ G S LSG H N
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTLVNTGKERAKVSLLFTWANSIGGISHLSGDHVNEP 270
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ +DGV G+ LHH+TA PPVTFA+AA ET +V V+ P F LS S ITAKDMW +
Sbjct: 271 FIGEDGVSGVLLHHKTAKENPPVTFAIAACETQNVSVTVLPSFGLSEGSH-ITAKDMWGK 329
Query: 316 IKKHGSFDHLD--NDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ + G FD + + ++ PS PG ++ AA++AS + +V F+LAW P+VKF +
Sbjct: 330 MVQDGQFDRENCYSGRSMPSSPGETLCAAVSASAWVEPHGKCTVAFALAWSSPKVKFLKG 389
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
YHRRYTK+YGT +A I HDA+ + +WE EIE WQ PIL D R PEWY TLFNE
Sbjct: 390 SSYHRRYTKYYGTSERAALNIVHDALTNYKQWEEEIEKWQSPILRDDRLPEWYKFTLFNE 449
Query: 433 LYYLNAGGTIWTDGSPPMQS 452
LY+L AGGT+W D S P S
Sbjct: 450 LYFLVAGGTVWIDSSLPATS 469
>gi|392918721|ref|NP_504047.2| Protein R08F11.1 [Caenorhabditis elegans]
gi|387910741|emb|CCD72306.2| Protein R08F11.1 [Caenorhabditis elegans]
Length = 830
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 282/817 (34%), Positives = 417/817 (51%), Gaps = 84/817 (10%)
Query: 72 VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQFSVF 130
+ +FNVFR T G+PLGGIG GSIG +RG F RF + GI + NQF V
Sbjct: 64 IDIFNVFRHFTYT---GVPLGGIGCGSIGTDFRGGFNRFSIIPGIKEQTETQKCNQFIVT 120
Query: 131 V-SRPNGEK-FSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
V S+ E + S+L C P + + WD + E+ Y LFPRAW +
Sbjct: 121 VHSKKTFELIYQSILSCAEFPA-------TVLPKWDTTIPAEDVRYRGLFPRAWQEFR-L 172
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
+ +V +SP IP +Y +SS P + F F + N +V++ ++ N
Sbjct: 173 GSSGVTVVVEHLSPVIPGDYSDSSLPLANFEFHVFNDSYEEVEVSITMSFRNGTGNRKWN 232
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
+ S+ + KD + TL H T G PVT+A+ EE V+ C L N G
Sbjct: 233 DENLCQSQKIQKDTMVVRTLAH-TVKGM-PVTYAIGTEEKNGSKVTTC---LFDPNGTG- 286
Query: 308 TAKDMWNEIKKHG---SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
+W++++ +G S+DHL PS+P +G A+ +S +P + FSL W
Sbjct: 287 --GRLWSDLEAYGHLSSYDHL------PSKP-KELGIAVCSSFFVPPDGAHNTQFSLTWY 337
Query: 365 CPEVKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
P+V F E+ Y+RRY +F+ + A I + + W+ +IE WQ P+L D++
Sbjct: 338 MPQVHFGTAERFYNRRYCRFFNGPDADEVTAAICRHGLQNFSTWQKKIEDWQAPVLNDQK 397
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
PEWY +FNELYY+ G T+W + P + ++ +L
Sbjct: 398 LPEWYKSAIFNELYYIVDGSTVWFEYDPDWK-----------------------TEESLM 434
Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
+ E + G+F Y+E EY M NTYDVHFYSS+A++ +P++E+S+Q DFA +
Sbjct: 435 SEQTEKQFKEYGRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQLDFADQMDRV 494
Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIG-------------LDDPWFEINSYNLFNSSRWK 587
D ++DG+ + K + +PHD+G + DPW N+Y L ++ RWK
Sbjct: 495 DNNIATSLADGEQMTIKSVDRIPHDMGHPTKGCIQEKRLSVADPWIHTNAYILHDTGRWK 554
Query: 588 DLNSKFVLQVYRD--FVATGDKN---FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
DLN KFV+ YRD + G + + I +E +DKD DGMIEN+GF D
Sbjct: 555 DLNLKFVISCYRDWKLIELGSEKGQVLEFFLGKCTKIVDGALECWDKDNDGMIENDGFAD 614
Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNG 701
QTYD W G SAYCG LW+AAL + + G ++ + + A Y LWNG
Sbjct: 615 QTYDVWKMTGTSAYCGSLWIAALSSYIEMLKQSG--LPTKHYEEKLEMAYDAYIGKLWNG 672
Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGM 761
++F +D +S + ADQL G W A + K+K AL I+ +NV G
Sbjct: 673 TFFKFDELPENSKI-VMADQLCGFWAMTAMDEPVQISKDKMKSALDTIFKYNVQMYNNGR 731
Query: 762 CGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDG 821
CGA+NG R+D S +Q+ E+W G+TY L+A MI++ M + AF+T+ G++E W
Sbjct: 732 CGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEQAFKTSEGLFESIWHRFP 791
Query: 822 LGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
L Y QTPE+ ++ YR+L YMRPL+IWA+Q AL +
Sbjct: 792 LQY--QTPEAITSDGMYRALGYMRPLSIWAIQHALDR 826
>gi|320105716|ref|YP_004181306.1| glucosylceramidase [Terriglobus saanensis SP1PR4]
gi|319924237|gb|ADV81312.1| Glucosylceramidase [Terriglobus saanensis SP1PR4]
Length = 794
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 285/798 (35%), Positives = 403/798 (50%), Gaps = 96/798 (12%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK--FSSVLC 144
QG P+GG GAG+ RSYRG+F R+ + G+ AN+F++F + GE + VL
Sbjct: 56 QGAPVGGFGAGTFSRSYRGDFARWHIKAGVHKYDVSYANEFAMF-QQVEGEATGVAKVLL 114
Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
P + SW W+ Y AL+P++W Y E P + Q SP IP
Sbjct: 115 NDHP------KQGELASWSWDYPVGAGEYAALYPKSWYDYRWEKFPA-HVTLEQYSPVIP 167
Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHF-------NSKTM 257
NY+ESS+P +V+ + N + V++L +W N ++G H F N T
Sbjct: 168 DNYRESSYPTAVYRWHAENPTDKTVTVSVLLSWTN-MSGWFRTFTHDFKGAPNQGNYDTY 226
Query: 258 TKD-GVHGLTLHHRTANGRPPV---TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMW 313
+ G+ G+ A P FA+AA ET V V+ F +G+ K + W
Sbjct: 227 RNEAGMKGIVFDRSRAGVAPNEWDGQFAIAAMETPGVEVTYQTSFQAAGDGKAV-----W 281
Query: 314 NEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE- 372
+ + G L ND TS G + AIA T+ G + V +AWD P V+F E
Sbjct: 282 SHFARDG---RLPNDDTSWVSDGEKLAGAIAVRFTLKPGEKKVVPMVIAWDLPVVQFGEG 338
Query: 373 KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNE 432
+ ++R+YT FYGT G +A +IA + +L +W I+AWQ+P + D+ P WY LFNE
Sbjct: 339 RQWNRKYTDFYGTDGKNAWKIAREGLLHAQEWSDAIDAWQKPTITDESKPLWYRGMLFNE 398
Query: 433 LYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIG 492
LY L GGT W P SD L
Sbjct: 399 LYALTDGGTFWG--------------------------RPVGSDKKL-----------PA 421
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
F LE +Y Y T DV FY+S L+ +P+++ + R+FA V + G
Sbjct: 422 SFALLECFDYAYYGTLDVRFYASLPLLKFWPEIDKRVLREFAETV------EKEWSEKGL 475
Query: 553 WV------------ARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
WV RK +GAVPHD+G+ DP++ +N +++ WKDLNSKFVL VY
Sbjct: 476 WVWKTAETGSPVLHKRKKVGAVPHDLGVPEGDPFYVVNEPGWQDTNDWKDLNSKFVLMVY 535
Query: 599 RDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAY 656
RD+V TG DK F + WP+V A+ Y+ QFD G G+ EN G+PDQTYD+W GVSAY
Sbjct: 536 RDYVLTGRTDKAFLQETWPAVKAAIQYLRQFDH-GRGVPENSGYPDQTYDSWVVRGVSAY 594
Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSST 715
GGLW+ AL+A A +GD + + + + A+ Y +LWNG YF YD +D S+
Sbjct: 595 SGGLWLGALRAGEETARVLGDWTAQAEYQALFAMAQKTYVATLWNGEYFLYD-TDASNKG 653
Query: 716 SIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID 775
IQADQLAGQWYA GL + A + L KIY FNV + G GA+NGM DG +
Sbjct: 654 VIQADQLAGQWYAHMTGLGDLVPRAMQRSVLKKIYAFNVNQFGDGNLGAVNGMNADGTV- 712
Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
+ ++ARE+W G T G AA + + + ++ G++ V + GY F+TPE+W+
Sbjct: 713 VDNVEAREVWAGTTLGYAALLKSLGIEEESYHVVHGLFHVIYESK--GYWFRTPEAWDVT 770
Query: 836 DEYRSLCYMRPLTIWAMQ 853
+R+ YMRP +W M+
Sbjct: 771 GNFRASMYMRPAGLWGME 788
>gi|341888271|gb|EGT44206.1| hypothetical protein CAEBREN_07747 [Caenorhabditis brenneri]
Length = 820
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 284/803 (35%), Positives = 406/803 (50%), Gaps = 66/803 (8%)
Query: 72 VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
+ +FNVFR T G+PLGGIG+GSIG +RG F RF + GI + NQF S
Sbjct: 64 IDIFNVFRHFTFT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTDTQKCNQFIAS 120
Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
V + + S+L C P + + WD + E+ Y LFPRAW +
Sbjct: 121 VHSKKTFELIYQSILSCAEFPS-------TVLPKWDSTIPAEDVRYRGLFPRAWQEFR-L 172
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
+ +V +SP IP++Y +SS P + F F + N +V++ + N
Sbjct: 173 GSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHVFNDSPEEVEVSITMAFRNGTGNRKWN 232
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
H S+ + K+ TL H T G PVT+A+ EE V+ C L N G
Sbjct: 233 DEHLCTSQKVQKETQVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
+W++++ +G DN P E +G A+ +S +P + + FSL+W PE
Sbjct: 287 --GRLWSDLEAYGHLSSYDNLPPKPKE----LGIAVCSSFFVPPDGSHTTQFSLSWYMPE 340
Query: 368 VKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
V F E+ Y+RRY +F+ + AA I + KW+ IE WQ PI+ +++ PE
Sbjct: 341 VHFGTAERFYNRRYCRFFNGPDADEVAAAICRHGLQNIEKWQEAIEKWQNPIITNQKLPE 400
Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL 483
WY +FNELYY+ G T+W + P + ++ L +
Sbjct: 401 WYRSAIFNELYYIVDGSTVWFEYDPDWR-----------------------TEENLISEQ 437
Query: 484 LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPG 543
E G+F Y+E EY M NTYDVHFYSS+A++ +P++E+S+Q DFA V D G
Sbjct: 438 TEKQFREYGRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNG 497
Query: 544 TMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
T ++DG + K +PHD+G + DPW N+Y L ++ WKDLN KFV+ YRD+
Sbjct: 498 TATSLADGDQMLIKSYDRIPHDMGHPMADPWLHTNAYILHDTGSWKDLNLKFVISCYRDY 557
Query: 602 VATG------DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSA 655
D+ + I +E +DKD DGMIEN+GF DQTYD W G SA
Sbjct: 558 KMIAKDREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQTYDVWKMTGTSA 617
Query: 656 YCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSST 715
YCG LW+AAL + G S Y A LWNG +F +D +S
Sbjct: 618 YCGSLWLAALTCYIEMLKQAGS-PSKHYEGKLLNAYDAYTTKLWNGRFFKFDELPDNSKI 676
Query: 716 SIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID 775
+ ADQL G W A E K+K AL I+ +NV G CGA+NG R+D
Sbjct: 677 -VMADQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGAVNGFLTSERVD 735
Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G++E W+ L Y QTPE+ ++
Sbjct: 736 GSSIQSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY--QTPEAITSD 793
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
YR+L YMRPL+IWA+Q AL +
Sbjct: 794 GMYRALGYMRPLSIWAIQHALDR 816
>gi|268555576|ref|XP_002635777.1| Hypothetical protein CBG10433 [Caenorhabditis briggsae]
Length = 821
Score = 437 bits (1124), Expect = e-119, Method: Compositional matrix adjust.
Identities = 286/808 (35%), Positives = 413/808 (51%), Gaps = 72/808 (8%)
Query: 72 VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
+ +FNVFR T G+PLGGIG+GSIG +RG F RF + GI + NQF S
Sbjct: 64 IDIFNVFRHFTYT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTETQKCNQFIAS 120
Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
V + + S+L C P + + WD + E+ Y LFPRAW +
Sbjct: 121 VHSKKTFELIYQSILSCAEFPA-------TVLPKWDSTIPAEDVRYRGLFPRAWQEFRLG 173
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
+ +V +SP IP +Y +SS P + F F + N +V++ ++ N
Sbjct: 174 TSG-ITVVVEHLSPVIPEDYSDSSLPLANFDFHVFNDSPEEVEVSITMSFRNGTGNKKWN 232
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
S + KD + TL H T G PVT+A+ EE V+ C L N G
Sbjct: 233 DESLCQSHKVQKDTMVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286
Query: 308 TAKDMWNEIKKHG---SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
+W++++ +G S+DHL P +P +G A+ +S +P + + FSLAW
Sbjct: 287 --GRLWSDLEAYGHLSSYDHL------PPKP-KELGIAVCSSFFVPPDGSHTTQFSLAWY 337
Query: 365 CPEVKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
P+V F E+ Y+RRY +F+ + A I + W+ I WQ PI+ D++
Sbjct: 338 MPQVHFGTAERFYNRRYCRFFNGPDADEVGASICRHGLQNIETWQEAIRKWQDPIINDEK 397
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
PEWY +FNELYY+ G T+W + P + +D L
Sbjct: 398 LPEWYRSAIFNELYYIVDGSTVWFEYDPEWR-----------------------TDEILI 434
Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
+ E + G+F Y+E EY M NTYDVHFYSS+A++ +P++E+S+Q DFA V
Sbjct: 435 SEETEKQFKQYGRFGYMESWEYFMINTYDVHFYSSWAILKNWPQIEMSMQMDFADQVDRV 494
Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
D GT ++DG + K +PHD+G + DPW N+Y L ++ WKDLN KFV+ Y
Sbjct: 495 DYGTATSLADGDTMTIKSYDRIPHDMGHPMADPWIHTNAYILHDTGHWKDLNLKFVISCY 554
Query: 599 RDF--VATGDKNFARA----VWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
RD+ +A G+++ + I +E +DKD DGMIEN+GF DQTYD W G
Sbjct: 555 RDWKMIAEGEEDSQEILEFFLGKCTKIVDGALENWDKDKDGMIENDGFADQTYDVWKMTG 614
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGS 712
SAYCG LW+AAL + + G + Y +A LWNG +F +D +
Sbjct: 615 TSAYCGSLWIAALTSYIQMLKRAG-IPTKDYEEKLLMAYEAYTTKLWNGKFFKFDELPDN 673
Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
S + ADQL G W A E K++ AL I+ +NV GG CGA+NG
Sbjct: 674 SKI-VMADQLCGFWALTAMDEPIQVSEGKIQSALETIFKYNVEMYDGGKCGAVNGFLASE 732
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
R+D S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G++ W+ L Y QTPE+
Sbjct: 733 RVDGSSIQSEEVWAGITYSLSAMMIEKGMDEMAFKTSEGLFHSIWNRFPLQY--QTPEAI 790
Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKPK 860
+ YR+L YMRPL+IWA+Q AL K K
Sbjct: 791 TADGMYRALGYMRPLSIWAIQHALDKKK 818
>gi|341899054|gb|EGT54989.1| hypothetical protein CAEBREN_31176 [Caenorhabditis brenneri]
Length = 831
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 284/814 (34%), Positives = 407/814 (50%), Gaps = 77/814 (9%)
Query: 72 VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
+ +FNVFR T G+PLGGIG+GSIG +RG F RF + GI + NQF S
Sbjct: 64 IDIFNVFRHFTFT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTDTQKCNQFIAS 120
Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
V + + S+L C P + + WD + E+ Y LFPRAW +
Sbjct: 121 VHSKKTFELIYQSILSCAEFPS-------TVLPKWDSTIPAEDVRYRGLFPRAWQEFR-L 172
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
+ +V +SP IP++Y +SS P + F F + N +V++ + N
Sbjct: 173 GSSGITVVIEHLSPVIPNDYSDSSLPLANFDFHVFNDSPEEVEVSITMAFRNGTGNRKWN 232
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
H S+ + K+ TL H T G PVT+A+ EE V+ C L N G
Sbjct: 233 DEHLCTSQKVQKETQVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
+W++++ +G DN P E +G A+ +S +P + + FSL+W PE
Sbjct: 287 --GRLWSDLEAYGHLSSYDNLPPKPKE----LGIAVCSSFFVPPDGSHTTQFSLSWYMPE 340
Query: 368 VKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
V F E+ Y+RRY +F+ + AA I + KW+ IE WQ PI+ +++ PE
Sbjct: 341 VHFGTAERFYNRRYCRFFNGPDADEVAAAICRHGLQNIEKWQEAIEKWQNPIINNQKLPE 400
Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL 483
WY +FNELYY+ G T+W + P + ++ L +
Sbjct: 401 WYRSAIFNELYYIVDGSTVWFEYDPDWR-----------------------TEENLISEQ 437
Query: 484 LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPG 543
E G+F Y+E EY M NTYDVHFYSS+A++ +P++E+S+Q DFA V D G
Sbjct: 438 TEKQFREYGRFGYMESWEYFMLNTYDVHFYSSWAILKNWPEIEMSMQLDFADQVDRIDNG 497
Query: 544 TMKIMSDGKWVARKCLGAVPHDIG-------------LDDPWFEINSYNLFNSSRWKDLN 590
T ++DG + K +PHD+G + DPW N+Y L ++ WKDLN
Sbjct: 498 TATSLADGDQMLIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTGSWKDLN 557
Query: 591 SKFVLQVYRDFVATG------DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
KFV+ YRD+ D+ + I +E +DKD DGMIEN+GF DQT
Sbjct: 558 LKFVISCYRDYKMIAKDREDKDQLLEFFLGKCTKIVEDALESWDKDKDGMIENDGFADQT 617
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF 704
YD W G SAYCG LW+AAL + G S +Y A LWNG +F
Sbjct: 618 YDVWKMTGTSAYCGSLWLAALTCYIEMLKQAGS-PSKNYEGKLLNAYDAYITKLWNGRFF 676
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
+D +S + ADQL G W A E K+K AL I+ +NV G CGA
Sbjct: 677 KFDELPDNSKI-VMADQLCGFWALTAMDEPVQVSEEKMKLALETIFKYNVQMYNDGKCGA 735
Query: 765 MNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
+NG R+D S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G++E W+ L Y
Sbjct: 736 VNGFLTSERVDGSSIQSEEVWAGITYALSAMMIEKNMDEMAFKTSEGLFESIWNRFPLQY 795
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
QTPE+ ++ YR+L YMRPL+IWA+Q AL +
Sbjct: 796 --QTPEAITSDGMYRALGYMRPLSIWAIQHALDR 827
>gi|326528739|dbj|BAJ97391.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 424
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 233/418 (55%), Positives = 297/418 (71%), Gaps = 4/418 (0%)
Query: 10 ENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAK 69
E + ++ QP TWQRK D K F+++ + + + + + + K
Sbjct: 7 EQTKARSHINCAQPATRTWQRKFDDEGKKIELFSMTMNDMISIIPMILKGLMINADQRGK 66
Query: 70 GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
G+ +++ FRK +G+PLGG+GAGSIGRSYRG FQ F++F + ++ P+LANQFS
Sbjct: 67 GRDILYDPFRKWMDNCYRGLPLGGLGAGSIGRSYRGYFQHFQIFPALYEERPILANQFSA 126
Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
FVSRPNG+ +S+VL + K GI SWDW LK +NCTYHALFPR+WTVYDGEPD
Sbjct: 127 FVSRPNGKSYSTVLSAPTADAFKGVDKVGIGSWDWKLKEKNCTYHALFPRSWTVYDGEPD 186
Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
PE+ I CRQISP IPHNYKESSFP +VFTFT+ NSG T+ADVTLLFTW NSV G S L+G
Sbjct: 187 PEINITCRQISPIIPHNYKESSFPVAVFTFTVQNSGSTAADVTLLFTWTNSVGGRSELTG 246
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLL-SGNSKGIT 308
+H NSK + +DGV G+ L HRTA+G PPVTFA+A++ET +V V++CP F + S +S T
Sbjct: 247 NHTNSKMIERDGVQGVLLRHRTADGHPPVTFAIASQETEEVRVTDCPFFTMGSSDSGDFT 306
Query: 309 AKDMWNEIKKHGSFDHLDNDKT-SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
AKDMW EIK+HGSF DK S+PGSSIGAA+AA+ T+P+G TR V+F+L+W CPE
Sbjct: 307 AKDMWEEIKQHGSFSETRTDKEPRASKPGSSIGAAVAAATTVPAGGTRVVSFALSWSCPE 366
Query: 368 VKFFE-KVYHRRYTKFYGTLGDSAAR-IAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
VKF + K YHRRYTKF G GD+AA +AHDA+LEH WE +IE WQRPIL+DKR PE
Sbjct: 367 VKFPDGKTYHRRYTKFCGLDGDAAAESLAHDALLEHMDWESKIEEWQRPILQDKRLPE 424
>gi|358254610|dbj|GAA55931.1| non-lysosomal glucosylceramidase [Clonorchis sinensis]
Length = 1200
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 293/871 (33%), Positives = 436/871 (50%), Gaps = 75/871 (8%)
Query: 28 WQRKLDSNVKV---PTGFTLSFQEFRHLAHIGFRL-YR-YSKQEEAKGKVPVFNVFRKHH 82
W+ + D +V P GF + + + FR +R Y ++ + ++P + +
Sbjct: 24 WKARFDRKPEVQCKPFGFP-RLSQIKSFIPMAFRYTFRFYLRKRLIEKRLPFLDPSQHVP 82
Query: 83 ITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSV 142
G+P+GGIG G+IGR +RGEF R L G+ P A+QF + + + F V
Sbjct: 83 WKPIYGVPMGGIGCGAIGRGFRGEFVRSSLIPGMYSYDPQPADQFILTIRKGGETLFHQV 142
Query: 143 LCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
L P++ VP +G+ SW+W ++ Y L+PR+WTVY+ L +VC Q+SP
Sbjct: 143 LSPQT-SVPA--FANGLRSWNWGFSAQHAHYIGLYPRSWTVYEVA-HFNLILVCHQVSPV 198
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTW-ANSVAGDSGLSGHHFNSKTMTKDG 261
+PH+YK S P VF +T+ N G + +VT+ F+W S A S + ++ S +G
Sbjct: 199 LPHDYKASCLPTCVFVWTVFNFGSEALEVTISFSWHGPSPAPRSAPNSNNQVSNHKQPNG 258
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
+ + L + PV ++A +H+ P +S S G T + W +++
Sbjct: 259 LSSVELTQPLV--KEPVADYLSARIQFAMHLLVGPSGYISYVS-GYTITN-WPTLRRRTP 314
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRY 379
+ T P S A A++ + + F++ W P V F + +Y RRY
Sbjct: 315 KLAVMVSATCIVPPARS---ANASAGDLSEAGQSQLEFAVVWHSPLVWFRGNKVIYRRRY 371
Query: 380 TKFY---GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
+++ G LG A R+ A+ W IE WQ P+L + FP+WY LFNELYYL
Sbjct: 372 ARWFPEPGMLG--AERLLESALSNWKDWVERIEQWQNPVLHNSSFPDWYKSALFNELYYL 429
Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
GGT+W D P A + + + + + +G F Y
Sbjct: 430 TDGGTVWLD--PNHHGGADFKWQSW--------------------KYRSRVAQEMGLFAY 467
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
LEG EY MYNT DVH+ SS+AL+ L+PKL+L++ D A + D + + G++ R
Sbjct: 468 LEGHEYRMYNTLDVHYNSSWALIKLWPKLQLAVLLDCADLAIEEDQTQLYFIHQGRYGIR 527
Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
AVPHD G +PW + N+Y ++ + WKDLN KFVLQV+RD+ T D ++ +
Sbjct: 528 STESAVPHDFGDPEGEPWRDANAYVMYPTKDWKDLNPKFVLQVWRDWKLTQDNDYLLYML 587
Query: 615 PSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL-- 671
P V I ++ +D +GDGMIEN GFPDQTYD W+A G+SAY GGLW+A L +A +
Sbjct: 588 PIVNRIIQVSLQSWDSNGDGMIENSGFPDQTYDTWNAQGISAYTGGLWLACLYSACEMLE 647
Query: 672 ----AND------VGDHASASYFWVRYQK--------AKAVYD-SLWNGSYFNYDNSDGS 712
AN + ++ W + QK AK VYD +LW G+Y+ Y N+
Sbjct: 648 YALSANSPLKAFLIASNSEDEVSWPQVQKELQILLESAKTVYDRALWTGTYYIYQNTPVG 707
Query: 713 SSTSIQADQLAGQWYARACGLLP--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP 770
+ +I ADQL+G + R G P I V AL I N ++ G G +NG P
Sbjct: 708 NHEAIMADQLSGHGFLRVGGAPPGAILPSEHVIMALKTIQSTNWESIQNGSLGVINGGLP 767
Query: 771 DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPE 830
+ D+S +QA E W GV Y LAA+MI E M F A Y ++ GL Y QTPE
Sbjct: 768 GAKRDLSNVQAEEFWVGVNYSLAATMILEGMSQEGFSLAEACYNTIYNRFGLQY--QTPE 825
Query: 831 SWNNNDEYRSLCYMRPLTIWAMQWALTKPKL 861
++ + +R YMRPL IW++Q AL +L
Sbjct: 826 AYMIDGRFRCPGYMRPLAIWSIQQALETKQL 856
>gi|355689821|gb|AER98956.1| glucosidase, beta 2 [Mustela putorius furo]
Length = 550
Score = 434 bits (1116), Expect = e-118, Method: Compositional matrix adjust.
Identities = 233/531 (43%), Positives = 309/531 (58%), Gaps = 35/531 (6%)
Query: 330 TSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLG 387
++PS+ G I A+ + +P + FSLAWD P + F K VY+RRYT+F+G G
Sbjct: 1 STPSQKGVGIAGAVCVTGKLPPRGQCRLEFSLAWDMPRIMFGAKGQVYYRRYTRFFGRDG 60
Query: 388 DSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGS 447
++A ++H A+ + WE +I AWQ P+L+D+ P WY LFNELY+L GGT+W
Sbjct: 61 NAAPALSHYALCRYIDWEEKISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---- 116
Query: 448 PPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLEGAEYVMYN 506
L+ P + D +G+ L + G+F YLEG EY MYN
Sbjct: 117 ---------------LEVPED----SLPDELVGSMCQLRPILQEYGRFGYLEGQEYRMYN 157
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFY+SFAL+ML+PKLELS+Q D A A + D + + G K +PHDI
Sbjct: 158 TYDVHFYASFALIMLWPKLELSLQYDMALATLREDLTRRQYLMSGVMAPVKRRNVIPHDI 217
Query: 567 G--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM 624
G D+PW +N+Y + +++ WKDLN KFVLQVYRD+ TGD++F R +WP M
Sbjct: 218 GDPDDEPWLRVNAYVVHDTADWKDLNLKFVLQVYRDYHLTGDQSFLRDMWPVCLAVMESE 277
Query: 625 EQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYF 684
+FDKD DG+IEN G+ DQTYD W G SAYCGGLW+AA+ +A G F
Sbjct: 278 MKFDKDQDGLIENGGYADQTYDGWITTGPSAYCGGLWLAAVAVMVQMAVLCGAKDIQDKF 337
Query: 685 WVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL----LPIADE 739
++ + Y+ LWNG Y+NYD S S SI +DQ AGQW+ +A GL +
Sbjct: 338 SSILRRGQEAYERLLWNGRYYNYDCSPQPQSCSIMSDQCAGQWFLKASGLGEGDTEVFPT 397
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
V AL I++FNV GG GA+NGMQP G D S +Q+ E+W GV YGLAA+MIQE
Sbjct: 398 QHVVCALQTIFEFNVRAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGLAATMIQE 457
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIW 850
+ FQTA G Y W LG +FQTPE++ +RSL YMRPL+IW
Sbjct: 458 GLTWEGFQTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIW 506
>gi|308471410|ref|XP_003097936.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
gi|308239241|gb|EFO83193.1| hypothetical protein CRE_12977 [Caenorhabditis remanei]
Length = 845
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 284/818 (34%), Positives = 421/818 (51%), Gaps = 71/818 (8%)
Query: 72 VPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL-ANQF--S 128
+ +FNVFR T G+PLGGIG+GSIG +RG F RF + GI + NQF S
Sbjct: 64 IDIFNVFRHFTYT---GVPLGGIGSGSIGTDFRGGFNRFSIIPGIKEQTETQKCNQFIAS 120
Query: 129 VFVSRPNGEKFSSVL-CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGE 187
V + + S+L C P + + WD + E+ Y LFPR W +
Sbjct: 121 VHSKKTFELIYQSILSCAEFPA-------TVLPKWDTTIPAEDVRYRGLFPRGWQEFR-L 172
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL 247
+ +V +S IP +Y +SS P + F F + N +V++ + N
Sbjct: 173 GSSGITVVIESLSSVIPGDYSDSSLPLANFDFHVFNDSPEEVEVSITLAFRNGTGNRKWN 232
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
H S+ + +D + TL H T G PVT+A+ EE V+ C L N G
Sbjct: 233 DEHLCQSQKVQRDTMVVRTLSH-TVKGM-PVTYAIGTEEKNGAKVTTC---LFDPNGTG- 286
Query: 308 TAKDMWNEIKKHG---SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWD 364
+W++++ +G S+DHL P +P +G A+ +S +P + + FSL+W
Sbjct: 287 --GRLWSDLEAYGHLSSYDHL------PPKP-KELGIAVCSSFFVPPDGSHTTQFSLSWY 337
Query: 365 CPEVKF--FEKVYHRRYTKFYG--TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
P+V F E+ Y+RRY +F+ + AA I + +KW+ IE WQ P++ D++
Sbjct: 338 MPQVHFGTAERFYNRRYCRFFNGPDADEVAASICRHGLQNISKWQEAIEKWQTPVINDEK 397
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPM---QSLATIRERKFSLDTPSKIHTPASSDT 477
P+WY +FNELYY+ G T+W + P +SL + K + + +
Sbjct: 398 LPDWYRSAIFNELYYIVDGSTVWFEYDPEWKTDESLISEHTEKHFKEYGRFGYMECEFFS 457
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ R L F +L EY M NTYDVHFYSS+A++ +P++E+S+Q DFA V
Sbjct: 458 LVELRTL---------FFFLTAWEYFMLNTYDVHFYSSWAILKNWPQIEMSMQLDFADQV 508
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIG-------------LDDPWFEINSYNLFNSS 584
D GT ++DG+ ++ K +PHD+G + DPW N+Y L ++
Sbjct: 509 DRIDNGTATSLADGEQMSIKSYDRIPHDMGHPTKYCIQEHRLSVADPWLHTNAYILHDTG 568
Query: 585 RWKDLNSKFVLQVYRDF--VATGDKNFAR-AVWPSVYIAMAYMEQFDKDGDGMIENEGFP 641
WKDLN KFV+ YRD+ ++ GD+ + I +E +DKD DGMIEN+GF
Sbjct: 569 CWKDLNLKFVISCYRDWKMISNGDQEILEFFIGKCTKIVDGALESWDKDQDGMIENDGFA 628
Query: 642 DQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWN 700
DQTYD W G SAYCG LW+AAL + G + ++ + A Y + LWN
Sbjct: 629 DQTYDVWKMTGTSAYCGSLWLAALTCYIQMLKKAG--SPTKFYDEKLLMAYEAYTTKLWN 686
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
G +F +D +S + ADQL G W A E K+K AL I+ +NV GG
Sbjct: 687 GKFFKFDELPDNSKI-VMADQLCGFWALTAMDEPVQVSEEKMKSALETIFKYNVQMYDGG 745
Query: 761 MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
CGA+NG R+D S +Q+ E+W G+TY L+A MI++ M +MAF+T+ G+++ W+
Sbjct: 746 KCGAVNGYLTSERVDGSSIQSEEVWAGITYALSAMMIEKGMDEMAFKTSEGLFDSIWNRF 805
Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
L FQTPE+ ++ YR+L YMRPL+IWA+Q AL K
Sbjct: 806 PL--QFQTPEAITSDGMYRALGYMRPLSIWAIQHALEK 841
>gi|443669591|ref|ZP_21134796.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
gi|159029778|emb|CAO87856.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443330115|gb|ELS44858.1| hypothetical protein C789_5336 [Microcystis aeruginosa DIANCHI905]
Length = 784
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 265/803 (33%), Positives = 400/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP D + W W E TY AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------KDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T+ N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P+E G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGKQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ + KW I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDKWRENIINWQKPILERIDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGEIIGEDTAIFTSWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|425435596|ref|ZP_18816046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
gi|389679839|emb|CCH91411.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9432]
Length = 784
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 264/803 (32%), Positives = 400/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E Y AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P++ G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ + W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ HD
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPTHD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY+ LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|224028315|gb|ACN33233.1| unknown [Zea mays]
Length = 384
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 206/343 (60%), Positives = 269/343 (78%), Gaps = 1/343 (0%)
Query: 485 ENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGT 544
++G E++G+FLYLEG EY+M+NTYDVHFY+SFAL+ LFPK+ELSIQRDFA AV+ D
Sbjct: 20 KDGPEHVGKFLYLEGVEYIMWNTYDVHFYASFALLDLFPKIELSIQRDFANAVLYEDRRK 79
Query: 545 MKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVAT 604
+K ++DG RK GAVPHD+G DPW E+N+YN+ ++S+WKDLN KFVLQ+YRDF AT
Sbjct: 80 VKFLADGTSGIRKVKGAVPHDLGTHDPWHEMNAYNIHDTSKWKDLNPKFVLQIYRDFAAT 139
Query: 605 GDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAA 664
GD F R VWP+V AM YM+QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+AA
Sbjct: 140 GDMQFGRDVWPAVCAAMDYMDQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLAA 199
Query: 665 LQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLA 723
LQAA+ +A+ +GD A + +++ KAKAVY++ LWNGSYFNYD+ S+S SIQADQLA
Sbjct: 200 LQAAATMAHRLGDRHFAEKYKLKFIKAKAVYEAKLWNGSYFNYDSGTSSNSRSIQADQLA 259
Query: 724 GQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQARE 783
GQWYA + GL P+ DE K++ AL KI++FNV+KVKGG GA+NGM P G++D + +Q+RE
Sbjct: 260 GQWYAASSGLPPLFDEHKIRTALQKIFEFNVMKVKGGRMGAVNGMTPKGKVDETCMQSRE 319
Query: 784 IWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSF 826
IW GVTY +AA+M+ M F TA G++ WSE+G GY+
Sbjct: 320 IWTGVTYAVAANMLLHGMEHQGFTTAEGIFTAGWSEEGYGYAL 362
>gi|425460273|ref|ZP_18839755.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
gi|389827051|emb|CCI21967.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9808]
Length = 784
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 265/803 (33%), Positives = 400/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E TY AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P+E G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ + W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
T I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTTRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R W S+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWSSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY+ LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|37522029|ref|NP_925406.1| hypothetical protein gll2460 [Gloeobacter violaceus PCC 7421]
gi|35213028|dbj|BAC90401.1| gll2460 [Gloeobacter violaceus PCC 7421]
Length = 867
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 284/819 (34%), Positives = 404/819 (49%), Gaps = 99/819 (12%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
G+PLGG GAG IGRS+RG+F + L G PV QF++F + + VLC
Sbjct: 79 HGMPLGGFGAGCIGRSHRGDFTLWHLDSGDHYFRSQPVC--QFALFEQSEGAKPRAKVLC 136
Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
P D + +WDW TY AL+PR+W Y+G + +++ +Q SP IP
Sbjct: 137 TEKPA------DGSLGAWDWTYPKGAGTYAALYPRSWFTYEGFFNSHVQL--KQFSPIIP 188
Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS------------------- 245
+NYK++S+P +VF +T N +++L +W N V +
Sbjct: 189 NNYKQTSYPVAVFEWTAFNPTDQPVTLSILVSWENMVGWFTNQFKSRAVKVRDDDTPVYD 248
Query: 246 -----GLSGHHFNS-------KTMTKDGVHGLTLHHRTA------NGRPPVT------FA 281
G S +FNS + +T G T + A + RP VT F
Sbjct: 249 YEPYWGESTGNFNSAREENAYRGLTMSRASGYTEQQKPAPATAAVSERPFVTAEGDGQFV 308
Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
+A DV ++ F +GN ++W G L + P+ G I A
Sbjct: 309 IATTREPDVEITYHTRFDPTGN-----GLEVWQPFATTGRLGDLVS--VRPAAKGERIAA 361
Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILE 400
A+A LT+ G R V +LAWD P ++F + V +R YTK + G +A +IA +A+
Sbjct: 362 AMAVRLTLKPGELRVVPMALAWDLPIMEFAQGVRRYRYYTKHFDKSGRNAWKIATEALDN 421
Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
+A W+ +I+ WQ P+L D P+WY LFNELYYL +GGT+W +G
Sbjct: 422 YADWDRQIQEWQNPVLADPSLPDWYKQALFNELYYLTSGGTLWENGP------------- 468
Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
P KI + A + G+F LE +Y Y++ DV Y SFAL++
Sbjct: 469 ----VPEKITSLAD---------IAKAPPGQGRFAVLESIDYRWYDSLDVRLYGSFALLI 515
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
+P+++ ++ R +A AV D T I RK A+PHD+G +DPW + N
Sbjct: 516 NWPEIDKAVLRSYADAVPDEDDSTRIIGYTKTVAKRKSRDALPHDLGAPNEDPWIKNNYT 575
Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQ-FDKDGDGMI 635
+ + WKDL FVLQVYRD+V TG D F R WP+ A+ ++ FD+DGDG
Sbjct: 576 TYQDCNLWKDLPCDFVLQVYRDYVDTGKRDLGFVRYCWPASKAALVRLKTTFDQDGDGFP 635
Query: 636 ENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY 695
EN G PD TYDAW G+SAYCGGLW+AAL AA + VGD A+ F Y +A+ +Y
Sbjct: 636 ENGGPPDCTYDAWPLKGISAYCGGLWLAALLAAVEMGKLVGDAAAVQQFGAWYAQAQPLY 695
Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
+ +LWNG Y+ D S +S +I ADQL G++YA+ C L I E + + AL +Y
Sbjct: 696 EKTLWNGRYYRID-SQSKNSEAIMADQLCGEYYAQVCNLADIVPEPQARSALETVYQTCF 754
Query: 755 LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
+K GG G NG DG E+W G+ +GLAA MI+ M A V
Sbjct: 755 VKFYGGRFGCANGTNADGSFIGDTEHPSEVWTGINFGLAAFMIRNGMRREGMAIAEAVVA 814
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
+ G G F+TPE+ +R+ Y+RP+ IWAMQ
Sbjct: 815 NVY---GGGLQFRTPEALTPARTFRACMYLRPMAIWAMQ 850
>gi|257060089|ref|YP_003137977.1| glucosylceramidase [Cyanothece sp. PCC 8802]
gi|256590255|gb|ACV01142.1| Glucosylceramidase [Cyanothece sp. PCC 8802]
Length = 805
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 275/826 (33%), Positives = 403/826 (48%), Gaps = 118/826 (14%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGG G+G IGRS G+F + L G + + QFSVF +P GE
Sbjct: 40 NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVF-EQPEGEDAQVYALS 98
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
P P++ T + W W E TYHAL+PR+W Y+G +L +C Q SP IP+
Sbjct: 99 TQP--PEEGT---LSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
NY+E+S+P +F +T N ++++ TW N V D G + +
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210
Query: 253 NSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPC 297
+ G ++ + LH + G + A + +V
Sbjct: 211 QPRWGDSTGNFNQWVQDNFRVGFILNRIQLHDQIQEGEGQICIASVTNPSVEVF------ 264
Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
+L N KG ++W+ +GS D + +P+EPG I A+A TI G T+ +
Sbjct: 265 YLGKWNPKG-DGSEVWDYFAMNGSLP--DTEDETPAEPGEQIAVAMAIRFTIRPGRTKKI 321
Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
F LAWD P +F V Y+RRYT FYG G + + A+ W +E WQ PIL
Sbjct: 322 PFILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPIL 381
Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
+ + P+W+ + LFNELY L GGT+WT S
Sbjct: 382 KREDLPDWFKMALFNELYLLTDGGTLWTAAS----------------------------- 412
Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
EN + +GQF LE +Y Y + DV Y SF L+ML+P+LE S+ FA A
Sbjct: 413 --------EN--DPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARA 462
Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
+ HD I + RK GA PHD+G + PW + N + + + WKDL S FV
Sbjct: 463 IPTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFV 522
Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
LQVYRD++ TG D +F WP++ + Y++ FD D DG+ EN G PDQT+D W G
Sbjct: 523 LQVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQG 582
Query: 653 VSAYCGGLWVAALQAASALA-------------------NDVGDHASASYFWVRYQKAKA 693
+SAYCGGLW+AAL+AA +A + + W+ +++++
Sbjct: 583 ISAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWL--EQSRS 640
Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+Y D+LWNG Y+ D+ GS + ADQL GQ+YAR L + + + AL K+Y+
Sbjct: 641 IYHDTLWNGEYYKLDSQSGSD--VVMADQLCGQFYARLLNLPDVVETQYTESALNKVYEA 698
Query: 753 NVLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
LK + G GA NGM+PDG D + +E+W G+ +GLAA ++Q D AF+
Sbjct: 699 CFLKFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEA 758
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
V + + E+GL F+TPE+ +R+ Y+R + IW + LT
Sbjct: 759 VVKQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGIYGILT 801
>gi|425455967|ref|ZP_18835678.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
gi|389803038|emb|CCI17984.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9807]
Length = 784
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 264/803 (32%), Positives = 400/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP D + W W E TY AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------KDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T+ N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P++ G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGSEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERTDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLADGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y D+LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHDTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY+ LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVRQIY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|218247014|ref|YP_002372385.1| hypothetical protein PCC8801_2201 [Cyanothece sp. PCC 8801]
gi|218167492|gb|ACK66229.1| protein of unknown function DUF608 [Cyanothece sp. PCC 8801]
Length = 805
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 274/826 (33%), Positives = 401/826 (48%), Gaps = 118/826 (14%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGG G+G IGRS G+F + L G + + QFSVF +P GE
Sbjct: 40 NHGMPLGGFGSGCIGRSPSGDFNLWHLDGGEHVFRSIPSCQFSVF-EQPEGEDAQVYALS 98
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
P P++ T + W W E TYHAL+PR+W Y+G +L +C Q SP IP+
Sbjct: 99 TQP--PEEGT---LSRWSW-YPPEKGTYHALYPRSWYQYEGVFKSQL--ICEQFSPIIPN 150
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
NY+E+S+P +F +T N ++++ TW N V D G + +
Sbjct: 151 NYQETSYPIGIFEWTAHNPTDKPITLSIMVTWQNIVGWFTNAIKSPEITVRDDGSPEYQY 210
Query: 253 NSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPC 297
+ G ++ + LH G + A + +V
Sbjct: 211 QPRWGDSTGNFNQWVQDNFRVGFILNRIQLHDEIQEGEGQICIASVTNPSVEVF------ 264
Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
+L N KG ++W+ +GS D + +P+EPG I A+A TI G T+ +
Sbjct: 265 YLGKWNPKG-DGSEVWDYFAMNGSLP--DTEDETPAEPGEQIAVAMAIRFTIRPGRTKKI 321
Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
F LAWD P +F V Y+RRYT FYG G + + A+ W +E WQ PIL
Sbjct: 322 PFILAWDFPVTEFAPGVQYYRRYTDFYGRNGKNGWAMVRTALKHSDVWYERVEEWQSPIL 381
Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
+ + P+W+ + LFNELY L GGT+WT S
Sbjct: 382 KREDLPDWFKMALFNELYLLTDGGTLWTAAS----------------------------- 412
Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
EN + +GQF LE +Y Y + DV Y SF L+ML+P+LE S+ FA A
Sbjct: 413 --------EN--DPVGQFGVLECLDYRWYESLDVRLYGSFGLMMLWPRLEKSVLEAFARA 462
Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
+ HD I + RK GA PHD+G + PW + N + + + WKDL S FV
Sbjct: 463 IPTHDDTPRIIGYNQAKAIRKAKGATPHDLGAPNEHPWQKTNYTSYQDCNLWKDLGSDFV 522
Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
LQVYRD++ TG D +F WP++ + Y++ FD D DG+ EN G PDQT+D W G
Sbjct: 523 LQVYRDYLLTGSDDTDFLWECWPAITETLDYLKTFDLDNDGIPENSGAPDQTFDDWKLQG 582
Query: 653 VSAYCGGLWVAALQAASALA-------------------NDVGDHASASYFWVRYQKAKA 693
+SAYCGGLW+AAL+AA +A + + W+ +++++
Sbjct: 583 ISAYCGGLWIAALEAAIKIAEILLKNVPTTEELQSRNNPESIKHYVKNHRDWL--EQSRS 640
Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+Y D+LWNG Y+ D+ GS + ADQL GQ+YAR + + + AL K+Y+
Sbjct: 641 IYHDTLWNGEYYKLDSQSGSD--VVMADQLCGQFYARLLNFPDVVETQYTESALNKVYEA 698
Query: 753 NVLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
LK + G GA NGM+PDG D + +E+W G+ +GLAA ++Q D AF+
Sbjct: 699 CFLKFQDGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLAAFLLQMGRKDAAFKLTEA 758
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
V + + E+GL F+TPE+ +R+ Y+R + IW + LT
Sbjct: 759 VVKQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGIYGILT 801
>gi|425450263|ref|ZP_18830094.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
gi|389768962|emb|CCI06046.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 7941]
Length = 784
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 264/803 (32%), Positives = 398/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLPACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E Y AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYGALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS +D++ +P++ G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIDDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ + W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKQGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATIKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITQAVVRQIY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|422303239|ref|ZP_16390593.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
gi|389791807|emb|CCI12404.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9806]
Length = 784
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 264/803 (32%), Positives = 397/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E Y AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P+E G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATIKIGEIIGEDTAIFTSWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYQSCFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|440753145|ref|ZP_20932348.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
gi|440177638|gb|ELP56911.1| hypothetical protein O53_1523 [Microcystis aeruginosa TAIHU98]
Length = 789
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 262/797 (32%), Positives = 395/797 (49%), Gaps = 93/797 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGNAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E Y AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P+E G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDAIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAM 852
+R+ Y+R + IWA+
Sbjct: 759 GTFRACHYLRAMAIWAI 775
>gi|425471589|ref|ZP_18850441.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
gi|389882505|emb|CCI37035.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9701]
Length = 784
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 266/803 (33%), Positives = 396/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P TD + W W E TY AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 RP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P+E G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT + ER
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAAT----------ER---------------------- 412
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
+ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 413 -------DPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATIKIGAILGEDTEIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALGKIYQACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + + E+GL F+TPE+ +
Sbjct: 702 PQATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGILTE 781
>gi|425442611|ref|ZP_18822852.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
gi|389716273|emb|CCH99457.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9717]
Length = 784
Score = 417 bits (1073), Expect = e-113, Method: Compositional matrix adjust.
Identities = 262/803 (32%), Positives = 398/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP D + W W E +Y AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------KDGTLSRWSW-YPAEKGSYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P+E G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ F+ A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFSRAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATIKIGEIIGEDTAIFTSWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|166363582|ref|YP_001655855.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
gi|166085955|dbj|BAG00663.1| glycosidase-like protein [Microcystis aeruginosa NIES-843]
Length = 784
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 265/803 (33%), Positives = 398/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E TY AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNGIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIATITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P++ G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVNEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT + ER
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAAT----------ER---------------------- 412
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
+ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 413 -------DPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKTKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGAILGEDTEIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+Y+R L + + + AL+KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + + E+GL F+TPE+ +
Sbjct: 702 PQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|443325334|ref|ZP_21054034.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
gi|442795063|gb|ELS04450.1| putative bile acid beta-glucosidase [Xenococcus sp. PCC 7305]
Length = 808
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 273/820 (33%), Positives = 397/820 (48%), Gaps = 110/820 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G PLGG GAG IGRS +G+F + L G + A QFSVF S GE + +
Sbjct: 37 HGAPLGGFGAGCIGRSPKGDFNLWHLDGGEHLLNSIPACQFSVFESVAGGETQAFAMACE 96
Query: 147 SPGVPKK--NTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
G K + + + W+W + E Y AL+PR+W Y G EL C Q SP
Sbjct: 97 DDGEIGKWGDREESLSRWNWYPQ-EQGVYAALYPRSWYEYQGVFASEL--TCEQFSPIWK 153
Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHH 251
NY+E+S+P ++F +T N ++++ TW N+V D G +
Sbjct: 154 ENYQETSYPVAIFDWTAHNPTDKPITLSIMLTWQNTVGWFTNAIKSPEVKVRDDGSPEYE 213
Query: 252 FNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
+ K G ++ + H G + A A +V
Sbjct: 214 YQPKWGDSTGNYNQWIQDSYRVGCILNRVQPHSEIQEGEGQIAIATVANPAVEV------ 267
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
F LS + ++W+ GS D + +P++PG I AIA +TIP G T+
Sbjct: 268 -FYLSRWNPEGDGSEVWDSFAADGSLP--DKEDETPADPGEQIACAIAVRVTIPPGKTKK 324
Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ F LAWD P +F + + Y+RRYT F+G G++ I A+ W +IEAWQ PI
Sbjct: 325 IPFILAWDFPVTEFAQGINYYRRYTDFFGRTGNNVWSIVKTALKHSDWWREQIEAWQNPI 384
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
LE P+W+ + LFNELY L GGT+W+ A++
Sbjct: 385 LERPDLPDWFKMALFNELYLLTQGGTLWS----------------------------AAT 416
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
++ + +GQF E +Y Y + DV Y SF L+ML+ KL+ ++ FA
Sbjct: 417 ES-----------DPVGQFCVCECMDYRWYESLDVRLYGSFGLMMLWSKLDKAVLEAFAR 465
Query: 536 AVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKF 593
A+ D I +G RK GA PHD+G + PW N + + ++WKDL S F
Sbjct: 466 AIPKSDDKPRIIGYNGATAIRKAAGATPHDLGAPNEHPWEATNYTSYQDCNQWKDLPSDF 525
Query: 594 VLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
VLQVYRD++ T N F W SV +AY+++FD+D DG+ EN G PDQT+D W
Sbjct: 526 VLQVYRDYLFTEGDNTEFLWECWDSVVCTLAYLKEFDEDNDGIPENSGAPDQTFDDWRLQ 585
Query: 652 GVSAYCGGLWVAALQAASALANDV-----------------GDHASASYFWVRYQKAKAV 694
G+S+YCGGLW+AAL+AA + + G + S + Q+++ +
Sbjct: 586 GISSYCGGLWIAALEAAIKIGEILIANPTKNPELQTADFPQGIEKTISDYREWLQRSREL 645
Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
Y D+LWNG Y+ D+ S+S + ADQL GQ+YAR GL + + K AL+KIYD
Sbjct: 646 YHDTLWNGEYYRLDSK--SNSEVVMADQLCGQFYARLLGLPDVVENQYTKSALSKIYDAC 703
Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
LK G GA NG+ PDG ++ E+W G+ +GLAA M+Q M D AF+ V
Sbjct: 704 FLKFHDGKFGAANGVLPDGSAVNPDDTHPLEVWTGINFGLAAFMVQMGMKDEAFRLTETV 763
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ E+GL F+TPE+ +R+ Y+R + IW +
Sbjct: 764 VRQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGV 800
>gi|425464233|ref|ZP_18843555.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
gi|389833806|emb|CCI21373.1| Glycosidase-like protein [Microcystis aeruginosa PCC 9809]
Length = 784
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 263/803 (32%), Positives = 397/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E TY AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMGRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P+E G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAEKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERTDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
W+ + LFNELY L GGT+WT +
Sbjct: 385 AAWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++ +Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATIKIGAILGEDTAIFASWL--QQSRTIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+Y+R L + + + AL+KIY LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYSRLLALPDVVENQYTQSALSKIYQACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + + E+GL F+TPE+ +
Sbjct: 702 PQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVKQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|434400779|ref|YP_007134783.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
gi|428271876|gb|AFZ37817.1| Glucosylceramidase [Stanieria cyanosphaera PCC 7437]
Length = 804
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 281/818 (34%), Positives = 400/818 (48%), Gaps = 115/818 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS+RG+F + L G + A QFSVF S + + LC
Sbjct: 42 HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHVFRSLPACQFSVFESS-EAKTQAYALCTE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W W K E YHAL+PR+W Y +L C Q SP N
Sbjct: 101 PP------EDGTLSRWQWYPK-EKGNYHALYPRSWFEYQDVFQSQL--TCEQFSPIWAEN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F + + N ++++ TW N+V D G + +
Sbjct: 152 YQESSYPVAIFDWNVHNPTDKPITLSIMLTWQNTVGWFTNAIKSPTVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
K G ++ + H G + A T +V F
Sbjct: 212 PKWGDSTGNFNQWIQDNYRVGCLLNRIQPHAEIQEGEGQIAIATVTNPTLEV-------F 264
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
LS D+W+ GS D +P++PG I +AIA T+ G TR +
Sbjct: 265 YLSRWHPDGDGADVWDYFAADGSLP--DRQDETPADPGEQIASAIAVRFTVNPGKTRQIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F + + Y+RRYT F+G G++A I A+ W+ I+AWQ PIL+
Sbjct: 323 FVLAWDLPVTEFAQGIKYYRRYTDFFGRTGNNAWSIVKTALKHSDWWQESIQAWQEPILK 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
+ P+W+ + LFNELY L GGT+W+ A+++T
Sbjct: 383 RQDLPDWFKMALFNELYLLTQGGTLWS----------------------------AATET 414
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ IGQF LE +Y Y + DV Y SF L+ML+ KL+ S+ FA A+
Sbjct: 415 -----------DPIGQFGVLECMDYRWYESLDVRLYGSFGLMMLWSKLDKSVLEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW N + + + WKDL S FVL
Sbjct: 464 PKGDDTPRIIGYNRAQAIRKAAGATPHDLGAPNEHPWQATNYTSYQDCNLWKDLPSDFVL 523
Query: 596 QVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
QVYRD+V TG K+ F W S+ + +AY+++FD DGDG+ EN G PDQT+D W G+
Sbjct: 524 QVYRDYVLTGSKDSEFLWECWDSIVVTLAYLKEFDLDGDGIPENSGAPDQTFDDWRLQGI 583
Query: 654 SAYCGGLWVAALQAASALANDVGDHASA-----------------SYFWVRYQKAKAVY- 695
SAYCGGLW+AAL+AA A+ + H +++ V Q+A+++Y
Sbjct: 584 SAYCGGLWIAALEAAIAIGQILMTHPPMNPNLQPPDYPQGIETVINHYRVWLQQARSLYH 643
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
D+LWNG Y+ D+ S+S + ADQL GQ+YAR GL + +E K AL KIY L
Sbjct: 644 DTLWNGEYYRLDSQ--SNSDVVMADQLCGQFYARLLGLPDVVEEQYTKSALKKIYQACFL 701
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K G GA NG+ PDG ++ + E+W G+ +GLAA M+Q M AFQ V +
Sbjct: 702 KFHNGKYGAANGVLPDGSPVNPNDTHPLEVWTGINFGLAAFMLQMGMKAEAFQLTETVVK 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ E+GL F+TPE+ +R+ Y+R + IW +
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWGI 796
>gi|390437958|ref|ZP_10226465.1| Glycosidase-like protein [Microcystis sp. T1-4]
gi|389838618|emb|CCI30589.1| Glycosidase-like protein [Microcystis sp. T1-4]
Length = 784
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 262/803 (32%), Positives = 396/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGAGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP TD + W W E Y AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SP------TDGTLSRWSW-YPAEKGNYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTAHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P++ G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGGEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLTDGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDKDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFAGWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY LK G GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLALPDVVENQYTQSALSKIYQACFLKFHEGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + E+GL F+TPE+ +
Sbjct: 702 PQATHPLEVWVGINFGLVAFLMQMGMEKQGWQITEAVVRQVY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGILTE 781
>gi|172036524|ref|YP_001803025.1| hypothetical protein cce_1609 [Cyanothece sp. ATCC 51142]
gi|354553307|ref|ZP_08972614.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
gi|171697978|gb|ACB50959.1| unknown [Cyanothece sp. ATCC 51142]
gi|353555137|gb|EHC24526.1| Glucosylceramidase [Cyanothece sp. ATCC 51472]
Length = 818
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 268/818 (32%), Positives = 394/818 (48%), Gaps = 114/818 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+P+GG G+G IGRS RG+F + L G + + QFS+F NG + L
Sbjct: 55 HGMPIGGFGSGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQPENGTAQAYALATE 114
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D+ + W W E TY AL+P +W Y+G E I C Q SP IP N
Sbjct: 115 PP------EDNTLSRWAW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 165
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P VF +T+ N ++++ TW N V D G + +
Sbjct: 166 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 225
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
+ G ++ L H + G + A + +V +
Sbjct: 226 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHQQVQEGEGQICIASVTNPSVEVF------Y 279
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
L N G ++W+ +GS D + +P+EPG I AA+A T+ G + +
Sbjct: 280 LGKWNPNG-DGSEVWDHFAMNGSLP--DQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIP 336
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F + + Y+RRYT F+G G + + A+ W +IE WQ PIL
Sbjct: 337 FILAWDLPVTEFAQGIQYYRRYTDFFGRNGQNGWAMVRTALKHSDVWREKIEEWQNPILN 396
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
P+W+ + LFNELY L GGT+WT A+S+T
Sbjct: 397 RDDLPDWFKMALFNELYLLTDGGTLWT----------------------------AASET 428
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+
Sbjct: 429 -----------DPIGQFGVLESMDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAI 477
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW + N + + + WKDL S FVL
Sbjct: 478 PNSDDTVRIIGYNQAEGIRKTKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 537
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
QVYRDF+ TG D F W ++ + Y++ FD D DG+ EN G PDQT+D W G+
Sbjct: 538 QVYRDFLLTGSDDIEFLWECWEAIPETLNYLKAFDLDNDGIPENSGAPDQTFDDWELRGI 597
Query: 654 SAYCGGLWVAALQAASALANDVGDHASAS-----------------YFWVRYQKAKAVYD 696
SAYCG LW+AAL+AA + + ++A + + Q+++A+Y
Sbjct: 598 SAYCGALWIAALEAAIKIGEILLENAPQNPQLEPENYPKSIEKELDKYQQWLQQSRAIYH 657
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
S LWNG Y+ D+ GS + ADQL+GQ+YAR GL + ++ AL K+Y+ L
Sbjct: 658 STLWNGEYYKLDSESGSD--VVMADQLSGQFYARLLGLPDVVEQQYALSALKKVYEACFL 715
Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K + G GA NGM+PDG D + +E+W G+ +GLA+ +IQ M D A + V +
Sbjct: 716 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVK 775
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ E+GL F+TPE+ +R+ Y+R + IW +
Sbjct: 776 QVY-ENGL--QFRTPEAITAVGTFRACHYLRAMAIWGV 810
>gi|157119417|ref|XP_001653371.1| bile acid beta-glucosidase, putative [Aedes aegypti]
gi|108883154|gb|EAT47379.1| AAEL001478-PA, partial [Aedes aegypti]
Length = 898
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 276/860 (32%), Positives = 427/860 (49%), Gaps = 148/860 (17%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-------------------EKFSSVL 143
+ GEF RF+L G+ + V ANQF V + N +++S +
Sbjct: 87 FAGEFCRFQLKPGLYEYNTVHANQFIVTIKDENNVTIFQSLLSSYRYVCTWCNDRYSDLF 146
Query: 144 CPRSPGVPKK-NTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ VP + + + SW+ + C Y AL+PRAW+ YD D +++V RQISP
Sbjct: 147 ARKLGLVPIEIRPKNPLSSWETYINSSKCNYTALYPRAWSEYDLS-DYGIKLVQRQISPI 205
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGV 262
IPH+YKESS PA+VF +T+ N VT+ FT+ N + + + +
Sbjct: 206 IPHDYKESSLPAAVFVWTVENVCGKDRQVTITFTFKNGTGNKKQDAEGNSETAEFSHGSA 265
Query: 263 HGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF 322
G+++ + + T+ ++ + +++++++ C F +GN + +WN++K++G
Sbjct: 266 KGVSIKQSIVDMQS--TYCISCKTSSEINLTRCKKFDPTGN-----GEKLWNDLKENGQL 318
Query: 323 DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF--FEKVYHRRYT 380
+ ++ S+ + A++A + + S + + FSL WD P+++F K Y++ YT
Sbjct: 319 TEVSVEENLKSK---DVAVAVSAQILVQSTGSSDLEFSLVWDMPKIRFGKKTKEYYKYYT 375
Query: 381 KFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGG 440
K++G GD+ I+ A+ KWE I WQRPIL+D P+WY +FNELY++ GG
Sbjct: 376 KYFGKSGDAGPSISDYALQNFGKWETLINEWQRPILDDNDLPDWYKSAIFNELYFIADGG 435
Query: 441 TIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGA 500
++W T + + D P + G+F YLEG
Sbjct: 436 SVW----------LTTDDAELPYDDPRLAY---------------------GRFAYLEGH 464
Query: 501 EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC-- 558
EY MYNTYDVHFY+S AL L+P L++S+Q D+ + K + DGK + RK
Sbjct: 465 EYRMYNTYDVHFYASHALASLWPNLQVSLQYDYKDTINREISEGRKHLYDGKIIPRKIKN 524
Query: 559 ----------------LGAVP-HDIGLDDPWFEINS----------YNL----------- 580
+ A P HD+ W ++N+ Y L
Sbjct: 525 SVPHDLGDPDEEPFDLINAYPIHDVS---EWRDLNTKFILQVYRDYYTLNRYAQQNAENA 581
Query: 581 --FNSSRWKDLNSKF--VLQVYRDFVATGDKNFARAVWPSVYI-----------AMAYME 625
F+S + D S + +Q R+ V+ +K R S+YI AM Y++
Sbjct: 582 SKFSSIEFIDKESMYDTYVQDNRNKVSPEEKQQNRKS-ASMYINETNGKVYLMDAMTYLK 640
Query: 626 --------------QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
++DKD DG+IEN PDQTYD W +G SAYCGGLW+A+L +A+
Sbjct: 641 SMYPACKQVLEKTLEWDKDNDGIIENSKCPDQTYDTWVMDGPSAYCGGLWLASLHCMTAM 700
Query: 672 ANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARA 730
AN + + + F +K K + + LWNG+Y+ +D S +T I +DQL G WY R
Sbjct: 701 ANLLDQNDDCNKFREILEKGKTSFEEKLWNGTYYKFDGQSASKNT-IMSDQLCGHWYLRC 759
Query: 731 CGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSGLQARE 783
CG I + V+ AL IYD NV++ GG GA+NG P DGR D S +Q E
Sbjct: 760 CGFDYDIFPKENVRTALKTIYDNNVMRFCGGNMGAVNGYVPSTQPNKDGRADASTVQGEE 819
Query: 784 IWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCY 843
+W GVTY LA++MI E M AFQTA G+Y+ + +G +F+TPE+ YR++ Y
Sbjct: 820 VWTGVTYALASTMIHEGMFTEAFQTAGGMYQTL--SEKIGMNFETPEALYAERHYRAIGY 877
Query: 844 MRPLTIWAMQWALTKPKLSR 863
MRPL+IW++Q A + K +R
Sbjct: 878 MRPLSIWSLQTAWEQKKQNR 897
>gi|343959578|dbj|BAK63646.1| bile acid beta-glucosidase [Pan troglodytes]
Length = 514
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 219/496 (44%), Positives = 289/496 (58%), Gaps = 37/496 (7%)
Query: 366 PEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPE 423
P + F K V++RRYT+F+G GD+A ++H A+ + +WE I AWQ P+L+D+ P
Sbjct: 2 PRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYTEWEERISAWQSPVLDDRSLPA 61
Query: 424 WYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL 483
WY LFNELY+L GGT+W L + + S LG +
Sbjct: 62 WYKSALFNELYFLADGGTVW---------LEVLED---------------SLPEELGRNM 97
Query: 484 --LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLELS+Q D A A + D
Sbjct: 98 CHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKLELSLQYDMALATLRED 157
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
+ + G K +P DIG D+PW +N+Y + +++ WKDLN KFVLQVYR
Sbjct: 158 LTRRRYLMSGVMAPVKRRNVIPRDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQVYR 217
Query: 600 DFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGG 659
D+ TGD+NF + +WP M +FDKD DG+IEN G+ DQTYD W G SAYCGG
Sbjct: 218 DYYLTGDQNFLKDMWPVCLAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGG 277
Query: 660 LWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQ 718
LW+AA+ +A G F + + Y+ LWNG Y+NYD+S S S+
Sbjct: 278 LWLAAVAVMVQMAALCGAQDIQDKFSSILSRGQEAYERLLWNGRYYNYDSSSRPQSRSVM 337
Query: 719 ADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRI 774
+DQ AGQW+ +ACGL + V +AL I++ NV GG GA+NGMQP G
Sbjct: 338 SDQCAGQWFLKACGLGEGDTEVFPTQHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVP 397
Query: 775 DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNN 834
D S +Q+ E+W GV YGLAA+MIQE + FQTA G Y W LG +FQTPE++
Sbjct: 398 DKSSVQSDEVWVGVVYGLAATMIQEGLTWEGFQTAEGCYRTVWER--LGLAFQTPEAYCQ 455
Query: 835 NDEYRSLCYMRPLTIW 850
+RSL YMRPL+IW
Sbjct: 456 QRVFRSLAYMRPLSIW 471
>gi|440680630|ref|YP_007155425.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
gi|428677749|gb|AFZ56515.1| Glucosylceramidase [Anabaena cylindrica PCC 7122]
Length = 802
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 275/822 (33%), Positives = 405/822 (49%), Gaps = 119/822 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
G+PLGG GAG IGRS RG+F + + G I + P A QFS+F S K + L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFNTFP--ACQFSIFESTGISTK-AYALS 96
Query: 145 PRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
+ P D +++W W + E YHAL+PR+W VY+ +L C Q S
Sbjct: 97 SQPP------EDGTLQTWQWYPASSTTEETGKYHALYPRSWFVYENVFQTDL--TCEQFS 148
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG- 246
P +NY+E+S+P +VF + N ++++ TW N V D G
Sbjct: 149 PIWANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENMVGWFTNALKSPQVQIRDDGS 208
Query: 247 --------LSGHHFNSKTMTKDGVHGLTLHHRTANGRP----PVTFAVAAEETADVHVSE 294
L N + ++ H R A+G P ++++A + V +
Sbjct: 209 PVYEYQPRLGASQGNYNQIVENTDHFSCFLGRVASGEPVGEGDGSWSIATPKHPQVEI-- 266
Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
+ +S G T D+W GS +N +P++ + +GAAIA T+ G T
Sbjct: 267 --FYHTRWDSVG-TGADIWQSFATDGSL--ANNQDATPADENTRLGAAIAIRFTLQPGET 321
Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
+ F+LAWD P +F + Y+RRYT F+G G++A +IA +LE+ W I WQ
Sbjct: 322 LEIPFTLAWDFPVTEFSAGINYYRRYTDFFGRNGENAGKIATTGLLEYQNWRSHILTWQN 381
Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
PIL+ + P+W+ + LFNELY L +GGT+W+ A
Sbjct: 382 PILDREDLPKWFKMALFNELYDLTSGGTLWS----------------------------A 413
Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
+S+ + IGQF LE +Y Y + DV Y SFAL+MLFP+LE S+ R F
Sbjct: 414 ASEI-----------DPIGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAF 462
Query: 534 AAAVMMHDPGTMKIM-------SDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSS 584
A A+ H ++++ +D RK GA PHD+G + W + N + +
Sbjct: 463 ARAIP-HSDDYLRVIGYYYTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCN 521
Query: 585 RWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
WKDL S FVLQVYRDF+ TG D F W ++ + Y++ FD DGDG+ EN G PD
Sbjct: 522 LWKDLGSDFVLQVYRDFIFTGGQDVEFLADCWDAIVQTLDYLKTFDLDGDGIPENSGAPD 581
Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA----------SYFWVRYQKAK 692
QT+D W GVSAYCGGLW+AAL++A A+ + + + A S + Q++
Sbjct: 582 QTFDDWRLQGVSAYCGGLWLAALESAIAICDILTNRRGAENAEKIRSQKSVYQSWLQQSL 641
Query: 693 AVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
+Y + LWNG Y+ D+ GS+ + ADQL GQ+YAR GL I AL +YD
Sbjct: 642 PIYQEKLWNGQYYRLDSESGSN--VVMADQLCGQFYARLLGLPDIVPNDNALSALQTVYD 699
Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
+K G GA NG++PDG + E+W G+ +GLAA +IQ M + A +
Sbjct: 700 ACFVKFCNGQFGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLIQMGMKNEAMRLTQ 759
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
V + + G F+TPE+ + +R+ Y+R + IWA+
Sbjct: 760 AVVQQIYDN---GLQFRTPEAITSAGTFRASTYLRAMAIWAI 798
>gi|428200465|ref|YP_007079054.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
gi|427977897|gb|AFY75497.1| putative bile acid beta-glucosidase [Pleurocapsa sp. PCC 7327]
Length = 808
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 281/833 (33%), Positives = 401/833 (48%), Gaps = 127/833 (15%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + A QFSVF GE + +C
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHVFRSLPACQFSVFEQPEGGEAQAYAMCTE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W W K + TYHAL+PR+W Y G EL VC Q SP N
Sbjct: 101 PP------EDGTLSRWLWYPKKKG-TYHALYPRSWYEYKGVFQSEL--VCEQFSPICAGN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P ++F +T N ++++ TW N+V D G + +
Sbjct: 152 YQETSYPVAIFEWTAHNPTDKPITLSIMLTWQNTVGWFSNAIKSPSIQVRDDGSPEYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHH-RTANGRPPVTFAVAAEETADVHVSECPC 297
+ G ++ + LH G + FA T +
Sbjct: 212 PRWGDSTGNFNQWIVDNFRVGCLLNRIRLHEDEIGEGEGQICFASITNPTIEAS------ 265
Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
+L N G ++W+ +GS D + +P+EPG I AA+A T+ G TR +
Sbjct: 266 YLGRWNPNG-DGSEVWDCFAANGSLP--DTEDETPAEPGEQIAAAMAIRFTVRPGKTRKI 322
Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
F LAWD P +F E V Y RRYT F+G G +A I A+ W+ I+ WQ PIL
Sbjct: 323 PFILAWDLPITEFAEGVKYCRRYTDFFGRTGTNAWAIVRTALKHGDMWKERIKEWQSPIL 382
Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
+ + P+W+ + LFNELY L GGT+WT +
Sbjct: 383 KREDLPDWFKMALFNELYLLADGGTLWTAATE---------------------------- 414
Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
++ IGQF LE +Y Y + DV Y SF L+ML+ +L+ ++ FA A
Sbjct: 415 -----------DDPIGQFGVLECMDYRWYESLDVRLYGSFGLMMLWSRLDKAVLEAFARA 463
Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
+ D I + RK GA PHD+G + PW + N + + + WKDL S FV
Sbjct: 464 IPTGDDTPRIIGYNKAPAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFV 523
Query: 595 LQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
LQVYRDFV TG K+ F W S+ +AY+++FD+DGDG+ EN G PDQT+D W G
Sbjct: 524 LQVYRDFVLTGAKDVAFLWECWQSIVETLAYLKKFDRDGDGIPENSGAPDQTFDDWRLRG 583
Query: 653 VSAYCGGLWVAALQAA-----------------------SALANDVGDHASASYFWVRYQ 689
+SAYCGGLW+AAL+AA +L N + + + W+ Q
Sbjct: 584 ISAYCGGLWIAALEAAIAIGKILIENPPSNPDLELENYPQSLQNTIQTYQN----WL--Q 637
Query: 690 KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
A+++Y + LWNG Y++ D+ GS + ADQL GQ+YAR GL + + + AL K
Sbjct: 638 GARSLYHEKLWNGEYYHLDSESGSD--IVMADQLCGQFYARLLGLPDVVERQYAESALKK 695
Query: 749 IYDFNVLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQ 807
+Y+ LK + G GA NG++PDG D E+W G+ +GLAA ++Q M D AF+
Sbjct: 696 VYEACFLKFQDGKFGAANGVKPDGSPEDPKATHPLEVWTGINFGLAAFLLQMGMKDEAFK 755
Query: 808 TAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
A V + G F+TPE+ +R+ Y+R + IW + LT+ K
Sbjct: 756 LAETVVRQVYDN---GLQFRTPEAITAVGTFRASHYLRAMAIWGIYGVLTEFK 805
>gi|126657976|ref|ZP_01729128.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
gi|126620614|gb|EAZ91331.1| hypothetical protein CY0110_05157 [Cyanothece sp. CCY0110]
Length = 804
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 271/826 (32%), Positives = 398/826 (48%), Gaps = 114/826 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + + QFS+F +G + L
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALSTE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D+ + W W + E +Y AL+P +W Y+G E I C Q SP IP+N
Sbjct: 101 PP------EDNTLSRWSWYPQ-EKGSYSALYPCSWYQYEGVFQTE--IYCEQFSPIIPNN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P VF +T+ N ++++ TW N V D G + +
Sbjct: 152 YQETSYPIGVFEWTIKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
+ G ++ L H + G + A + +V +
Sbjct: 212 PRWGHSTGNFNQWIQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVF------Y 265
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
L N G ++W+ +GS D + +P+EPG I AA+A TI G + +
Sbjct: 266 LGKWNPNG-DGSEVWDYFAMNGSLP--DQEDETPAEPGEQIAAAMAIRFTIKPGRVKKIP 322
Query: 359 FSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F + + +RRYT F+G G + + A+ W ++E W IL+
Sbjct: 323 FILAWDLPVTEFAQGIQSYRRYTDFFGRNGQNGWAMVRTALKHSDVWRDQVEEWHNSILK 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
K P W+ + LFNELY L GGTIWT A+S+T
Sbjct: 383 QKDLPNWFKMALFNELYLLTDGGTIWT----------------------------AASET 414
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+
Sbjct: 415 -----------DPIGQFGVLECMDYRWYESLDVRLYGSFGLLMLWPRLEKAVIEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW + N + + + WKDL S FVL
Sbjct: 464 PSSDDTIRIIGYNQAEGIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
QVYRDF+ TG D F W +V A+ Y+++FD D DG+ EN G PDQT+D W G+
Sbjct: 524 QVYRDFLLTGADDIEFLWECWEAVPEALNYLKEFDLDNDGIPENSGAPDQTFDDWELRGM 583
Query: 654 SAYCGGLWVAALQAASALANDVGDH-----------------ASASYFWVRYQKAKAVYD 696
SAYCG LW+AAL+AA + + ++ + Q++++VY
Sbjct: 584 SAYCGALWIAALEAAIKIGEILLENRPNNPQLEPKNYPECIEKELDKYQNWLQQSRSVYH 643
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
S LWNG Y+ D+ GS + ADQL+GQ+YAR GL + + AL K+Y+ L
Sbjct: 644 STLWNGEYYKLDSESGSDV--VMADQLSGQFYARLLGLPDVVENQYALSALKKVYEACFL 701
Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K + G GA NGM+PDG D + +E+W G+ +GLA+ +IQ M D A + V +
Sbjct: 702 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGLASFLIQMGMKDEALKLTEAVVK 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
+ E+GL F+TPE+ +R+ Y+R + IW + + L+K K
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTFRACHYLRAMAIWGIYYQLSKTK 804
>gi|416392732|ref|ZP_11685894.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
gi|357263592|gb|EHJ12579.1| hypothetical protein CWATWH0003_2700 [Crocosphaera watsonii WH
0003]
Length = 804
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 269/823 (32%), Positives = 398/823 (48%), Gaps = 114/823 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + + QFS+F +G + L
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W W E TY AL+P +W Y+G E I C Q SP IP N
Sbjct: 101 PP------EDKTLSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P VF +T+ N ++++ TW N V D G + +
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
+ G ++ L H + G + A + +V +
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEVF------Y 265
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
L N G ++W+ GS D + +P+EPG I AA+A T+ G + +
Sbjct: 266 LGKWNPDG-DGSEVWDYFAMDGSLP--DQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F V Y+RRYT F+G G++ + A+ W ++E W IL+
Sbjct: 323 FILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSYVWRDKVEEWHNSILK 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
+ P+W + LFNELY L GGT+WT A+S+T
Sbjct: 383 NDNLPDWLKMALFNELYLLTDGGTLWT----------------------------AASET 414
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ +GQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+
Sbjct: 415 -----------DPVGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW + N + + + WKDL S FVL
Sbjct: 464 PSSDDTIRIIGYNQAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
QVYRDF+ TG D +F W ++ + Y+++FD D DG+ EN G PDQT+D W +G+
Sbjct: 524 QVYRDFLLTGSDDIDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGI 583
Query: 654 SAYCGGLWVAALQAASALANDVGDHA-------------SASYFWVRY----QKAKAVYD 696
SAYCGGLW+AAL+AA + + +++ S +Y Q+++A+Y
Sbjct: 584 SAYCGGLWIAALEAAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYH 643
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
S LWNG Y+N D+ GS + ADQL GQ+YAR GL + + AL K+Y+ L
Sbjct: 644 STLWNGEYYNLDSKSGSD--IVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFL 701
Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K + G GA NGM+PDG D + +E+W G+ +G+A+ +IQ M + A + V +
Sbjct: 702 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVK 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
+ E+GL F+TPE+ YR+ Y+R + IW + + L+
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTYRACHYLRAMAIWGVYYQLS 801
>gi|147812670|emb|CAN61859.1| hypothetical protein VITISV_016692 [Vitis vinifera]
Length = 305
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 209/322 (64%), Positives = 247/322 (76%), Gaps = 23/322 (7%)
Query: 1 MENGVKEEGEN----AASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIG 56
MENG KE+G+ +S KV+PG+P LTWQRKL++ T F L +E +HL
Sbjct: 1 MENGHKEDGDMEHPVKSSAHKVNPGKPASLTWQRKLNTKANTLTRFNLKLREIKHL---- 56
Query: 57 FRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGI 116
+ + + F K +TS G+PLGGIG GSIGRSYRGEFQR++LF I
Sbjct: 57 ---------------ISIIDPFSKRLVTSYHGVPLGGIGGGSIGRSYRGEFQRYQLFPRI 101
Query: 117 CDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHAL 176
C+D+PVLANQFSVFVSRPNG+K S+VLCPR+P V K + SGI SWDWNL GE+CTYHAL
Sbjct: 102 CEDSPVLANQFSVFVSRPNGKKSSTVLCPRNPEVLKGSASSGIGSWDWNLDGESCTYHAL 161
Query: 177 FPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFT 236
+PRAWTVY+GEPDPE+ I+ QISPFIPHNYKESSFP SVF FTLSNSG+TSAD+TLLFT
Sbjct: 162 YPRAWTVYEGEPDPEISIISSQISPFIPHNYKESSFPVSVFKFTLSNSGKTSADITLLFT 221
Query: 237 WANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
WANSV G S SGHH+NSK TKDGVHG+ LHH+TANG PPVTFA+AAEET DVH+SECP
Sbjct: 222 WANSVGGTSEFSGHHYNSKMKTKDGVHGVLLHHKTANGHPPVTFAIAAEETGDVHISECP 281
Query: 297 CFLLSGNSKGITAKDMWNEIKK 318
CFL+SGNS G+TAK+MW EIK+
Sbjct: 282 CFLISGNSLGVTAKEMWQEIKQ 303
>gi|218437284|ref|YP_002375613.1| hypothetical protein PCC7424_0277 [Cyanothece sp. PCC 7424]
gi|218170012|gb|ACK68745.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7424]
Length = 801
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 266/818 (32%), Positives = 389/818 (47%), Gaps = 114/818 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QFS+F GE + LC +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFRNLPPCQFSIFEHPEGGEAQAYALCTQ 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W W + + TYHAL+PR+W Y+G + +I+C Q SP + N
Sbjct: 101 PP------EDGTLSRWSWYPQKKG-TYHALYPRSWYEYEGVF--QSQIICEQYSPILADN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T+ N ++++ +W N+V D G + +
Sbjct: 152 YQESSYPLAIFEWTIYNPSDRPVTISIMMSWQNTVGWFSNAIKTPEVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
+ G ++ + H + G V A +V +
Sbjct: 212 PRWGDSTGNFNQWIVDNYRVGCLLNRIQPHPQVEEGEGQVCIASILNPNLEVFYNT---- 267
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
N +G +W+ +GS N+ SP+ PG I AA+A TIP G TR +
Sbjct: 268 --RWNPQG-DGSQIWDYFAMNGSLGDYQNE--SPAAPGEQIAAAMAIRFTIPPGKTRKIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F + + Y+RRYT F+G G + + A+ H W I WQ PIL+
Sbjct: 323 FILAWDLPITEFAQGINYYRRYTDFFGRNGQNVWSVVRTALKHHEMWRDRIIEWQAPILK 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
K P+W+ + LFNELY L GGT+WT +
Sbjct: 383 QKELPDWFKMALFNELYLLADGGTLWTAATE----------------------------- 413
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
++ +GQF LE +Y Y + DV Y SF L ML+P+L+ ++ FA A+
Sbjct: 414 ----------DDPVGQFAVLECIDYRWYESIDVRLYGSFGLGMLWPRLDKAVMEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I +G RK GA PHD+G + PW + N + + + WKDL FVL
Sbjct: 464 PKGDDTPRIIGYNGAKAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGCDFVL 523
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
+YR F+ TG +++F W SV +AY+++FD DGDG+ EN G PDQT+D W G+
Sbjct: 524 LLYRSFLWTGANNQDFLWEHWDSVTQTIAYVKKFDLDGDGIPENSGAPDQTFDDWQLRGI 583
Query: 654 SAYCGGLWVAALQAA------------------SALANDVGDHASASYFWVRYQKAKAVY 695
SAYCG LW+A L+AA S A + + A A+Y +Q +
Sbjct: 584 SAYCGALWIAGLEAAIKIGETLLQRPPMNPQLQSENAPEKIEIAIATYRQWLHQARALYH 643
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
D+LW G Y+ D+ GS + +DQL GQ+YAR GL + + AL KIY+ L
Sbjct: 644 DTLWTGEYYRLDSESGSD--VVMSDQLCGQFYARLLGLPDVVEPQYTLSALRKIYESCFL 701
Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
+ GG GA NG++PDG + E+W G+ +GLAA MIQ M V +
Sbjct: 702 QFYGGKYGAANGVRPDGSPENPDATHPLEVWTGINFGLAAFMIQMGMKQEGLSITEAVVK 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ E+GL F+TPE+ +R+ Y+R + IWA+
Sbjct: 762 QIY-ENGL--QFRTPEAITAAGTFRASHYLRAMAIWAI 796
>gi|307151416|ref|YP_003886800.1| glucosylceramidase [Cyanothece sp. PCC 7822]
gi|306981644|gb|ADN13525.1| Glucosylceramidase [Cyanothece sp. PCC 7822]
Length = 805
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 267/820 (32%), Positives = 394/820 (48%), Gaps = 118/820 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + L G + A QFS+F GE + L
Sbjct: 41 HGVPLGGFGAGCIGRSPKGDFNLWHLDGGEHIFRTLAACQFSIFEQPEGGEAQAYALSTE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W W E TYHAL+PR+W Y+G + +I+C Q SP N
Sbjct: 101 PP------EDGTLSRWSW-YPTEKGTYHALYPRSWYEYEGVF--QTKIICEQYSPIWAGN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P ++F +T+ N ++++ +W N+V D G + +
Sbjct: 152 YQETSYPVAIFAWTIHNPTDRPITLSIMMSWQNTVGWFSNAIKTPEVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
+ G ++ + H + G + A + +V +
Sbjct: 212 PRWGDSTGNYNQWIVDNFRVGCLLNRIQPHPQVQEGEGQLCIASILNPSVEVFYNT---- 267
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
N G ++W+ +G +N+ S + PG I AA+A TIP G TR +
Sbjct: 268 --RWNPMG-DGSEIWDYFAMNGMLGDYENE--SAAAPGEQIAAAMAIRFTIPPGKTRLIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F + + Y+RRYT F+G G + + A+ W I WQ PIL+
Sbjct: 323 FILAWDLPITEFSQGINYYRRYTDFFGRNGQNVWSVVRTALKNVDIWRDRIIDWQSPILK 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
D+ P+W+ + LFNELY L GGT+WT +
Sbjct: 383 DQDLPDWFKMALFNELYLLADGGTLWTAATE----------------------------- 413
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
++ +GQF LE +Y Y + DV Y SFAL ML+P+L+ ++ FA A+
Sbjct: 414 ----------DDPVGQFAVLECIDYRWYESLDVRLYGSFALGMLWPRLDKAVMEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I +G RK GA PHD+G + PW + N + + + WKDL S FVL
Sbjct: 464 PQADDTPRIIGYNGAKAIRKAKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523
Query: 596 QVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
+YR F+ TG N + W SV +AY++ FD DGDG+ EN G PDQT+D W G+
Sbjct: 524 LLYRSFLWTGGNNQDYLWEHWHSVTETLAYLKAFDLDGDGIPENSGAPDQTFDDWQLRGI 583
Query: 654 SAYCGGLWVAALQAASAL-------------------ANDVGDHASASYFWVRYQKAKAV 694
SAYCG LW+AAL+AA + A + + + W+ Q+A+++
Sbjct: 584 SAYCGALWIAALEAAIKMGEILLQRPPMNPQLQPQNAAESIEEAITTYRHWL--QQARSL 641
Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
Y D+LWNG Y+ D+ GS + +DQL GQ+YAR GL + + AL KIY+
Sbjct: 642 YHDTLWNGEYYRLDSESGSD--VVMSDQLCGQFYARLLGLPDVVEPQYTTSALRKIYEAC 699
Query: 754 VLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
LK GG GA NG++PDG + E+W G+ +GLAA MIQ M + AF+ V
Sbjct: 700 FLKFHGGKYGAANGVKPDGTPENPDATHPLEVWTGINFGLAAFMIQMGMKEEAFKITEAV 759
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ + E+GL F+TPE+ +R+ Y+R + IWA+
Sbjct: 760 VKQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWAV 796
>gi|428308994|ref|YP_007119971.1| bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
gi|428250606|gb|AFZ16565.1| putative bile acid beta-glucosidase [Microcoleus sp. PCC 7113]
Length = 819
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 276/841 (32%), Positives = 395/841 (46%), Gaps = 133/841 (15%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G+PLGG GAG IGRS RG+F + L G + A QFSVF N K + LC
Sbjct: 42 GMPLGGFGAGCIGRSPRGDFNLWHLDGGEHSFKSLPACQFSVFEQSENAPKQAYALCTEP 101
Query: 148 PGVPKKNTDSGIESWDW-----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
P D + +W W KG TY AL+P++W Y+ EL C Q SP
Sbjct: 102 P------QDGTLGTWQWYPTSQQAKGNTGTYQALYPQSWFTYENVFQTEL--TCEQFSPI 153
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
Y+E+S+P +VF +T N ++++ TW N V D G
Sbjct: 154 WAGCYQETSYPVAVFEWTAHNPTDAPITLSIMLTWQNIVGWFTNAIKSPQVKVRDDGSPV 213
Query: 250 HHFNSKTMTKDGVHG---------------LTLHHRTA-----------NGRPPVTFAVA 283
+ + + G + + LH A G + FA
Sbjct: 214 YEYQPRWGESTGSYNQWIVDNFRVGCILDRVRLHDSEAVADFRGASEIQEGEGQLCFATV 273
Query: 284 AEETADV--HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
+ +V H P +GN ++W GS + ++ + + PG I A
Sbjct: 274 TNPSMEVFYHTRWNP----AGN-----GAEVWQSFAADGSLSDMQDE--TAAAPGEQIAA 322
Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILE 400
AIA TI G TR + F LAWD P +F V Y+RRYT F+G G++A I A+
Sbjct: 323 AIAIRFTIRPGRTRKIPFILAWDFPVTEFSPGVNYYRRYTDFFGRTGNNAWSIVRTALKH 382
Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
W+ +I++WQ PIL+ + P W+ + LFNELY L GGT+WT
Sbjct: 383 ADLWKEKIQSWQEPILKREDLPHWFKMALFNELYLLADGGTLWT---------------- 426
Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
A+ D + IGQF LE +Y Y + DV Y SFAL M
Sbjct: 427 ------------AADD-----------RDPIGQFGVLECLDYRWYESLDVRLYGSFALAM 463
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSY 578
L+P+L+ ++ FA A+ + D I + RK A PHD+G + PW + N
Sbjct: 464 LWPRLDKAVLEAFARAIPIGDKTPRIIGYNQAQAIRKAADATPHDLGAPNEHPWEQTNYT 523
Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
+ + ++WKDL FVLQVYRDFV TG D F W SV +AY++ FDKDGDG+ E
Sbjct: 524 SYQDCNQWKDLPCDFVLQVYRDFVFTGANDTEFLWECWSSVVKTLAYLKTFDKDGDGIPE 583
Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVR--------- 687
N G PDQT+D W G+SAYCGGLW+AAL+AA A+ + +H + +
Sbjct: 584 NSGAPDQTFDDWRLQGISAYCGGLWLAALEAAIAMGEILINHYPPTSELIEVPDPESIQE 643
Query: 688 --------YQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIAD 738
++++ +Y + LWNG Y+ D+ GS + DQL GQ+YA+ GL I
Sbjct: 644 TLKLYRQWLEQSQPIYQEKLWNGQYYRLDSESGSD--IVMTDQLCGQFYAQLLGLPDIVP 701
Query: 739 EAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMI 797
V+ AL +YD LK + GA NG++PDG + E+W G+ +GLAA ++
Sbjct: 702 PECVQSALKSVYDSCFLKFQDAQLGAANGVRPDGSPENPDATHPLEVWTGINFGLAAFLV 761
Query: 798 QEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
Q+ M A Q V + E+GL F+TPE+ +R+ Y+R + IWA+ + +
Sbjct: 762 QQGMKQEALQLTEAVVRQVY-ENGL--QFRTPEAITAAGTFRASHYLRAMAIWAVYFMIN 818
Query: 858 K 858
+
Sbjct: 819 Q 819
>gi|298490121|ref|YP_003720298.1| glucosylceramidase ['Nostoc azollae' 0708]
gi|298232039|gb|ADI63175.1| Glucosylceramidase ['Nostoc azollae' 0708]
Length = 806
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 270/822 (32%), Positives = 403/822 (49%), Gaps = 116/822 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + QFSVF S + K ++
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFKSFPSCQFSVFESTGSSSKAYAL---- 95
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
S +P+ D ++SW W TYHAL+PR+W VY E + ++ C Q SP
Sbjct: 96 STEIPE---DGTLQSWQWYPPSTTTQATGTYHALYPRSWFVY--ENVLQTQLTCEQFSPI 150
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+NY+E+S+P +VF + N ++++ TW N V D G
Sbjct: 151 WANNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWENIVGWFTNALKSPDLQIRDDGSPV 210
Query: 250 HHFNSKTMTKDGVHGLTLHH---------RTANGRPPV----TFAVAAEETADVHVSECP 296
+ + + G + + + R G P ++ +A + + V
Sbjct: 211 YEYQPRLGESKGNYNHIVENTEFFGCFLGRVPTGLPIQEGDGSWCIATPKNPRMEVFYHT 270
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
F N G + D+W G + N +P++ + +GAA+A T+ G T
Sbjct: 271 HF----NPVG-SGADLWESFSTDGYLPNYLN--ATPADADTRLGAALAVRCTLQPGETLE 323
Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ F+LAWD P +F E + Y+RRYT F+ GD+A I + ++ W +I+ WQ+PI
Sbjct: 324 IPFTLAWDFPVTEFAEGINYYRRYTDFFSRYGDNAFAITSRGLEQYQVWRAQIQNWQKPI 383
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L+ + PEW+ + LFNELY L +GGT+W+ A+S
Sbjct: 384 LDREDLPEWFKMALFNELYDLTSGGTLWS----------------------------AAS 415
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
+ + IGQF LE +Y Y + DV Y SFAL++LFP+LE S+ R FA
Sbjct: 416 EI-----------DPIGQFAVLECLDYRWYESLDVRLYGSFALLILFPELEKSVIRAFAR 464
Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLF-NSSRWK 587
A+ D I ++ K RK A PHD+G + W +E +Y + + + WK
Sbjct: 465 AIPHSDDHLRVIGYYYTIGAETKKAIRKVAKATPHDLGAPNEWVWEKTNYTCYQDCNLWK 524
Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
DL+S FVLQVYRDF+ TG D F W ++ + Y++ FD DGDG+ EN G PDQT+
Sbjct: 525 DLSSDFVLQVYRDFLFTGSHDVQFLVDCWDAIVQTLDYLKTFDLDGDGIPENSGAPDQTF 584
Query: 646 DAWSANGVSAYCGGLWVAALQAASALANDVGDH-------------ASASYFWVRYQKAK 692
D W NGVSAYCGGLW+AAL++A A+A+ + +H + S + Q+++
Sbjct: 585 DDWLLNGVSAYCGGLWLAALESAIAIADILTNHRGTENTEKTDKLRSQKSIYQSWLQQSR 644
Query: 693 AVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
+Y+ LWNG Y+ D+ GS + ADQL GQ+Y R GL I AL +Y+
Sbjct: 645 PIYEQKLWNGQYYRLDSESGSD--VVMADQLCGQFYTRLLGLPDIVPSNHAHSALQTVYE 702
Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
+K G GA NG+ PDG + E+W G+ +GLAA ++Q M + A Q
Sbjct: 703 ACFVKFYDGKFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKNEAMQLTQ 762
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
V E + G F+TPE+ +R+ Y+RP+ IWA+
Sbjct: 763 AVVEQIYDN---GLQFRTPEAITPVGTFRASTYLRPMAIWAI 801
>gi|284929778|ref|YP_003422300.1| putative bile acid beta-glucosidase [cyanobacterium UCYN-A]
gi|284810222|gb|ADB95919.1| predicted bile acid beta-glucosidase [cyanobacterium UCYN-A]
Length = 803
Score = 404 bits (1039), Expect = e-109, Method: Compositional matrix adjust.
Identities = 270/827 (32%), Positives = 404/827 (48%), Gaps = 120/827 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+P+GG GAG IGRS G+F + L G + + QFS+F +G+ + L +
Sbjct: 41 HGMPIGGFGAGCIGRSSAGDFNLWHLDGGEHVFKTISSCQFSIFEELKDGKTKAYALSTK 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W+W K + TY+AL+PR+W Y+G E +IVC Q SP IP N
Sbjct: 101 EP------EDKTLSKWNWYPKNKG-TYYALYPRSWYQYEGIF--EAKIVCEQFSPIIPDN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV------------------------A 242
Y+E+S+P +F +T+SN ++++ TW N + +
Sbjct: 152 YQETSYPIGIFEWTVSNPTDQPITLSIMLTWQNMIGWFTNSIKTPKIEVRDDGSPEYEYS 211
Query: 243 GDSGLSGHHFNS--KTMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
G S +FN ++ ++ G + L ++ + G + FA A + D+ S +
Sbjct: 212 PQLGFSQGNFNKWKQSSSQFGFILDRLRVNDSISEGEGQICFATAHD--PDLEFSYLGKW 269
Query: 299 LLSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
SGN ++W+ G+ + LD P E I AA+A I G + +
Sbjct: 270 NPSGN-----GSEVWDHFSASGTLPNTLDETIAKPDE---QIAAAMAVKFVIKPGEVKKI 321
Query: 358 TFSLAWDCPEVKF-FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
F+LAWD P +F YHRRYT F+G G +A IA + W +I +Q PIL
Sbjct: 322 PFTLAWDLPITEFSLGNQYHRRYTDFFGINGKNAWMIAETGLKSLNIWREQILEYQLPIL 381
Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
DK P+W+ + LFNELY L GGT+WT T SK +
Sbjct: 382 NDKDLPDWFKMALFNELYLLTDGGTLWT--------------------TSSKSYP----- 416
Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
+G+F LE +Y Y + DV Y SF L+ L+P LE ++ + FA +
Sbjct: 417 --------------VGKFGVLECIDYRWYESLDVRLYGSFGLLALWPSLEKAVIKTFAES 462
Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
+ D I D RK GAVPHD+G + PW E N + + + WKDL S FV
Sbjct: 463 ISKSDHTLRLIGYDRSLAPRKVKGAVPHDLGAPNEHPWEETNYTSYQDCNLWKDLGSDFV 522
Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
LQVYRDF+ T D NF + WP++ + Y+++FD D DG+ EN G PDQT+D W +G
Sbjct: 523 LQVYRDFLLTDSQDINFLQECWPAIVETLDYLKKFDLDDDGIPENSGAPDQTFDDWQLHG 582
Query: 653 VSAYCGGLWVAALQAASALA---------NDVGDHASASYF----------WVRYQKAKA 693
+S+YCG LW+ AL+AA + N D ++ F W++ K++
Sbjct: 583 ISSYCGALWITALEAAIGIGKILLKNSENNSHPDLQNSLAFIQENLTKYDDWLK--KSRL 640
Query: 694 VY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+Y ++LWNG Y+ D+ GS +I DQL GQ+YAR L + D + AL+KIY+
Sbjct: 641 IYHETLWNGEYYQLDSKSGSD--TIMTDQLCGQFYARLLNLPDVVDTSCTISALSKIYEA 698
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
L G GA NG++ DG + + +E+W G+ +GLAA +IQ M + A +
Sbjct: 699 CFLNFYDGQYGAANGVRSDGTPENPNSTHPQEVWVGINFGLAAFLIQMGMKNQALELTKV 758
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
V E ++ G F+TPE+ +R+ Y+R + IW + + L K
Sbjct: 759 VVEQIYAN---GLQFRTPEAITPVGTFRASHYLRAMAIWGVYYQLKK 802
>gi|67922703|ref|ZP_00516206.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
gi|67855413|gb|EAM50669.1| Protein of unknown function DUF608 [Crocosphaera watsonii WH 8501]
Length = 804
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 268/823 (32%), Positives = 396/823 (48%), Gaps = 114/823 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + + QFS+F +G + L
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSIPSCQFSIFEQLEDGTAQAYALATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W W E TY AL+P +W Y+G E I C Q SP IP N
Sbjct: 101 PP------EDKTLSRWSW-YPTEKGTYSALYPCSWYQYEGVFQAE--IYCEQFSPIIPDN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P VF +T+ N ++++ TW N V D G + +
Sbjct: 152 YQETSYPIGVFEWTVKNPTDKPITLSIMLTWQNIVGWFTNAIKSPEITVRDDGSPEYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
+ G ++ L H + G + A + +V F
Sbjct: 212 PRWGHSTGNFNQWVQDNFRVGFILNRLQPHEQVQEGEGQICIASVTNPSVEV-------F 264
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
L + ++W+ GS D + +P+EPG I AA+A T+ G + +
Sbjct: 265 NLGKWNPDGDGSEVWDYFAMDGSLP--DQEDETPAEPGEQIAAAMAIRFTVKPGRVKKIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F V Y+RRYT F+G G++ + A+ W ++E W IL+
Sbjct: 323 FILAWDLPVTEFTPGVQYYRRYTDFFGRNGNNGWAMVRTALKHSDVWRDKVEEWHNSILK 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
+ P+W + LFNELY L GGT+WT A+S+T
Sbjct: 383 NDNLPDWLKMALFNELYLLTDGGTLWT----------------------------AASET 414
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ +GQF LE +Y Y + DV Y SF L+ML+P LE ++ FA A+
Sbjct: 415 -----------DPVGQFGVLECLDYCWYESLDVRLYGSFGLLMLWPGLEKAVMEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW + N + + + WKDL S FVL
Sbjct: 464 PSSDDTIRIIGYNQAEGFRKVKGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLGSDFVL 523
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
QVYRDF+ TG D +F W ++ + Y+++FD D DG+ EN G PDQT+D W +G+
Sbjct: 524 QVYRDFLLTGSDDIDFLWECWAAIPETLNYLKEFDLDNDGIPENSGAPDQTFDDWKLSGI 583
Query: 654 SAYCGGLWVAALQAASALANDVGDHA-------------SASYFWVRY----QKAKAVYD 696
SAYCGGLW+AAL+AA + + +++ S +Y Q+++A+Y
Sbjct: 584 SAYCGGLWIAALEAAIKIGEILLENSPNNPQLEPENYPESIEKELDKYNNWLQQSRAIYH 643
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
S LWNG Y+N D+ GS + ADQL GQ+YAR GL + + AL K+Y+ L
Sbjct: 644 STLWNGEYYNLDSKSGSD--IVMADQLCGQFYARLLGLSDVVESEYTLSALNKVYEACFL 701
Query: 756 KVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K + G GA NGM+PDG D + +E+W G+ +G+A+ +IQ M + A + V +
Sbjct: 702 KFQNGKYGAANGMKPDGTPEDPNSTHPQEVWTGINFGIASFLIQMGMKNEALKLTETVVK 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
+ E+GL F+TPE+ YR+ Y+R + IW + + L+
Sbjct: 762 QVY-ENGL--QFRTPEAITAVGTYRACHYLRAMAIWGVYYQLS 801
>gi|428209901|ref|YP_007094254.1| glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
gi|428011822|gb|AFY90385.1| Glucosylceramidase [Chroococcidiopsis thermalis PCC 7203]
Length = 812
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 271/828 (32%), Positives = 399/828 (48%), Gaps = 125/828 (15%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVF--VSRPNGEKFSSVLC 144
G+PLGG GAG IGRS RG+F +++ G P+ A QFSVF VS + + + LC
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNVWQVDGGEHIFQPMPACQFSVFESVSERSASRQAYALC 100
Query: 145 PRSPGVPKKNTDSGIESWDWN-LKGEN----CTYHALFPRAWTVYDGEPDPELRIVCRQI 199
+P D G+++W W +GE TYHAL+PR+W VY+G +L C Q
Sbjct: 101 TEAP-------DDGLQNWQWYPSRGEQHAHTGTYHALYPRSWFVYEGVFQSQL--TCEQF 151
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
SP +NY+ESS+P VF +T N ++++ +W N D G
Sbjct: 152 SPIWANNYQESSYPVGVFVWTAHNPTNAPMTLSIMLSWQNMTGWFTNASKSSEVKIRDDG 211
Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKG 306
+ + S+ G + V + + + DV + + + G
Sbjct: 212 SPVYEYKSRWGESQGNFNRLVESEY------VGCVLGRDNSEDVEEGDGQWCIATAKQPG 265
Query: 307 I------------TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
+ T +D+W + G+ ++ ++ +P+ G I AIA T+ G T
Sbjct: 266 VEVFHHTRWNPVGTGEDIWTSFAQSGALPNIQDE--TPAIEGEQIAGAIAIRFTLQPGET 323
Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
R++ F L WD P +F V Y+RR+T F+ G +A IA A+ + W+ +I+AWQ+
Sbjct: 324 RTIPFVLTWDFPVTEFAAGVKYYRRHTDFFDRTGKNAWAIARTALQAYQSWQSQIQAWQQ 383
Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
PIL+ P W+ + LFNELY L +GGT+W S AT R+
Sbjct: 384 PILDKTDLPNWFKMALFNELYDLTSGGTLW--------SAATERDP-------------- 421
Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
IGQF LE +Y Y + DV Y SFAL++LFP+LE ++ R F
Sbjct: 422 -----------------IGQFAILECIDYRWYESLDVRLYGSFALLILFPELEKAVMRAF 464
Query: 534 AAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSR 585
A A+ D + I ++ RK GA PHD+G + W + N + +
Sbjct: 465 ARAIPASDNHRRLIGYYLTIGAEMPLAPRKVAGATPHDLGAPNEHVWEKTNYTAYQDCNL 524
Query: 586 WKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
WKDL S FVLQVYRD++ TG D F + W ++ A+ Y++ FDKDGDG+ EN G PDQ
Sbjct: 525 WKDLPSDFVLQVYRDYLLTGATDVEFLASCWEAIVQALNYLKTFDKDGDGIPENSGAPDQ 584
Query: 644 TYDAWSANGVSAYCGGLW-----------------VAALQAASALANDVGDHASASYFWV 686
T+D W GVSAYCGGLW V L S + + W+
Sbjct: 585 TFDDWKLQGVSAYCGGLWLAALEAAIAIGKILLNNVERLHVTSLHVTSLQETIDTYQSWL 644
Query: 687 RYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
+++ +Y + LWNG Y+ D++ GS + ADQL GQ+YAR L I + A
Sbjct: 645 --SRSRPIYQEKLWNGQYYRLDSNSGSD--VVMADQLCGQFYARLLKLPDIVPQECALSA 700
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDM 804
L IYD LK G GA NG++P+G + + E+W G+ +GLAA ++Q M
Sbjct: 701 LKTIYDACFLKFHQGKFGAANGLKPNGSPENPNATHPLEVWTGINFGLAAFLLQMGMKAE 760
Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A Q V + E+GL F+TPE+ + +R+ Y+R + IWA+
Sbjct: 761 ALQLTETVVNQIY-ENGL--QFRTPEAITSVGTFRASYYLRAMAIWAI 805
>gi|225593155|gb|ACN96073.1| pedicted bile acid beta-glucosidase [Fischerella sp. MV11]
Length = 812
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 279/824 (33%), Positives = 403/824 (48%), Gaps = 114/824 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + QFSVF G + + LC +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVFERTEAGTQ-AYALCTQ 98
Query: 147 SPGVPKKNTDSGIESWDW-----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
P D + +W W N +G TYHAL+PR+W VY E + ++ C Q SP
Sbjct: 99 PP------EDGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVY--EQVYKTQLTCEQFSP 150
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
+NY+E+S+P ++F + N S ++++ TW N V D G
Sbjct: 151 LWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGWFTNALKSPEVRMRDDGSP 210
Query: 249 GHHFNSKT-MTKDGVHGL---TLHHRTANGRPPVTFAVAAEE----TADVHVSECPCFLL 300
+ + + +KD + L + H GR V + E A + F
Sbjct: 211 VYEYQPRLGESKDNFNQLVENSEHIGCILGRVGVDQPIQEGEGQWCIATRKHPKLEIFYH 270
Query: 301 SGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
S T +D+W + GS ++LD+ + + G I AA+A T+ G T + F
Sbjct: 271 SRWQPRGTGEDVWQSFAQDGSLPNYLDH---ASAVEGEQIAAALAVRFTLQPGETIEIPF 327
Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
L WD P +F + Y+RRYT F+G G +A IA A+ ++ W+ +I+AWQ+PIL+
Sbjct: 328 VLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPILDR 387
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
P+W+ + LFNELY L+AGGT+W S AT R+
Sbjct: 388 ADLPDWFKMALFNELYDLSAGGTLW--------SAATERD-------------------- 419
Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
IGQF LE +Y Y + DV Y SFAL+MLFP+LE ++ R FA A+
Sbjct: 420 -----------PIGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIP 468
Query: 539 MHDPGTMKIM------SDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLN 590
+ D T I ++ + RK GA PHD+G + W + N + + + WKDL
Sbjct: 469 VSDSTTRVIGYYYTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLG 528
Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
FVLQVYRDF+ TG D F W ++ + Y++ FD D DG+ EN G PDQT+D W
Sbjct: 529 CDFVLQVYRDFLFTGADDITFLAECWNAIVQTLDYLKTFDIDNDGIPENSGAPDQTFDDW 588
Query: 649 SANGVSAYCGGLWVAA-----------------LQAASALANDVGDHA-SASYFWVRYQK 690
GVSAYCGGLW+AA + A D+GD A S + Q+
Sbjct: 589 RLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTENAEKFLGDLGDLAVQKSIYETWLQQ 648
Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
++ +Y++ LWNG Y+ D+ GS + ADQL GQ+YAR GL I + AL +
Sbjct: 649 SRPIYEAKLWNGQYYRLDSESGSD--VVMADQLCGQFYARLLGLPDIVKRDRALSALKTV 706
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
Y+ LK G GA NG++PDG + E+W G+ +GLAA M+Q M D A +
Sbjct: 707 YNACFLKFHHGQFGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKL 766
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
V + E+GL F+TPE+ +R+ Y+R + IWA+
Sbjct: 767 TEAVVRQVY-ENGL--QFRTPEAITAVGTFRASTYLRAMAIWAI 807
>gi|119493962|ref|ZP_01624522.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
gi|119452283|gb|EAW33479.1| hypothetical protein L8106_04106 [Lyngbya sp. PCC 8106]
Length = 799
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 271/817 (33%), Positives = 388/817 (47%), Gaps = 115/817 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + A QFS+F NG+ + LC
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFRSLPACQFSIF-EEVNGKTQTYALCTE 99
Query: 147 SPGVPKKNTDSGIESWDWNLKGENC----TYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
P D + +W+W N TYHAL+PR+W VY+ +L C Q SP
Sbjct: 100 PP------EDESLSAWNWYPASSNSQKTGTYHALYPRSWFVYENVFKAQLS--CEQFSPV 151
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+NY+E+S+P +VF + N +++L TW N++ D G
Sbjct: 152 WANNYQETSYPIAVFEWIAHNPTDEPITLSILLTWQNTIGWFTNALEQPLVQVRDDGSPI 211
Query: 250 HHFNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
+ + + G + L+ G + A +V+
Sbjct: 212 YEYQPRWGNSVGNFNHLVEDFHRIGCLMSRLSASEEVREGEGQMAIATITNPAVEVY--- 268
Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
+ + N+ G T +++WN + G+ LD + P+ G IG A+A TI G T
Sbjct: 269 ---YHTAWNTVG-TGEEIWNYFSQDGTL--LDEESEHPATEGRQIGVALAVKFTIRPGKT 322
Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
R + F L+WD P +F V Y+RRYT F+G G +A + A+ W I+AWQ
Sbjct: 323 RKIPFLLSWDLPVTEFESGVGYYRRYTDFFGRNGKNAWSMIRTAMKHSDVWLDNIKAWQN 382
Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
PIL + P W+ + LFNELY L GGT+W+
Sbjct: 383 PILNRQDLPSWFKMALFNELYLLTDGGTLWSAAD-------------------------- 416
Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
EN + GQF LE +Y Y + DV Y SF L++L+P+L+ S+ F
Sbjct: 417 -----------ENNPK--GQFAVLECLDYRWYESLDVRLYGSFGLLLLWPELDKSVLLAF 463
Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNS 591
A ++ D I + RK GA PHD+G + PW + N + + + WKDL
Sbjct: 464 ARSISKGDKTPRIIGYNQAEAIRKATGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPC 523
Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
FVLQVYRDFV TG D F + W ++ + Y++ FDKD DG+ EN G PDQT+D W
Sbjct: 524 DFVLQVYRDFVLTGSEDYEFLQDCWLAIVETLDYLKIFDKDEDGIPENSGAPDQTFDDWK 583
Query: 650 ANGVSAYCGGLWVAALQAASALAN-------------DVGDHASASYFWVRYQKAKAVYD 696
GVSAYCGGLW+AAL+AA + + D S W+ +KAK +Y
Sbjct: 584 LQGVSAYCGGLWLAALEAAIEIGKCLQKTEFSDNVSPSISDKISTYETWL--EKAKPIYQ 641
Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
+LWNG YF D+ GS + ADQL GQ+YA+ GL I K AL IY
Sbjct: 642 NLWNGQYFQLDSQSGSD--VVMADQLCGQFYAKLLGLEDIVSPEKTISALQTIYQSCFQN 699
Query: 757 VKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
G GA NG++ +G I+ + E+W G+ +GLAA +IQ M + F+ A V +
Sbjct: 700 FYQGQFGAANGVRINGEPINPNDTHPLEVWTGINFGLAAFLIQMGMKEEGFKLAEVVVQQ 759
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ E+GL F+TPE+ +R+ Y+R + IWA+
Sbjct: 760 IY-ENGL--QFRTPEAITAAGTFRASHYLRAMAIWAI 793
>gi|443320525|ref|ZP_21049620.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
gi|442789771|gb|ELR99409.1| putative bile acid beta-glucosidase [Gloeocapsa sp. PCC 73106]
Length = 806
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 267/822 (32%), Positives = 388/822 (47%), Gaps = 114/822 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG +GRS RG+F + G + A QFSVF GE + +C +
Sbjct: 41 HGMPLGGFGAGCVGRSPRGDFNLWHFDGGEHIFRSLDACQFSVFEHPEQGEVKAYGMCTK 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D + W W + + TYHAL+PR+W Y G EL VC Q SP N
Sbjct: 101 PP------EDGTLSRWRWYPEDQG-TYHALYPRSWFEYQGVFATEL--VCTQFSPIWAKN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF- 252
Y+ESS+P ++F +T N ++++ TW N V D G + +
Sbjct: 152 YQESSYPLAIFEWTAYNPTDQPITLSIMLTWQNVVGWFTNAIKSPEIRIRDDGSPVYQYQ 211
Query: 253 --------NSKTMTKDG------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
N +DG + + H G A + +V +
Sbjct: 212 PRWGDSTGNYNQWIQDGSRVGCLFNRIQPHEELQEGEGQFCIASILNPSLEVF------Y 265
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
+ +G D+W++ + GS D +P+EPG I AIA T+ G TR +
Sbjct: 266 QTRWHPRG-DGSDVWDDFARDGSLG--DRQDETPAEPGEQIAGAIAVRFTVRPGKTRLIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F L WD P ++F V Y+RRYT F+G G +A + A+ W+ +I+ WQ P+++
Sbjct: 323 FVLTWDLPILEFAPGVNYYRRYTDFFGRNGQNAWSMVRTALKHGDMWQDKIDQWQSPVVQ 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
+P W + LFNELY L GGT+WT +
Sbjct: 383 RSDYPTWLKMALFNELYLLTQGGTLWTAAT------------------------------ 412
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
EN + IGQF LE +Y Y + DV Y SFAL ML+P+L+ +I FA A+
Sbjct: 413 -------EN--DPIGQFAVLECIDYRWYESLDVRLYGSFALAMLWPRLDKAILEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW + N + + + WKDL+S FVL
Sbjct: 464 PQSDESMRVIGYNQAEAQRKIAGATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLSSDFVL 523
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
QVYRDFV TG D +F WPS+ + Y++ FD DGDG+ EN G PDQT+D W G+
Sbjct: 524 QVYRDFVLTGKEDTDFLWECWPSITQTLNYLKSFDLDGDGIPENAGAPDQTFDDWKLEGI 583
Query: 654 SAYCGGLWVAALQAASALANDVGDH-----------------ASASYFWVRYQKAKAVY- 695
SAYCGGLW+AAL+A+ A+ + D+ A+ A+++Y
Sbjct: 584 SAYCGGLWIAALEASIAIGAVLLDNPPMNPQLQPSDYPEGIEATLKTHQSWLTLARSIYH 643
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
LWNG Y++ D G S + ADQLAGQ+YA+ GL + ++ V+ L IY L
Sbjct: 644 QQLWNGQYYSLDTKSG--SVVVMADQLAGQFYAQLLGLPDVVEKEYVQSTLKTIYQSCFL 701
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
G GA NG++ DG + E+W G+ + L A +I M D + A + +
Sbjct: 702 NFHQGKIGAANGVKLDGTGENPQSTHPLEVWTGINFALGAFLIHSGMRDEGLKLAETIVK 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
+ + G F+TPE+ + +R+ Y+R + IWA+ + L
Sbjct: 762 QIYDQ---GLQFRTPEALTTSGTFRASHYLRAMAIWAIDYTL 800
>gi|354566897|ref|ZP_08986068.1| Glucosylceramidase [Fischerella sp. JSC-11]
gi|353544556|gb|EHC14010.1| Glucosylceramidase [Fischerella sp. JSC-11]
Length = 812
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 285/824 (34%), Positives = 405/824 (49%), Gaps = 114/824 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + QFSVF G + + LC +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNIPGCQFSVFERTGAGTQ-AYALCTQ 98
Query: 147 SPGVPKKNTDSGIESWDW-----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
P D + +W W N +G TYHAL+PR+W VY E + ++ C Q SP
Sbjct: 99 PP------EDGSLAAWQWYPTIKNEEGITGTYHALYPRSWFVY--EQVYKTQLTCEQFSP 150
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
+NY+E+S+P ++F + N S ++++ TW N V D G
Sbjct: 151 IWANNYQETSYPVAIFVWKAHNPTNASIALSIMLTWENMVGWFTNALKSPEVKMRDDGSP 210
Query: 249 GHHFNSKT-MTKDGVHGL---TLHHRTANGRPPVTFAVAAEE----TADVHVSECPCFLL 300
+ + +KD + L + H GR V + E A + F
Sbjct: 211 VYEYQPPLGESKDNFNQLVENSEHIGCVLGRVGVDQPIQEGEGQWCIATRKHPKLEIFYH 270
Query: 301 SGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
S T +D+W + GS ++LD+ TS E G I AA+A T+ G T + F
Sbjct: 271 SRWQPRGTGEDVWQSFAQDGSLPNYLDH--TSAVE-GEQIAAALAVRFTLQPGETLEIPF 327
Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
L WD P +F + Y+RRYT F+G G +A IA A+ ++ W+ +I+AWQ+PILE
Sbjct: 328 VLTWDFPVTEFAAGINYYRRYTDFFGNDGKNAWAIALIALKQYQAWQAQIQAWQKPILER 387
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
P+W+ + LFNELY L+AGGT+W S AT R+
Sbjct: 388 ADLPDWFKMALFNELYDLSAGGTLW--------SAATERD-------------------- 419
Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
IGQF LE +Y Y + DV Y SFAL+MLFP+LE ++ R FA A+
Sbjct: 420 -----------PIGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKAVMRAFARAIP 468
Query: 539 MHDPGTMKIM------SDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLN 590
+ D T I ++ + RK GA PHD+G + W + N + + + WKDL
Sbjct: 469 VSDQTTRVIGYYYTVGAESPFAIRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDLG 528
Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
FVLQVYRDF+ TG D F W ++ + Y++ FD D DG+ EN G PDQT+D W
Sbjct: 529 CDFVLQVYRDFLFTGADDITFLAECWNAIVETLDYLKTFDIDNDGIPENSGAPDQTFDDW 588
Query: 649 SANGVSAYCGGLWVAALQA-----------------ASALANDVGDHA-SASYFWVRYQK 690
GVSAYCGGLW+AAL+A A D+GD A S + Q+
Sbjct: 589 RLQGVSAYCGGLWLAALEAAIAICDVLITNRRDTDNAENFLGDLGDLAVQKSIYETWLQQ 648
Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
++ +Y++ LWNG Y+ D+ GS + ADQL GQ+YAR GL I + AL +
Sbjct: 649 SRPIYEAKLWNGQYYRLDSESGSD--VVMADQLCGQFYARLLGLPDIVKRDRALSALKTV 706
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
Y+ LK G GA NG++PDG + E+W G+ +GLAA M+Q M D A +
Sbjct: 707 YNACFLKFHHGQFGAANGLRPDGSPENPQATHPLEVWTGINFGLAAFMVQMGMKDEAMKL 766
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
V + E+GL F+TPE+ +R+ Y+R + IWA+
Sbjct: 767 TETVVRQIY-ENGL--QFRTPEAITAVGTFRASTYLRAMAIWAI 807
>gi|282900195|ref|ZP_06308151.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
gi|281194944|gb|EFA69885.1| protein of unknown function DUF608 [Cylindrospermopsis raciborskii
CS-505]
Length = 789
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 274/814 (33%), Positives = 402/814 (49%), Gaps = 115/814 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G V QFS+F S G+ ++ P
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYALSTQP- 93
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P+ NT +++W W + TYHAL+PR+W VY E L+ C Q SP HN
Sbjct: 94 ----PEDNT---LQTWQWYPNTDGGTYHALYPRSWFVY--ENVFPLQFTCEQFSPVWAHN 144
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P +VF + + N +++ +W N D G + +
Sbjct: 145 YRETSYPVAVFLWNIHNPTNQPLTASIMLSWENMTGWFTNALKSPQVKIRDDGSPVYEYQ 204
Query: 254 SKTMTKDG--------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
+ G V G H G T+ +A + V + F
Sbjct: 205 PRWGESQGNYNWLAENDQYLGCVLGRATDHPIQEGDG--TWCIATVKNPQVELFYHCQFN 262
Query: 300 LSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
GN +++W + ++GS + ++ +P++ S +GAA+A T+P G T +V F
Sbjct: 263 PLGN-----GEEIWRDFSQNGSLSNYQDE--TPADVNSRLGAAVAVRFTLPPGETLTVPF 315
Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
L+WD P +F V Y+RRYT F+GT GD+A +IA A+ E W IE+WQ+PILE
Sbjct: 316 VLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSHIESWQKPILER 375
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
+ P W+ + LFNELY L +GGT+W+ G+
Sbjct: 376 EDLPSWFKMALFNELYDLTSGGTLWSAGTE------------------------------ 405
Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
++ +GQF LE +Y Y + DV Y SF L++LFP+LE ++ R FA A+
Sbjct: 406 ---------KDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAVIRAFARAIP 456
Query: 539 MHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLF-NSSRWKDLN 590
D + + I S RK GA PHD+G + W +E +Y + + + WKDL
Sbjct: 457 QSDDRSRIIGYYLTINSPSPMALRKVAGATPHDLGAPNEWVWEQTNYTSYQDCNLWKDLG 516
Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
FVLQVYRDF+ TG D F W + + + Y++QFD DGDG+ EN G PDQT+D W
Sbjct: 517 CDFVLQVYRDFLFTGANDIEFLVDCWHGMVLTLDYLKQFDMDGDGIPENSGAPDQTFDDW 576
Query: 649 SANGVSAYCGGLWVAALQAASALAN---------DVGDHASASYFWVRYQKAKAVYDSLW 699
+GVSAYCG LW+AAL+AA A+ + +VG+ S W+ Q LW
Sbjct: 577 RLSGVSAYCGALWLAALEAAIAICDVLINRPELPNVGEQKSIYEHWLN-QSLPVYQQKLW 635
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
NG ++ D+ GS+ + ADQL+GQ+YAR L I + + ALT IYD LK +
Sbjct: 636 NGKFYRLDSESGSN--VVMADQLSGQFYARLLNLPDIVPKDRALSALTTIYDACFLKFQD 693
Query: 760 GMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWS 818
G GA NG+ DG + E+W G+ +GLAA ++Q M + + V +
Sbjct: 694 GKFGAANGVLLDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKEQGLRLTEAVVRQVYE 753
Query: 819 EDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
G F+TPE+ +R+ Y+RP+ IWA+
Sbjct: 754 H---GLQFRTPEAITAAGTFRASTYLRPMAIWAV 784
>gi|332711224|ref|ZP_08431157.1| putative bile acid beta-glucosidase [Moorea producens 3L]
gi|332350038|gb|EGJ29645.1| putative bile acid beta-glucosidase [Moorea producens 3L]
Length = 815
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 272/820 (33%), Positives = 383/820 (46%), Gaps = 108/820 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + A QFSVF + + LC +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHIFKSLPACQFSVFEQPEHSLAQAYALCTQ 100
Query: 147 SPGVPKKNTDSGIESWDW---------NLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
P D + SW+W N + Y+AL+PR+W Y EL C
Sbjct: 101 PP------EDGKLGSWEWYPTQGKSEENSSEVSGHYYALYPRSWFTYQNVFQTEL--TCE 152
Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GD 244
Q SP Y+ESS+P +VF +T N T V+++ TW N V D
Sbjct: 153 QFSPIWAGCYQESSYPVAVFEWTAHNPTDTPITVSIMLTWQNMVGWFTNGIKNPEVRVRD 212
Query: 245 SGLSGHHFNSK--TMTKDGVHGLTLHHRTANGRPPVTFAVAAEE------TADVHVSECP 296
G + + + T + + H+R V +E A V
Sbjct: 213 DGSPVYEYQPRWGDSTGNLNQWIVDHYRVGFVLDRVRLGDDIKEGEGQLCLATVTNPSME 272
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
F + ++W+ GS +D+ +P+ PG I AIA T+ G TR
Sbjct: 273 VFYHGRWNPVGDGAEVWDTFATDGSL--VDHQDETPAAPGEQIACAIAIRFTLRPGKTRK 330
Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ F LAWD P +F + Y+RRYT F+G G++ + A+ H W +IE+WQ PI
Sbjct: 331 IPFILAWDFPVTEFKLGINYYRRYTDFFGRNGNNGWPMVRTALKHHDLWREQIESWQEPI 390
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L + P W+ + LFNELY L GGT+WT + LD
Sbjct: 391 LNQEDLPNWFKMALFNELYLLTDGGTLWTAAT--------------ELDP---------- 426
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
IGQF LE +Y Y + DV Y SFAL ML+PKL+ ++ FA
Sbjct: 427 ---------------IGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEAFAR 471
Query: 536 AVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKF 593
A+ D I + RK A PHD+G + PW + N + + ++WKDL S F
Sbjct: 472 AIPTSDETLRIIGYNQAQAVRKIADATPHDLGAPNEHPWQQTNYTSYQDCNQWKDLASDF 531
Query: 594 VLQVYRDFV--ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
VLQVYRDFV + D F W S+ A+AY+++FD+DGDG+ EN G PDQT+D W
Sbjct: 532 VLQVYRDFVFYSATDIEFLWECWSSIVKALAYLKRFDQDGDGIPENYGAPDQTFDDWRLQ 591
Query: 652 GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-------------- 697
GVSAYCGGLW+AAL++A A+ + + + +V + ++ D+
Sbjct: 592 GVSAYCGGLWLAALESAIAIGEILTKNYPPTSPFVEVPELDSIQDTIDCYRQWLAQAMPI 651
Query: 698 ----LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
LWNG Y+ D+ GS + ADQL GQ+YA+ GL I AL +YD
Sbjct: 652 YQEKLWNGQYYQLDSESGSD--VVMADQLCGQFYAQLLGLPDIVPPECTLSALKTVYDAC 709
Query: 754 VLKVKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
LK G GA+NG++PDG D E+W G+ +GLAA +IQ+ M D A + V
Sbjct: 710 FLKFHQGQLGAVNGVRPDGTPEDPDATHPMEVWTGINFGLAAFLIQQGMKDEALRITEAV 769
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ G F+TPE+ +R+ Y+R + IWA+
Sbjct: 770 VRQVYDH---GLQFRTPEAITAAGTFRASHYLRAMGIWAV 806
>gi|16330256|ref|NP_440984.1| hypothetical protein sll1775 [Synechocystis sp. PCC 6803]
gi|383321997|ref|YP_005382850.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325166|ref|YP_005386019.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491050|ref|YP_005408726.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436317|ref|YP_005651041.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|451814414|ref|YP_007450866.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
gi|1652745|dbj|BAA17664.1| sll1775 [Synechocystis sp. PCC 6803]
gi|339273349|dbj|BAK49836.1| hypothetical protein SYNGTS_1088 [Synechocystis sp. PCC 6803]
gi|359271316|dbj|BAL28835.1| hypothetical protein SYNGTI_1088 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274486|dbj|BAL32004.1| hypothetical protein SYNPCCN_1087 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277656|dbj|BAL35173.1| hypothetical protein SYNPCCP_1087 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407958168|dbj|BAM51408.1| hypothetical protein BEST7613_2477 [Bacillus subtilis BEST7613]
gi|451780383|gb|AGF51352.1| hypothetical protein MYO_110980 [Synechocystis sp. PCC 6803]
Length = 817
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 275/817 (33%), Positives = 387/817 (47%), Gaps = 114/817 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVL 143
D G+PLGG GAG IGRS G+F + L G I + P A QFSV+ E + +
Sbjct: 40 DHGMPLGGFGAGCIGRSPTGKFNLWHLDGGNHIFKNLP--ACQFSVYEQVAGEEAKAYAM 97
Query: 144 CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
+P D ++ W W K E Y AL+PR+W Y EL +C Q SP
Sbjct: 98 ATEAP------EDGSLDRWQWYPK-EKGEYSALYPRSWYEYKDVFKSEL--ICEQFSPVW 148
Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGH 250
NY+ESS+P +VF +T+ N ++++ TW NSV D G +
Sbjct: 149 GGNYQESSYPLAVFDWTVHNPTDKPITLSIMLTWQNSVGWFTNAIKSPTVKLRDDGSPEY 208
Query: 251 HF---------NSKTMTKDGVHGLTLHHRTA------NGRPPVTFAVAAEETADVHVSEC 295
+ N +DG L +R G + A + + +V
Sbjct: 209 EYQPRWGDSTGNFNQWIQDGFRVGCLFNRVQPHEALREGEGQMCIASVSNPSVEVFYHN- 267
Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
N G A D+W+ +GS + ++ + +EPG I AI T+ G T+
Sbjct: 268 -----RWNPSGDGA-DVWDWFAGNGSLPDIQDE--TAAEPGEQIAGAICLRFTVRPGKTK 319
Query: 356 SVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+ F LAWD P +F + V Y+RRYT FYG G + + A+ W +IEAWQ P
Sbjct: 320 KIPFLLAWDLPITEFAQGVTYYRRYTDFYGRNGKNVWSMIRTALKHSDTWRKKIEAWQAP 379
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
ILE+ R P+W ++LFNELY L GGT+WT +
Sbjct: 380 ILENPRLPDWLKMSLFNELYLLTQGGTLWTAAT--------------------------- 412
Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
EN + +GQF E +Y Y + DV Y SF L+M +PKLE ++ FA
Sbjct: 413 ----------EN--DPVGQFGVCECMDYRWYESLDVRLYGSFGLLMHWPKLEQAVMLAFA 460
Query: 535 AAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSK 592
A+ DP I + RK A PHD+G + PW N + + WKDL S
Sbjct: 461 RAIPTEDPKERVIGYNLSLAPRKVKNATPHDLGAPNEHPWERSNYTAYQDCNLWKDLGSD 520
Query: 593 FVLQVYRDFV--ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSA 650
FVL VYRDF+ D F WPS+ A+ Y++ FD D DG+ EN G PDQT+D W
Sbjct: 521 FVLLVYRDFLLLPDADGEFLGECWPSIVAALDYLKGFDLDNDGIPENSGAPDQTFDDWRL 580
Query: 651 NGVSAYCGGLWVAALQAASALANDV---------GDHASASYFWVRYQ----KAKAVY-D 696
G+SAYCGGLW+AAL+AA + + D RY+ +A+++Y D
Sbjct: 581 QGISAYCGGLWIAALEAALVMGQYLLENPPQQTDLDAMEIQTAIARYETWLAQARSLYQD 640
Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
+LWNG Y+N D+ GS + ADQL GQ+YAR L + + +KAL K+YD LK
Sbjct: 641 TLWNGEYYNLDSQSGSK--VVMADQLCGQFYARLLNLPDVVEHHYAEKALAKVYDTCFLK 698
Query: 757 VKGGMCGAMNGMQPDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
G GA NG+ PDG + + E+W G+ +GLAA +IQ M + A + V
Sbjct: 699 FAHGELGAANGLLPDGSPQNPNDTHPLEVWTGINFGLAAFLIQMGMKEEALKMTETVVNQ 758
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
++ G F+TPE+ +R+ Y+R + IWA+
Sbjct: 759 VYNN---GLQFRTPEAITAVGTFRASHYLRAMAIWAV 792
>gi|425447791|ref|ZP_18827773.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
gi|389731620|emb|CCI04385.1| Similar to tr|P73619|P73619 [Microcystis aeruginosa PCC 9443]
Length = 784
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 261/803 (32%), Positives = 399/803 (49%), Gaps = 93/803 (11%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG G G IGRS +G+F + L G + A QFS+F +G + +
Sbjct: 41 HGMPLGGFGGGCIGRSPKGDFNLWHLDGGEHSFNSLAACQFSIFEQTEDGSAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
SP + + W W E TY AL+PR+W Y+ + + +++C Q SP HN
Sbjct: 101 SPKY------ATLSRWLW-YPAEKGTYSALYPRSW--YEYKDVFQTQLICEQFSPVWAHN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+ESS+P ++F +T+ N ++++ TW N++ D G + +
Sbjct: 152 YQESSYPIALFEWTVHNPTDKPITISIMLTWQNTIGWFTNAIKSPVVKVRDDGSPVYEYQ 211
Query: 254 SKTMTKDGVH----------GLTLHHRTAN-GRPPVTFAVAAEETADVHVSECPCFLLSG 302
SK +G + G + N G A+A+ + V F L
Sbjct: 212 SKWGESEGNYNQWIVDNFRVGCLMRRDYENPGEGDGQMAIASITNPSLEV-----FYLGK 266
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ ++W+ +GS ++++ +P++ G A+A TI G TR + F LA
Sbjct: 267 WNPAGDGSEVWDYFAMNGSLPDIEDE--TPAQKGEQTATAMAIRFTIRPGKTRKIPFILA 324
Query: 363 WDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF 421
WD P +F + + Y+RRYT F+G G +A + A+ W I WQ+PILE
Sbjct: 325 WDLPVTEFAQGINYYRRYTDFFGRNGQNAWAMVRTALKHGDTWRENIINWQKPILERSDL 384
Query: 422 PEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGT 481
+W+ + LFNELY L GGT+WT +
Sbjct: 385 ADWFKMALFNELYLLADGGTLWTAATE--------------------------------- 411
Query: 482 RLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
++ IGQF LE +Y Y + DV Y SF L+ML+P+LE ++ FA A+ D
Sbjct: 412 ------QDPIGQFGVLECIDYRWYESLDVRLYGSFGLLMLWPRLEKAVMEAFARAIPSSD 465
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYR 599
I + RK A PHD+G + PW + N + + + WKDL S FVL VYR
Sbjct: 466 DTPRIIGYNQASAIRKAKNATPHDLGAPNEHPWQKSNYTSYQDCNLWKDLGSDFVLLVYR 525
Query: 600 DFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
D++ TG D++F R WPS+ + + Y++ FD D DG+ EN G PDQT+D W G+SAYC
Sbjct: 526 DYLLTGAKDEDFLRECWPSIVLTLQYLKTFDLDNDGIPENSGAPDQTFDDWRLQGISAYC 585
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTS 716
GGLW+AAL+A + +G+ + W+ Q+++A+Y +LWNG Y+N D+ GS S
Sbjct: 586 GGLWIAALEATLKIGAILGEDTAIFASWL--QQSRAIYHQTLWNGEYYNLDS--GSGSDV 641
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
+ DQL GQ+YAR L + + + AL+KIY+ LK GG GA NG++PDG +
Sbjct: 642 VMTDQLCGQFYARLLSLPDVVENQYTQSALSKIYEACFLKFHGGKFGAANGLKPDGTPEN 701
Query: 777 -SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
E+W G+ +GL A ++Q M +Q V + + E+GL F+TPE+ +
Sbjct: 702 PEATHPLEVWVGINFGLVAFLLQMGMEKQGWQITEAVVQQIY-ENGL--QFRTPEAITSV 758
Query: 836 DEYRSLCYMRPLTIWAMQWALTK 858
+R+ Y+R + IWA+ LT+
Sbjct: 759 GTFRACHYLRAMAIWAIYGVLTE 781
>gi|427716943|ref|YP_007064937.1| glucosylceramidase [Calothrix sp. PCC 7507]
gi|427349379|gb|AFY32103.1| Glucosylceramidase [Calothrix sp. PCC 7507]
Length = 800
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 273/813 (33%), Positives = 394/813 (48%), Gaps = 107/813 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + A QFSVF S + + L +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESHGTSSQ-AYALSTQ 98
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
+P D + W W TYHAL+PR+W VY E + ++ C Q SP +N
Sbjct: 99 AP------EDGSLAGWKW-YPASGGTYHALYPRSWFVY--ENVFQAQLTCEQFSPVWANN 149
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P ++F + N ++++ TW N V D G + +
Sbjct: 150 YQETSYPVAIFLWKAHNPTDAPITLSIMLTWENMVGWFTNALKSPEVRIRDDGSPVYEYQ 209
Query: 254 SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI------ 307
+ G + + + G F + A V + + + I
Sbjct: 210 PRLGESQGNYNQLIENSGQVG----CFLSQVDRDASVAEGDGSWCIATLKHPQIEVFHHT 265
Query: 308 ------TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSL 361
T +D+W GS + + +P+ + IG AIA T+ G T + FSL
Sbjct: 266 RWNPVGTGEDVWQSFAADGSLPNYQD--ATPAIENTRIGGAIAIRFTLQPGQTIEIPFSL 323
Query: 362 AWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
AWD P +F V Y+RRYT F+G G++A IA A+ E+ W +I+ WQ+PILE +
Sbjct: 324 AWDFPVTEFAAGVNYYRRYTDFFGRSGENAWAIASTALQEYQNWRSQIQRWQKPILERED 383
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
P+W+ + LFNELY L +GGT+W+ S LD
Sbjct: 384 LPQWFKMALFNELYDLTSGGTLWSAAS--------------KLDP--------------- 414
Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
+GQF LE +Y Y + DV Y SFAL+MLFP+LE S+ R FA A+
Sbjct: 415 ----------VGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVIRAFARAIPHS 464
Query: 541 DPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLNSK 592
D I +D RK GA PHD+G + W + N + + WKDL S
Sbjct: 465 DAHQRVIGYYYTIGADTTTAVRKVPGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLGSD 524
Query: 593 FVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSA 650
FVLQVYRDF+ TG D F WP++ + Y++ FD+D DG+ EN G PDQT+D W
Sbjct: 525 FVLQVYRDFLLTGADDVEFLTDCWPAIVETLNYLKTFDQDDDGIPENSGAPDQTFDDWRL 584
Query: 651 NGVSAYCGGLWVAALQAASALAN--------DVGDHA-SASYFWVRYQKAKAVYDS-LWN 700
GVSAYCGGLW+AAL+AA A++N D+G+ A S + +++ +Y+ LWN
Sbjct: 585 QGVSAYCGGLWLAALEAAIAISNILLNQNLADLGNLALQKSIYETWLAQSRPIYEEKLWN 644
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
G Y+ D+ GS + ADQL GQ+YAR GL I + AL +Y+ LK G
Sbjct: 645 GQYYRLDSKSGSD--VVMADQLCGQFYARLLGLPDIVASDRALSALQTVYEACFLKFYDG 702
Query: 761 MCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
GA NG++PDG + E+W G+ +GLAA ++Q M D A + V + +
Sbjct: 703 QFGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLVQMGMQDEALKLTQAVVKQIYDN 762
Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
G F+TPE+ +R+ Y+R + IWA+
Sbjct: 763 ---GLQFRTPEAITTVGTFRASTYLRAMAIWAI 792
>gi|428773649|ref|YP_007165437.1| glucosylceramidase [Cyanobacterium stanieri PCC 7202]
gi|428687928|gb|AFZ47788.1| Glucosylceramidase [Cyanobacterium stanieri PCC 7202]
Length = 800
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 265/822 (32%), Positives = 389/822 (47%), Gaps = 110/822 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGG GAG IGR+ G+F + + G + A QF+VF + L
Sbjct: 39 NHGMPLGGFGAGCIGRAPSGDFNLWHIDGGNHTFETLPACQFAVFEHSEGEDAQVYALST 98
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
+P D + SW W +G+ TY AL+PR+W Y+G +L VC Q SP
Sbjct: 99 NAP------EDGSLSSWSWYPQGKG-TYSALYPRSWFSYEGVFKSQL--VCEQFSPVWAG 149
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
+Y+ESS+P +VF +++ N ++++ +W N+V D G + +
Sbjct: 150 SYQESSYPMAVFEWSIHNPTDKPITMSIMVSWQNTVGWFTNAIKSPTVKVRDDGSPEYEY 209
Query: 253 ---------NSKTMTKDGVHGLTLHHRTANGRPPV----TFAVAAEETADVHVSECPCFL 299
N +DG L R P +A+A + V +
Sbjct: 210 KPRWGNSTGNLNQWIQDGFRVGCLFDRIRLEDEPTEGEGQWAIATTTNPTLEV----FYH 265
Query: 300 LSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
N KG ++W+ +GS ++ +P+EPG I A+A TI G T+ + F
Sbjct: 266 TRWNPKG-DGSEIWDRFAMNGSLPDFQDE--TPAEPGEQIAGAMAIRFTIRPGKTKKIPF 322
Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
L WD P +F + V Y RRYT F+G G + + + H W+ +++ WQ PIL
Sbjct: 323 ILGWDFPVTEFAQGVNYFRRYTDFFGRNGRNVWSMIRTGLKHHDMWQNKVKMWQEPILSR 382
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
P+W + LFNELY L GGT+WT + D P
Sbjct: 383 SNLPDWLKMALFNELYLLTQGGTLWTAATE---------------DDP------------ 415
Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
+GQF LE +Y Y + DV Y SF L+ LFP+LE S+ F+ A+
Sbjct: 416 ------------VGQFGVLECIDYRWYESLDVRLYGSFGLLQLFPRLEKSVMEAFSRAIP 463
Query: 539 MHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQ 596
D I +G RK GA PHD+G + PW + N + + + WKDL S FVL
Sbjct: 464 TADDTPRIIGYNGASAIRKAKGATPHDLGAPNEHPWIKSNYTSYQDCNLWKDLGSDFVLL 523
Query: 597 VYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
VYRD++ TG D++F W SV + Y++ FD D DG+ EN G PDQT+D W G+S
Sbjct: 524 VYRDYLYTGKNDQDFLWECWDSVVETLHYVKGFDLDNDGIPENSGAPDQTFDDWRLQGIS 583
Query: 655 AYCGGLWVAALQAASALANDVGDH-----------------ASASYFWVRYQKAKAVY-D 696
AYCGGLW+ AL+AA A+ + D+ + + F +++A+Y D
Sbjct: 584 AYCGGLWICALEAAIAIGTILLDNPPVNPALEKENAQENIQGAIALFRQWLHQSRAIYHD 643
Query: 697 SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
+LWNG Y+N D+ GS + ADQL GQ+YA+ L + ++ V L KIYD LK
Sbjct: 644 TLWNGEYYNLDSKSGSD--VVMADQLCGQYYAQLLKLPDVVEKEYVTATLNKIYDACYLK 701
Query: 757 VKGGMCGAMNGMQPDGRIDMSG-LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
G G NG++PDG + EIW G+ YG+ A MI M + + A V
Sbjct: 702 FHNGTLGIANGVKPDGSPEKENDTHPLEIWTGINYGIVAFMILNGMKEEGLKVAETVVNQ 761
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
+S G F+TPE+ + +R+ Y+R + IWA+ AL+
Sbjct: 762 VYSN---GLQFRTPEAITAVNTFRASHYLRAMAIWAIYQALS 800
>gi|282897193|ref|ZP_06305195.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
gi|281197845|gb|EFA72739.1| Protein of unknown function DUF608 [Raphidiopsis brookii D9]
Length = 798
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 274/824 (33%), Positives = 398/824 (48%), Gaps = 126/824 (15%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G V QFS+F S G+ ++ P
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHVFKNVPGCQFSIFTS---GQAYALSTQP- 93
Query: 147 SPGVPKKNTDSGIESWDW---------NLKGENCTYHALFPRAWTVYDGEPDPELRIVCR 197
P+ NT +++W W K + TYHAL+PR+W VY E L+ C
Sbjct: 94 ----PEDNT---LQTWQWYPNVVHSSTGTKTDGGTYHALYPRSWFVY--ENLFPLQFTCE 144
Query: 198 QISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GD 244
Q SP HNY+E+S+P +VF + + N +++ +W N D
Sbjct: 145 QFSPVWAHNYRETSYPVAVFLWNIHNPTNQPITASIMLSWENMTGWFTNALKSPQVKIRD 204
Query: 245 SGLSGHHFNSKTMTKDG--------------VHGLTLHHRTANGRPPVTFAVAAEETADV 290
G + + + G V G +H G T+ +A + V
Sbjct: 205 DGSPVYEYQPRWGESQGNYNWLAENDQYLGCVLGRATNHPVQEGDG--TWCIATVKNPQV 262
Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIP 350
+ F GN +++W + ++GS + ++ +P++ S +GAA+A TI
Sbjct: 263 ELFYHCQFNPVGN-----GEELWRDFSQNGSLSNYQDE--TPADVNSRLGAAVAVRFTIS 315
Query: 351 SGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
G T +V F L+WD P +F V Y+RRYT F+GT GD+A +IA A+ E W IE
Sbjct: 316 PGETFTVPFVLSWDFPVTEFAAGVNYYRRYTDFFGTTGDNAWQIATCALKECYNWRSHIE 375
Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
+WQ+PILE + P W+ + LFNELY L +GGT+W+ G+
Sbjct: 376 SWQKPILEREDLPSWFKMALFNELYDLTSGGTLWSAGTE--------------------- 414
Query: 470 HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
++ +GQF LE +Y Y + DV Y SF L++LFP+LE ++
Sbjct: 415 ------------------KDPMGQFAVLECLDYRWYESLDVRLYGSFGLLILFPELEKAV 456
Query: 530 QRDFAAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLF- 581
R FA A+ D + + I S RK GA PHD+G + W +E +Y +
Sbjct: 457 IRAFARAIPQSDDRSRIIGYYLTINSPSPVALRKVAGATPHDLGAPNEWVWEQTNYTSYQ 516
Query: 582 NSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ + WKDL FVLQVYRDF+ TG D F W + + + Y++QFD D DG+ EN G
Sbjct: 517 DCNLWKDLGCDFVLQVYRDFLFTGGNDIEFLVDCWYGIVLTLDYLKQFDIDADGIPENSG 576
Query: 640 FPDQTYDAWSANGVSAYCGGLWVA----------ALQAASALANDVGDHASASYFWVRYQ 689
PDQT+D W +GVSAYCG LW+A L L+N VG+ S W+ Q
Sbjct: 577 APDQTFDDWRLSGVSAYCGALWLAALEAAIAICDVLMNHPELSN-VGEQRSIYENWLN-Q 634
Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
LWNG ++ D+ GS+ + ADQL+GQ+YAR L I + + ALT I
Sbjct: 635 SLPVYQQKLWNGKFYRLDSESGSN--VVMADQLSGQFYARLLNLPDIVPQDRALSALTTI 692
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
YD LK + G GA NG+ PDG + E+W G+ +GLAA ++Q M D +
Sbjct: 693 YDACFLKFQDGKFGAANGVLPDGSPENPQATHPLEVWTGINFGLAAFLLQMGMKDQGLRL 752
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
V + G F+TPE+ +R+ Y+RP+ IWA+
Sbjct: 753 TQAVVRQIYDH---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 793
>gi|47212590|emb|CAG12815.1| unnamed protein product [Tetraodon nigroviridis]
Length = 777
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 238/613 (38%), Positives = 335/613 (54%), Gaps = 44/613 (7%)
Query: 47 QEFRHLAHIGFRLYRY-SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRG 105
Q +HL +G R R+ K+ + K P ++F + G PLGGIG G+I R +RG
Sbjct: 43 QVLKHLG-LGIRYLRWWYKKTRVEKKAPFIDMFGALPLRQIYGAPLGGIGGGTITRGWRG 101
Query: 106 EFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWN 165
EF R++L G V+ NQF+V + R + VL P ++ W+W
Sbjct: 102 EFCRWQLNPGKYHYKTVIENQFTVCLRRDGQTVYQQVLSVERPPT--------LQGWNWG 153
Query: 166 LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
GE YHAL+PRAWTVY P + + CRQISP IPH+Y+++S P +VF + + N
Sbjct: 154 FCGEYAYYHALYPRAWTVYH-LPGQNVTLTCRQISPVIPHDYQDASLPVAVFVWDVENKN 212
Query: 226 QTSADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAV 282
+ DV+++FT N SG H+N + K+G V G+ LHH TA P T +
Sbjct: 213 DYALDVSIMFTALNGSGHRDDKSGGHWNQPFRLEKEGEAVSGVLLHHCTAVN--PYTLCI 270
Query: 283 AAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAA 342
AA E D VS F G G +WN++ G D + P+ G + AA
Sbjct: 271 AAREQPDRDVSHQTAFSPKGTCNG-----LWNDLITDGRLDS-PTGSSPPTAKGERVAAA 324
Query: 343 IAASLTIPSGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARIAHDAILE 400
+AA + + + + FSLAWD P++ F E+ + RRYT++YG+ GD++ ++H A+ +
Sbjct: 325 LAAGCAVSAQNRNCLEFSLAWDMPKITFGAGEQQHSRRYTRYYGSKGDASPLLSHHALTQ 384
Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
+ WE IE WQ P+L+D P WY LFNELY++ GGT+WT+ + +R
Sbjct: 385 YRGWEKRIEEWQTPVLQDSSLPAWYKSALFNELYFVADGGTVWTELAEDADVSGGLR--- 441
Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVM 520
P PA + G+F YLEG EY MYNTYDVH+Y+SFAL+M
Sbjct: 442 -----PEDGGLPAQPAVI----------KEYGRFAYLEGQEYRMYNTYDVHYYASFALIM 486
Query: 521 LFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSY 578
L+PKL LS+Q D A +V+ DP + +G+ K G VPHDIG D+PW +N+Y
Sbjct: 487 LWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSPVKTKGVVPHDIGDPEDEPWQRVNAY 546
Query: 579 NLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
+ +++ WKDLN KFVLQVYRDF T D + R +WP + QFD DGDG+IEN
Sbjct: 547 LIHDTAGWKDLNLKFVLQVYRDFHLTQDAQYLRDMWPICEMVTESELQFDLDGDGLIENS 606
Query: 639 GFPDQTYDAWSAN 651
G+ DQTYD W+A+
Sbjct: 607 GYADQTYDGWAAS 619
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/116 (51%), Positives = 77/116 (66%), Gaps = 2/116 (1%)
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEE 800
K++ AL I+D NV+ GG GA+NGM+P+G D S +Q+ E+W GV YGLAA+MI E
Sbjct: 663 KIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHEG 722
Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
M++ +TA G Y W LG +FQTPE++ YRSL YMRPL IWAMQ AL
Sbjct: 723 MLEEGMRTAEGCYRTVWER--LGMAFQTPEAYCEKGIYRSLAYMRPLCIWAMQLAL 776
>gi|428307493|ref|YP_007144318.1| glucosylceramidase [Crinalium epipsammum PCC 9333]
gi|428249028|gb|AFZ14808.1| Glucosylceramidase [Crinalium epipsammum PCC 9333]
Length = 811
Score = 394 bits (1013), Expect = e-106, Method: Compositional matrix adjust.
Identities = 274/829 (33%), Positives = 396/829 (47%), Gaps = 126/829 (15%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG +GRS RG+F + + G + A QFSVF N + +C
Sbjct: 41 HGMPLGGFGAGCLGRSSRGDFNLWHIDGGEHTFKSLPACQFSVFEQAGNSAPQAYAMCTE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKG-----ENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
P D + +W W K + TYHAL+PR+W VYDG +L C Q SP
Sbjct: 101 PP------EDGTLSTWQWYPKTGKGTESSGTYHALYPRSWFVYDGVFQAQL--TCEQFSP 152
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
NY+ESS+P VF +T N ++++ TW NSV D G
Sbjct: 153 IWADNYQESSYPVGVFEWTAHNPTDQPITLSIMLTWQNSVGWFTNAIKTPQVKVRDDGSP 212
Query: 249 GHHFNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVS 293
+ + + G + + +H G A + +V
Sbjct: 213 VYEYQPRWKDSTGNLNQWIVDHFRAGCLLDRVRIHDEVQEGEGQWAIATITNPSVEV--- 269
Query: 294 ECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGS 353
F + + + +++W+ GS D + +P++PG I AA+ T+ G
Sbjct: 270 ----FYHTRWNPAGSGEEIWDNFAFDGSLR--DKEDETPADPGEQIAAAMTIRFTVRPGR 323
Query: 354 TRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
TR + F LAWD P +F V Y+RRYT F+G G++A I A+ W+ +I++WQ
Sbjct: 324 TRKIPFVLAWDFPVTEFATGVNYYRRYTDFFGRTGNNAWSIVRTALKHDDTWKEQIQSWQ 383
Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP 472
L++ P+W+ + LFNELY L GGT+WT
Sbjct: 384 ELTLKNANLPDWFKMALFNELYILADGGTLWT---------------------------- 415
Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
A+S+T + IGQF LE +Y Y + DV Y SFAL ML+PKL+ ++
Sbjct: 416 AASET-----------DGIGQFGVLECLDYRWYESLDVRLYGSFALAMLWPKLDKAVLEA 464
Query: 533 FAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLN 590
+A A+ D I + RK A PHD+G + PW + N + + + WKDL
Sbjct: 465 YARAIPTSDNTPRIIGYNQASAIRKAANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLG 524
Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
S FVLQVYRD++ TG D F W ++ +AY+++FD DGDG+ EN G PDQT+D W
Sbjct: 525 SDFVLQVYRDYLLTGATDIEFLSECWSAIVKTLAYLKEFDLDGDGIPENSGAPDQTFDDW 584
Query: 649 SANGVSAYCGGLWVAALQAASALANDVGDH-----------ASASY-FWVRYQKAKAVY- 695
GVSAYCGGLW+AAL+AA A+ + +H A A+Y W+ +++K +Y
Sbjct: 585 RLQGVSAYCGGLWLAALEAAIAIGEILINHGVELEDGDFESAIATYKSWL--EQSKPLYQ 642
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
+ LWNG Y+ D+ GS+ + ADQL GQ+YAR L I V+ AL +YD L
Sbjct: 643 EKLWNGQYYQLDSESGSN--VVMADQLCGQFYARLLNLPDIVPNDCVESALKTVYDSCFL 700
Query: 756 KVKGGM-----------CGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVD 803
K + GA NG+ DG + E+W G+ +GLAA MIQ M D
Sbjct: 701 KFNQKLSQHSESLTPTFIGAANGVLADGSPENPQATHPLEVWTGINFGLAAFMIQMGMKD 760
Query: 804 MAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
AFQ V + ++ G F+TPE+ +R+ Y+R + IWA+
Sbjct: 761 EAFQLTEAVVQQIYNN---GLQFRTPEAITAAGTFRASHYLRAMAIWAI 806
>gi|427733995|ref|YP_007053539.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
gi|427369036|gb|AFY52992.1| putative bile acid beta-glucosidase [Rivularia sp. PCC 7116]
Length = 799
Score = 394 bits (1013), Expect = e-106, Method: Compositional matrix adjust.
Identities = 266/820 (32%), Positives = 392/820 (47%), Gaps = 118/820 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
QG+PLGG GAG IGRS RG+F + + G + A QFS+F S N ++
Sbjct: 40 QGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFGNIPACQFSLFESSSNRNVAYAL---- 95
Query: 147 SPGVPKKNTDSGIESWDW---NLKGENC-TYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ + DS + +W+W KG++ TYHAL+PR+W VY E + R+ C Q SP
Sbjct: 96 ---STEPSNDSTLSAWEWYPSRSKGDSSGTYHALYPRSWFVY--ENVFQSRLSCEQFSPI 150
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------------- 242
NY+ESS+P ++F + SN ++++ TW N V
Sbjct: 151 WADNYQESSYPVAIFLWNASNPTSEPVTLSIMLTWQNMVGWFTNTLKSPEVEMRDDGSPV 210
Query: 243 ----GDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRP---PVTFAVAAEETADVHVSEC 295
+ G S +FN + +T G L + P ++ +A + V +
Sbjct: 211 YDYQPNLGESKGNFN-RLVTNSEYIGCVLD-KIGTAEPIEGEGSWCIATAKHPKVEI--- 265
Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
F + + T +D+W K GS ++ ++ +P+ +GAAIA T+ G +
Sbjct: 266 --FHHNRWNPTSTGEDIWRSFAKTGSLANISDE--TPAISNEQVGAAIAVRFTLQPGESL 321
Query: 356 SVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+ F++AWD P +F V Y RRYT F+G G +A IA A+ E+ W +++AWQ+P
Sbjct: 322 QIPFAVAWDFPITEFVSGVTYKRRYTDFFGDNGKNAWNIATTALEEYQNWLSQVQAWQQP 381
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
I+E P+W+ + LFNELY L AGGT+W+ S
Sbjct: 382 IVERPDLPDWFKMALFNELYDLTAGGTLWSAASK-------------------------- 415
Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
+ IGQF LE +Y Y + DV Y SF L+ LFP+LE ++ R FA
Sbjct: 416 -------------RDPIGQFAVLECLDYRWYESLDVRLYGSFGLLNLFPELEKAVMRAFA 462
Query: 535 AAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRW 586
A+ D T I ++ RK GA PHD+G + W + N + + ++W
Sbjct: 463 RAIPESDEKTRVIGYYYTIGAESPLAVRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNQW 522
Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
KDL FVLQVYRDF+ TG D F W ++ + Y++ FDKDGDG+ EN G PDQT
Sbjct: 523 KDLGCDFVLQVYRDFIFTGSNDIEFLADCWNAIVETLNYLKAFDKDGDGIPENSGAPDQT 582
Query: 645 YDAWSANGVSAYCGGLW----------VAALQAASALANDVGDHASASYFWVRYQKAKAV 694
+D W GVSAYCGGLW L + ++ S W+ +K++ V
Sbjct: 583 FDDWRLLGVSAYCGGLWLAALEAAIAIAEVLSSKQVQVENLAAQKSIYEGWL--EKSRPV 640
Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
Y + LWNG ++ D+ GS + ADQL GQ+YAR L I AL +YD
Sbjct: 641 YQEKLWNGKFYRLDSESGSD--VVMADQLCGQFYARLLKLPDIVPPECTMSALNSVYDAC 698
Query: 754 VLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
LK G GA NG++ DG + E+W G+ +G+AA ++Q M AF+ V
Sbjct: 699 FLKFNQGKLGAANGLRLDGSPENPDATHPLEVWTGINFGIAAFLVQMGMKSEAFKLTEAV 758
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ G F+TPE+ +R+ Y+R + IWA+
Sbjct: 759 VNQIYEH---GLQFRTPEAITATRNFRACVYLRAMAIWAI 795
>gi|170079447|ref|YP_001736085.1| hypothetical protein SYNPCC7002_A2862 [Synechococcus sp. PCC 7002]
gi|169887116|gb|ACB00830.1| Conserved hypothetical protein (DUF608) [Synechococcus sp. PCC
7002]
Length = 803
Score = 394 bits (1012), Expect = e-106, Method: Compositional matrix adjust.
Identities = 277/822 (33%), Positives = 386/822 (46%), Gaps = 114/822 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G A QFSVF +P GE +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFSVF-EQPEGEAAQAYAM-- 97
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
S P+ T ++SW W + TY AL+PR+W Y G EL C Q SP I N
Sbjct: 98 STVAPEDGT---LDSWQW-YPAKKGTYAALYPRSWYEYQGAFAAEL--TCEQFSPIIAQN 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P +VF +T N V+++ TW N V D G + +
Sbjct: 152 YQETSYPVAVFEWTAHNPTDKPITVSIMMTWQNMVGWFTKAQKTPEVVLRDDGSPVYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
S+ G ++ + H + G + FA T +V
Sbjct: 212 SRWGDSTGNLNQWVQDRYRVGCLLNRVRPHADLSEGEGQLCFATVTNPTLEVFYHN---- 267
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
N G +D+W+ +GS ++++ +P+ PG I AIA T+ G T+ +
Sbjct: 268 --RWNPVG-DGRDVWDYFAGNGSLPDIEDE--TPAAPGEQIAGAIAVRFTLRPGKTKKIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P ++F + Y RRYT F+G G++ + A+ W+ I AWQ+PIL
Sbjct: 323 FFLAWDLPVMEFAPDITYFRRYTDFFGRSGNNVWTMIRTAMKHSDLWKERIAAWQQPILG 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
P+W + LFNELY L GG++WT +
Sbjct: 383 RDDLPDWLKMALFNELYLLTQGGSLWTAAT------------------------------ 412
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
EN + +GQF LE +Y Y + DV Y SF L ML+P+L+ ++ FA A+
Sbjct: 413 -------EN--DPVGQFGVLECLDYCWYESLDVRMYGSFGLTMLWPRLDKAVLEAFARAI 463
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW N + + WKDL S FVL
Sbjct: 464 PTSDDTQRMIGYNKTMATRKVAGATPHDLGAPNEHPWEATNYTAYQDCNLWKDLGSDFVL 523
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
QVYRDFV TG D F + W SV A+ Y++ FD DGDG+ EN G PDQT+D W G+
Sbjct: 524 QVYRDFVMTGAEDTEFLWSCWGSVLTALDYLKTFDLDGDGIPENGGAPDQTFDDWKLKGI 583
Query: 654 SAYCGGLWVAA----------LQAASALANDVGDHASASYF---WVRYQ----KAKAVY- 695
SAYCGGLW+AA LQ + + + SA+ F YQ +A+ +Y
Sbjct: 584 SAYCGGLWIAALEAAIAMAKILQQSPSHFEAHQNQVSAAEFEQAIATYQTWLDQARPLYQ 643
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
D LWNGSY+ D GS+S + ADQL GQ+Y + L + + L IY
Sbjct: 644 DILWNGSYYTLDT--GSNSRVVMADQLCGQYYTQLLSLPDVNPGDRTATTLATIYTACFE 701
Query: 756 KVKGGMCGAMNGMQPDGRIDMSG-LQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K GG GA NG+ PDG + E+W G+ +G+AA MI+ M A + V E
Sbjct: 702 KFHGGQFGAANGLNPDGTPEKENDTHPLEVWTGINFGIAALMIRNGMQTEALRMVEAVVE 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
+ G F+TPE+ N +R+ Y+R + IWA+ AL
Sbjct: 762 QVYDN---GLQFRTPEAITANGTFRACHYLRAMGIWAIYDAL 800
>gi|220909879|ref|YP_002485190.1| hypothetical protein Cyan7425_4519 [Cyanothece sp. PCC 7425]
gi|219866490|gb|ACL46829.1| protein of unknown function DUF608 [Cyanothece sp. PCC 7425]
Length = 814
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 277/824 (33%), Positives = 402/824 (48%), Gaps = 118/824 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVF--VSRPNGEKFS-SVL 143
G+PLGG GAG +GRS+RG+F + + G + A QFSVF V + G+ +S S +
Sbjct: 42 HGMPLGGFGAGCVGRSHRGDFNLWHIDGGEHVFQSLPACQFSVFEQVGQSPGKAYSLSTV 101
Query: 144 CPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
P D + SW W + +YHAL+PR+W VY +L VC Q SP +
Sbjct: 102 APE---------DGSLNSWQWYPQ-SGGSYHALYPRSWFVYQNVFQADL--VCEQFSPIL 149
Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGH 250
NY+E+S+P +VF +T N ++L+ TW N V D G +
Sbjct: 150 TENYQETSYPVAVFHWTAHNPTDQPLTLSLMLTWQNVVGWFTNADKSPVVKVRDDGSPVY 209
Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE---------------C 295
+ + G + NGR V F + + + ++E
Sbjct: 210 TYQPRLGESQGNFNRVIDD---NGR--VGFVLDRLRSEEAAIAEGEGQWAVATFASPYTS 264
Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
F + + ++W GS + D++ +P+ G I AAIA T+P G TR
Sbjct: 265 EVFYYTRWNPVGDGSELWQTFAADGSLANCDDE--TPAGVGEQIAAAIAVRFTLPPGQTR 322
Query: 356 SVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+ F LAWD P +F V Y RRYT F+G GD IA + H W +I+AWQ+P
Sbjct: 323 QIPFVLAWDFPVTEFAAGVNYFRRYTDFFGRTGDHGWAIARTGLKMHQTWTDQIQAWQQP 382
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
IL D P+W+ + L NELY L +GGT+W+ ER
Sbjct: 383 IL-DSNEPDWFKMALLNELYDLTSGGTLWSAAD----------ER--------------- 416
Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
+ IGQF LE +Y Y + DV Y SFAL++L+P+LE ++ FA
Sbjct: 417 --------------DPIGQFAVLECLDYRWYESLDVRLYGSFALLLLWPRLEKAVMEAFA 462
Query: 535 AAVMMHDPGTMKI---MSDG---KWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
A+ D T I + G K A PHD+G + PW + N + + + W
Sbjct: 463 RAIPTQDDRTRVIGYYYTQGMPSPTALHKIANATPHDLGAPNEHPWEKTNYTSYQDCNLW 522
Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
KDL + FV+ VYRD++ TG D +F WP+++ +AY++ FD DGDG+ EN G PDQT
Sbjct: 523 KDLPADFVILVYRDYLLTGGTDYDFLTTCWPAIHATLAYLKTFDLDGDGIPENSGAPDQT 582
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHA--------SASYFWVRYQKAKAVY- 695
+D W G+SAYCGGLW+AAL+AA A+ + A S + + +++ +Y
Sbjct: 583 FDDWRLQGISAYCGGLWIAALEAAIAIIKLLLVQAPDLPHLQESLTTYQTWLAQSRPLYQ 642
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
+LWNG Y+ D+ GS S + ADQL GQ+YAR GL + E + AL +Y L
Sbjct: 643 QTLWNGQYYRLDS--GSGSDVVMADQLCGQFYARLVGLPDVVPEDCARAALDMVYQTCFL 700
Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQ-TAAGVY 813
+ G GA NG++P G + E+W G+ +GLAA ++Q A Q TAA VY
Sbjct: 701 QFHNGTLGAANGLRPGGLAEKPDATHPLEVWTGINFGLAAFLVQMGRKAEAMQLTAAVVY 760
Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
++ E GL F+TPE+ +R+ Y+RP+ IWA+ LT
Sbjct: 761 QI--YEHGL--QFRTPEAITAEQTFRAGMYLRPMAIWAIYHLLT 800
>gi|17230308|ref|NP_486856.1| hypothetical protein alr2816 [Nostoc sp. PCC 7120]
gi|17131910|dbj|BAB74515.1| alr2816 [Nostoc sp. PCC 7120]
Length = 804
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 273/820 (33%), Positives = 390/820 (47%), Gaps = 114/820 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
G+PLGG GAG IGRS RG+F + + G I + P A QFSVF NG +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVF--EANGTSSQAYAL 95
Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENC----TYHALFPRAWTVYDGEPDPELRIVCRQIS 200
P D ++SW W TYHAL+PR+W VY+ EL C Q S
Sbjct: 96 STQP-----TDDESLKSWQWYPASSETQTTGTYHALYPRSWFVYENVFQAEL--TCEQFS 148
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
P +NY+E+S+P +VF + N ++++ TW N V D G
Sbjct: 149 PIWANNYQETSYPVAVFAWQAHNPTNAPITLSIMLTWQNMVGWFTNALKSPDVRVRDDGS 208
Query: 248 SGHHFNSKTMTKDGVHGLT----LHHRTANGRPPVTFAVAAEE----TADVHVSECPCFL 299
+ + + G + H GR +T + E A + S+ F
Sbjct: 209 PVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMTEPLQEGEGSWCIATLKHSQVEVFY 268
Query: 300 LSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
+ + T +++W GS +H+D + +E +GAAIA T+ G T +
Sbjct: 269 HTRWNPVGTGEEVWQSFAADGSLANHIDTSSVTENE---QLGAAIAVRFTLQPGETLEIP 325
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F ++WD P +F V Y+RRYT F+G GD A IA A+ ++ W+ +IE+WQ PIL
Sbjct: 326 FVVSWDLPVTEFAAGVNYYRRYTDFFGKTGDHAWAIATTALEQYKTWQQQIESWQDPILN 385
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
P+W+ + LFNELY L +GGT+W+ +P
Sbjct: 386 RPDLPDWFKMALFNELYDLTSGGTLWSAATPC---------------------------- 417
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ IGQF LE +Y Y + DV Y SF L+ LFP+LE ++ R FA A+
Sbjct: 418 -----------DPIGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAI 466
Query: 538 MMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDL 589
D P + I ++ RK GA PHD+G + W + N + + + WKDL
Sbjct: 467 PQGDDTPRVIGYYYTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDL 526
Query: 590 NSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
FVLQVYRDF+ TG D F R W ++ + Y++ FD DGDG+ EN G PDQT+D
Sbjct: 527 GCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDD 586
Query: 648 WSANGVSAYCGGLW----VAALQAASALANDVGDHASASYFWVRYQKAK---------AV 694
W GVSAYCGGLW AA+ + L + D A+ + + YQK+ +
Sbjct: 587 WRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSATKEH--LLYQKSTYETWLTKSLPI 644
Query: 695 Y-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
Y + LWNG Y+ D+ GS + ADQL GQ+YA L I + AL +YD
Sbjct: 645 YQEKLWNGKYYRLDSESGSD--VVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDAC 702
Query: 754 VLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
LK G GA NG++PDG + E+W G+ +GLAA ++Q M D F+ V
Sbjct: 703 FLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAV 762
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
E ++ G F+TPE+ +R+ Y+RP+ IWA+
Sbjct: 763 VEQIYNN---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 799
>gi|428223619|ref|YP_007107716.1| glucosylceramidase [Geitlerinema sp. PCC 7407]
gi|427983520|gb|AFY64664.1| Glucosylceramidase [Geitlerinema sp. PCC 7407]
Length = 821
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 280/831 (33%), Positives = 397/831 (47%), Gaps = 122/831 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG +GRS RG+F + + G + A QFSVF +G + LC
Sbjct: 39 HGMPLGGFGAGCLGRSPRGDFNLWHIDGGEHVFQSMPACQFSVFEQGADGTSQAYALCSE 98
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+P + W W + TY AL+PR + +D E R+ C Q SP
Sbjct: 99 APA-------ESLGQWQWYPASMAERSTGTYSALYPR--SRFDYEGVFRARLACEQFSPI 149
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
NY+ESS+P +VF +T N +++L TW N V D G
Sbjct: 150 WADNYQESSYPVAVFEWTAHNPTDQLLTLSILLTWQNMVGWFTNALKSPEVKVRDDGSPV 209
Query: 250 HHF---------NSKTMTKDGVHGLTLHHRTANGRPPV----TFAVAAEETADVHVSECP 296
+ + N +G L R P +A+A + E P
Sbjct: 210 YEYQPRLGESEGNFNQWIAEGAWVGCLMDRAGRAEEPQEGEGQWAIATQ-------CEGP 262
Query: 297 CFLLSGNSKGITAKD---MWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGS 353
+ +++ A D +W + G+ + + +P++ IG AIA T+ G
Sbjct: 263 GTEIFYHTRWNPAGDGGELWRTFAQTGALE--NRIDGTPAQGEEQIGGAIALRFTLAPGE 320
Query: 354 TRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
TR V F LAWD P +F +RRYT F+G G + IA A+ E++ W+ I AWQ
Sbjct: 321 TRRVPFVLAWDFPVTEFAAGTAAYRRYTDFFGRSGRNGWAIARTALEEYSAWQARIAAWQ 380
Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP 472
+PIL+ P W+ + LFNELY L +GGT+W+
Sbjct: 381 QPILDRADLPNWFKMALFNELYDLTSGGTLWS---------------------------- 412
Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
A+SD + +GQF LE +Y Y + DV Y SF L+ML+P+LE S+ R
Sbjct: 413 AASD-----------RDPVGQFAVLECIDYRWYESLDVRLYGSFGLLMLWPELEKSVMRA 461
Query: 533 FAAAVMMHDPGTMKIM------SDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSS 584
FA A+ D I ++ RK L A PHD+G + PW + N + + +
Sbjct: 462 FARAIPTADDRQRVIGYYYTQGTESPLALRKALDATPHDLGAPNEHPWEKTNYTSYQDCN 521
Query: 585 RWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
WKDL S FVLQVYRDF TG D F WP+V A+AY++ FD+DGDG+ EN G PD
Sbjct: 522 LWKDLGSDFVLQVYRDFRLTGSTDDAFIAECWPAVTRALAYLKAFDQDGDGIPENSGAPD 581
Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALA-----NDVGDHASASYFWVRYQ-------- 689
QT+D W GVSAYCGGLW+AAL+AA AL + V DH S + +
Sbjct: 582 QTFDDWRLQGVSAYCGGLWIAALEAAIALGERLQQHPVSDHPELSQGAIAHHLSTYRTWL 641
Query: 690 -KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALT 747
+++AVY D+LWNG Y+ D+ GS S + ADQL GQ+YAR L + + AL
Sbjct: 642 AQSRAVYDDALWNGRYYRLDS--GSGSEVVMADQLCGQFYARLLDLPDVVASDRALSALK 699
Query: 748 KIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
+YD L+ G GA NG++PDG + E+W G+ +G+AA ++Q + D F
Sbjct: 700 TVYDSCFLRFHDGKFGAANGVKPDGSPENPKATHPLEVWTGINFGIAAFLVQMGLRDEGF 759
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
+ V E + G F+TPE+ +R+ Y+RP+ IWA+ LT
Sbjct: 760 RLTQAVVEQIYEH---GLQFRTPEAITAVGTFRACHYLRPMAIWAVYGVLT 807
>gi|434407175|ref|YP_007150060.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
gi|428261430|gb|AFZ27380.1| putative bile acid beta-glucosidase [Cylindrospermum stagnale PCC
7417]
Length = 816
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 270/828 (32%), Positives = 390/828 (47%), Gaps = 122/828 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + A QFSVF S + + L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNIPACQFSVFESHGASSQ-AYALSTE 98
Query: 147 SPGVPKKNTDSGIESWDWNLKG------------ENCTYHALFPRAWTVYDGEPDPELRI 194
P D + +W W + + TYHAL+PR+W VY+ +L
Sbjct: 99 PP------EDGTLSAWQWYPRSAESKAGKMPTPQDTGTYHALYPRSWFVYENVFQAQLS- 151
Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA------------ 242
C Q SP +NY+E+S+P +VF + N ++++ TW N V
Sbjct: 152 -CEQFSPVWANNYQETSYPVAVFNWKAHNPTDAPITLSIMLTWENMVGWFTNALKSPKVQ 210
Query: 243 -GDSGLSGHHFNSKTMTKDGVHGLTLHH---------RTANGRPPV----TFAVAAEETA 288
D G + + + G + + + R A+ P T+ +A +
Sbjct: 211 IRDDGSPVYEYQPRLGKSQGNYHQLIENTQQVGCVLGRVASDAPVQEGDGTWCIATRKHP 270
Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
+ V F + + T ++W GS + D+T+ +E + IGAA+A T
Sbjct: 271 LIEV-----FHHTRWNPVGTGDEVWQSFAADGSLPNY-VDETAAAE-NTRIGAALAVRFT 323
Query: 349 IPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
+ G V F LAWD P +F V Y+RRYT F+G G +A IA ++ E+ W +
Sbjct: 324 LQPGECLEVPFVLAWDFPVTEFAAGVNYYRRYTDFFGRSGQNAEAIAFLSLQEYQNWRSQ 383
Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
I+ WQ+PIL + P W+ + LFNELY L +GGT+W+ S LD
Sbjct: 384 IQNWQKPILVREDLPPWFKMALFNELYDLTSGGTLWSAAS--------------ELDP-- 427
Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
GQF LE +Y Y + DV Y SFAL+MLFP+LE
Sbjct: 428 -----------------------FGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEK 464
Query: 528 SIQRDFAAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYN 579
S+ R FA A+ D I +D RK GA PHD+G + W + N
Sbjct: 465 SVIRAFARAIPQSDDHQRIIGYYYTIGADTTTAVRKIAGATPHDLGAPNEHVWEKTNYTC 524
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
+ + WKDL S FVLQVYRDF+ TG D F W ++ + Y++ FDKDGDG+ EN
Sbjct: 525 YQDCNLWKDLGSDFVLQVYRDFLFTGADDVEFLADCWDAIVQTLDYLKGFDKDGDGIPEN 584
Query: 638 EGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-- 695
G PDQT+D W GVSAYCGGLW+AAL+ A A+ + + + + + +++Y
Sbjct: 585 SGAPDQTFDDWRLQGVSAYCGGLWLAALECAIAICDILLTNRQDAKNAKNLETQRSIYGA 644
Query: 696 ----------DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
+ LWNG Y+ D+ G+ + ADQL GQ+YAR GL I + A
Sbjct: 645 WLAQSLTVYEEKLWNGQYYRLDSESGTE--VVMADQLCGQFYARLLGLPDIVKCDRALSA 702
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDM 804
L +YD LK G GA NG++PDG + + E+W G+ +GLAA ++Q M D
Sbjct: 703 LQTVYDACFLKFFDGQFGAANGVRPDGVPENPNATHPLEVWTGINFGLAAFLVQMGMQDE 762
Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A + V + G F+TPE+ +R+ Y+RP+ IWA+
Sbjct: 763 ALRLTQAVVSQIYDN---GLQFRTPEAITTTGTFRASTYLRPMAIWAI 807
>gi|427725201|ref|YP_007072478.1| glucosylceramidase [Leptolyngbya sp. PCC 7376]
gi|427356921|gb|AFY39644.1| Glucosylceramidase [Leptolyngbya sp. PCC 7376]
Length = 807
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 267/829 (32%), Positives = 389/829 (46%), Gaps = 120/829 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G A QF V+ GE + +
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDAGEHTFRNAGACQFGVYEYVEGGEAQAYAMATE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P D +ESWDW E TY AL+PR+W Y + EL C Q SP + +
Sbjct: 101 PP------DDGTLESWDW-YPTEKGTYSALYPRSWYEYKDVFNVEL--TCEQFSPIVAES 151
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P ++F ++ N V+++ +W N V D G + +
Sbjct: 152 YQETSYPTAIFEWSAHNPTDKPITVSIMMSWQNMVGWFTKATKTPKVELRDDGSPVYEYQ 211
Query: 254 SKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCF 298
S+ G + + H + + G + FA T +V F
Sbjct: 212 SRWGDSTGNLNQWIQDRYRVGCLLTRVRPHDQISEGEGQICFATVTNPTLEV-------F 264
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
+ + +D+WN +GS ++++ +P+ PG I AIA T+ G T+ +
Sbjct: 265 YHNRWNPSGDGRDVWNYFAGNGSLPDVEDE--TPASPGEQIAGAIAVRFTVRPGRTKKIP 322
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P ++F + Y RRYT F+G G + I A+ W+ IE WQ+P L
Sbjct: 323 FFLAWDFPVMEFAPNITYFRRYTDFFGRSGHNVWSIIRTAMKHSDLWKERIEEWQKPFLS 382
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
P+W+ ++L NELY L GG++WT +
Sbjct: 383 RADLPDWFKMSLCNELYLLAQGGSLWTAATE----------------------------- 413
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
++ +GQF LE +Y Y + DV Y SFAL ML+PKL+ S+ FA A+
Sbjct: 414 ----------DDPVGQFGVLECLDYSWYESLDVRMYGSFALAMLWPKLDKSVLEAFARAI 463
Query: 538 MMHDP-----GTMKIMS-DGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDL 589
D G K+M D ARK GA PHD+G + PW N + + WKDL
Sbjct: 464 PSSDDTQRAIGYNKLMGYDNFMAARKRAGATPHDLGAPNEHPWEATNYTAYQDCNLWKDL 523
Query: 590 NSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
+ FVLQVYRDFV TG K+ F + W V A+ Y++ FD D DG+ EN G PDQT+D
Sbjct: 524 GADFVLQVYRDFVMTGSKDTEFLWSCWEGVVQALDYLKTFDLDCDGIPENGGSPDQTFDD 583
Query: 648 WSANGVSAYCGGLW----------VAALQAASALANDVGDHASASYF---WVRYQ----K 690
W G+SAYCGGLW L+ +++ + + S F +YQ +
Sbjct: 584 WKLKGISAYCGGLWIAALEAAIAIANILKNKASIYEEHQNQVSTEEFEAAIAKYQDWLKQ 643
Query: 691 AKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
++ +Y D+LWNGSY+ D GS S + +DQL G++Y + L + E + L I
Sbjct: 644 SRPLYQDTLWNGSYYTLDT--GSDSKVVMSDQLCGEFYTQLLNLPNVNPENRTAITLQTI 701
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSG-LQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
Y+ LK G GA NG+ PDG + E+W G+ +G+AA MI+ M D A +
Sbjct: 702 YEACFLKFHDGQFGAANGLNPDGSPEKENDTHPLEVWTGINFGIAAFMIRNGMKDEALKM 761
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
V +S G F+TPE+ N+ +R+ Y+R + IWA+ ALT
Sbjct: 762 TEAVIHQVYSN---GLQFRTPEAITANNTFRACHYLRAMGIWAIYDALT 807
>gi|414077215|ref|YP_006996533.1| beta-glucosidase [Anabaena sp. 90]
gi|413970631|gb|AFW94720.1| putative beta-glucosidase [Anabaena sp. 90]
Length = 795
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 274/818 (33%), Positives = 398/818 (48%), Gaps = 117/818 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G V A QFS+F S G+ ++ P
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKTVPACQFSIFES---GQAYALSTQP- 93
Query: 147 SPGVPKKNTDSGIESWDWNLKGE---NCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
P+ +T ++SW W + + + TYHAL+PR+W VY+ +L C Q SP
Sbjct: 94 ----PEDHT---LQSWQWYPQSQEQNSGTYHALYPRSWFVYENVLKTQL--TCEQFSPIW 144
Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGH 250
NY+ESS+P +VF + N ++++ TW N V D G +
Sbjct: 145 ADNYQESSYPVAVFLWKAHNPTNAPITISIMLTWENMVGWFTNTLKSPEVRVRDDGSPVY 204
Query: 251 HF---------NSKTMTKD-----GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
+ N + +D V G H+T T+ +A + V +
Sbjct: 205 EYLPRWGESQGNYNCLAEDHQYFGCVLGQVSDHQTVQ-EGDGTWCIATAKNPQVEIFYHS 263
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
F GN +D+W +GS + ++ +P++ + +GAAIA T+ G
Sbjct: 264 RFNPVGN-----GEDVWQSFSANGSLPNYIDE--TPADENTRLGAAIAVRFTLQPGENLE 316
Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
V L+WD P +F V Y+RRYT F+G G++A +IA A+ E+ W I+ WQ PI
Sbjct: 317 VPLVLSWDFPVTEFAAGVNYYRRYTDFFGCNGENAWQIATTALKEYQNWRSLIQDWQNPI 376
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L P+W+ + LFNELY L +GGT+W+ A+S
Sbjct: 377 LAKSDLPDWFKMALFNELYDLTSGGTLWS----------------------------AAS 408
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
+ + +GQF LE +Y Y + DV Y SF L+MLFP+LE S+ R FA
Sbjct: 409 EI-----------DPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKSVIRAFAR 457
Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWK 587
A+ D T + I S+ RK GA PHD+G + W + N + + + WK
Sbjct: 458 AIPQSDDRTRIIGYYLTIKSESPNAVRKIAGATPHDLGAPNEHVWEKTNYTSYQDCNLWK 517
Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
DL FVLQVYRDF+ TG D F W ++ + Y++ FD DGD + EN G PDQT+
Sbjct: 518 DLGCDFVLQVYRDFLFTGANDLQFLADCWDAIVQTLDYVKTFDLDGDSIPENSGAPDQTF 577
Query: 646 DAWSANGVSAYCGGLWVAALQAASAL---------ANDVGDHASASYFWVRYQKAKAVY- 695
D W GVSAYCGGLW+AAL+AA A+ A D S + ++K +Y
Sbjct: 578 DDWRLQGVSAYCGGLWLAALEAAIAISKILTNRRGAEDAEVEVQGSIYQGWLSQSKPIYQ 637
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
+ LWNG Y+ D+ GS + ADQL GQ+YAR L I + ALTK+YD L
Sbjct: 638 EKLWNGEYYRLDSDSGSD--VVMADQLCGQFYARLLSLPDIVPSDRALSALTKVYDSCFL 695
Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K + G GA NG+ P+G + + E+W G+ +GLAA ++Q M + A + V
Sbjct: 696 KFQNGEFGAANGVLPNGLPENPNSTHPLEVWTGINFGLAAFLVQMNMKNEAMRLTEAVVR 755
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ G F+TPE+ N +R+ Y+R + IW +
Sbjct: 756 QIYDN---GLQFRTPEAITANGTFRASTYLRAMAIWGI 790
>gi|186681980|ref|YP_001865176.1| hypothetical protein Npun_R1544 [Nostoc punctiforme PCC 73102]
gi|186464432|gb|ACC80233.1| protein of unknown function DUF608 [Nostoc punctiforme PCC 73102]
Length = 801
Score = 388 bits (996), Expect = e-104, Method: Compositional matrix adjust.
Identities = 275/818 (33%), Positives = 396/818 (48%), Gaps = 113/818 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G V A QFSVF S + + L +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFKNVPACQFSVFESNGTSTQ-AYALSTQ 98
Query: 147 SPGVPKKNTDSGIESWDW--NLKGENCT--YHALFPRAWTVYDGEPDPELRIVCRQISPF 202
P D +++W W L T YHAL+PR+W VY E + ++ C Q SP
Sbjct: 99 GP------NDGALKAWQWYPTLPSTEGTGNYHALYPRSWFVY--ENIFQAQLTCEQFSPV 150
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
NY+E+S+P ++F + N ++++ TW N V D G
Sbjct: 151 WAGNYQETSYPVAIFLWNAHNPTNAPITLSIMLTWQNMVGWFTNALKSPEVRVRDDGSPV 210
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANG------------RPPVTFAVAAEETADVHVSECPC 297
+ + + G + + + G T+ +A + V +
Sbjct: 211 YEYQPRWGESQGNYNQIVENPQQFGCLLSRVGSDALQEGDGTWCIATLKHPQVEI----F 266
Query: 298 FLLSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
+ N++G T ++W K GS +++D +P G +GAA+A T+ G T
Sbjct: 267 YHTRWNAEG-TGDEVWQSFAKDGSLSNYID---ANPVGEGEQLGAAMALRFTLQPGETLE 322
Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ F LAWD P +F V Y+RRYT F+ G++A IA A+ E+ W +I+ WQ+PI
Sbjct: 323 IPFVLAWDLPITEFAAGVNYYRRYTDFFDRSGNNAWAIASTALQEYQTWRSQIQTWQQPI 382
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L+ + P W+ + LFNELY L +GGT+W+ S LD
Sbjct: 383 LDREDLPNWFKMALFNELYDLTSGGTLWSAAS--------------ELDP---------- 418
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
IGQF LE +Y Y + DV Y SFAL+MLFP+LE S+ R FA
Sbjct: 419 ---------------IGQFAVLECLDYRWYESLDVRLYGSFALLMLFPELEKSVMRAFAR 463
Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWK 587
A+ D M I ++ RK GA PHD+G + W + N + + + WK
Sbjct: 464 AIPQGDDTPRIIGYYMTIKAESPIAVRKVAGATPHDLGAPNEHVWEKTNYTSYQDCNLWK 523
Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
DL S FVLQVYRDF+ TG D F W ++ + Y++ FD DGDG+ EN G PDQT+
Sbjct: 524 DLGSDFVLQVYRDFLLTGADDVEFLADCWNAIVQTLDYLKTFDLDGDGIPENSGAPDQTF 583
Query: 646 DAWSANGVSAYCGGLWVAALQAASA-----LANDVGD----HASASYFWVRYQKAKAVY- 695
D W GVSAYCGGLW+AAL+AA A L + +GD S + Q++ +Y
Sbjct: 584 DDWRLQGVSAYCGGLWLAALEAAIAISDLLLTHKLGDLGGLAVQKSIYETWLQQSLPIYQ 643
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
+ LWNG Y+ D+ GS + ADQL GQ+YAR L I + AL +YD L
Sbjct: 644 EKLWNGQYYQLDSQSGSD--VVMADQLCGQFYARLLDLPDIVPSDRALSALKTVYDACFL 701
Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K G G NG++PDG + E+W G+ +GLAA ++Q M D A + V +
Sbjct: 702 KFCNGEFGTANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEALRLTQAVVQ 761
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ E+GL F+TPE+ +R+ Y+R + IW +
Sbjct: 762 QIY-ENGL--QFRTPEAITAAGTFRASTYLRAMAIWGI 796
>gi|427709805|ref|YP_007052182.1| glucosylceramidase [Nostoc sp. PCC 7107]
gi|427362310|gb|AFY45032.1| Glucosylceramidase [Nostoc sp. PCC 7107]
Length = 802
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 270/819 (32%), Positives = 392/819 (47%), Gaps = 103/819 (12%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G V A QFSVF S + ++
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNVPACQFSVFESYGTSSQAYAL---- 95
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ D + +W W + TYHAL+PR+W VY+ +L C Q SP
Sbjct: 96 ---STQSLEDGSLNAWQWYPASSESTSTGTYHALYPRSWFVYENVFQSQL--TCEQFSPI 150
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+NY+E+S+P +VF + N ++++ TW N V D G
Sbjct: 151 WANNYQETSYPVAVFLWKAHNPSNAPITLSIMLTWENMVGWFTNALKSPEVKIRDDGSPV 210
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANG--------RPPVTFAVAAEETADVHVSECPCFLLS 301
+ + + G + + + G PV + A V + F S
Sbjct: 211 YEYQPRWGESQGNYNQVVENSEYFGCFLGRVGITAPVQEGDGSWCIATVKHPQVEIFHHS 270
Query: 302 GNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFS 360
T ++W GS ++LD T+P +G+AIA + G T + F
Sbjct: 271 KWHPEGTGDEVWQSFAADGSLPNYLD---TNPVGENQRLGSAIAVRFNVQPGETLEIPFV 327
Query: 361 LAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDK 419
LAWDCP +F V Y+RRYT F+G G+ A IA A+ + W +I++WQ PIL +
Sbjct: 328 LAWDCPVTEFAAGVNYYRRYTDFFGKDGNQAWEIATTALGNYQTWRSQIQSWQAPILNRE 387
Query: 420 RFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTAL 479
P W+ + LFNELY L +GGT+W+ A+S+
Sbjct: 388 DLPAWFKMALFNELYDLTSGGTLWS----------------------------AASEI-- 417
Query: 480 GTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMM 539
+ IGQF LE +Y Y + DV Y SF L++LFP+LE S+ R FA +
Sbjct: 418 ---------DPIGQFAVLECLDYRWYESLDVRLYGSFGLLLLFPELEKSVMRAFARGIPQ 468
Query: 540 HDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLNS 591
D I +D ARK GA PHD+G + W + N + + WKDL+
Sbjct: 469 SDDHQRIIGYYYTIGADTTTAARKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLSC 528
Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
FVLQVYRD++ TG D F W +V + Y++ FDKDGDG+ EN G PDQT+D W
Sbjct: 529 DFVLQVYRDYLLTGADDVEFLADCWDAVVQTLDYLKTFDKDGDGIPENSGAPDQTFDDWR 588
Query: 650 ANGVSAYCGGLWVAALQAASALAND-VGDH-------ASASYFWVRYQKAKAVY-DSLWN 700
GVSAYCGGLW+AAL+AA A+++ + +H S + V +++ +Y + LWN
Sbjct: 589 LLGVSAYCGGLWLAALEAAIAISDILITNHKVVESAEKQKSIYEVWLAQSRPIYQEKLWN 648
Query: 701 GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGG 760
G Y+ D+ GS+ + ADQL GQ+YAR L I + AL +YD LK + G
Sbjct: 649 GQYYRLDSESGSA--VVMADQLCGQFYARLLELPDIVPSDRALSALKTVYDACFLKFQNG 706
Query: 761 MCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSE 819
GA NG+ PDG + E+W G+ +GLAA ++Q M D A + V E ++
Sbjct: 707 EFGAANGVLPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMQDEALRLTQAVVEQVYNN 766
Query: 820 DGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
G F+TPE+ +R+ Y+R + IW + ++K
Sbjct: 767 ---GLQFRTPEAITATGTFRASTYLRAMAIWGIYLVMSK 802
>gi|256082537|ref|XP_002577511.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 892
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 283/852 (33%), Positives = 420/852 (49%), Gaps = 87/852 (10%)
Query: 49 FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
F L + + + Y K+ + ++P + G+P+GGIG+GSIGR YRGEF
Sbjct: 63 FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122
Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
R L G+ D PV +QF V V + + VL P + P KK ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175
Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
E Y L+PR+WTVY+ P+ +L +V +QISP IPH+Y+ + P +VF + + +
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234
Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
+ + VT+ F+W S S+ + H TA N P +
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294
Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
A T D L N +G K ++ G S ++ + A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344
Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
+A+ +P + S + F++ W P VKF + VY RRY +++ S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404
Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
AH AI +W +IE WQ PIL ++ P WY LFNELYYL+ GGT+W D P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460
Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
T + S D +++ IG F YLEG EY MYNTYDVH
Sbjct: 461 VDLTEDMKVDSWDHRARL------------------SREIGLFGYLEGHEYRMYNTYDVH 502
Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
Y+S+AL+ L+PK++L++ D A + D ++ + +GK + R AV HD G D
Sbjct: 503 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 562
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
+PW N+Y +F + WKDLNSKF+LQV+RD+ T D + + P V I + +D
Sbjct: 563 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 622
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA---------------SALAN 673
D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A + N
Sbjct: 623 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQIVN 682
Query: 674 DVGDHASASYFWVR------YQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
+ D S+ ++ + KA+ Y++ LW GSY+ + I ADQL+G W
Sbjct: 683 NTDD-TQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSGYW 741
Query: 727 YARACGLLP--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
++R G+ P I + V K L I + N +K G GA+NG P + D+S +QA E
Sbjct: 742 FSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAEEF 801
Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
W V Y L++ MI E M+D Y+ ++ GL Y QTPE++ ++ +R YM
Sbjct: 802 WVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QTPEAYMSDGRFRCPGYM 859
Query: 845 RPLTIWAMQWAL 856
R L IW++Q AL
Sbjct: 860 RALAIWSIQQAL 871
>gi|428317101|ref|YP_007114983.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
gi|428240781|gb|AFZ06567.1| Glucosylceramidase [Oscillatoria nigro-viridis PCC 7112]
Length = 809
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 280/829 (33%), Positives = 395/829 (47%), Gaps = 115/829 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + A QFSVF G+K + L
Sbjct: 41 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 99
Query: 147 SPGVPKKNTDSGIESWDW-------------NLKGENCTYHALFPRAWTVYDGEPDPELR 193
P TD + SW W + + TYHAL+PR+W VY+ +L
Sbjct: 100 PP------TDGSLSSWKWYPSEGRRKKEEGRREESTSGTYHALYPRSWFVYENVFQAQLS 153
Query: 194 IVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA----------- 242
C Q SP +P+NY+ESS+P ++F + N ++++ TW N V
Sbjct: 154 --CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGWFTNANKSPKV 211
Query: 243 --GDSGLSGHHFNSK--------TMTKDGVH--GLTLHHRTANGRPPV---TFAVAAEET 287
D G + + + + + H G + TA+ P +A+A
Sbjct: 212 KVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTADDEPQEGDGQWAIATITN 271
Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
V V + +GN D+W G LD+ + + G + AIA
Sbjct: 272 PAVEVFYQTRWNPAGN-----GDDIWRNFSAEGFL--LDDRDETAAAAGEQLACAIAVRF 324
Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWEC 406
TI G TR + F +AWD P +F V Y+RRYT F+G G +A I A+ W
Sbjct: 325 TIRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPIIRTAMKHSDTWRE 384
Query: 407 EIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP 466
IEAWQ+PILE + + + + LFNELY L GGT+W S A R+ K
Sbjct: 385 NIEAWQKPILERQDLSDSFKMALFNELYALTDGGTLW--------SAADERDPK------ 430
Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
GQF LE +Y Y + DV Y SFAL+ML+P LE
Sbjct: 431 -------------------------GQFAVLECIDYRWYESLDVRLYGSFALLMLWPDLE 465
Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSS 584
S+ FA A+ D I + RK +GA PHD+G + PW + N + + +
Sbjct: 466 KSVVLAFARAISAGDDTPRIIGWNQASAIRKAVGATPHDLGAPNEHPWEKTNYTSYQDCN 525
Query: 585 RWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
WKDL FVLQVYRDF+ TG D F + WP++ A+AY++ FD+DGDG+ EN G PD
Sbjct: 526 LWKDLPCDFVLQVYRDFLLTGGTDCEFLQECWPAIVEALAYLKTFDEDGDGIPENSGAPD 585
Query: 643 QTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS----------ASYFWVRYQKAK 692
QT+D W G+SAYCGGLW+AAL+AA A+ + D A A Y Q
Sbjct: 586 QTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPEIIPNPQAKIALYHSWLLQARP 645
Query: 693 AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+ + LWNG Y+ D+ S+S + ADQL GQ+YA+ GL I AL +Y+
Sbjct: 646 SYQEKLWNGQYYRLDSE--SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVSALETVYES 703
Query: 753 NVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
LK G GA NG+ +G + + E+W G+ +GLAA ++Q M + AF+
Sbjct: 704 CFLKFNDGEFGAANGVMLNGSPENPNATHPLEVWTGINFGLAAFLVQMGMEEKAFKLTDA 763
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
V + + E+GL F+TPE+ +R+ Y+R + IWA+ LT K
Sbjct: 764 VVKQIY-ENGL--QFRTPEAITAGGTFRASHYLRAMAIWAIYGVLTNFK 809
>gi|119510228|ref|ZP_01629365.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
gi|119465077|gb|EAW45977.1| hypothetical protein N9414_13977 [Nodularia spumigena CCY9414]
Length = 814
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 273/814 (33%), Positives = 390/814 (47%), Gaps = 105/814 (12%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + A QFSVF S + + L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHTFQNIPACQFSVFESDGTSAQ-AYALATE 98
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+P + + +W W TYHAL+PR+W VY E + ++ C Q SP
Sbjct: 99 AP------ENGSLSAWKWYPASTATQSTGTYHALYPRSWFVY--ENVLQAQLTCEQFSPI 150
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
NY+ESS+P +VF + N ++++ TW N V D G
Sbjct: 151 WAENYQESSYPVAVFNWKAHNPTNAPITLSIMLTWQNMVGWFTNALKSPEVRIRDDGSPV 210
Query: 250 HHFNSKTM-TKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSEC------PCFLLSG 302
+ + + ++D + +T T N + V + A+ S C P F +
Sbjct: 211 YEYQPRLGDSQDNYNCIT--ENTENFSCFCSRVVDDDSVAEGDGSWCIATIKHPQFKVFY 268
Query: 303 NS---KGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
N+ T D+W GS + + +P+ + IG AIA T+ G T ++ F
Sbjct: 269 NTCWNPAGTGADVWETFAADGSLTNYQD--ATPALENTQIGVAIALRFTLQPGETLAIPF 326
Query: 360 SLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILED 418
L+WD P +F + Y+RRYT F+G GD A IA A+ ++ W IE WQ+PI++
Sbjct: 327 VLSWDFPVTEFAAGINYYRRYTDFFGRGGDHAWAIASTALTQYQTWYKNIENWQQPIIDR 386
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
+ P+W+ + LFNELY L +GGT+W+ S LD
Sbjct: 387 EDLPDWFKMALFNELYDLTSGGTLWSAAS--------------ELDP------------- 419
Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
IGQF LE +Y Y + DV Y SF L+ LFP+LE S+ R FA A+
Sbjct: 420 ------------IGQFAVLECLDYRWYESLDVRLYGSFGLLHLFPELEKSVIRAFARAIP 467
Query: 539 MHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLN 590
D I +D RK GA PHD+G + W + N + + WKDL
Sbjct: 468 HSDNHQRVIGYYYTIGADTTTAVRKVAGATPHDLGAPNEHVWEKTNYTCYQDCNLWKDLG 527
Query: 591 SKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAW 648
S FVLQVYRDF+ TG D F W + + Y++ FD DGDG+ EN G PDQT+D W
Sbjct: 528 SDFVLQVYRDFLLTGADDVQFLADCWAGIVQTLDYLKSFDLDGDGIPENSGAPDQTFDDW 587
Query: 649 SANGVSAYCGGLWVAALQAASA-----LANDVGDHAS---ASYFWVRYQKAKAVY-DSLW 699
GVSAYCGGLW+AAL+AA A L N G + + V +++ +Y + LW
Sbjct: 588 RLQGVSAYCGGLWLAALEAAIAISDILLTNHRGAEDTEELGQRYRVWLAQSRTIYQEKLW 647
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKG 759
NG YF D+ GS + ADQL GQ+YAR GL I + AL +Y +K
Sbjct: 648 NGQYFRLDSESGSD--VVMADQLCGQFYARLLGLPDIVPSDRALSALQTVYHACFVKFCN 705
Query: 760 GMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWS 818
G GA NG++PDG + + E+W G+ +GLAA ++Q M D A + V + +
Sbjct: 706 GEFGAANGVRPDGSAENPNATHPLEVWTGINFGLAAFLVQMGMQDEALKLTGAVVQQIYH 765
Query: 819 EDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
G F+TPE+ + +R+ Y+R + IWA+
Sbjct: 766 N---GLQFRTPEAITASGTFRASTYLRAMAIWAI 796
>gi|353232147|emb|CCD79502.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 900
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 284/852 (33%), Positives = 422/852 (49%), Gaps = 79/852 (9%)
Query: 49 FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
F L + + + Y K+ + ++P + G+P+GGIG+GSIGR YRGEF
Sbjct: 63 FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122
Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
R L G+ D PV +QF V V + + VL P + P KK ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175
Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
E Y L+PR+WTVY+ P+ +L +V +QISP IPH+Y+ + P +VF + + +
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYE-IPELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234
Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
+ + VT+ F+W S S+ + H TA N P +
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294
Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
A T D L N +G K ++ G S ++ + A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344
Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
+A+ +P + S + F++ W P VKF + VY RRY +++ S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404
Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
AH AI +W +IE WQ PIL ++ P WY LFNELYYL+ GGT+W D P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460
Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
+ + +L K+ + RL IG F YLEG EY MYNTYDVH
Sbjct: 461 -VDCFKSDLLNLTEDMKVDSWDHR-----ARL----SREIGLFGYLEGHEYRMYNTYDVH 510
Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
Y+S+AL+ L+PK++L++ D A + D ++ + +GK + R AV HD G D
Sbjct: 511 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 570
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
+PW N+Y +F + WKDLNSKF+LQV+RD+ T D + + P V I + +D
Sbjct: 571 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 630
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA---------------SALAN 673
D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A + N
Sbjct: 631 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYATFDMLSWCLKSDSPVYDQIVN 690
Query: 674 DVGDHASASYFWVR------YQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
+ D S+ ++ + KA+ Y++ LW GSY+ + I ADQL+G W
Sbjct: 691 NTDD-TQRSWSEIKDEVQTLFTKARDSYNAKLWTGSYYAFQTYCTPRREVIMADQLSGYW 749
Query: 727 YARACGLLP--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
++R G+ P I + V K L I + N +K G GA+NG P + D+S +QA E
Sbjct: 750 FSRITGVPPNLILPKNHVVKTLQTISNCNWHGIKNGTIGAINGCLPVCKPDLSSVQAEEF 809
Query: 785 WPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYM 844
W V Y L++ MI E M+D Y+ ++ GL Y QTPE++ ++ +R YM
Sbjct: 810 WVAVNYSLSSLMIAEGMIDEGLALGEKCYDTIYNLYGLQY--QTPEAYMSDGRFRCPGYM 867
Query: 845 RPLTIWAMQWAL 856
R L IW++Q AL
Sbjct: 868 RALAIWSIQQAL 879
>gi|428770169|ref|YP_007161959.1| glucosylceramidase [Cyanobacterium aponinum PCC 10605]
gi|428684448|gb|AFZ53915.1| Glucosylceramidase [Cyanobacterium aponinum PCC 10605]
Length = 802
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 269/824 (32%), Positives = 400/824 (48%), Gaps = 110/824 (13%)
Query: 85 SDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
++ G+PLGG G G +GR GEF + + G + A QFSVF +
Sbjct: 38 ANHGMPLGGFGGGCVGRGVNGEFNLWHIDGGNHIYQSLPACQFSVFEQVEGEPPQVYAMS 97
Query: 145 PRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIP 204
SP TD + SW W +G+ TY AL+PR+W Y G + R++C Q SP
Sbjct: 98 TNSP------TDGSLSSWAWYPQGKG-TYSALYPRSWFDYQGVF--KSRLICEQFSPIWA 148
Query: 205 HNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHH 251
+Y+ESS+P +VF +++ N ++++FTW NSV D G +
Sbjct: 149 GSYQESSYPLAVFEWSVHNPTDKEITLSIMFTWQNSVGWFTNAIKSPTVKVRDDGSPEYE 208
Query: 252 FNSK---------TMTKDGVHGLTLHHRTANGRPPV----TFAVAAEETADVHVSECPCF 298
+ K +D L R P +A+++ + V +
Sbjct: 209 YRPKWGDSTGNLNQWIQDNFRVGCLFDRIRMEEQPQEGEGQWAISSITNPSLEV----FY 264
Query: 299 LLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
N KG A D+W+ +GS D +P+EPG I A+A T+ G T+ +
Sbjct: 265 HSRWNPKGDGA-DIWDYFAMNGSLP--DYQDETPAEPGEQIAGAMAIRFTVKPGKTKKIP 321
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F LAWD P +F + + Y+RRYT F+G G +A I A+ H W+ I+ WQ+PIL+
Sbjct: 322 FILAWDFPVTEFAQGITYYRRYTDFFGRNGRNAWAIVRTALKHHDMWQNRIKEWQKPILD 381
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
PEW + LFNELY L+ GG++WT +
Sbjct: 382 RGDLPEWLKMALFNELYLLSEGGSLWTAAT------------------------------ 411
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
EN + +GQF LE +Y Y + DV Y SF L+M++P+L+ SI F+ A+
Sbjct: 412 -------EN--DPVGQFGVLECIDYRWYESLDVRLYGSFPLIMMWPRLDKSIMEAFSRAI 462
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVL 595
D I + RK GA PHD+G + PW + N + + + WKDL FVL
Sbjct: 463 PTEDKTPRIIGYNQTQAIRKAKGATPHDLGAPNEHPWEKSNYTSYQDCNLWKDLGCDFVL 522
Query: 596 QVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV 653
VYRD++ TG D++F W SV + Y++ FD DGDG+ EN G PDQT+D W G+
Sbjct: 523 LVYRDYLFTGGLDQDFLWECWDSVVETLHYVKTFDLDGDGIPENSGAPDQTFDDWRLQGI 582
Query: 654 SAYCGGLWVAALQAASALANDV-----------------GDHASASYFWVRYQKAKAVY- 695
SAYCGGLW+A L+AA A+ N + G + + F +A+++Y
Sbjct: 583 SAYCGGLWIAGLEAAIAIGNTLIANPPMNPSLQPDDYPQGIKDAIAVFQQWLTQARSLYH 642
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
++LWNG Y+ D+ GS + ADQL GQ+YA+ GL + + VK L K+YD L
Sbjct: 643 ETLWNGEYYRLDSESGSD--VVMADQLCGQFYAQLLGLPDVVESKYVKSTLRKVYDACFL 700
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K G G NG++PDG+ + + E+W G+ YG+ A MI M + A V +
Sbjct: 701 KFHDGKFGIANGVKPDGKPVKENDTHPLEVWTGINYGIVAFMILNGMKQEGLRVAETVVK 760
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
+ E+GL F+TPE+ +R+ Y+R + +WA+ ++L K
Sbjct: 761 QVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAVWAIYYSLEK 801
>gi|327288418|ref|XP_003228923.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Anolis
carolinensis]
Length = 667
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 205/435 (47%), Positives = 258/435 (59%), Gaps = 28/435 (6%)
Query: 372 EKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFN 431
EKV YT+F+G GD+ +AH + + +WE IEAWQRPILED P WY LFN
Sbjct: 221 EKVTMLWYTRFFGREGDACPALAHYMLTHYPEWEKRIEAWQRPILEDSDLPAWYKSALFN 280
Query: 432 ELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENI 491
ELY+L+ GGT+W + P D PS G L +
Sbjct: 281 ELYFLSDGGTLWVEFQPE--------------DAPSLA-------AGQGLSGLLPVLKEY 319
Query: 492 GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDG 551
G+F YLEG EY MYNTYDVHFY+SFAL ML+PKLELS+Q D A AV+ DP + +G
Sbjct: 320 GRFAYLEGQEYRMYNTYDVHFYASFALAMLWPKLELSLQYDMAVAVLSEDPRPRVYLMNG 379
Query: 552 KWVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
+ K VPHDIG D PW +N+Y + +++ WKDLN KFVLQVYRDF T D +
Sbjct: 380 QTAQVKLRNVVPHDIGEPDEEPWQRVNAYLIHDTADWKDLNLKFVLQVYRDFFLTEDCTY 439
Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
R +WP M +FD DGDG+IEN GF DQTYDAW G SAYCGGLW+AA+
Sbjct: 440 LRDMWPVCQAVMESELKFDTDGDGLIENSGFADQTYDAWVVTGASAYCGGLWLAAVCMMC 499
Query: 670 ALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYA 728
+A +G+ A+ + A ++ LWNG Y+NYD+ G SS SI +DQLAGQW+
Sbjct: 500 RMAEILGEGAALQRYSAILSSGAAAFERLLWNGRYYNYDSGGGPSSNSIMSDQLAGQWFL 559
Query: 729 RACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
RACGL + + V+ AL IY+ NVL G GA+NGM+PDG D S +Q+ E+
Sbjct: 560 RACGLGEGKYQVFPQEHVQSALRTIYEMNVLGFSEGAMGAVNGMRPDGVPDTSSVQSDEV 619
Query: 785 WPGVTYGLAASMIQE 799
W GV Y LAA+MIQE
Sbjct: 620 WVGVVYSLAATMIQE 634
Score = 132 bits (332), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 80/231 (34%), Positives = 122/231 (52%), Gaps = 19/231 (8%)
Query: 60 YRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDD 119
YR ++ E+ K P ++F + G PLGG G G+I R ++G+F R++L G+
Sbjct: 6 YRKTRLEK---KQPFIDLFNSQPLRQIYGCPLGGFGGGTITRGWKGDFCRWQLNPGLYHY 62
Query: 120 APVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPR 179
V+A+QF+V + R + VL P S ++ W+W+ G YHAL+PR
Sbjct: 63 KRVVADQFTVCLRRKGQTAYQQVLSVEQP--------SCLQGWNWSFCGHYAFYHALYPR 114
Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
AW+VY P +L + CRQ++P IPH+YKE+S P VF + + N +V+++FT+ N
Sbjct: 115 AWSVYQ-LPGQDLVLTCRQVTPIIPHDYKETSLPVGVFVWEVENLRDEEVEVSIMFTFQN 173
Query: 240 SVAGDSGLSGHHFNSKTMTKDGVH---GLTLHH-RTANGRPPVTFAVAAEE 286
G H+N ++G H G+ LHH NG T A+AA E
Sbjct: 174 GTETKEDRRGGHWNEPFALREGGHCVRGVLLHHCLPVNG---YTLAIAARE 221
>gi|334116939|ref|ZP_08491031.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
gi|333461759|gb|EGK90364.1| Glucosylceramidase [Microcoleus vaginatus FGP-2]
Length = 818
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 283/837 (33%), Positives = 396/837 (47%), Gaps = 123/837 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + A QFSVF G+K + L
Sbjct: 42 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFRSLPACQFSVF-EETQGKKQAFALSTE 100
Query: 147 SPGVPKKNTDSGIESWDW---------------------NLKGENCTYHALFPRAWTVYD 185
P TD + SW W + +YHAL+PR+W VY+
Sbjct: 101 PP------TDGSLSSWKWYPCEGRRKKEEGRREEEERTGEESNSSGSYHALYPRSWFVYE 154
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA--- 242
+L C Q SP +P+NY+ESS+P ++F + N ++++ TW N V
Sbjct: 155 NVFHAQLS--CEQFSPILPNNYQESSYPVAMFEWVAHNPTDAPITLSIMLTWENIVGWFT 212
Query: 243 ----------GDSGLSGHHFNSK--------TMTKDGVH--GLTLHHRTANGRPPV---T 279
D G + + + + + H G + TA P
Sbjct: 213 NANKSPKVMVRDDGSPFYDYQPRWGESAGNLNLLVEDFHRIGCVMTRLTAEDEPQEGDGQ 272
Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
+A+A V V + +GN +D+W G F DND+T+ + G +
Sbjct: 273 WAIATITNPAVEVFYQTRWNPAGN-----GEDIWRNFSAEG-FLLDDNDETAAAA-GEQL 325
Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAI 398
AIA T+ G TR + F +AWD P +F V Y+RRYT F+G G +A I A+
Sbjct: 326 ACAIAVRFTLRPGKTRKIPFIIAWDLPVTEFASGVWYYRRYTDFFGRNGKNAWPIIRTAM 385
Query: 399 LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
W IEAWQ+PILE K + + + LFNELY L GGT+W S A R+
Sbjct: 386 KHSDTWRENIEAWQKPILERKDLSDSFKMALFNELYALTDGGTLW--------SAADERD 437
Query: 459 RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
K GQF LE +Y Y + DV Y SFAL
Sbjct: 438 PK-------------------------------GQFAVLECIDYRWYESLDVRLYGSFAL 466
Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEIN 576
+ML+P LE S+ FA A+ D I + RK +GA PHD+G + PW + N
Sbjct: 467 LMLWPDLEKSVVLAFARAISAGDDTPRIIGWNQASAIRKAVGATPHDLGAPNEHPWEKTN 526
Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
+ + + WKDL FVLQVYRDF+ TG D F + WP++ A+AY++ FD+DGDG+
Sbjct: 527 YTSYQDCNLWKDLPCDFVLQVYRDFLLTGGTDSEFLQECWPAIVEALAYLKTFDEDGDGI 586
Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS----------ASYF 684
EN G PDQT+D W G+SAYCGGLW+AAL+AA A+ + D A A Y
Sbjct: 587 PENSGAPDQTFDDWQMRGISAYCGGLWLAALEAAIAIGEILEDAAPQIIPNPQAKIALYH 646
Query: 685 WVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKK 744
Q + + LWNG Y+ D+ S+S + ADQL GQ+YA+ GL I
Sbjct: 647 SWLLQARPSYQEKLWNGQYYRLDSE--SNSEVVMADQLCGQFYAKLLGLPDIVPPECAVS 704
Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVD 803
AL +Y+ LK G GA NG+ +G + + E+W G+ +GLAA ++Q M D
Sbjct: 705 ALETVYESCFLKFNEGEFGAANGVMLNGSPENPNATHPLEVWTGINFGLAAFLVQMGMED 764
Query: 804 MAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
AF+ V + E+GL F+TPE+ +R+ Y+R + IWA+ LT K
Sbjct: 765 KAFKLTDAVVNQIY-ENGL--QFRTPEAITAGGTFRASHYLRAMAIWAIYGVLTNFK 818
>gi|339252284|ref|XP_003371365.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
gi|316968412|gb|EFV52690.1| AP-1 complex subunit gamma-1 [Trichinella spiralis]
Length = 1615
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 262/807 (32%), Positives = 383/807 (47%), Gaps = 119/807 (14%)
Query: 69 KGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGIC--DDAPVLANQ 126
+G + FR + G+P G IG+GSIGR G F +F L GI D + V A+
Sbjct: 899 RGTSMTIDSFRPLQLKPIYGVPCGTIGSGSIGRHILGGFCKFSLVPGIVEHDVSSVPADN 958
Query: 127 FSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
F V V + VL K + SW +N E +Y +P +WT Y
Sbjct: 959 FIVTVRDAGVTIYQQVL-----SAVKVENKKFLSSWLFNFPAECMSYLGRYPMSWTTYKL 1013
Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT--LLFTWANSVAGD 244
P L ++C Q SP IPHNYK+S P ++F +T+ + S+D+T + FT+ N
Sbjct: 1014 -PCHNLILICHQYSPVIPHNYKDSCLPLTLFKWTVMSEESLSSDLTVSITFTFRNGTGRP 1072
Query: 245 SGLSGHHFNSKTMTK---DGVH--GLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFL 299
H + + + VH G++LH +T +G + VAA T + V+
Sbjct: 1073 RVDDQHQMRANSFLERMSSSVHCVGVSLH-QTIDGMN-CDYCVAALNTKSISVTHS---- 1126
Query: 300 LSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTF 359
+ N +G ++W + + G + D+ E + A++ I S + F
Sbjct: 1127 -TFNPEG-CGSEIWKSLYETGRLSNESTDEFVVVE-DRPLAIAVSCCTNIRSPCQEGIVF 1183
Query: 360 SLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
SL W PE+ F K Y RRY+ ++ D+A R++ A+ + +I+ WQ PIL+
Sbjct: 1184 SLVWHMPEIYFGNKRRTYLRRYSDWFDESSDAAKRLSIYAMENREIFLSKIDEWQEPILK 1243
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
+ P+WY +FNELYYL GG++W ++++ D P
Sbjct: 1244 NDNLPKWYKSAIFNELYYLTDGGSVWVK-----------YDQRWRDDEPHLTW------- 1285
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
EY MYNTYDVHFY+SFAL LFP++E SIQ +FA
Sbjct: 1286 -----------------------EYRMYNTYDVHFYASFALAKLFPQIEHSIQAEFADQF 1322
Query: 538 MMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQV 597
+ +S G +V +PW E+N+Y ++S WKDLN KFVL
Sbjct: 1323 YNVTQTNVYHISPGTYVF--------------EPWLEVNAYAFHDTSEWKDLNLKFVLTS 1368
Query: 598 YRDFVATG-DKNFARAVWPSVY-IAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSA 655
YRD++ ++F R VWP+V I ++ +D DGD ++EN+G DQTYDAW +
Sbjct: 1369 YRDYLMLNKSESFLRHVWPAVKEIIETALQNWDHDGDKLVENDGTCDQTYDAWCMH---- 1424
Query: 656 YCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW--NGSYFNYDNSDGSS 713
W ++ A L QK + + W G+Y+ +D +S
Sbjct: 1425 -----WTLLVELAKRL---------------EMQKKSLISEQCWMMRGTYYQFDQQSCNS 1464
Query: 714 STSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
ST + ADQL G Y + CGL LP+ + VKK L IY+ NV G GA+NGM
Sbjct: 1465 ST-VMADQLCGYAYLKICGLQTDILPLDN---VKKVLETIYNLNVCSFGNGTLGAVNGML 1520
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
G D S LQA E W GVTY L+A MI E V+ F TA+G+Y+ + + G +QTP
Sbjct: 1521 YSGEKDTSSLQADEAWTGVTYFLSAHMISEGFVEQGFSTASGIYKSCF--ESFGMHYQTP 1578
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWAL 856
E+ +R++ YMRPL+IWA+QW L
Sbjct: 1579 EALYEKKWFRAVGYMRPLSIWAIQWYL 1605
>gi|254410084|ref|ZP_05023864.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196183120|gb|EDX78104.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 814
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 267/834 (32%), Positives = 384/834 (46%), Gaps = 123/834 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG +GRS RG+F + L G + A QFSVF N + + LC
Sbjct: 41 HGMPLGGFGAGCLGRSPRGDFNLWHLDGGEHTFKRLPACQFSVFEQPENQDAQAYALCTE 100
Query: 147 SPGVPKKNTDSGIESWDWNLKGENC-----------TYHALFPRAWTVYDGEPDPELRIV 195
P D + W W N TYHAL+PR+W VY+ EL
Sbjct: 101 PP------EDGSLGKWQWYPTSPNVETRHGASLRTGTYHALYPRSWFVYENVFQTEL--T 152
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA------------- 242
C+Q SP Y+ESS+P +VF +T N ++++ TW N V
Sbjct: 153 CQQFSPIWADCYQESSYPVAVFEWTAHNPTDQPITLSIMLTWQNMVGWFTNAIQHPQVQV 212
Query: 243 GDSGLSGHHFNSKTMTKDGVHG---------------LTLHHRTANGRPPVTFAVAAEET 287
D G + + + G + + L G + FA +
Sbjct: 213 RDDGSPVYEYQPRWGNSTGNYNQWIVDNFRVGCLLDRVRLRDEIEEGEGQLCFATVTNPS 272
Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
+V + N G ++WN GS + ++ +P+EPG I AIA +
Sbjct: 273 QEVF------YHCRWNPTG-DGSEVWNTFASDGSLSDVQDE--TPAEPGEQIAGAIAVRV 323
Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWEC 406
+ G TR + F LAWD P +F V Y+RRYT F+G G++A + A+ W+
Sbjct: 324 RLRPGRTRKIPFILAWDFPVTEFASGVKYYRRYTDFFGRTGNNAWSMVRTALKHSDLWQE 383
Query: 407 EIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTP 466
I +WQ PIL+ P W+ + LFNELY L+AGGTIW+
Sbjct: 384 RILSWQNPILQRHDLPNWFKMALFNELYLLSAGGTIWSSA-------------------- 423
Query: 467 SKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
N + +G+F LE +Y Y + DV Y SFAL ML+ +L+
Sbjct: 424 -------------------NERDPVGKFGVLESFDYRWYESLDVRLYGSFALAMLWSRLD 464
Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSS 584
++ FA A+ D I + RK A PHD+G + PW + N + + +
Sbjct: 465 KAVLEAFARAIPTSDDTPRIIGYNQATAIRKAADATPHDLGAANEHPWEKTNYTSYQDCN 524
Query: 585 RWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPD 642
+WKDL FVLQVYR FV TG K+ F W ++ A+AY++ FD DGDG+ EN G PD
Sbjct: 525 QWKDLPCDFVLQVYRTFVFTGSKDIEFLAECWSAIVRALAYLKSFDHDGDGIPENSGAPD 584
Query: 643 QTYDAWSANGVSAYCGGLWVA----------------ALQAASALANDVGDHASASYFWV 686
QT+D W GVSAYCGGLW+A L +A L + + + +
Sbjct: 585 QTFDDWQLQGVSAYCGGLWLAALEAASAIAQILLDNYTLVSAINLPDPESIQHTLNTYHK 644
Query: 687 RYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
Q+A+ +Y + LWNG Y+ D+ S S + ADQL GQ+YA+ GL I + A
Sbjct: 645 WLQQAQPIYQEKLWNGQYYRLDSD--SDSDVVMADQLCGQFYAQLLGLPDIVPPECTQSA 702
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDM 804
L +Y+ LK GA NG++PDG + E+W G+ +GLAA +IQ M D
Sbjct: 703 LKTVYESCFLKFHNAQFGAANGVKPDGSPEHPDATHPLEVWTGINFGLAAFLIQLGMKDE 762
Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
A + V + E+GL F+TPE+ +R+ Y+R + IWA+ L++
Sbjct: 763 ALKLTESVVRQVY-ENGL--QFRTPEAITPVGTFRAGHYLRAMAIWAVYLTLSQ 813
>gi|300866990|ref|ZP_07111661.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300335025|emb|CBN56827.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 863
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 282/851 (33%), Positives = 397/851 (46%), Gaps = 139/851 (16%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS+RG+F + + G + A QFSVF +G+K + L
Sbjct: 66 HGMPLGGFGAGCIGRSHRGDFNLWHIDGGEHVFRSLPACQFSVF-EEIDGKKQAYALSTE 124
Query: 147 SPGVPKKNTDSGIESWDW----------NLKGE---------NCTYHALFPRAWTVYDGE 187
P TD + +W W LK E +YHAL+PR+W VY+
Sbjct: 125 PP------TDGSLSAWKWYPSELKIKNEKLKSEEQNSDSSLSTGSYHALYPRSWFVYENV 178
Query: 188 PDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA----- 242
+L C Q SP NY+ESS+P ++F + N ++++ TW N+V
Sbjct: 179 FQAQLS--CEQFSPIWAGNYQESSYPVAIFEWIAHNPTDRPITLSIMLTWQNTVGWFTNA 236
Query: 243 --------GDSGLSGHHFNSKTMTKDGVHGLTLH--HRTANGRPPVTFAVAAEET----- 287
D G + +NS+ G + HR +T A+EE
Sbjct: 237 NKSPEIKVRDDGSPVYEYNSRWGESGGNFNRLVEDFHRIGC---VMTRLSASEEVKEGEG 293
Query: 288 ----ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
A V + F + + + +D+W + G LD+ +P+ G + AI
Sbjct: 294 QWAIATVANAAVEVFYQTRWNPAGSGEDIWRPFSQEGFL--LDDPDETPAAGGEQLACAI 351
Query: 344 AASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHA 402
A TI G TR + F LAWD P +F +V Y+RRYT F+G G + I A+
Sbjct: 352 AVRFTIRPGKTRKIPFILAWDFPITEFAPEVWYYRRYTDFFGRNGKNVWSIIRTAMKHSD 411
Query: 403 KWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFS 462
W I+AWQ+PIL + P+ + +FNELY L GGT+W S A R+ K
Sbjct: 412 TWRENIQAWQQPILAREDLPDIIKMAMFNELYALTDGGTLW--------SAADERDPK-- 461
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
GQF LE +Y Y + DV Y SFAL+ML+
Sbjct: 462 -----------------------------GQFAVLECIDYRWYESLDVRLYGSFALLMLW 492
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNL 580
P+L+ ++ FA A+ D I + RK GA PHD+G + PW + N +
Sbjct: 493 PELDKAVLLAFARAIFTGDDTPRVIGWNQASAVRKIAGATPHDLGAPNEHPWEKTNYTSY 552
Query: 581 FNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
+ + WKDL FVLQVYRDF+ TG D F + WP++ A+AY++ FD DGDG+ EN
Sbjct: 553 QDCNLWKDLPCDFVLQVYRDFLLTGSTDYEFLQECWPAIVEALAYLKTFDLDGDGIPENS 612
Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAA-------------------SALANDVGDHA 679
G PDQT+D W GVSAYCGGLW+AAL+AA S + N D
Sbjct: 613 GAPDQTFDDWQMRGVSAYCGGLWLAALEAAIAIGSLLIKHQSSVEELNQSPITNFNKDFK 672
Query: 680 SASYFWV-RYQ-----------KAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
S F + YQ +AK +Y LWNG Y+ D+ GS + ADQL GQ+
Sbjct: 673 DVSLFPITNYQLPITNYQNWLEQAKPIYQQRLWNGQYYRLDSESGSD--VVMADQLCGQF 730
Query: 727 YARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSG-LQAREIW 785
YAR GL I E AL +Y+ +K G GA NG+ DG + G E+W
Sbjct: 731 YARLLGLPDIVPEECAISALETVYESCFIKFNQGEFGAANGVMLDGSPEKPGATHPLEVW 790
Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
G+ +GLAA M+Q M + A + V + ++ G F+TPE+ +R+ Y+R
Sbjct: 791 TGINFGLAAFMMQMGMKEKALELTEVVVKQIYAN---GLQFRTPEAITATGTFRASHYLR 847
Query: 846 PLTIWAMQWAL 856
+ IWA+ + L
Sbjct: 848 AMAIWAIYYEL 858
>gi|209526014|ref|ZP_03274547.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|376002240|ref|ZP_09780079.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067988|ref|ZP_17056778.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
gi|209493540|gb|EDZ93862.1| protein of unknown function DUF608 [Arthrospira maxima CS-328]
gi|375329367|emb|CCE15832.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406710526|gb|EKD05737.1| hypothetical protein SPLC1_S630380 [Arthrospira platensis C1]
Length = 799
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 270/826 (32%), Positives = 390/826 (47%), Gaps = 120/826 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS+RG+F + L G P+ QFSVF +G++ + L +
Sbjct: 41 HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
P D + +W W + + TY+AL+PR+W VY E + ++ C Q SP
Sbjct: 100 PP------QDGSLSAWSWYPTSDQHTQTGTYYALYPRSWFVY--ENVFQTQLTCEQFSPV 151
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------------- 242
NY+E+S+P +VF + N +++L TW N++
Sbjct: 152 WGGNYQETSYPIAVFEWIAHNPTDKLIILSILLTWENTIGWFTNSLENPTVKVRDDGSPV 211
Query: 243 -------GDSG------LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETAD 289
GDS + H MT+ L ++ + G + A + +
Sbjct: 212 YEYQPRWGDSQDNLNRFVEDFHRVGCVMTR-----LNINDQPREGEGQIAIATISNPAVE 266
Query: 290 VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI 349
V+ N G D+W K GS + +D+ P+ G IGAA+A +
Sbjct: 267 VYYHN------RWNPNG-NGYDIWEYFSKDGSLIDIQDDR--PAVEGERIGAALAVRCIL 317
Query: 350 PSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
G R + F +AWD P +F V Y+R YT F+G G +A + A+ W I
Sbjct: 318 RPGKIRKIPFFIAWDLPVCEFEAGVSYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENI 377
Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
EAWQ PIL + P W+ + LFNELY L GG++WT AT ++ +
Sbjct: 378 EAWQNPILNRQDLPPWFKMALFNELYLLTDGGSLWT---------ATDEDQPW------- 421
Query: 469 IHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELS 528
G+F LE +Y Y + DV Y SF L+M +P+LE S
Sbjct: 422 -----------------------GRFAVLECLDYRWYESLDVRLYGSFGLLMSWPELEKS 458
Query: 529 IQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
+ R FA A+ D I + RK GA PHD+G + PW + N + + + W
Sbjct: 459 VIRTFAKAIATADHTPRIIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLW 518
Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
KDL FVLQVYRD++ TG D F WP++ + Y++ FD+D DG+ EN G PDQT
Sbjct: 519 KDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVETLDYLKTFDRDQDGIPENGGAPDQT 578
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDH--------ASASYFWVRY-QKAKAVY 695
+D W G+SAYCGGLW+AAL+AA +AN + H A W + + AK +Y
Sbjct: 579 FDDWRLVGISAYCGGLWLAALEAAIEIANILLSHNRDLTPDTTKAIATWTNWLETAKPLY 638
Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
D +LWNGSY+N D+ GS + ADQL GQ+YA GL I + + AL IY
Sbjct: 639 DQTLWNGSYYNLDSQSGSD--VVMADQLCGQFYAALLGLPDIVPPHRAQVALQTIYQACF 696
Query: 755 LKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
G GA NG++ G ++ + E+W G+ +GL A IQ M + AFQ A V
Sbjct: 697 QNFHNGKFGAANGVRTTGEPMNPNDTHPLEVWTGINFGLGAFFIQMGMKEEAFQLAEAVI 756
Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
+ G F+TPE+ +R+ Y+R + IWAM + + P
Sbjct: 757 TQVYQN---GLQFRTPEAITAAGTFRASHYLRAMAIWAMYYQIQHP 799
>gi|409991122|ref|ZP_11274412.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
gi|291568810|dbj|BAI91082.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938009|gb|EKN79383.1| hypothetical protein APPUASWS_08895 [Arthrospira platensis str.
Paraca]
Length = 799
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 275/826 (33%), Positives = 395/826 (47%), Gaps = 120/826 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS+RG+F + L G P+ QFSVF +G++ + L +
Sbjct: 41 HGMPLGGFGAGCIGRSHRGDFNLWHLDGGEHIFRPLPGCQFSVF-EEIDGKRQAYALSTQ 99
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
P D + +W+W + + TYHAL+PR+W VY E + ++ C Q SP
Sbjct: 100 PP------EDGSLSTWNWYPSSDQHTQTGTYHALYPRSWFVY--ENVFQTQLTCEQFSPV 151
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------------- 242
NY+E+S+P +VF + N +++L TW N++
Sbjct: 152 WGGNYQETSYPVAVFEWIAHNPTDKLIILSILLTWENTIGWFTNRLDTPAVKVRDDGSPV 211
Query: 243 -------GDSGLSGHHFNSK------TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETAD 289
GDS + +HF MT+ L ++ + G + A +
Sbjct: 212 YEYQPRWGDSQDNCNHFVEDFHRVGCVMTR-----LNINDQLGEGDGQMAIATITNPVVE 266
Query: 290 VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI 349
V+ + SGN D+W + GS + +D P+ G IGAA+A +
Sbjct: 267 VYCHNR--WRPSGN-----GYDIWEYFSQDGSLIDIHDD--GPALEGERIGAALAVRCIL 317
Query: 350 PSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEI 408
G TR + F +AWD P +F V Y+R YT F+G G +A + A+ W I
Sbjct: 318 RPGKTRKIPFFIAWDLPVTEFEAGVTYYRHYTDFFGRNGRNAWSMIRTAMKHSDTWRENI 377
Query: 409 EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSK 468
EAWQ PIL + P W+ + LFNELY L GG++WT ++D
Sbjct: 378 EAWQNPILNRQDLPPWFKMALFNELYLLTDGGSLWT-----------------AIDE--- 417
Query: 469 IHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELS 528
+E G+F LE +Y Y + DV Y SF L+M +P+LE S
Sbjct: 418 -------------------DEPWGRFAVLECLDYRWYESLDVRLYGSFGLLMWWPELEKS 458
Query: 529 IQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
+ R FA A+ D I + RK GA PHD+G + PW + N + + + W
Sbjct: 459 VMRAFAKAIATDDDTPRIIGYNQASAIRKAAGATPHDLGAPNEHPWEKTNYTSYQDCNLW 518
Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
KDL FVLQVYRD++ TG D F WP++ + Y++ FD+D D + EN G PDQT
Sbjct: 519 KDLPCDFVLQVYRDYLLTGADDIQFLVECWPAIVQTLDYLKTFDRDRDCIPENGGAPDQT 578
Query: 645 YDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS--------ASYFWVRY-QKAKAVY 695
+D W G+SAYCGGLW+AAL+AA A+AN + H S A W + + AK +Y
Sbjct: 579 FDDWRMMGISAYCGGLWLAALEAAIAIANILLSHHSDITPDTTKAIATWTNWLETAKPLY 638
Query: 696 D-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
D +LWNGSY+N D+ GS + ADQL GQ+YA GL I + + AL IY
Sbjct: 639 DQTLWNGSYYNLDSQSGSD--VVMADQLCGQFYAALLGLPDIVPHHRARVALQTIYQACF 696
Query: 755 LKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
G GA NG++ G I+ E+W G+ +GL A +IQ M + AFQ A V
Sbjct: 697 QNFHNGKFGAANGVRTTGEPINPQDTHPLEVWTGINFGLGAFLIQMGMKEEAFQLAEAVI 756
Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
+ G F+TPE+ +R+ Y+R + IWAM + + P
Sbjct: 757 TQVYQN---GLQFRTPEAITAAGTFRASHYLRAMAIWAMYYQIQHP 799
>gi|22297737|ref|NP_680984.1| hypothetical protein tlr0193 [Thermosynechococcus elongatus BP-1]
gi|22293914|dbj|BAC07746.1| tlr0193 [Thermosynechococcus elongatus BP-1]
Length = 806
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 279/799 (34%), Positives = 393/799 (49%), Gaps = 93/799 (11%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G+PLGG GAG IGRS G+F + + G + A QFS+F + ++ P+
Sbjct: 57 GMPLGGFGAGCIGRSSAGDFNLWHVDGGEHIFGTLPACQFSLFEQGEQTQAYALGSAPK- 115
Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
D + SW W G+ TY +PR+W VY+G +I C Q SP +PHNY
Sbjct: 116 --------DGRLSSWQWYPAGKG-TYAVRYPRSWFVYEGVF--RAQITCEQFSPILPHNY 164
Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFNS 254
+E+S+P +VF +T SN S ++L+ +W N+V D G + +
Sbjct: 165 QETSYPVAVFLWTFSNPTDQSLTLSLMLSWQNTVGWFCNTTPSSAIAIRDDGSPVYTYTP 224
Query: 255 KTMTKDGVHGLTLHHRTANG-------RPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
+ DG + + G P E A + F S
Sbjct: 225 RWGQSDGNFNELIQTESFQGWRLRRMPHPNPPQEGDGEWAALIPTGLGEFFGCSRWQPEG 284
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
+W GS + ND T P+ G + AA A ++ G + + LAWD P
Sbjct: 285 DGAHLWQSFSVDGSLPFV-NDPT-PAAAGEQVAAAFALRFSLAPGERKQIPVVLAWDFPV 342
Query: 368 VKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYP 426
+F + V Y+RRYT F G +A +A A+ +A W+ +I WQ PIL +P+W+
Sbjct: 343 TEFGKGVIYYRRYTDFCDRHGTNAVTLAAQALAAYATWQEQIRTWQAPILSHPDWPDWFK 402
Query: 427 ITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLEN 486
+ L NELY L++GG++W+ A+SD
Sbjct: 403 MALCNELYVLSSGGSLWS----------------------------AASD---------- 424
Query: 487 GEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK 546
+ +GQF LE +Y Y + DV Y SFAL+ L+P+LE S+ R FA A+ DP T++
Sbjct: 425 -RDPVGQFAVLECLDYRWYESLDVRLYGSFALLQLWPELEKSVMRAFARAIPTADP-TLR 482
Query: 547 IMS--------DGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQ 596
I+ RK AVPHD+G + PW + N + + WKDL S FVL
Sbjct: 483 IIGYFYRGDPETAYKAPRKLANAVPHDLGAPNEHPWEKTNYTAYQDCNLWKDLASDFVLL 542
Query: 597 VYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
VYRDF+ TG D NFAR WP+V A+ +++QFD+DGDG+ EN G PDQTYD W GVS
Sbjct: 543 VYRDFLFTGGTDLNFARECWPAVVAALDHLKQFDQDGDGLPENGGAPDQTYDDWKLQGVS 602
Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSS 714
AYCGGLW+AAL+AA AL + Y Q + LWNG Y+ D GS S
Sbjct: 603 AYCGGLWLAALEAAIALGTLLQQPQVEIYRQWLSQARPRYHQLLWNGEYYRLDT--GSGS 660
Query: 715 TSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRI 774
I ADQL GQ+YA+ GL+ I +AL KIYD LK G GA NG+ P+G+
Sbjct: 661 DVIMADQLCGQFYAQLLGLVDIVPPDCCDRALRKIYDTCFLKFHNGQFGAANGLLPNGQP 720
Query: 775 DMS-GLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
+ E+W G+ +GLAA + Q M+D A++ A V + E+GL F+TPE+
Sbjct: 721 ENPHATHPLEVWTGINFGLAAFLWQRGMIDEAWRLAEVVVRQIY-ENGL--QFRTPEAIT 777
Query: 834 NNDEYRSLCYMRPLTIWAM 852
N +R+ Y+RP+ IWA+
Sbjct: 778 ANGTFRACMYLRPMAIWAL 796
>gi|443311911|ref|ZP_21041533.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
gi|442777986|gb|ELR88257.1| putative bile acid beta-glucosidase [Synechocystis sp. PCC 7509]
Length = 804
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 258/816 (31%), Positives = 398/816 (48%), Gaps = 110/816 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS +G+F + + G + A QFSVF + N + LC
Sbjct: 41 HGMPLGGFGAGCIGRSSKGDFNLWHIDGGEHTFKSIPACQFSVFEQQSN---LAYALCTE 97
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
+P D +++W W + + TYHAL+PR+W VY+ +L C Q SP +P N
Sbjct: 98 AP------EDGSLKAWQWYPQNKG-TYHALYPRSWFVYENVFLSQL--TCEQFSPILPDN 148
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHFN 253
Y+E+S+P ++F + N ++++ TW N V D G + +
Sbjct: 149 YRETSYPVAIFVWKAHNPTDQPLTLSIMLTWQNMVGWFTNSIKSPQVRVRDDGSPVYEYE 208
Query: 254 SKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETA-------DVHVSECPCFLLSGNSK- 305
+ G + + TA+ + V +++ A + S P + +S +++
Sbjct: 209 PRLGQSAGNYNKLV--ETADSLGCILARVGSDDVALEGIGQWVIATSINPKYQISHHTRW 266
Query: 306 --GITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
T D+W GS + ++ +P G + +AIA T+ G + + F LAW
Sbjct: 267 NPTGTGADVWQSFSTDGSLPNYIDE--TPIADGEQLASAIAIKFTLEPGESLEIPFVLAW 324
Query: 364 DCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFP 422
D P +F V Y+RRYT F+G G +A IA +A+ + W+ +++AWQ+PI++ P
Sbjct: 325 DFPITEFAPGVNYYRRYTDFFGRNGSNAEAIAFNALQHYQTWQQQVQAWQQPIIDRTDLP 384
Query: 423 EWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTR 482
+W+ + LFNELY L +GGT+W+ ER
Sbjct: 385 DWFKMALFNELYDLTSGGTLWSAAD----------ER----------------------- 411
Query: 483 LLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP 542
+ +GQF LE +Y Y + DV Y SF L+ML+ KL+ SI R +A A+ D
Sbjct: 412 ------DPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLWSKLDKSILRAYARAISTSDR 465
Query: 543 GTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRD 600
+I + RK A PHD+G + PW + N + + + WKDL S FVLQVYRD
Sbjct: 466 TLREIGYNQATGLRKVANATPHDLGAPNEHPWEKTNYTSYQDCNLWKDLPSDFVLQVYRD 525
Query: 601 FVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
FV TG D W ++ + Y++ FD D DG+ EN G PDQT+D W GVSAYCG
Sbjct: 526 FVLTGSTDWELLWDCWGAIVQTLNYLKTFDLDNDGIPENSGAPDQTFDDWRLQGVSAYCG 585
Query: 659 GLWVAALQAASALA--------------------NDVGDHASASYFWVRYQKAKAVY-DS 697
GLW+A L++A A+ N + + ++++ VY +
Sbjct: 586 GLWLAGLESAIAIGKTLISYPQEHPLGEMLTSPLNQPPIEETIGIYQTWLEQSRPVYQEK 645
Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKV 757
LWNG Y+ D+ GS + ADQL GQ+YAR G+ I + + +L +Y+ LK
Sbjct: 646 LWNGQYYKLDSESGSD--VVMADQLCGQFYARLLGVPDIVESDRTVSSLKTVYEACFLKF 703
Query: 758 KGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
G GA NG++ DG + E+W G+ +GLAA ++Q +M + AF+ V E
Sbjct: 704 HNGAFGAANGLKIDGSPENPDATHPLEVWTGINFGLAAFLLQMDMKEEAFKLTQTVVEQI 763
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
++ G F+TPE+ +R+ Y+RP+ IWA+
Sbjct: 764 YNN---GLQFRTPEAITAVGTFRACHYLRPMAIWAV 796
>gi|434391596|ref|YP_007126543.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
gi|428263437|gb|AFZ29383.1| Glucosylceramidase [Gloeocapsa sp. PCC 7428]
Length = 809
Score = 377 bits (969), Expect = e-101, Method: Compositional matrix adjust.
Identities = 272/829 (32%), Positives = 401/829 (48%), Gaps = 118/829 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGR+ RG+F + + G + A QFS+F + + + + LC
Sbjct: 40 HGMPLGGFGAGCIGRASRGDFNLWHIDGGEHTFKSLPACQFSIFEQQASTSQ-AFALCTL 98
Query: 147 SPGVPKKNTDSGIESWDWNLK-GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
P D + +W W + EN YHAL+PR+W VY+ L C Q SP +P
Sbjct: 99 PPA------DGTLTTWQWYPQHCENGVYHALYPRSWFVYENVFQAHLS--CEQFSPILPE 150
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSGHHF 252
NY+E+S+P +VF + N ++++ TW N+V D G + +
Sbjct: 151 NYQETSYPVAVFVWRAHNPTNAPLTLSIMLTWQNTVGWFQNAIKNPEIRMRDDGSPVYEY 210
Query: 253 N----------SKTMTKDGVHGLTLHHRTANGRPPV-TFAVAAEETADVHVSECPCFLLS 301
++ + KD + G L ++ + + +A+A + ++ V + S
Sbjct: 211 EPCLGNNLGNFNRLIAKDNLCGCVLERVASDLQDGIGQWAIATPKQPNIEVFYHTRWNPS 270
Query: 302 GNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSL 361
G+ ++W+ + GS + + DKTS SE + AAIA T+ G TR + F++
Sbjct: 271 GD-----GAEIWHSFARDGSLPNTE-DKTSASE-QEQLAAAIAVRFTLQPGETRELPFAI 323
Query: 362 AWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
AWD P +F V Y RRYT F+G GD+A IA A+ + W+ +I+AWQ+PIL+
Sbjct: 324 AWDFPITEFAPGVKYFRRYTDFFGRGGDNAVAIAQTALQNYQSWQQQIQAWQQPILDRDD 383
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
P + + LFNELY L +GGT+W+ A+ D
Sbjct: 384 LPSVFKMALFNELYDLTSGGTLWS----------------------------AADD---- 411
Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
+ +GQF LE +Y Y + DV Y SFAL+ML+ KLE S+ R FA A+
Sbjct: 412 -------RDPVGQFAVLECFDYRWYESLDVRLYGSFALLMLWSKLEKSVIRAFARAIGQR 464
Query: 541 DPGTMKIMS------DGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSK 592
D T I + RK GA PHD+G + PW N + + + WKDL+
Sbjct: 465 DDRTRVIGYYYTQGLESPTALRKVAGATPHDLGAPNEHPWEATNYTSYQDCNLWKDLSCD 524
Query: 593 FVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSA 650
FVLQVYRDFV TG D W +V + Y++ FD D DG+ EN G PDQT+D W
Sbjct: 525 FVLQVYRDFVLTGGTDWELLWDCWSAVVQTLTYLKTFDLDEDGIPENSGAPDQTFDDWRL 584
Query: 651 NGVSAYCGGLW----VAALQAASALANDVGDHASAS----------------YFWVRYQK 690
GVSAYCGGLW AA+ L + DH ++ F +
Sbjct: 585 QGVSAYCGGLWLAALEAAIAIGKTLLSYPDDHPASKILASAPDYPPIPETLDVFQSWLMR 644
Query: 691 AKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
++ +Y + LWNG Y+ D+ GS + ADQL GQ+YAR GL I +AL +
Sbjct: 645 SRPIYQEKLWNGQYYRLDSKSGSD--VVMADQLCGQFYARLLGLPDIVPPECATQALKTV 702
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
YD +K G GA NG++PDG + + E+W G+ +GLAA ++Q M A++
Sbjct: 703 YDACFVKFHNGQFGAANGLKPDGLPENPNATHPLEVWTGINFGLAAFLMQMGMKSEAWRI 762
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
V + + G F+TPE+ +R+ Y+RP+ IWA+ LT
Sbjct: 763 TQAVVQQVYDN---GLQFRTPEAITAKGTFRACHYLRPMAIWAIYRVLT 808
>gi|427727368|ref|YP_007073605.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
gi|427363287|gb|AFY46008.1| putative bile acid beta-glucosidase [Nostoc sp. PCC 7524]
Length = 803
Score = 377 bits (969), Expect = e-101, Method: Compositional matrix adjust.
Identities = 271/818 (33%), Positives = 399/818 (48%), Gaps = 102/818 (12%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + + G + A QFSVF S + + L
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHLFQNIPACQFSVFESHGTSSQ-AYALATT 98
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+P D +++W W E YHAL+PR+W VY E + ++ C Q SP
Sbjct: 99 AP------EDGSLQAWQWYPASTATQETGAYHALYPRSWFVY--ENVLQAQLTCEQFSPV 150
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
HNY+E+S+P +VF + N ++++ TW N D G
Sbjct: 151 YAHNYQETSYPVAVFVWKAHNPTNAPITLSIMLTWQNMTGWFTNALKSPEVKVRDDGSPV 210
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANG----RPPVTFAVAAEETADVHVSE----CPCFLLS 301
+ + + G + + + G R +T + E + V++ F +
Sbjct: 211 YEYQPRLGESQGNYNYLVESPQSLGCFLGRVGITEPLQEGEGSWCIVTQKHPQVEVFHHT 270
Query: 302 GNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFS 360
+ ++W GS D++D T+P+ IG AIA T+ G + F
Sbjct: 271 RWNAAGKGDEIWQSFAADGSLPDYID---TTPAGENEQIGTAIAVRFTLQPGENLEIPFV 327
Query: 361 LAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDK 419
LAWD P +F V YHRRYT F+GT G++A IA A+ ++ W+ +I+AWQ+PI+E +
Sbjct: 328 LAWDFPVTEFAAGVNYHRRYTDFFGTNGNNAWAIATLALEQYQTWQQQIQAWQQPIIERE 387
Query: 420 RFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTAL 479
P W + LFNELY L +GGT+W S AT R+
Sbjct: 388 DLPNWLKMALFNELYDLTSGGTLW--------SAATERDP-------------------- 419
Query: 480 GTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMM 539
IGQF LE +Y Y + DV Y SF LV+LFP+LE S+ R FA A+
Sbjct: 420 -----------IGQFAVLECLDYRWYESLDVRLYGSFGLVLLFPELEKSVMRAFARAIPQ 468
Query: 540 HD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDLNS 591
D P + I ++ RK +GA PHD+G + W + N + + + WKDL
Sbjct: 469 GDDTPRVIGYYYTIGAESPVAVRKAVGATPHDLGAPNEHVWQKTNYTSYQDCNLWKDLGC 528
Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
FVLQVYRD++ TG D F + W ++ + Y++ FD DGDG+ EN G PDQT+D W
Sbjct: 529 DFVLQVYRDYLFTGANDVEFLQDCWDAIVQTLDYVKTFDLDGDGIPENSGAPDQTFDDWR 588
Query: 650 ANGVSAYCGGLWVAALQAASALANDVGDH-------ASASYFWVRYQKAKAVY-DSLWNG 701
GVSAYCGGLW+AAL+AA A++ + H S + +K+ +Y + LWNG
Sbjct: 589 LQGVSAYCGGLWMAALEAAIAISEILSQHQPDSALLKQKSIYATWLEKSLPIYQEKLWNG 648
Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGM 761
Y+ D++ GS + ADQL GQ+YA L I + AL +YD LK G
Sbjct: 649 QYYRLDSNSGSD--VVMADQLCGQFYAALLNLPDIVPGDRAISALRTVYDACFLKFYNGQ 706
Query: 762 CGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
GA NG++PDG + E+W G+ +GLAA ++Q M D A + V + +S
Sbjct: 707 FGAANGVRPDGSPENPQATHPLEVWTGINFGLAAFLVQMGMKDEALRLTEAVVQQIYSN- 765
Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
G F+TPE+ + +R+ Y+R + IW + +++
Sbjct: 766 --GLQFRTPEAITPSGTFRASTYLRAMAIWGVYLVMSE 801
>gi|428210868|ref|YP_007084012.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
gi|427999249|gb|AFY80092.1| putative bile acid beta-glucosidase [Oscillatoria acuminata PCC
6304]
Length = 820
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 270/831 (32%), Positives = 380/831 (45%), Gaps = 127/831 (15%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + A QFS+F N ++ + LC
Sbjct: 41 HGMPLGGFGAGCIGRSPRGDFNLWHLDGGEHTFQSLPACQFSIFEETQN-QRHAYALCTE 99
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
P +D + SW W + TYHAL+PR+W VY +L C Q SP
Sbjct: 100 PP------SDGSLNSWQWYPASTEENPTGTYHALYPRSWFVYQNVLKAQL--TCEQFSPI 151
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+P Y+E S+P ++F +T N +++L TW N V D G
Sbjct: 152 LPGKYQECSYPLAIFEWTAYNPTDEPITLSILLTWENVVGWFTNAIKSPTVKVRDDGSPV 211
Query: 250 HHFNSKTMTKDG---------------VHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
+ + SK G + G L G + A A D +
Sbjct: 212 YEYQSKWGQSQGNSNRYIEDFHRIGCIMAGGHLGEVPQEGEGELAIATATNPFLDTFKNT 271
Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
N G + K++W + GS LD +P+ G IG A+A TI G T
Sbjct: 272 ------RWNPTG-SGKEVWEQFSHDGSL--LDCTDETPAAKGERIGVALAVRFTIRPGKT 322
Query: 355 RSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
R + F LAWD P +F V+ +RRYT F+G G ++ + A+ W I WQ+
Sbjct: 323 RKIPFILAWDFPVTEFAAGVWEYRRYTDFFGRHGKNSWSMIRTALKHSEMWRENIITWQQ 382
Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
IL+ + + + LFNELY L GGT+WT
Sbjct: 383 SILDREDLSDSLKMALFNELYILTDGGTLWT----------------------------- 413
Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
++ EE G+F LE +Y Y + DV Y SFAL+ML+P+LE S+ R F
Sbjct: 414 ----------AQDAEEPRGKFAVLECIDYRWYESLDVRLYGSFALLMLWPELEKSVMRGF 463
Query: 534 AAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNS 591
A A+ D I + RK A PHD+G + PW + N + + + WKDL
Sbjct: 464 ARAIPTGDDTPRIIGYNQAAAIRKVTDATPHDLGAPNEHPWVKTNYTSYQDCNLWKDLPC 523
Query: 592 KFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
F L VYRDF+ TG D F WPS+ A+AY++ FD DGDG+ EN G PDQT+D W
Sbjct: 524 DFALLVYRDFLLTGGDDYEFLWDCWPSMVQAIAYLKTFDTDGDGIPENGGAPDQTFDDWP 583
Query: 650 ANGVSAYCGGLWVAAL--------------------------QAASALANDVGDHASASY 683
GVSAYCGGLW+AAL Q S L A+ S
Sbjct: 584 MRGVSAYCGGLWIAALQAAIAIGEVILNNPPHSGVFTDTIYEQKRSQLLEPATIQATLST 643
Query: 684 FWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKV 742
+ ++++ +Y LWNG+Y+ D+ GS + ADQL GQ+YAR GL I
Sbjct: 644 YQDWLEQSRPLYQHKLWNGTYYRLDSESGSEV--VMADQLCGQFYARLLGLADIVPIDCA 701
Query: 743 KKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEM 801
+ AL +Y+ LK G GA NG+ DG ++ E+W G+ +GLAA ++Q M
Sbjct: 702 ESALKTVYESCFLKFHNGEFGAANGVLIDGSPVNPDATHPLEVWTGINFGLAAFLLQMGM 761
Query: 802 VDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ F+ V + + E+GL F+TPE+ +R+ Y+R + IWA+
Sbjct: 762 KEEGFKLTETVIQQVY-ENGL--QFRTPEAITAVGTFRASHYLRAMAIWAV 809
>gi|113477064|ref|YP_723125.1| hypothetical protein Tery_3568 [Trichodesmium erythraeum IMS101]
gi|110168112|gb|ABG52652.1| protein of unknown function DUF608 [Trichodesmium erythraeum
IMS101]
Length = 816
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 277/827 (33%), Positives = 394/827 (47%), Gaps = 120/827 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
G+PLGG GAG+ GRS RG+F + L G I + P A QFSVF G+K + L
Sbjct: 41 HGMPLGGFGAGATGRSPRGDFNLWHLDGGEHIYQNLP--ACQFSVF-EETKGQKQAYALS 97
Query: 145 PRSPGVPKKNTDSGIESWDWNLKGE----NCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
P TD + +W W + + TYHAL+PR+W VY+ +L C Q S
Sbjct: 98 TELP------TDGSLSAWQWYPREKAGLKTGTYHALYPRSWFVYENVFTAQLS--CEQFS 149
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
P + NY+E+S+P ++F +T N + ++++ +W N+V D G
Sbjct: 150 PILAGNYQETSYPIAIFEWTAHNPTDEAITLSIMLSWQNTVGWFTNSVKTPEVRVRDDGS 209
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG----- 302
+ + + L + G ++A E F+ +G
Sbjct: 210 PVYEYKPRWGESTDNFNLLVEDFHRIGCTMTKLSIADEPAEGEGQMAIATFINAGMEVFY 269
Query: 303 ----NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
N G T +D+W+ GS +D + P+ G IG AI+ TI G R +
Sbjct: 270 HTRWNPTG-TGEDIWHYFALDGSL--IDEENELPATEGEQIGVAISVRFTIRPGKNRKIP 326
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F L WD P F V Y+RRYT FYG G +A + A+ + W IEAWQ PIL+
Sbjct: 327 FFLVWDLPVTDFGNGVSYYRRYTDFYGRNGKNAWSMIRTAMKHYQTWRENIEAWQNPILQ 386
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
+ P W + L NELY L +GGTIW A+SD+
Sbjct: 387 REDLPNWLKMALLNELYDLTSGGTIWA----------------------------AASDS 418
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
A GQF LE +Y Y + DV Y SF L++L+P+LE S+ FA AV
Sbjct: 419 A-----------PYGQFAVLECLDYRWYESLDVRLYGSFGLLILWPELEKSVLVAFARAV 467
Query: 538 MMHDPGTMKIMSDGKWVA-RKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
D T++I+ + A RK GA PHD+G + PW N + + ++WKDL+S FV
Sbjct: 468 STAD-DTLRIIGYNQVSAVRKVAGATPHDLGAPNEHPWEMTNYTSYQDCNQWKDLSSDFV 526
Query: 595 LQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
LQVYRDF+ TG D F W ++ +AY++ FDKD DG+ ENEG PDQT+D W G
Sbjct: 527 LQVYRDFLLTGADDYEFLWQSWSAITETLAYLKGFDKDNDGIPENEGAPDQTFDDWQLRG 586
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFW---------------------VRYQ-- 689
VSAYCGGLW+AAL+AA A+ + +H ++ YQ
Sbjct: 587 VSAYCGGLWLAALEAAIAIGKVLIEHPREIPYYPPKGFYSEVDKNSVDAINNQVYLYQGW 646
Query: 690 --KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
K +Y + LWNG Y+ D+ S+S + ADQL+GQ+YA+ L I AL
Sbjct: 647 LKKGLPIYQEKLWNGEYYRLDSE--SNSEVVMADQLSGQFYAKLLNLEDIVPAECALSAL 704
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMA 805
+Y+ G GA NG+ PDG + + E+W G+ +GLAA M+Q M A
Sbjct: 705 KTVYNSCFKNFHNGKFGAANGVLPDGSPENPNATHPLEVWTGINFGLAAFMVQIGMKKEA 764
Query: 806 FQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ V + E+GL F+TPE+ +R+ Y+R + IWA+
Sbjct: 765 LEITEVVVGQIY-ENGL--QFRTPEAITVMGTFRASHYLRAMAIWAI 808
>gi|434389621|ref|YP_007100232.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
gi|428020611|gb|AFY96705.1| putative bile acid beta-glucosidase [Chamaesiphon minutus PCC 6605]
Length = 818
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 260/833 (31%), Positives = 392/833 (47%), Gaps = 128/833 (15%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS+RGEF + + G + A QFSVF NG+ + + +
Sbjct: 41 HGMPLGGFGAGCIGRSHRGEFNLWHIDGGEHVFKNLSACQFSVFEQLANGDSQAYAMSTQ 100
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENC-TYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+PG D + W W G++ +Y AL+PR+W Y E + ++ C Q SP
Sbjct: 101 APG------DGSLNRWQWYPQTTPTGQSTGSYQALYPRSWYSY--ENVFQAQLTCEQFSP 152
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
NY+E+S+P +VF +T N ++++ TW N+V D G
Sbjct: 153 IWADNYQETSYPIAVFEWTAHNPTDKPITLSIMLTWENTVGWFANTSPSSKVEIRDDGSP 212
Query: 249 GHHFNSKTMTKDGVHG------------LTLHHRT---ANGRPPVTFAVAAEETADVHVS 293
+ + + +G L H++ + G + A A + +V
Sbjct: 213 VYDYQPRIGESEGNFNQWIVDNYRVGCLLDKAHKSDIPSEGEGQWSIASIANPSLEV--- 269
Query: 294 ECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGS 353
F S + ++W+ GS +D++ S ++ I AI T+ G
Sbjct: 270 ----FYHSRWNPIGNGGEIWDSFAADGSLPDIDDE--SEAKQRERIATAITIRFTVKPGQ 323
Query: 354 TRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQ 412
T+ + F LAWD P +F + + Y+RRYT F+G G+++ + A+ W+ +I WQ
Sbjct: 324 TKKIPFILAWDFPVTEFGKGINYYRRYTDFFGRSGENSWAMVRTALKHSDTWQEQIINWQ 383
Query: 413 RPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTP 472
+PIL+ P+W+ + LFNELY L GGTIW+
Sbjct: 384 QPILDRSDLPDWFKMALFNELYLLTDGGTIWS---------------------------- 415
Query: 473 ASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRD 532
A+ D + +GQF LE +Y Y + DV Y SF++++L+PKLE SI
Sbjct: 416 AADD-----------RDPVGQFGVLECLDYRWYESLDVRLYGSFSVLLLWPKLEKSILIA 464
Query: 533 FAAAVMMHDPGT-----------------MKIMSDGKWVARKCLGAVPHDIGL--DDPWF 573
FA A+ D + D RK L A PHD+G + PW
Sbjct: 465 FARAIDTEDTNKRLIGYYVQTAFGYEDRPQNVDRDAMMAPRKTLHATPHDLGAPNEHPWE 524
Query: 574 EINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDG 631
+ N + + WKDL S FVLQVYR ++ TG K+ F W ++ +AY++QFD D
Sbjct: 525 KTNYTCYQDCNLWKDLGSDFVLQVYRAYIFTGKKDLEFLAECWTAIVDTLAYLKQFDLDN 584
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH------ASASYFW 685
DG+IEN G PDQT+D W G+SAYCGGLW+AAL+AA ++ N + +
Sbjct: 585 DGIIENSGAPDQTFDDWRLQGISAYCGGLWLAALEAAISIGNILTQQYHGELTPKPAEVI 644
Query: 686 VRYQ----KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
+YQ K +Y LWNGSY+ D GS S + ADQL GQ+Y + L I
Sbjct: 645 PQYQAWLDSGKPIYQQQLWNGSYYRIDT--GSGSEVVMADQLCGQFYTQLLELDDIVPPD 702
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQE 799
K AL +Y L +GG GA NG++ DG + E+W G+ +G+AA ++Q
Sbjct: 703 CAKTALQTVYQSCFLNFQGGKFGAANGVKIDGSPSNPKDTHPLEVWTGINFGIAAFLVQM 762
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
M + A + A + E +S G F+TPE+ + +R+ Y+R + IW +
Sbjct: 763 GMQEEALKLAETIVEQVYSN---GLQFRTPEAITSAGTFRASHYLRAMAIWGI 812
>gi|428299644|ref|YP_007137950.1| glucosylceramidase [Calothrix sp. PCC 6303]
gi|428236188|gb|AFZ01978.1| Glucosylceramidase [Calothrix sp. PCC 6303]
Length = 804
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 263/823 (31%), Positives = 381/823 (46%), Gaps = 117/823 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS+ G+F + + G + A QFS+F S + ++
Sbjct: 40 HGMPLGGFGAGCIGRSHGGDFNFWHIDGGEHVFKNIPACQFSLFESVGESSQAYAL---- 95
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
D ++SW + + + TYHAL+PR+W Y G + I C+Q SP
Sbjct: 96 ---ATAAPADGTLQSWKYYPQSSPEQNTGTYHALYPRSWFEYGGVY--QANITCQQFSPI 150
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
NY+ESS+P +VF + L N ++++ TW N V D G
Sbjct: 151 WADNYQESSYPVAVFLWNLHNPTNAPITLSIMLTWENMVGWFTNTSKSPEVIVRDDGSPV 210
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECP---CFLLSG---- 302
+ + + G + NG T V + V E CFL S
Sbjct: 211 YEYQPRLGQSQG----NFNEIVDNGN---TIGVVLSGISGEVVQEGDGQWCFLTSKQAGV 263
Query: 303 --------NSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGST 354
N G +++W GS + + + E + A+A LT+ G T
Sbjct: 264 EIFSHSRWNPTG-NGEEIWQSFAADGSLSNYQDATPATDE---RLAGALAVRLTLQPGET 319
Query: 355 RSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQR 413
V F L WD P +F V Y+RRYT F+G+ G +A +IA A+ ++ W +I+ WQ+
Sbjct: 320 LEVPFVLTWDFPVTEFAAGVNYNRRYTDFFGSNGKNALQIAETALEQYRDWLEKIQTWQK 379
Query: 414 PILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPA 473
PIL P+W + LFNELY L AGGT+W+ SP
Sbjct: 380 PILNRHDLPDWMKMALFNELYDLTAGGTLWSAASP------------------------- 414
Query: 474 SSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
++ +GQF LE +Y Y + DV Y SF L+MLFP+LE ++ R F
Sbjct: 415 --------------KDPVGQFAVLECLDYRWYESLDVRLYGSFGLLMLFPELEKAVMRAF 460
Query: 534 AAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSR 585
A A+ D T I ++ RK GA PHD+G + W + N + + +
Sbjct: 461 ARAIPASDEKTRVIGYYYTIGAESPLAIRKAAGATPHDLGAPNEHVWEKTNYTSYQDCNL 520
Query: 586 WKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
WKDL S FVLQVYRDFV TG K+ F W ++ + + Y++ FDKD DG+ EN G PDQ
Sbjct: 521 WKDLGSDFVLQVYRDFVFTGSKDIEFLAECWDAIALTLEYLKAFDKDHDGIPENSGAPDQ 580
Query: 644 TYDAWSANGVSAYCGGLW----VAALQAASALANDVGDHASASYFWVRYQ----KAKAVY 695
T+D W GVSAYCGGLW AA+ L + GD + + Y K++ +Y
Sbjct: 581 TFDDWKLQGVSAYCGGLWLAALEAAIAICDILLENHGDQEKLTAQKIIYTDWLAKSRPIY 640
Query: 696 DS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV 754
LWNG Y+ D+ GS + ADQL GQ+YAR L I A+ +Y+
Sbjct: 641 QQKLWNGQYYRLDSESGSD--VVMADQLCGQFYARLLKLADIVPHDCAISAVRTVYEACF 698
Query: 755 LKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVY 813
+ G GA NG++PDG + E+W G+ +GLAA ++Q M D A++ V
Sbjct: 699 INFNDGKFGAANGLRPDGSPENPKATHPLEVWTGINFGLAAFLMQMGMQDEAWKLTEVVV 758
Query: 814 EVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
+ + G F+TPE+ +R+ Y+R + IW + L
Sbjct: 759 QQIYDN---GLQFRTPEAITATGTFRACTYLRAMAIWGIYLVL 798
>gi|427712531|ref|YP_007061155.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
gi|427376660|gb|AFY60612.1| putative bile acid beta-glucosidase [Synechococcus sp. PCC 6312]
Length = 799
Score = 374 bits (961), Expect = e-100, Method: Compositional matrix adjust.
Identities = 261/817 (31%), Positives = 382/817 (46%), Gaps = 117/817 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QFSVF + + LC
Sbjct: 38 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHIFQTMPGCQFSVF-EQAGDHSQAYALCTE 96
Query: 147 SPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P +D + +W+W + TY AL+PR+W Y G + + C Q SP IP N
Sbjct: 97 PP------SDGTLGAWNWYPTAKG-TYGALYPRSWFQYQGVF--QAGLTCEQFSPIIPDN 147
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS--------------------- 245
Y+E+S+P +VF + N ++++ +W N V S
Sbjct: 148 YQETSYPVAVFVWAAHNPTDQPITLSIMLSWQNLVGWFSNATPASEIRIRDDGSPVYDYT 207
Query: 246 ---GLSGHHFNSKTMTKDGVHGLTLHHRTAN-----GRPPVTFAVAAEETADVHVSECPC 297
G S ++NS ++ L R + G F+V + ++
Sbjct: 208 PRLGQSQGNYNSFHQSQTQASCLLQRQRDPHTAIKEGEGEWAFSVPVRDDLEIF------ 261
Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
+ N G ++W+ + GS +L+N++ P+ IGA +A T+ G ++ +
Sbjct: 262 YHTRWNPVG-DGGELWSSFSRDGSLSNLENNQ--PASDIEQIGATLAVRFTLAPGESKEI 318
Query: 358 TFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPIL 416
F LAWD P +F V Y RRYT F+ G +A IA A+ + KW +I AWQ+PIL
Sbjct: 319 PFGLAWDLPVTEFAPNVNYFRRYTDFWNRSGTNAETIAQVALENYQKWSEQIGAWQQPIL 378
Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
E +P+ + + L NELY L +GG++WT S LD
Sbjct: 379 EHPDWPDDFKMALINELYDLTSGGSLWTAAS--------------ELDP----------- 413
Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
+GQ+ LE +Y Y + DV Y SFAL+ L+P+LE + R FA A
Sbjct: 414 --------------LGQYAVLECLDYRWYESLDVRLYGSFALLYLWPELEKGVMRAFARA 459
Query: 537 VMMHDPGTMKIM--------SDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRW 586
+ DP ++I+ RK A PHD+G + PW N + + + W
Sbjct: 460 IPTEDP-KLRIIGYYYQGDPETAHRAPRKIAHATPHDLGAPNEHPWQASNYTSYQDCNLW 518
Query: 587 KDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
KDL S FVL VYRD++ TG D F WP++ ++ Y++ FD DGDG+ EN G PDQT
Sbjct: 519 KDLASDFVLLVYRDYLLTGATDLEFLTDCWPAIVSSLDYLKTFDTDGDGLPENSGAPDQT 578
Query: 645 YDAWSANGVSAYCGGLW-------VAALQAASALANDVGDHASASYFWVRYQKAKAVYDS 697
YD W GVSAYCGGLW +A Q N+ W+ +++ Y +
Sbjct: 579 YDDWQLKGVSAYCGGLWIAALEAALAIAQILEKQGNNTKLEQEKWQAWL--SQSRPTYQA 636
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
LWNG ++ D+ S+S + ADQL GQ+YAR L + + AL +YD LK
Sbjct: 637 KLWNGQFYRLDSD--SNSQVVMADQLCGQYYARLLNLPDVVPIECAESALKTVYDVCFLK 694
Query: 757 VKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
G GA NG+ PDG + + E+W G+ YGLAA + Q + D A++ V
Sbjct: 695 FHEGQFGAANGLLPDGSPEKPNATHPLEVWIGINYGLAAFLYQMGLEDGAWRLTQTVIRQ 754
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+ G F+TPE+ +R+ Y+RP+ IWAM
Sbjct: 755 IYDN---GLQFRTPEAITAVGTFRACMYLRPMAIWAM 788
>gi|366162568|ref|ZP_09462323.1| hypothetical protein AcelC_02756 [Acetivibrio cellulolyticus CD2]
Length = 842
Score = 371 bits (953), Expect = e-99, Method: Compositional matrix adjust.
Identities = 258/777 (33%), Positives = 388/777 (49%), Gaps = 103/777 (13%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G P+GG GAG+I + G F + +L G + + F ++ +P+G+ S
Sbjct: 70 GAPIGGFGAGTITWKFDGSFTKGRLDIGSNELSSDADCAFYMY-QKPSGQNIS------- 121
Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
K D+G ++ TY++LFP++W Y G + + Q SP IP NY
Sbjct: 122 ----MKKLDAG------SIGAGQATYYSLFPKSWVDYSGS-NFYCKAKVTQFSPIIPGNY 170
Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK--TMTKDGVHGL 265
+ +S+P V+ + ++N G +S DV ++ +W N G NSK + ++G
Sbjct: 171 ETTSYPVGVYKWEITNPGSSSCDVAIMLSWKNEFGGSFAEGQTSGNSKGIILRRNGTSNA 230
Query: 266 TLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHL 325
T+ ++ F++AAE+ V V+ G++ G + + D L
Sbjct: 231 TMENQGE-------FSLAAEQKDGVKVTY-------GSASGTST------LSSDFGTDGL 270
Query: 326 DNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYG 384
D GS+ A+A T+ G T +V +L+WD P + ++R+YT+FYG
Sbjct: 271 LGDTV-----GSNNIGAVAFKATLEPGQTITVPVALSWDIPISQAGSGSKWYRKYTRFYG 325
Query: 385 TLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWT 444
G ++ +IA DAI ++ WE I++WQ +L D ++P+W T+FNELYY GGT W
Sbjct: 326 RSGLNSWKIAKDAIDNYSSWESSIDSWQNSVLNDTKYPQWLKTTMFNELYYYFTGGTYWE 385
Query: 445 DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVM 504
G+ Q +N +E++ F +LE +Y
Sbjct: 386 AGAASGQK--------------------------------DNPDEDM--FSHLECYKYDF 411
Query: 505 YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPH 564
Y T DV FY S+ L++++P ++ R FA +V + T K +G H
Sbjct: 412 YGTSDVRFYGSWPLILMWPDIDKQAVRQFADSV--YHTRTDK---------PAAIGTTAH 460
Query: 565 DIGLDDPWFEI-NSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAM 621
DIG + FE N+Y +S+ WKDLNSK VL VYRD+ TG D +F + W V AM
Sbjct: 461 DIGNANNVFETWNAYVYRDSTNWKDLNSKLVLMVYRDWALTGKTDMDFLKYCWIPVQKAM 520
Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA 681
++ D DGD + + G DQTYD G +AYCGGL+VAA AA +A +GD A
Sbjct: 521 DKVKSQDSDGDYLPNSSGI-DQTYDDMQLQGNTAYCGGLFVAACNAAKEIATAMGDSTKA 579
Query: 682 SYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTS-IQADQLAGQWYARACGLLPIADE 739
+ + Y+K+K Y + LWNGSYF D GS TS I +DQL G WYA+ACGL I +
Sbjct: 580 TQYQSWYEKSKENYIAKLWNGSYFKIDT--GSYDTSRIMSDQLCGHWYAKACGLEGIVPD 637
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
KK+ +K+Y+FN K G+ G +N M+ DG +DMS Q E W G +YG+ A MIQE
Sbjct: 638 EYAKKSYSKVYEFNNKKFDNGLHGFVNIMKADGNVDMSHAQTAEAWVGTSYGVIAGMIQE 697
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN-DEYRSLCYMRPLTIWAMQWA 855
+ A Q + + + W + + F+TPE+W E R+ YMR IWA++ A
Sbjct: 698 GLETEASQVGSNLADTVWKTNDMW--FRTPEAWRQGVSEVRAPYYMRANCIWAVKHA 752
>gi|290990718|ref|XP_002677983.1| predicted protein [Naegleria gruberi]
gi|284091593|gb|EFC45239.1| predicted protein [Naegleria gruberi]
Length = 937
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 263/812 (32%), Positives = 395/812 (48%), Gaps = 117/812 (14%)
Query: 115 GICDDAPVLANQFSVFVSRPNGEKFSSVL---CPRSPGVPKKNTDSGIE-SW----DWNL 166
GI D + NQFS+ V R NGE S VL C G K+ ++ +W D
Sbjct: 178 GIYSDTVLPVNQFSLRVER-NGEIESFVLNPICEFKDGTMDKDLIKKLKHNWMFINDERR 236
Query: 167 KGENCTY----HALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLS 222
+ + ++LFP++WTVY D + ++C + SP HNY+ESSFP F +T+
Sbjct: 237 RKNKMQFSQQVYSLFPKSWTVYQT-SDEKFTVICERYSPVWAHNYQESSFPTCNFEWTII 295
Query: 223 NSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK--DGVHGLTLHHRTANG---RPP 277
N+ A+V+LLFT + + + S D V LT+ + +
Sbjct: 296 NNTDEEANVSLLFT----IQSEEKFRKKNEQSIIARNETDSVKCLTIETTVTDSFSHKDT 351
Query: 278 VTFAVAAEETADVHVSECPCFLLSGNSKGIT-AKDMWNEIKKHGSF------DHLDNDKT 330
+ F + A + FL N + D+W E +G F +
Sbjct: 352 IQFCIGANTSLG-----SITFLDEFNGTNESHLHDLWQEFNNNGEFSTEFCENFAKRHDW 406
Query: 331 SPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF---FEKVYHRRYTKFYGTLG 387
+ S+ AI + + IPS S++ V F+L W+ P V+F E Y R YT+F
Sbjct: 407 TKSQSKHLSCHAICSKVQIPSNSSKQVCFNLNWNAPFVRFGPTMEFRYLRHYTQFCTDAI 466
Query: 388 DSAARIAHDAILEHAK--------WECEIEAWQRPILED--KRFPEWYPITLFNELYYLN 437
S + A E K W+ EI W + +++ K P LFNELYY+
Sbjct: 467 SSITQHKDTACFEMCKRSLDSSKQWKDEISQWHQGVIDQVGKDHPTAVA-ALFNELYYIV 525
Query: 438 AGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG--EENIGQFL 495
GGT+WT+G L ENG +++ F
Sbjct: 526 DGGTLWTNGG----------------------------------ELDENGNSQDSKDYFF 551
Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
YLEG EY+M NTYDVHFY+S A++M +P+++LSIQRD AV + +
Sbjct: 552 YLEGHEYLMCNTYDVHFYASHAMIMNWPEIQLSIQRDIIRAVREENNQEVTFYLSNMKNR 611
Query: 556 RKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAV 613
RK G+VPHD+G + PW +N+YN + ++WKDLNS+F+L ++RDF+ T ++ F
Sbjct: 612 RKVKGSVPHDVGTPYERPWKLVNAYNFQDVNKWKDLNSQFILTIFRDFMITKNREFLEEA 671
Query: 614 WPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALA 672
+PS+ +A+ Y + FDKDGDG+IENE FPD TYDAW GVSAY G LW+A+L+A +A
Sbjct: 672 FPSIILALNYSLTNFDKDGDGLIENESFPDTTYDAWKVTGVSAYSGMLWIASLKAILEMA 731
Query: 673 NDVGDHASASYFWVRYQKAKAVYDS-LWN--GSYFNYDNSDGSSSTSIQADQLAGQW--- 726
++ D + + ++S LW+ Y++YD S SI +D L G +
Sbjct: 732 KELDDENTRQKVEPLIESGLKTFESKLWDETNQYYHYDASTEPQHDSIMSDHLHGTFMLL 791
Query: 727 ----YARACG-------LLPIADEAKVKKALTKIYDFNVLKVK--GGMCGAMNGMQPDGR 773
Y + G L+P E +V K+L KI + N +K + + G +NGM+P
Sbjct: 792 LIGNYVKNSGRCEDSHYLIPFNME-RVSKSLNKILENNFVKYQQITKLGGCVNGMRPTQE 850
Query: 774 IDMSGLQAREIWPGVTYGLAASMI----QEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
+D + LQ+RE+W G +Y +AA I +EE +D+ V++ WS + + FQTP
Sbjct: 851 VDKTSLQSREMWTGTSYVVAALTILLGRREEGIDL----MKSVFDKCWSSNS-AFWFQTP 905
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKPKL 861
E+ EYR+L Y RPL+IW+ A+ K+
Sbjct: 906 EAVTETGEYRALGYCRPLSIWSCLSAMMNKKI 937
>gi|443318654|ref|ZP_21047901.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
gi|442781756|gb|ELR91849.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 6406]
Length = 857
Score = 369 bits (946), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 273/864 (31%), Positives = 393/864 (45%), Gaps = 158/864 (18%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G PLGG GAG IGRS G+F + L G A QFSVF + ++ + L
Sbjct: 39 HGAPLGGFGAGCIGRSPHGDFNLWHLDGGEHTFQTFPACQFSVF-EQWGDQRQAYALSTE 97
Query: 147 SPGVPKKNTDSGIESWDW---NLKGEN--------------------CTYHALFPRAWTV 183
+P TD + W W L+ N TYHAL+PR+W V
Sbjct: 98 AP------TDGTLGQWQWYPATLQNSNQNSKFKIQNSELPASPPLSTGTYHALYPRSWYV 151
Query: 184 YDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA- 242
Y E + ++ C Q SP NY+E+S+P +VF +T N +++L +W N V
Sbjct: 152 Y--ENVFQAQLTCEQFSPIWARNYQEASYPVAVFLWTAHNPTDQPITLSILLSWQNMVGW 209
Query: 243 ------------GDSGLSGHHF---------NSKTMTKDGVH-GLTLHHRTANGRPPV-- 278
D G + + N+ + + G G ++ + G P
Sbjct: 210 FTNTAKSPQVLQRDDGSPYYDYVSAIGQSAGNANRLVQTGAAVGYVMNGAISPGITPTEG 269
Query: 279 --TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPG 336
+A+A V V + N G D+WN GS + +D+ P+ G
Sbjct: 270 EGQWAIATLTQPGVEV----FYHTRWNPVG-DGGDLWNGFAIDGSLVNSGDDR--PAAAG 322
Query: 337 SSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAH 395
IG AIA T+ G TR + +L WD P ++ V +RRYT F+G G +A IA
Sbjct: 323 EQIGGAIAVRFTLQPGETRQMPMTLVWDLPVTEYAAGVADYRRYTDFFGREGQNAWVIAQ 382
Query: 396 DAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLAT 455
A+ + W +I WQ+PIL+ FP+W+ + LFNELY L +GGT+WT
Sbjct: 383 TALANYTTWRQQIINWQQPILDQPDFPDWFKMALFNELYDLTSGGTLWT----------- 431
Query: 456 IRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSS 515
A+++T + +GQF LE +Y Y + DV Y
Sbjct: 432 -----------------AATET-----------DPVGQFGVLECVDYRWYESLDVRLYGG 463
Query: 516 FALVMLFPKLELSIQRDFAAAVMMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGL- 568
FA ++L+P+LE ++ R FA A+ D P + I + RK GA PHD+G
Sbjct: 464 FATLILWPELEKAVIRAFARAIPTADDHPRIIGYYYTIGETDHYAPRKLPGATPHDLGAP 523
Query: 569 -DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYME 625
+ P+ + N + + ++WKDL S FV+QVYR + TG D +F WP+V + Y++
Sbjct: 524 NEHPFVKTNYTSYQDCNQWKDLPSDFVIQVYRAYKMTGATDGDFLADCWPAVTETLKYLK 583
Query: 626 QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA---- 681
+FD D DG+ EN G PDQT+D W G+SAYCGGLW+AAL++A A+A+ + A
Sbjct: 584 RFDTDSDGIPENGGAPDQTFDDWQLKGLSAYCGGLWMAALESAIAIADTLTAAGRAPGNT 643
Query: 682 -----SYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
Y Q K + LWNG Y+ + GS S+ + ADQL GQ+ AR L +
Sbjct: 644 PILVTQYRRWLEQSRKVYHKRLWNGRYYRLET--GSDSSVVMADQLCGQFCARLMELPDL 701
Query: 737 ADEAKVKKALTKIYDFNVLKVK---------------------------GGMCGAMNGMQ 769
+E V+ AL IYD LK G GA NG+
Sbjct: 702 VEEKYVQSALAAIYDACFLKFNNYAATQARPQEQKFIGSQTGTFRSAQPGIPIGAANGVL 761
Query: 770 PDGRI-DMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
PDG D G E+W G+ +GLA Q VD AF V + ++ G F+T
Sbjct: 762 PDGSPEDPDGTHQLEVWTGINFGLATFFAQMGQVDKAFAITEAVVQQIYTH---GLQFRT 818
Query: 829 PESWNNNDEYRSLCYMRPLTIWAM 852
PE+ +R+ Y+RPL IW +
Sbjct: 819 PEAITALGTFRACHYLRPLAIWGL 842
>gi|75907319|ref|YP_321615.1| hypothetical protein Ava_1096 [Anabaena variabilis ATCC 29413]
gi|75701044|gb|ABA20720.1| Protein of unknown function DUF608 [Anabaena variabilis ATCC 29413]
Length = 804
Score = 368 bits (944), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 270/818 (33%), Positives = 389/818 (47%), Gaps = 110/818 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVLC 144
G+PLGG GAG IGRS RG+F + + G I + P A QFSVF S NG +
Sbjct: 40 HGMPLGGFGAGCIGRSSRGDFNLWHIDGGEHIFQNFP--ACQFSVFES--NGTSSQAYAL 95
Query: 145 PRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
P D ++SW W TYHAL+PR+W VY+ EL C Q S
Sbjct: 96 STQP-----TDDGSLKSWQWYPASTATQTTGTYHALYPRSWFVYENVFQAEL--TCEQFS 148
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
P +NY+E+S+P +VF + N ++++ TW N V D G
Sbjct: 149 PIWANNYQETSYPVAVFVWQAHNPTNAPITLSIMLTWQNMVGWFTNALKSPDVRVRDDGS 208
Query: 248 SGHHFNSKTMTKDGVHGLT----LHHRTANGRPPVTFAVAAEETADVHVS----ECPCFL 299
+ + + G + H GR + + E + V+ + F
Sbjct: 209 PVYEYQPRLGESGGNYNYLDESPQHLGCFLGRVGMAEPLQEGEGSWCIVTRKHPQVEIFH 268
Query: 300 LSGNSKGITAKDMWNEIKKHGSF-DHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVT 358
+ + T +++W GS +++D TSP +GAAIA T+ G T +
Sbjct: 269 HTRWNPVGTGEEVWQSFAADGSLANYID---TSPVSENEQLGAAIAVRFTLQPGETLEIP 325
Query: 359 FSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILE 417
F ++WD P +F V Y+RRYT F+G G+ A IA A+ ++ W+ +I+AWQ PIL
Sbjct: 326 FVVSWDLPVTEFAAGVNYYRRYTDFFGKSGNHAWAIATIALEQYQTWQQQIQAWQDPILN 385
Query: 418 DKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDT 477
P+W+ + LFNELY L +GGT+W+ +P
Sbjct: 386 RDDLPDWFKMALFNELYDLTSGGTLWSAATP----------------------------- 416
Query: 478 ALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAV 537
+ IGQF LE +Y Y + DV Y SF L+ LFP+LE ++ R FA A+
Sbjct: 417 ----------SDPIGQFAVLECLDYRWYESLDVRLYGSFGLLQLFPELEKAVMRAFARAI 466
Query: 538 MMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP--WFEINSYNLFNSSRWKDL 589
D P + I ++ RK GA PHD+G + W + N + + + WKDL
Sbjct: 467 PQGDDTPRVIGYYYTIGAESPIAVRKTPGATPHDLGAPNEHVWEKTNYTSYQDCNLWKDL 526
Query: 590 NSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
FVLQVYRDF+ TG D F R W ++ + Y++ FD DGDG+ EN G PDQT+D
Sbjct: 527 GCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYVKTFDLDGDGIPENSGAPDQTFDD 586
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS---------- 697
W GVSAYCGGLW+AAL AA A+++ + + S + K+ Y++
Sbjct: 587 WRLQGVSAYCGGLWMAALAAAIAISDILLQNHQDSETKEKLLLQKSTYETWLTKSLPIYQ 646
Query: 698 --LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
LWNG Y+ D+ GS + ADQL GQ+YA L I + AL +YD L
Sbjct: 647 EKLWNGKYYRLDSESGSD--VVMADQLCGQFYANLLELPDIVPSDRAISALQTVYDACFL 704
Query: 756 KVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K G GA NG++PDG + E+W G+ +GLAA ++Q M D F+ V
Sbjct: 705 KFYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFLVQMGMKDEGFRLTQAVVA 764
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
++ G F+TPE+ +R+ Y+R + IWA+
Sbjct: 765 QIYNN---GLQFRTPEAITAAGTFRASTYLRAMAIWAI 799
>gi|81299663|ref|YP_399871.1| hypothetical protein Synpcc7942_0854 [Synechococcus elongatus PCC
7942]
gi|81168544|gb|ABB56884.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 798
Score = 367 bits (942), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 261/824 (31%), Positives = 385/824 (46%), Gaps = 122/824 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G PLGG GAG GRS RG+ + L G + A QF+V+ S S+
Sbjct: 36 HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ +D + SW+W + Y AL+PR+ Y + E I CRQ SP
Sbjct: 91 -----EAPSDGSLSSWNWYPASTAERSTGEYSALYPRSQFSY--QQVFEAEIHCRQFSPI 143
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN-------------SVAGDSGLSG 249
+PH+Y+ +S+P ++F + L N +++L +W N V D G
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGWFTNTNKAPEVVYRDDGSPV 203
Query: 250 HHFN----------SKTMTKDGVHGLTLHHRTANGRPPV---TFAVAAEETADVHVSECP 296
+ + ++ + +G GL L +T + P +A+A E + + C
Sbjct: 204 YDYVPALGQSVGNLNQRIAGEGWQGLLLD-QTRSQDPEEGDGQWAIAIAEAEGLEIFRCD 262
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
+ +G+ ++W GS D + P+ G + AIA LT+ G +R
Sbjct: 263 RWDPTGD-----GSELWQSFALDGSIP--DRQDSQPAAAGERLAGAIAVRLTLQPGESRE 315
Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ FS+AWD P +F V RRYT F+G G +AA IA + + +WE I AWQ+PI
Sbjct: 316 IPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAWQQPI 375
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L+ + + + LFNELY L +GG++WT S
Sbjct: 376 LDRDDLSDTFKLGLFNELYDLCSGGSLWTAASE--------------------------- 408
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
+ +GQF LE +Y Y + DV Y +F L+ML+P+LE ++ R FA
Sbjct: 409 ------------ADPVGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFAR 456
Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWK 587
A+ D T I + RK A PHD+G + PW + N + + WK
Sbjct: 457 AIPQADDRTRVIGYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWK 516
Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
DL FVLQV+RD+ TG D+ F WP+ A+ Y++ FD DGDG+ EN G PDQT+
Sbjct: 517 DLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTF 576
Query: 646 DAWSANGVSAYCGGLWVAALQAA---------SALANDVGDHASASYFWVRYQKAKAVY- 695
D W G+SAYCGGLW+AAL+AA SA D D S W+ +A+++Y
Sbjct: 577 DDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDRQDFQS----WL--AQARSLYH 630
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
D+LWNG Y+ D+ GS S + ADQL G +Y+R L P+A + +Y+
Sbjct: 631 DTLWNGRYYQLDS--GSGSQVVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQ 688
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K G G NG+ PDG +D G E+W G+ +G+AA + D F+ V
Sbjct: 689 KFHSGQFGLANGLLPDGSPVDPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIR 748
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
+ G F+TPE+ N +R+ Y+RP+ IWA+ LT
Sbjct: 749 QIYDN---GLQFRTPEAITANATFRASHYLRPMAIWAVYGVLTN 789
>gi|148670516|gb|EDL02463.1| glucosidase beta 2, isoform CRA_c [Mus musculus]
Length = 467
Score = 366 bits (940), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 198/441 (44%), Positives = 254/441 (57%), Gaps = 35/441 (7%)
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
+ P WY LFNELY+L GGT+W L+ P+ S
Sbjct: 10 RTLPAWYKSALFNELYFLADGGTVW-------------------LEVPAD-----SLPEG 45
Query: 479 LG--TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
LG R L + ++ G+F YLEG EY MYNTYDVHFY+SFALVML+PKLELS+Q D A A
Sbjct: 46 LGGSMRQLRSTLQDYGRFGYLEGQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALA 105
Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFV 594
+ D + + G K +PHDIG D+PW +N+Y + +++ WKDLN KFV
Sbjct: 106 TLKEDLTRRRYLMSGVVAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFV 165
Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
LQ+YRD+ TGD+ F +WP M +FDKD DG+IEN G+ DQTYDAW G S
Sbjct: 166 LQIYRDYYLTGDQGFLEDMWPVCLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPS 225
Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSS 713
AYCGGLW+AA+ +A G F + + Y+ LWNG Y+NYD+S
Sbjct: 226 AYCGGLWLAAVAVMVQMAVLCGAQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQ 285
Query: 714 STSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S SI +DQ AGQW+ RACGL + V +AL I++ NV GG GA+NGM
Sbjct: 286 SRSIMSDQCAGQWFLRACGLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMH 345
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P G D S +Q+ E+W GV YGLAA+MIQE + F+TA G Y W LG +FQTP
Sbjct: 346 PHGVPDRSSVQSDEVWVGVVYGLAATMIQEGLTWEGFRTAEGCYRTVWER--LGLAFQTP 403
Query: 830 ESWNNNDEYRSLCYMRPLTIW 850
E++ +RSL YMRPL+IW
Sbjct: 404 EAYCQQQVFRSLAYMRPLSIW 424
>gi|56750695|ref|YP_171396.1| hypothetical protein syc0686_c [Synechococcus elongatus PCC 6301]
gi|56685654|dbj|BAD78876.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 798
Score = 365 bits (936), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 260/824 (31%), Positives = 384/824 (46%), Gaps = 122/824 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G PLGG GAG GRS RG+ + L G + A QF+V+ S S+
Sbjct: 36 HGAPLGGFGAGCWGRSPRGDVTLWHLDGGEHWYGSIPACQFAVYESGTGAYALST----- 90
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ +D + SW+W + Y AL+PR+ Y + E I CRQ SP
Sbjct: 91 -----EAPSDGSLSSWNWYPASTAERSTGEYSALYPRSQFSY--QQVFEAEIHCRQFSPI 143
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN-------------SVAGDSGLSG 249
+PH+Y+ +S+P ++F + L N +++L +W N V D G
Sbjct: 144 LPHDYQATSYPTAIFRWQLHNPSDRPLTISILLSWENLCGWFTNTNKAPEVVYRDDGSPV 203
Query: 250 HHFN----------SKTMTKDGVHGLTLHHRTANGRPPV---TFAVAAEETADVHVSECP 296
+ + ++ + +G GL L +T + P +A+A E + + C
Sbjct: 204 YDYVPALGQSVGNLNQRIAGEGWQGLLLD-QTRSQDPEEGDGQWAIAIAEAEGLEIFRCD 262
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
+ +G+ ++W GS D + P+ G + AIA LT+ G +R
Sbjct: 263 RWDPTGD-----GSELWQSFALDGSIP--DRQDSQPAAAGERLAGAIAVRLTLQPGESRE 315
Query: 357 VTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ FS+AWD P +F V RRYT F+G G +AA IA + + +WE I AWQ+PI
Sbjct: 316 IPFSIAWDLPVTEFAAGVKAFRRYTDFFGRDGRNAAAIAATGLKHYDEWEQAIAAWQQPI 375
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L+ + + + LFNELY L +GG++WT S
Sbjct: 376 LDRDDLSDTFKLGLFNELYDLCSGGSLWTAASE--------------------------- 408
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
+ +GQF LE +Y Y + DV Y +F L+ML+P+LE ++ R FA
Sbjct: 409 ------------ADPVGQFAVLECLDYAWYESLDVRLYGAFGLLMLWPELEKAVMRAFAR 456
Query: 536 AVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWK 587
A+ D T I + RK A PHD+G + PW + N + + WK
Sbjct: 457 AIPQADDRTRVIGYWFTIGQESPLAKRKLANATPHDLGAPNESPWLQTNYTGYQDCNLWK 516
Query: 588 DLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTY 645
DL FVLQV+RD+ TG D+ F WP+ A+ Y++ FD DGDG+ EN G PDQT+
Sbjct: 517 DLGCDFVLQVWRDYQLTGSSDRGFLSDCWPAAVAALRYLKDFDLDGDGIPENSGAPDQTF 576
Query: 646 DAWSANGVSAYCGGLWVAALQAA---------SALANDVGDHASASYFWVRYQKAKAVY- 695
D W G+SAYCGGLW+AAL+AA SA D D S W+ +A+++Y
Sbjct: 577 DDWRLQGISAYCGGLWIAALEAALAIADVLELSAEDRDRQDFQS----WL--AQARSLYH 630
Query: 696 DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
D+LWNG Y+ D+ GS S + ADQL G +Y+R L P+A + +Y+
Sbjct: 631 DTLWNGRYYQLDS--GSGSQVVMADQLCGDFYSRLLQLPPVAPLEAAQSTADMVYEACFQ 688
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K G G NG+ PDG + G E+W G+ +G+AA + D F+ V
Sbjct: 689 KFHSGQFGLANGLLPDGSPVGPKGTHPLEVWTGINFGIAAYWLLLGHRDRCFEVTETVIR 748
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
+ G F+TPE+ N +R+ Y+RP+ IWA+ LT
Sbjct: 749 QIYDN---GLQFRTPEAITANATFRASHYLRPMAIWAVYGVLTN 789
>gi|427419169|ref|ZP_18909352.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
gi|425761882|gb|EKV02735.1| putative bile acid beta-glucosidase [Leptolyngbya sp. PCC 7375]
Length = 835
Score = 360 bits (925), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 271/861 (31%), Positives = 395/861 (45%), Gaps = 159/861 (18%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKF------ 139
+ G+PLGG+GAG IGR+ G F + L G A QF+V+ +
Sbjct: 36 EHGMPLGGLGAGCIGRAPNGTFNLWHLDGGEHTFEHFPACQFAVYEEAAGASQTYAMATA 95
Query: 140 --------------SSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
SV PG P +T S I+ TY+AL+PR+W Y
Sbjct: 96 GDGLERWQWYPAENRSVGTAHPPGQPGNDTTSTIK---------GGTYNALYPRSWYSYA 146
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
L C Q SP P NY+ESS+P +VF +T N +++L +W N V
Sbjct: 147 DVFKANL--TCEQFSPIWPDNYQESSYPVAVFVWTAHNPTDQPITLSILLSWQNMV---- 200
Query: 246 GLSGHHFNSKT-----MTKDGV----HGLTLHHRTANGRPPVTFAVAAEETADVHVSECP 296
G NS++ +DG +G L N V+ A D ++ P
Sbjct: 201 ---GWFTNSESSPEIQQREDGSPFYDYGPPLRQSAGNFNQLVSDERAIGMVMDGAWTDSP 257
Query: 297 -------CFLLSGNSKGIT----------AKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
C + ++GIT D+W HG+ ++ + T+ + +
Sbjct: 258 GEGDGQFCIAVPA-TEGITYHTRWNPHGDGGDLWESFAGHGTLSNVAD--TTSAVKDEQV 314
Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAI 398
G AIA T+ G T+ + F L WD P +F K+ +RRYT F G +A ++A +A+
Sbjct: 315 GGAIAIKFTLAPGETQELPFVLTWDLPVTEFATGKLAYRRYTDFCDCSGRNAWQLAQEAL 374
Query: 399 LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
+ W I WQ+PIL+ P+W+ + LFNELY + +GGT+W+
Sbjct: 375 ANYPTWRQNIIDWQQPILDRPDLPDWFKMALFNELYDMASGGTLWS-------------- 420
Query: 459 RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
A+SD + +GQF LE +Y Y + DV Y F+
Sbjct: 421 --------------AASDI-----------DPVGQFGVLECIDYRWYESLDVRLYGGFST 455
Query: 519 VMLFPKLELSIQRDFAAAVMMHDP-----GTMKIMSDGKWVA-RKCLGAVPHDIGL--DD 570
+ML+P+LE +I R FA A+ DP G + + + +A RK GA PHD+G ++
Sbjct: 456 LMLWPELEKAIIRAFARAIPTEDPKKRIIGYYYTVGESEHLAPRKLKGATPHDLGAPNEN 515
Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFD 628
PW N + + ++WKDL S FVLQVYR FV TG D F WP+V + Y++ FD
Sbjct: 516 PWIATNYTSYQDCNQWKDLPSDFVLQVYRAFVLTGGDDLEFLAECWPAVTQTLQYLKLFD 575
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDH--ASASYFWV 686
DGDG+ EN G PDQT+D W G+SAYCGGLW+AAL+AA + + + H S S+ V
Sbjct: 576 LDGDGVPENGGAPDQTFDDWQLKGLSAYCGGLWLAALEAAIKMGDLLQQHRKISGSFAIV 635
Query: 687 RYQ------KAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
+ Q + K VY + LW GS++ D+ S+S + ADQL GQ+ AR L I D+
Sbjct: 636 KSQYQRWIKQGKIVYQNKLWTGSFYRLDSE--SNSDVVMADQLCGQFCARLMDLPDIVDQ 693
Query: 740 AKVKKALTKIYD-----FNVLKVK----------------------GGMCGAMNGMQPDG 772
+ AL IYD FN K G GA NG++PDG
Sbjct: 694 EFIDIALDTIYDACFVKFNEYAAKHSTPQNQKFEGSQAGFFSAAQLGMAIGAANGVKPDG 753
Query: 773 RIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
+ E+W G+ +G+AA + Q D A V + ++ G F+TPE+
Sbjct: 754 SPENPDDTHQLEVWTGINFGVAAFLAQMGKRDQAMAITEAVVQQIYN---YGLQFRTPEA 810
Query: 832 WNNNDEYRSLCYMRPLTIWAM 852
+R+ Y+RP+ IW +
Sbjct: 811 ITAMGTFRACHYLRPMAIWGL 831
>gi|162452686|ref|YP_001615053.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
gi|161163268|emb|CAN94573.1| hypothetical protein sce4410 [Sorangium cellulosum So ce56]
Length = 782
Score = 358 bits (920), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 260/775 (33%), Positives = 374/775 (48%), Gaps = 101/775 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G P+GG GAG+I + G F + +L G D + F ++ G+ + L
Sbjct: 95 EDGAPIGGFGAGTITWRFDGRFYKGRLDIGSNDQNVDNSAGFFMYQKAEGGDAQTKRLDK 154
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
G +G+ TY +LFPR+W Y G P + Q SP IP
Sbjct: 155 DGLG-----------------QGQ-ATYSSLFPRSWVDYHGAAFP-CKAKVEQYSPIIPG 195
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVH-G 264
+YK++S+P V+ + ++N DV ++ TW N H N T+ G G
Sbjct: 196 DYKKTSYPVGVYRWEITNPTGAPCDVAVMLTWKND---------HGGNGATVQTSGDDVG 246
Query: 265 LTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
L L T G P + + + P ++S S A + +E+ G+ +
Sbjct: 247 LVL---TRGGGAPSS---EGQGEFTLASRNAPGVVVSYQSADSVAA-LQSELANDGALAN 299
Query: 325 LDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFY 383
+ G S+GA IA T+ G + V LAWD P + ++R YT+ +
Sbjct: 300 --------TTGGHSLGA-IAFKATVEPGKSVIVPIVLAWDIPVTQAGSGDRWYRAYTRHF 350
Query: 384 GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW 443
G G+++ IA +A+ EH W I+ WQ +L+D ++PEW LFNELYY GT W
Sbjct: 351 GRTGNASWTIAAEALAEHEPWLWAIQDWQASVLDDPKYPEWLKGALFNELYYYAIAGTYW 410
Query: 444 TDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYV 503
G+ Q D P +E++ F LE Y
Sbjct: 411 EAGAASGQP-----------DDP---------------------DEDM--FTSLESYIYP 436
Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVP 563
Y T DV FY S+AL L+P ++ R F +V SD + +G
Sbjct: 437 FYGTSDVRFYGSWALFSLWPDIDKQEVRQFCDSVTT-------TRSD----RPRAIGTTA 485
Query: 564 HDIGLDDPWF-EINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIA 620
HD G ++ F + N+Y +S+ WKDLNSK VL VYRD+ TG DK F WP+V +A
Sbjct: 486 HDFGDNESVFRKWNAYTYRDSTNWKDLNSKLVLMVYRDWALTGKTDKAFLDYCWPAVKMA 545
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
M ++ D DGDG+ E+ G DQTYD +G +AYCGGL++AA +AA LA VGD +
Sbjct: 546 MQKVKGQDGDGDGLPESNGI-DQTYDDMDLHGNTAYCGGLFIAASEAAKELATAVGDASL 604
Query: 681 ASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTS-IQADQLAGQWYARACGLLPIAD 738
AS + +++K ++S LW GSY+ D GS T+ I +DQLAGQWYARA GL PI D
Sbjct: 605 ASTYQTWVEQSKGGFESKLWTGSYYKIDT--GSRDTNRIMSDQLAGQWYARALGLPPIVD 662
Query: 739 EAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQ 798
+ A TKIY+ N + GG G +N M +G +D + Q RE W G ++G+ A MIQ
Sbjct: 663 PSHAASAFTKIYENNFKRFDGGTRGVVNVMTANGSVDGTSNQTRECWVGTSWGVVAGMIQ 722
Query: 799 EEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
E + A + A + + W DGL F+TPE+W N R+ YMR T+WA +
Sbjct: 723 EGLAAQAGEIGASLVDTIWKTDGL--WFRTPEAWEGNGSIRAPYYMRATTLWAAK 775
>gi|443710865|gb|ELU04898.1| hypothetical protein CAPTEDRAFT_216398 [Capitella teleta]
Length = 449
Score = 355 bits (910), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 192/491 (39%), Positives = 272/491 (55%), Gaps = 72/491 (14%)
Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
YHRRY +F+G G + R+A A+ WE +I WQ+P L+ + P WY LFNE Y
Sbjct: 13 YHRRYARFFGHDGCATPRLAAHALNHFTSWEEKIAEWQQPTLKANKLPAWYKSALFNETY 72
Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQF 494
+++ GG++W D + + E F + G+F
Sbjct: 73 FISDGGSVWVDS---IDVKDAVSEHPFV--------------------------QEFGKF 103
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
YLEG EY MYNT DVH+Y+SFAL+ L+P L+LS+Q D
Sbjct: 104 AYLEGHEYRMYNTLDVHYYASFALMKLWPNLQLSLQYDID-------------------- 143
Query: 555 ARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
++PW +N+Y + ++ WKDLN KFVLQVYRD+V T D+ + ++
Sbjct: 144 --------------NEPWSRLNAYIIHPTNEWKDLNPKFVLQVYRDYVFTKDEQYLYEMY 189
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P M + ++D D DG+I+N GF DQTYDAW+ G SAYCGGLW+AA++A + +A+
Sbjct: 190 PQCKAVMDHSLRWDVDRDGIIDNSGFADQTYDAWTVTGASAYCGGLWLAAVKAFTEMAHR 249
Query: 675 VGDHASASYFWVRYQKAKAVY-DSLWNG----SYFNYDNSDGSSSTSIQADQLAGQWYAR 729
+G + ++ K + + LWNG Y+NYD S+ SI ADQ+ GQW+ +
Sbjct: 250 LGVVEDLMKYEEILERGKKNFEEKLWNGWVDCEYYNYDCSESGHHDSIMADQMCGQWFLK 309
Query: 730 ACGLL--PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
ACG+ + A VK+AL ++ NVL GG GA+NGM+P+ + D++ Q+ E W G
Sbjct: 310 ACGVADNAVFPSANVKRALEAVFKHNVLPFDGGRMGAINGMRPNAKKDVTSCQSDEFWTG 369
Query: 788 VTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPL 847
VTY L A+MIQ M+D FQTA G Y W GL +FQTPE++ +N +RSL YMRPL
Sbjct: 370 VTYALGATMIQVGMIDKGFQTAYGAYHTCWERYGL--AFQTPEAYFDNRRFRSLGYMRPL 427
Query: 848 TIWAMQWALTK 858
IWA+Q A+ K
Sbjct: 428 AIWAIQHAVEK 438
>gi|411117353|ref|ZP_11389840.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
gi|410713456|gb|EKQ70957.1| putative bile acid beta-glucosidase [Oscillatoriales cyanobacterium
JSC-12]
Length = 907
Score = 352 bits (903), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 362/774 (46%), Gaps = 143/774 (18%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS++G+F + + G P+ A QFS+F + + + LC
Sbjct: 41 HGMPLGGFGAGCIGRSHQGDFNLWHIDGGEHVFQPMPACQFSIF-EQVEQDAQAYALCTE 99
Query: 147 SPGVPKKNTDSGIESWDWNLKGENC--------------------TYHALFPRAWTVYDG 186
+P TD + +W W E+ TYHAL+PR+W Y
Sbjct: 100 AP------TDGSLAAWSWYPTVEDADELGETSLDQARSLLSPHTGTYHALYPRSWFTYQN 153
Query: 187 EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAG--- 243
+L C Q SP +PHNY+E+S+P ++F +T+ N ++L+ TW N +AG
Sbjct: 154 VFLADL--TCEQFSPILPHNYQETSYPVAIFEWTVHNPTNQPLTLSLMLTWQN-MAGWFT 210
Query: 244 -----------DSGLSGHHF---------NSKTMTKDGVHGLTL-----HHRTANGRPPV 278
D G + + N + K+G H + + G
Sbjct: 211 NASKSPDVQMRDDGSPVYDYQPRLYESAGNWNQIVKEGQHVAIVMAGARNQELEEGDGQW 270
Query: 279 TFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSS 338
AV A+V + N G A D+W ++GS ++ N+ +P+
Sbjct: 271 AIAVKLPTQAEVF------YHTHWNPVGDGA-DIWQSFSQNGSLTNVSNN--TPAAKDEQ 321
Query: 339 IGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTLGDSAARIAHDA 397
IG AIA + G T + F L+WD P ++F E V Y +RYT F+ G +A IA A
Sbjct: 322 IGGAIAVRFMLQPGETWQIPFCLSWDFPVIEFAEGVTYFKRYTDFFDRSGQNAWTIAQTA 381
Query: 398 ILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIR 457
+ + W+ +I WQ+PILE P+W+ + LFNELY L GGT+WT +
Sbjct: 382 LNHYKTWQEQIHTWQQPILERNDLPDWFKMALFNELYNLTDGGTLWTAAT---------- 431
Query: 458 ERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFA 517
ER + +GQF LE +Y Y + DV Y SFA
Sbjct: 432 ER-----------------------------DPVGQFAVLECIDYRWYESLDVRLYGSFA 462
Query: 518 LVMLFPKLELSIQRDFAAAVMMHDPGT------MKIMSDGKWVARKCLGAVPHDIGLDDP 571
L+MLFP+LE ++ R FA A+ D T I ++ RK GA PHD+G +
Sbjct: 463 LLMLFPELEKAVIRAFARAIPASDDRTRIIGYYYTIGAESPQAVRKVAGATPHDLGAPNE 522
Query: 572 --WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN--FARAVWPSVYIAMAYMEQF 627
W + N + + + WKDL FVLQVYRDF+ TG K+ F + W ++ + Y++ F
Sbjct: 523 HVWQKTNYTSYQDCNLWKDLPCDFVLQVYRDFLYTGSKDIAFLQECWSAIAQTLHYLKTF 582
Query: 628 DKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW-----------------VAALQAASA 670
D D DG+ EN G PDQT+D W +G+SAYCGGLW ++ +
Sbjct: 583 DYDFDGIPENSGAPDQTFDDWKLDGISAYCGGLWIAALEAALAIGRILAEYTTNMRQLAT 642
Query: 671 LANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYAR 729
L+ND+G+ F V ++A+++Y D+LWNG Y+ D++ S S + ADQL GQ+YAR
Sbjct: 643 LSNDIGN------FQVWLEQARSIYHDTLWNGQYYRLDSN--SCSEVVMADQLCGQFYAR 694
Query: 730 ACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQARE 783
L + D + AL IYD +K + +Q + + + RE
Sbjct: 695 LLKLPDVVDLNCAQTALKTIYDTCFIKFNNYLKSGALKLQQFTTVQEARIAERE 748
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 51/91 (56%), Gaps = 4/91 (4%)
Query: 763 GAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDG 821
GA NG++PDG + + E+W G+ +GLAA ++Q M D A + V + ++
Sbjct: 811 GAANGVKPDGSPENPNSTHPLEVWTGINFGLAAFLVQMGMRDEAMELTQAVVQQIYTN-- 868
Query: 822 LGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
G F+TPE+ + +R+ Y+R L IWA+
Sbjct: 869 -GLQFRTPEAITPSGTFRACHYLRALAIWAI 898
>gi|88809081|ref|ZP_01124590.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
gi|88787023|gb|EAR18181.1| hypothetical protein WH7805_05296 [Synechococcus sp. WH 7805]
Length = 849
Score = 348 bits (893), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 254/823 (30%), Positives = 402/823 (48%), Gaps = 112/823 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QFS+F + + ++ ++
Sbjct: 70 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFSLFENNGSSKRAHALALKP 129
Query: 147 SPGVPKKNTDSGIESWDW--NLKGENCT--YHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ N+ + +W+W GE T Y A +P +WT Y+G D E++ C SP
Sbjct: 130 EADASRSNSGEPLAAWNWYPASTGEQSTGTYAARYPLSWTHYEGVFDAEVQ--CEAFSPI 187
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+P +Y+ +S+P +VF +TL N D++LL +W N+V D G
Sbjct: 188 LPGDYQRTSYPVAVFRWTLRNPTDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 247
Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN--GRPPVTFAVAAEETAD-VHVSECP 296
H++ ++ + ++G+ G+ L + + G + +A +T D V V C
Sbjct: 248 HNYAPAIGHGEGQRNRWLDQEGIRGIVLEGQRSEPIGEGQGQWCLAVPDTLDGVEVLRCS 307
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
+ +G+ +++W G + +ND+ S G AAIA + T+ G +R
Sbjct: 308 RWDPTGD-----GQELWQPFAAEGRIPNSNNDRAS--RAGEHASAAIALTFTLAPGESRE 360
Query: 357 VTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ +++WD P F V RRYT F+G G +AA +A +A+ + W +I+AWQ P+
Sbjct: 361 IPIAISWDLPVTAFASGVRDLRRYTDFFGADGRNAAAMATEALRDWRHWRDQIDAWQAPV 420
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L K PE + LFNELY L +GG++WT S
Sbjct: 421 LARKELPEALRMALFNELYDLASGGSLWTAAS---------------------------- 452
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
G++ G+F LE +Y Y + DV Y S L+ L+P+L+ ++ R FA
Sbjct: 453 -----------GKDPHGRFGVLECLDYAWYESLDVRLYGSLGLLQLWPELDKAVLRSFAR 501
Query: 536 AVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRWK 587
A+ D I G+ A RK GA PHD+G ++ F+ +Y + + + WK
Sbjct: 502 AIPAADATQRPIGWYFTQGRGRVEADRKVEGATPHDLGAPNEVPFDATNYTAYQDCNLWK 561
Query: 588 DLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
DL S FVLQV+R F TG D F WP+ A+ Y+++FD + DG+ +N G PDQT
Sbjct: 562 DLASDFVLQVWRTFKLAPTGEDLTFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQT 621
Query: 645 YDAWSANGVSAYCGGLWVAAL--------QAASALANDVGDHASASYFWVRYQKAKAVYD 696
+D W GVSAYCG LW+AAL + L D G W+ ++++A +D
Sbjct: 622 FDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDTGSEQHTFSNWL--EQSRANFD 679
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
+ LWNG Y++ D G+ + ADQL G +YAR GL P+ +A + L + +
Sbjct: 680 ALLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLEPVVSDANSRSTLKAVKESCFE 737
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAFQTAAGV 812
K +GG G NG++ DG +D +G E+W G+ +G+A+ ++ E A +A V
Sbjct: 738 KFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYQLMGEGQTAEAICSAV-V 796
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
++V G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 797 HQVY----SGGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 834
>gi|254423918|ref|ZP_05037636.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196191407|gb|EDX86371.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 828
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 264/855 (30%), Positives = 388/855 (45%), Gaps = 161/855 (18%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHG--ICDDAPVLANQFSVFVSRPNGEKFSSVL 143
+ G PLGG GAG IGRS G F + L G + D P QFSVF +GE + +
Sbjct: 40 EHGAPLGGFGAGCIGRSPHGTFNLWHLDGGEHVFDSFP--GCQFSVFEQSADGETKAYAM 97
Query: 144 CPRSPGVPKKNTDSGIESWDWNLKGENC----TYHALFPRAWTVYDGEPDPELRIVCRQI 199
P ++ + SW W G+ TYHAL+PR+W +Y+ +L C Q+
Sbjct: 98 SSDKP-------ETELASWQWYPAGKTTENTGTYHALYPRSWYLYENIFKADL--TCEQL 148
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
SP +NY ESS+P + F +T N ++++ +W N V G NS+T +
Sbjct: 149 SPIWANNYIESSYPTATFIWTAHNHTDQPLTLSIMLSWENMV-------GWFTNSETSPE 201
Query: 260 DGVHGLTLHHRTANGRPPVTFA-VAAEETADVHVSECP-----CFLLSGNSKGITAK--- 310
+ R +G P + + E + + C +L GN G A+
Sbjct: 202 -------IQQR-EDGSPFYDYVPLLRESKGNYNTLVCEEGVARGVVLDGNWTGSPAEGQG 253
Query: 311 --------------------------DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIA 344
++WN+ G +L++ S G IG AIA
Sbjct: 254 QLAIAAHHLEAEVSYHTRWHPDGDGSELWNDFSTTGRLSNLEDVDQSG---GERIGGAIA 310
Query: 345 ASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAK 403
T+ G T+ + F+L+WD P +F + K +RRYT F+ G +A IA A+ +A
Sbjct: 311 LRFTLAPGETKQIPFTLSWDLPVTEFAKGKRAYRRYTDFFDRSGRNAWVIASTALNNYAI 370
Query: 404 WECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSL 463
W+ I WQ+PIL+ P W+ + LFNELY L +GGT+W+ S
Sbjct: 371 WQQNIINWQQPILDRTDLPNWFKMALFNELYDLASGGTLWSAASK--------------- 415
Query: 464 DTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFP 523
+ +GQF LE +Y Y + DV Y +AL++L+P
Sbjct: 416 ------------------------SDPVGQFGVLECIDYRWYESLDVRLYGGYALLLLWP 451
Query: 524 KLELSIQRDFAAAVMMHDP-----GTMKIMSDGKWVA-RKCLGAVPHDIGL--DDPWFEI 575
+LE SI R FA A+ + G + + +A RK GA PHD+G ++PW
Sbjct: 452 ELEKSILRAFARAIPTANAKERIIGYYYTVGEADHMAPRKLKGATPHDLGAPNENPWMAT 511
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDKDGDG 633
N + + ++WKDL+S FVLQVYR F +TG D +F WP++ + Y + FD DGDG
Sbjct: 512 NYTSYQDCNQWKDLSSDFVLQVYRAFESTGSVDMDFLLDCWPAIVETLGYTKLFDTDGDG 571
Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASA--------SYFW 685
+IEN G PDQT+D W G+SAYCGGLW+AAL AA + + + + +
Sbjct: 572 LIENSGAPDQTFDDWKLQGISAYCGGLWIAALSAAIKIGEILQESKRVEGTIVILLAQYR 631
Query: 686 VRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKK 744
+Q + Y LWNG Y+ D GS S + ADQL GQ+ A GL + D V+
Sbjct: 632 RWWQHGRTAYQKQLWNGEYYRLDT--GSGSDVVMADQLCGQFCASTMGLPDVVDSEFVEP 689
Query: 745 ALTKIYD-----FNVLKVK---------------------GGMCGAMNGMQPDGR-IDMS 777
L IY+ FN + G G NG++PDG +
Sbjct: 690 TLRAIYEACFVRFNQYTAQLGPQNQKFEGAQTGYFSASELGVKVGCANGVRPDGSPQNPD 749
Query: 778 GLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDE 837
E+W G+ +GLAA + E + A + V + G F+TPE+
Sbjct: 750 DTHQLEVWTGINFGLAAFLAHEGKLHEAMEITEAVVRQVYEH---GLQFRTPEAITAVGT 806
Query: 838 YRSLCYMRPLTIWAM 852
+R+ Y+RP+ IWA+
Sbjct: 807 FRACHYLRPMAIWAV 821
>gi|254431466|ref|ZP_05045169.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197625919|gb|EDY38478.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 837
Score = 346 bits (887), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 259/825 (31%), Positives = 387/825 (46%), Gaps = 110/825 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGG GAG IGR+ G F + L G + QF++F NG + ++
Sbjct: 51 NHGMPLGGFGAGCIGRAPDGSFNLWNLDGGEHWFGTLPDCQFALFERDGNGSRAHALAVA 110
Query: 146 RSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+ + + +W W TY A +P +WT + G R+ C SP
Sbjct: 111 PEADASRPDAGPPLSAWSWYPASTADTSTGTYAARYPLSWTSFSGVF--SARVGCEAFSP 168
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLS 248
+P +Y+ SS+P +VFT++LSN + +++LL +W N+ D G
Sbjct: 169 ILPGDYERSSYPVAVFTWSLSNPTRRPLELSLLLSWRNTTGWFTNTDPAAAVHFRDDGSP 228
Query: 249 GHHFNSKTMTKDG----------VHGLTLH----HRTANGRPPVTFAVAAEETA---DVH 291
H++ +G + G+ L A G+ A A A +
Sbjct: 229 EHNYVPAIGRSEGQRNRWVDDGPLKGVLLEGPISDPIAEGQGQWCIATDAGLEARHPGLR 288
Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
+ C + +G+ ++W + GS +ND+ S + +S AA+A L +
Sbjct: 289 IHRCSRWNPAGD-----GAELWESFARDGSIPDSNNDRHSGANDPAS--AALAVQLRLEP 341
Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
G++ + ++WD P F RRYT +G G +AA IA +A+ W +I A
Sbjct: 342 GASIDIPLVISWDLPVTAFATGSRALRRYTDHFGADGTNAAAIAAEALGSWPSWREQIAA 401
Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
WQ+P+LE PE + LFNELY L +GGT+WT SP
Sbjct: 402 WQQPVLERTDLPEPLRMALFNELYDLASGGTLWTAASP---------------------- 439
Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
E+ +G+F LE +Y Y + DV Y SFAL+ L+P+L+ ++
Sbjct: 440 -----------------EDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVL 482
Query: 531 RDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEINSYNLFN 582
R FA A+ DP I + GK RK GA PHD+G + PW N +
Sbjct: 483 RSFARAIPAADPTPRPIGWYFTQGKGRVEAPRKVAGATPHDLGAPNERPWDATNYTAYQD 542
Query: 583 SSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ WKDL S FVLQV+R F TG D F WP+ A+ Y++ FD + DG+ +N G
Sbjct: 543 CNLWKDLASDFVLQVWRTFRLAPTGEDLRFLADCWPAAVEALRYLKGFDANSDGLPDNGG 602
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALAN----DVGDHASASY--FWVRYQKAKA 693
PDQT+D W GVSAYCG LW+AAL+AA A+ ++G S+ F ++++A
Sbjct: 603 APDQTFDDWPLKGVSAYCGALWIAALEAALAMGQRLQLELGLDTSSQQRDFSSWLEQSRA 662
Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+D LWNG Y+N D G+ + ADQL G +YAR L P+ E + + AL I +
Sbjct: 663 NFDRLLWNGEYYNIDADSGTP--VVMADQLCGDFYARLLELPPVVAEERARSALRAIREA 720
Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQ-TAA 810
+GG G NG++ DG +D G E+W G+ +GLAA D A T A
Sbjct: 721 CFEGFQGGSLGVANGLRRDGTPLDPDGTHPLEVWTGINFGLAAYYRLMGETDTALAITGA 780
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
V++V G G F+TPE+ + +R+ Y+R + IWA+ WA
Sbjct: 781 VVHQVY----GGGLQFRTPEAITAVNTFRACHYLRAMAIWAL-WA 820
>gi|318042148|ref|ZP_07974104.1| hypothetical protein SCB01_10580 [Synechococcus sp. CB0101]
Length = 836
Score = 345 bits (885), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 254/834 (30%), Positives = 392/834 (47%), Gaps = 133/834 (15%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
+ G+PLGG GAG IGR+ G F + L + G+ D QF++F S NG++
Sbjct: 58 NHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEHWFGVLPDC-----QFALFES--NGQQSR 110
Query: 141 SVLCPRSPGVPKKNTDSG--IESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRI 194
+ P D+G + SW W + TY A +P +W+ Y G D E +
Sbjct: 111 AHALAVQPERDASRPDAGAPLPSWSWYPASTAERTTGTYAARYPLSWSHYQGVFDAE--V 168
Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA------------ 242
C+ SP +P NY+++S+P +VF +TL N + D++LL +W N+
Sbjct: 169 SCQAFSPILPGNYQQTSYPLAVFVWTLHNPTRKPLDLSLLLSWRNTSGWFTNTDSSAEVH 228
Query: 243 -GDSGLSGHHFNSKTMTKDG----------VHGLTLHHRTANGRPPVT-----FAVAAEE 286
D G H++ T +G + G+ L +N PV + +A E
Sbjct: 229 FRDDGSPEHNYAPAIGTTEGQRNRWVDDGPLKGVVLERDVSN---PVAEGEGQWCIATGE 285
Query: 287 TADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDK-TSPSEPGSSIGAAIAA 345
V + C + G+ ++W+ +GS +ND+ +S S+P S AA+A
Sbjct: 286 QPGVRIQRCSRWNPHGD-----GSELWSSFSANGSIPDSNNDRRSSQSDPAS---AALAV 337
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKW 404
+ G + + ++WD P F RRYT F+G+ G +A IA + + + +W
Sbjct: 338 QCRLEPGQSIEIPVVISWDLPVTAFATGTSALRRYTDFFGSSGGNATAIAAEGLRDWRRW 397
Query: 405 ECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLD 464
+IEAWQ+P+LE PE + LFNELY L +GG++W+ S
Sbjct: 398 REQIEAWQQPVLERSDLPEPLRMALFNELYDLCSGGSLWSAAS----------------- 440
Query: 465 TPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPK 524
E+ +G+F LE +Y Y + DV Y S AL+ L+P+
Sbjct: 441 ----------------------REDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPE 478
Query: 525 LELSIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEIN 576
L+ ++ R FA A+ DP I + GK ARK GA PHD+G + PW N
Sbjct: 479 LDKAVLRSFARAIPAADPSPRPIGWYFTQGKGRVEAARKVKGATPHDLGAPNERPWDATN 538
Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDG 633
+ + WKDL S +VLQV+R F D NF WP+ A+ Y++ FD + DG
Sbjct: 539 YTAYQDCNLWKDLASDYVLQVWRTFKLAPNGEDINFLAECWPAAVEALHYLKGFDANNDG 598
Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAA----------LQAASALANDVGDHASASY 683
+ +N G PDQT+D W GVSAYCG LW+AA LQ ++ L H +S+
Sbjct: 599 LPDNGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQQLQLSTGLDTASEQHTFSSW 658
Query: 684 FWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKV 742
++++A +D LWNG Y++ D G+ + ADQL G +YAR GL + EA
Sbjct: 659 L----EQSRANFDHLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLPAVVSEANS 712
Query: 743 KKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEM 801
+ L + + + +GG G NG++ DG +D +G E+W G+ +G+A+
Sbjct: 713 RSTLKAVKEACFEQFEGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGD 772
Query: 802 VDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
D A A V ++ G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 773 TDTALAITAAVVNQVYAG---GLQFRTPEAITAVNTFRACHYLRAMAIWGV-WA 822
>gi|326934726|ref|XP_003213436.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Meleagris gallopavo]
Length = 639
Score = 343 bits (880), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 188/394 (47%), Positives = 247/394 (62%), Gaps = 18/394 (4%)
Query: 472 PASSDTALGTRLLENG--EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
PA S + LL++G E IGQ EY MYNTYDVHFY+SFALVML+PKL++S+
Sbjct: 235 PAGSGRDVWQDLLQDGKLESPIGQ-------EYRMYNTYDVHFYASFALVMLWPKLQISL 287
Query: 530 QRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL--DDPWFEINSYNLFNSSRWK 587
Q D A V+ D + + GK K VPHDIG D+PW +N+Y + +++ WK
Sbjct: 288 QYDIAVTVLNEDMQPRQYLVCGKTAQVKLKNVVPHDIGDPDDEPWQRVNAYLMHDTANWK 347
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
DLN KFVLQVYRD+ T D + + +WP M +FD D DG+IEN G DQTYDA
Sbjct: 348 DLNLKFVLQVYRDYYLTHDALYLQDMWPVCQAVMESELKFDTDNDGLIENGGTADQTYDA 407
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY 706
W NG SAYCGGLW+AA++ +A +GD + + QK K ++ LWNG Y+NY
Sbjct: 408 WVVNGASAYCGGLWLAAVRMMCEMAEVLGDAETQQKYGAILQKGKESFERLLWNGKYYNY 467
Query: 707 DNSDGSSSTSIQADQLAGQWYARACGL----LPIADEAKVKKALTKIYDFNVLKVKGGMC 762
D+S S+S+SI +DQ AGQW+ ACGL L + ++ + AL I++ NV+ GG
Sbjct: 468 DSSGSSTSSSIMSDQCAGQWFLGACGLDQKELEVFPKSHIVSALKTIFEKNVMSFAGGTM 527
Query: 763 GAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGL 822
GA+NGM+PDG D S +Q+ E+W GV Y LAA+MIQE +V F TA G Y W + L
Sbjct: 528 GAVNGMRPDGVPDTSSVQSSEVWVGVVYALAATMIQEGLVQEGFHTAEGCYRTVW--ENL 585
Query: 823 GYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
G +FQTPE++ YRSL YMRPL+IW+MQ AL
Sbjct: 586 GMAFQTPEAYCEKKVYRSLAYMRPLSIWSMQLAL 619
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 118/401 (29%), Positives = 179/401 (44%), Gaps = 41/401 (10%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G PLGGIG G+I R +RGEF R++L G V+ +QF+V + R + VL
Sbjct: 31 GCPLGGIGGGTITRGWRGEFCRWQLNPGKYHYETVITDQFTVCLRRKGQTVYQQVLSVEK 90
Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
P S ++ W+W G YHAL+PRAW VY+ P + + CRQISP IPH+Y
Sbjct: 91 P--------SALQGWNWGYCGRYAFYHALYPRAWMVYE-LPGQNVVLTCRQISPVIPHDY 141
Query: 208 KESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG---VHG 264
K+SS P VF + + N+ + DV+++FT N SG H+N +DG V G
Sbjct: 142 KDSSLPVGVFIWEVENNSEEPVDVSIMFTLQNGTGTKGDGSGGHWNEPFALQDGGERVTG 201
Query: 265 LTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
+ LHH T P TFAVAA A ++ F +G + +D+W ++ + G +
Sbjct: 202 VLLHHCTPIN--PFTFAVAAXXQAGTVITHLTAFDPAG-----SGRDVWQDLLQDGKLES 254
Query: 325 -LDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFY 383
+ + + A+ A + P + SL +D E + R+Y
Sbjct: 255 PIGQEYRMYNTYDVHFYASFALVMLWP-----KLQISLQYDIAVTVLNEDMQPRQY---- 305
Query: 384 GTLGDSAARIAHDAILEHAKWECEIEAWQRP---ILEDKRFPEWYPITLFNEL-----YY 435
+ A++ ++ H + + E WQR ++ D W + L L YY
Sbjct: 306 -LVCGKTAQVKLKNVVPHDIGDPDDEPWQRVNAYLMHDT--ANWKDLNLKFVLQVYRDYY 362
Query: 436 LNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
L D P Q++ E KF D I ++D
Sbjct: 363 LTHDALYLQDMWPVCQAVME-SELKFDTDNDGLIENGGTAD 402
>gi|87125468|ref|ZP_01081313.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
gi|86166768|gb|EAQ68030.1| hypothetical protein RS9917_01806 [Synechococcus sp. RS9917]
Length = 832
Score = 342 bits (876), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 251/824 (30%), Positives = 378/824 (45%), Gaps = 113/824 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGG GAG GR+ G F + L G + QF++F NG +
Sbjct: 54 NHGMPLGGFGAGCFGRAPDGSFNLWHLDGGEHWFGALPDCQFALFER--NGSSTRAHALA 111
Query: 146 RSPGVPKKNTDSG--IESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
P D G + +WDW + TY A +P +WT YDG D E+R C
Sbjct: 112 VKPEADASRPDGGEPLAAWDWYPASTPERSTGTYAARYPLSWTHYDGVYDAEVR--CEAF 169
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
SP +P +Y+ +S+P +VF +TL N D++LL +W N+ D G
Sbjct: 170 SPILPGDYQRTSYPVAVFVWTLRNPTPKPLDLSLLLSWRNTSGWFTNTDASAEVHFRDDG 229
Query: 247 LSGHHFNSKTMTKDG----------VHGLTLHHRTANGRPPVT-----FAVAAEETADVH 291
H++ T DG + G+ L +N P+ + +A E V
Sbjct: 230 SPEHNYAPAIGTTDGQRNRWIDDGNLKGVVLEGNVSN---PIAEGEGQWCIATAEQPGVT 286
Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
+ C + G+ ++W+ + GS +ND+ S + AA+A +
Sbjct: 287 IQRCSRWNPHGD-----GNELWSSFRADGSIPDSNNDRRSGHN--DPLSAALAVQCQLAP 339
Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
G + + ++WD P F RRYT F+G G AA IA +A+ + W +IEA
Sbjct: 340 GQSLEIPVVISWDLPVTGFASGSQALRRYTDFFGAEGHQAAAIAAEALRDWRDWRQQIEA 399
Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
WQ+P+LE PE + LFNELY L +GG++W S TP H
Sbjct: 400 WQQPVLERSDLPEPVRMALFNELYDLCSGGSLW------------------SAATPMDPH 441
Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
G+F LE +Y Y + DV Y S AL+ L+P+L+ ++
Sbjct: 442 ---------------------GRFGVLECIDYAWYESLDVRLYGSLALLQLWPELDKAVL 480
Query: 531 RDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEINSYNLFN 582
R FA A+ D I + GK RK GA PHD+G ++PW N +
Sbjct: 481 RSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNENPWDATNYTAYQD 540
Query: 583 SSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ WKDL S +VLQV+R F + D NF WP+ A+ Y++ FD + DG+ +N G
Sbjct: 541 CNLWKDLASDYVLQVWRTFKLSPSGEDINFLAECWPAAVEALTYLKGFDTNNDGLPDNGG 600
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQKAKA 693
PDQT+D W GVSAYCG LW+AAL+AA A LA + A F ++++A
Sbjct: 601 APDQTFDDWPLKGVSAYCGALWIAALEAALAMGQQLQLATGLDTAAEQHTFSTWLEQSRA 660
Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+D+ LWNG Y++ D G+ + ADQL G +YAR L + +A + L + +
Sbjct: 661 NFDTLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLKLPAVVSDANCRSTLKAVKES 718
Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
+GG G NG++ DG +D G E+W G+ +G+A+ + A +
Sbjct: 719 CFEAFQGGSLGVANGLRRDGTPLDPQGTHPLEVWTGINFGIASYYRLMDDTSTALAITSA 778
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
V ++ G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 779 VVNQVYAG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 818
>gi|324504438|gb|ADY41918.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 534
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 208/556 (37%), Positives = 288/556 (51%), Gaps = 48/556 (8%)
Query: 329 KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTK--FYGTL 386
K +E +G AI A S+ V F LAWD P VKF R F+
Sbjct: 7 KKERTEGEVDLGVAICAEFPAKCKSSAEVEFVLAWDMPIVKFGAGRRQYRRRYARFFPDA 66
Query: 387 GDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDG 446
++ A++ +WE +I+AWQ+ IL D P+WY LFNE Y+L GGT W +
Sbjct: 67 SKRVEQMCSRALMSRIEWERKIDAWQQRILSDDSLPDWYKSALFNESYFLTDGGTCWFEY 126
Query: 447 SPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYN 506
+S ER+ S ++ + G+F YLE EY M N
Sbjct: 127 DDEWRST----ERQMSNESAKYF-------------------KEFGRFAYLEAWEYYMLN 163
Query: 507 TYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDI 566
TYDVHFYSSFAL+ +P +EL+IQ DFA V+ +++ A K LG +PHD+
Sbjct: 164 TYDVHFYSSFALLENWPLIELAIQLDFADQVLSSCDRKSVNINESTRTAVKRLGRLPHDL 223
Query: 567 G--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF-----VATGDKN-----FARAVW 614
G +D+PW +N+Y L ++ WKDLN KFVL YRD+ + D N R +
Sbjct: 224 GNPMDEPWLHLNAYALSDTCEWKDLNLKFVLTCYRDYEKIVKIYFNDDNEMKGCLLRRFY 283
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
+A + +D DGD +IEN G PDQTYD WS +G SAYCGGLW+ AL+ +A
Sbjct: 284 DLSSGIIADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALT 343
Query: 675 VGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+G+ A F + A+ Y+ LWNG YF++D + SI ADQL G W+ C
Sbjct: 344 LGEVVDAQKFANKLNNARKAYERKLWNGKYFDFDEH-STDHKSIMADQLCGFWF--MCIT 400
Query: 734 LPIADEAKVKK-----ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGV 788
D+ + + +L I+++NV K G G +N M P G +D +G+Q+ E+W GV
Sbjct: 401 DGKVDDVIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGV 460
Query: 789 TYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLT 848
Y LA+ + E + AF+TA G Y W GL Y Q+PE+ N + YR++ YMRPL
Sbjct: 461 AYALASFHLLVEENESAFKTAEGWYRSCWERYGLQY--QSPEAINESSYYRAIGYMRPLA 518
Query: 849 IWAMQWALTKPKLSRQ 864
IWAMQ AL + RQ
Sbjct: 519 IWAMQSALDALRNKRQ 534
>gi|345289089|gb|AEN81036.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289091|gb|AEN81037.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289093|gb|AEN81038.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289095|gb|AEN81039.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289097|gb|AEN81040.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289101|gb|AEN81042.1| AT1G33700-like protein, partial [Capsella rubella]
gi|345289103|gb|AEN81043.1| AT1G33700-like protein, partial [Capsella rubella]
Length = 191
Score = 339 bits (870), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 155/190 (81%), Positives = 175/190 (92%)
Query: 489 ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIM 548
ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFPKLELSIQRDFAAAV+MHD ++M
Sbjct: 2 ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61
Query: 549 SDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN 608
S G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD N
Sbjct: 62 SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
FA+AVWPSVY A+AY++QFDKDGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181
Query: 669 SALANDVGDH 678
SALA ++GD+
Sbjct: 182 SALAREIGDN 191
>gi|148240031|ref|YP_001225418.1| hypothetical protein SynWH7803_1695 [Synechococcus sp. WH 7803]
gi|147848570|emb|CAK24121.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 833
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 251/821 (30%), Positives = 397/821 (48%), Gaps = 108/821 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QF++F S + ++ ++
Sbjct: 55 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGTIPDCQFALFESNGSSQRAHALAVKP 114
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ N+ + +WDW + TY A +P +WT Y+G D E + C SP
Sbjct: 115 EADASRPNSAEPLAAWDWYPASTPERSTGTYAARYPLSWTHYEGVYDAE--VSCEAFSPI 172
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+P +Y+ +S+P +VF +TL N D++LL +W N+V D G
Sbjct: 173 LPGDYQRTSYPVAVFRWTLRNPSDQPLDLSLLLSWRNTVGWFFNTDPSAEVHFRDDGSPE 232
Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN--GRPPVTFAVAAEETAD-VHVSECP 296
H++ ++ + ++GV G+ L + + G + +A + + V V C
Sbjct: 233 HNYAPAIGQGEGQRNRWLDQEGVRGIVLEGQRSEPIGEGQGQWCLAVPDALEGVEVLRCS 292
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
+ +G+ +++W G + +ND++S G AAIA T+ G +R
Sbjct: 293 RWDPTGD-----GQELWQSFAAEGRIPNSNNDRSS--RAGEQASAAIALKFTLAPGESRE 345
Query: 357 VTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ +++WD P F V RRYT F+G G +AA +A +A+ + +W +I+AWQ P+
Sbjct: 346 IPIAISWDLPVTAFATGVRDLRRYTDFFGADGCNAAALAAEALRDWRQWRDQIDAWQAPV 405
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L + PE + LFNELY L +GG++WT S
Sbjct: 406 LARQELPEALRMALFNELYDLASGGSLWTAAS---------------------------- 437
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
G++ G+F LE +Y Y + DV Y SFAL+ L+P+L+ ++ R FA
Sbjct: 438 -----------GKDPHGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLRSFAR 486
Query: 536 AVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRWK 587
A+ D I G+ A RK GA PHD+G ++ F+ +Y + + + WK
Sbjct: 487 AIPAADATQRPIGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWK 546
Query: 588 DLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
DL S FVLQV+R F TG D F WP+ A+ Y+++FD + DG+ +N G PDQT
Sbjct: 547 DLASDFVLQVWRTFKLAPTGEDLKFLADCWPAAVQALHYLKRFDVNNDGLPDNGGAPDQT 606
Query: 645 YDAWSANGVSAYCGGLWVAAL--------QAASALANDVGDHASASYFWVRYQKAKAVYD 696
+D W GVSAYCG LW+AAL + L D G W+ +++++ +D
Sbjct: 607 FDDWPLKGVSAYCGALWIAALEAALAMAQRLQQELGLDTGSDQHTFSAWL--EQSRSNFD 664
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
LWNG Y++ D G+ + ADQL G +YAR GL P+ EA + L + +
Sbjct: 665 RLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLEPVVSEANSRSTLKAVKESCFE 722
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
K +GG G NG++ DG +D +G E+W G+ +G+A+ A + V +
Sbjct: 723 KFQGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAICSAVVD 782
Query: 815 VAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
+S G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 783 QVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819
>gi|345289099|gb|AEN81041.1| AT1G33700-like protein, partial [Capsella rubella]
Length = 191
Score = 338 bits (866), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 154/190 (81%), Positives = 175/190 (92%)
Query: 489 ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIM 548
ENIGQFLYLEG +Y+MYNTYDVHFYSSFAL+MLFPKLELSIQRDFAAAV+MHD ++M
Sbjct: 2 ENIGQFLYLEGTQYLMYNTYDVHFYSSFALLMLFPKLELSIQRDFAAAVLMHDSSRKQVM 61
Query: 549 SDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN 608
S G++V RK LGAVPHDIGL+DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD N
Sbjct: 62 SSGEFVTRKVLGAVPHDIGLNDPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLN 121
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
FA+AVWPSVY A+AY++QFDKDGDGMIEN+GFPDQTYDAWS +GVSAYCGGLWVAALQA
Sbjct: 122 FAKAVWPSVYTAIAYLDQFDKDGDGMIENDGFPDQTYDAWSCSGVSAYCGGLWVAALQAG 181
Query: 669 SALANDVGDH 678
SALA ++GD+
Sbjct: 182 SALAREIGDN 191
>gi|317969095|ref|ZP_07970485.1| hypothetical protein SCB02_06140 [Synechococcus sp. CB0205]
Length = 837
Score = 336 bits (861), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 249/825 (30%), Positives = 384/825 (46%), Gaps = 115/825 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
+ G+PLGG GAG IGR+ G F + L + G+ D QFS+F S ++
Sbjct: 59 NHGMPLGGFGAGCIGRAPDGSFNLWHLDGGEHWFGVLPDC-----QFSLFESNGTEQRAH 113
Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
++ + + + SW W + + + TY A +P +W+ Y G + E + C
Sbjct: 114 ALAVKPNRDDSRPEAADPLSSWSWYPASSEQRQTGTYAARYPLSWSHYQGVFEAE--VSC 171
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
SP +P NY+++S+P + F +TL N + + D++L+ +W N+
Sbjct: 172 EAFSPILPGNYQQTSYPLATFVWTLRNPTRQALDLSLMLSWRNTCGWFTNTDSSAEVHFR 231
Query: 244 DSGLSGHHF-----NSKTMTKDGVHGLTLHHRTANG--RPPVT-----FAVAAEETADVH 291
D G H++ S+ D + +L G PV + +A +E A V
Sbjct: 232 DDGSPEHNYAPAIGKSEGHRNDWIDDGSLKGVVLGGDVSSPVAEGEGQWCIATQEQAGVS 291
Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTS-PSEPGSSIGAAIAASLTIP 350
+ C + G+ D+W + GS +N++ S +EP + AA+A +
Sbjct: 292 IQRCSRWNPQGD-----GSDLWGSFSRDGSIPESNNNRRSGAAEP---VSAALAVRCRLE 343
Query: 351 SGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIE 409
G + + ++WD P F RRYT F+G G +AA IA + + + W +IE
Sbjct: 344 PGQSIEIPVVISWDLPVTSFATGTSALRRYTDFFGASGKNAAAIAAEGLRDWRSWRQQIE 403
Query: 410 AWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKI 469
AWQ+P+LE PE + LFNELY L +GG++W+ S
Sbjct: 404 AWQQPVLERSDLPEPLRMALFNELYDLCSGGSLWSAASR--------------------- 442
Query: 470 HTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSI 529
E+ +G+F LE +Y Y + DV Y S AL+ L+P+L+ ++
Sbjct: 443 ------------------EDPVGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAV 484
Query: 530 QRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL--DDPWFEINSYNLF 581
R FA A+ D I + GK ARK GA PHD+G + PW N
Sbjct: 485 LRSFARAIPAADATPRPIGWYFTQGKGRVEAARKVKGATPHDLGAPNEKPWDATNYTAYQ 544
Query: 582 NSSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENE 638
+ + WKDL S +VLQV+R F D F WP+ A+ Y++ FD + DG+ +N
Sbjct: 545 DCNLWKDLASDYVLQVWRTFKLAPNGEDLRFLADCWPAAVEALRYLKTFDINNDGLPDNG 604
Query: 639 GFPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQKAK 692
G PDQT+D W GVSAYCG LW+AAL AA A L+ + A F +A+
Sbjct: 605 GAPDQTFDDWPLKGVSAYCGALWIAALDAALAIAQQLQLSTGLDTSAEQQEFSAWLAQAR 664
Query: 693 AVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
+D LWNG Y++ D G + + ADQL G +YAR GL P+ EA + L + D
Sbjct: 665 GNFDRLLWNGEYYDIDAESG--TPVVMADQLCGDFYARLLGLEPVVSEANSRSTLNAVKD 722
Query: 752 FNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
K +GG G NG++ DG +D +G E+W G+ +G+A+ + A +
Sbjct: 723 SCFEKFEGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGETETALAITS 782
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
V +S G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 783 AVVNQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGV-WA 823
>gi|116073260|ref|ZP_01470522.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
gi|116068565|gb|EAU74317.1| hypothetical protein RS9916_32457 [Synechococcus sp. RS9916]
Length = 832
Score = 333 bits (855), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 258/829 (31%), Positives = 393/829 (47%), Gaps = 123/829 (14%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
+ G+PLGG GAG IGR+ G + L + G+ D QF++F S +G++
Sbjct: 54 NHGMPLGGFGAGCIGRAPDGNINLWHLDGGEHWFGVLPDC-----QFALFESNGSGKRAH 108
Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
++ S + + + +WDW + TY A +P +WT Y+G D E+R C
Sbjct: 109 ALAVKPSADASRPESGEPLPAWDWYPASTAERSTGTYAARYPLSWTSYEGVYDAEVR--C 166
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
SP +P +Y+ +S+P +VF +TL N D++LL +W N+
Sbjct: 167 EAFSPILPGDYQRTSYPVAVFVWTLHNPTDQPLDLSLLLSWRNTTGWFTNTDASAEVHFR 226
Query: 244 DSGLSGHHFN---SKTMTK------DG-VHGLTLHHRTANGRPPVT-----FAVAAEETA 288
D G H++ KT+ + DG + G+ L +N PV + +A E
Sbjct: 227 DDGSPEHNYAPAIGKTVGQRNRWVDDGNLKGVVLEGNVSN---PVAEGEGQWCIATTEQP 283
Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
V + C + SG+ ++WN GS +ND+TS + + AA+A
Sbjct: 284 GVSIQRCSRWNPSGD-----GSELWNSFSADGSIPESNNDRTSGAN--DPLSAALAVQCQ 336
Query: 349 IPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
+ G + + ++WD P F RRYT F+GT GD AA IA +A+ + +W +
Sbjct: 337 LAPGQSLEIPVMISWDLPVTAFATGSQALRRYTDFFGTGGDQAAAIAAEALRDWKQWREQ 396
Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
I AWQ+P+L+ PE + LFNELY L +GG++W+ SP
Sbjct: 397 IGAWQQPVLQRSELPEPLRMALFNELYDLCSGGSLWSAASP------------------- 437
Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
E+ G+F LE +Y Y + DV Y S AL+ L+P+L+
Sbjct: 438 --------------------EDPHGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDK 477
Query: 528 SIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGLDD--PWFEINSYN 579
++ R FA A+ D I + GK RK GA PHD+G + PW N
Sbjct: 478 AVLRSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNELPWDATNYTA 537
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
+ + WKDL S +VLQV+R F + D F WPS A+ Y++ FD + DG+ +
Sbjct: 538 YQDCNLWKDLGSDYVLQVWRTFKLSPSGEDVRFLADCWPSAVEALRYLKTFDVNDDGLPD 597
Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQK 690
N G PDQT+D W GVSAYCG LW+AAL+AA A L + A F ++
Sbjct: 598 NGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLKTGLDTSAEQREFSGWLEQ 657
Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
++ +D LWNG Y++ D G+ + ADQL G +YAR GL P+ EA + L +
Sbjct: 658 SRGNFDRLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLEPVVSEANSRSTLKAV 715
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAF 806
+ K GG G NG++ DG +D +G E+W G+ +G+A+ ++ E A
Sbjct: 716 REACFEKFDGGKLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGQTAEAI 775
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
+A V +V G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 776 CSAV-VNQVY----AGGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 818
>gi|113953483|ref|YP_730032.1| hypothetical protein sync_0818 [Synechococcus sp. CC9311]
gi|113880834|gb|ABI45792.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 833
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 259/828 (31%), Positives = 391/828 (47%), Gaps = 122/828 (14%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QF+++ + G++
Sbjct: 55 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQ--GDQVRVHALAT 112
Query: 147 SPGVPKKNTDSG--IESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
P +SG + SW W + TY A +P +W Y G E + C S
Sbjct: 113 EPARDDSRPESGKPLSSWQWYPASTEENSTGTYAARYPLSWNHYKGVFRAE--VSCEAFS 170
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
P +P +Y+ SS+P +VF +TL+N +V+LL +W N+V D G
Sbjct: 171 PILPGDYQRSSYPVAVFRWTLTNPTNKPLEVSLLLSWRNTVGWFTNTDASAEVHFRDDGS 230
Query: 248 SGHHF----------NSKTMTKDGVHGLTLHHRTANGRPPVT-----FAVAAEETAD-VH 291
H++ +++ + DG+ G+ L + + PV + +A +T + V
Sbjct: 231 PEHNYAPAIGDGEGQSNRWIDGDGLSGVLLDGKRST---PVAEGEGQWCLALPDTLEGVE 287
Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
+ C + S + ++W G +ND+ S G AAIA T+
Sbjct: 288 LMRCSRWDPSSD-----GAELWQPFAADGVIPDSNNDRAS--RKGEHASAAIAVKFTLAP 340
Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
G TR + +++WD P F V RRY FYG G AA IA +A+ + W +IEA
Sbjct: 341 GETREIPVAISWDLPVTSFATGVRDLRRYCDFYGVDGCHAAAIASEALRDWRSWHQQIEA 400
Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
WQ P+L K PE + LFNELY L +GG++WT +
Sbjct: 401 WQEPVLARKELPEELRMALFNELYDLASGGSLWTAAT----------------------- 437
Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
++ G+F LE +Y Y + DV Y SFAL+ L+P+L+ ++
Sbjct: 438 ----------------SKDPYGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVI 481
Query: 531 RDFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-N 582
R FA A+ D I G+ A RK GA PHD+G ++ F+ +Y + +
Sbjct: 482 RSFARAIPAADATQRPIGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQD 541
Query: 583 SSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ WKDL S +VLQV+R F TG D +F WP+ A+ Y++QFD + DG+ +N G
Sbjct: 542 CNLWKDLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGG 601
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASA--------LANDVGDHASASYFWVRYQKA 691
PDQT+D W GVSAYCG LW+AAL+AA A L D GD W+ +++
Sbjct: 602 APDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDTGDEQHTFSGWL--EQS 659
Query: 692 KAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIY 750
+A +D LWNG Y++ D G+ + ADQL G +YAR GL P+ EA + L +
Sbjct: 660 RANFDKLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLPPVVSEANSRSTLKAVK 717
Query: 751 DFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAFQ 807
+ GG G NG++ DG +D +G E+W G+ +G+A+ ++ E A
Sbjct: 718 EACFDNFAGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGKTAEAIC 777
Query: 808 TAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
+A A G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 778 SAVVTQVYA-----GGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819
>gi|449275451|gb|EMC84314.1| Non-lysosomal glucosylceramidase [Columba livia]
Length = 692
Score = 331 bits (848), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/365 (47%), Positives = 233/365 (63%), Gaps = 7/365 (1%)
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
L G EY MYNTYDVHFY+SFAL+ML+PKL++S+Q D A AV+ D + + G+
Sbjct: 314 LPGQEYRMYNTYDVHFYASFALIMLWPKLQISLQYDIAVAVVNEDVQPRQYLMGGQTAPV 373
Query: 557 KCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
K VPHDIG D+PW +N+Y + +++ WKDLN KFVLQVYRD+ T D + R +W
Sbjct: 374 KTKNVVPHDIGDPADEPWQRVNAYLIHDTANWKDLNLKFVLQVYRDYYLTHDSLYLRDMW 433
Query: 615 PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
P M +FD D DG+IEN GF DQTYDAW NG SAYCGGLW+AA+ +A
Sbjct: 434 PVCQAVMESELKFDMDNDGLIENGGFADQTYDAWVVNGASAYCGGLWLAAVCMMCKMAEV 493
Query: 675 VGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+GD + +K K ++ LWNG Y+NYD+S ++S+SI +DQ AG + +
Sbjct: 494 LGDAEIQQKYMDILRKGKETFERLLWNGKYYNYDSSGSNTSSSIMSDQCAG--HPKPPLS 551
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLA 793
+ + ++ V AL I++ NV+ GG GA+NGM+P G D S +Q+ E+W GV Y LA
Sbjct: 552 MQVFPKSHVVSALKTIFEKNVMGFAGGTMGAVNGMRPSGVPDTSSVQSNEVWVGVVYALA 611
Query: 794 ASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQ 853
A+MIQE +V+ F+TA G Y W + LG +FQTPE++ YRSL YMRPL+IW+MQ
Sbjct: 612 ATMIQEGLVEEGFRTAEGCYRTVWEQ--LGMAFQTPEAYREKKVYRSLAYMRPLSIWSMQ 669
Query: 854 WALTK 858
AL +
Sbjct: 670 LALER 674
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 104/210 (49%), Gaps = 19/210 (9%)
Query: 126 QFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYD 185
QF+V + + VL P S ++ W+W G YHAL+PRAW VY+
Sbjct: 6 QFTVCLRCKGQTVYQQVLSVERP--------STLQGWNWGYCGHYAFYHALYPRAWMVYE 57
Query: 186 GEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS 245
P + + CRQISP IPH+YK+SS P VF + + N DV+++F+ N
Sbjct: 58 -LPGQNVVLTCRQISPVIPHDYKDSSLPVGVFIWEVENGRDEDVDVSIMFSLQNGTGTKE 116
Query: 246 GLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG 302
SG H+N T K+G V G+ LHH T P TFA++A E A V V+ F +G
Sbjct: 117 DRSGGHWNEPFTFEKEGERVAGVLLHHCTHVN--PFTFAISAREKAGVVVTHLTAFNPAG 174
Query: 303 NSKGITAKDMWNEIKKHGSFDHLDNDKTSP 332
+ +++W ++ + G D D + P
Sbjct: 175 -----SGREVWQDLLQDGQLDSSDGESLHP 199
>gi|87301122|ref|ZP_01083963.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
gi|87284090|gb|EAQ76043.1| hypothetical protein WH5701_14591 [Synechococcus sp. WH 5701]
Length = 830
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 258/823 (31%), Positives = 390/823 (47%), Gaps = 114/823 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
+ G PLGG GAG IGR G F + L + G+ D QF++F +
Sbjct: 54 NHGAPLGGFGAGCIGRGPDGSFNLWHLDGGEHWFGVLPDC-----QFALFEHDGTASRAR 108
Query: 141 SV-LCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
++ + P + ++ + W W L+GE TY A +P W +D + + + C
Sbjct: 109 ALAMAPERDDSRPEASERPLGGWQW-LEGEAGTYAARYPLHW--FDYQDNFHTEVSCEAF 165
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
SP +P +Y+ +S+P +VF + L+NS + D++LL +W N+V D G
Sbjct: 166 SPILPGDYQRTSYPVAVFRWRLANSTKKPLDLSLLLSWRNTVGWFTNTDPSAAVTFRDDG 225
Query: 247 LSGHHF----------NSKTMTKDGVHGLTLH----HRTANGRPPVTFAVAAEETADVHV 292
H++ ++ + G+ G+ L A G AV ++ V V
Sbjct: 226 SPEHNYLPAIGAGVGQRNRLIDAPGLKGVLLEGAPSQPIAEGEGQWCLAVP-DQLEGVEV 284
Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
C + SG+ ++W + GS +ND+ S +S AAIA L + G
Sbjct: 285 MRCSRWDPSGD-----GAEIWEPFARDGSIPDSNNDRRSTGSEQAS--AAIALRLRLEPG 337
Query: 353 STRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAW 411
+ + ++WD P F RRYT F+G G SA IA +A+ + +W IEAW
Sbjct: 338 AEIELPVVISWDLPVTAFATGTSALRRYTDFFGAAGTSAVAIAAEALRDWRQWRDAIEAW 397
Query: 412 QRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT 471
Q+P+LE PE + L NELY L +GG++WT
Sbjct: 398 QQPVLERSDLPEPLRMALLNELYDLTSGGSLWT--------------------------- 430
Query: 472 PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
A+S T + +G+F LE +Y Y + DV Y SFAL+ L+P+L+ ++ R
Sbjct: 431 -AASST-----------DPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLR 478
Query: 532 DFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSYNLF-NS 583
FA A+ D I + G+ RK GA PHD+G ++ F+ +Y + +
Sbjct: 479 SFARAIPAADATPRPIGWYFTQGRGRVEAPRKVAGATPHDLGAPNERPFDATNYTAYQDC 538
Query: 584 SRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+ WKDL S FVLQV+R F TG D F WP+V A+ Y++QFD + DG+ +N G
Sbjct: 539 NLWKDLASDFVLQVWRTFSLAPTGEDLRFLADCWPAVVEALRYLKQFDVNDDGLPDNGGA 598
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAV 694
PDQT+D W GVSAYCG LW+AAL+AA A L D+G A F + ++A
Sbjct: 599 PDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDTGAEQREFGSWLEPSRAN 658
Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
+D LWNG Y+ D GS + + ADQL G +YAR GL + + + +L I +
Sbjct: 659 FDRLLWNGEYYRIDA--GSGTPVVMADQLCGDFYARLLGLPQVVSDERALSSLKAIREAC 716
Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
+ +GG G NG++ DG +D G E+W G+ +GLAA D A + V
Sbjct: 717 FERFEGGRLGVANGLRRDGTPLDPKGTHPLEVWTGINFGLAAYYRLMGQTDTALAITSAV 776
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
++ G F+TPE+ +R+ Y+R + IWA+ WA
Sbjct: 777 VGQVYAG---GLQFRTPEAITAVGTFRACHYLRAMAIWAL-WA 815
>gi|158299437|ref|XP_319575.4| AGAP008830-PA [Anopheles gambiae str. PEST]
gi|157013519|gb|EAA14848.4| AGAP008830-PA [Anopheles gambiae str. PEST]
Length = 913
Score = 329 bits (844), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 192/582 (32%), Positives = 304/582 (52%), Gaps = 66/582 (11%)
Query: 45 SFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYR 104
SF++ + + R Y + +G+ + + + + G PLGGIGAG+IGR +
Sbjct: 35 SFRQILSMVPLALRYIPYYWKVAREGRQVLMDYWYTENGKQIYGAPLGGIGAGTIGRGFA 94
Query: 105 GEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVLCPRSPGVPKKNT-------- 155
GEF R++L G+ + V ANQF V + G F S+L V K ++
Sbjct: 95 GEFCRYQLKPGLYEYNTVHANQFIVTIKDETGATIFHSLLSTYRSYVQKIDSHRLAKAFV 154
Query: 156 ------------DSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
+ + SW+ L C+Y AL+PRAW+ YD + +++V RQISP I
Sbjct: 155 LRVWAGFISYRPKTPLASWESGLDASRCSYTALYPRAWSEYD-LSEHGVKLVQRQISPII 213
Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVH 263
PH+YKESS P +VF +T+ N VTL FT+ N + + T+
Sbjct: 214 PHDYKESSLPCAVFVWTVENVCDKDRQVTLTFTFKNGTGTKKQDAEGGSETSAFTQGNAR 273
Query: 264 GLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD 323
G+++ A P T+ V+ +++++++ C F +GN + +WN++K++G
Sbjct: 274 GVSIRQTIA--EMPCTYCVSCRSSSEINLTRCERFDPTGN-----GEKLWNDLKENGHLT 326
Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTK 381
ND+T S+ + A++ + + G+T + F+L WD P+V F + K YHR YTK
Sbjct: 327 EKSNDETLKSK---DVAVAVSGQILVQPGTTSQLEFALVWDMPKVHFQKRGKEYHRYYTK 383
Query: 382 FYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGT 441
++G GD+ +I+ A+ + KWE I+ WQRPILED P+WY +FNELY++ GG+
Sbjct: 384 YFGKSGDAGPQISDYALNNYGKWERLIDEWQRPILEDADLPDWYKSAIFNELYFIADGGS 443
Query: 442 IWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAE 501
+W F++D + + ++ G++ YLEG E
Sbjct: 444 VW-----------------FTMDDQTDLP-------------FDDPRLAYGRYSYLEGHE 473
Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
Y MY TYDVHFY+S AL L+P L++SIQ D+ ++ K + DGK + RK +
Sbjct: 474 YRMYTTYDVHFYASHALASLWPNLQVSIQYDYKDSIEREITEGRKHLYDGKVIPRKIKNS 533
Query: 562 VPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
VPHD+G ++P+ IN+Y + + S W+DLN KF+LQVYRD+
Sbjct: 534 VPHDLGDPAEEPFDLINAYPIHDVSEWRDLNLKFILQVYRDY 575
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 111/263 (42%), Positives = 165/263 (62%), Gaps = 11/263 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++P+ + + + ++DKDGDG+IEN PDQTYD W +G SAYCGGLW+A+L
Sbjct: 653 YLKAMYPACRVVLEHSLEWDKDGDGLIENSKAPDQTYDTWVMDGPSAYCGGLWLASLHCM 712
Query: 669 SALANDVGDHASASYFWVRYQKAKAVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
SA+A+ + + + + K +A + + LWNG+Y+ +D + +S SI +DQL G WY
Sbjct: 713 SAMASLLDQNEDSDRYKAVLDKGRASFEEKLWNGTYYRFD-AQSASKNSIMSDQLCGHWY 771
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSGLQ 780
R+CG + + V+ A+ IY+ NV++ GG GA+NG P DGR D +Q
Sbjct: 772 LRSCGFDYDVFPKENVRLAMRTIYENNVMRFCGGQLGAVNGYVPSGQPNKDGRPDTVSIQ 831
Query: 781 AREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRS 840
E+W GVTY LA++MI E M + AF+TA G+Y+ A SE +G +F+TPE+ YR+
Sbjct: 832 GEEVWTGVTYALASTMIHEGMFEEAFKTAGGLYQ-ALSER-IGMNFETPEAVYAERHYRA 889
Query: 841 LCYMRPLTIWAMQWALTKPKLSR 863
+ YMRPL+IW+MQ A KL R
Sbjct: 890 IGYMRPLSIWSMQTAWEMKKLRR 912
>gi|352096901|ref|ZP_08957615.1| Glucosylceramidase [Synechococcus sp. WH 8016]
gi|351675485|gb|EHA58646.1| Glucosylceramidase [Synechococcus sp. WH 8016]
Length = 833
Score = 322 bits (825), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 257/823 (31%), Positives = 394/823 (47%), Gaps = 112/823 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QF+++ + + + ++
Sbjct: 55 HGMPLGGFGAGCIGRSSRGDFNLWHLDGGEHWYGSIPDCQFALWEKQDDQVRVHALATEP 114
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ + + + SW W + TY A +P +W Y+G E + C SP
Sbjct: 115 TRDDSRPESGKPLSSWQWYPASTEESSTGTYAARYPLSWNHYEGVFRAE--VSCEAFSPI 172
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+P +Y+ SS+P +VF +TL+N +V+LL +W N+V D G
Sbjct: 173 LPGDYQRSSYPVAVFRWTLTNPTTKPLEVSLLLSWRNTVGWFTNTDSSAEVHFRDDGSPE 232
Query: 250 HHF----------NSKTMTKDGVHGLTLHHR--TANGRPPVTFAVAAEETAD-VHVSECP 296
H++ +++ + DG+ G+ L + T + +A + D V + C
Sbjct: 233 HNYAPAIGVGEGQSNRWIDGDGLCGVLLDGKRSTPLAEGEGQWCLALPDRLDGVELMRCS 292
Query: 297 CFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRS 356
+ SG+ ++W G +ND+ S G AAIA LT+ G TR
Sbjct: 293 RWDPSGD-----GAELWQPFAADGLIPDSNNDRAS--RAGEHASAAIAVKLTLAPGETRE 345
Query: 357 VTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPI 415
+ +++WD P F V RRY+ FYG G AA IA +A+ + W +IEAWQ P+
Sbjct: 346 IPVAISWDLPVTSFATGVRDLRRYSDFYGADGCHAAAIAAEALRDWRSWHEQIEAWQAPV 405
Query: 416 LEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASS 475
L K PE + LFNELY L +GG++WT AT ++
Sbjct: 406 LARKELPEELRMALFNELYDLASGGSLWT--------AATTKD----------------- 440
Query: 476 DTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAA 535
G+F LE +Y Y + DV Y SFAL+ L+P+L+ ++ R FA
Sbjct: 441 --------------PYGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVIRSFAR 486
Query: 536 AVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRWK 587
A+ D I G+ A RK GA PHD+G ++ F+ +Y + + + WK
Sbjct: 487 AIPAADATQRPIGWYFTQGRGRVEADRKVKGATPHDLGAPNEVPFDATNYTAYQDCNLWK 546
Query: 588 DLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQT 644
DL S +VLQV+R F TG D +F WP+ A+ Y++QFD + DG+ +N G PDQT
Sbjct: 547 DLASDYVLQVWRTFKLAPTGEDLSFLAECWPAAVQALHYLKQFDVNDDGLPDNGGAPDQT 606
Query: 645 YDAWSANGVSAYCGGLWVAALQAASA--------LANDVGDHASASYFWVRYQKAKAVYD 696
+D W GVSAYCG LW+AAL+AA A L D GD W+ ++++A +D
Sbjct: 607 FDDWPLKGVSAYCGALWIAALEAALAMAQRLQLELGLDTGDEQHTFSGWL--EQSRANFD 664
Query: 697 S-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
+ LWNG Y++ D G+ + ADQL G +YAR L P+ EA + L + +
Sbjct: 665 TLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLELPPVVSEANSRSTLKAVKEACFD 722
Query: 756 KVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAFQTAAGV 812
GG+ G NG++ DG +D +G E+W G+ +G+A+ ++ E A +A
Sbjct: 723 NFAGGLLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGEGPTAEAICSAVVT 782
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
A G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 783 QVYA-----GGLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819
>gi|260434763|ref|ZP_05788733.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260412637|gb|EEX05933.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 832
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 241/829 (29%), Positives = 385/829 (46%), Gaps = 123/829 (14%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
+ G+PLGG GAG GR+ G + L + G+ D QF++F G++
Sbjct: 54 NHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDC-----QFALFEGNGRGKRAH 108
Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
++ ++ +++W+W TY A +P +WT Y+G D E+R C
Sbjct: 109 ALAVQPDKDASRQEAGQPLKAWEWYPASTPDRSTGTYAARYPFSWTSYEGVYDAEVR--C 166
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
SP +P +Y+ +S+P +VF +TL N D++LL W N+
Sbjct: 167 EAFSPILPGDYQRTSYPVAVFIWTLRNPTDQPLDLSLLLCWRNTTGWFTNTDASAEVHFR 226
Query: 244 DSGLSGHHF----------NSKTMTKDGVHGLTLHHRTANGRPPVT-----FAVAAEETA 288
D G H++ ++ + + G+ L +N PV + +AA E
Sbjct: 227 DDGSPEHNYAPAIGSTAGQRNRCIDYGSLKGVVLEGNASN---PVAEGEGQWCIAAAEQP 283
Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
V + C + SG+ +++W+ GS +ND+ S S+ + AA+A
Sbjct: 284 GVTIQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSD--DPLSAALAVQCQ 336
Query: 349 IPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
+ G + + ++WD P F RRYT F+ + A IA +++ + W +
Sbjct: 337 LAPGQSIEIPLVISWDLPVTGFATGSQALRRYTDFFAADANQAVAIAAESLRDWRSWRQK 396
Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
I+AWQ P+L+ + PE + LFNELY L +GG++W+ SP
Sbjct: 397 IDAWQEPVLQRQDLPEPLRMALFNELYDLCSGGSLWSAASP------------------- 437
Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
E+ G+F LE +Y Y + DV Y S AL+ L+P+L+
Sbjct: 438 --------------------EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDK 477
Query: 528 SIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGLDD--PWFEINSYN 579
++ R FA A+ D I + GK RK GA PHD+G + PW N
Sbjct: 478 AVLRSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTA 537
Query: 580 LFNSSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
+ + WKDL S FVLQV+R F +G D F WP+ A+ +++ FD + DG+ +
Sbjct: 538 YQDCNLWKDLGSDFVLQVWRSFKLAPSGEDIRFLADCWPAAVEALRFLKTFDVNNDGLPD 597
Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQK 690
N G PDQT+D W GVSAYCG LW+AAL+AA A L+ + A F ++
Sbjct: 598 NGGAPDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDTSAEQREFSGWLEQ 657
Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
++ +D LWNG Y++ D G+ + ADQL G +YAR GL P+ ++ + L +
Sbjct: 658 SRGNFDKLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLGLPPVVSDSNSRSTLKVV 715
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAA--SMIQEEMVDMAF 806
+ GG G NG++ DG +D +G E+W G+ +G+A+ ++ ++ A
Sbjct: 716 KEACFEAFDGGSLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYYRLMGDKQTAQAI 775
Query: 807 QTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
+A V E +S G F+TPE+ + +R+ Y+R + IW WA
Sbjct: 776 CSA--VVEQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGF-WA 818
>gi|78212458|ref|YP_381237.1| hypothetical protein Syncc9605_0920 [Synechococcus sp. CC9605]
gi|78196917|gb|ABB34682.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 832
Score = 320 bits (819), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 240/824 (29%), Positives = 373/824 (45%), Gaps = 113/824 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
+ G+PLGG GAG GR+ G + L + G+ D QF++F G++
Sbjct: 54 NHGMPLGGFGAGCFGRAPDGNINLWHLDGGEHWFGVLPDC-----QFALFEGNGRGKRAH 108
Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
++ + ++ W+W TY A +P +WT Y+G D E+R C
Sbjct: 109 ALAVQPEQDASRPEAGQPLKVWEWYPASTPDRSTGTYAARYPLSWTSYEGVYDAEVR--C 166
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
SP +P +Y+ +S+P VF +TL N D++LL +W N+
Sbjct: 167 EAFSPILPGDYQRTSYPVVVFVWTLRNPTDQPLDLSLLLSWRNTTGWFTNTDASAEVHFR 226
Query: 244 DSGLSGHHFNSKTMTKDGVHGLTLHHRTANG-------RPPVT-----FAVAAEETADVH 291
D G H++ + G + + G PV + +A E V
Sbjct: 227 DDGSPEHNYAPAIGSTAGQRNCCIDDGSLKGVLLEGNVSDPVAEGEGQWCIATAEQPGVT 286
Query: 292 VSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPS 351
+ C + SG+ +++W+ GS +ND+ S S+ + AA+A +
Sbjct: 287 IQRCSRWNPSGD-----GRELWDSFSADGSIPESNNDRRSGSD--DPLSAALAVQCQLAP 339
Query: 352 GSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
G + + ++WD P F RRYT F+ + A IA +A+ + +W +IEA
Sbjct: 340 GQSIEIPLVISWDLPVTAFATGSQALRRYTDFFAADANQAVAIAAEALRDWRRWRQQIEA 399
Query: 411 WQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIH 470
WQ+P+L+ + P+ + LFNELY L +GG++W+ SP
Sbjct: 400 WQQPVLQRQDLPKPLRMALFNELYDLCSGGSLWSAASP---------------------- 437
Query: 471 TPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQ 530
E+ G+F LE +Y Y + DV Y S AL+ L+P+L+ ++
Sbjct: 438 -----------------EDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVL 480
Query: 531 RDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGLDD--PWFEINSYNLFN 582
R FA A+ D I + GK RK GA PHD+G + PW N +
Sbjct: 481 RSFARAIPAADATQRPIGWYFTQGKGRVEADRKVKGATPHDLGAPNEIPWDATNYTAYQD 540
Query: 583 SSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
+ WKDL S FVLQV+R F +G D F WP+ A+ Y++ FD + DG+ +N G
Sbjct: 541 CNLWKDLGSDFVLQVWRTFKLAPSGEDIRFLADCWPAAVEALRYLKTFDVNNDGLPDNGG 600
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASA------LANDVGDHASASYFWVRYQKAKA 693
PDQT+D W GVSAYCG LW+AAL+AA A L+ + A F ++++
Sbjct: 601 APDQTFDDWPLKGVSAYCGALWIAALEAALAIAQTLQLSTGLDTSAEQREFSGWLEQSRG 660
Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+D LWNG Y++ D G+ + ADQL G +YAR L P+ +A L + +
Sbjct: 661 NFDKLLWNGEYYDIDAESGTP--VVMADQLCGDFYARLLALPPVVSDANSLSTLKAVKEA 718
Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
GG G NG++ DG +D + E+W G+ +G+A+ A +
Sbjct: 719 CFEAFDGGSLGVANGLRRDGTPLDPNATHPLEVWTGINFGIASYYRLMGDKQTAQSICSA 778
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
V E +S G F+TPE+ + YR+ Y+R + IW + WA
Sbjct: 779 VVEQVYSS---GLQFRTPEAITAVNTYRACHYLRAMAIWGL-WA 818
>gi|78185056|ref|YP_377491.1| hypothetical protein Syncc9902_1489 [Synechococcus sp. CC9902]
gi|78169350|gb|ABB26447.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 823
Score = 318 bits (815), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 249/823 (30%), Positives = 388/823 (47%), Gaps = 110/823 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGG+GAG GR+ G + L G + QFS++ + + + ++
Sbjct: 44 NHGMPLGGLGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDVRAHALAVA 103
Query: 146 RSPGVPKKNTDSGIESWDWNLKGEN--CT--YHALFPRAWTVYDGEPDPELR--IVCRQI 199
+ + + + +W W N CT Y A +P +WT YDG R ++C
Sbjct: 104 PEGDASRPDAAASLPAWSWYPASTNKRCTGTYAARYPLSWTHYDGV----FRSGVLCEAF 159
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
SP +P +Y+ SS+P +VF + L+N +++L+ +W N+V D G
Sbjct: 160 SPILPGDYQRSSYPVAVFCWQLANPTDQPLELSLMVSWRNTVGWFTNTDASAEVHFRDDG 219
Query: 247 LSGHHFN----------SKTMTKDGVHGLTLHHRTAN----GRPPVTFAVAAEETADVHV 292
H+++ ++ ++ G+ G+ + R++ G AV ++ A V V
Sbjct: 220 SPEHNYSPAIGRGAGQRNRAHSESGLTGVLMEGRSSEPIQEGEGQWCLAVP-DDLAGVEV 278
Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
C + SG+ D+W G DND+ S G AA+ + G
Sbjct: 279 LRCSRWDPSGD-----GSDIWTPFSAEGRIPESDNDRES--RAGEQASAAMVVKFILAPG 331
Query: 353 STRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAW 411
+R + ++WD P F RRYT F+G+ G +AA IA +A+ + + W IEAW
Sbjct: 332 ESREIPVVISWDLPITAFASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEAW 391
Query: 412 QRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT 471
Q+P+LE K PE + LFNELY L +GG++WT +
Sbjct: 392 QQPVLERKALPEPVRMALFNELYDLCSGGSLWTAAT------------------------ 427
Query: 472 PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
++ G+F LE +Y Y + DV Y S AL+ L+P+L+ ++ R
Sbjct: 428 ---------------AQDPYGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDKAVLR 472
Query: 532 DFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGLDDPW-FEINSYNLF-NS 583
F+ A+ HD I G+ A RK GA PHD+G + F+ +Y + +
Sbjct: 473 SFSRAIPAHDATQRPIGWYFTQGRGRVEADRKVAGATPHDLGAPNELPFDATNYTAYQDC 532
Query: 584 SRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+ WKDL S FVLQV+R F TG D NF WPS A+ Y++ FD + DG+ +N G
Sbjct: 533 NLWKDLASDFVLQVWRTFQLAPTGEDLNFLADCWPSAVRALDYLKGFDVNDDGLPDNGGA 592
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAV 694
PDQT+D W GVSAYCG LW+AAL+AA A L D+G + F +++++
Sbjct: 593 PDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDTTVEQRRFGQWLEQSRSN 652
Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
+D LWNG Y+ D G+ + ADQL G +YAR GL + +A L + D
Sbjct: 653 FDRLLWNGEYYEIDAESGTP--VVMADQLCGDFYARLLGLPSVVSDANALSTLNVVRDAC 710
Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
+ +GG G NG++ DG +D +G E+W G+ +G+A+ D A V
Sbjct: 711 FERFEGGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAV 770
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
+S G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 771 VNQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 809
>gi|115633999|ref|XP_780575.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Strongylocentrotus purpuratus]
Length = 597
Score = 314 bits (804), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 183/492 (37%), Positives = 253/492 (51%), Gaps = 42/492 (8%)
Query: 70 GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
GK P +V G+P+GGIG G+I R ++G+F R+ L G V NQF+V
Sbjct: 136 GKRPFIDVLHPMKFKPVYGVPVGGIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTV 195
Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
V R + VL P P N D ++W+W G +YH L+PRAWT Y P
Sbjct: 196 CVRRGGKTVYQQVLSPNKP----HNKDLA-QAWEWQFPGNRASYHGLYPRAWTKY-LLPG 249
Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
+ +VCRQISP PHNY+E+S P VF + + N G +ADV+++FTW N + G
Sbjct: 250 QNITLVCRQISPVFPHNYQETSLPVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKG 309
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRP--PVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
+N+ + G+ H +G P P T +AA + DV V+ C F S
Sbjct: 310 GRWNTG-FSSASTSGVMFH----DGHPTQPCTMCIAAADGDDVTVTTCASFNPSSQ---- 360
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
K++WN++K+ G + T +E G ++ A+ +S I + F+L+WD P+
Sbjct: 361 -LKELWNDLKEDGQLN-TTKGPTERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPK 418
Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
V+F Y RRYT+++G GDSA + A+ E+ WE I WQ PILE+ R P WY
Sbjct: 419 VQFDSCSYFRRYTRWFGRDGDSALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKS 478
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
LFNELY++ GG IW D + + HT ASS
Sbjct: 479 ALFNELYFMTDGGGIWVDVDDEYP------------NGENNAHTRASSLV---------- 516
Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLELS+Q DF V D + +
Sbjct: 517 -REYGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKDEDSFQD 575
Query: 548 MSDGKWVARKCL 559
+S G R+ L
Sbjct: 576 LSSGLRSIRRVL 587
>gi|390344694|ref|XP_001200450.2| PREDICTED: uncharacterized protein LOC764213 [Strongylocentrotus
purpuratus]
Length = 1283
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 183/492 (37%), Positives = 253/492 (51%), Gaps = 42/492 (8%)
Query: 70 GKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSV 129
GK P +V G+P+GGIG G+I R ++G+F R+ L G V NQF+V
Sbjct: 136 GKRPFIDVLHPMKFKPVYGVPVGGIGCGAIDRGWKGDFCRWSLSPGRYTYDNVELNQFTV 195
Query: 130 FVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
V R + VL P P N D ++W+W G +YH L+PRAWT Y P
Sbjct: 196 CVRRGGKTVYQQVLSPNKP----HNKDLA-QAWEWQFPGNRASYHGLYPRAWTKY-LLPG 249
Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
+ +VCRQISP PHNY+E+S P VF + + N G +ADV+++FTW N + G
Sbjct: 250 QNITLVCRQISPVFPHNYQETSLPVGVFVWDVYNHGTAAADVSIMFTWKNGTGTSADKKG 309
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRP--PVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
+N+ + G+ H +G P P T +AA + DV V+ C F S
Sbjct: 310 GRWNTG-FSSASTSGVMFH----DGHPTQPCTMCIAAADGDDVTVTTCASFNPSSQ---- 360
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
K++WN++K+ G + T +E G ++ A+ +S I + F+L+WD P+
Sbjct: 361 -LKELWNDLKEDGQLNTTKG-PTERTEKGETLAGAVVSSRHIEPNQQGLLEFALSWDMPK 418
Query: 368 VKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPI 427
V+F Y RRYT+++G GDSA + A+ E+ WE I WQ PILE+ R P WY
Sbjct: 419 VQFDSCSYFRRYTRWFGRDGDSALALCSHALEEYGDWEKNINEWQNPILENDRLPAWYKS 478
Query: 428 TLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENG 487
LFNELY++ GG IW D + + HT ASS
Sbjct: 479 ALFNELYFMTDGGGIWVDVDDEYP------------NGENNAHTRASSLV---------- 516
Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKLELS+Q DF V D + +
Sbjct: 517 -REYGRFGYLEGHEYRMYNTYDVHFYASFALIMLWPKLELSMQYDFGEYVDHKDEDSFQD 575
Query: 548 MSDGKWVARKCL 559
+S G R+ L
Sbjct: 576 LSSGLRSIRRVL 587
>gi|195436933|ref|XP_002066400.1| GK18271 [Drosophila willistoni]
gi|194162485|gb|EDW77386.1| GK18271 [Drosophila willistoni]
Length = 955
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 184/552 (33%), Positives = 286/552 (51%), Gaps = 68/552 (12%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
Y GEF RF++ GI + VLANQF V + G F S+L S + +T++
Sbjct: 137 YAGEFCRFQMRPGIYEYNTVLANQFIVTIKDQKGCTIFQSLLSKCSSIINNNDTEADEDE 196
Query: 159 -------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
+ +W N+ C+Y L+PR+WT YD +R++CRQI
Sbjct: 197 KSRTKCQLPNCSSSRSRQPLSTWHSNIDDSRCSYTGLYPRSWTEYDLS-HYGVRLICRQI 255
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
SP IPH+YKESS P +VF ++ N V++ FT+ N + S+ + +
Sbjct: 256 SPVIPHDYKESSLPCAVFVWSAENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLINE 315
Query: 260 DGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKH 319
G+ + + A+ P ++ +A D+ ++ CP F +GN + +W ++K+H
Sbjct: 316 GHAKGVAIRQKIADM--PCSYNLACRVLPDISITRCPQFDPAGNGEA-----LWAQLKEH 368
Query: 320 GSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHR 377
G L ++ T + IG A+ A + + ++ + F LAWD P+++F K+ + R
Sbjct: 369 G---QLSDEPTQETLKTKDIGVAVCAHIALKPQASHELEFVLAWDMPKIQFPRKLKTHTR 425
Query: 378 RYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLN 437
YTK++ G++ RI A+ + WE I+AWQRPIL D+ PEWY +FN+LY+++
Sbjct: 426 YYTKYFDDSGEAGPRICEYALKHYQSWERLIDAWQRPILNDEGLPEWYKCAIFNQLYFIS 485
Query: 438 AGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLY 496
GGTIW D++LG L ++ G+F Y
Sbjct: 486 DGGTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGY 516
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
LEG EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + R
Sbjct: 517 LEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQFDFKDAIAAELNDTRKMLYDGKIMPR 576
Query: 557 KCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
K VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A+A
Sbjct: 577 KVKNCVPHDLGDPDEEPFTLINCYNIHDVNEWKDLNTKFVLQVYRDYYVLNELAQAQADN 636
Query: 615 PSVYIAMAYMEQ 626
S + ++ ++++
Sbjct: 637 ASKFSSIEFIDK 648
Score = 207 bits (528), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 113/264 (42%), Positives = 157/264 (59%), Gaps = 12/264 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ ++++ + M ++DKD DG+IEN PDQTYD+W +G SAYC GLW+A+LQA
Sbjct: 694 YLKSMYAACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLASLQAM 753
Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
SA+A + + +K K ++ + LWNGSY+ +D S T I ADQL G WY
Sbjct: 754 SAMATILDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWY 812
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM-----QPD--GRIDMSGL 779
++CG I + V+ AL +IYD NV+ G GA NG +P G +D S +
Sbjct: 813 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANVSEPSKAGHVDNSNI 872
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
QA E+WPGV Y LAA+MIQE M D AFQTA G+Y+ +G +F+TPE+ YR
Sbjct: 873 QAEEVWPGVVYALAATMIQEGMFDEAFQTAGGMYKTI--SQRIGMNFETPEALYGEKRYR 930
Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
S+ YMRPL+IW+MQ A + + R
Sbjct: 931 SIGYMRPLSIWSMQVAWERRRALR 954
>gi|116072378|ref|ZP_01469645.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
gi|116064900|gb|EAU70659.1| hypothetical protein BL107_11341 [Synechococcus sp. BL107]
Length = 820
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 248/823 (30%), Positives = 386/823 (46%), Gaps = 110/823 (13%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGG GAG GR+ G + L G + QFS++ + + + ++
Sbjct: 41 NHGMPLGGFGAGCFGRAPDGNTNLWHLDGGEHWFGTLPDCQFSLWEKQGDDLRAHALAVS 100
Query: 146 RSPGVPKKNTDSGIESWDW-NLKGENC---TYHALFPRAWTVYDGEPDPELR--IVCRQI 199
+ + + + +W W + C TY A +P +WT YDG R ++C
Sbjct: 101 PEGDASRPDAAASLPAWSWYPASTDQCSTGTYAARYPLSWTHYDGV----FRSGVLCEAF 156
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
SP +P +Y+ SS+P +VF + L+N +++L+ +W N+V D G
Sbjct: 157 SPILPGDYQRSSYPVAVFRWQLANPTDQPLELSLMVSWRNTVGWFTNTDAAAEVHFRDDG 216
Query: 247 LSGHHFN----------SKTMTKDGVHGLTLHHRT----ANGRPPVTFAVAAEETADVHV 292
H+++ ++ + + GV G+ L R+ A G AV ++ V V
Sbjct: 217 SPEHNYSPAIGRGEGQRNRAINEPGVTGVLLEGRSSEPIAEGEGQWCLAVP-DDLDGVDV 275
Query: 293 SECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSG 352
C + G+ D+W G DND+ S G AA+ T+ G
Sbjct: 276 LRCSRWDPCGD-----GSDIWTPFAAEGRIPESDNDRES--RAGEQASAAMVVKFTLAPG 328
Query: 353 STRSVTFSLAWDCPEVKFFEKVY-HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAW 411
+R + ++WD P F RRYT F+G+ G +AA IA +A+ + + W IEAW
Sbjct: 329 ESREIPVVISWDLPITAFASGCADRRRYTDFFGSDGRNAAAIAAEALRDWSDWRGRIEAW 388
Query: 412 QRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHT 471
Q+P+LE K PE + LFNELY L +GG++WT +
Sbjct: 389 QQPVLERKALPEPVRMALFNELYDLCSGGSLWTAAT------------------------ 424
Query: 472 PASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR 531
++ G+F LE +Y Y + DV Y SFAL+ L+P+L+ ++ R
Sbjct: 425 ---------------AKDPYGRFGVLECLDYAWYESLDVRLYGSFALLQLWPELDKAVLR 469
Query: 532 DFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNLF-NS 583
F+ A+ D I G+ A RK GA PHD+G ++ F+ +Y + +
Sbjct: 470 SFSRAIPARDATQRPIGWYFTQGRGRVEADRKVAGATPHDLGAPNEVPFDATNYTAYQDC 529
Query: 584 SRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
+ WKDL S FVLQV+R F TG D NF WPS A+ Y++ FD + DG+ +N G
Sbjct: 530 NLWKDLASDFVLQVWRTFQLAPTGEDLNFLAECWPSAVQALDYLKGFDVNDDGLPDNGGA 589
Query: 641 PDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAV 694
PDQT+D W GVSAYCG LW+AAL+AA A L D+G + F +++++
Sbjct: 590 PDQTFDDWPLKGVSAYCGALWIAALEAALAMAQRLQLDLGLDTTVEQRRFGQWLEQSRSN 649
Query: 695 YDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
+D LWNG Y+ D G+ + ADQL G +YAR GL + +A L + +
Sbjct: 650 FDRLLWNGEYYEIDAESGTP--VVMADQLCGDFYARLLGLPSVVSDANALSTLNVVREAC 707
Query: 754 VLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
+ + G G NG++ DG +D +G E+W G+ +G+A+ D A V
Sbjct: 708 FERFEAGTLGVANGLRRDGTPLDPNGTHPLEVWTGINFGIASYFRLMGQSDTALAITGAV 767
Query: 813 YEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
+S G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 768 VNQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 806
>gi|195579186|ref|XP_002079443.1| GD23959 [Drosophila simulans]
gi|194191452|gb|EDX05028.1| GD23959 [Drosophila simulans]
Length = 947
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 182/549 (33%), Positives = 286/549 (52%), Gaps = 64/549 (11%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
Y GEF RF++ GI + V ANQF V + P G F S+L S + D+
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDG 190
Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+ +W N++ C+Y L+PR+WT YD +R+ CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 249
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
IPH Y+ESS P +VF +++ N V++ FT+ N + S+ +++
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
G+++ + A+ P ++ +A ++ ++ CP F +GN + +W ++K+HG
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 361
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
L TS + IG A+ A + + ++ + F LAWD P+++F K+ + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419
Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
TK++ GDS RI A+ +++ WE I+AWQRPIL D+ P+WY +FN+LY+++ G
Sbjct: 420 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 479
Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEG 499
GTIW + ++ S D P + G+F YLEG
Sbjct: 480 GTIWLKCESSLG-------KELSYDDPRLAY---------------------GRFGYLEG 511
Query: 500 AEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCL 559
EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK
Sbjct: 512 HEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVK 571
Query: 560 GAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV 617
VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A+A S
Sbjct: 572 NCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNASK 631
Query: 618 YIAMAYMEQ 626
+ ++ ++++
Sbjct: 632 FSSIEFIDK 640
Score = 207 bits (526), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 115/268 (42%), Positives = 159/268 (59%), Gaps = 20/268 (7%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ S + ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQA
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
SA+A + D + +RYQ +++ + LWNGSY+ +D S T I ADQL
Sbjct: 746 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 800
Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
G WY ++CG I + V+ AL +IYD NV+ G GA NG + G +D
Sbjct: 801 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVD 860
Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+
Sbjct: 861 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 918
Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
YRS+ YMRPL+I +MQ AL + + R
Sbjct: 919 KRYRSIGYMRPLSILSMQVALERRRAQR 946
>gi|195338407|ref|XP_002035816.1| GM15538 [Drosophila sechellia]
gi|194129696|gb|EDW51739.1| GM15538 [Drosophila sechellia]
Length = 947
Score = 311 bits (798), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 182/549 (33%), Positives = 286/549 (52%), Gaps = 64/549 (11%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
Y GEF RF++ GI + V ANQF V + P G F S+L S + D+
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDG 190
Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+ +W N++ C+Y L+PR+WT YD +R+ CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 249
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
IPH Y+ESS P +VF +++ N V++ FT+ N + S+ +++
Sbjct: 250 VIPHEYRESSQPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
G+++ + A+ P ++ +A ++ ++ CP F +GN + +W ++K+HG
Sbjct: 310 AKGVSIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 361
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
L TS + IG A+ A + + ++ + F LAWD P+++F K+ + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419
Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
TK++ GDS RI A+ +++ WE I+AWQRPIL D+ P+WY +FN+LY+++ G
Sbjct: 420 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 479
Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEG 499
GTIW + ++ S D P + G+F YLEG
Sbjct: 480 GTIWLKCESSLG-------KELSYDDPRLAY---------------------GRFGYLEG 511
Query: 500 AEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCL 559
EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK
Sbjct: 512 HEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVK 571
Query: 560 GAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV 617
VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A+A S
Sbjct: 572 NCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNASK 631
Query: 618 YIAMAYMEQ 626
+ ++ ++++
Sbjct: 632 FSSIEFIDK 640
Score = 211 bits (537), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 116/268 (43%), Positives = 160/268 (59%), Gaps = 20/268 (7%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ S + ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQA
Sbjct: 686 YLKAMYASCKAILERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
SA+A + D + +RYQ +++ + LWNGSY+ +D S T I ADQL
Sbjct: 746 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 800
Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
G WY ++CG I + V+ AL +IYD NV+ G GA NG + G +D
Sbjct: 801 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVD 860
Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+
Sbjct: 861 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 918
Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
YRS+ YMRPL+IW+MQ AL + + R
Sbjct: 919 KRYRSIGYMRPLSIWSMQVALERRRAQR 946
>gi|33866127|ref|NP_897686.1| hypothetical protein SYNW1593 [Synechococcus sp. WH 8102]
gi|33639102|emb|CAE08108.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 833
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 249/827 (30%), Positives = 390/827 (47%), Gaps = 118/827 (14%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
D G+PLGG GAG +GRS G F + L G + QF+++ + GE +
Sbjct: 54 DHGMPLGGFGAGCLGRSPSGAFNLWNLDGGEHWFGSIPDCQFALWEQQ--GEDVRAHALA 111
Query: 146 RSPGVPKKNTDSG--IESWDW---NLKGENC-TYHALFPRAWTVYDGEPDPELRIVCRQI 199
+P + G + +W W + +G + TY A +P +WT + E ++C
Sbjct: 112 TAPDRDDTDPSGGTPLSAWQWYPASTQGRSTGTYAARYPLSWTHH--EDVYRAGVLCEAF 169
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSG 246
SP +P +Y+ +S+P +VF + SN +++L+ W N+V D G
Sbjct: 170 SPILPGDYQRTSYPVAVFRWQFSNPTDQPLELSLMLCWRNTVGWFTNTDAAAEVHFRDDG 229
Query: 247 LSGHHF----------NSKTMTKDGVHGLTLHHRTANGRPPVTFA--------VAAEETA 288
H++ ++ + + G G+ L G+P + A ++
Sbjct: 230 SPEHNYAPAIGRGEGQRNRQVDQPGFSGVLLE-----GQPSMPLAEGEGQWCLALPDDLD 284
Query: 289 DVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLT 348
VHV C + SG+ ++W G +ND+ S S G AAIA T
Sbjct: 285 GVHVMRCSRWDPSGD-----GSELWTPFAADGRIPDSNNDRASRS--GEQASAAIAVKFT 337
Query: 349 IPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
+ G++R + ++WD P F RRYT F+G+ G +AA IA +A+ + +W +
Sbjct: 338 LAPGASREIPLVISWDLPVTAFASGSSALRRYTDFFGSSGSNAAAIAAEALRDWRQWREQ 397
Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
I+AWQ+P+L + PE + LFNELY L +GG++WT SP
Sbjct: 398 IDAWQQPVLARMQLPEPLRMALFNELYDLCSGGSLWTAASP------------------- 438
Query: 468 KIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLEL 527
+ G+F LE +Y Y + DV Y S AL+ L+P+L+
Sbjct: 439 --------------------ADPHGRFGVLECLDYAWYESLDVRLYGSLALLQLWPELDK 478
Query: 528 SIQRDFAAAVMMHDPGTMKI-----MSDGKWVA-RKCLGAVPHDIGL-DDPWFEINSYNL 580
++ R FA A+ D I G+ A RK GA PHD+G ++ F+ +Y
Sbjct: 479 AVLRSFARAIPAADGSQRPIGWYFTQGRGRVEADRKVAGATPHDLGAPNERPFDATNYTA 538
Query: 581 F-NSSRWKDLNSKFVLQVYRDF--VATG-DKNFARAVWPSVYIAMAYMEQFDKDGDGMIE 636
+ + + WKDL S ++LQV+R F TG D F WPS A+ Y++ FD + DG+ +
Sbjct: 539 YQDCNLWKDLASDYMLQVWRTFRLAPTGEDLRFLAECWPSAVQALDYLKGFDVNNDGLPD 598
Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQK 690
N G PDQT+D W GVSAYCG LW+AAL+AA A L D+G +A F ++
Sbjct: 599 NGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLDTAAEQRRFGAWLEQ 658
Query: 691 AKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
++A +D LWNG Y+ D G+ + ADQL G +YAR GL P+ +A+ + L +
Sbjct: 659 SRANFDDLLWNGEYYRIDAESGTP--VVMADQLCGDFYARLLGLPPVVSDARARSTLQAV 716
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
+ + + G G NG++ DG +D +G E+W G+ +GLA+ A
Sbjct: 717 KEACFERFESGRLGVANGLRRDGSPLDPNGTHPLEVWTGINFGLASYYRLMGESKTALAI 776
Query: 809 AAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
+ V +S G F+TPE+ + +R+ Y+R + IW + WA
Sbjct: 777 CSAVVGQVYSG---GLQFRTPEAITAVNTFRACHYLRAMAIWGL-WA 819
>gi|195474003|ref|XP_002089281.1| GE19029 [Drosophila yakuba]
gi|194175382|gb|EDW88993.1| GE19029 [Drosophila yakuba]
Length = 947
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 180/550 (32%), Positives = 286/550 (52%), Gaps = 66/550 (12%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
Y GEF RF++ GI + V ANQF V + P G F S+L S + D+
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKSSDPDADPDG 190
Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+ +W N++ C+Y L+PR+WT YD +R+ CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 249
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
IPH Y++SS P +VF +++ N V++ F++ N + S+ +++
Sbjct: 250 VIPHEYRDSSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
G+++ + A P ++ +A ++ ++ CP F +GN + +W ++K+HG
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 361
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
L TS + IG A+ A + + ++ + F LAWD P+++F K+ + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419
Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
TK++ GDS +I A+ +++ WE I+AWQRPIL D+ P+WY +FN+LY+++ G
Sbjct: 420 TKYFDASGDSGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISDG 479
Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
GTIW D++LG L ++ G+F YLE
Sbjct: 480 GTIWL-----------------------------KCDSSLGKELTYDDPRLAYGRFGYLE 510
Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
G EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK
Sbjct: 511 GHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGKVMPRKV 570
Query: 559 LGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A+A S
Sbjct: 571 KNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNAS 630
Query: 617 VYIAMAYMEQ 626
+ ++ ++++
Sbjct: 631 KFSSIEFIDK 640
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 116/268 (43%), Positives = 159/268 (59%), Gaps = 20/268 (7%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ M ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQA
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
SA+A + D + +RYQ +++ + LWNGSY+ +D S T I ADQL
Sbjct: 746 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 800
Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
G WY ++CG I + V+ AL +IYD NV+ G GA NG + G +D
Sbjct: 801 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVD 860
Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+
Sbjct: 861 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 918
Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
YRS+ YMRPL+IW+MQ AL + + R
Sbjct: 919 KRYRSIGYMRPLSIWSMQVALERRRAQR 946
>gi|195397814|ref|XP_002057523.1| GJ18180 [Drosophila virilis]
gi|194141177|gb|EDW57596.1| GJ18180 [Drosophila virilis]
Length = 869
Score = 310 bits (793), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 186/556 (33%), Positives = 284/556 (51%), Gaps = 72/556 (12%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVL--CPRSPGVPKKNTDSG- 158
Y GEF RF++ GI + V ANQF V + G F S+L C K N +
Sbjct: 47 YAGEFCRFQMRPGIYEYNVVHANQFIVTIKDHKGCTIFQSLLSRCSTKSFKAKTNNNDAE 106
Query: 159 -----------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+ +W N++ C+Y L+PRAWT YD +R++
Sbjct: 107 ADEADAQKTKCQLPNCSSRPRQPLSAWHSNIEDSRCSYTGLYPRAWTEYDLS-HYGVRLI 165
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSK 255
CRQISP IPH+YKESS P +VF +++ N V++ FT+ N + + S+
Sbjct: 166 CRQISPVIPHDYKESSLPCAVFVWSVQNVCDQERKVSITFTFKNGTGNKKQDADGNAESQ 225
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+++ G+ + A+ P ++ +A ++ ++ CP F +GN + +W +
Sbjct: 226 LISEGNAKGVAIRQNIADM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQ 278
Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV- 374
+K+HG L T+ + IG A+ A L + + + F LAWD P+++F K+
Sbjct: 279 LKEHG---QLSEQPTAETLRSKDIGVAVCAQLALKPQAYHELEFVLAWDMPKIQFPRKLQ 335
Query: 375 -YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
+ R YTK++ G+S RI A+ +A WE I+AWQRPIL D P+WY +FN+L
Sbjct: 336 THTRYYTKYFDDSGESGPRICEYALKHYASWERLIDAWQRPILNDDGLPDWYKCAIFNQL 395
Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIG 492
Y+++ GGTIW D++LG L ++ G
Sbjct: 396 YFISDGGTIWL-----------------------------KCDSSLGQSLAFDDPRLAYG 426
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGK 552
+F YLEG EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK
Sbjct: 427 RFGYLEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGK 486
Query: 553 WVARKCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFA 610
+ RK VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A
Sbjct: 487 VMPRKVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQA 546
Query: 611 RAVWPSVYIAMAYMEQ 626
+A S + ++ ++++
Sbjct: 547 QADNASKFSSIEFIDK 562
Score = 202 bits (514), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 111/264 (42%), Positives = 153/264 (57%), Gaps = 12/264 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ + M ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQ
Sbjct: 608 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCAGLWLAALQTM 667
Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
S +A + + +K K ++ + LWNGSY+ +D S T I ADQL G WY
Sbjct: 668 SVMATLLDQPNDCLRYQDILEKGKHSLEEKLWNGSYYRFDLSHNHRDT-IMADQLCGHWY 726
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
++CG I + V+ AL +IYD NV+ G GA NG + G +D S +
Sbjct: 727 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPSKPGHVDNSNI 786
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ G+ Y +TPE+ YR
Sbjct: 787 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTISQRIGMNY--ETPEALYGEKRYR 844
Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
S+ YMRPL+IW+MQ A + + R
Sbjct: 845 SIGYMRPLSIWSMQVAWERRRALR 868
>gi|194860149|ref|XP_001969520.1| GG23916 [Drosophila erecta]
gi|190661387|gb|EDV58579.1| GG23916 [Drosophila erecta]
Length = 947
Score = 309 bits (792), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 180/550 (32%), Positives = 287/550 (52%), Gaps = 66/550 (12%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRSPGVPKKNTDSG--- 158
Y GEF RF++ GI + V ANQF V + P G F S+L S + D+
Sbjct: 131 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDADPDG 190
Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+ +W N++ C+Y L+PR+WT YD +R++CRQ+SP
Sbjct: 191 ERTKCQLPNCSSRAKQPLSAWHSNIEDSRCSYTGLYPRSWTEYDLS-HYGVRLICRQVSP 249
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
IPH Y+ESS P +VF +++ N V++ F++ N + S+ +++
Sbjct: 250 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFSFKNGTGNKKQDAEGGAESQLISEGN 309
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
G+++ + A P ++ +A ++ ++ CP F +GN + +W ++K+HG
Sbjct: 310 AKGVSIRQKIAEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEPLWAQLKEHG- 361
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
L TS + IG A+ A + + ++ + F LAWD P+++F K+ + R Y
Sbjct: 362 --QLSEQPTSEALKTKDIGVAVCAQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 419
Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
TK++ GD+ +I A+ +++ WE I+AWQRPIL D+ P+WY +FN+LY+++ G
Sbjct: 420 TKYFDDSGDAGPKICEYALRQYSTWERLIDAWQRPILNDESLPDWYKCAIFNQLYFISDG 479
Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
GTIW D++LG L ++ G+F YLE
Sbjct: 480 GTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYLE 510
Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
G EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK
Sbjct: 511 GHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELGDTRKMLYDGKVMPRKV 570
Query: 559 LGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A+A S
Sbjct: 571 KNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADNAS 630
Query: 617 VYIAMAYMEQ 626
+ ++ ++++
Sbjct: 631 KFSSIEFIDK 640
Score = 210 bits (534), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/264 (43%), Positives = 154/264 (58%), Gaps = 12/264 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ M ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQA
Sbjct: 686 YLKAMYAPCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 745
Query: 669 SALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
SA+A + + +K K D LWNGSY+ +D S T I ADQL G WY
Sbjct: 746 SAMATILDQPNDCLRYQDILEKGKRSLDEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWY 804
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
++CG I + V+ AL +IYD NV+ G GA NG + G +D S +
Sbjct: 805 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANAGEPTKPGHVDNSNI 864
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+ YR
Sbjct: 865 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGEKRYR 922
Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
S+ YMRPL+IW+MQ AL + + R
Sbjct: 923 SIGYMRPLSIWSMQVALERRRAQR 946
>gi|194760205|ref|XP_001962332.1| GF15415 [Drosophila ananassae]
gi|190616029|gb|EDV31553.1| GF15415 [Drosophila ananassae]
Length = 894
Score = 308 bits (790), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 183/552 (33%), Positives = 283/552 (51%), Gaps = 67/552 (12%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG-EKFSSVLCPRS---------PGVPK 152
Y GEF RF++ GI + V ANQF V + P G F S+L S P P
Sbjct: 75 YAGEFCRFQMRPGIYEYNVVQANQFIVTIKDPKGCTIFQSLLSKCSTRGSSGGGNPADPD 134
Query: 153 KN--------------TDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQ 198
+ T + +W N++ C Y L+PR+WT YD +R+VCRQ
Sbjct: 135 ADGEKTKCQLPNCSSRTKQPLSAWHSNIEDSRCQYTGLYPRSWTEYDLS-HYGVRLVCRQ 193
Query: 199 ISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMT 258
+SP IPH Y+ESS P +VF + + N V++ FT+ N + S+ ++
Sbjct: 194 VSPVIPHEYRESSLPCAVFVWDVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLIS 253
Query: 259 KDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKK 318
+ G+ + + A P ++ +A ++ +S CP F +GN + +W ++K+
Sbjct: 254 EGNAKGVAIRQKIAEM--PCSYNLACRVRPEISISRCPQFDPAGN-----GEQLWAQLKE 306
Query: 319 HGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YH 376
HG L+ T + IG A+ A + + ++ + F LAWD P+++F K+ +
Sbjct: 307 HG---QLNEQPTPETLKTKDIGVAVCAQVALKPKASHKLEFVLAWDMPKIQFPRKLQTHT 363
Query: 377 RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
R YTK++ G+S +I A+ ++ WE I+AWQRPIL D+ P+WY +FN+LY++
Sbjct: 364 RYYTKYFDDSGESGPKICEYALKQYPSWERLIDAWQRPILNDETLPDWYKCAIFNQLYFI 423
Query: 437 NAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLY 496
+ GGTIW + ++ S D P + G+F Y
Sbjct: 424 SDGGTIWL-------KCDSFLGKELSYDDPRLAY---------------------GRFGY 455
Query: 497 LEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVAR 556
LEG EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + R
Sbjct: 456 LEGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAADLTDTRKMLYDGKVMPR 515
Query: 557 KCLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW 614
K VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A+A
Sbjct: 516 KVKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQADN 575
Query: 615 PSVYIAMAYMEQ 626
S + ++ ++++
Sbjct: 576 ASKFSSIEFIDK 587
Score = 213 bits (541), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 114/264 (43%), Positives = 158/264 (59%), Gaps = 12/264 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ S M ++DKD DG+IEN PDQTYD+W +G S+YC GLW+AALQA
Sbjct: 633 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSSYCSGLWLAALQAM 692
Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
SA+A + + +K K ++ + LWNGSY+ +D S G T I ADQL G WY
Sbjct: 693 SAMATILDQPNDCMRYQDILEKGKRSLEEKLWNGSYYRFDLSHGHRDT-IMADQLCGHWY 751
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
++CG I + V+ AL +IYD NV+ G GA NG + G +D S +
Sbjct: 752 LKSCGFDYEIYPKENVRTALKRIYDNNVMGFHDGNIGAANGFIANASDPSKPGHVDNSNI 811
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
Q+ E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+ YR
Sbjct: 812 QSEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTI--SQRIGMNFETPEALYGEKRYR 869
Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
S+ YMRPL+IW+MQ AL + + R
Sbjct: 870 SIGYMRPLSIWSMQVALERRRAQR 893
>gi|301628717|ref|XP_002943495.1| PREDICTED: non-lysosomal glucosylceramidase [Xenopus (Silurana)
tropicalis]
Length = 692
Score = 302 bits (774), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 175/443 (39%), Positives = 228/443 (51%), Gaps = 72/443 (16%)
Query: 419 KRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTA 478
K P WY LFNELY++ GGT+W + S +R + D P
Sbjct: 301 KDLPAWYKSALFNELYFMADGGTVWVEVSAADDDDNLMR--RGCKDLPG----------- 347
Query: 479 LGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM 538
+++ + G+F YLEG EY MYNTYDVHFY+SFAL+ML+P+LE+S+Q D AA+V+
Sbjct: 348 -----MKDVLQEYGRFAYLEGQEYRMYNTYDVHFYASFALIMLWPQLEISLQYDMAASVL 402
Query: 539 MHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
DP K + G + S + + S LQ
Sbjct: 403 QEDPELRKYLMSGTVAP-------------------VKSRSYYYS------GPSLALQT- 436
Query: 599 RDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
M +FD+DGDG+IEN GF DQTYD W G S+YCG
Sbjct: 437 ---------------------VMETSLKFDEDGDGLIENSGFADQTYDDWVMTGPSSYCG 475
Query: 659 GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSL-WNGSYFNYDNSDGSSSTSI 717
GLW+AA+ +A +GD A+ F + KA ++ L WNG Y+N+D D S S+
Sbjct: 476 GLWLAAVCMMCKMAEVLGDGAAQEKFTDILHRGKAAFEKLLWNGKYYNFDCGDQPYSNSV 535
Query: 718 QADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
DQ AG W+ RACGL E V AL I++ NV + G GA+NGM+PDG
Sbjct: 536 MVDQCAGNWFLRACGLGAGESEVFPKDHVVSALRTIFELNVKQFADGQMGAVNGMRPDGT 595
Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
D S +Q+ E+W GV YGLAA+MI E +V F TA G Y W LG SFQTPE++
Sbjct: 596 TDTSSVQSDEVWIGVVYGLAATMIHEGLVQEGFATAEGCYRTVWER--LGMSFQTPEAYC 653
Query: 834 NNDEYRSLCYMRPLTIWAMQWAL 856
+RSL YMRPL+IWAMQ AL
Sbjct: 654 EKQVFRSLAYMRPLSIWAMQLAL 676
>gi|390344669|ref|XP_001200431.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial
[Strongylocentrotus purpuratus]
Length = 347
Score = 302 bits (773), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 202/325 (62%), Gaps = 9/325 (2%)
Query: 541 DPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVY 598
D + + +S G RK G VPHD G +DPW ++N Y +++ WKDLN K VL VY
Sbjct: 13 DEDSFQDLSSGLRSIRKSAGCVPHDSGNPEEDPWLKLNCYWFHDTAEWKDLNLKMVLMVY 72
Query: 599 RDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCG 658
RD+ AT D+ F +WP MA DKDGDG+I++ G DQTYD W+A G SAYCG
Sbjct: 73 RDYFATKDREFLDFMWPKCMTVMAKANSQDKDGDGLIDHFGTADQTYDVWTAKGCSAYCG 132
Query: 659 GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSI 717
GL++AAL+ +A+ +GD + + Q+ KA Y+ LWNG YFNYD+SD S SI
Sbjct: 133 GLYLAALKCMCEMADLLGDDKAKRQYSEILQRGKATYEKKLWNGRYFNYDSSDQSYHDSI 192
Query: 718 QADQLAGQWYARACGLLPIADEA----KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR 773
ADQ G WY RAC L+P D+ V+ AL I+D NV+ VK G GAMNG++P G+
Sbjct: 193 MADQTCGHWYLRACDLVPERDQVFQVDHVRSALRTIFDMNVMGVKEGNFGAMNGIRPSGK 252
Query: 774 IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
D + LQ E+W G TYGLA +MIQE M + F+TA G Y + + GL Y Q PE++
Sbjct: 253 PDHTSLQGEEVWVGTTYGLAGNMIQEGMWEEGFRTAKGCYTTCYEQAGLAY--QVPEAYM 310
Query: 834 NNDEYRSLCYMRPLTIWAMQWALTK 858
+ YRSL YMRPL IWAMQWA+ K
Sbjct: 311 SKKIYRSLGYMRPLAIWAMQWAVEK 335
>gi|427702545|ref|YP_007045767.1| bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
gi|427345713|gb|AFY28426.1| putative bile acid beta-glucosidase [Cyanobium gracile PCC 6307]
Length = 835
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 247/831 (29%), Positives = 383/831 (46%), Gaps = 122/831 (14%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKL-----FHGICDDAPVLANQFSVFVSRPNGEKFS 140
+ G+PLGG GAG +GR G F + L + G+ D QF++F +G +
Sbjct: 51 NHGMPLGGFGAGCLGRGLDGAFNLWHLDGGEHWFGVLPDC-----QFALFEQDGDGRRAH 105
Query: 141 SVLCPRSPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVC 196
++ + + + SW+W + + T +P + Y ++R C
Sbjct: 106 ALATAPQADDSRPGSGPPLASWNWYPAATPEQPSGTVAVRYPISSHHYAAVFSADVR--C 163
Query: 197 RQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------G 243
SP +P +Y+ +S+P +VF +T SN +++LL +W N+V
Sbjct: 164 EAFSPILPGDYRRTSYPVAVFAWTFSNPTNQPLELSLLLSWRNTVGWFTNTDPAAEVHFR 223
Query: 244 DSGLSGHHF----------NSKTMTKDGVHGLTLH-HRT---ANGRPPVTFAVAAEETAD 289
D G H++ ++ + G+ GL L RT A G AV E +
Sbjct: 224 DDGSPEHNYVPAIGRGKGQRNRWVDGAGLKGLLLEGERTQPLAEGEGQWCLAVPDEAALN 283
Query: 290 ---VHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAAS 346
V V C + SG+ ++W + GS DND+ S + + AA+A
Sbjct: 284 HPGVEVFRCSRWDPSGD-----GAELWEPFSRDGSIPDSDNDRRSTGD--DPLSAALAVR 336
Query: 347 LTIPSGSTRSVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWE 405
+ G++ + ++WD P F RRYT F+ GDSAA IA +A+ + W+
Sbjct: 337 FRLEPGASLEIPVVISWDLPVTAFATGTRALRRYTDFFSAEGDSAAAIAGEALADWRDWQ 396
Query: 406 CEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDT 465
IEAWQ P+++ E + L NELY L +GG++WT SP
Sbjct: 397 AAIEAWQAPVVQRHDLAEPLRMALLNELYDLASGGSLWTAASP----------------- 439
Query: 466 PSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
+ +G+F LE +Y Y + DV Y SFAL+ L+P+L
Sbjct: 440 ----------------------GDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPEL 477
Query: 526 ELSIQRDFAAAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSY 578
+ ++ R FA A+ DP I + G+ ARK GA PHD+G ++ F+ +Y
Sbjct: 478 DKAVLRSFARAIPAADPTPRPIGWYFTQGRGRVEAARKVAGATPHDLGAPNERPFDATNY 537
Query: 579 NLF-NSSRWKDLNSKFVLQVYRDFV---ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
+ + + WKDL S +VLQV+R + + D F WP+ A+ Y++QFD + DG+
Sbjct: 538 TAYQDCNLWKDLASDYVLQVWRTYSLAPSGPDIRFLADCWPAAVTALDYLKQFDVNDDGL 597
Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASA--------LANDVGDHASASYFWV 686
+N G PDQT+D W GVSAYCG LW+AAL+AA A L D + W+
Sbjct: 598 PDNGGAPDQTFDDWPLQGVSAYCGALWIAALEAALAIGQRLQLELGLDTAEEQRRFGGWL 657
Query: 687 RYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA 745
+++++ +D+ LWNG Y+ D GS + + ADQL G +YAR L P+ + + +
Sbjct: 658 --EQSRSHFDTLLWNGEYYAIDA--GSGTPVVMADQLCGDFYARLLALPPVVADERALSS 713
Query: 746 LTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
L I +GG G NG++ DG +D G E+W G+ +GLAA
Sbjct: 714 LKAIRQACFESFEGGRLGVANGLRRDGTPLDPEGTHPLEVWTGINFGLAAYYRLMGQSST 773
Query: 805 AFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
AF V + E GL F+TPE+ +R+ Y+R + IWA+ WA
Sbjct: 774 AFAITGAVVRQVY-EGGL--QFRTPEAITAAGTFRACHYLRAMAIWAL-WA 820
>gi|198473190|ref|XP_002133205.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
gi|198139343|gb|EDY70607.1| GA28784 [Drosophila pseudoobscura pseudoobscura]
Length = 865
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 174/551 (31%), Positives = 279/551 (50%), Gaps = 67/551 (12%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
+ GEF RF++ GI + V ANQF V + G L + + + N S
Sbjct: 48 FAGEFCRFQMRPGIYEYNVVQANQFIVTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDAD 107
Query: 159 ------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
+ +W N++ C+Y L+PR+WT YD +R+VCRQ+S
Sbjct: 108 GEKTKCQLPNCSSRPKQPLSAWHSNIEDARCSYTGLYPRSWTEYDLS-HYGVRLVCRQVS 166
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKD 260
P IPH Y++SS P +VF +++ N V++ F++ N + S+ ++
Sbjct: 167 PVIPHEYRDSSLPCAVFVWSVENVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAG 226
Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
G+ + + A+ P ++ +A ++ ++ CP F +G + + +W ++K+HG
Sbjct: 227 NAKGVAIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHG 279
Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRR 378
L T + IG A+ A L + ++ + F LAWD P ++F K+ + R
Sbjct: 280 ---QLSEQPTGETLKTKDIGVAVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRY 336
Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
YTK++ G+S +I A+ ++ WE I+AWQRPIL D P+WY +FN+LY+++
Sbjct: 337 YTKYFDDSGESGPKICEYALKQYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISD 396
Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYL 497
GGTIW D++LG L ++ G+F YL
Sbjct: 397 GGTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYL 427
Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
EG EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK
Sbjct: 428 EGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGKVMPRK 487
Query: 558 CLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + ++A
Sbjct: 488 VKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQSQADNA 547
Query: 616 SVYIAMAYMEQ 626
S + ++ ++++
Sbjct: 548 SKFSSIEFIDK 558
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/264 (43%), Positives = 158/264 (59%), Gaps = 12/264 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ + M ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQA
Sbjct: 604 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 663
Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
SA+A + F +K K ++ + LWNGSY+ +D S S SI ADQL G WY
Sbjct: 664 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSP-SHRDSIMADQLCGHWY 722
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
++CG I + V+ AL IYD NV+ G GA NG + G +D S +
Sbjct: 723 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 782
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ + +G +F+TPE+ YR
Sbjct: 783 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SERIGMNFETPEALYGEKRYR 840
Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
S+ YMRPL+IW+MQ AL + + R
Sbjct: 841 SIGYMRPLSIWSMQVALERRRALR 864
>gi|195164866|ref|XP_002023267.1| GL21266 [Drosophila persimilis]
gi|194105352|gb|EDW27395.1| GL21266 [Drosophila persimilis]
Length = 867
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 174/551 (31%), Positives = 279/551 (50%), Gaps = 67/551 (12%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
+ GEF RF++ GI + V ANQF V + G L + + + N S
Sbjct: 50 FAGEFCRFQMRPGIYEYNVVQANQFIVTIKDQKGCTIFQSLLSKCSTMSRGNNGSDPDAD 109
Query: 159 ------------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
+ +W N++ C+Y L+PR+WT YD +R+VCRQ+S
Sbjct: 110 GEKTKCQLPNCSSRPKQPLSAWHSNIEDARCSYTGLYPRSWTEYDLS-HYGVRLVCRQVS 168
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKD 260
P IPH Y++SS P +VF +++ N V++ F++ N + S+ ++
Sbjct: 169 PVIPHEYRDSSLPCAVFVWSVENVCDQERIVSITFSFKNGTGNKKQDAEGGAESQLISAG 228
Query: 261 GVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG 320
G+ + + A+ P ++ +A ++ ++ CP F +G + + +W ++K+HG
Sbjct: 229 NAKGVAIRQKIADM--PCSYNLACRVLPEISITRCPQFDPAG-----SGEQLWAQLKEHG 281
Query: 321 SFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRR 378
L T + IG A+ A L + ++ + F LAWD P ++F K+ + R
Sbjct: 282 ---QLSEQPTGETLKTKDIGVAVCAQLALKPQASHDLEFVLAWDMPRIQFPRKMQTHMRY 338
Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
YTK++ G+S +I A+ ++ WE I+AWQRPIL D P+WY +FN+LY+++
Sbjct: 339 YTKYFDDSGESGPKICEYALKQYPSWERLIDAWQRPILSDDTLPDWYKCAIFNQLYFISD 398
Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYL 497
GGTIW D++LG L ++ G+F YL
Sbjct: 399 GGTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYL 429
Query: 498 EGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARK 557
EG EY MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK
Sbjct: 430 EGHEYRMYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELSDTRKMLYDGKVMPRK 489
Query: 558 CLGAVPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
VPHD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + ++A
Sbjct: 490 VKNCVPHDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQSQADNA 549
Query: 616 SVYIAMAYMEQ 626
S + ++ ++++
Sbjct: 550 SKFSSIEFIDK 560
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/264 (43%), Positives = 158/264 (59%), Gaps = 12/264 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ + M ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQA
Sbjct: 606 YLKAMYGACKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 665
Query: 669 SALANDVGDHASASYFWVRYQKAK-AVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
SA+A + F +K K ++ + LWNGSY+ +D S S SI ADQL G WY
Sbjct: 666 SAMATILDQPNDCLRFQDILEKGKRSLEEKLWNGSYYRFDLSP-SHRDSIMADQLCGHWY 724
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGL 779
++CG I + V+ AL IYD NV+ G GA NG + G +D S +
Sbjct: 725 LKSCGFDYEIYPKENVRTALKTIYDNNVMGFHEGNIGAANGFIANAADPSKPGHVDNSNI 784
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYR 839
QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ + +G +F+TPE+ YR
Sbjct: 785 QAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SERIGMNFETPEALYGEKRYR 842
Query: 840 SLCYMRPLTIWAMQWALTKPKLSR 863
S+ YMRPL+IW+MQ AL + + R
Sbjct: 843 SIGYMRPLSIWSMQVALERRRALR 866
>gi|256082539|ref|XP_002577512.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 717
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 217/640 (33%), Positives = 319/640 (49%), Gaps = 60/640 (9%)
Query: 49 FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
F L + + + Y K+ + ++P + G+P+GGIG+GSIGR YRGEF
Sbjct: 63 FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122
Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
R L G+ D PV +QF V V + + VL P + P KK ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175
Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
E Y L+PR+WTVY+ P+ +L +V +QISP IPH+Y+ + P +VF + + +
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234
Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
+ + VT+ F+W S S+ + H TA N P +
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294
Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
A T D L N +G K ++ G S ++ + A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344
Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
+A+ +P + S + F++ W P VKF + VY RRY +++ S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404
Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
AH AI +W +IE WQ PIL ++ P WY LFNELYYL+ GGT+W D P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460
Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
T + S D +++ IG F YLEG EY MYNTYDVH
Sbjct: 461 VDLTEDMKVDSWDHRARL------------------SREIGLFGYLEGHEYRMYNTYDVH 502
Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
Y+S+AL+ L+PK++L++ D A + D ++ + +GK + R AV HD G D
Sbjct: 503 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 562
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
+PW N+Y +F + WKDLNSKF+LQV+RD+ T D + + P V I + +D
Sbjct: 563 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 622
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A
Sbjct: 623 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 662
>gi|353232146|emb|CCD79501.1| bile acid beta-glucosidase-related [Schistosoma mansoni]
Length = 725
Score = 295 bits (755), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 218/640 (34%), Positives = 321/640 (50%), Gaps = 52/640 (8%)
Query: 49 FRHLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQ 108
F L + + + Y K+ + ++P + G+P+GGIG+GSIGR YRGEF
Sbjct: 63 FSKLRAVFSKRFFYIKKRFIQHRLPFIDAVSHVPWRPIYGVPIGGIGSGSIGRGYRGEFC 122
Query: 109 RFKLFHGIC--DDAPVLANQFSVFVSRPNGEKFSSVLCP--RSPGVPKKNTDSGIESWDW 164
R L G+ D PV +QF V V + + VL P + P KK ++ W W
Sbjct: 123 RSSLIPGMYSYDVQPV--DQFIVTVRKNGVTVYHQVLSPLTKPPSNAKK-----LKKWKW 175
Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
E Y L+PR+WTVY+ P+ +L +V +QISP IPH+Y+ + P +VF + + +
Sbjct: 176 GFDPEKGHYLGLYPRSWTVYEI-PELQLVLVYQQISPVIPHDYQTTCLPVAVFYWKVISW 234
Query: 225 GQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTA-NGRPPVTFAVA 283
+ + VT+ F+W S S+ + H TA N P +
Sbjct: 235 NKENLKVTITFSWHGPNHRKRTKSSDFSKSEDGNNSSENTTNTHVSTAENAFPTCSQRST 294
Query: 284 AEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAI 343
A T D L N +G K ++ G S ++ + A+
Sbjct: 295 AFSTGD---------FLKLNCQGKKLKTF-TDLSWLGWLVGYTVPLDSKAKKSPKLAIAV 344
Query: 344 AASLTIP--------SGSTRSVTFSLAWDCPEVKFF--EKVYHRRYTKFYGTLGDSAARI 393
+A+ +P + S + F++ W P VKF + VY RRY +++ S A++
Sbjct: 345 SATTNVPRCNITDNLNPSQSELEFAITWHSPIVKFRTGDVVYTRRYVRWFPVDRISGAKL 404
Query: 394 --AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQ 451
AH AI +W +IE WQ PIL ++ P WY LFNELYYL+ GGT+W D P+Q
Sbjct: 405 LLAH-AIDNWRQWVQKIEDWQNPILSNQSLPSWYKSALFNELYYLSDGGTVWLD---PIQ 460
Query: 452 SLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVH 511
+ + +L K+ + RL IG F YLEG EY MYNTYDVH
Sbjct: 461 -VDCFKSDLLNLTEDMKVDSWDHR-----ARL----SREIGLFGYLEGHEYRMYNTYDVH 510
Query: 512 FYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LD 569
Y+S+AL+ L+PK++L++ D A + D ++ + +GK + R AV HD G D
Sbjct: 511 HYASWALIKLWPKIQLALNYDCADLTIAEDSTSVYYIGNGKALFRSSECAVVHDFGDPED 570
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVY-IAMAYMEQFD 628
+PW N+Y +F + WKDLNSKF+LQV+RD+ T D + + P V I + +D
Sbjct: 571 EPWRCTNAYIMFPTDTWKDLNSKFILQVWRDWRITQDHQYLLYMLPIVLRILRKSLVAWD 630
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
D DG+IEN GFPDQTYD W+A G++AY GG+W++ L A
Sbjct: 631 SDDDGLIENSGFPDQTYDVWTAKGLTAYTGGMWLSCLYAT 670
>gi|357630375|gb|EHJ78536.1| hypothetical protein KGM_16417 [Danaus plexippus]
Length = 912
Score = 295 bits (754), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 188/522 (36%), Positives = 263/522 (50%), Gaps = 69/522 (13%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVLCPRSPGVPKKNTDSGIES 161
++GEF RF+L+ G+ + V QF V + E F SVL S PKK + S
Sbjct: 51 FKGEFCRFQLYPGLYEYVTVPECQFIVNIRDAKKETIFQSVLSTYSK--PKK----VLPS 104
Query: 162 WDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTL 221
W+WN+KG +C Y AL+PRAWT YD ++++CRQISP IPHNYK+SS P +VF F+
Sbjct: 105 WEWNIKGADCEYTALYPRAWTTYD-LTKYGIKLICRQISPVIPHNYKDSSLPCAVFVFSA 163
Query: 222 SNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFA 281
N DV++ FTW + G + + + + G+TL + A+ P TF
Sbjct: 164 KNISNEQRDVSITFTWTECLGESKKKGGCKLDLERYSAENTCGVTLEQKIADT--PCTFT 221
Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSF-------------DHLDND 328
+A + + + E C S + G +W +KKHG D D D
Sbjct: 222 IAKNKPDNETIHEGYCLWNSTKTSGY----VWECLKKHGRLEPDPSQTPPPPKVDKTDKD 277
Query: 329 KTSPSEPGSSI--GAAIAASLTI---PSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTK 381
E I +IA S TI P GS S F L WD P +K+ + KV+ R YTK
Sbjct: 278 NKEDKEKAKKIYKNDSIALSSTISLDPRGSG-STEFCLVWDMPVIKYKKDSKVHKRYYTK 336
Query: 382 FYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGT 441
++G+ G + A IA A+ + +WE ++ WQ PIL + P+W L NELY++ GGT
Sbjct: 337 YFGSDGIAGASIASYALRSYRRWEKQLAEWQDPILNNSCIPDWLKSALMNELYFVADGGT 396
Query: 442 IWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAE 501
IW D + D P DT + G F YLEG E
Sbjct: 397 IWFD---------------VAEDYP---------DT--------DPRHEFGVFGYLEGHE 424
Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
Y MYNTYDVHFY+SFAL L+P L++ +Q F ++ + + DGK K +
Sbjct: 425 YRMYNTYDVHFYASFALAQLWPNLQVVLQYMFRESISVEINKLRPSLYDGKSCKFKTKDS 484
Query: 562 VPHDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+PHD+G D P+ IN+YN+ + S W+DLN KF+LQV RD+
Sbjct: 485 IPHDLGEPDGVPFSHINAYNIHDVSEWRDLNLKFILQVMRDY 526
Score = 216 bits (550), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 121/263 (46%), Positives = 159/263 (60%), Gaps = 9/263 (3%)
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
V K + ++PS + +D+DGDG+IEN GFPDQTYDAW G SAYCGGLW
Sbjct: 635 VVWNSKQYLSDMYPSCVTLLRRGLDWDRDGDGLIENGGFPDQTYDAWVMTGPSAYCGGLW 694
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQAD 720
VA++ A A+A +G F +KA+ Y+ LWNGSY+ +D +S + AD
Sbjct: 695 VASVSAVHAMAKILGFTDDEKEFSTLLEKARDSYERKLWNGSYYKFDTKPCNSEV-VMAD 753
Query: 721 QLAGQWYARACGLL-PIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ--PDGRIDMS 777
QLAGQW+ RA G P+ EA VKKAL IY+ NV + G GA+NG P ID +
Sbjct: 754 QLAGQWFLRASGWTEPVFPEANVKKALHTIYENNVQRFLNGRMGAVNGFVRGPRPGIDTT 813
Query: 778 GLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDE 837
+Q+ E+W GVTYGLAA MI E M + AF TA G+Y +G +F+TPE+ N
Sbjct: 814 AIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTLMK---MGLAFETPEALYENGN 870
Query: 838 YRSLCYMRPLTIWAMQWA-LTKP 859
+RS+ YMRPL+IW+M A +TKP
Sbjct: 871 HRSVAYMRPLSIWSMYHAIITKP 893
>gi|124023625|ref|YP_001017932.1| bile acid beta-glucosidase [Prochlorococcus marinus str. MIT 9303]
gi|123963911|gb|ABM78667.1| Predicted bile acid beta-glucosidase [Prochlorococcus marinus str.
MIT 9303]
Length = 837
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 244/820 (29%), Positives = 387/820 (47%), Gaps = 106/820 (12%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QF++F + + + ++
Sbjct: 55 HGMPLGGFGAGCIGRSPRGDFNLWNLDAGEHWQGSIPDCQFALFERQGDQVRAHALATAP 114
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ D + SW W + +Y A +P +WT Y G EL VC SP
Sbjct: 115 QTDDSQPELDKPLSSWSWYPASTEQRTTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPI 172
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+P +Y+ +S+P +VF + L N S +++LL +W N+ D G
Sbjct: 173 LPGDYRRTSYPLAVFRWQLQNPTSQSLELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPE 232
Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN----GRPPVTFAVAAEETADVHVSEC 295
H + ++ + + G+ G+ + A+ G+ AV + V V C
Sbjct: 233 HSYVPAIGRGEGQCNRWIDQPGLIGVLMEGERADPLAEGQGQWCLAVP-DHLPGVEVMRC 291
Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
+ SG+ ++W+ G+ +++ T S G AA+A +T+ G +
Sbjct: 292 SRWDPSGD-----GSELWSSFASEGTIP--NSNDTHKSLAGEQTSAALAVKVTLAPGESL 344
Query: 356 SVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+ ++WD P F V RRYT F+G+ G +AA +A +A+ + W +I+AWQ P
Sbjct: 345 EIPVVISWDLPVTAFATGVRDLRRYTDFHGSDGQAAAALAAEALRDWPDWREQIDAWQAP 404
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
+L + PE + LFNELY L +GG++WT P
Sbjct: 405 VLAREDLPERLRMALFNELYDLASGGSLWTAARP-------------------------- 438
Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
+ +G+F LE +Y Y + DV Y S AL+ L+P+L+ ++ R FA
Sbjct: 439 -------------GDPVGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFA 485
Query: 535 AAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRW 586
A+ D I + G+ RK A PHD+G ++ F+ +Y + + + W
Sbjct: 486 RAIPAADATPRPIGWYFTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLW 545
Query: 587 KDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
KDL S +VLQV+R F+ + D +F WP+ A++Y+++FD + DG+ +N G PDQ
Sbjct: 546 KDLASDYVLQVWRTFLLSPNGEDLSFLAECWPAAVQALSYLKRFDVNHDGLPDNGGAPDQ 605
Query: 644 TYDAWSANGVSAYCGGLWVAALQAASA----LANDVGDHASAS--YFWVRYQKAKAVYDS 697
T+D W GVSAYCG LW+AAL+AA A L D+G + + F ++++A +D
Sbjct: 606 TFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWLEQSRANFDR 665
Query: 698 -LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
LWNG Y+ D G+ + ADQL G +YAR GL + + + + +L + +
Sbjct: 666 LLWNGEYYKIDAESGTP--VVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEACFEG 723
Query: 757 VKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
+GG G NG+ DG +D G E+W G+ +GLAA ++ A A V
Sbjct: 724 FEGGRLGVANGLCRDGMPLDPKGTHPLEVWTGINFGLAAYY---RLMGDATTATAICSAV 780
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
G G F+TPE+ YR+ Y+R + IWA+ WA
Sbjct: 781 VNQVYGGGLQFRTPEAITAVKTYRACHYLRAMAIWAL-WA 819
>gi|148670515|gb|EDL02462.1| glucosidase beta 2, isoform CRA_b [Mus musculus]
Length = 377
Score = 293 bits (750), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 190/310 (61%), Gaps = 7/310 (2%)
Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
G EY MYNTYDVHFY+SFALVML+PKLELS+Q D A A + D + + G K
Sbjct: 19 GQEYRMYNTYDVHFYASFALVMLWPKLELSLQYDMALATLKEDLTRRRYLMSGVVAPVKR 78
Query: 559 LGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
+PHDIG D+PW +N+Y + +++ WKDLN KFVLQ+YRD+ TGD+ F +WP
Sbjct: 79 RNVIPHDIGDPDDEPWLRVNAYLIHDTADWKDLNLKFVLQIYRDYYLTGDQGFLEDMWPV 138
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
M +FDKD DG+IEN G+ DQTYDAW G SAYCGGLW+AA+ +A G
Sbjct: 139 CLAVMESEMKFDKDQDGLIENGGYADQTYDAWVTTGPSAYCGGLWLAAVAVMVQMAVLCG 198
Query: 677 DHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL-- 733
F + + Y+ LWNG Y+NYD+S S SI +DQ AGQW+ RACGL
Sbjct: 199 AQDVQERFASILCRGREAYERLLWNGRYYNYDSSSHPQSRSIMSDQCAGQWFLRACGLGE 258
Query: 734 --LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYG 791
+ V +AL I++ NV GG GA+NGM P G D S +Q+ E+W GV YG
Sbjct: 259 GDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMHPHGVPDRSSVQSDEVWVGVVYG 318
Query: 792 LAASMIQEEM 801
LAA+MIQE M
Sbjct: 319 LAATMIQEVM 328
>gi|33862646|ref|NP_894206.1| hypothetical protein PMT0373 [Prochlorococcus marinus str. MIT
9313]
gi|33634562|emb|CAE20548.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 837
Score = 292 bits (748), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 239/824 (29%), Positives = 382/824 (46%), Gaps = 114/824 (13%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G+PLGG GAG IGRS RG+F + L G + QF++F + + + ++
Sbjct: 55 HGMPLGGFGAGCIGRSPRGDFNLWNLDGGEHWQGSIPDCQFALFERQGDQVRAHALATAP 114
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
+ D + SW W + + +Y A +P +WT Y G EL VC SP
Sbjct: 115 QTDDSQPELDKPLSSWSWYPASTEQRKTGSYSARYPLSWTHYKGVFAAEL--VCEAFSPI 172
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
+P +Y+ +S+P +VF + L N S +++LL +W N+ D G
Sbjct: 173 LPGDYRRTSYPLAVFRWQLQNPTSQSLELSLLLSWRNTCGWFTNTDPSASVHFRDDGSPE 232
Query: 250 HHF----------NSKTMTKDGVHGLTLHHRTAN----GRPPVTFAVAAEETADVHVSEC 295
H + ++ + + G+ G+ + A+ G+ AV + V V C
Sbjct: 233 HSYVPAIGRGEGQCNRWIDQPGLIGVLMEGERADPLAEGQGQWCLAVP-DHLPGVEVMRC 291
Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
+ SG+ ++W+ G+ +++ S G AA+A +T+ G +
Sbjct: 292 SRWDPSGD-----GSELWSSFASEGTIP--NSNTAHKSLAGEQTSAALAVKVTLAPGESL 344
Query: 356 SVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+ ++WD P F V RRYT F+G+ G +AA +A +A+ + + W +I+AWQ P
Sbjct: 345 EIPVVISWDLPVTAFATGVRDLRRYTDFHGSDGQAAAALAAEALSDWSDWREQIDAWQAP 404
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
+L + PE + LFNELY L +GG++WT P
Sbjct: 405 VLAREDLPERLRMALFNELYDLASGGSLWTAARP-------------------------- 438
Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
+ +G+F LE +Y Y + DV Y S AL+ L+P+L+ ++ R FA
Sbjct: 439 -------------GDPVGRFGVLECFDYAWYESLDVRLYGSLALLQLWPELDKAVLRSFA 485
Query: 535 AAVMMHDPGTMKI---MSDGKW---VARKCLGAVPHDIGL-DDPWFEINSYNLF-NSSRW 586
A+ D I + G+ RK A PHD+G ++ F+ +Y + + + W
Sbjct: 486 RAIPAADATPRPIGWYFTQGRGRVEAPRKRAAATPHDLGAPNESPFDATNYTAYQDCNLW 545
Query: 587 KDLNSKFVLQVYRDFVATG---DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ 643
KDL S +VLQV+R F+ + D +F WP+V A++Y+++FD + DG+ +N G PDQ
Sbjct: 546 KDLASDYVLQVWRTFLLSPNGEDLSFLAECWPAVVQALSYLKRFDVNHDGLPDNGGAPDQ 605
Query: 644 TYDAWSANGVSAYCGGLWVAA----------LQAASALANDVGDHASASYFWVRYQKAKA 693
T+D W GVSAYCG LW+AA LQ L H + + ++++
Sbjct: 606 TFDDWPLQGVSAYCGALWIAALEAALAMAQRLQLDLGLNTAEEQHQFSGWL----EQSRV 661
Query: 694 VYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDF 752
+D LWNG Y+ D G+ + ADQL G +YAR GL + + + + +L + +
Sbjct: 662 NFDRLLWNGEYYKIDAESGTP--VVMADQLCGDFYARLLGLPSVVADERSRSSLNAVKEA 719
Query: 753 NVLKVKGGMCGAMNGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
+GG G NG+ DG +D G E+W G+ +GLAA ++ A A
Sbjct: 720 CFEGFEGGRLGVANGLCRDGMPLDPKGTHPLEVWTGINFGLAAYY---RLMGDATTATAI 776
Query: 812 VYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
V G G F+TPE+ YR+ Y+R + IWA+ WA
Sbjct: 777 CSAVVNQVYGGGLQFRTPEAITAVKTYRACHYLRAMAIWAL-WA 819
>gi|255073149|ref|XP_002500249.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
gi|226515511|gb|ACO61507.1| hypothetical protein MICPUN_55788 [Micromonas sp. RCC299]
Length = 954
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 257/881 (29%), Positives = 387/881 (43%), Gaps = 122/881 (13%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G P+GG+GAG++GR+YRG+F R+ + G P ++F +R +G P
Sbjct: 63 GAPIGGMGAGTMGRTYRGDFARWHMRVGGHTHRPA---AHTLFAARVDGTATVLSALPSH 119
Query: 148 P---GVPKKN-TDSGIESWDWNLKGENCTYHALFPRAWTVYDGE---PDPELRIVCRQIS 200
GVP +D G TYHAL+PR+W YD P +R+ Q S
Sbjct: 120 AVMEGVPGGGGSDRTHSRRVLPPDGTGGTYHALYPRSWYEYDPSALAPGGRVRLSQTQFS 179
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGL 247
P IP Y+++S P VF F SN G ADV L F++ N +A G L
Sbjct: 180 PVIPGEYRDASMPVGVFRFLASNEGDVPADVALAFSFENVLATTDVAPVDVGPPRGAWRL 239
Query: 248 SGHHFNSKTMTKDGVHGLTLHHRTAN----GRPPVT-----FAVAAEETADVHVSECPCF 298
+ H T + G +H T + R P AVA ++
Sbjct: 240 APHELTHSTFERSGGRVAGVHMHTDDVVYGTRSPSEPWHGGIAVAVGGNGELGGLNVTTL 299
Query: 299 LL--SGNSKGITAKDMWNEIKKHGSFDH------LDNDKTSPSEPGSSIGAAIAASLTIP 350
L +G++ I +K W ++ G F ++D P+ P S+ A++A+ T+
Sbjct: 300 PLYDAGDADSIESK-FWRPFERAGGFGRDESAAANEHDGGLPAGPLSAC--AVSAAFTLA 356
Query: 351 SGSTRSVTFSLAWDCPEVKFFEKV-YHRRYTKFYGTL--------GDSAARIAHDAILEH 401
G R+VTF+ AWD P F V + +RYT+++G+ G +A IA +A+L
Sbjct: 357 PGERRNVTFAAAWDLPASSFPRGVRWAKRYTRYHGSFIHRRDVPRGAAAPDIATEALLRA 416
Query: 402 AKWECEIEAWQRPILEDK----------------------------RFPEWYPITLFNEL 433
+WE +I WQR ++ R P W +FNEL
Sbjct: 417 DEWEGDIARWQRGFVQSAAAKDANLDDPAGGRGRNYPGRVRDRPTLRRPAWLVGAMFNEL 476
Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEE---N 490
YYL GGT+W + E + S P + + NG E
Sbjct: 477 YYLVDGGTLWGRPLTVAGLEGDVEESEASAAAPGGWRG-EDGEGGDAQDVAGNGGEWGGT 535
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSD 550
+G+F LE + +YN DVHFY S+ L ML+P L+L++ D AA V D + +
Sbjct: 536 LGRFGLLESFDAPVYNALDVHFYGSWPLAMLWPGLDLAVLADLAAGVDSEDDEERALSWN 595
Query: 551 GKWVAR-------KCLGAVPHDIG--LDDPWFEI-NSYNLFNSSRWKDLNSKFVLQVYRD 600
+ R K G+ P D+G D P N+++L + +RWKDL K +L + R
Sbjct: 596 ARDAGRAPEMRKRKVRGSAPRDLGAPFDAPLTSSPNAFDLTDVNRWKDLAPKLMLTLARA 655
Query: 601 FVATGD-----------KNFARAVWPSVYIAMA-YMEQFDKDGDGMIE----NEGFPDQT 644
G ++ R ++ Y ++A + +FD +GDG+IE +E PD +
Sbjct: 656 HALRGGFRGPNDPGGIPRDVLRRMFRPCYESLAAQLRRFDGNGDGLIEHDDDDEAGPDHS 715
Query: 645 YDAWSANG-VSAYCGGLWVAALQAASALANDVGD-HASASYFWVRYQKAKAVYDSLWNG- 701
+ AW A G SAY GGLW+AAL+A + LA D+ + HA S A A +LW G
Sbjct: 716 FVAWRARGDTSAYAGGLWLAALRAGAGLARDLNETHARDSLEQTMRAAAVAHDAALWLGD 775
Query: 702 -----SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK 756
Y+ +D S A Q+ G+W G + KV+ AL + NV
Sbjct: 776 DDGGRGYYRFDASGTEGGDVSSAAQVMGEWALAVIGARGVLPARKVRAALASVARLNVAA 835
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
G G +D + E+WPGV+Y +A+ M+ M D A++ A GV
Sbjct: 836 AGGAGGAVNGAAA-SGSVDAVSAHSNEVWPGVSYAVASHMLLAGMDDEAWELARGVAAGT 894
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALT 857
+ G +F+TPE+W+ +RS R ++WA++ ALT
Sbjct: 895 YER---GLAFRTPEAWDARGGFRSAMSQRAGSVWAIEHALT 932
>gi|312382731|gb|EFR28085.1| hypothetical protein AND_04399 [Anopheles darlingi]
Length = 777
Score = 285 bits (730), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 154/445 (34%), Positives = 240/445 (53%), Gaps = 46/445 (10%)
Query: 161 SWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFT 220
SW+ L C+Y AL+PRAW+ YD + +R+V RQISP IPH+YKESS P +VF +T
Sbjct: 37 SWESALDASRCSYTALYPRAWSEYDLS-EYGVRLVQRQISPIIPHDYKESSMPCAVFVWT 95
Query: 221 LSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTF 280
+ N VT+ FT+ N + + T+ G+++ A+ T+
Sbjct: 96 VENVCDRDRQVTITFTFKNGTGTKKQDAEGGSETGAFTQGSARGVSIRQTIAD--MACTY 153
Query: 281 AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIG 340
++ +++++++ C F SG T + +WN++K+HG HL + +
Sbjct: 154 CLSCRSSSEINLTRCERFDPSG-----TGEKLWNDLKEHG---HLTEQSVDETVKTKDVA 205
Query: 341 AAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE--KVYHRRYTKFYGTLGDSAARIAHDAI 398
A++ + + G+T + F+L WD P+V F + K YHR YTK++G GD+ ++ A+
Sbjct: 206 VAVSGQILVQPGTTSELQFALVWDMPKVHFQKRGKEYHRYYTKYFGKTGDAGPTMSDYAL 265
Query: 399 LEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRE 458
+ KWE I+ WQRPIL+D P+WY +FNELY++ GG++W T+ +
Sbjct: 266 TNYGKWERLIDEWQRPILDDADLPDWYKSAIFNELYFIADGGSVWL----------TMED 315
Query: 459 RKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFAL 518
D P + G++ YLEG EY MY TYDVHFY+S AL
Sbjct: 316 SDLPYDDPRLAY---------------------GRYSYLEGHEYRMYTTYDVHFYASHAL 354
Query: 519 VMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEIN 576
L+P L++SIQ D+ ++ K + DGK + RK +VPHD+G ++P+ IN
Sbjct: 355 ASLWPNLQVSIQYDYKDSIGREISEGRKHLYDGKVIPRKIQNSVPHDLGDPAEEPFDLIN 414
Query: 577 SYNLFNSSRWKDLNSKFVLQVYRDF 601
+Y + + S W+DLN KF+LQVYRD+
Sbjct: 415 AYPIHDVSEWRDLNIKFILQVYRDY 439
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 108/263 (41%), Positives = 161/263 (61%), Gaps = 11/263 (4%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++P+ + + ++DKD DG+IEN PDQTYD+W +G SAYCGGLW+A+L
Sbjct: 517 YLKAMYPACKLVLERSREWDKDDDGLIENSRSPDQTYDSWVMDGPSAYCGGLWLASLHCM 576
Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
+ +A+ + A+ + K + ++ LWNG+Y+ +D + +S SI +DQL G WY
Sbjct: 577 ATMASLLDQTEDAANYQAILDKGRRSFEEKLWNGTYYKFD-AQSASKNSIMSDQLCGHWY 635
Query: 728 ARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQP------DGRIDMSGLQ 780
RACG + + V+ A+ IY+ NV++ GG GA+NG P +GR D+S +Q
Sbjct: 636 LRACGFDYDVFPKENVRLAMRTIYENNVMRFCGGNLGAVNGYVPNAQPNKEGRPDVSNIQ 695
Query: 781 AREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRS 840
E+W GVTY LAA+MI E M + AFQTA G+Y + +G +F+TPE+ YR+
Sbjct: 696 GEEVWTGVTYALAATMIHEGMFEEAFQTAGGLYRSL--SEKIGMNFETPEAVYAERHYRA 753
Query: 841 LCYMRPLTIWAMQWALTKPKLSR 863
+ YMRPL+IW+MQ A KL R
Sbjct: 754 IGYMRPLSIWSMQTAWELRKLRR 776
>gi|345293667|gb|AEN83325.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293669|gb|AEN83326.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293671|gb|AEN83327.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293673|gb|AEN83328.1| AT5G49900-like protein, partial [Capsella rubella]
gi|345293675|gb|AEN83329.1| AT5G49900-like protein, partial [Capsella rubella]
Length = 156
Score = 279 bits (714), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 127/156 (81%), Positives = 141/156 (90%)
Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
Y M+NTYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP +K +S+G+WV RK LGA
Sbjct: 1 YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60
Query: 562 VPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
VPHD+G++DPWFE+N YNL N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AM
Sbjct: 61 VPHDLGINDPWFEVNGYNLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120
Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
AYM QFDKDGDGMIENEGFPDQTYD WSA+GVSAYC
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYC 156
>gi|345293661|gb|AEN83322.1| AT5G49900-like protein, partial [Capsella grandiflora]
gi|345293663|gb|AEN83323.1| AT5G49900-like protein, partial [Capsella grandiflora]
gi|345293665|gb|AEN83324.1| AT5G49900-like protein, partial [Capsella grandiflora]
Length = 156
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 126/156 (80%), Positives = 141/156 (90%)
Query: 502 YVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGA 561
Y M+NTYDVHFY+SFALVMLFPKLELSIQRDFAAAVM+HDP +K +S+G+WV RK LGA
Sbjct: 1 YRMWNTYDVHFYASFALVMLFPKLELSIQRDFAAAVMLHDPTKVKTLSEGQWVQRKVLGA 60
Query: 562 VPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
VPHD+G++DPWFE+N Y+L N+ RWKDLN KFVLQVYRD VATGDK FA AVWPSVY+AM
Sbjct: 61 VPHDLGINDPWFEVNGYSLHNTDRWKDLNPKFVLQVYRDVVATGDKKFAVAVWPSVYVAM 120
Query: 622 AYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
AYM QFDKDGDGMIENEGFPDQTYD WSA+GVSAYC
Sbjct: 121 AYMAQFDKDGDGMIENEGFPDQTYDTWSASGVSAYC 156
>gi|449674846|ref|XP_002159451.2| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 751
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 167/446 (37%), Positives = 240/446 (53%), Gaps = 36/446 (8%)
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
V G+ + H R T A+AA + + A ++W ++ G
Sbjct: 336 VRGVLMKHNALKKR--FTLAIAALQVIHNTIDVVQVSTKVAFDAQSPAAELWEDLNDDGV 393
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV---YHRR 378
D+ D + S PG + AA+AA T+ +G+ + + F L+WD P V F Y RR
Sbjct: 394 LDNKRGD-SCYSYPGQLLAAAVAAKTTVHAGAKQDLKFVLSWDQPFVTFGSDSGLSYCRR 452
Query: 379 YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNA 438
YT+F+G+ G+++ + A+ WE IE WQ PIL D P W+ LFNELY+++
Sbjct: 453 YTRFFGSNGNASPSLCCYALQNIDFWEDNIERWQNPILNDSELPAWFKSALFNELYFISD 512
Query: 439 GGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLE 498
GGT+W D I+ +K S H SD + G+F YLE
Sbjct: 513 GGTVWLD----------IKRKKDS-------HGNFISDLV----------KLYGRFAYLE 545
Query: 499 GAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKC 558
G EY MYNTYDVHFY+S+AL +L+P L++S+Q D +V+ D IM +G RK
Sbjct: 546 GHEYRMYNTYDVHFYASWALALLWPNLQISMQYDICNSVLNVDDEKRPIMMEGGIAPRKV 605
Query: 559 LGAVPHDIGL--DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
+G VPHD+G D+P+ +N+Y + ++S+WKDLN FVLQVYRDF T +K F +WP+
Sbjct: 606 IGCVPHDLGDPDDEPFVRVNAYCIHDTSQWKDLNLHFVLQVYRDFYITKNKQFLMDMWPA 665
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
+ AMA+ D DGDG+IEN G D T+D W G SAY GGLW++AL+ +A+ +G
Sbjct: 666 MKAAMAHSLAQDIDGDGIIENSGIADSTFDTWVVTGPSAYTGGLWLSALRCIIEIAHILG 725
Query: 677 DHASASYFWVRYQKAKAVYD-SLWNG 701
S S + ++ K Y+ LWNG
Sbjct: 726 LSQSISKYKSILERGKKSYEKKLWNG 751
Score = 146 bits (369), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 118/214 (55%), Gaps = 3/214 (1%)
Query: 61 RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
+++++ + KGK+P + + G+PLGGIGAGSIGR ++G+F R++L GI
Sbjct: 2 KFTRKVKKKGKLPFIDPYAVSACKQIYGVPLGGIGAGSIGRGWKGDFNRWQLRPGIYTYD 61
Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRA 180
V NQF+V + + + VL P P V KN +S ++SW W+ G YHAL+PRA
Sbjct: 62 TVDVNQFTVCIRKCGKTIYQQVLAPYYP-VAIKNVES-LQSWTWSFDGGRGYYHALYPRA 119
Query: 181 WTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANS 240
WTVY + +++ CRQ+SP P NYK++S P VF +T+ N G DV+++ T+ N
Sbjct: 120 WTVYH-ITEHNIKLTCRQVSPIFPDNYKDTSLPVGVFVWTIENIGMHDLDVSIMLTFQNG 178
Query: 241 VAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANG 274
+ G H N + K + L +N
Sbjct: 179 DGTNEDCVGGHNNEEFFCKGVYSEIRLQKDESNA 212
>gi|7470943|pir||T31071 conserved hypothetical protein - Anabaena sp. (fragment)
gi|2828799|gb|AAC38131.1|AAC38131 unknown [Nostoc sp. PCC 7120]
Length = 439
Score = 266 bits (679), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 172/476 (36%), Positives = 243/476 (51%), Gaps = 67/476 (14%)
Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRER 459
++ W+ +IE+WQ PIL P+W+ + LFNELY L +GGT+W+ +P
Sbjct: 3 QYKTWQQQIESWQDPILNRPDLPDWFKMALFNELYDLTSGGTLWSAATPC---------- 52
Query: 460 KFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
+ IGQF LE +Y Y + DV Y SF L+
Sbjct: 53 -----------------------------DPIGQFAVLECLDYRWYESLDVRLYGSFGLL 83
Query: 520 MLFPKLELSIQRDFAAAVMMHD--PGTM----KIMSDGKWVARKCLGAVPHDIGLDDP-- 571
LFP+LE ++ R FA A+ D P + I ++ RK GA PHD+G +
Sbjct: 84 QLFPELEKAVMRAFARAIPQGDDTPRVIGYYYTIGAESPIAVRKTPGATPHDLGAPNEHV 143
Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATG--DKNFARAVWPSVYIAMAYMEQFDK 629
W + N + + + WKDL FVLQVYRDF+ TG D F R W ++ + Y++ FD
Sbjct: 144 WEKTNYTSYQDCNLWKDLGCDFVLQVYRDFLLTGADDVQFLRDCWDAIVETLDYVKTFDL 203
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDV--GDHASASYFWVR 687
DGDG+ EN G PDQT+D W GVSAYCGGLW+AAL AA A+++ + SA+ +
Sbjct: 204 DGDGIPENSGAPDQTFDDWRLQGVSAYCGGLWLAALAAAIAISDILLQNHQDSATKGHLL 263
Query: 688 YQKAK---------AVY-DSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIA 737
YQK+ +Y + LWNG Y+ D+ GS + ADQL GQ+YA L I
Sbjct: 264 YQKSTYETWLTKSLPIYQEKLWNGKYYRLDSESGSD--VVMADQLCGQFYANLLELPDIV 321
Query: 738 DEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM-SGLQAREIWPGVTYGLAASM 796
+ AL +YD LK G GA NG++PDG + E+W G+ +GLAA +
Sbjct: 322 PSDRAISALQTVYDACFLKFYDGQFGAANGVRPDGSPENPKATHPLEVWTGINFGLAAFL 381
Query: 797 IQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
+Q M D F+ V E ++ G F+TPE+ +R+ Y+RP+ IWA+
Sbjct: 382 VQMGMKDEGFRLTQAVVEQIYNN---GLQFRTPEAITAAGTFRASTYLRPMAIWAI 434
>gi|12005896|gb|AAG44660.1|AF258662_1 AD035 [Homo sapiens]
Length = 367
Score = 245 bits (626), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 153/392 (39%), Positives = 213/392 (54%), Gaps = 43/392 (10%)
Query: 234 LFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRTANGRPPVTFAVAAEETADV 290
+F+ N + G G +N + + G V GL LHH T P T AVAA TA
Sbjct: 1 MFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPTLPN--PYTMAVAARVTAAT 58
Query: 291 HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTI- 349
V+ F T + +W ++ + G D +++P++ G I A+ S +
Sbjct: 59 TVTHITAF-----DPDSTGQQVWQDLLQDGQLDS-PTGQSTPTQKGVGIAGAVCVSSKLR 112
Query: 350 PSGSTRSVTFSLAWDCPEVKFFEK--VYHRRYTKFYGTLGDSAARIAHDAILEHAKWECE 407
P G R + FSLAWD P + F K V++RRYT+F+G GD+A ++H A+ +A+WE
Sbjct: 113 PRGQCR-LEFSLAWDMPRIMFGAKGQVHYRRYTRFFGQDGDAAPALSHYALCRYAEWEER 171
Query: 408 IEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPS 467
I AWQ P+L+D+ P WY LFNELY+L GGT+W L + +
Sbjct: 172 ISAWQSPVLDDRSLPAWYKSALFNELYFLADGGTVW---------LEVLED--------- 213
Query: 468 KIHTPASSDTALGTRL--LENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKL 525
S LG + L + G+F YLEG EY MYNTYDVHFY+SFAL+ML+PKL
Sbjct: 214 ------SLPEELGRNMCHLRPTLRDYGRFGYLEGQEYRMYNTYDVHFYASFALIMLWPKL 267
Query: 526 ELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG--LDDPWFEINSYNLFNS 583
ELS+Q D A A + D + + G K +PHDIG D+PW +N+Y + ++
Sbjct: 268 ELSLQYDMALATLREDLTRRRYLMSGVMAPVKRRNVIPHDIGDPDDEPWLRVNAYLIHDT 327
Query: 584 SRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
+ WKDLN KFVLQVYRD+ TGD+NF + +WP
Sbjct: 328 ADWKDLNLKFVLQVYRDYYLTGDQNFLKDMWP 359
>gi|288901049|gb|ADC68237.1| hypothetical protein [Dendrobium nobile]
Length = 207
Score = 234 bits (597), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 120/201 (59%), Positives = 157/201 (78%), Gaps = 1/201 (0%)
Query: 606 DKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAAL 665
D F R VWP+V AM YM QFD+DGDG+IEN+GFPDQTYDAW+ +G+SAYCG LW+A+L
Sbjct: 6 DMAFGREVWPAVCAAMDYMNQFDRDGDGLIENDGFPDQTYDAWTVHGISAYCGCLWLASL 65
Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAG 724
QAA+A+A+ +GDHA A +++ AK ++S LWNGSYFNYD+ + S+S SIQADQLAG
Sbjct: 66 QAAAAMAHRLGDHAYAEKCTIKFLTAKHAFESKLWNGSYFNYDSGNSSNSKSIQADQLAG 125
Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
QWYA + GL P+ DE K++ L KI+DFNV+KV+GG GA+NGM P+G++D S +Q+REI
Sbjct: 126 QWYAASSGLPPLFDENKIRSTLQKIFDFNVMKVRGGRIGAVNGMHPNGKVDESCMQSREI 185
Query: 785 WPGVTYGLAASMIQEEMVDMA 805
W GVTY L+A+MI M + A
Sbjct: 186 WTGVTYALSATMILAGMEEQA 206
>gi|85857488|gb|ABC86280.1| RE08574p [Drosophila melanogaster]
Length = 544
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/434 (30%), Positives = 211/434 (48%), Gaps = 64/434 (14%)
Query: 103 YRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSG---- 158
Y GEF RF++ GI + VLANQF V + P G L + K + G
Sbjct: 132 YAGEFCRFQMRPGIYEYNVVLANQFIVTIKDPKGCTIFQSLLSKCSTRDKTSDPDGDPDG 191
Query: 159 -----------------IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISP 201
+ +W N++ C+Y L+PR+WT YD +R+ CRQ+SP
Sbjct: 192 ERTKCQLPNCSSRAKQPLSAWHSNIEDTRCSYTGLYPRSWTEYDLS-HYGVRLTCRQVSP 250
Query: 202 FIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDG 261
IPH Y+ESS P +VF +++ N V++ FT+ N + S+ +++
Sbjct: 251 VIPHEYRESSLPCAVFVWSVENVCDQERKVSITFTFKNGTGNKKQDAEGGAESQLISEGN 310
Query: 262 VHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGS 321
G+++ + + P ++ +A ++ ++ CP F +GN + +W ++K+HG
Sbjct: 311 AKGVSIRQKISEM--PCSYNLACRVLPEISITRCPQFDPAGN-----GEQLWAQLKEHG- 362
Query: 322 FDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV--YHRRY 379
L TS + IG A+ + + ++ + F LAWD P+++F K+ + R Y
Sbjct: 363 --QLSEHPTSEALKTKDIGVAVCGQVALKPMASHDLEFVLAWDMPKIQFPRKMQTHTRYY 420
Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
TK++ GDS RI A+ +++ WE I+AWQRPIL D+ P+WY +FN+LY+++ G
Sbjct: 421 TKYFDDSGDSGPRICEYALRQYSTWERLIDAWQRPILNDETLPDWYKCAIFNQLYFISDG 480
Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRL-LENGEENIGQFLYLE 498
GTIW D++LG L ++ G+F YLE
Sbjct: 481 GTIWL-----------------------------KCDSSLGKELAYDDPRLAYGRFGYLE 511
Query: 499 GAEYVMYNTYDVHF 512
G EY MYNTYDVHF
Sbjct: 512 GHEYRMYNTYDVHF 525
>gi|147788373|emb|CAN61188.1| hypothetical protein VITISV_019327 [Vitis vinifera]
Length = 550
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 100/217 (46%), Positives = 150/217 (69%), Gaps = 3/217 (1%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D PP+ W+R+L+S+ + F+++F E + +G RL+ Y ++E ++G+ + F
Sbjct: 31 DSAAPPEQAWRRRLNSHANILKEFSVTFTEAIKMIRLGIRLWSYIREEASQGRKAPIDPF 90
Query: 79 RKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNG 136
+ ++ QG+PLGG+G+GSI R +RGEF+ +++ G CD +P++ANQFS+F+SR G
Sbjct: 91 TRETCKPSASQGVPLGGMGSGSISRGFRGEFRHWQIVPGTCDASPIMANQFSIFISREGG 150
Query: 137 -EKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIV 195
+K++SVL P K+ D GI SW WNL G++ TYHALFPRAWT+YDGEPDPEL++
Sbjct: 151 NKKYASVLAPGQHEGLGKSGDQGISSWGWNLSGQHSTYHALFPRAWTIYDGEPDPELKVS 210
Query: 196 CRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT 232
CRQISPFIPHNY++SS P +VF +T + + S+ T
Sbjct: 211 CRQISPFIPHNYRDSSLPTAVFVYTPLTTFEVSSKFT 247
>gi|444729905|gb|ELW70308.1| Non-lysosomal glucosylceramidase [Tupaia chinensis]
Length = 279
Score = 216 bits (549), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 115/240 (47%), Positives = 145/240 (60%), Gaps = 10/240 (4%)
Query: 621 MAYME---QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGD 677
MA ME +FDKD DG+IEN G+ DQTYD W G SAYCGGLW+AA+ +A G
Sbjct: 1 MAVMESEMKFDKDHDGLIENGGYADQTYDGWVTTGPSAYCGGLWLAAVAVMVQMAALCGA 60
Query: 678 HASASYFWVRYQKAKAVYDSL-WNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL--- 733
F + + Y+ L WNG Y+NYD+S S SI +DQ AGQW+ +ACGL
Sbjct: 61 QDVQEKFTSILTRGQEAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLKACGLGEG 120
Query: 734 -LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
+ V +AL I++FNV GG GA+NGMQP G D S +Q+ E+W GV YGL
Sbjct: 121 DTEVFPTPHVVRALQTIFEFNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVGVVYGL 180
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
AA+MIQE + F+TA G Y W LG +FQTPE++ +RSL YMRPL+IWAM
Sbjct: 181 AATMIQEGLTWEGFRTAEGCYRTVWER--LGLAFQTPEAYCQQRVFRSLAYMRPLSIWAM 238
>gi|288558766|gb|ADC53514.1| MIP16949p [Drosophila melanogaster]
Length = 432
Score = 213 bits (541), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 117/268 (43%), Positives = 160/268 (59%), Gaps = 20/268 (7%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
+ +A++ S M ++DKD DG+IEN PDQTYD+W +G SAYC GLW+AALQA
Sbjct: 171 YLKAMYASCKAIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAM 230
Query: 669 SALANDVGDHASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLA 723
SA+A + D + +RYQ +++ + LWNGSY+ +D S T I ADQL
Sbjct: 231 SAMAT-ILDQPNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLC 285
Query: 724 GQWYARACGL-LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRID 775
G WY ++CG I + V+ AL +IYD NV+ G GA NG + G +D
Sbjct: 286 GHWYLKSCGFDYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVD 345
Query: 776 MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNN 835
S +QA E+WPGV Y LAA+MIQE M + AFQTA G+Y+ +G +F+TPE+
Sbjct: 346 NSNIQAEEVWPGVVYALAATMIQEGMFEEAFQTAGGMYKTL--SQRIGMNFETPEALYGE 403
Query: 836 DEYRSLCYMRPLTIWAMQWALTKPKLSR 863
YRS+ YMRPL+IW+MQ AL + + R
Sbjct: 404 KRYRSIGYMRPLSIWSMQVALERRRAQR 431
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 81/125 (64%), Gaps = 2/125 (1%)
Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVP 563
MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK VP
Sbjct: 1 MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60
Query: 564 HDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM 621
HD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+ + A++ S + ++
Sbjct: 61 HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120
Query: 622 AYMEQ 626
++++
Sbjct: 121 EFIDK 125
>gi|345291087|gb|AEN82035.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291089|gb|AEN82036.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291091|gb|AEN82037.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291093|gb|AEN82038.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291095|gb|AEN82039.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291097|gb|AEN82040.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291099|gb|AEN82041.1| AT3G24180-like protein, partial [Capsella rubella]
gi|345291101|gb|AEN82042.1| AT3G24180-like protein, partial [Capsella rubella]
Length = 191
Score = 213 bits (541), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/190 (59%), Positives = 142/190 (74%), Gaps = 1/190 (0%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
F VWP+V AM YMEQFD+D D +IEN+GFPDQTYD W+ +GVSAYCG LW+AALQAA
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
+A+A +GD A ++ AKA + LWNGSYFNYD+ S+S SIQ DQLAGQWY
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
A + GL PI +E+K+K + KI+DFNV+K KGG GA+NGM PDG++D + +Q+REIW G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180
Query: 788 VTYGLAASMI 797
VTY AA+MI
Sbjct: 181 VTYAAAATMI 190
>gi|295829670|gb|ADG38504.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829672|gb|ADG38505.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829674|gb|ADG38506.1| AT3G24180-like protein [Capsella grandiflora]
gi|295829676|gb|ADG38507.1| AT3G24180-like protein [Capsella grandiflora]
Length = 190
Score = 212 bits (540), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 113/190 (59%), Positives = 142/190 (74%), Gaps = 1/190 (0%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
F VWP+V AM YMEQFD+D D +IEN+GFPDQTYD W+ +GVSAYCG LW+AALQAA
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
+A+A +GD A ++ AKA + LWNGSYFNYD+ S+S SIQ DQLAGQWY
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
A + GL PI +E+K+K + KI+DFNV+K KGG GA+NGM PDG++D + +Q+REIW G
Sbjct: 121 AASSGLPPIFEESKIKSTMQKIFDFNVMKTKGGKMGAVNGMHPDGKVDDTCMQSREIWTG 180
Query: 788 VTYGLAASMI 797
VTY AA+MI
Sbjct: 181 VTYAAAATMI 190
>gi|295829678|gb|ADG38508.1| AT3G24180-like protein [Neslia paniculata]
Length = 190
Score = 212 bits (539), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 112/190 (58%), Positives = 142/190 (74%), Gaps = 1/190 (0%)
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAA 668
F VWP+V AM YMEQFD+D D +IEN+GFPDQTYD W+ +GVSAYCG LW+AALQAA
Sbjct: 1 FGIDVWPAVRAAMEYMEQFDRDNDDLIENDGFPDQTYDTWTVHGVSAYCGCLWLAALQAA 60
Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQWY 727
+A+A +GD A ++ AKA + LWNGSYFNYD+ S+S SIQ DQLAGQWY
Sbjct: 61 AAMALQIGDKFFAELCKNKFLNAKAALEKKLWNGSYFNYDSGASSNSKSIQTDQLAGQWY 120
Query: 728 ARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
A + GL PI +E+K++ + KI+DFNV+K KGG GA+NGM PDG++D + +Q+REIW G
Sbjct: 121 AASSGLPPIFEESKIRSTMQKIFDFNVMKTKGGRMGAVNGMHPDGKVDETCMQSREIWTG 180
Query: 788 VTYGLAASMI 797
VTY AA+MI
Sbjct: 181 VTYAAAATMI 190
>gi|188570621|gb|ACD64346.1| hypothetical protein [Helianthus petiolaris]
gi|188570623|gb|ACD64347.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/210 (56%), Positives = 158/210 (75%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI EM D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAEMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570649|gb|ACD64360.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/210 (56%), Positives = 158/210 (75%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + D+ K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|292384326|gb|ADE21416.1| unknown [Helianthus neglectus]
Length = 210
Score = 210 bits (534), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 118/210 (56%), Positives = 157/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + E K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGEDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570631|gb|ACD64351.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKLAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570553|gb|ACD64312.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAATAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570503|gb|ACD64287.1| hypothetical protein [Helianthus annuus]
gi|188570505|gb|ACD64288.1| hypothetical protein [Helianthus annuus]
gi|188570507|gb|ACD64289.1| hypothetical protein [Helianthus annuus]
gi|188570509|gb|ACD64290.1| hypothetical protein [Helianthus annuus]
gi|188570513|gb|ACD64292.1| hypothetical protein [Helianthus annuus]
gi|188570515|gb|ACD64293.1| hypothetical protein [Helianthus annuus]
gi|188570517|gb|ACD64294.1| hypothetical protein [Helianthus annuus]
gi|188570519|gb|ACD64295.1| hypothetical protein [Helianthus annuus]
gi|188570521|gb|ACD64296.1| hypothetical protein [Helianthus annuus]
gi|188570523|gb|ACD64297.1| hypothetical protein [Helianthus annuus]
gi|188570525|gb|ACD64298.1| hypothetical protein [Helianthus annuus]
gi|188570527|gb|ACD64299.1| hypothetical protein [Helianthus annuus]
gi|188570529|gb|ACD64300.1| hypothetical protein [Helianthus annuus]
gi|188570533|gb|ACD64302.1| hypothetical protein [Helianthus annuus]
gi|188570535|gb|ACD64303.1| hypothetical protein [Helianthus annuus]
gi|188570537|gb|ACD64304.1| hypothetical protein [Helianthus annuus]
gi|188570539|gb|ACD64305.1| hypothetical protein [Helianthus annuus]
gi|188570541|gb|ACD64306.1| hypothetical protein [Helianthus annuus]
gi|188570543|gb|ACD64307.1| hypothetical protein [Helianthus annuus]
gi|188570545|gb|ACD64308.1| hypothetical protein [Helianthus annuus]
gi|188570547|gb|ACD64309.1| hypothetical protein [Helianthus annuus]
gi|188570549|gb|ACD64310.1| hypothetical protein [Helianthus annuus]
gi|188570551|gb|ACD64311.1| hypothetical protein [Helianthus annuus]
gi|188570555|gb|ACD64313.1| hypothetical protein [Helianthus annuus]
gi|188570557|gb|ACD64314.1| hypothetical protein [Helianthus annuus]
gi|188570559|gb|ACD64315.1| hypothetical protein [Helianthus annuus]
gi|188570561|gb|ACD64316.1| hypothetical protein [Helianthus annuus]
gi|188570563|gb|ACD64317.1| hypothetical protein [Helianthus annuus]
gi|188570565|gb|ACD64318.1| hypothetical protein [Helianthus annuus]
gi|188570567|gb|ACD64319.1| hypothetical protein [Helianthus annuus]
gi|188570569|gb|ACD64320.1| hypothetical protein [Helianthus annuus]
gi|188570571|gb|ACD64321.1| hypothetical protein [Helianthus annuus]
gi|188570573|gb|ACD64322.1| hypothetical protein [Helianthus annuus]
gi|188570575|gb|ACD64323.1| hypothetical protein [Helianthus annuus]
gi|188570577|gb|ACD64324.1| hypothetical protein [Helianthus annuus]
gi|188570579|gb|ACD64325.1| hypothetical protein [Helianthus annuus]
gi|188570581|gb|ACD64326.1| hypothetical protein [Helianthus annuus]
gi|188570583|gb|ACD64327.1| hypothetical protein [Helianthus annuus]
gi|188570585|gb|ACD64328.1| hypothetical protein [Helianthus annuus]
gi|188570587|gb|ACD64329.1| hypothetical protein [Helianthus annuus]
gi|188570589|gb|ACD64330.1| hypothetical protein [Helianthus annuus]
gi|188570591|gb|ACD64331.1| hypothetical protein [Helianthus annuus]
gi|188570593|gb|ACD64332.1| hypothetical protein [Helianthus annuus]
gi|188570595|gb|ACD64333.1| hypothetical protein [Helianthus annuus]
gi|188570597|gb|ACD64334.1| hypothetical protein [Helianthus annuus]
gi|188570599|gb|ACD64335.1| hypothetical protein [Helianthus annuus]
gi|188570601|gb|ACD64336.1| hypothetical protein [Helianthus annuus]
gi|188570603|gb|ACD64337.1| hypothetical protein [Helianthus petiolaris]
gi|188570605|gb|ACD64338.1| hypothetical protein [Helianthus petiolaris]
gi|188570607|gb|ACD64339.1| hypothetical protein [Helianthus petiolaris]
gi|188570609|gb|ACD64340.1| hypothetical protein [Helianthus petiolaris]
gi|188570611|gb|ACD64341.1| hypothetical protein [Helianthus petiolaris]
gi|188570613|gb|ACD64342.1| hypothetical protein [Helianthus petiolaris]
gi|188570615|gb|ACD64343.1| hypothetical protein [Helianthus petiolaris]
gi|188570617|gb|ACD64344.1| hypothetical protein [Helianthus petiolaris]
gi|188570619|gb|ACD64345.1| hypothetical protein [Helianthus petiolaris]
gi|188570625|gb|ACD64348.1| hypothetical protein [Helianthus petiolaris]
gi|188570627|gb|ACD64349.1| hypothetical protein [Helianthus petiolaris]
gi|188570629|gb|ACD64350.1| hypothetical protein [Helianthus petiolaris]
gi|188570633|gb|ACD64352.1| hypothetical protein [Helianthus petiolaris]
gi|188570637|gb|ACD64354.1| hypothetical protein [Helianthus petiolaris]
gi|188570641|gb|ACD64356.1| hypothetical protein [Helianthus petiolaris]
gi|188570643|gb|ACD64357.1| hypothetical protein [Helianthus petiolaris]
gi|188570645|gb|ACD64358.1| hypothetical protein [Helianthus petiolaris]
gi|188570651|gb|ACD64361.1| hypothetical protein [Helianthus petiolaris]
gi|188570653|gb|ACD64362.1| hypothetical protein [Helianthus petiolaris]
gi|188570655|gb|ACD64363.1| hypothetical protein [Helianthus petiolaris]
gi|188570657|gb|ACD64364.1| hypothetical protein [Helianthus petiolaris]
gi|188570659|gb|ACD64365.1| hypothetical protein [Helianthus petiolaris]
gi|188570661|gb|ACD64366.1| hypothetical protein [Helianthus petiolaris]
gi|188570663|gb|ACD64367.1| hypothetical protein [Helianthus petiolaris]
gi|188570665|gb|ACD64368.1| hypothetical protein [Helianthus petiolaris]
gi|292384298|gb|ADE21402.1| unknown [Helianthus neglectus]
gi|292384300|gb|ADE21403.1| unknown [Helianthus neglectus]
gi|292384302|gb|ADE21404.1| unknown [Helianthus neglectus]
gi|292384304|gb|ADE21405.1| unknown [Helianthus neglectus]
gi|292384312|gb|ADE21409.1| unknown [Helianthus neglectus]
gi|292384318|gb|ADE21412.1| unknown [Helianthus neglectus]
gi|292384320|gb|ADE21413.1| unknown [Helianthus neglectus]
gi|292384328|gb|ADE21417.1| unknown [Helianthus neglectus]
Length = 210
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570671|gb|ACD64371.1| hypothetical protein [Bahiopsis reticulata]
gi|188570673|gb|ACD64372.1| hypothetical protein [Bahiopsis reticulata]
Length = 210
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 155/210 (73%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D SA + ++ KAK ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDSAFTTAEGIFTAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL P
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEAP 210
>gi|188570647|gb|ACD64359.1| hypothetical protein [Helianthus petiolaris]
Length = 210
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 156/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K+ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIPSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|16183333|gb|AAL13692.1| GH26380p [Drosophila melanogaster]
Length = 391
Score = 207 bits (526), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 137/374 (36%), Positives = 187/374 (50%), Gaps = 83/374 (22%)
Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVP 563
MYNTYDVHFY+S AL L+P L++S+Q DF A+ T K++ DGK + RK VP
Sbjct: 1 MYNTYDVHFYASPALAHLWPNLQVSLQYDFKDAIAAELNDTRKMLYDGKVMPRKVKNCVP 60
Query: 564 HDIGLDD--PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVA------------------ 603
HD+G D P+ IN YN+ + + WKDLN+KFVLQVYRD+
Sbjct: 61 HDLGDPDEEPFTLINCYNIHDVNDWKDLNTKFVLQVYRDYYVLNELAQAQSDNASKFSSI 120
Query: 604 --------------------TGDKNFARAVWPSVYI-----------AMAYMEQFDKDGD 632
+ D+ S+YI A+ Y++
Sbjct: 121 EFIDKESLYELYSQDNKRKNSADEKQQNRKSASMYINETNGKVYLMDAIGYLKAMYASCK 180
Query: 633 GMIE--------NEGFPDQT------YDAWSANGVSAYCGGLWVAALQAASALANDVGDH 678
++E N+G + T YD+W +G SAYC GLW+AALQA SA+A + D
Sbjct: 181 AIMERTIEYDKDNDGLIENTKMPDQTYDSWVMDGPSAYCSGLWLAALQAMSAMAT-ILDQ 239
Query: 679 ASASYFWVRYQK-----AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGL 733
+ +RYQ +++ + LWNGSY+ +D S T I ADQL G WY ++CG
Sbjct: 240 PNDC---LRYQDILEKGKRSLEEKLWNGSYYRFDLSHSHRDT-IMADQLCGHWYLKSCGF 295
Query: 734 -LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-------GRIDMSGLQAREIW 785
I + V+ AL +IYD NV+ G GA NG + G +D S +QA E+W
Sbjct: 296 DYEIYPKENVRTALKRIYDNNVMGFHEGNIGAANGFIANASEPTKPGHVDNSNIQAEEVW 355
Query: 786 PGVTYGLAASMIQE 799
PGV Y LAA+MIQE
Sbjct: 356 PGVVYALAATMIQE 369
>gi|188570511|gb|ACD64291.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 206 bits (525), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 116/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW +Q AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGIQQALEKP 210
>gi|188570531|gb|ACD64301.1| hypothetical protein [Helianthus annuus]
Length = 210
Score = 206 bits (525), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 116/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI++FNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFNFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|188570635|gb|ACD64353.1| hypothetical protein [Helianthus petiolaris]
gi|188570639|gb|ACD64355.1| hypothetical protein [Helianthus petiolaris]
Length = 211
Score = 206 bits (524), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 118/211 (55%), Positives = 158/211 (74%), Gaps = 2/211 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDH-ASASYFWVRYQKAKAVYDS-LWNGSYFNYDN 708
+G+SAYCG LW+AALQAA+A+A ++ D A+A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAAQKYKKKFLKAKSSFESKLWNGSYFNYDS 60
Query: 709 SDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM 768
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 GSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGM 120
Query: 769 QPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQT 828
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQT
Sbjct: 121 HPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQT 180
Query: 829 PESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
PE W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 PEGWTMDGSYRSLVYMRPLAIWGMQQALEKP 211
>gi|292384310|gb|ADE21408.1| unknown [Helianthus neglectus]
Length = 210
Score = 206 bits (524), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 157/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFFKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|292384324|gb|ADE21415.1| unknown [Helianthus neglectus]
Length = 210
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 116/210 (55%), Positives = 156/210 (74%), Gaps = 1/210 (0%)
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
+G+SA CG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+
Sbjct: 1 HGISACCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSG 60
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQ 769
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM
Sbjct: 61 SSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMH 120
Query: 770 PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTP 829
P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTP
Sbjct: 121 PNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTP 180
Query: 830 ESWNNNDEYRSLCYMRPLTIWAMQWALTKP 859
E W + YRSL YMRPL IW MQ AL KP
Sbjct: 181 EGWTMDGSYRSLVYMRPLAIWGMQQALEKP 210
>gi|292384322|gb|ADE21414.1| unknown [Helianthus neglectus]
Length = 207
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 116/207 (56%), Positives = 154/207 (74%), Gaps = 1/207 (0%)
Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+ S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM P+G
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180
Query: 833 NNNDEYRSLCYMRPLTIWAMQWALTKP 859
+ YRSL YMRPL IW MQ AL KP
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQALEKP 207
>gi|188570667|gb|ACD64369.1| hypothetical protein [Bahiopsis lanata]
gi|188570669|gb|ACD64370.1| hypothetical protein [Bahiopsis lanata]
Length = 208
Score = 202 bits (515), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 116/208 (55%), Positives = 152/208 (73%), Gaps = 1/208 (0%)
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDG 711
+SAYCG LW+AALQAA+A+A + D SA + ++ KAK ++S LWNGSYFNYD+
Sbjct: 1 ISAYCGCLWLAALQAAAAMAVALNDKFSAQKYKKKFMKAKTSFESKLWNGSYFNYDSGSS 60
Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM P+
Sbjct: 61 SNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLHKIFDFNVMKVRGGRMGAVNGMHPN 120
Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPES 831
G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTPE
Sbjct: 121 GKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFTAGWSEEGFGYAFQTPEG 180
Query: 832 WNNNDEYRSLCYMRPLTIWAMQWALTKP 859
W + YRSL YMRPL IW MQ AL P
Sbjct: 181 WTMDGCYRSLVYMRPLAIWGMQQALEAP 208
>gi|292384308|gb|ADE21407.1| unknown [Helianthus neglectus]
Length = 205
Score = 202 bits (515), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 115/204 (56%), Positives = 153/204 (75%), Gaps = 1/204 (0%)
Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+ S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFMKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
+S SIQADQLAGQWY + GL + D+ K++ +L KI+DFNV+KV+GG GA+NGM P+G
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPNLFDDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180
Query: 833 NNNDEYRSLCYMRPLTIWAMQWAL 856
+ YRSL YMRPL IW MQ AL
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|292384306|gb|ADE21406.1| unknown [Helianthus neglectus]
Length = 205
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 114/204 (55%), Positives = 152/204 (74%), Gaps = 1/204 (0%)
Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+ S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFTAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM P+G
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180
Query: 833 NNNDEYRSLCYMRPLTIWAMQWAL 856
+ YRSL YMRPL IW MQ AL
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|292384314|gb|ADE21410.1| unknown [Helianthus neglectus]
Length = 205
Score = 200 bits (508), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 114/204 (55%), Positives = 152/204 (74%), Gaps = 1/204 (0%)
Query: 654 SAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGS 712
SAYCG LW+AALQAA+A+A ++ D +A + ++ KAK+ ++S LWNGSYFNYD+ S
Sbjct: 1 SAYCGCLWLAALQAAAAMAVELNDKFAAQKYKKKFLKAKSSFESKLWNGSYFNYDSGSSS 60
Query: 713 SSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG 772
+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV+GG GA+NGM P+G
Sbjct: 61 NSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKVRGGRMGAVNGMHPNG 120
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ WSE+G GY+FQTPE W
Sbjct: 121 KVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGWSEEGFGYAFQTPEGW 180
Query: 833 NNNDEYRSLCYMRPLTIWAMQWAL 856
+ YRSL YMRPL IW MQ AL
Sbjct: 181 TMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|292384316|gb|ADE21411.1| unknown [Helianthus neglectus]
Length = 205
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 93/159 (58%), Positives = 119/159 (74%)
Query: 698 LWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKV 757
LWNGSYFNYD+ S+S SIQADQLAGQWY + GL + + K++ +L KI+DFNV+KV
Sbjct: 46 LWNGSYFNYDSGSSSNSKSIQADQLAGQWYTASAGLPNLFGDDKIQSSLQKIFDFNVMKV 105
Query: 758 KGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAW 817
+GG GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++ W
Sbjct: 106 RGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIAGW 165
Query: 818 SEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
SE+G GY+FQTPE W + YRSL YMRPL IW MQ AL
Sbjct: 166 SEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 204
>gi|295828610|gb|ADG37974.1| AT1G33700-like protein [Capsella grandiflora]
Length = 105
Score = 195 bits (496), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 89/105 (84%), Positives = 99/105 (94%)
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
DPWFE+N+YNLFN++RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDK
Sbjct: 1 DPWFEVNAYNLFNTNRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
DGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA +
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105
>gi|295828604|gb|ADG37971.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828608|gb|ADG37973.1| AT1G33700-like protein [Capsella grandiflora]
Length = 105
Score = 195 bits (496), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 89/105 (84%), Positives = 98/105 (93%)
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDK
Sbjct: 1 DPWFEVNAYNLFNTBRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
DGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA +
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105
>gi|295828602|gb|ADG37970.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828606|gb|ADG37972.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828612|gb|ADG37975.1| AT1G33700-like protein [Capsella grandiflora]
gi|295828614|gb|ADG37976.1| AT1G33700-like protein [Neslia paniculata]
Length = 105
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 89/105 (84%), Positives = 98/105 (93%)
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
DPWFE+N+YNLFN+ RWKDLNSKFVLQVYRD VATGD NFA+AVWPSVY A+AY++QFDK
Sbjct: 1 DPWFEVNAYNLFNTDRWKDLNSKFVLQVYRDVVATGDLNFAKAVWPSVYTAIAYLDQFDK 60
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
DGDGMIENEGFPDQTYDAWS +GVSAYCGGLWVAALQA SALA +
Sbjct: 61 DGDGMIENEGFPDQTYDAWSCSGVSAYCGGLWVAALQAGSALARE 105
>gi|324503709|gb|ADY41606.1| Non-lysosomal glucosylceramidase [Ascaris suum]
Length = 261
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 103/250 (41%), Positives = 143/250 (57%), Gaps = 11/250 (4%)
Query: 621 MAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHAS 680
+A + +D DGD +IEN G PDQTYD WS +G SAYCGGLW+ AL+ +A +G+
Sbjct: 17 IADAKAWDVDGDDLIENAGQPDQTYDVWSMHGSSAYCGGLWLCALECVRRMALTLGEVVD 76
Query: 681 ASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
A F + A+ Y+ LWNG YF++D + SI ADQL G W+ C D+
Sbjct: 77 AQKFANKLNNARKAYERKLWNGKYFDFDEH-STDHKSIMADQLCGFWF--MCITDGKVDD 133
Query: 740 AKVKK-----ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
+ + +L I+++NV K G G +N M P G +D +G+Q+ E+W GV Y LA+
Sbjct: 134 VIITRQQICASLKTIFEYNVEKFANGQLGPVNAMMPSGVVDSTGIQSEEVWGGVAYALAS 193
Query: 795 SMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQW 854
+ E + AF+TA G Y W GL Y Q+PE+ N + YR++ YMRPL IWAMQ
Sbjct: 194 FHLLVEENESAFKTAEGWYRSCWERYGLQY--QSPEAINESSYYRAIGYMRPLAIWAMQS 251
Query: 855 ALTKPKLSRQ 864
AL + RQ
Sbjct: 252 ALDALRNKRQ 261
>gi|344251774|gb|EGW07878.1| Non-lysosomal glucosylceramidase [Cricetulus griseus]
Length = 226
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 109/183 (59%), Gaps = 10/183 (5%)
Query: 672 ANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARAC 731
A DV D S+ + + +A LWNG Y+NYD+S S SI +DQ AGQW+ RAC
Sbjct: 7 AQDVQDKFSS----ILCRGREAYERLLWNGRYYNYDSSSQPQSRSIMSDQCAGQWFLRAC 62
Query: 732 GL----LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
GL + V +AL I++ NV GG GA+NGMQP G D S +Q+ E+W G
Sbjct: 63 GLGEGDTEVFPTLHVVRALQTIFELNVQAFAGGAMGAVNGMQPHGVPDRSSVQSDEVWVG 122
Query: 788 VTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPL 847
V YGLAA+MIQE + F+TA G Y W + LG +FQTPE++ +RSL YMRPL
Sbjct: 123 VVYGLAATMIQEGLTREGFRTAEGCYRTVW--ERLGLAFQTPEAYCQQQVFRSLAYMRPL 180
Query: 848 TIW 850
+IW
Sbjct: 181 SIW 183
>gi|303285310|ref|XP_003061945.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
gi|226456356|gb|EEH53657.1| hypothetical protein MICPUCDRAFT_48400 [Micromonas pusilla CCMP1545]
Length = 1073
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 175/671 (26%), Positives = 262/671 (39%), Gaps = 161/671 (23%)
Query: 333 SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFE-KVYHRRYTKFYGTLGDSAA 391
+ PG++ AA++A ++ G RS+ F++AWD P F + +RY G AA
Sbjct: 378 ASPGAA--AAVSAKFSLKPGERRSIAFAIAWDLPVATFPRGPAFLKRYAWRRRRRGGGAA 435
Query: 392 RIAHDAILEHAKWECEIEAWQRPILEDK---------RFPEWYPITLFNELYYLNAGGTI 442
+A A+ A WE + AWQ P ++ R P W+ LFNELYYL GGT+
Sbjct: 436 DVAMSALRRAASWERRVRAWQAPYVDSAANADSKSPVRRPGWFVTALFNELYYLVDGGTL 495
Query: 443 WTDGSP-PMQSLATIRERKFSLDTPSKIHTPASSDTA----LGTRLLENGEE-------- 489
W G P + L D S+D A L R GE
Sbjct: 496 W--GRPLSVGGLDGDEFDDDEDDEVEDALDGDSADDAEVRLLPIRPRSRGERRSLRRLGL 553
Query: 490 -----------------------NIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLE 526
+G+F + YN D +FY S+ L L+P L+
Sbjct: 554 LGGEDGDGGDARDVIGNGGAWGGTLGRFGSASDVDRATYNALDEYFYGSWPLATLWPGLD 613
Query: 527 LSIQRDFAAAVMMHDPGTMKIMSDGKWVA------------------------RKCLGAV 562
L++ D A AV D +++ W A RK G
Sbjct: 614 LAVIGDLAHAVDAADDTERRVL----WRALGSAKKSAGGTGATTTTIASQTKRRKIRGVA 669
Query: 563 PHDIGL--DDPWFEI-NSYNLFNSSRWKDLNSKFVLQVYRDFVATG-------------- 605
P D+G D P N+ + +S+ DL K +L V R G
Sbjct: 670 PRDLGTSRDAPLTSAPNALSDVDSNALLDLAPKLMLLVARAHALRGGFRGPEAMASFGGG 729
Query: 606 -DKNFARAVWPSVYIAMA-YMEQFDKDGDGMIENEGFPDQT---YDAWSANG-VSAYCGG 659
+++ R V+ Y A+A + D +GDG+ E+ T YD W +G S+Y GG
Sbjct: 730 LERDALRRVFEPAYRALATQLRARDLNGDGLPEHGDADAPTLSGYDRWVVSGNASSYGGG 789
Query: 660 LWVAALQAASALANDVGD-HASASYFWVRYQKAKAVYDSLWNG----------------- 701
LW+AAL+A + +A D+ + A S AK+ ++LW G
Sbjct: 790 LWLAALRAGAGIARDLDETDARTSLEETLRIAAKSFDEALWRGGGGGGGGGVRKSQSRSR 849
Query: 702 -------SYFNYDNSDGSSSTSIQADQLAGQW---YARACGLLPIADEAKVKKALTKIYD 751
Y+ D S + + A Q+ G+W RA G+L D KV+ AL+ Y+
Sbjct: 850 RGTHALEGYYAADASGTDAGDVVLAGQVMGEWALGMIRAPGVL---DPRKVRAALSTTYE 906
Query: 752 FNVLKVKGGM----------------CGAMNGMQPDGRIDMSGL----------QAREIW 785
NV GA+NG + D D + L QARE
Sbjct: 907 RNVKAFGRARGGSGSGSDDEPSTYVPSGAVNGARVDDATDAAALGVGAGDGIPAQARESR 966
Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMR 845
G +Y LA+ +I D + A G Y V + + G++F+TPE ++ +R R
Sbjct: 967 VGQSYALASHLILAGFSDEGWDVARGAYRVTYED---GFAFRTPEIFDAERRFRGAISGR 1023
Query: 846 PLTIWAMQWAL 856
+WA++ +L
Sbjct: 1024 AGAVWAIERSL 1034
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 75/159 (47%), Gaps = 25/159 (15%)
Query: 101 RSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSP------GVPKKN 154
RSY G+F R+ L G V A+ F+ R NG ++VL SP GVP
Sbjct: 90 RSYLGDFSRWHLKVG-SHTHRVAAHTFA--ACRVNG--VATVLSA-SPAHASLEGVPSSE 143
Query: 155 TDSGIESWDWNL------KGENCTYHALFPRAWTVYDGEPD---PELRIVCRQI--SPFI 203
D G Y AL+PRAW Y+ PD P+ R+V Q+ SP +
Sbjct: 144 DDDDSPPPRTRRRRTLPEDGIGGNYTALYPRAW--YEYAPDALHPDARVVVSQVQFSPVL 201
Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA 242
P +Y+ S+ P V F N G+ +A+V+++ ++ + +A
Sbjct: 202 PGDYRASAVPTGVLRFVARNDGRATANVSIMLSFESVLA 240
>gi|148242847|ref|YP_001228004.1| hypothetical protein SynRCC307_1748 [Synechococcus sp. RCC307]
gi|147851157|emb|CAK28651.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 606
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 129/487 (26%), Positives = 200/487 (41%), Gaps = 81/487 (16%)
Query: 87 QGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR 146
G PLGG GAG IGRS +G+F + + G + QF++F + + ++
Sbjct: 54 HGAPLGGFGAGCIGRSSKGDFNLWNIDGGEHWFGSLPDCQFALFEDNGSDTRAHALAVAP 113
Query: 147 SPGVPKKNTDSGIESWDW----NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
K ++ + SW W + +Y A +P W Y G E C SP
Sbjct: 114 EADSSKPEGEAPLSSWSWYPASTAEKPTGSYSARYPLNWFHYSGVFGCE--ASCESFSPI 171
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-------------GDSGLSG 249
IP NY++++ P +VF + N + ++LL +W N+ D G
Sbjct: 172 IPGNYRDTALPVAVFRWRFRNPSRKPLKLSLLLSWRNTTGWFCNTDPSAAVHFRDDGSPE 231
Query: 250 HHF----------NSKTMTKDGVHGLTLH----HRTANGRPPVTFAVAAEETADVHVSEC 295
H++ ++ + G+ G+ + A G+ AV EE V + C
Sbjct: 232 HNYLPAIGATQGQRNRAINAPGLRGVVMEGPMSEPIAEGQGQWCIAVP-EELEGVELMRC 290
Query: 296 PCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTR 355
+ G+ ++W GS +ND+ S + + AA+A + G+
Sbjct: 291 SRWNPHGD-----GAEIWQRFAADGSVPESNNDRRSGNN--DPVSAAMALRFELAPGAEL 343
Query: 356 SVTFSLAWDCPEVKFFEKVYH-RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRP 414
+ ++WD P F RRYT F+G GD AA IA DA+L +W +IE WQ P
Sbjct: 344 ELPVVISWDLPVTAFATGSQCLRRYTDFFGPSGDQAAAIAADALLRWPRWRQQIEEWQAP 403
Query: 415 ILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPAS 474
+L PE + L NELY L +GGT+W+ S
Sbjct: 404 VLLRDDLPEELRMALCNELYDLASGGTLWSAAS--------------------------- 436
Query: 475 SDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFA 534
+ +G+F LE +Y Y + DV Y SFAL+ L+P L+ S+ R FA
Sbjct: 437 ------------ASDPVGRFGVLECLDYAWYESLDVRLYGSFALLQLWPALDRSVLRSFA 484
Query: 535 AAVMMHD 541
A+ D
Sbjct: 485 RAIPAAD 491
>gi|449691194|ref|XP_004212588.1| PREDICTED: non-lysosomal glucosylceramidase-like, partial [Hydra
magnipapillata]
Length = 169
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 73/162 (45%), Positives = 98/162 (60%), Gaps = 7/162 (4%)
Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE-----AKVKKALTKIYDFNVLK 756
+Y+NYD S+ I A Q AGQWY AC L D+ +V L I+ NV++
Sbjct: 1 TYYNYDGSENDYHNCIMAGQCAGQWYLHACDLGQTEDDLVFRPEQVHSTLKVIFQNNVMR 60
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
+ G GA+NGM+ DG +D + LQA E+W GVTY LAA+MIQ MV F+TA G+Y
Sbjct: 61 FEDGFMGAVNGMRADGTVDKNSLQAEEVWTGVTYALAATMIQHGMVREGFRTAEGIYRTV 120
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
+ + GLG FQTPE+ N +RS YMRPL+IW++ A +K
Sbjct: 121 YEKWGLG--FQTPEAITANKTFRSRGYMRPLSIWSIYHAYSK 160
>gi|226444526|gb|ACO57837.1| unknown [Helianthus annuus]
gi|226444528|gb|ACO57838.1| unknown [Helianthus annuus]
gi|226444560|gb|ACO57854.1| unknown [Helianthus petiolaris]
gi|226444568|gb|ACO57858.1| unknown [Helianthus petiolaris]
Length = 103
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 60/103 (58%), Positives = 76/103 (73%)
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
KV+GG GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDKAFTTAEGIFIA 60
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
WSE+G GY+FQTPE W + YRSL YMRPL IW MQ AL K
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|86144926|ref|ZP_01063258.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
gi|85837825|gb|EAQ55937.1| hypothetical protein MED222_10953 [Vibrio sp. MED222]
Length = 1024
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 150/637 (23%), Positives = 258/637 (40%), Gaps = 109/637 (17%)
Query: 153 KNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSF 212
+N D+G ++ G + Y AL+P A Y D ++I + +SP + + + S
Sbjct: 200 ENNDTG------SVNGRDIEYQALYPLAEYKYSSFDD--IKIKRKVVSPIVKEDKRLCSL 251
Query: 213 PASVFTFTLSNSGQTSADVTLLFTWANSVAG--DSGLSGHHFNSKTMTKDGVHGLTLHHR 270
P F L+N+ + +TL N + G G ++ T++++ + H+
Sbjct: 252 PMHWNHFALTNNSSQTRVITLAQPLKNLIGSTYQKGRDGIQDSACTLSQNPI---AQQHQ 308
Query: 271 TANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD------- 323
T N L G S T + ++ +
Sbjct: 309 TVN--------------------------LKGESHSFTGVQLTSQSPYQSDIEGEVVFGV 342
Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRR----- 378
DN T + S+ + +S T F+L +F +Y R
Sbjct: 343 QADNQLTESGKVSISVKPTLYSSKT-----AEQTEFALKTGRTNAEFQTGIYTGREALSA 397
Query: 379 --YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
+ G+S + ++ H+K + W DK + ++YP T
Sbjct: 398 LVVVQVELEAGESV-DLRFTQVMAHSK--VMLNGWHS----DKAYTQFYPQT-------- 442
Query: 437 NAGGTIWTDGSPPMQSLA---TIRERKFSLDTPSKIHTPASS---DTALGTRLLENGEEN 490
+ D P ++++ ++ F SKI P S+ T L E
Sbjct: 443 QPALPMLEDMLPKLETIEQQIVAQQTAFLEQAQSKISQPESALRYATMAMNSLSFLAEST 502
Query: 491 I----GQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----- 541
+ +FL E +Y +N+ DV+FY SF+L+ L P+L+ + ++F+ A++ D
Sbjct: 503 VWDKEDKFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTERR 562
Query: 542 ------PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFV 594
+++ D R GAV HD+G P+ + ++Y+ N WKDL K++
Sbjct: 563 YWEYEATPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYI 620
Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVS 654
L VYR + T D + + W +V ++ ++ +GD + G D T+D +++G+S
Sbjct: 621 LMVYRHYQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGIS 679
Query: 655 AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDG 711
YC LWVA LQAAS LA +G+ AS + R +KA A V SLW+ Y+++
Sbjct: 680 IYCASLWVAGLQAASELAQLMGESELASGYLTRSKKALATVEQSLWDDKEGYYHF----- 734
Query: 712 SSSTSIQADQLAGQWYA--RACGLLPIADEAKVKKAL 746
T +QA L G+ Y GL+ D K L
Sbjct: 735 -FVTPVQAKHLTGEGYQALETLGLILTGDAIADKNTL 770
>gi|226444514|gb|ACO57831.1| unknown [Helianthus annuus]
gi|226444516|gb|ACO57832.1| unknown [Helianthus annuus]
gi|226444518|gb|ACO57833.1| unknown [Helianthus annuus]
gi|226444520|gb|ACO57834.1| unknown [Helianthus annuus]
gi|226444522|gb|ACO57835.1| unknown [Helianthus annuus]
gi|226444530|gb|ACO57839.1| unknown [Helianthus annuus]
gi|226444532|gb|ACO57840.1| unknown [Helianthus annuus]
gi|226444534|gb|ACO57841.1| unknown [Helianthus annuus]
gi|226444536|gb|ACO57842.1| unknown [Helianthus annuus]
gi|226444538|gb|ACO57843.1| unknown [Helianthus annuus]
gi|226444540|gb|ACO57844.1| unknown [Helianthus annuus]
gi|226444542|gb|ACO57845.1| unknown [Helianthus annuus]
gi|226444544|gb|ACO57846.1| unknown [Helianthus annuus]
gi|226444546|gb|ACO57847.1| unknown [Helianthus petiolaris]
gi|226444548|gb|ACO57848.1| unknown [Helianthus petiolaris]
gi|226444550|gb|ACO57849.1| unknown [Helianthus petiolaris]
gi|226444552|gb|ACO57850.1| unknown [Helianthus petiolaris]
gi|226444554|gb|ACO57851.1| unknown [Helianthus petiolaris]
gi|226444556|gb|ACO57852.1| unknown [Helianthus petiolaris]
gi|226444558|gb|ACO57853.1| unknown [Helianthus petiolaris]
gi|226444562|gb|ACO57855.1| unknown [Helianthus petiolaris]
gi|226444564|gb|ACO57856.1| unknown [Helianthus petiolaris]
gi|226444566|gb|ACO57857.1| unknown [Helianthus petiolaris]
gi|226444570|gb|ACO57859.1| unknown [Helianthus argophyllus]
gi|226444572|gb|ACO57860.1| unknown [Helianthus argophyllus]
gi|226444576|gb|ACO57862.1| unknown [Helianthus argophyllus]
gi|226444578|gb|ACO57863.1| unknown [Helianthus argophyllus]
gi|226444580|gb|ACO57864.1| unknown [Helianthus argophyllus]
gi|226444582|gb|ACO57865.1| unknown [Helianthus argophyllus]
gi|226444584|gb|ACO57866.1| unknown [Helianthus argophyllus]
gi|226444586|gb|ACO57867.1| unknown [Helianthus argophyllus]
gi|226444588|gb|ACO57868.1| unknown [Helianthus argophyllus]
gi|226444590|gb|ACO57869.1| unknown [Helianthus argophyllus]
gi|226444592|gb|ACO57870.1| unknown [Helianthus argophyllus]
Length = 103
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 60/103 (58%), Positives = 76/103 (73%)
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
KV+GG GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
WSE+G GY+FQTPE W + YRSL YMRPL IW MQ AL K
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQALEK 103
>gi|226444524|gb|ACO57836.1| unknown [Helianthus annuus]
Length = 103
Score = 133 bits (334), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 59/101 (58%), Positives = 75/101 (74%)
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
KV+GG GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWAL 856
WSE+G GY+FQTPE W + YRSL YMRPL IW MQ AL
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGMQQAL 101
>gi|226444574|gb|ACO57861.1| unknown [Helianthus argophyllus]
Length = 103
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 59/103 (57%), Positives = 75/103 (72%)
Query: 756 KVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEV 815
KV+GG GA+NGM P+G++D + +Q+RE+W GVTYG+AA+MI M D AF TA G++
Sbjct: 1 KVRGGRMGAVNGMHPNGKVDETCMQSREVWTGVTYGVAATMIHAGMEDNAFTTAEGIFIA 60
Query: 816 AWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
WSE+G GY+FQTPE W + YRSL YMRPL IW Q AL K
Sbjct: 61 GWSEEGFGYAFQTPEGWTMDGSYRSLVYMRPLAIWGKQQALEK 103
>gi|47196687|emb|CAF89164.1| unnamed protein product [Tetraodon nigroviridis]
Length = 239
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/158 (43%), Positives = 89/158 (56%), Gaps = 24/158 (15%)
Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
+ G EY MYNTYDVH+Y+SFAL+ML+PKL LS+Q D A +V+ DP + +G+
Sbjct: 1 FFPGQEYRMYNTYDVHYYASFALIMLWPKLALSLQYDIAGSVVQQDPTERLYLMNGRCSP 60
Query: 556 RKCLGAVPHDIG--LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAV 613
K G VPHDIG D+PW +VYRDF T D + R +
Sbjct: 61 VKTKGVVPHDIGDPEDEPW----------------------QRVYRDFHLTQDAQYLRDM 98
Query: 614 WPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSAN 651
WP + QFD DGDG+IEN G+ DQTYD W+A+
Sbjct: 99 WPICEMVTESELQFDLDGDGLIENSGYADQTYDGWAAS 136
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/59 (52%), Positives = 42/59 (71%)
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
K++ AL I+D NV+ GG GA+NGM+P+G D S +Q+ E+W GV YGLAA+MI E
Sbjct: 180 KIQTALKSIFDLNVMSFAGGQMGAVNGMRPEGVPDRSSVQSDEVWIGVVYGLAATMIHE 238
>gi|261417086|ref|YP_003250769.1| glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791890|ref|YP_005823013.1| hypothetical protein FSU_3272 [Fibrobacter succinogenes subsp.
succinogenes S85]
gi|261373542|gb|ACX76287.1| Glucosylceramidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302325406|gb|ADL24607.1| conserved domain protein [Fibrobacter succinogenes subsp.
succinogenes S85]
Length = 1047
Score = 130 bits (327), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 147/587 (25%), Positives = 243/587 (41%), Gaps = 120/587 (20%)
Query: 173 YHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT 232
Y AL+P + TVY+G+ + I Q S P + + SS P + FTL N+ + + +VT
Sbjct: 227 YTALYPVSETVYEGKG---VAITKTQSSYVTPGDERLSSLPVNATVFTLENNTKETREVT 283
Query: 233 LLFTWANSVAG-----------DSGL---------SGHHFNSKTMTKDGVHGLTLHHRTA 272
++ +S+ G DS G F+ + V G+ ++ A
Sbjct: 284 IV-QIQDSITGYMAKKDRQGVQDSSFVLVPAARFPKGVQFDKELEDGRSVRGIEFYNEKA 342
Query: 273 ------NGRPPVTFAVAAEETADVHVSECPCF------------LLSGNSKGITAKDMWN 314
NG V+ VA + +++VS P F L SG G K++++
Sbjct: 343 LAESDFNGCMGVS--VAWNKKDNLNVSVKPMFYQDDAKSVLKAALQSGRVAGSWVKNVYS 400
Query: 315 EIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV 374
+I AIA + + S F++ D PE+K +
Sbjct: 401 --------------------GRETIAGAIAVTAVLKPKQKVSFQFNMVLDFPEIKLNKLT 440
Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
++YT FY + +A+ ++ ++A++ + P +LY
Sbjct: 441 SAKKYTAFYPEAYGRVVALLTEALAADKTFDDRLKAFENLV----------PKKPVAKLY 490
Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQF 494
A +SLA ++T S + D +F
Sbjct: 491 KTAAK-------QAEFKSLA--------INTLSFLAEATVWDKE-------------DRF 522
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMK-------- 546
L E A+Y +N+ DV+FY SF+L+ L P+L+ + + F A++ + +
Sbjct: 523 LVRECADYPFFNSLDVYFYGSFSLLALMPRLDGVVMKRFGDAILAVNNNRRRHHEFVNLP 582
Query: 547 --IMSDGKWVA-RKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRDFV 602
+ D K R GAV HD+G P+ E ++Y+ N WKDL K+VL VYR +
Sbjct: 583 YADLPDPKLEGPRAVRGAVIHDLG--SPFDAEPDAYDWHNVKEWKDLAPKYVLMVYRHYH 640
Query: 603 ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
T D +VY AM Y+E+ G+ G D T+D + G+S YCG LW+
Sbjct: 641 KTKDMQCLADCKEAVYAAMEYLEKMVNPGENFPLTHG-TDDTFDNLCSYGISVYCGSLWI 699
Query: 663 AALQAASALANDVGDHASASYFWVRYQKA-KAVYDSLW--NGSYFNY 706
A L+AA+ +A +GD+ A+ + + + A K +SLW N YF +
Sbjct: 700 AGLRAAAKIAELLGDNEQAAKWNEKSEAANKEFTESLWDENEGYFRF 746
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 61/138 (44%), Gaps = 18/138 (13%)
Query: 719 ADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGR-IDMS 777
AD + Y R L PI D AK K L +I++ N K + GA N ++ DG +D
Sbjct: 838 ADTMLADTYLRLLDLEPITDSAKAKSNLLRIFNTN-YKANSPLIGAANLVRKDGSPLDEF 896
Query: 778 GLQAREIWPGVTYGLAASM----IQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWN 833
QA ++W G+ Y + +M +++E +A +Y+ A F PE +N
Sbjct: 897 NFQAHDVWIGIQYSIMTAMMFHGLEKEASVLADSMIGNLYDEA------RVPFAAPEGFN 950
Query: 834 NNDEYRSLCYMRPLTIWA 851
+ C + P + A
Sbjct: 951 GS------CRLHPEALVA 962
>gi|159042199|ref|YP_001541451.1| hypothetical protein Cmaq_1638 [Caldivirga maquilingensis IC-167]
gi|157921034|gb|ABW02461.1| protein of unknown function DUF608 [Caldivirga maquilingensis
IC-167]
Length = 661
Score = 130 bits (326), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 140/298 (46%), Gaps = 32/298 (10%)
Query: 515 SFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LDDP 571
SFAL+ LFP +L+++ D +H+ G VPHD+G ++DP
Sbjct: 339 SFALLDLFP--DLTVKMDELLGFYIHN------------------GEVPHDLGEESIEDP 378
Query: 572 WFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG 631
Y W DL S ++L +YRD++ TG+ R + + ++ D DG
Sbjct: 379 -----IYGASYLYPWNDLGSTWILMIYRDYLLTGNVEVLRRNIDKMREVIDWLISRDYDG 433
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
D + ++ G D +YD + G S+Y L++ +LQA A +G S Y K
Sbjct: 434 DCIPDSRGGFDNSYDGTNMYGASSYIASLFLCSLQAFIKSAEVLGVRLSDRYESC-LSKG 492
Query: 692 KAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
+ +SLWNG YF S G+S+ S QL GQ++ L P+ DE K+K AL IY+
Sbjct: 493 RETLNSLWNGRYFMAWKSSGNSNESCMNSQLLGQFWCDFLKLPPVVDEDKIKVALRSIYE 552
Query: 752 FNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
N K N ++P G ID S Q R WP V++ + A M+ MV+ + A
Sbjct: 553 LN---HKSSPHCLPNSVKPSGEIDTSSGQMRSCWPRVSFVVTAHMVLRGMVNEGLEIA 607
>gi|148976285|ref|ZP_01813009.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
gi|145964379|gb|EDK29634.1| hypothetical protein VSWAT3_18783 [Vibrionales bacterium SWAT-3]
Length = 982
Score = 129 bits (324), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/250 (32%), Positives = 132/250 (52%), Gaps = 24/250 (9%)
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----------- 541
+FL E +Y +N+ DV+FY SF+L+ L P+L+ + ++F+ A++ D
Sbjct: 467 KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEA 526
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
+++ + R GAV HD+G P+ + ++Y+ N WKDL K++L VYR
Sbjct: 527 TPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 584
Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
+ T D + + W +V ++ ++ DGD + G D T+D +++G+S YC L
Sbjct: 585 YQNTQDMSVVKECWQAVTESIDFLSNLIADGDDLPLTRG-TDDTFDNLASHGISIYCASL 643
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
WVA LQAAS LA +G+H + + R +KA A V SLW+ Y+++ T +
Sbjct: 644 WVAGLQAASELAKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPV 697
Query: 718 QADQLAGQWY 727
QA L G+ Y
Sbjct: 698 QAKHLTGEGY 707
>gi|332796021|ref|YP_004457521.1| hypothetical protein Ahos_0332 [Acidianus hospitalis W1]
gi|332693756|gb|AEE93223.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 652
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 166/349 (47%), Gaps = 43/349 (12%)
Query: 515 SFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFE 574
SFAL+ FP+L + F ++ G VPHD+G + E
Sbjct: 334 SFALLEYFPELVKKMDEHFGNFIID--------------------GEVPHDLGEES--IE 371
Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGM 634
Y W DL ++L +YRD+ T D +F + + + + ++ + D+D DG+
Sbjct: 372 NPIYGASYPYSWNDLGPTWILMIYRDYKFTNDLSFLKRNYNKMKEVIDWLIKKDEDNDGI 431
Query: 635 IENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----K 690
+++G D +YD G S+Y G L++ AL+A + SY + +Y+ K
Sbjct: 432 PDSKGGFDNSYDGTYMYGTSSYIGSLFLCALKAF------IESSKILSYDYSKYEEILNK 485
Query: 691 AKAVYDSLWNGSYF-NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
AK+ +SLWNG YF N+ D + + + + QL G+++ GL + DE K+K AL I
Sbjct: 486 AKSSLESLWNGRYFINWKYKDQKNESCLNS-QLLGEFWCNLLGLGNVIDEDKIKTALKYI 544
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTA 809
Y+ N K +N + PDG ID S Q + WP +++ +A+ MI + M++ + A
Sbjct: 545 YEHN---GKASKYCLVNSVNPDGSIDESTDQMKSCWPRISFAIASHMIMKGMINEGIEIA 601
Query: 810 AGVYEVAWSEDGLGYSFQTPESWN--NNDEYRSLCYMRPLTIWAMQWAL 856
+E S Y + P N N + + Y+ L+I+ +++A+
Sbjct: 602 RKEWETISSR----YPYNQPSKINAFNGEHFGLPYYIGSLSIYLVKYAI 646
>gi|156333782|ref|XP_001619412.1| hypothetical protein NEMVEDRAFT_v1g224207 [Nematostella vectensis]
gi|156202556|gb|EDO27312.1| predicted protein [Nematostella vectensis]
Length = 173
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 62/108 (57%), Positives = 78/108 (72%), Gaps = 2/108 (1%)
Query: 757 VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
K G GA+NG++PDG++D S LQA E+W GVTY +AASMIQE +VD F+TA+G+Y
Sbjct: 49 TKEGTMGAVNGIRPDGQLDTSSLQAEEVWTGVTYAVAASMIQEGLVDEGFKTASGIYNTC 108
Query: 817 WSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPKLSRQ 864
+ LG +FQTPE+ N YRSL YMRPL+IWAMQWAL K K R+
Sbjct: 109 FER--LGMNFQTPEAIVANGNYRSLAYMRPLSIWAMQWALEKRKNKRE 154
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 35/45 (77%)
Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
PW +N+Y++ ++S+WKDLN KFVLQVYRD+V T D + + +WP
Sbjct: 3 PWDHVNAYHIHDTSKWKDLNLKFVLQVYRDYVFTNDVYYLQDMWP 47
>gi|417950561|ref|ZP_12593680.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
gi|342806239|gb|EGU41472.1| hypothetical protein VISP3789_13285 [Vibrio splendidus ATCC 33789]
Length = 1046
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 142/274 (51%), Gaps = 25/274 (9%)
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI----- 547
+FL E +Y +N+ DV+FY SF+L+ L P+L+ + ++F+ A++ D +
Sbjct: 531 KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEA 590
Query: 548 -----MSDGKWV-ARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
+ D K+ R GAV HD+G P+ + ++Y+ N WKDL K++L VYR
Sbjct: 591 TPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 648
Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
+ T D + + W +V ++ ++ +GD + G D T+D +++G+S YC L
Sbjct: 649 YQKTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASL 707
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
WVA LQAAS LA +G+H + + R +KA A V SLW+ Y+++ T +
Sbjct: 708 WVAGLQAASELAKLMGEHDLGAGYLTRSKKALATVEQSLWDEKEGYYHF------FVTPV 761
Query: 718 QADQLAGQWYARACGL-LPIADEAKVKKALTKIY 750
QA L G+ Y L L + +A V K + Y
Sbjct: 762 QAKHLTGEGYQALETLGLVLTGDAIVDKNMLNDY 795
>gi|407069309|ref|ZP_11100147.1| hypothetical protein VcycZ_07137 [Vibrio cyclitrophicus ZF14]
Length = 1024
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 145/613 (23%), Positives = 248/613 (40%), Gaps = 119/613 (19%)
Query: 165 NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNS 224
++ G + Y AL+P A Y D + I + +SP + N + S P F L+N+
Sbjct: 206 SVNGRDIEYQALYPLAEYKYSSFDD--INIKRKVVSPIVKENKRLCSLPMHWNHFELTNN 263
Query: 225 GQTSADVTLLFTWANSVAG--DSGLSGHHFNSKTMTKDGVHGLTLHHRTAN--------- 273
+ +TL N + G G ++ T++++ + H+T N
Sbjct: 264 SSQTRVITLAQPLQNLIGSTYQKGRDGIQDSACTLSQNPI---AQQHQTVNLKGESHSFT 320
Query: 274 -----GRPP--------VTFAVAAE----ETADVHVSECPCFLLSGNSKGITAKDMWNEI 316
+ P V F V A+ E+ V VS P F S +K +
Sbjct: 321 GVQLSSQSPYQSDIEGEVVFGVQADNRLTESGKVSVSVKPTFYTSKTTK--QTEFALKTG 378
Query: 317 KKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYH 376
+ + F +T ++ A + + + +G + + F+ +V
Sbjct: 379 RTNTEF------QTGIYTGREALSALVVVQVELEAGESVDLRFAQVMAHSKVMLNGWHSE 432
Query: 377 RRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRF----PE---WYPITL 429
+ YT+FY A + D + E E +I Q LE + P+ Y
Sbjct: 433 KAYTQFYPQ-AKPALLMLEDVLPELEAIEQQIVKQQTAFLEQAQSKISQPDSALRYATMA 491
Query: 430 FNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEE 489
N L +L A T+W +E KF
Sbjct: 492 MNSLSFL-AESTVWD------------KEDKF---------------------------- 510
Query: 490 NIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD-------- 541
L E +Y +N+ DV+FY SF+L+ L P+L+ + ++F+ A++ D
Sbjct: 511 -----LVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWE 565
Query: 542 ---PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQV 597
+++ + R GAV HD+G P+ + ++Y+ N WKDL K++L V
Sbjct: 566 YEATPNAELIDEKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMV 623
Query: 598 YRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYC 657
YR + T D + + W +V ++ ++ +GD + G D T+D +++G+S YC
Sbjct: 624 YRHYQNTQDLSVVKECWQAVTESIEFLSNLVAEGDDLPLTRG-TDDTFDNLASHGISIYC 682
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSS 714
LWVA LQAA+ LA + ++ A+ + R +KA A V SLW+ Y+++
Sbjct: 683 ASLWVAGLQAANELALLMNENELAADYLARSKKALATVEQSLWDDKEGYYHF------FV 736
Query: 715 TSIQADQLAGQWY 727
T +QA L GQ Y
Sbjct: 737 TPVQAKHLTGQGY 749
>gi|422908697|ref|ZP_16943374.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
gi|341639159|gb|EGS63785.1| hypothetical protein VCHE09_0171 [Vibrio cholerae HE-09]
Length = 981
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 164/663 (24%), Positives = 264/663 (39%), Gaps = 124/663 (18%)
Query: 114 HGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTY 173
I +D L Q V + NG ++ R+ + N S +ES D + Y
Sbjct: 118 QAITNDPSSLETQLLVLIDFFNGLLVNN--SARALSL-TANNGSSVESID----PSDIEY 170
Query: 174 HALFPRAWTVYDGEPDPELRIVCRQ-ISPFIPHNYKESSFPASVFTFTLSNSGQTSADVT 232
AL+P A Y G E + RQ ISP + ++ S P F +SN + VT
Sbjct: 171 RALYPIAEYEYQG---LESVALTRQVISPIVKNDPMLCSLPMHWNHFEMSNRSDKTQLVT 227
Query: 233 LLFTWANSVAG-----------------DSGLSGHHFNSKTMTKDGVHGLTLHHRTANGR 275
L+ N + + +S H + +T G G H +
Sbjct: 228 LVQPLRNLIGSTYKKAREGVQDSCCHLVQNAISQQH---QPVTISGA-GFEFHGVSMTSN 283
Query: 276 PPVTFAVAAEETADVHVSECPCFLLSGNSKGITA------------KDMWNEIKKHGSFD 323
P ++ E VH G ++ +T + + +K +
Sbjct: 284 SPYDGDISGEVLYGVHTQ-----FEGGAAEQVTVTVKPQVYCSKADQQVERALKTGRTNA 338
Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF----FEKVYHRRY 379
H D S EP + A + +T+ G T ++ F+ D ++ EK Y R
Sbjct: 339 HFDRGIYSGREP---LSALVCVQVTLKPGETIALRFAQVMDHSKILLDGWESEKAYAR-- 393
Query: 380 TKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAG 439
Y A I + + ++ E I + QR L+ NA
Sbjct: 394 ---YFDAQSRAQSILNQTLPRLSEIEQNIISQQREFLD-------------------NAS 431
Query: 440 GTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEG 499
T + D ++ AT+ S S + +EN+ L E
Sbjct: 432 QT-FVDNDSALR-FATMAMNSLSFLAESTVW----------------DKENV--LLVKEC 471
Query: 500 AEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI----------MS 549
+Y +N+ DV+FY SF+L+ L P+L+ + + FA A++ D + +
Sbjct: 472 VDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKHFAKAILNSDEKLRRYWEYEDKPYAELV 531
Query: 550 DGKWV-ARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDK 607
D K+ +R GAV HD+G P+ + ++Y+ N WKDL KF+L VYR + TGD+
Sbjct: 532 DEKYQGSRAERGAVIHDLG--SPFDIDPDAYSWHNVKEWKDLAPKFILMVYRHWQKTGDR 589
Query: 608 NFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
+ W SV ++ Y+ +DGD + G D T+D +++GVS YCG LWVA L+A
Sbjct: 590 EVVESCWSSVKESIDYLINLIEDGDTLPLTRG-TDDTFDNLASHGVSIYCGSLWVAGLRA 648
Query: 668 ASALANDVGDHASASYFWVRYQKA-KAVYDSLWN--GSYFNYDNSDGSSSTSIQADQLAG 724
AS LA VG+ A + R + A + + LW+ Y+++ T +QA L G
Sbjct: 649 ASELALLVGEEELAQGYTQRSKDALETLERGLWDEQKGYYHF------FITPVQAKHLTG 702
Query: 725 QWY 727
+
Sbjct: 703 NGF 705
>gi|15921102|ref|NP_376771.1| hypothetical protein ST0868 [Sulfolobus tokodaii str. 7]
gi|15621887|dbj|BAB65880.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 683
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 176/733 (24%), Positives = 284/733 (38%), Gaps = 128/733 (17%)
Query: 86 DQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+ G+PLGGIGAG I S +G + + + F VF+ + E F
Sbjct: 10 EAGVPLGGIGAGKIEISNKGRMINLTIANNWSFPIKEMLG-FHVFILPNDSEPFF----- 63
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
S + D N Y +P A + G + ++ S IP
Sbjct: 64 ---------LQSDLIFLDLNKLAVPLEYEGKYPFA--MIKGVKN-SVKAELEAFSALIPE 111
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGL 265
N +SS PA + +S S +++ ++ G S + ++ K +G+
Sbjct: 112 NLYDSSLPAVGISVKVSGSKSGLIAISM-----GNITGISKIGRYNEGLK-------NGI 159
Query: 266 TLHHRTANGRPPVTF-AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH 324
+ + AN P V E+ V + + G K + + I+ +
Sbjct: 160 IMKNSKANDLDPYNGETVLVTESPKAIVKQYNFHVNRGLEKTLNLHRL---IENEKPWMD 216
Query: 325 LDNDKTSPSEPGSSIGAAIAASLTIPSG-------STRSVTFSLAWDCPEVKFFEK--VY 375
L + K SE G S G S +P+G V F +W FF K VY
Sbjct: 217 LMSGKIPESEDGESTG-----SYYLPAGMIIDEYEKNEEVKFVFSW------FFNKPWVY 265
Query: 376 HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYY 435
+ Y +Y SA +A + + + + W ++ D P+W + N Y
Sbjct: 266 YP-YKHYYSNFFSSAKEVASYFLENFDRLKEKTRRWHEDLI-DPSLPDWLSDAIINSTYI 323
Query: 436 LNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFL 495
L++ + W D + +F + T +GT L IG
Sbjct: 324 LSS--STWLD-----------EKGRFGI----------MEGTQVGTML-----STIGGVC 355
Query: 496 YLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVA 555
Y G S +V++FP LE S F M D
Sbjct: 356 YETG---------------SLPVVLMFPMLEKSTIEQFILN-MRED-------------- 385
Query: 556 RKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
G +PHD+G F+ S +WKD N+ FVL VYR ++ T DK F ++V+P
Sbjct: 386 ----GYIPHDLGTYS--FDAPSDGTTAPPKWKDTNTTFVLMVYRYYLRTKDKEFLKSVYP 439
Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDV 675
V AM+++ DKDGDG+ E +G DQ +D GV +Y +++AAL+A +A V
Sbjct: 440 YVKKAMSWIISKDKDGDGLPEVDGSTDQGFDCVPIEGVCSYISTVYIAALEAMIKIAEIV 499
Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
GD S + K+ + L++G F + ++ + Q+ GQW+A L
Sbjct: 500 GDSTSYYSSLLSKAKSSLM--KLFDGKKFIPWTGKPNHHKAVFSAQIFGQWWAYLLDLDV 557
Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA- 794
+AD + + AL +IY N K C + + D+ Q WP + + L+A
Sbjct: 558 VADRSAILSALDEIYRVNGHASK--YCTPNMAKEGESLDDLDS-QLTSSWPRLVFSLSAL 614
Query: 795 --SMIQEEMVDMA 805
S+ + E +D+A
Sbjct: 615 GYSLGKREWLDIA 627
>gi|84386772|ref|ZP_00989797.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
gi|84378300|gb|EAP95158.1| hypothetical protein V12B01_19101 [Vibrio splendidus 12B01]
Length = 908
Score = 126 bits (317), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 80/250 (32%), Positives = 130/250 (52%), Gaps = 24/250 (9%)
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----------- 541
+FL E +Y +N+ DV+FY SF L+ L P+L+ + ++F+ A++ D
Sbjct: 393 KFLVKECVDYPFFNSLDVYFYGSFTLLYLLPELDGCVMKEFSKAILAEDFTERRYWEYEA 452
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
+++ D R GAV HD+G P+ + ++Y+ N WKDL K++L VYR
Sbjct: 453 TPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 510
Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
+ T D + + W +V ++ ++ +GD + G D T+D +++G+S YC L
Sbjct: 511 YQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASL 569
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
WVA LQAAS LA +G+ A+ + R +KA A V SLW+ Y+++ T +
Sbjct: 570 WVAGLQAASELAQLMGESERANGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPV 623
Query: 718 QADQLAGQWY 727
QA L G Y
Sbjct: 624 QAKHLTGAGY 633
>gi|218677125|ref|YP_002395944.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
gi|218325393|emb|CAV27494.1| hypothetical protein VS_II1384 [Vibrio splendidus LGP32]
Length = 1069
Score = 126 bits (317), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 84/274 (30%), Positives = 139/274 (50%), Gaps = 25/274 (9%)
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD----------- 541
+FL E +Y +N+ DV+FY SF+L+ L P+L+ + ++F+ A++ D
Sbjct: 554 KFLVKECVDYPFFNSLDVYFYGSFSLLYLLPELDGCVMKEFSKAILAEDFTQRRYWEYEA 613
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRD 600
+++ D R GAV HD+G P+ + ++Y+ N WKDL K++L VYR
Sbjct: 614 TPNAELIDDKYQGVRAIRGAVIHDLG--SPFDIQPDAYSWHNVKEWKDLAPKYILMVYRH 671
Query: 601 FVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
+ T D + + W +V ++ ++ +GD + G D T+D +++G+S YC L
Sbjct: 672 YQNTQDISVVKECWQAVTESIDFLSNLIAEGDDLPLTRG-TDDTFDNLASHGISIYCASL 730
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGS--YFNYDNSDGSSSTSI 717
WVA LQAAS LA + + A+ + R +KA A V SLW+ Y+++ T +
Sbjct: 731 WVAGLQAASELAKLMNESELAAGYLTRSKKALATVEQSLWDDKEGYYHF------FVTPV 784
Query: 718 QADQLAGQWYARACGL-LPIADEAKVKKALTKIY 750
QA L G+ Y L L + +A K + Y
Sbjct: 785 QAKHLTGEGYQALETLGLSLTGDAIADKTMLNTY 818
>gi|159041841|ref|YP_001541093.1| hypothetical protein Cmaq_1276 [Caldivirga maquilingensis IC-167]
gi|157920676|gb|ABW02103.1| protein of unknown function DUF608 [Caldivirga maquilingensis
IC-167]
Length = 688
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 164/683 (24%), Positives = 260/683 (38%), Gaps = 134/683 (19%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
GIPLGGIG+G + S G + + L F +F+ +P+ + C
Sbjct: 13 GIPLGGIGSGGVEVSNDGRLINARFANNWAYPIRDLRG-FHIFI-KPHDASGFFMHC--- 67
Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAW--TVYDGEPDPELRIVCRQISPFIPH 205
+ G+E +G + +P AW +G + + SP IP
Sbjct: 68 -----RVNVLGLEG-----RGALIGFEGRWPFAWLRAFRNG-----VNVEVEAFSPIIPG 112
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGL 265
N K+S+ P FT + S +A N + G N G+
Sbjct: 113 NLKDSTLPVIGFTIRVKGSDALAAVSVPNVVGTNPI----GRINRSING---------GV 159
Query: 266 TLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDH- 324
+ A P +A ++E P F ++ + + G+F++
Sbjct: 160 LFTNNKAPDNDPAKGNIAL-------ITEEPRFTITQYNINSKPEHALKARTWKGAFENP 212
Query: 325 ---LDNDK---TSPSEPGSSIG----AAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV 374
L DK + EP G A +L +P+G V F+L+W FF
Sbjct: 213 EPWLTIDKGGVPTGEEPHEVTGLWDDPAGLIALNVPNGG--EVRFTLSW------FFNGR 264
Query: 375 YHR-RYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL 433
+H Y +Y ++ +A + E + WQ ++ D P+W + N
Sbjct: 265 WHLYNYGHYYENFFKDSSEVARYVLDEFDRLRTSTLDWQNSLI-DPALPDWLRDAVVNST 323
Query: 434 YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQ 493
Y L S R+ +FS+ LE G E
Sbjct: 324 YILTT-------------STWLTRDGRFSI--------------------LE-GVEVCPC 349
Query: 494 FLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKW 553
L GA Y S +V++FP+LE S+ R F A+
Sbjct: 350 HGTLAGACYET---------GSLPVVLMFPELEKSLLRQFTEAM---------------- 384
Query: 554 VARKCLGAVPHDIGLDDPWFEINSYNLFNSS--RWKDLNSKFVLQVYRDFVATGDKNFAR 611
+ G +PH +G+ + ++ ++ RWKDLNS ++L V+R F + D F +
Sbjct: 385 ---RSDGYIPHSLGI----YSLDHIEDGTTAPPRWKDLNSTYILLVHRYFKRSNDVEFIK 437
Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
++P + A ++ DKDGDG+ E G D +DA S G +Y LW+AAL L
Sbjct: 438 EIYPKLIKAFEWVLVQDKDGDGVPELSGDGDTGFDAMSVKGFDSYTTSLWIAALMVMGEL 497
Query: 672 ANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN-YDNSDGSSSTSIQADQLAGQWYARA 730
A +GD A+ S KA+ Y+ W G F +D D ++ + Q+ G+W++
Sbjct: 498 AKLMGDQATLSKVESTLLKARDSYNRRWLGDRFKAWDEPDMGKASFLA--QIWGEWWSLM 555
Query: 731 CGLLPIADEAKVKKALTKIYDFN 753
GL I DE KVK A+ I N
Sbjct: 556 LGLGHITDEDKVKAAMGTIIRVN 578
>gi|119578750|gb|EAW58346.1| glucosidase, beta (bile acid) 2, isoform CRA_b [Homo sapiens]
Length = 370
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 120/227 (52%), Gaps = 17/227 (7%)
Query: 52 LAHIGFRLYRY----SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEF 107
+ HIG L RY ++ + K P ++ + G PLGGIG G+I R +RG+F
Sbjct: 112 IKHIGMGL-RYLQWWYRKTHVEKKTPFIDMINSVPLRQIYGCPLGGIGGGTITRGWRGQF 170
Query: 108 QRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLK 167
R++L G+ V+A+QF+V + R + VL P V + SW+W L
Sbjct: 171 CRWQLNPGMYQHRTVIADQFTVCLRREGQTVYQQVLSLERPSV--------LRSWNWGLC 222
Query: 168 GENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQT 227
G YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G
Sbjct: 223 GYFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDE 281
Query: 228 SADVTLLFTWANSVAGDSGLSGHHFNSK-TMTKDG--VHGLTLHHRT 271
+ DV+++F+ N + G G +N + + G V GL LHH T
Sbjct: 282 ALDVSIMFSMRNGLGGGDDAPGGLWNEPFCLERSGETVRGLLLHHPT 328
>gi|15922862|ref|NP_378531.1| hypothetical protein ST2526 [Sulfolobus tokodaii str. 7]
gi|15623653|dbj|BAB67640.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 646
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 140/306 (45%), Gaps = 25/306 (8%)
Query: 560 GAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYI 619
G VPHD+G + E Y W DL S +VL +YRD+ T D F + + +
Sbjct: 354 GEVPHDLGEES--IEAPIYGASYGYPWTDLGSTWVLMIYRDYKFTNDLAFLKRNYRKMKE 411
Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
+ ++ DKD D + +++G D +YD G S+Y G +++ AL+A + + +G
Sbjct: 412 VIDWLISLDKDKDCIPDSKGGFDNSYDGTYMYGASSYVGSMFLCALRAFISASKILGMEY 471
Query: 680 SASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADE 739
S +R + ++SLWNG YF SD S + Q+ GQ++ GL PI DE
Sbjct: 472 SIYEDCLR--RGIMTFNSLWNGKYFIAWKSDNRKKESCMSSQILGQFWCDILGLEPIIDE 529
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
K+ +AL IY+ N K +N + PDG ID Q R W V + ++A MI
Sbjct: 530 DKIVQALRSIYELN---GKASKFCLVNSVNPDGSIDTETDQMRSCWSRVAFAVSAHMIIR 586
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC--------YMRPLTIWA 851
+ + + A +E + ++ WN + + Y+ + W
Sbjct: 587 GLKNEGIEIANREWE----------TIKSLGKWNQSSRIDGMTGNKVGLPYYIGSASTWF 636
Query: 852 MQWALT 857
+++ALT
Sbjct: 637 IKFALT 642
>gi|15899397|ref|NP_344002.1| hypothetical protein SSO2674 [Sulfolobus solfataricus P2]
gi|384433011|ref|YP_005642369.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|13815992|gb|AAK42792.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
gi|261601165|gb|ACX90768.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
Length = 648
Score = 119 bits (299), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 131/273 (47%), Gaps = 12/273 (4%)
Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+ I +D ++ G PHD+G ++DP + SY + W DL VL +YRD+
Sbjct: 341 LVISADNYFMNYVNYGETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 395
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
V T ++ + + + ++ + D D D + +++G D +YD G S+Y ++
Sbjct: 396 VFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMF 455
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
++AL A ++ ++ D ++ + K ++SLWNG YF + +TS Q
Sbjct: 456 LSALTAFIKMS-EILDVKIDDKYYRFLECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQ 514
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
L GQ++ GL PI D K+ AL IY+ N K N ++ DG +D S Q
Sbjct: 515 LLGQFWCDILGLPPITDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQL 571
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
R WP V++ +AA MI MV + A +E
Sbjct: 572 RSCWPRVSFAVAAHMILRGMVKEGIEVAKREWE 604
>gi|284173198|ref|ZP_06387167.1| hypothetical protein Ssol98_00875 [Sulfolobus solfataricus 98/2]
Length = 647
Score = 119 bits (298), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 131/273 (47%), Gaps = 12/273 (4%)
Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+ I +D ++ G PHD+G ++DP + SY + W DL VL +YRD+
Sbjct: 340 LVISADNYFMNYVNYGETPHDVGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 394
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
V T ++ + + + ++ + D D D + +++G D +YD G S+Y ++
Sbjct: 395 VFTSNREILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYIASMF 454
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
++AL A ++ ++ D ++ + K ++SLWNG YF + +TS Q
Sbjct: 455 LSALTAFIKMS-EILDVKIDDKYYRFLECGKKTFNSLWNGKYFILWKKNDEENTSCLNSQ 513
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
L GQ++ GL PI D K+ AL IY+ N K N ++ DG +D S Q
Sbjct: 514 LLGQFWCDILGLPPITDHDKINTALRSIYELN---FKASKYCLTNAVREDGSVDSSTAQL 570
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
R WP V++ +AA MI MV + A +E
Sbjct: 571 RSCWPRVSFAVAAHMILRGMVKEGIEVAKREWE 603
>gi|374921957|gb|AFA26156.1| non-lysosomal glucosylceramidase-like protein, partial [Lolium
perenne]
Length = 109
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 70/108 (64%), Positives = 84/108 (77%), Gaps = 1/108 (0%)
Query: 620 AMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHA 679
AM YMEQFD+D D MIEN+GFPDQTYDAW+ GVSAYCG LW+A+LQAA+A+A +G
Sbjct: 2 AMEYMEQFDRDDDSMIENDGFPDQTYDAWTVLGVSAYCGCLWLASLQAAAAMARSLGHAD 61
Query: 680 SASYFWVRYQKAKAVYDS-LWNGSYFNYDNSDGSSSTSIQADQLAGQW 726
A V++ KAK V+++ LWNGSYFNYD+ SS SIQADQLAGQW
Sbjct: 62 YAERCMVKFAKAKHVFEAKLWNGSYFNYDSGTSYSSRSIQADQLAGQW 109
>gi|385777223|ref|YP_005649791.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323475970|gb|ADX86576.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 661
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)
Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+ I +D V G PHDIG ++DP + SY + W DL VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
V T D+ + + + ++ + D D D + +++G D +YD G S+Y ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
++AL A ++ ++ D + ++ + K ++SLWNG YF + + S Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
L GQ++ GL PI D K+ AL IY+ N K + N ++ D ID S Q
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
R WP V++ +AA MI MV + A +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617
>gi|227828867|ref|YP_002830647.1| hypothetical protein M1425_2640 [Sulfolobus islandicus M.14.25]
gi|227460663|gb|ACP39349.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 661
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 130/275 (47%), Gaps = 12/275 (4%)
Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+ I +D V G PHDIG ++DP + SY + W DL VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
V T D+ + + + ++ + D D D + +++G D +YD G S+Y ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
++AL A ++ ++ D + ++ + K ++SLWNG YF + + S Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
L GQ++ GL PI D K+ AL IY+ N K + N ++ D ID S Q
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA 816
R WP V++ +AA MI MV + A +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWETV 619
>gi|385774575|ref|YP_005647144.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|323478691|gb|ADX83929.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 661
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 125/258 (48%), Gaps = 12/258 (4%)
Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
G PHDIG ++DP + SY + W DL VL +YRD+V T D+ +
Sbjct: 369 GEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDYVFTSDRGILEKNYNK 423
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
+ + ++ + D D D + +++G D +YD G S+Y ++++AL A ++ ++
Sbjct: 424 IKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMFLSALAAFIKIS-EIL 482
Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
D + ++ + K ++SLWNG YF + + S QL GQ++ GL PI
Sbjct: 483 DVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQLLGQFWCDILGLPPI 542
Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
D K+ AL IY+ N K + N ++ D ID S Q R WP V++ +AA M
Sbjct: 543 TDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQLRSCWPRVSFAVAAHM 599
Query: 797 IQEEMVDMAFQTAAGVYE 814
I MV + A +E
Sbjct: 600 ILRGMVKEGMEVAKREWE 617
>gi|229586074|ref|YP_002844576.1| hypothetical protein M1627_2694 [Sulfolobus islandicus M.16.27]
gi|228021124|gb|ACP56531.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 661
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)
Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+ I +D V G PHDIG ++DP + SY + W DL VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
V T D+ + + + ++ + D D D + +++G D +YD G S+Y ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
++AL A ++ ++ D + ++ + K ++SLWNG YF + + S Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
L GQ++ GL PI D K+ AL IY+ N K + N ++ D ID S Q
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
R WP V++ +AA MI MV + A +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617
>gi|227831601|ref|YP_002833381.1| hypothetical protein LS215_2801 [Sulfolobus islandicus L.S.2.15]
gi|229580553|ref|YP_002838953.1| hypothetical protein YG5714_2812 [Sulfolobus islandicus Y.G.57.14]
gi|238621059|ref|YP_002915885.1| hypothetical protein M164_2625 [Sulfolobus islandicus M.16.4]
gi|284999153|ref|YP_003420921.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|227458049|gb|ACP36736.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
gi|228011269|gb|ACP47031.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
gi|238382129|gb|ACR43217.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
gi|284447049|gb|ADB88551.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
Length = 661
Score = 117 bits (292), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)
Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+ I +D V G PHDIG ++DP + SY + W DL VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
V T D+ + + + ++ + D D D + +++G D +YD G S+Y ++
Sbjct: 409 VFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
++AL A ++ ++ D + ++ + K ++SLWNG YF + + S Q
Sbjct: 469 LSALAAFIKIS-EILDVKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
L GQ++ GL PI D K+ AL IY+ N K + N ++ D ID S Q
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
R WP V++ +AA MI MV + A +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617
>gi|229583411|ref|YP_002841810.1| hypothetical protein YN1551_3001 [Sulfolobus islandicus Y.N.15.51]
gi|228014127|gb|ACP49888.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
Length = 661
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 130/273 (47%), Gaps = 12/273 (4%)
Query: 545 MKIMSDGKWVARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
+ I +D V G PHDIG ++DP + SY + W DL VL +YRD+
Sbjct: 354 LVISADNHLVNYINNGEAPHDIGEESIEDPIYGA-SYPYW----WTDLGPTLVLMLYRDY 408
Query: 602 VATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLW 661
+ T D+ + + + ++ + D D D + +++G D +YD G S+Y ++
Sbjct: 409 IFTSDRGILEKNYNKIKEIIDWLIRKDMDNDCIPDSKGGYDNSYDGTHMYGASSYVASMF 468
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQ 721
++AL A ++ ++ D + ++ + K ++SLWNG YF + + S Q
Sbjct: 469 LSALAAFIKIS-EILDIKIDTKYYRFLECGKKTFNSLWNGKYFVLWKKNDEENRSCLNSQ 527
Query: 722 LAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQA 781
L GQ++ GL PI D K+ AL IY+ N K + N ++ D ID S Q
Sbjct: 528 LLGQFWCDILGLPPITDNDKINTALRSIYELNFRASKYCLT---NAVKEDRDIDTSTGQL 584
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
R WP V++ +AA MI MV + A +E
Sbjct: 585 RSCWPRVSFAVAAHMILRGMVKEGMEVAKREWE 617
>gi|327310049|ref|YP_004336946.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
gi|326946528|gb|AEA11634.1| hypothetical protein TUZN_0133 [Thermoproteus uzoniensis 768-20]
Length = 660
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 131/300 (43%), Gaps = 32/300 (10%)
Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LDD 570
+ FAL+ LFP L + FA M+ G G +PHD G ++D
Sbjct: 334 AGFALLELFPDLVVRADEVFA----MYRRG----------------GEIPHDYGEESIED 373
Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
P Y + WKDL S +VL +YRD+ TGD R ++ A+ ++ D+D
Sbjct: 374 P-----IYGATYLTPWKDLASTWVLMIYRDYYYTGDVEILRRNIDAMKDAVDWLMSLDRD 428
Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK 690
GD + ++ G D +YD + G AY L++ AL A A +G Y ++
Sbjct: 429 GDCVPDSRGRNDNSYDGSNMYGRVAYVASLFLCALTAYIKAAERLGLKPEPKYLEC-LER 487
Query: 691 AKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIY 750
K D LWNG Y+ + + QL GQ++ L PIA + KV AL IY
Sbjct: 488 GKRSLDELWNGRYYVAWADGPRRKDACMSSQLLGQFWCDMLDLPPIAPDDKVAAALRSIY 547
Query: 751 DFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
+ L ++ N + PDG+ D Q R W V + +AA M+ M + A+
Sbjct: 548 E---LGMRASKYCIPNSVTPDGKPDEETPQLRSCWTRVNFAVAAHMLLRGMEREGMEVAS 604
>gi|148242848|ref|YP_001228005.1| hypothetical protein SynRCC307_1749 [Synechococcus sp. RCC307]
gi|147851158|emb|CAK28652.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 220
Score = 112 bits (280), Expect = 9e-22, Method: Composition-based stats.
Identities = 72/211 (34%), Positives = 109/211 (51%), Gaps = 14/211 (6%)
Query: 653 VSAYCGGLWVAALQAASA----LANDVGDHASASY--FWVRYQKAKAVYDSL-WNGSYFN 705
+SAYCG LW+AAL+AA A LA ++G +A F Q+++ +D L WNG ++
Sbjct: 1 MSAYCGALWIAALEAALAMGQTLALELGVDTAAEQRDFSQWLQQSRGNFDQLLWNGEFYK 60
Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
D G+ + ADQL G +YAR GL P+ EA + +L I + GG G
Sbjct: 61 IDAESGTPV--VMADQLCGDFYARLLGLPPVVSEANARSSLQAIKEACFEGFAGGRLGVA 118
Query: 766 NGMQPDGR-IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGY 824
NG++ DG +D +G E+W G+ +GLAA + A V ++ G
Sbjct: 119 NGLRRDGTPLDPNGTHPLEVWTGINFGLAAYYRLMGETNTAVAITQAVVNQVYAG---GL 175
Query: 825 SFQTPESWNNNDEYRSLCYMRPLTIWAMQWA 855
F+TPE+ + +R+ Y+R + IWA+ WA
Sbjct: 176 QFRTPEALTGQNTFRACHYLRAMAIWAL-WA 205
>gi|284997264|ref|YP_003419031.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284445159|gb|ADB86661.1| protein of unknown function DUF608 [Sulfolobus islandicus L.D.8.5]
Length = 740
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/239 (30%), Positives = 117/239 (48%), Gaps = 14/239 (5%)
Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
G VPHD+G LD P S +WKD N F+L VYR + T D +F + ++P
Sbjct: 450 GYVPHDLGIHSLDHPVDGTTS-----PPKWKDTNPTFILLVYRYYKFTNDLDFLKEMYPK 504
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
+ AM + DKD DG+ E EG D +DA S G+ +Y +++A++ A A +
Sbjct: 505 MLKAMKWELTQDKDKDGVPELEGQGDTGFDAMSVKGIDSYTTSIYIASIIALKETAKILK 564
Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTS-IQADQLAGQWYARACGLLP 735
D+ + + + +K++ ++ L+NG YF D G I QL G+W++ GL P
Sbjct: 565 DNDTLNEMTILLEKSRKIFSRLFNGKYF--DPWIGEPEIKGIFLGQLVGEWWSEILGLEP 622
Query: 736 IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
I +E K+ AL + N C N + DG+I Q+ WP + + +++
Sbjct: 623 IIEEEKISSALEAMLSINA--NSSNFCTP-NIVSEDGKIIDISPQSYSSWPRLVFAISS 678
Score = 48.9 bits (115), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 63/140 (45%), Gaps = 19/140 (13%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G+PLGGIGAG I + +G+ +F+ P + F + + +G F L
Sbjct: 74 GVPLGGIGAGKIEINNKGKVVNVTIFNNKGFPIPNIRG-FHILIKPDDGNPF--FLEKDL 130
Query: 148 PGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNY 207
P + K+ +S I Y L+P A+ + G D ++R SP IP N
Sbjct: 131 PFISLKHLESDI-----------LIYEGLYPFAYLI--GIKD-KIRAKLTAFSPLIPRNI 176
Query: 208 KESSFPASVFTFTL--SNSG 225
+S+ PA F+ + SNSG
Sbjct: 177 NDSTLPAVGFSLKIEGSNSG 196
>gi|149196161|ref|ZP_01873217.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
gi|149141008|gb|EDM29405.1| hypothetical protein LNTAR_23484 [Lentisphaera araneosa HTCC2155]
Length = 1052
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 123/543 (22%), Positives = 220/543 (40%), Gaps = 93/543 (17%)
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
E L+P A Y+ + ++RI SP + ++ K S P ++ L+N TS
Sbjct: 219 EKIQNRLLYPYAELKYEDDAH-DVRIEKLHFSPIVRNDEKSCSAPLNLTEIKLTN---TS 274
Query: 229 ADVTLL-FTW---------------ANSVAGDSGLSGHHFNSKTM----TKDGV-HGLTL 267
A V +L W AN AG + + T++G+ G+TL
Sbjct: 275 AKVKILTLAWDQENLSGFSVVKKRPANQDAGFLLQKTVRYQENNLCDIETENGLFQGITL 334
Query: 268 H----HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD 323
H + + + +TF + + D+ VS + + E + G +
Sbjct: 335 GNKKGHESGDIKGELTFGILTQNDPDICVSRRSSYYSVERDSTVC------EALQGGRVN 388
Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFY 383
++ DK + + +T+ G+ +++ F D PE+ E ++YT F+
Sbjct: 389 NVFFDKIYTGR--EQLSGIVTVQVTLQPGAVKNIPFVQILDYPEISLGEYQSQKKYTSFF 446
Query: 384 GTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIW 443
A + + K+ C+ R ++ + NEL+ L ++
Sbjct: 447 P---------AENRTEDLVKFGCDNFYSTREQIDADQ----------NELHELLLNSDVY 487
Query: 444 TDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYV 503
+ L T+ + S + + +FL E ++Y
Sbjct: 488 KGKAKSAAELTTMAQNTLSFTADATVWDIDD------------------KFLVRECSDYP 529
Query: 504 MYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI-----MSDGKW----- 553
+N+ DV+FY SF+++ L P+++ + R F A+M + + + + K
Sbjct: 530 FFNSLDVYFYGSFSMMWLLPQVDTNTMRCFRDAIMAENDELRRFYVYLELPNAKLPHPKY 589
Query: 554 ---VARKCLGAVPHDIGLD-DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNF 609
ARK GAV HD+G DP ++YN N + WKDL K++L + R++ T D
Sbjct: 590 EGPRARK--GAVIHDLGSPFDP--RPDAYNWHNVAEWKDLAPKYILMLLRNYHFTKDVYL 645
Query: 610 ARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAAS 669
W SV ++ Y++ +G + G D T+D S+ G++ YCG LWVA L+AA
Sbjct: 646 LEECWDSVQASLQYLKDMILEGHSIPLTNG-TDDTFDNLSSFGITLYCGSLWVAGLKAAG 704
Query: 670 ALA 672
+A
Sbjct: 705 EIA 707
>gi|327309977|ref|YP_004336874.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
gi|326946456|gb|AEA11562.1| hypothetical protein TUZN_0057 [Thermoproteus uzoniensis 768-20]
Length = 694
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 183/726 (25%), Positives = 271/726 (37%), Gaps = 134/726 (18%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKF-----SSV 142
G+PLGGIGAG + +G + L F VFV + +K SV
Sbjct: 13 GVPLGGIGAGKVELDNKGRLVNATFANNWSRPIREL-RGFHVFVYAEDLDKHLFAQDVSV 71
Query: 143 LCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPF 202
L +P + +E+ Y +P W G L + S
Sbjct: 72 LGT----LPTLGSAGFVEA----------EYEGRWPFVWL--RGRRG-GLEVEVEGFSAV 114
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLL-FTWANSVAGDSGLSGHHF---NSKTMT 258
+P + K+SS PA+ + S + A V++ AN+V + F N++
Sbjct: 115 VPGDLKDSSLPAAGLAVRVRGSRRGIAAVSIPNVVGANNVGRINEPVPGGFKARNARAHE 174
Query: 259 KDGVHG-LTLHHRTANGRPPVTFAVAAEETADVHVSECP--CFLLSGNSKGITAKDMWNE 315
D +G L L + PV AV + DV SE P C L G + W
Sbjct: 175 YDPQNGELALLAKD-----PV--AVIPQFNLDVPRSEAPKVCAL------GPEDERPWLY 221
Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY 375
+ F P +S G A + S F L+W +F+K +
Sbjct: 222 LMGKADF--------GPRVAEASGGGEQPAGVVAASYEGGEARFVLSW------YFDKPW 267
Query: 376 HRR-YTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY 434
Y +Y AA + + E+ + WQ ++ D PEW + N Y
Sbjct: 268 ALYPYGHYYENFFKDAADVGRYFLDEYDRLRRRTAEWQEGLV-DPALPEWLRDAVINSAY 326
Query: 435 YLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQF 494
L+AG + DG +FS I+ GT +G
Sbjct: 327 ILSAGTWLTKDG-------------RFS------IYEAPEVAPMHGT---------VGGL 358
Query: 495 LYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWV 554
Y G S +V++FP+LE R F + H M D
Sbjct: 359 CYETG---------------SLPVVLMFPQLE----RSFLEQLARH-------MRDD--- 389
Query: 555 ARKCLGAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFAR 611
G VPHD+G LD P S RWKD N F+L VYR + TGD +F +
Sbjct: 390 -----GYVPHDLGTHSLDAP-----SDGTTAPPRWKDTNPTFILLVYRYYKRTGDIDFVK 439
Query: 612 AVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASAL 671
+V+ + A +M D+DGDG+ E EG D +D G SAY LW+AAL A L
Sbjct: 440 SVYGRLVAAYRWMRGQDRDGDGLPELEGSGDSGFDCAKIRGPSAYTSSLWIAALVALRDL 499
Query: 672 ANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARAC 731
A VGD +A ++A+ ++ ++G +Y DG+ ++ A G W+
Sbjct: 500 AEAVGDSKTAEEAGSDLERARNSFERSFDG--VSYKAWDGAPEGAVFAALAVGDWWLYLA 557
Query: 732 GLLP-IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM-QPDGRI-DMSGLQAREIWPGV 788
G A ++ AL+ + N A G + DG I + G Q WP +
Sbjct: 558 GSREGAARRDRLASALSAVLRVNGSASANCAPNASTGRPEADGEISERVGRQLTASWPRL 617
Query: 789 TYGLAA 794
+ +AA
Sbjct: 618 IFSVAA 623
>gi|194359640|gb|ACF57667.1| bile acid beta-glucosidase [Ctenopharyngodon idella]
Length = 89
Score = 110 bits (276), Expect = 3e-21, Method: Composition-based stats.
Identities = 46/89 (51%), Positives = 62/89 (69%)
Query: 571 PWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD 630
PW +N+Y +++++ WKDLN KFVLQV+RD+ T D+ + + +WP M +FDKD
Sbjct: 1 PWVRVNAYLIYDTADWKDLNLKFVLQVFRDYHLTQDEQYLKDMWPICQTVMETELKFDKD 60
Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGG 659
GDG+IEN G+ DQTYD W G SAYCGG
Sbjct: 61 GDGLIENSGYADQTYDGWKVTGPSAYCGG 89
>gi|355689818|gb|AER98955.1| glucosidase, beta 2 [Mustela putorius furo]
Length = 157
Score = 109 bits (273), Expect = 6e-21, Method: Composition-based stats.
Identities = 61/166 (36%), Positives = 89/166 (53%), Gaps = 12/166 (7%)
Query: 109 RFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG 168
R++L G+ V+A+QF+V + R + VL P V + SW+W L G
Sbjct: 1 RWQLNPGMYQHRTVIADQFTVCLRREGKTVYQQVLSVERPSV--------LRSWNWGLCG 52
Query: 169 ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTS 228
YHAL+PRAWTVY P + + CRQI+P +PH+Y++SS P VF + + N G +
Sbjct: 53 YFAFYHALYPRAWTVYQ-LPGQNVTLTCRQITPILPHDYQDSSLPVGVFVWDVENEGDEA 111
Query: 229 ADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG--VHGLTLHHRT 271
DV+++F+ N + GD G + +DG V GL LHH T
Sbjct: 112 LDVSIMFSMRNGLGVGDDAPGGLWNEPFCLERDGETVQGLLLHHPT 157
>gi|302348409|ref|YP_003816047.1| trehalase [Acidilobus saccharovorans 345-15]
gi|302328821|gb|ADL19016.1| Putative trehalase [Acidilobus saccharovorans 345-15]
Length = 928
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 177/827 (21%), Positives = 313/827 (37%), Gaps = 127/827 (15%)
Query: 74 VFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH-----GICDDAPVLANQFS 128
V ++ ++ TS G+P+GG+G GS G F+ + +F+ G +
Sbjct: 2 VLHLRISYYDTSLAGVPMGGLGTGSFEIGPDGRFREWLIFNNRPWAGYGQPQWFMTPDDL 61
Query: 129 VFVSR--PNGEK--FSSVLCPRSPGVPKKNTDSGIESW--------DWNLKGENCTYHAL 176
VF R P G++ + G + T G W W E
Sbjct: 62 VFFLRIEPEGQEPILRGLFTGLWYGSAEDYTYRGCGPWVVAEPYHLPWAKGVEGVEMDVR 121
Query: 177 FPRAWTVYDGEPDPEL--RIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLL 234
+PR Y EL + SP +P + K SS PA + F ++N G + V+L+
Sbjct: 122 YPRVTLSYRDSLLDELGVEVSLDAYSPLLPGDLKASSTPAVILRFRVTNRGTSRVKVSLM 181
Query: 235 FTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSE 294
N +G +S+ + + + R P+ A + S
Sbjct: 182 AVARNLARS----AGGRPSSEVREGPDWTAMIMSACCVDERHPMARGALAVASLGQQSSG 237
Query: 295 CPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI---GAAIAASLTIPS 351
+ S N + + G D T PS S + A + +
Sbjct: 238 AVGSVDSSNRPQLVNLLRRLLVDFRG-----DGTLTGPSSMTSDLDVFSAVATRPIELTP 292
Query: 352 GSTRSVTFSLAWDCP-EVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEA 410
GS+ + +++AW P + + Y F+G + + I + A+ E A
Sbjct: 293 GSSEELYWAIAWHFPNHIDASGQRVGHYYENFFGNVEQVISYIFENKHELTARLEAFSSA 352
Query: 411 WQRPILEDKRFPEWY------PITLFNELYYLNAGG--TIWTDGSPPMQSLATIRERKFS 462
+ D + +W +T +L +L G +W +G P L T+ ++
Sbjct: 353 -----MYDASYDDWVIDLVTAQLTSIPKLSWLTRDGHFGLW-EGGPGCCGLNTVDVVLWA 406
Query: 463 LDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLF 522
++ L++ +++ + + L ++ Y + + F+ + +L
Sbjct: 407 FAGVINMYP----------ELVKVAVKDVVRHI-LRPDKHYYYELFALGFHENMSLYREM 455
Query: 523 PKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFN 582
+ + SIQ D A + I + + G VPH S+NL +
Sbjct: 456 LRKDPSIQNDPAKF-------SEAIAEVVRRTGKDPTGRVPHSFR--------ASFNLID 500
Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAM-AYMEQFDKDGDGMIEN---- 637
S DL +F+L + ATGD+ + R++W V + Q D G G+IE+
Sbjct: 501 SYDRNDLMPEFLLIALLPYYATGDREYLRSLWNDVLTVIDGTRRQHDSRGTGLIEHYMPS 560
Query: 638 --EGF----------------------------PD------QTYDAWSANGVSAYCGGLW 661
EG P+ T+D +S GV+++ G LW
Sbjct: 561 DYEGLSRVAQDAAEKGLVPGFYGGNVVKLLFSGPEFVMNSVNTFDTFSLLGVASFTGDLW 620
Query: 662 VAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQA 719
A+L+AA+ A+ G + + + +++ LWNGSYF+ YD G +I +
Sbjct: 621 AASLRAAAEAASREGLARAEELRQLSLRAYESMVKLLWNGSYFDDWYDPESGLRDRAILS 680
Query: 720 DQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGM----------Q 769
QL G+WY + GL D KV+ AL ++Y N + +G + G G +
Sbjct: 681 AQLTGEWYLQVLGLGDGVDSEKVRSALREVYRANFRRWEGLLNGTYPGTPRPSMVGDVEE 740
Query: 770 PDGR--IDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
P+G ++ QA W GV +G+A+ M+ E MV+ A + +++
Sbjct: 741 PNGTKILNRVSSQADTPWTGVEFGVASQMLYEGMVEEAMEVLRSIHD 787
>gi|320109368|ref|YP_004184958.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
gi|319927889|gb|ADV84964.1| hypothetical protein AciPR4_4219 [Terriglobus saanensis SP1PR4]
Length = 898
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 171/762 (22%), Positives = 283/762 (37%), Gaps = 147/762 (19%)
Query: 76 NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP- 134
N H G+PLGGIG G++ + G F R + + D A P
Sbjct: 56 NYLALHDAPGPSGVPLGGIGVGAVDLAPDGHFTRIAVNNWATDSGVARAR-----AENPE 110
Query: 135 -NGEKFSSVLCPRSPG-------VPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
+ E F +V S G + T G+ ++ + TY LFP A +D
Sbjct: 111 WDAEAFLAVWEKDSSGNVAAVRLQRDRRTAYGMGAYG------HSTYRGLFPTARIAFD- 163
Query: 187 EPD---PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN---- 239
+P P + S P N K+SS P TL+NS + ++ +W +
Sbjct: 164 DPSANSPHSLVSVFAYSGLAPQNVKDSSLPGFWIEVTLANSDSKPVEASVALSWPDLVGR 223
Query: 240 ---SVAGDSGLSGHH---------------FNSKTMTKDGVHGLTLHHRTANGRPPVTF- 280
VA + +G+ + + D +HGL TF
Sbjct: 224 GIYDVAPGAEKTGYDRAFDIEKLVAVPSPATQVEALQMDSLHGLRQFSTATPSIRQATFQ 283
Query: 281 ------AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSE 334
A+ AE VS P + SK T D W ++ G+F T+ S+
Sbjct: 284 NYINAVAILAEVPEGGSVSTNPAW-----SK--TDADSWKSFREDGAFPS-RTGTTALSQ 335
Query: 335 PGSSIGAAIAA-SLTIPSGSTRSVTFSLAWDCP-----EVKFFEKVYHRRYTKFYGTLGD 388
G GA++ A I G ++ F + W P V+F Y R + ++
Sbjct: 336 SGKLSGASVVAIKAHIAGGGKQTFRFLITWSIPTLPHASVQFGTSDYGRYFQNYFPDF-- 393
Query: 389 SAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSP 448
+ H E A+ E +WQ+PIL D P+W L +N+G TI+T+
Sbjct: 394 --QSLIHYEFSERARILRETLSWQKPIL-DSTLPDWLKFKL------INSGYTIYTN--- 441
Query: 449 PMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTY 508
T L ++G+ F +EG + T
Sbjct: 442 --------------------------------TMLNKSGD-----FSVMEGGMGGLAGTM 464
Query: 509 DVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGL 568
D + F +L+ + + F A D G + DG + +GL
Sbjct: 465 DQRLVAHPFYQKFFTQLDRAELQRF-ADTQDTDGGILHF--DGSYF-----------VGL 510
Query: 569 DDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFD 628
D + + + D +V+Q+ +D+ TGD F + + AYM+
Sbjct: 511 ADD----HGHTPVPHEKMIDNTGAWVIQLAKDYQQTGDDTFIKKNADHIRRGFAYMKAQI 566
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
+D D I G QTYD + +S Y G +++AAL+A L + +GD + ++
Sbjct: 567 RD-DSFIPVGG---QTYDDFPHPEISVYTGTVYLAALRAGVVLGDALGDTQMSHEAETQF 622
Query: 689 QKAKA-VYDSLWNGSYFNYDNSDGSS---STSIQADQLAGQWYARACGLLPIADEAKVKK 744
++ +A + +LWNG +F Y G + + + QLAGQ+ +R G + + K
Sbjct: 623 KQTQAGLIHALWNGRFFAYGTDIGGAHRRDDRLFSGQLAGQFLSRYAGWGDVLPHDQAKS 682
Query: 745 ALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWP 786
A+ + +VL + ++ +DM G ++ WP
Sbjct: 683 AIYEQLHTSVLGTPDFYAPKVWDLEMQRGVDMPGSRS---WP 721
>gi|158316208|ref|YP_001508716.1| hypothetical protein Franean1_4431 [Frankia sp. EAN1pec]
gi|158111613|gb|ABW13810.1| protein of unknown function DUF608 [Frankia sp. EAN1pec]
Length = 934
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 169/800 (21%), Positives = 296/800 (37%), Gaps = 117/800 (14%)
Query: 141 SVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
++ P + P + + +W+ LK + +Y LFP + Y G + ++ + S
Sbjct: 163 TLAVPHTNTAPDRTFGDVLPAWN-TLKPGDGSYAVLFPFGYMTYSGF---QSKVSTKIWS 218
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG--LSGHHFNSKTMT 258
P + + + +S P + F ++N ++++ T+ N+ A +G +G + +
Sbjct: 219 PIVANEDERTSMPVAFFDMLMNNPTAKPIKISVMLTFPNAPAFATGSVRTGLYSRFDRDS 278
Query: 259 KDGVHGLTLHHRTANGRPPVT---FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
G+ G+TL + P + +AA A ++ C + SG+ I A ++
Sbjct: 279 ASGIGGVTLGSDSPENTPDTVTSEWTIAAHPFAGQTLTYCTSWDGSGDGSDIYAP--FSA 336
Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF----- 370
G + D D+ S+ A+A +LT+ T++V F+L+WD P+V +
Sbjct: 337 AGADGKLPNGDIDQ-------SASAGAVAVALTLEPDQTQTVRFALSWDFPQVYYDGEDA 389
Query: 371 -FEKVYHRRYTKFYGTLGDSAARIAHD----------AILEHAKWE---CEIEAWQRPIL 416
V+ RRYT F G D A E A+++ +E+W +PI
Sbjct: 390 TTRAVWMRRYTAFLGGKTSRTNDYVQDSYPFRQGFTIARKELARYDDSLAAVESWWKPIA 449
Query: 417 EDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD 476
E+ + P W NELY++ G+ W G P S +
Sbjct: 450 ENPQVPPWLRKASLNELYHMIFNGSFWESGL-------------------VSNTMPMSVE 490
Query: 477 TALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAA 536
RL E F G N D+ LF LEL R
Sbjct: 491 EGTSPRLGSAIPETHIYFHADGGDGGAQTNEVDMDSSGYLVFAKLFRSLELGRVRPLLQM 550
Query: 537 VMMHDPGTMKIM-----SDGKWVARKCLGAVPHDIGLDDPWFEINSY--------NLFNS 583
V + G +++ S G ++ + A ++ P N +LF
Sbjct: 551 VRQNPLGIGRVIQQTFRSSGPYITQT---ASFQNLPFSKPPTAGNPPAPPTRDLGDLFAD 607
Query: 584 SR---WKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGF 640
++D K + + Y D + + + A+ Y + F G + +
Sbjct: 608 EAGDPFRDCPHKLIYRTYALIKFYDDDDLLEYGYAPMLKALTYSQFFRPTGSHLPADPAS 667
Query: 641 --PDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG----DHASASY---FWVRYQKA 691
P T D NG Y GL++ +LQ S L D A+ A
Sbjct: 668 NNPPNTMDQAVVNGHGIYNCGLYLLSLQILSTLTPQAARLGVDEATPEIQKELDEELAAA 727
Query: 692 KAVYDSL-WNGSYFNYDNSDGSSST------------------SIQADQLAGQWYARACG 732
K ++ + WN + Y DG+ +I + AGQ A G
Sbjct: 728 KEEFERIFWNPATGRYRYCDGTGGIGDRTGTIRGRFKPVPPPDAIWLESFAGQLVAMELG 787
Query: 733 LLPIADEAKVKKALTKIYDFNV--LKVKGGMCGAMNGMQPDGRIDMSGLQAREI---WPG 787
L + D + L D V +G + G ++PD I S L+ EI PG
Sbjct: 788 LPDVVDLDHARTHLKNTLDSFVRFRDPEGNLMGGPIILKPDFSIYPSSLRTTEINEVIPG 847
Query: 788 VTYGLAASMIQ-------EEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEY-- 838
+ + AA + +++ + A + G + + GY+F TPESW +D +
Sbjct: 848 IAFLAAAGAFRIGAKVKDKDITEKALKLGEGCALQIYDIESNGYAFATPESWFVDDHHIS 907
Query: 839 RSLCYMRPLTIWAMQWALTK 858
R Y R ++W++ A+++
Sbjct: 908 RFPGYTRTRSVWSLYDAVSE 927
>gi|224535965|ref|ZP_03676504.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522420|gb|EEF91525.1| hypothetical protein BACCELL_00829 [Bacteroides cellulosilyticus
DSM 14838]
Length = 899
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 157/697 (22%), Positives = 261/697 (37%), Gaps = 152/697 (21%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV--SRPNGEKFSSVLCP 145
G+PLGGIG G+I + G+F R + + P+ ++ S F +R EK +V
Sbjct: 64 GVPLGGIGVGNINFAPSGKFTRIGMNN---IHTPIKRSEHSFFSLWTRKGNEK-EAVRLV 119
Query: 146 RSPGVPKKNTDSGIESWDWNLKG-ENCTYHALFPRAWTVYDGEPDPELRI--VCRQISPF 202
R V + +KG E+ Y LFP A Y D L++ V R S
Sbjct: 120 RDNHVL------------YGMKGVEHTQYKGLFPTAELSY---ADNNLKVTPVIRAYSGL 164
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG 261
+PHN K+SS P F L + Q + L F+W + + + S F++ + +G
Sbjct: 165 VPHNVKDSSLPVVWFEVDLIS--QEDMEAALAFSWEDFIGLFNDPKSLEGFDNGQLLSEG 222
Query: 262 ---------------------------VHGLTLHHRTANGRPPVTF-------AVAAEET 287
+ GL + + +TF +A EE
Sbjct: 223 RANINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEE- 281
Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
+ +VS P + NS+ W++ + +G F + +S +A+A
Sbjct: 282 -EKNVSYLPTY--RSNSEA------WDQFRNNGEFTSSLTKNVLTEQSQTSSASALAVKT 332
Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKVY--------HRRYTKFYGTLGDSAARIAHDAIL 399
+ +G +++ F LAW PE++ Y K+Y + + A+
Sbjct: 333 QLKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNCMNSMVSYAVS 392
Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRER 459
A+ + WQ P+LE P+WY L +N+G I+T+
Sbjct: 393 NRARIARQTTEWQIPVLESS-LPDWYKFKL------INSGYVIYTN-------------- 431
Query: 460 KFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
+L G G + EGA T D S
Sbjct: 432 ----------------------MVLTKG----GDVMVNEGAMGGFAGTMDQRLSSHPFYQ 465
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYN 579
F +L+ S FA A+ DP + G + +G V + + W
Sbjct: 466 KFFTQLDRSEMDIFADAM---DPEGYILHFIGHYYV--GMGTVGGRVPTEKGWM------ 514
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
L N+S W ++Q+ +D+ TGD + +A + AM ++ G +
Sbjct: 515 LDNASGW-------IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFLYSRMPQGSTI----P 563
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSL 698
TYD ++ + +Y G+W+ L+A A+ +GD + ++ + K + L
Sbjct: 564 VGPTTYDDFTHPPLYSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKL 623
Query: 699 WNGSYFNYD-NSDGSS--STSIQADQLAGQWYARACG 732
WNG +F Y DGS + QLAGQ+ +R CG
Sbjct: 624 WNGRFFAYGCEPDGSKRLDNVLFTGQLAGQFLSRYCG 660
>gi|423226023|ref|ZP_17212489.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392630880|gb|EIY24861.1| hypothetical protein HMPREF1062_04675 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 899
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 156/697 (22%), Positives = 260/697 (37%), Gaps = 152/697 (21%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFV--SRPNGEKFSSVLCP 145
G+PLGGIG G+I + G+F R + + P+ ++ S F +R EK +V
Sbjct: 64 GVPLGGIGVGNINFAPSGKFTRIGMNN---IHTPIKRSEHSFFSLWTRKGNEK-EAVRLV 119
Query: 146 RSPGVPKKNTDSGIESWDWNLKG-ENCTYHALFPRAWTVYDGEPDPELRI--VCRQISPF 202
R V + +KG E+ Y LFP A Y D L++ V R S
Sbjct: 120 RDNHVL------------YGMKGVEHTQYKGLFPTAELSY---ADNNLKVTPVIRAYSGL 164
Query: 203 IPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVA-GDSGLSGHHFNSKTMTKDG 261
+PHN K+SS P V F + Q + L F+W + + + S F++ + +G
Sbjct: 165 VPHNVKDSSLP--VVWFEVDLIAQEDMEAALAFSWEDFIGLFNDPKSLEGFDNGQLLSEG 222
Query: 262 ---------------------------VHGLTLHHRTANGRPPVTF-------AVAAEET 287
+ GL + + +TF +A EE
Sbjct: 223 RANINNGENWPLREKAKTYVEPYQMGSLKGLIQYAADSLQPRKLTFQNYVNQVVIAVEE- 281
Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
+ +VS P + NS+ W + + +G F + +S + +A
Sbjct: 282 -EKNVSYLPAY--RSNSEA------WEQFRNNGEFTSSLTKNVLTEQSQTSSASVLAVKT 332
Query: 348 TIPSGSTRSVTFSLAWDCPEVKFFEKVY--------HRRYTKFYGTLGDSAARIAHDAIL 399
+ +G +++ F LAW PE++ Y K+Y ++ + A+
Sbjct: 333 QLKAGQKKTIRFMLAWYAPELQIDAAALPIGSYWPCGADYNKYYHNYFNNMNSMVSYAVS 392
Query: 400 EHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRER 459
A+ + WQ P+LE P+WY L +N+G I+T+
Sbjct: 393 NRARIARQTTEWQIPVLESS-LPDWYKFKL------INSGYVIYTN-------------- 431
Query: 460 KFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALV 519
+L G G + EGA T D S
Sbjct: 432 ----------------------MVLTKG----GDVMVNEGAMGGFAGTMDQRLSSHPFYQ 465
Query: 520 MLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYN 579
F +L+ S FA A+ DP + G + +G V + + W
Sbjct: 466 KFFTQLDRSEMDIFADAM---DPEGYILHFIGHYYV--GMGTVGGRVPTEKGWM------ 514
Query: 580 LFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG 639
L N+S W ++Q+ +D+ TGD + +A + AM ++ G +
Sbjct: 515 LDNASGW-------IIQLVKDYEQTGDTEYLKAHLTGLKRAMKFLYSRMPQGSTI----P 563
Query: 640 FPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSL 698
TYD ++ + +Y G+W+ L+A A+ +GD + ++ + K + L
Sbjct: 564 VGPTTYDDFTHPPLYSYYAGVWLTTLKAYEAIGKAIGDESIVKQAQQQFATSQKEALEKL 623
Query: 699 WNGSYFNYD-NSDGSS--STSIQADQLAGQWYARACG 732
WNG +F Y DGS + QLAGQ+ +R CG
Sbjct: 624 WNGRFFAYGCEPDGSKRLDNVLFTGQLAGQFLSRYCG 660
>gi|385775519|ref|YP_005648087.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|323474267|gb|ADX84873.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
Length = 661
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 129/297 (43%), Gaps = 35/297 (11%)
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
+ S ++++FP+LE S + + V + G VPHD+G LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
P S +WKD+N VL VYR F T D +F + V+P++ M + + +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
DG+ EG D +DA G +Y L++A+L A +A VGD + +
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503
Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
A+ + ++NG YF DG S A QL G+W+ L IADE +K AL I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIADENMIKSALESI 560
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
N N + +G+I +Q WP + + + ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|238619389|ref|YP_002914214.1| hypothetical protein M164_0933 [Sulfolobus islandicus M.16.4]
gi|238380458|gb|ACR41546.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
Length = 661
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 128/297 (43%), Gaps = 35/297 (11%)
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
+ S ++++FP+LE S + + V + G VPHD+G LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
P S +WKD+N VL VYR F T D +F + V+P++ M + + +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMNW--ELRQ 443
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
DG+ EG D +DA G +Y L++A+L A +A VGD + +
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503
Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
A+ + ++NG YF DG S A QL G+W+ L I DE +K AL I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIVDENMIKSALESI 560
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
N N + +G+I +Q WP + + + ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|229584458|ref|YP_002842959.1| hypothetical protein M1627_0991 [Sulfolobus islandicus M.16.27]
gi|228019507|gb|ACP54914.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 661
Score = 99.8 bits (247), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 128/297 (43%), Gaps = 35/297 (11%)
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
+ S ++++FP+LE S + + V + G VPHD+G LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
P S +WKD+N VL VYR F T D +F + V+P++ M + + +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
DG+ EG D +DA G +Y L++A+L A +A VGD + +
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503
Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
A+ + ++NG YF DG S A QL G+W+ L I DE +K AL I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIVDENMIKSALESI 560
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
N N + +G+I +Q WP + + + ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|227827237|ref|YP_002829016.1| hypothetical protein M1425_0924 [Sulfolobus islandicus M.14.25]
gi|227459032|gb|ACP37718.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 661
Score = 99.8 bits (247), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 128/297 (43%), Gaps = 35/297 (11%)
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
+ S ++++FP+LE S + + V + G VPHD+G LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
P S +WKD+N VL VYR F T D +F + V+P++ M + + +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
DG+ EG D +DA G +Y L++A+L A +A VGD + +
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503
Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
A+ + ++NG YF DG S A QL G+W+ L I DE +K AL I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIVDENMIKSALESI 560
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
N N + +G+I +Q WP + + + ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|385772401|ref|YP_005644967.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|323476515|gb|ADX81753.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 661
Score = 99.4 bits (246), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 129/297 (43%), Gaps = 35/297 (11%)
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
+ S ++++FP+LE S + + V + G VPHD+G LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RNDGYVPHDLGFHSLD 390
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
P S +WKD+N VL VYR F T D +F + V+P++ M + + +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLVLLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
DG+ EG D +DA G +Y L++A+L A +A VGD + +
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVGDSNYVGFINEKLS 503
Query: 690 KAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKI 749
A+ + ++NG YF DG S A QL G+W+ L IA+E +K AL I
Sbjct: 504 VAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENIANENMIKSALESI 560
Query: 750 YDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
N N + +G+I +Q WP + + + ++ + D++F
Sbjct: 561 IRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLAYKKGVGDLSF 614
>gi|15898194|ref|NP_342799.1| hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
gi|13814563|gb|AAK41589.1| Hypothetical protein SSO1353 [Sulfolobus solfataricus P2]
Length = 663
Score = 96.3 bits (238), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 80/300 (26%), Positives = 133/300 (44%), Gaps = 41/300 (13%)
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
+ S ++++FP+LE + F ++ H I DG VPHD+G LD
Sbjct: 352 FGSLPVILMFPELE----KSFLKLLIRH------IREDG---------YVPHDLGYHSLD 392
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
P S RWKD+N +L VYR F T D F + V+P + M + + K
Sbjct: 393 SPIDGTTS-----PPRWKDMNPSLILLVYRYFKFTNDIEFLKEVYPILVKVMDWELRQCK 447
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
+E E D +DA G +Y L++ +L A +A VGD + +
Sbjct: 448 GNLPFMEGE--MDNAFDATIIKGHDSYTSSLFIGSLIAMREIAKLVGDSNYVDFISEKLS 505
Query: 690 KAKAVYDSLWNGSYFN-YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTK 748
A+ + ++NG YF +D+ D +S + QL G+W+ GL I +E +KKAL
Sbjct: 506 SAREAFRRMFNGRYFKAWDSVDNASFLA----QLYGEWFTTLVGLEDIVEEDIIKKALES 561
Query: 749 IYDFNVLKVKGGMC--GAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
I +++ G N + +G+I +Q WP + + + ++ + D++F
Sbjct: 562 I-----IRLNGNASPHCVPNLVDDNGKIVGLSVQTYSSWPRMVFAICWLAYKKGVGDLSF 616
>gi|15898746|ref|NP_343351.1| hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
gi|229578524|ref|YP_002836922.1| hypothetical protein YG5714_0710 [Sulfolobus islandicus Y.G.57.14]
gi|229582969|ref|YP_002841368.1| hypothetical protein YN1551_2495 [Sulfolobus islandicus Y.N.15.51]
gi|284173504|ref|ZP_06387473.1| hypothetical protein Ssol98_02460 [Sulfolobus solfataricus 98/2]
gi|384435081|ref|YP_005644439.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|13815223|gb|AAK42141.1| Hypothetical protein SSO1948 [Sulfolobus solfataricus P2]
gi|228009238|gb|ACP45000.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
gi|228013685|gb|ACP49446.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
gi|261603235|gb|ACX92838.1| protein of unknown function DUF608 [Sulfolobus solfataricus 98/2]
Length = 661
Score = 96.3 bits (238), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 112/250 (44%), Gaps = 16/250 (6%)
Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
G VPHD+G LD P S +WKD+N +L VYR F T D +F + V+P+
Sbjct: 378 GYVPHDLGFHSLDSPIDGTTS-----PPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPT 432
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
+ M + + + DG+ EG D +DA G +Y L++A+L A +A VG
Sbjct: 433 IVKVMDW--ELRQCRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490
Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
D + + A+ + ++NG YF DG S A QL G+W+ L I
Sbjct: 491 DSNYVGFINEKLNVAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENI 547
Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
DE +K AL I N N + +G+I +Q WP + + +
Sbjct: 548 VDENMIKSALESIIRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLA 604
Query: 797 IQEEMVDMAF 806
++ + D++F
Sbjct: 605 YKKGVGDLSF 614
>gi|227829616|ref|YP_002831395.1| hypothetical protein LS215_0672 [Sulfolobus islandicus L.S.2.15]
gi|227456063|gb|ACP34750.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
Length = 661
Score = 96.3 bits (238), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 112/250 (44%), Gaps = 16/250 (6%)
Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
G VPHD+G LD P S +WKD+N +L VYR F T D +F + V+P+
Sbjct: 378 GYVPHDLGFHSLDSPIDGTTS-----PPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPT 432
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
+ M + + + DG+ EG D +DA G +Y L++A+L A +A VG
Sbjct: 433 IVKVMDW--ELRQCRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490
Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
D + + A+ + ++NG YF DG S A QL G+W+ L I
Sbjct: 491 DSNYVGFINEKLNVAREAFRKMFNGKYFK--AWDGVDKASFLA-QLYGEWFTTLLELENI 547
Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASM 796
DE +K AL I N N + +G+I +Q WP + + +
Sbjct: 548 VDENMIKSALESIIRLN---GNASPYCVPNLVDENGKIVNLSVQTYSSWPRLVFAICWLA 604
Query: 797 IQEEMVDMAF 806
++ + D++F
Sbjct: 605 YKKGVGDLSF 614
>gi|15922946|ref|NP_378615.1| hypothetical protein ST2609 [Sulfolobus tokodaii str. 7]
gi|15623737|dbj|BAB67724.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 661
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 117/245 (47%), Gaps = 26/245 (10%)
Query: 560 GAVPHDIG---LDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
G VPHD+G LD P + +WKDLN ++L +YR + TGD F ++V+
Sbjct: 364 GYVPHDLGYCSLDSP-----TDGTTAPPKWKDLNPTYILLIYRYYKLTGDIEFLKSVYDK 418
Query: 617 VYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
V A + +F + G EG D +D G+++Y L++A+L A ++ G
Sbjct: 419 VKKAFEWELKFSRYG-----LEGKMDSAFDVTPIKGINSYTLSLYIASLFAMREISKTAG 473
Query: 677 DHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPI 736
D+ + + ++AK ++ ++NG YF +G A Q+ G+W+ GL PI
Sbjct: 474 DNLNLDE---QIKEAKEAFEKMFNGKYF--IAWEGMEDAVFLA-QVFGEWWTTLLGLEPI 527
Query: 737 ADEAKVKKALTKIYDFNVLKVKGGMCG--AMNGMQPDGRIDMSGLQAREIWPGVTYGLAA 794
ADE K+K AL I +KV G N ++ DG++ Q WP + + +
Sbjct: 528 ADEEKIKSALRWI-----IKVNGNASKYCTPNLVKEDGKVVSLSPQTYSSWPRLVFAICW 582
Query: 795 SMIQE 799
I++
Sbjct: 583 LSIEK 587
>gi|383764296|ref|YP_005443278.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381384564|dbj|BAM01381.1| hypothetical protein CLDAP_33410 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 905
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 140/614 (22%), Positives = 231/614 (37%), Gaps = 126/614 (20%)
Query: 194 IVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV-AGDSGLSGHH- 251
+ R SPFIP N ++SS PA+ F +T++NS + S D T++ N + A + + HH
Sbjct: 162 VKLRAFSPFIPLNDRDSSLPAAFFEWTITNSTEESIDYTVVGVLCNPLPANNLNVVRHHP 221
Query: 252 -FNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK 310
+ MT DGV T + +T A A D +S + +
Sbjct: 222 WGCALHMTSDGVQPDTPSYGD------LTLATNANLVPDAAMSWQQYWFRGAWFDSLEV- 274
Query: 311 DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
W ++ G F N G +A ++ G +++V F + W+ P +
Sbjct: 275 -YWKDLISGGRF---QNRVYPLQRAGEHNEGVLAVHFSLQPGESKTVNFVITWNFPICEN 330
Query: 371 FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLF 430
+ K T + A + D+ A + + W+R + E RF E LF
Sbjct: 331 YWKPEQVLPQGISPTWKNYYATVWKDS---QASAQYALSEWERLLDETHRFKE----ALF 383
Query: 431 NELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEEN 490
S P L + L +P+ + T G E +
Sbjct: 384 A--------------SSLPPAVLDAVSANMAVLKSPTVLRL--EDGTLYG---WEGCHPD 424
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQR-DFAAAVMMHDPGTMKI-- 547
G EG+ ++N L LFPKLE S++ DFA + +I
Sbjct: 425 SG---CCEGSCTHVWNYQQT-------LPFLFPKLERSMREADFAYNLRPDGGMAFRIQL 474
Query: 548 -MSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGD 606
+ G+W R C D VL+ YRD+ GD
Sbjct: 475 PLGAGRWDFRPC----------------------------ADGQFGGVLKTYRDWKICGD 506
Query: 607 KNFARAVWPSVYIAMAY------MEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGL 660
+ R +WPSV ++ + ++++D + G++ G T D G +++ +
Sbjct: 507 NEWLRRLWPSVKKSLEFAWHPNNIDRWDPEKTGVL--WGRQHHTLDM-ELFGPNSWLTSM 563
Query: 661 WVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF--------------- 704
++AAL+A + +A +GD A+A + + + K D+ L+NG Y+
Sbjct: 564 YLAALKAGAEMAEYLGDEAAAIEYREIFARGKKWADANLFNGEYYIQRINLHDRSIVEAF 623
Query: 705 ---------------NYDNSDGSSSTSI----QADQLAGQWYARACGLLPIADEAKVKKA 745
+D+ G I DQ+ GQW+A GL I D +V+ A
Sbjct: 624 ADDESVLIGGSTLEAYWDDEHGEIKYQIGEGSSIDQVLGQWHASLYGLGEIFDPTQVRSA 683
Query: 746 LTKIYDFNVLKVKG 759
L IY +N + V G
Sbjct: 684 LAAIYKYNFIPVMG 697
>gi|332796035|ref|YP_004457535.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
gi|332693770|gb|AEE93237.1| bile acid beta-glucosidase [Acidianus hospitalis W1]
Length = 840
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 159/389 (40%), Gaps = 74/389 (19%)
Query: 493 QFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMS--- 549
Q+ Y +G + NT DV YS ++ LFP+L I F ++ + + S
Sbjct: 343 QYPYTDGPLHTALNTIDVLTYSLPTILNLFPELAEKIILQFKDKLIEENTPEHVVYSLSF 402
Query: 550 -------------------DGKWVARKCLGAVPHDIGLDD----PWFEINSYNLFNSSRW 586
DG+ + + + + G D P F ++ + R
Sbjct: 403 EENREKFLEKLSKDPSLPTDGEKLY-STINEIVKETGKDPKGRVPHFFTDNLRVDEYHR- 460
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVWPSV------------YIAMAY----------- 623
DLN +F+L Y TGD NF + ++P + Y + Y
Sbjct: 461 VDLNPEFILMTYLIAKTTGDLNFLKEIFPKMKEALESTMKTQTYDGLIYHTLPAGLEWLR 520
Query: 624 -------MEQFDKDGDGMIENEGFP--DQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
+ + D + ++ + P QT+D WS G++++ LW++++QA + ++
Sbjct: 521 YVNNKLNLPRGDNNSASILGHNLIPLSMQTFDDWSMIGITSFTSILWISSIQAVNDACSN 580
Query: 675 VGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACG 732
+ + S Y K + D LWNG YF+ YD + A Q+ G WY+ G
Sbjct: 581 LKINCSYDY----ESLVKKLIDYLWNGEYFDLWYDPKSKMRDKACNASQILGHWYSTLLG 636
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM-NGMQPDGRIDMSGL------QAREIW 785
L D++ VK L I +N+ + +G + GA NG +P R + L Q W
Sbjct: 637 -LRFLDDSLVKTTLKSIVKYNLKEEEGLLNGAYPNGYRPLKRNYQNQLNLPATTQIDTPW 695
Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
GV + +A+ +I E++ D + +YE
Sbjct: 696 SGVEFYVASHLIYEKLRDEGEKILRNIYE 724
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 64/141 (45%), Gaps = 22/141 (15%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFH--GICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
GIPLGGIG GS+ G + +F+ G + + +F+ ++ + S
Sbjct: 12 GIPLGGIGTGSVEIRADGRLYEWTIFNNGGYAERQDL---RFTYYLDEQD-----SFFAV 63
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD-------PELRIVCRQ 198
R G+ + ++++D+N G L P ++GEP ++ + +
Sbjct: 64 RQKGIVRV-----LQAFDYNFGGSPYILPWLRPVERVEFNGEPPIAYLEFVDKISVNMKA 118
Query: 199 ISPFIPHNYKESSFPASVFTF 219
SPFIP + K SS P+++FT
Sbjct: 119 FSPFIPTDLKNSSLPSAIFTL 139
>gi|402589212|gb|EJW83144.1| hypothetical protein WUBG_05948, partial [Wuchereria bancrofti]
Length = 174
Score = 89.4 bits (220), Expect = 9e-15, Method: Composition-based stats.
Identities = 56/159 (35%), Positives = 79/159 (49%), Gaps = 20/159 (12%)
Query: 89 IPLGGIGAGSIGRSYRGEFQRFKLFHGICDD--APVLANQFSVFVSRPNG---EKFSSVL 143
+P G IG+GSIGR +RG F +F L GI + V ANQF+ PN + VL
Sbjct: 1 VPCGTIGSGSIGRDFRGGFCKFGLRPGITEQKIGVVKANQFN-----PNNLWETVYQKVL 55
Query: 144 CPRSPGVPKKNTDSG---IESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQIS 200
C RS + SG + SWD++ + Y L+PR WT Y P+ + +Q+S
Sbjct: 56 CARS------SVSSGRGELTSWDFSFPPDKLIYRGLYPRFWTYYS-IPEFNFILCIQQVS 108
Query: 201 PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
P IP+NY+ES P ++F N + FT+ N
Sbjct: 109 PVIPNNYEESLLPVTLFIIDAENQSSVDLQAAVAFTFRN 147
>gi|150009713|ref|YP_001304456.1| hypothetical protein BDI_3128 [Parabacteroides distasonis ATCC
8503]
gi|149938137|gb|ABR44834.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 901
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/406 (22%), Positives = 157/406 (38%), Gaps = 67/406 (16%)
Query: 75 FNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP 134
F+ + + + G+PLGGIG G++ + G F R L + P+ + S F
Sbjct: 47 FDYLKLENHPGNSGVPLGGIGVGNVEFAPDGRFARIGLNN---IHLPIRKSTASFFSLWY 103
Query: 135 NGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
G K ++V+ +P+ L E+ Y LFPR + G P +
Sbjct: 104 KGAKETAVVRLVRDSLPQYGL----------LGVEDIYYTGLFPRVEMDF-GNSIPGINA 152
Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGD-------SGL 247
R SP IPH+ K SS P F L + Q +V++ F+W + + G+
Sbjct: 153 TIRAFSPLIPHDVKNSSLPVVFFDVDLVS--QEGGEVSIAFSWEDFIGRGIKEPDDIKGM 210
Query: 248 SGHHFN---------------------SKTMTKDGVHGLT------LHHRTANGRPPVT- 279
G F S++ + + G+ L + AN + V
Sbjct: 211 DGQLFGQERNSLCNGEAWPERPLEQTYSESWSCGKMQGIRQFASGPLKPKRANFQNYVNE 270
Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
AV AE DV VS P F + + W+ K+ G F+ + D + S PG +
Sbjct: 271 VAVLAESREDVEVSVLPSFNIK------ETHEAWDLFKQEGRFEDISKDVSLLSIPGKQM 324
Query: 340 -GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEK--------VYHRRYTKFYGTLGDSA 390
G+A+A + + ++V F L W PE+ + V Y +++ +
Sbjct: 325 GGSAVALKTKLKAKEKKTVRFMLVWFYPEMMIDRENDPLEYYWVGGSDYGRYFHNYFNRL 384
Query: 391 ARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
++ ++ + WQ P+L + P+WY L N Y +
Sbjct: 385 EQLVDYGYGNRERFREKTTEWQLPVL-NSTMPDWYKFKLINSGYVI 429
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 87/203 (42%), Gaps = 11/203 (5%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
D +++Q+ +D+ TGD ++ VY M +++ +G N TYD
Sbjct: 517 DNTGGWIIQLVKDYEQTGDFHYLSRYAGRVYNGMDFLKSLMPEG----LNIPIGGTTYDD 572
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSLWNGSYFNY 706
++ V +Y +++A L+AA +A + D + + ++ K K + LWNG +F+Y
Sbjct: 573 FTHPPVYSYGASIYLATLRAAQVVAQAMKDTVRENEYKEQFDKTQKDMIRLLWNGRFFSY 632
Query: 707 D---NSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCG 763
+ G + QL GQ+ +R CG + K +L + ++ K
Sbjct: 633 GCELDGSGRKDNLLFTGQLGGQFVSRYCGWGDVIPMEMTKASLVSQFKISLSKTPDYYAN 692
Query: 764 AMNGMQPDGRIDMSGLQAREIWP 786
+ + + ID G Q WP
Sbjct: 693 KVWDITLNRGIDNRGSQC---WP 712
>gi|284174574|ref|ZP_06388543.1| hypothetical protein Ssol98_07941 [Sulfolobus solfataricus 98/2]
gi|356934758|gb|AET42959.1| hypothetical protein [Sulfolobus solfataricus 98/2]
Length = 803
Score = 86.3 bits (212), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 151/383 (39%), Gaps = 69/383 (18%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
+G++ Y G E NT DV Y+ +++LFP+L +I RD + + D
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391
Query: 543 ----GTMK----------IMSDGKWV---ARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
MK I +D K + +K + D P + +S + R
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHSLTVDTYER 451
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAM 621
D+N +FVL Y TGD+ F +V+ PS M
Sbjct: 452 -VDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLTLPSGIEWM 510
Query: 622 AYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALAND 674
++ K D QT D WS G S Y LW+AAL+A AS L N+
Sbjct: 511 RHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALNEASKLLNN 570
Query: 675 VGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACG 732
+ ++ + + V LWNG Y+ YD S+ A Q+ G WY + G
Sbjct: 571 LHNYEVK-------ELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTGDWYVQLLG 623
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNG-MQPDGRIDMSGLQAREIWPGVTYG 791
L D K K + I +N + +G G+ N + P G + +S +Q++ W G+ Y
Sbjct: 624 LPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLG-VKLS-VQSKTPWSGIEYY 681
Query: 792 LAASMIQEEMVDMAFQTAAGVYE 814
LA+ M + A + VYE
Sbjct: 682 LASHMFYNGFDEYAKKILRNVYE 704
>gi|15899746|ref|NP_344351.1| hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
gi|13816440|gb|AAK43141.1| Hypothetical protein SSO3039 [Sulfolobus solfataricus P2]
Length = 803
Score = 86.3 bits (212), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 151/383 (39%), Gaps = 69/383 (18%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
+G++ Y G E NT DV Y+ +++LFP+L +I RD + + D
Sbjct: 332 VGKYPYTGGPENTALNTIDVLLYALPGVMLLFPELAKNIVRDLSNRALKEDTPEYVLFSL 391
Query: 543 ----GTMK----------IMSDGKWV---ARKCLGAVPHDIGLDDPWFEINSYNLFNSSR 585
MK I +D K + +K + D P + +S + R
Sbjct: 392 AFPENLMKYKEEVKKDPTISTDLKKLYDTIKKIIKETGKDPKGRMPHYIRHSLTVDTYER 451
Query: 586 WKDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAM 621
D+N +FVL Y TGD+ F +V+ PS M
Sbjct: 452 -VDINPEFVLLYYLIAKYTGDREFLTSVYEVARNAIESIMRTQTLDGLPYLTLPSGIEWM 510
Query: 622 AYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALAND 674
++ K D QT D WS G S Y LW+AAL+A AS L N+
Sbjct: 511 RHVNNMLKVSDAHKILGYHTLALSMQTLDDWSWLGFSPYVSFLWIAALEALNEASKLLNN 570
Query: 675 VGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACG 732
+ ++ + + V LWNG Y+ YD S+ A Q+ G WY + G
Sbjct: 571 LHNYEVK-------ELTEKVNKYLWNGEYYINWYDPISNLRDDSLNASQMTGDWYVQLLG 623
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNG-MQPDGRIDMSGLQAREIWPGVTYG 791
L D K K + I +N + +G G+ N + P G + +S +Q++ W G+ Y
Sbjct: 624 LPEFLDHEKRKSIFSSIIKYNYTEEEGVKNGSSNKEITPLG-VKLS-VQSKTPWSGIEYY 681
Query: 792 LAASMIQEEMVDMAFQTAAGVYE 814
LA+ M + A + VYE
Sbjct: 682 LASHMFYNGFDEYAKKILRNVYE 704
>gi|332796016|ref|YP_004457516.1| hypothetical protein Ahos_0327 [Acidianus hospitalis W1]
gi|332693751|gb|AEE93218.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 662
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 130/309 (42%), Gaps = 50/309 (16%)
Query: 492 GQFLYLEGAEY--VMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMS 549
G+F LE E V+ + + + +++LFP LE ++ A V+ D
Sbjct: 332 GRFGVLEAPEVFPVVGTIAGLCYEGALPVLLLFPCLEKEFIKELAN-VIRED-------- 382
Query: 550 DGKWVARKCLGAVPHDIGLDDPW-FEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKN 608
G VPHD+G+ W F+ RWKDLN F+L VYR F TGD +
Sbjct: 383 ----------GYVPHDLGI---WSFDSPIEGTTAPPRWKDLNPTFILLVYRYFKFTGDVD 429
Query: 609 FARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVA---AL 665
F R +P + A +M IE EG D +D G + L++A AL
Sbjct: 430 FLRETYPKMLKAYEWMLTRS------IEAEGSGDTAFDVLPIKGKNPMLLTLFIASALAL 483
Query: 666 QAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-NYDNSDGSSSTSIQADQLAG 724
+ + N+ + S K + + +SL+NG YF ++ +G I QL G
Sbjct: 484 RETKKVLNEKDESTDLS-------KLREMLNSLYNGKYFIAWEGQEG-----IFMAQLLG 531
Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREI 784
+W+ GL + DE K+ AL + + N K N ++ +G + QA
Sbjct: 532 EWWTELLGLENVTDEEKISSALRYMLEVN---GKASEYCTPNLVKENGEVVKISPQAYSS 588
Query: 785 WPGVTYGLA 793
WP + + +
Sbjct: 589 WPRLVFAMG 597
>gi|327311335|ref|YP_004338232.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
gi|326947814|gb|AEA12920.1| hypothetical protein TUZN_1447 [Thermoproteus uzoniensis 768-20]
Length = 892
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 198/872 (22%), Positives = 308/872 (35%), Gaps = 204/872 (23%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHG---ICDDA-----PVLANQFSVFVSRPNGEKF 139
G+ LGGIGAGS+ G + +F+ DA P+ F + V GE
Sbjct: 12 GVALGGIGAGSVEIRADGLLHDWLIFNNGPWTTIDAHRKLYPLDEKGFLLAVRVDGGE-- 69
Query: 140 SSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEP---------DP 190
P V + ++S + L G+ L P Y GEP D
Sbjct: 70 -------GPLVRQ------LQSAGYILGGDPYKAPWLKPVKGVDYRGEPPLALLKYVDDL 116
Query: 191 ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGH 250
+ I +SPFIP + K S+ PA + N DV+L WA +
Sbjct: 117 PVEIEAEFLSPFIPGDLKNSALPAFIAVVKFRNRSDKPLDVSL---WA--------MLKS 165
Query: 251 HFNSKTMTKDGVHGLTLHHRTANGRPPV---TFAVAAEETADVH--VSECPCFLLSGNSK 305
F SK + K + L G P+ + A+A +VH V P + G +
Sbjct: 166 PF-SKAVAKAKGDTVVLQDGGEPGDSPLRDGSMALAVR-AGEVHAAVVRAPGYDEYGFQR 223
Query: 306 GITAKDM---WNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLA 362
+ +M W + ++ G + + G + A + A L + +SV LA
Sbjct: 224 YDDSLEMLQAWIDFRREGRV------RGPAARSGDGLWAVVTAPLKLAPSEEKSVVLVLA 277
Query: 363 WDCPEV--KFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
W P +F E++ H Y F+ G A + + + K + + + + K
Sbjct: 278 WFFPNHVDQFGERLGHY-YENFFSDAGSVARYVLDNFEYLYGK----TKKFHDALYDVKG 332
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALG 480
W + ++L L + DG +F+L ALG
Sbjct: 333 VESWIADLVASQLSTLVKISWLTKDG-------------RFALWE------------ALG 367
Query: 481 TRLLENGEENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMH 540
+ Y G E +NT DV Y++ ALV LFP+L + +A +
Sbjct: 368 DK-------------YYGGPERNAFNTTDVIAYATPALVSLFPELAVKYLVQHSAFALRR 414
Query: 541 DPGTMKI----MSDGKWVARKCLGAVPHDIGLD-----------------DPWFEI---- 575
I + + + K L P+ + LD DP I
Sbjct: 415 GTPEWVIYALAIPENRREFEKALEREPY-LALDWQRLVEAVSRIVERTGKDPAGRIAHFL 473
Query: 576 -NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW-------------------- 614
S + DL K+VL Y TG+ R +W
Sbjct: 474 NKSIKGIDGYHMIDLMPKYVLMAYATAKWTGNAELLRNLWDVLDGAVDSVAKAQSVEGLP 533
Query: 615 ----PSVYIAMAYMEQFDKDGDGMIENE------------GFPDQTYDAWSANGVSAYCG 658
P+ M ++ K+ I GF QT+D W+ GVS+Y
Sbjct: 534 YHTTPAGIEWMRAVQAIFKESASQISTAVVQLLSQRAVLMGF--QTFDTWAFYGVSSYVL 591
Query: 659 GLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTS 716
LW AAL+A A G S + ++A + LWNG YF+ +D G +
Sbjct: 592 FLWAAALKAMVEGAKLTGR--SPEKYAELLRRAFEGLERLWNGEYFDLWWDPVTGERDRA 649
Query: 717 IQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRID- 775
A QL GQ A L ++D+ +V AL + +N+ + G +NGM PD R
Sbjct: 650 SMAAQLFGQLLAHVADLGYLSDKQRVISALRAVAKYNLAPDE----GLINGMYPDRRRPS 705
Query: 776 ---------------MSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSED 820
+ Q W GV Y +A M E +V+ ++E
Sbjct: 706 FVGPTLYENFTRGPYLPTWQMDTPWTGVEYAVAGHMFYEGLVEEGTAVLKALHE------ 759
Query: 821 GLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
++ + N+ E+ + YMRPL+ WA+
Sbjct: 760 ----RYERGGHYWNHIEWGT-HYMRPLSAWAV 786
>gi|76156398|gb|AAX27603.2| SJCHGC03819 protein [Schistosoma japonicum]
Length = 192
Score = 85.5 bits (210), Expect = 1e-13, Method: Composition-based stats.
Identities = 49/117 (41%), Positives = 69/117 (58%), Gaps = 8/117 (6%)
Query: 103 YRGEFQRFKLFHGI-CDD-APVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIE 160
+RGEF R L GI C D PV +QF V V + N ++ VL P + P N G+
Sbjct: 74 FRGEFCRSSLIPGIYCYDIQPV--DQFIVTVRKNNVIIYNQVLSPLTK--PPSN-GKGLR 128
Query: 161 SWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVF 217
W W + EN Y L+PR+WTVY+ P+ +L +VC+QISP IPH+Y+ + P ++F
Sbjct: 129 KWIWGFRPENGYYIGLYPRSWTVYE-IPELQLTLVCQQISPVIPHDYQVTCLPVAIF 184
>gi|160934323|ref|ZP_02081710.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
gi|156866996|gb|EDO60368.1| hypothetical protein CLOLEP_03194 [Clostridium leptum DSM 753]
Length = 806
Score = 82.8 bits (203), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 164/786 (20%), Positives = 281/786 (35%), Gaps = 161/786 (20%)
Query: 91 LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGV 150
LGGIG G++ RG + +++F F+++ N + +L R
Sbjct: 22 LGGIGTGNVSVGTRGNYCDWEIFGQPGKGIHFPYTFFALWAKSKNTQPIVRILESRIQ-- 79
Query: 151 PKKNTDSGIESWDWNLKG----ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P G S D L G E ++ A +P + P L + +PFIP N
Sbjct: 80 PPYTCSHGFISAD--LAGLPRFERSSFSAAYPFVNVALEDSKLP-LSVELEAFTPFIPLN 136
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLT 266
K+S P +V + + N+ + + T+ A+SVA G +G+ + K V
Sbjct: 137 AKDSGIPGAVIRYKVKNNSGEALEATV----ASSVANAVGFNGYDLFDNLLVKRPVKNQY 192
Query: 267 LHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAK-------------DMW 313
L G + E A+ + +++G +T K D W
Sbjct: 193 LDQDQMKG-----LLYTSPELAEDDFTNGSIAVMTGEDS-VTVKEEWLSGAWWDGIHDFW 246
Query: 314 NEIKKHGSFD---HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKF 370
++ + G + ++ ++ ++ TIP+ F L W
Sbjct: 247 DDFSEDGRLEFHSEFQGKESKFNDLSDLRIGSLGVCKTIPAYGEAVFEFILTW------- 299
Query: 371 FEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLF 430
Y K++G G + +EH
Sbjct: 300 ----YFPNRPKYWG--GHVCPENRNKKEIEHN---------------------------- 325
Query: 431 NELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEEN 490
YY + W G ++L SL+ S+ T A ++ L ++E N
Sbjct: 326 ---YYADFQKNAWDYGVYLYKNLE-------SLEKASRSFTKALYESTLPDYVIEALANN 375
Query: 491 I-------------GQFLYLEGA----EYVMYNTYDVHFYSSFALVMLFPKLELSIQRDF 533
I G FL EG+ N V Y+ A LFP+LE S++R
Sbjct: 376 ITVLRSPTCFRISDGTFLSWEGSFPQHGSCEGNCTHVWNYAQTA-AFLFPELEQSMRRT- 433
Query: 534 AAAVMMHDPG-----TMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKD 588
+ D G T+++ D KW + + AV +G
Sbjct: 434 EFLLETDDTGRMAFRTLQVFGDKKW---EMIPAVDGQMGT-------------------- 470
Query: 589 LNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDA 647
+L++YRD+ + W +A+ + +D D D +++++ TYD
Sbjct: 471 -----ILRLYRDWKLGAGDALLKETWEKAALALDFAFTYWDSDKDCVLDDQ--QHNTYDI 523
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY 706
G ++ L++AAL+A +A +GD A A + ++K DS LW G Y+
Sbjct: 524 -EFYGPNSLSNSLFLAALKAGKEMAAYLGDTAHAEKYAEAFRKGSEKMDSLLWGGDYYIQ 582
Query: 707 DNSDGSS-----STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLK----- 756
D ++ T ADQ+ GQ A L + E +KKA+ ++ +N +
Sbjct: 583 VIDDINAYKYQYGTGCLADQVFGQMLAHLNHLGYVLPEEHIKKAIHSVFQYNFKETMEDH 642
Query: 757 VKGGMCGAMNGMQ--------PDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQT 808
V A+N + GR + + + E+W G+ Y +AA +I E V+
Sbjct: 643 VNVQRTYALNQEKGLVLCSWPQGGRPKLPFVYSDEVWSGIEYQVAAHLIFEGFVEEGLAV 702
Query: 809 AAGVYE 814
V E
Sbjct: 703 VKAVRE 708
>gi|238620760|ref|YP_002915586.1| hypothetical protein M164_2319 [Sulfolobus islandicus M.16.4]
gi|238381830|gb|ACR42918.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.4]
Length = 818
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 102/441 (23%), Positives = 174/441 (39%), Gaps = 100/441 (22%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
+G++ Y +G E NT DV Y+ +++LFP L +I +D + +
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
M DP GT+K +++ + G +PH I +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSV-----------------Y 618
++Y D+N +FVL Y TGD+ ++V+ Y
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTVDGLPY 515
Query: 619 IAMAYMEQFDKDGDGMIENE-----------GFPDQTYDAWSANGVSAYCGGLWVAALQA 667
+ + ++ ++ + M+ QT D WS G S Y LW++AL+A
Sbjct: 516 LTLPSGIEWIRNVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575
Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
AS L N+ ++ + + V LWNG Y+ YD S A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628
Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
G WY + L D + K + I +N +G G+ N + P G + +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSAEEGVRNGSSNDDITPLG-VKLS-IQS 686
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
+ W GV Y LA+ M + A + VYE ++ ++ N+ E+ +
Sbjct: 687 KAPWSGVEYYLASHMFSSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA- 735
Query: 842 CYMRPLTIWAMQWALTKPKLS 862
YMRPL +M +A+ K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756
>gi|385774246|ref|YP_005646813.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
gi|323478361|gb|ADX83599.1| bile acid beta-glucosidase [Sulfolobus islandicus HVE10/4]
Length = 818
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
+G++ Y +G E NT DV Y+ +++LFP L +I +D + +
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVLLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
M DP GT+K +++ + G +PH I +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
++Y D+N +FVL Y TGD+ ++V+
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515
Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
PS + Y+ + D + QT D WS G S Y LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575
Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
AS L N+ ++ + + V LWNG Y+ YD S A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628
Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
G WY + L D + K + I +N +G G+ N + P G + +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQS 686
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
+ W GV Y LA+ M + A + VYE ++ S+ N+ E+ +
Sbjct: 687 KAPWSGVEYYLASHMFYSGFDEYAKKILRNVYE----------RYEIAGSFWNHIEWGA- 735
Query: 842 CYMRPLTIWAMQWALTKPKLS 862
YMRPL +M +A+ K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756
>gi|385776902|ref|YP_005649470.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
gi|323475650|gb|ADX86256.1| bile acid beta-glucosidase [Sulfolobus islandicus REY15A]
Length = 818
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
+G++ Y +G E NT DV Y+ +++LFP L +I +D + +
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
M DP GT+K +++ + G +PH I +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
++Y D+N +FVL Y TGD+ ++V+
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515
Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
PS + Y+ + D + QT D WS G S Y LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575
Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
AS L N+ ++ + + V LWNG Y+ YD S A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628
Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
G WY + L D + K + I +N +G G+ N + P G + +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQS 686
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
+ W GV Y LA+ M + A + VYE ++ S+ N+ E+ +
Sbjct: 687 KAPWSGVEYYLASHMFYSGFDEYAKKILRNVYE----------RYEIAGSFWNHIEWGA- 735
Query: 842 CYMRPLTIWAMQWALTKPKLS 862
YMRPL +M +A+ K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756
>gi|229585796|ref|YP_002844298.1| hypothetical protein M1627_2396 [Sulfolobus islandicus M.16.27]
gi|228020846|gb|ACP56253.1| protein of unknown function DUF608 [Sulfolobus islandicus M.16.27]
Length = 818
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
+G++ Y +G E NT DV Y+ +++LFP L +I +D + +
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
M DP GT+K +++ + G +PH I +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
++Y D+N +FVL Y TGD+ ++V+
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515
Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
PS + Y+ + D + QT D WS G S Y LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575
Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
AS L N+ ++ + + V LWNG Y+ YD S A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQI 628
Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
G WY + L D + K + I +N +G G+ N + P G I +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-IKLS-IQS 686
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
+ W GV Y LA+ M + A + VYE ++ ++ N+ E+ +
Sbjct: 687 KAPWSGVEYYLASHMFSSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA- 735
Query: 842 CYMRPLTIWAMQWALTKPKLS 862
YMRPL +M +A+ K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756
>gi|297741933|emb|CBI33368.3| unnamed protein product [Vitis vinifera]
Length = 108
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/60 (56%), Positives = 42/60 (70%)
Query: 801 MVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAMQWALTKPK 860
M + AF TA G++ WSE+G GY FQTPE W + +RSL YMRPL IW MQWAL+ P+
Sbjct: 1 MEEQAFTTAEGIFTAGWSEEGYGYWFQTPEGWTIDGHFRSLIYMRPLAIWGMQWALSMPR 60
>gi|227828566|ref|YP_002830346.1| hypothetical protein M1425_2317 [Sulfolobus islandicus M.14.25]
gi|227460362|gb|ACP39048.1| protein of unknown function DUF608 [Sulfolobus islandicus M.14.25]
Length = 818
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 105/441 (23%), Positives = 172/441 (39%), Gaps = 100/441 (22%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVM------------ 538
+G++ Y +G E NT DV Y+ +++LFP L +I +D + +
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 539 -------------MHDP----------GTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEI 575
M DP GT+K +++ + G +PH I +
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIANE--TGKDPKGRMPHYIRYS---LTV 461
Query: 576 NSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVW--------------------- 614
++Y D+N +FVL Y TGD+ ++V+
Sbjct: 462 DTYERI------DINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPY 515
Query: 615 ---PSVYIAMAYMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA 667
PS + Y+ + D + QT D WS G S Y LW++AL+A
Sbjct: 516 LTLPSGIEWIRYVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEA 575
Query: 668 ---ASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQL 722
AS L N+ ++ + + V LWNG Y+ YD S A Q+
Sbjct: 576 LNEASKLLNNPQNYEVK-------ELIEKVDKYLWNGEYYMDWYDPISNLRDDSSNASQI 628
Query: 723 AGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQA 781
G WY + L D + K + I +N +G G+ N + P G I +S +Q+
Sbjct: 629 TGDWYVQMLDLPEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-IKLS-IQS 686
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
+ W GV Y LA+ M + A + VYE ++ ++ N+ E+ +
Sbjct: 687 KAPWSGVEYYLASHMFSSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA- 735
Query: 842 CYMRPLTIWAMQWALTKPKLS 862
YMRPL +M +A+ K++
Sbjct: 736 RYMRPLVALSMIYAIEGMKVN 756
>gi|10440618|gb|AAG16856.1|AC069145_5 hypothetical protein [Oryza sativa Japonica Group]
Length = 143
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 69/126 (54%), Gaps = 21/126 (16%)
Query: 9 GENAASMPKVDPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEA 68
G A + D G PP+ W+R+L+S+ + F+++F E +
Sbjct: 21 GRTALQLLDFDGGSPPEQAWRRRLNSHANLLKEFSVTFMEAMRM---------------- 64
Query: 69 KGKVPVFNVFRKHHI--TSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQ 126
K P+ + F K ++ QG+PLGG+G+GSI R +RGEF+ + + G+C+ +PV+ NQ
Sbjct: 65 --KAPI-DPFTKEKCKPSASQGVPLGGMGSGSISRGFRGEFKNWHIIPGLCETSPVMENQ 121
Query: 127 FSVFVS 132
FSV +S
Sbjct: 122 FSVMLS 127
>gi|374815490|ref|ZP_09719227.1| hypothetical protein TpriZ_16630 [Treponema primitia ZAS-1]
Length = 814
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 133/325 (40%), Gaps = 55/325 (16%)
Query: 513 YSSFALVMLFPKLELSIQR-DFAAAVMMHDPGTMKI--MSDGKWVARKCLGAVPHDIGLD 569
Y + A+ LFP+LE S++R DF +D G M+ M + + L AV +G
Sbjct: 421 YYAQAMAYLFPELEQSMRRTDFLTET--NDEGEMQFRAMRELNGASWGFLPAVDGQMGT- 477
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFD 628
++++YRD++ +GD F + +WP +A+ Y + +D
Sbjct: 478 ------------------------IVRLYRDWLISGDDAFLKELWPKAKLALDYGIRTWD 513
Query: 629 KDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY 688
DGD +++ D + + + + C ++AAL+AA +A + D + + + Y
Sbjct: 514 TDGDFVLDGMKHVDYDVEFYGIDPLGNLC---YLAALKAAVKIAEYLKDEGAHNRYQTIY 570
Query: 689 QKAKAVYDS-LWN--GSYF-----NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEA 740
A D +W+ YF + D T ADQL GQ+YA GL P+ D
Sbjct: 571 DTASVQADKFMWDSKAGYFIQVLKDVDEFKYQHGTGCLADQLIGQFYAHLTGLGPLVDAG 630
Query: 741 KVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPG 787
+K+A I+ +N + V M GR + E W
Sbjct: 631 HIKQAAQSIFKYNFISDFTDHPNMQRGYAVNDDKGLLMTTWPLGGRPRYPFFYSEEAWSR 690
Query: 788 VTYGLAASMIQEEMVDMAFQTAAGV 812
Y +A+++I E +D A V
Sbjct: 691 TEYHVASTLIYEGFLDEGLTIAKAV 715
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 72/298 (24%), Positives = 123/298 (41%), Gaps = 40/298 (13%)
Query: 91 LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGV 150
LGGIG GSI RG+ F+LF+ + L ++ F GE + S
Sbjct: 30 LGGIGTGSISLGSRGQLTDFELFN---HPSKGLRFPYTFFAVHAQGEGVNVTKVLESRLT 86
Query: 151 PKKNTDSGIESWDWN-----LKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
P +G+ + + + L+ E FP A + + P ++ +PFIP
Sbjct: 87 PPYGAANGLPAQELSGLPRFLESE---MQVQFPFASIQFMDDRLP-VKAELEAFNPFIPL 142
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGL--------SGHHFNSKTM 257
N +S PA++ + + N+ +++ + N+ G +G+ +G FN
Sbjct: 143 NANDSGIPAAILRYRIKNTSAAPLQISVAGSLYNA-GGFTGIEPFGNFINTGRPFNVYQK 201
Query: 258 TKDGVHGLTLHHRTANGRPP--VTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
++ + G+ L + +G PP + + A T+D VS + + GI + W++
Sbjct: 202 EQE-IRGIFLDN---DGLPPASIEYGNLALMTSDESVSYKAEWQMGQRIDGI--HNFWDD 255
Query: 316 IKKHGSFDHLDNDKTSPSEPGS-------SIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
+ G LDND G IG ++A S T+ G TF+LAW P
Sbjct: 256 FSEDGK---LDNDAPKMGVDGKLDKKILIKIG-SLAISRTLAPGEEYCYTFALAWYFP 309
>gi|15922864|ref|NP_378533.1| hypothetical protein ST2529 [Sulfolobus tokodaii str. 7]
gi|15623655|dbj|BAB67642.1| putative glycosidase [Sulfolobus tokodaii str. 7]
Length = 836
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 104/440 (23%), Positives = 173/440 (39%), Gaps = 102/440 (23%)
Query: 488 EENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
E +G+ Y +G NT DV Y + L +L+P+L ++ + + + +D
Sbjct: 333 ERKVGEHPYTDGPVNTALNTIDVSTYFIYTLTVLYPQLAKNLLLTSSRSALSYDNPLYIF 392
Query: 548 MS------DGKWVAR---------------KCLGAVPHDIGLDDPWFEINSYNL----FN 582
S K+V R + + + + G DP I Y F+
Sbjct: 393 YSLAMPENRSKYVERVIKDPSIPSSIEKLLQTVKEIAKETG-KDPKGRIAHYIYRDLKFD 451
Query: 583 SSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM------------------ 624
DLN +FVL TGD F ++++P AM +
Sbjct: 452 EYGRNDLNPEFVLMWGLVSKYTGDIEFMKSLYPVAKEAMESVLRTHSYEGLIYSRLPSGF 511
Query: 625 ----EQFDKDGDGMIENEGF-------------PDQTYDAWSANGVSAYCGGLWVAALQA 667
+ F D I + F T+D W+ G++++ L ++AL+
Sbjct: 512 EWNRQVFSYFKDVNIYDNLFLVVSLLGIDSFHMSVNTFDDWTTIGINSFVSLLGISALKI 571
Query: 668 ASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFN--YDNSDGSSSTSIQADQLAG 724
+ L GD + + A ++Y+S LWNG YF+ YD G + QA QL G
Sbjct: 572 LNELG---GDKYNV-------ENALSLYESMLWNGEYFDLWYDPISGYRDKTCQASQLLG 621
Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDG------------ 772
++Y G + D K +K L I +N+ + +G +NG PDG
Sbjct: 622 EFYLNLLGY-SLLDREKTRKTLLSIVKYNLKEEEG----VINGAYPDGYRPLMREYENPL 676
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESW 832
+I + + W GV + LA+ +I E+M+D A + VY+ +S G +F W
Sbjct: 677 KIKEASIHQDTPWSGVEFYLASHLIYEKMIDEAKKVLKEVYD-RYSIAG---NFWNHWEW 732
Query: 833 NNNDEYRSLCYMRPLTIWAM 852
++ Y RPL+ W +
Sbjct: 733 GSH-------YSRPLSSWLI 745
>gi|227831316|ref|YP_002833096.1| hypothetical protein LS215_2489 [Sulfolobus islandicus L.S.2.15]
gi|229580221|ref|YP_002838621.1| hypothetical protein YG5714_2457 [Sulfolobus islandicus Y.G.57.14]
gi|227457764|gb|ACP36451.1| protein of unknown function DUF608 [Sulfolobus islandicus L.S.2.15]
gi|228010937|gb|ACP46699.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.G.57.14]
Length = 818
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 103/430 (23%), Positives = 169/430 (39%), Gaps = 78/430 (18%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
+G++ Y +G E NT DV Y+ +++LFP L +I +D + + D
Sbjct: 347 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 406
Query: 543 ----GTMK----IMSDGKWVA--RKCLGAVPHDIGL--DDPWFEINSYNLF----NSSRW 586
MK IM D +K G + + DP + Y + ++
Sbjct: 407 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 466
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAMA 622
D+N +FVL Y TGD+ ++V+ PS +
Sbjct: 467 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 526
Query: 623 YMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALANDV 675
Y+ + D + QT D WS G S Y LW++AL+A AS L ++
Sbjct: 527 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSNP 586
Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACGL 733
++ + + V LWNG Y+ YD S A Q+ G WY + L
Sbjct: 587 QNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLDL 639
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQAREIWPGVTYGL 792
D + K + I +N +G G+ N + P G + +S +Q++ W GV Y L
Sbjct: 640 PEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQSKAPWSGVEYYL 697
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A M + A + VYE ++ ++ N+ E+ + YMRPL +M
Sbjct: 698 ALHMFYSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA-RYMRPLVALSM 746
Query: 853 QWALTKPKLS 862
+A+ K++
Sbjct: 747 IYAIEGMKVN 756
>gi|269925332|ref|YP_003321955.1| hypothetical protein Tter_0211 [Thermobaculum terrenum ATCC
BAA-798]
gi|269788992|gb|ACZ41133.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 811
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 106/232 (45%), Gaps = 22/232 (9%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIENEGFPDQTYDAWSANG 652
V++VYR++ +GD F + +WP V A+ Y + DKD D ++E G TYD G
Sbjct: 471 VIRVYREWKFSGDDAFLKELWPKVKSAVNYSSLYWDKDQDFILE--GRQHNTYDI-EFYG 527
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGS 712
+ G L++ AL+AA +A +G +++SY +K + LWNG Y+ D +
Sbjct: 528 PNPLTGILFLGALRAAEEMAKYLGSESASSYAQAFEASSKKLDQLLWNGEYYIQKLDDPN 587
Query: 713 S-----STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV------------L 755
++ +DQL GQ A GL + + + +AL I+ +N +
Sbjct: 588 EHRYQHASGCLSDQLFGQTLASLTGLGYLLPKEHISRALESIFAYNFKPNFWNHTNTQRV 647
Query: 756 KVKGGMCGAMNGMQPDG-RIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
G G + P G R + E+W G Y +A MI E ++D A
Sbjct: 648 YALGDDAGLVMCTWPFGDRPSFPFPYSDEVWSGTEYQVATLMIYEGLLDEAL 699
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 127/319 (39%), Gaps = 36/319 (11%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
PLGGIG G++ RGE + +++F+ L N + +RP GE + +
Sbjct: 25 AFPLGGIGTGNVSLGARGELRDWEIFNRPAKGFS-LPNTYFALWARPEGESPVTKVLQSL 83
Query: 148 PGVPKKNTDSGIESWDWNLKGENCT-YHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P P + + L T + +P AW VY + D +++ +PF+P N
Sbjct: 84 PPPPHTQSHGYHPTSGLGLPHFTSTLFRGEYPIAW-VYFEDSDLPIKVTLESFTPFVPLN 142
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV--------AGDSGLSGHHFNSKTMT 258
++S P + + + N DV + + N V G S G N+
Sbjct: 143 AEDSGIPGAYLIYRVENISPKRIDVAIAASITNPVGEVRRDRFGGFSVERGG--NANDFR 200
Query: 259 KDG-VHGLTLHHRTANGRP--PVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ G + GL L +N P + + + T + HV+ +L S + + ++ W +
Sbjct: 201 RQGRILGLFLR---SNSYPLYDLRYGDLSLVTMNSHVTFKRVWLPSRWACD-SVEEFWRD 256
Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW------------ 363
G + L + PSE G + +IAA TI G + F L+W
Sbjct: 257 FSTDGKLEDLGIE--DPSEEGQTYTGSIAAMETIAPGEVKDFVFILSWFFPNRIKSWDSD 314
Query: 364 --DCPEVKFFEKVYHRRYT 380
D +V+ Y RR++
Sbjct: 315 AEDLEDVEIVRNHYSRRFS 333
>gi|229581136|ref|YP_002839535.1| hypothetical protein YN1551_0454 [Sulfolobus islandicus Y.N.15.51]
gi|228011852|gb|ACP47613.1| protein of unknown function DUF608 [Sulfolobus islandicus
Y.N.15.51]
Length = 816
Score = 77.4 bits (189), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 103/430 (23%), Positives = 169/430 (39%), Gaps = 78/430 (18%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
+G++ Y +G E NT DV Y+ +++LFP L +I +D + + D
Sbjct: 345 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 404
Query: 543 ----GTMK----IMSDGKWVA--RKCLGAVPHDIGL--DDPWFEINSYNLF----NSSRW 586
MK IM D +K G + + DP + Y + ++
Sbjct: 405 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 464
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAMA 622
D+N +FVL Y TGD+ ++V+ PS +
Sbjct: 465 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 524
Query: 623 YMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALANDV 675
Y+ + D + QT D WS G S Y LW++AL+A AS L ++
Sbjct: 525 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSNP 584
Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACGL 733
++ + + V LWNG Y+ YD S A Q+ G WY + L
Sbjct: 585 QNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLDL 637
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQAREIWPGVTYGL 792
D + K + I +N +G G+ N + P G + +S +Q++ W GV Y L
Sbjct: 638 PEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQSKAPWSGVEYYL 695
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A M + A + VYE ++ ++ N+ E+ + YMRPL +M
Sbjct: 696 ALHMFYSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA-RYMRPLVALSM 744
Query: 853 QWALTKPKLS 862
+A+ K++
Sbjct: 745 IYAIEGMKVN 754
>gi|284998829|ref|YP_003420597.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284446725|gb|ADB88227.1| hypothetical protein LD85_2606 [Sulfolobus islandicus L.D.8.5]
Length = 803
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 103/430 (23%), Positives = 169/430 (39%), Gaps = 78/430 (18%)
Query: 491 IGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDP-------- 542
+G++ Y +G E NT DV Y+ +++LFP L +I +D + + D
Sbjct: 332 VGKYPYTDGPENTALNTIDVLLYALPGVMLLFPDLAKNIVKDLSNRALKEDTPEYVIFSL 391
Query: 543 ----GTMK----IMSDGKWVA--RKCLGAVPHDIGL--DDPWFEINSYNLF----NSSRW 586
MK IM D +K G + + DP + Y + ++
Sbjct: 392 AFPENLMKYKEEIMKDPTISTDLKKLYGTIKRIVNETGKDPKGRMPHYIRYSLTVDTYER 451
Query: 587 KDLNSKFVLQVYRDFVATGDKNFARAVW------------------------PSVYIAMA 622
D+N +FVL Y TGD+ ++V+ PS +
Sbjct: 452 IDINPEFVLLYYLIAKYTGDRELLKSVYEVARNAIESIMRTQTMDGLPYLTLPSGIEWIR 511
Query: 623 YMEQFDKDGDGM----IENEGFPDQTYDAWSANGVSAYCGGLWVAALQA---ASALANDV 675
Y+ + D + QT D WS G S Y LW++AL+A AS L ++
Sbjct: 512 YVNSMLRANDAHKILGYHSLALSMQTLDDWSWLGFSPYVSFLWISALEALNEASKLLSNP 571
Query: 676 GDHASASYFWVRYQKAKAVYDSLWNGSYFN--YDNSDGSSSTSIQADQLAGQWYARACGL 733
++ + + V LWNG Y+ YD S A Q+ G WY + L
Sbjct: 572 QNYEVK-------ELIEKVNKYLWNGEYYMDWYDPISNLRDDSSNASQITGDWYVQMLDL 624
Query: 734 LPIADEAKVKKALTKIYDFNVLKVKGGMCGAMN-GMQPDGRIDMSGLQAREIWPGVTYGL 792
D + K + I +N +G G+ N + P G + +S +Q++ W GV Y L
Sbjct: 625 PEFLDYERRKSVFSSIMKYNYSGEEGVRNGSSNDDITPLG-VKLS-IQSKAPWSGVEYYL 682
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLCYMRPLTIWAM 852
A M + A + VYE ++ ++ N+ E+ + YMRPL +M
Sbjct: 683 ALHMFYSGFDEYAKKILRNVYE----------RYEIAGNFWNHIEWGA-RYMRPLVALSM 731
Query: 853 QWALTKPKLS 862
+A+ K++
Sbjct: 732 IYAIEGMKVN 741
>gi|284045431|ref|YP_003395771.1| hypothetical protein Cwoe_3979 [Conexibacter woesei DSM 14684]
gi|283949652|gb|ADB52396.1| protein of unknown function DUF608 [Conexibacter woesei DSM 14684]
Length = 870
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 111/248 (44%), Gaps = 41/248 (16%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYDAWSANG 652
VL+ YR+ G + + W ++ M ++++ ++ GDG++ E TYD + +G
Sbjct: 509 VLKTYREARQGGGEAWLADRWDALVRLMDHVDRTWNSAGDGVLRGE--QPCTYDI-ALHG 565
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFN------- 705
+ GGLW+AAL++ +A +G +A Y VR+++A+ Y+ L+NG Y+
Sbjct: 566 PNLLIGGLWLAALRSMEEIALRLGVAGAAGYR-VRFEQARGGYEKLFNGEYYAQPVTGEP 624
Query: 706 YDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAM 765
+D DG S DQL GQW+A L + D +V+ AL I N ++ G +
Sbjct: 625 HDFGDGCLS-----DQLLGQWWAHQLELGHLLDPERVRSALRAIVAHN---LREGFGARV 676
Query: 766 NGMQP---------------------DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDM 804
QP GR D+ E+W GV Y +AA I E + D
Sbjct: 677 ADEQPPGHRVFADGEDSGLVVCSWPRGGRPDVPLRYCDEVWSGVEYAVAAHCIDEGLEDE 736
Query: 805 AFQTAAGV 812
V
Sbjct: 737 GLALVEAV 744
>gi|336426603|ref|ZP_08606612.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336010619|gb|EGN40601.1| hypothetical protein HMPREF0994_02618 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 850
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 178/787 (22%), Positives = 284/787 (36%), Gaps = 171/787 (21%)
Query: 89 IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA--NQFSVFVSRPNGEK-------F 139
P+GG+G+G IG + G ++++ + P + N +S F + E
Sbjct: 16 FPVGGLGSGCIGIAGNGCLIDWQVY-----NRPAMGTRNGYSHFAIKAQDETGLLDARVL 70
Query: 140 SSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQI 199
S L G + T G+ + + +P A + E P ++ R
Sbjct: 71 VSDLEKPYMGQVDEKTMLGMPHF------RELVFEGRYPAAKLSFLDERFPG-KVTLRYY 123
Query: 200 SPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTK 259
+PFIP N K+SS P + N+ T N G + S H+F K
Sbjct: 124 NPFIPLNDKDSSIPGLFADIIVENTSDHPIRYTAAGVITNPNKG-TNTSNHYFEDK---- 178
Query: 260 DGVHGLT-----LHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWN 314
G+ G+ + R A G T A +EE +C + GN + + W
Sbjct: 179 -GIRGIRFATDGMDERDA-GYSDFTLATDSEE------GDCQEYWFRGNWRD-HVETYWR 229
Query: 315 EIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV 374
E S L N + P + A I G + + L W+ P
Sbjct: 230 EF---ASEQPLKNRRYETGLPDYKDTGMLCAVQDIGPGEKKEFHYVLTWNVPNC------ 280
Query: 375 YHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNEL- 433
++ L S + I +D C+ W+ Y LF +
Sbjct: 281 -----NNYWSGLDISTSNIHYDV--------CDRTPWKN-----------YYAVLFADSG 316
Query: 434 ----YYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGE- 488
Y L +W + M ++ + + +LD + S T L LE+G
Sbjct: 317 ASASYGLENRARLWEETKSFMDAVYSTTIPEEALDAVTANLAILRSPTCL---RLEDGSF 373
Query: 489 -------ENIGQFLYLEGAEYVMYNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHD 541
+++G EG ++N ++AL LFP L SI RD D
Sbjct: 374 YGWEGCFDDVGS---CEGTCTHVWNY-------AYALPFLFPALARSI-RDLEFTYSQQD 422
Query: 542 PGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDF 601
G ++ R LG+ F+S L S V+ YRD+
Sbjct: 423 DGGVEFR------LRLPLGSTG---------------KRFHSCVDGQLGS--VMLTYRDY 459
Query: 602 VATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDAWSANGVSA 655
+GD+ + W SV A+AY + +D+ G+++ G T D G +A
Sbjct: 460 KISGDREWLEGKWESVKKAIAYAWSEENPDAWDRKKQGVLD--GRQHHTLDM-ELFGPNA 516
Query: 656 YCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYF--NYDNSDGS 712
+ G +V AL+AA+ +A +GD + + + KA + L+NG YF D D S
Sbjct: 517 WLQGFYVGALEAAAEMAAILGDGEAEAEYRALAAGGKAYLNKELFNGEYFYQKIDVCDRS 576
Query: 713 SSTSIQA----------------------DQLAGQWYARACGLLPIADEAKVKKALTKIY 750
+ A DQ+ QW+ CGL I ++ KV+KAL I+
Sbjct: 577 AVDRFNASKEYWNEEKGQIKYQIAEGCGIDQVLAQWHGNLCGLPEIFEKEKVRKALASIF 636
Query: 751 DFNVLKVKGGM------------CGAMNGMQPDG--RIDMSGLQAREIWPGVTYGLAASM 796
+N G + GA+ P+G + + A+E G+ Y AA M
Sbjct: 637 RYNYKPALGDIYNPWRIFAIDEESGAIMCEWPEGTDKPAIPLTYAQECMDGMQYQAAAHM 696
Query: 797 IQEEMVD 803
IQE M +
Sbjct: 697 IQEGMTE 703
>gi|253761387|ref|XP_002489099.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
gi|241947398|gb|EES20543.1| hypothetical protein SORBIDRAFT_0073s002040 [Sorghum bicolor]
Length = 175
Score = 76.3 bits (186), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 73/172 (42%), Gaps = 63/172 (36%)
Query: 19 DPGQPPQLTWQRKLDSNVKVPTGFTLSFQEFRHLAHIGFRLYRYSKQEEAKGKVPVFNVF 78
D G PP+ W+RKL S+ F ++F+E + +G RL+ Y ++E
Sbjct: 31 DDGSPPEHAWRRKLSSHANRLKEFNVTFREAIRMMKLGLRLWSYIREE------------ 78
Query: 79 RKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK 138
S GR AP+ + F+ +RP+
Sbjct: 79 ------------------ASHGRK-----------------API--DPFTREGNRPSA-- 99
Query: 139 FSSVLCPRSPGVP----KKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDG 186
S GVP K+ D GI SWDW LKG+ TYHALFPRAWTVYDG
Sbjct: 100 --------SQGVPLGGMGKSIDDGISSWDWKLKGDRSTYHALFPRAWTVYDG 143
>gi|297725625|ref|NP_001175176.1| Os07g0444000 [Oryza sativa Japonica Group]
gi|255677726|dbj|BAH93904.1| Os07g0444000 [Oryza sativa Japonica Group]
Length = 87
Score = 72.0 bits (175), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 53/77 (68%), Gaps = 2/77 (2%)
Query: 55 IGFRLYRYSKQEEAKGKVPVFNVFRKH--HITSDQGIPLGGIGAGSIGRSYRGEFQRFKL 112
+G RL+ Y ++E + G+ + F + ++ QG+PLGG+G+GSI R +RGEF+ +++
Sbjct: 3 LGLRLWSYIREEASHGRKAPIDPFTRESDKPSASQGVPLGGMGSGSISRGFRGEFKHWQI 62
Query: 113 FHGICDDAPVLANQFSV 129
G C+ +PV+ANQFSV
Sbjct: 63 IPGSCEMSPVMANQFSV 79
>gi|160932020|ref|ZP_02079412.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
gi|156869062|gb|EDO62434.1| hypothetical protein CLOLEP_00853 [Clostridium leptum DSM 753]
Length = 822
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 173/795 (21%), Positives = 309/795 (38%), Gaps = 138/795 (17%)
Query: 91 LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPR-SPG 149
LGGIG G++ RGE + +++F+ + F++ GE S +L + +P
Sbjct: 32 LGGIGTGNVSVGARGELRSWQIFNEPGQFNFLPYTFFALRAEVEGGEAVSKILESKLNP- 90
Query: 150 VPKKNTDSGIESWDWNL-KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYK 208
P + + + L + E T A +P + + P +++ +PFIP N
Sbjct: 91 -PFTHPEGTLRCELGGLPRFEKSTMSAEYPFVRVKLEDKQVP-VQVEMEAFTPFIPLNAD 148
Query: 209 ESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS-------GLSGHHFNSKTMTKDG 261
+S P + + + N V++ + AN+V + L+G+ N + ++G
Sbjct: 149 DSGIPGAYLNYRVKNPTNQPVTVSIAGSLANAVGFEKYDIWNHMKLNGNPVN-EYQERNG 207
Query: 262 VHGLTLH------HRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNE 315
+ GL ANG T +A T++ V+ P + + G A+D W++
Sbjct: 208 LKGLFFRGENVPEKHIANG----TMCLA---TSEDDVTAKPLWFQGEWTDG--AQDFWDD 258
Query: 316 IKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVY 375
GS D +D+ K + G +I S S S S+ E + F +
Sbjct: 259 FISDGSLD-VDSGKV---QEGCAIAKIYDFSYLHFSDSIGSICIKKVLKPGEERVFPFLL 314
Query: 376 HRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELY- 434
+ GD+ +H K ++ + I+ + Y TLF + +
Sbjct: 315 TWNFPNRVRDWGDTD---------KHVK---AVQEYHYEIVGN------YYSTLFQDAWT 356
Query: 435 ---YLNAGGTIWT-DGSPPMQSLATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEEN 490
Y+N I D Q+ + + L+ + T S T T ENG+
Sbjct: 357 VADYMNQKKEILEGDSRKFSQAFFSSTLPGYVLEAVADNITILRSPTCFRT---ENGD-- 411
Query: 491 IGQFLYLEGAEYVM---YNTYDVHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKI 547
F EG E + T + + + LFP+LE S++R ++
Sbjct: 412 ---FFGWEGVENTVGCGAGTCTHVWNYAQTVAFLFPELEQSMRR-------------IEF 455
Query: 548 MSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSK-----FVLQVYRDFV 602
+ + + D + +Y F W+ L S V+++YR++
Sbjct: 456 LQETQ----------------ADGYMPFRTYRPFGLPAWEMLPSADGQLGAVVRLYREWR 499
Query: 603 ATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQ----TYDAWSANGVSAYC 657
+GD W V +M Y + +D DGD + PD TYD G+++
Sbjct: 500 LSGDNRLIEDCWDGVVRSMEYAIRTWDTDGDFV------PDSAQHVTYDT-ELYGMTSMV 552
Query: 658 GGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYF-----NYDNSDG 711
L+ AAL AA+ +A +GD A + +K AK + + WNG Y+ + D
Sbjct: 553 STLFFAALIAAAEMAEYMGDQERARRYRDGAEKGAKLMDEKCWNGEYYIQLIDDVDKYRY 612
Query: 712 SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD 771
+DQL GQ+ A+ GL + + VKKA I+ +N ++ +
Sbjct: 613 QYGKGCLSDQLLGQFLAQEAGLGYVLPKNHVKKAAESIFRYNFIERASDFGHVQRAYIFN 672
Query: 772 GRIDMS------GLQAR-------EIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVA-- 816
+ ++ G + R E+W G+ Y +AA +I+E++++ V +
Sbjct: 673 DEMGLTPCTWPLGERPRFPFIYFGEVWTGIEYEVAALLIREDLLEEGLTIVKAVRDRQDG 732
Query: 817 -----WSEDGLGYSF 826
WSE+ GY +
Sbjct: 733 YRRNPWSENESGYYY 747
>gi|410638454|ref|ZP_11349015.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
gi|410141863|dbj|GAC16220.1| hypothetical protein GLIP_3609 [Glaciecola lipolytica E3]
Length = 1124
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 148/352 (42%), Gaps = 43/352 (12%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+L YR+ + + ++ ++ WP V A+ + + ++D + G + N T D + G
Sbjct: 726 ILNTYREHQCSANDSWLKSQWPRVKKAVEWGINEWDPNRIGFMSNT--QHNTLDG-NITG 782
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASAS-YFWVRYQKAKAVYDSLWNGSYFNYDNSDG 711
S++ G L++ +L+AA+ +A+ VG+ +A Y +R K LWNG Y Y +G
Sbjct: 783 CSSWIGSLYLTSLEAAARMADIVGEPQTAKEYRAIRESGKKLQNKRLWNGEY--YIQEEG 840
Query: 712 SSS-----TSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV--------LKVK 758
DQL G+W+A + + ++A++ + +N LK +
Sbjct: 841 KERFQDYLDGCHIDQLLGEWWADQLNIDRNYPRERAQQAMSALLKYNFRADFYGQSLKPR 900
Query: 759 GGMCGAMNGMQ----PDGRIDMSGLQ-AREIWPGVTYGLAASMIQEEMVD---MAFQTAA 810
GM+ P G+ + G++ E+ G YG A +M+Q M+D M + A
Sbjct: 901 QYCEIEDGGMKMITWPRGQQPIPGMKYGDEVMTGFEYGAAVTMMQNGMIDEGLMLLKVVA 960
Query: 811 GVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC---YMRPLTIWAMQWALTKPKLSRQEIK 867
Y+ +G+ P ++ N C Y R L++W+ AL Q
Sbjct: 961 DRYDGRLRTEGVSNVVNGPWGYSGNPYGDDECGKYYGRSLSVWSALTAL-------QGFI 1013
Query: 868 HE-----ISDRDSSYLEQHAAFSKVASLLKLPKEEASKGFLKVVYDFTIGRV 914
++ I R E HA+F VA L ++ S L V G V
Sbjct: 1014 YDGPAGLIGFRPILQPENHASFFSVAQGYGLFEQTQSNNALTAVIAMREGSV 1065
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 76/178 (42%), Gaps = 17/178 (9%)
Query: 194 IVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH-F 252
+ +PFIP + + SS P +++ T+ N+ ++L + N++ G G
Sbjct: 409 VALEVFNPFIPMDLQNSSIPCAIYQVTVKNTSTDEVTTSVLSSQKNALGYTEGEKGEFGS 468
Query: 253 NSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDM 312
N T+ K G TL H T G P V + AD +G + + +
Sbjct: 469 NQNTLLKKG--NTTLLHMTRAGSHPKADMVLLTQAADA----------TGTTNWTSPAEF 516
Query: 313 WNEIKKHGSFDHLDNDKTSP-SEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
E +G +N SP S G ++ A++A LT+ G ++SVTF L W E K
Sbjct: 517 KREFTDNGGLTGPNN---SPISRAGQTVNGALSAPLTLAPGESKSVTFVLTWYFTETK 571
>gi|326436882|gb|EGD82452.1| hypothetical protein PTSG_11966 [Salpingoeca sp. ATCC 50818]
Length = 1720
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 145/337 (43%), Gaps = 58/337 (17%)
Query: 90 PLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPG 149
PLG G G I + G +++++ + IC DA V + F+++ + G + VL
Sbjct: 886 PLGPCGGGCIALAGDGGLRQWQILNRICHDAHVPDSFFALYTNDGKGGVDARVL-QLLKD 944
Query: 150 VPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKE 209
+P S IE ++G+ + + +T + + +C P IP + K
Sbjct: 945 LPHL---SAIE-----IEGK----YPVMDVDYTSLNNGIKASMECMC----PCIPLDSKN 988
Query: 210 SSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLS----------GHHFNSKTMTK 259
S P F FTL+N+ + DV+LL + N V D + G + N+K MT
Sbjct: 989 SGLPVIFFNFTLTNTTSDAVDVSLLMSQHNLVGWDGETAIENETEFAGYGGNVNAK-MTM 1047
Query: 260 DGVHGLTLHHRTANGRPPVT--FAVAAEETADVHVSECPCFLLSGNSKGITA-------K 310
DGV + + P A+AA L+G+S ++ K
Sbjct: 1048 DGVTAVDMTSTQFRDTDPQAGHIAIAA---------------LTGDSSEVSTMLQYDAHK 1092
Query: 311 DMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE--V 368
+W+ K G +++PS G + A+ S T+ +G+T ++TF LAW P V
Sbjct: 1093 TLWSHFAKTGKLP--GQGQSTPSPSGKTWNGAVCVSRTLAAGATTTITFVLAWHFPNRYV 1150
Query: 369 KFFEKVYHRRYTKFYGTLGDSAARI--AHDAILEHAK 403
+ +K ++ TK LG++ A + +LE+AK
Sbjct: 1151 NWSQKGFYIDDTKSQYWLGNAYANFWPSITEVLEYAK 1187
>gi|423347418|ref|ZP_17325105.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
CL03T12C32]
gi|409217877|gb|EKN10851.1| hypothetical protein HMPREF1060_02777 [Parabacteroides merdae
CL03T12C32]
Length = 837
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 49/320 (15%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
V+++YRD+ +GD F + W V +AY + +D + DG++E G T D +
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
G + G ++ AL+AA +A + D A +++ D+ L+NG Y+ + +
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFRQGSTWMDANLFNGEYYEHKIT 592
Query: 710 DGSSSTSIQ------------------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
D + + DQL GQ+ A CGL + D+ ++ L I
Sbjct: 593 DPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIMK 652
Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
+N +K V G G + P GR+++ E+ G Y A MI E
Sbjct: 653 YNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIYE 712
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
M A + + DG + F E ++ Y R + WA AL+
Sbjct: 713 GMEKEALTCIRAIRD---RHDGAKRNPFSEAECGHH--------YARSMASWAAVIALSD 761
Query: 859 PKLSRQEIKHEISDRDSSYL 878
+ S + + +DR Y
Sbjct: 762 FQYSGVDRRMHFTDRPGCYF 781
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/322 (21%), Positives = 128/322 (39%), Gaps = 43/322 (13%)
Query: 73 PVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH---------GICDDAPVL 123
PV + + H+ +PLGGIG G++ RGE + +++ + ++AP
Sbjct: 18 PVLKQYDREHLYQ-VALPLGGIGTGTVSLGGRGELRDWEIMNVPGKKYSTVTTGNNAPF- 75
Query: 124 ANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENCTYHALFPR 179
F+++ GE +++L P P++ E N G + T+ A +P
Sbjct: 76 ---FAIYAKPQTGEAMTTLLA--GPLYPQEYLH--YEGRPVNHHGMPRFADATFEAAYPF 128
Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
V+ + +++ + +P +P + S P +V + ++N+ +V + + N
Sbjct: 129 G-QVHLSDRQLPVKVTVKGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRN 187
Query: 240 SVA-----------GDSGLSGHHFNSKTM-TKDGVHGLTLHHRTANGRPPVTFAVAAEET 287
+ GD +G N K+G+ GL L+ + P VA
Sbjct: 188 FIGKDGSKFRTDWKGDYIPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQ 247
Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
A V+ + GI + W++ G + D+ +P +S+ A
Sbjct: 248 AVEGVTYRTSSRADDWNNGIL--NFWDDFSADGEL--TERDRQEDEDPMASL----AVKK 299
Query: 348 TIPSGSTRSVTFSLAWDCPEVK 369
T+ GST + TF L W+ P K
Sbjct: 300 TVGPGSTETFTFYLTWNFPNRK 321
>gi|154490957|ref|ZP_02030898.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
43184]
gi|423724888|ref|ZP_17699030.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
CL09T00C40]
gi|154088705|gb|EDN87749.1| hypothetical protein PARMER_00874 [Parabacteroides merdae ATCC
43184]
gi|409236060|gb|EKN28869.1| hypothetical protein HMPREF1078_02927 [Parabacteroides merdae
CL09T00C40]
Length = 837
Score = 69.3 bits (168), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 128/320 (40%), Gaps = 49/320 (15%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
V+++YRD+ +GD F + W V +AY + +D + DG++E G T D +
Sbjct: 476 VMKIYRDWQLSGDDEFLQKNWGQVKKVLAYAWTDKGWDGNQDGVME--GLQHNTMDV-NY 532
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
G + G ++ AL+AA +A + D A +++ D+ L+NG Y+ + +
Sbjct: 533 FGPNPQMGFWYMGALKAAEKMALAMKDKTFAKKCNTLFRQGSTWMDANLFNGEYYEHKIT 592
Query: 710 DGSSSTSIQ------------------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
D + + DQL GQ+ A CGL + D+ ++ L I
Sbjct: 593 DPETFEYLDMRNPDVKVPPFQLGKGCLVDQLVGQYMAHICGLGYLGDKEHIRTTLGSIMK 652
Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
+N +K V G G + P GR+++ E+ G Y A MI E
Sbjct: 653 YNYVKDFSRHFNNMRSYVMGDESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIYE 712
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
M A + + DG + F E ++ Y R + WA AL+
Sbjct: 713 GMEKEALTCIRAIRD---RHDGAKRNPFSEAECGHH--------YARSMASWAAVIALSD 761
Query: 859 PKLSRQEIKHEISDRDSSYL 878
+ S + + +DR Y
Sbjct: 762 FQYSGVDRRMHFTDRPGCYF 781
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 68/322 (21%), Positives = 128/322 (39%), Gaps = 43/322 (13%)
Query: 73 PVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH---------GICDDAPVL 123
PV + + H+ +PLGGIG G++ RGE + +++ + ++AP
Sbjct: 18 PVLKQYDREHLYQ-VALPLGGIGTGTVSLGGRGELRDWEIMNVPGKKYSTVTTGNNAPF- 75
Query: 124 ANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENCTYHALFPR 179
F+++ GE +++L P P++ E N G + T+ A +P
Sbjct: 76 ---FAIYAKPQTGEAMTTLLA--GPLYPQEYLH--YEGRPVNHHGMPRFADATFEAAYPF 128
Query: 180 AWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWAN 239
V+ + +++ + +P +P + S P +V + ++N+ +V + + N
Sbjct: 129 G-QVHLSDRQLPVKVTVKGFNPLVPGDADASGMPIAVLAYEVTNTTDRPLEVAVCGSMRN 187
Query: 240 SVA-----------GDSGLSGHHFNSKTM-TKDGVHGLTLHHRTANGRPPVTFAVAAEET 287
+ GD +G N K+G+ GL L+ + P VA
Sbjct: 188 FIGKDGSKFRTDWKGDYIPTGAKDNKNQFRKKNGLQGLYLYSDGVDKNDPAYGTVALTTQ 247
Query: 288 ADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASL 347
A V+ + GI + W++ G + D+ +P +S+ A
Sbjct: 248 AVEGVTYRTSSRADNWNNGIL--NFWDDFSADGEL--TERDRQEDEDPMASL----AVKK 299
Query: 348 TIPSGSTRSVTFSLAWDCPEVK 369
T+ GST + TF L W+ P K
Sbjct: 300 TVGPGSTETFTFYLTWNFPNRK 321
>gi|160931643|ref|ZP_02079037.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
gi|156869288|gb|EDO62660.1| hypothetical protein CLOLEP_00474 [Clostridium leptum DSM 753]
Length = 804
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 108/241 (44%), Gaps = 26/241 (10%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIENEGFPDQTYDAWSANG 652
+L+VYR++ +GD +F + +W V A+ + + D + D ++ +G TYD G
Sbjct: 477 ILRVYREWKISGDDDFLKKIWNQVVSALEFSASYWDSNQDCVL--DGQQHNTYDI-EFYG 533
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA--KAVYDSLWNGSYF------ 704
V++ ++ AAL+A + +A +G+H S W ++A K + + L+NG Y+
Sbjct: 534 VNSLGNSIYYAALKAGAEMAEYLGEH-ERSQKWRSMEQAGCKRMDEMLFNGEYYRQVTDG 592
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
+ D +DQL GQ A GL + E VK A+ IY +N + G
Sbjct: 593 DIDEYKYQYGEGCLSDQLLGQTLAHLYGLGHLFPEDHVKSAVFAIYKYNFKERMGSHKSL 652
Query: 765 MNGM----QP---------DGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAG 811
G +P GR + + E+W G+ Y +AA +I E + +
Sbjct: 653 QRGYAYQDEPGLLLCSWPSGGRPKQPFVYSDEVWTGIEYQVAAGLIYEGFLQEGLEIVRA 712
Query: 812 V 812
V
Sbjct: 713 V 713
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 136/334 (40%), Gaps = 43/334 (12%)
Query: 91 LGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGV 150
LGGIG G+I + RGE F++F+ + + F+++ S P+GE + VL + G+
Sbjct: 24 LGGIGTGNISLNARGELCDFEIFNRQSKGLKLPYSFFALWYSLPSGESDALVLEAKPQGL 83
Query: 151 ---PKKNTDS---GIESWDWNLKGENCTYHAL-FPRAWTVYDGEPDPELRIVCRQISPFI 203
P S G+ +D +C A F DG P + + +PFI
Sbjct: 84 CDAPMGVAPSLVPGLPRFD-----SSCFSSAYPFANIQLKKDGLP---FEVTLQAYTPFI 135
Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS----GLSGHHFN----SK 255
P + S P F + L N + T N+ DS GL+ + ++
Sbjct: 136 PLDDVNSGIPGIRFVYRLKNLLDQPVKAAVSATMPNACGFDSFSTDGLNKLYLKGRPENR 195
Query: 256 TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVH-VSECPCFLLSGNSKGITAKDMWN 314
+ K+G+ G+ + T + + F A T + S P +L G G A++ W
Sbjct: 196 VVKKNGLQGIC--YSTDLEKEQLGFGTMALLTDEQEGFSAKPRWLNEGWWDG--AEEYWQ 251
Query: 315 EIKKHGSFDHLDN-----DKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCP-EV 368
++ + G + + + +PS S+IG +IA + G ++ F L W P
Sbjct: 252 DLMEDGCLGEVVSGGKPVSRINPSGEESAIG-SIAVQKVLQPGESKDFVFYLTWHFPNRY 310
Query: 369 KFFEKVYHRR--------YTKFYGTLGDSAARIA 394
++ + R Y +Y TL D A +A
Sbjct: 311 GWWPDGHDARKPIPCADIYQNYYSTLWDDALAVA 344
>gi|116624646|ref|YP_826802.1| hypothetical protein Acid_5570 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227808|gb|ABJ86517.1| protein of unknown function DUF608 [Candidatus Solibacter usitatus
Ellin6076]
Length = 844
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 82/187 (43%), Gaps = 30/187 (16%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
++ Y D+ +GD + R +WP V A+ + +D D DG++E G TYD
Sbjct: 480 IMHAYLDWSLSGDDAWLREMWPRVKKAIEFAWIPGGWDGDRDGVLE--GAQHNTYDV-EF 536
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----- 704
G + CG ++ AL+A +A VGD ASA+ + ++ + D+ L+NG Y+
Sbjct: 537 YGPNPMCGIYYLGALRAGEEMARAVGDTASAAEYRRLFESGRQWIDAKLFNGEYYIQQIR 596
Query: 705 ------------------NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
+ +N DQL GQ+ A GL P+ ++K L
Sbjct: 597 GVAKDKVAKAIIGTMGSDDTENPQYQVGAGCLVDQLVGQYMAEVAGLGPLVSPENIRKTL 656
Query: 747 TKIYDFN 753
IY +N
Sbjct: 657 QSIYRYN 663
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 119/299 (39%), Gaps = 28/299 (9%)
Query: 76 NVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQF-SVFVSRP 134
VF +T+ PLGG+ AGSI RG+ + +++F+ D + F +++V
Sbjct: 34 RVFSGRKLTA-IAFPLGGVCAGSISLGGRGQLRDWEIFN-RSDKGNAPSYAFPAIWVQSG 91
Query: 135 NGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRI 194
+ + VL R + G S ++ T+ FP A + P +++
Sbjct: 92 ARKPVARVLESRIQPPYEGQNGLGTRSAPGLVRLAGATFTGEFPLARIDFTDAALP-VKV 150
Query: 195 VCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHHFNS 254
+PFIPH +S P +V + +SN G +A V++ ++ +S G + + N
Sbjct: 151 SLEAFTPFIPHEPDDSGLPVAVLRYRVSNPGAAAAKVSIAWS-IDSPVGRTQAADTRTN- 208
Query: 255 KTMTKDGVHGLTLHHRTANGRPPV------TFAVAAEETADVHVSECPCFLLSGNSKGIT 308
+ G+ GL + N P +FA+AA TA V+ +G
Sbjct: 209 EYREAGGLAGLMM----TNAELPAADALKGSFALAAMGTAGARVTHL---------RGWE 255
Query: 309 AKDMWNEIKKHGSFDHLDND-KTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
WN + +D D + P + A+ TI +G F LAW P
Sbjct: 256 QGRWWNSPMLY--WDDFSADGELGPEAAKMNAVGALCVQRTIAAGGHADFPFLLAWHFP 312
>gi|269839491|ref|YP_003324183.1| hypothetical protein Tter_2468 [Thermobaculum terrenum ATCC
BAA-798]
gi|269791221|gb|ACZ43361.1| protein of unknown function DUF608 [Thermobaculum terrenum ATCC
BAA-798]
Length = 807
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 107/242 (44%), Gaps = 30/242 (12%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYME-QFDKDGDGMIENEGFPDQTYDAWSANG 652
+L+V R++ +GD + R VWP V A+AY Q+D DGDG+++ TYD G
Sbjct: 462 ILRVLREWRLSGDMEWLRRVWPGVKSALAYASAQWDGDGDGVLDGR--QHNTYDV-EFYG 518
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----NYD 707
+ CG ++AAL+A LA +G+ A +++ D LWNG Y+ + D
Sbjct: 519 PNPLCGLYYLAALRAVEELAEVLGEANLARRCREVFERGSHRLDELLWNGEYYEQLGDED 578
Query: 708 NSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN-------------- 753
ADQL GQ A G+ + E V++AL I+ +N
Sbjct: 579 AWPYQHGKGCLADQLLGQLQAHVLGMGYLVPEEHVRQALRSIWRYNFRADLSEHVNCQRT 638
Query: 754 -VLKVKGG--MCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAA 810
L + G MC G QP S E++ G Y +AA MI E +++ Q
Sbjct: 639 FALNDEAGLLMCTWPRGGQPTFPFPYSD----EVFTGSEYQVAAHMIYEGLLEEGLQIVG 694
Query: 811 GV 812
V
Sbjct: 695 AV 696
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/385 (21%), Positives = 142/385 (36%), Gaps = 70/385 (18%)
Query: 89 IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSP 148
PLGGIG G++ RG+ + +++ + + F+++ +G + VL P
Sbjct: 17 FPLGGIGTGNVSLGARGDLRDWEICNRPAKGNRLPNTFFALWTRTASGRCVTRVL--EGP 74
Query: 149 GVPKKNTDSGIE-SWDWNL-KGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHN 206
P + G S L + + + +P AW V +P +R+ +P IP
Sbjct: 75 IQPPHDGSHGYHPSTGAGLPRCRHSRFRGEYPLAW-VELADPALPVRVSLEAYTPLIPLC 133
Query: 207 YKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDS-----GLSGHHFNSKTMTKDG 261
++S P ++ ++T+ N+ +TL+ + N V G + G N +
Sbjct: 134 PEDSGIPCAILSYTVENTSDEVVALTLVGSLTNPVGGVTFDRFGNYRGAGGNQNEYRESE 193
Query: 262 VH-GLTLHHRT-------------ANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGI 307
V+ GL L T A P +T+ A L G +
Sbjct: 194 VYRGLWLRSTTLPSDDLRYGDMTLATDHPVITYKRA---------------WLRG-AWWD 237
Query: 308 TAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
++ W ++ + G + L + SPSEPG + ++ + G T + F L W P
Sbjct: 238 YLREFWQDLSEDGRLEDLGYE--SPSEPGKTDTCSLGLRDELAPGETHTYRFVLTWYFPN 295
Query: 368 V---------KFFEKVYHRRYTKFYGTLG---------DSAARIAHDAILEHAKWECEIE 409
+ Y R+ + G + ++ HDA+ E +C +E
Sbjct: 296 RPHSWREEAGEIVRNHYASRFASSWDVAGYVFTHLARLEQGTKLFHDALFESTLPDCVLE 355
Query: 410 AWQRPI----------LEDKRFPEW 424
A I LED RF W
Sbjct: 356 ALSANIVPLRSNTCFWLEDGRFLAW 380
>gi|284997210|ref|YP_003418977.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|284445105|gb|ADB86607.1| hypothetical protein LD85_0902 [Sulfolobus islandicus L.D.8.5]
Length = 498
Score = 66.6 bits (161), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 76/167 (45%), Gaps = 29/167 (17%)
Query: 513 YSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIG---LD 569
+ S ++++FP+LE S + + V + G VPHD+G LD
Sbjct: 350 FGSLPVILMFPELEKSFLKLLISYV-------------------RSDGYVPHDLGFHSLD 390
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
P S +WKD+N +L VYR F T D +F + V+P++ M + + +
Sbjct: 391 SPIDGTTS-----PPKWKDMNPSLILLVYRYFKFTNDIDFLKEVYPTIVKVMDW--ELRQ 443
Query: 630 DGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVG 676
DG+ EG D +DA G +Y L++A+L A +A VG
Sbjct: 444 CRDGLPFMEGEMDNAFDATIIKGHDSYTSSLFIASLIAMREIAKLVG 490
>gi|423217835|ref|ZP_17204331.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
CL03T12C61]
gi|392627338|gb|EIY21373.1| hypothetical protein HMPREF1061_01104 [Bacteroides caccae
CL03T12C61]
Length = 851
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 132/322 (40%), Gaps = 53/322 (16%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
++++YR++ +GD +F + W V ++Y + +D + DG++E G T D +
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNS 709
G + G ++ AL+AA ++ + D A ++K D +L+NG Y+ + +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604
Query: 710 DGSS------------------STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
D + DQL GQ+ A CGL + D+ ++ + I
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664
Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
+N ++ V G G + P GR+++ E+ G Y A M+ E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC---YMRPLTIWAMQWAL 856
M + A + DG + + P S S C Y R + W+ AL
Sbjct: 725 GMEEDALTCINAIRR---RHDG---AKRNPFS-------ESECGHHYARSMASWSAIIAL 771
Query: 857 TKPKLSRQEIKHEISDRDSSYL 878
++ + S + +I+DR +Y
Sbjct: 772 SEFQYSGVDKSMKITDRPGNYF 793
Score = 46.6 bits (109), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 67/338 (19%), Positives = 132/338 (39%), Gaps = 39/338 (11%)
Query: 65 QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
QE + PV + K H+ +PLGGIG G++ RGE + +++ + V A
Sbjct: 22 QEWKPQQWPVLKKYDKEHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMN-------VPA 73
Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDW-NLKGENCTYHAL------- 176
++S + N F+ P++ + S ++ + +G +H L
Sbjct: 74 KKYSTVTTGNNAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYEGRPVNHHGLPRFEQAS 133
Query: 177 FPRAW---TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
F A+ V + + +++ + +PFIP + S P +V ++ ++N+ +V++
Sbjct: 134 FEAAYPFGQVSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAVLSYEVTNTTDQPMNVSI 193
Query: 234 LFTWANSVA-----------GDSGLSGHHFN-SKTMTKDGVHGLTLHHRTANGRPPVTFA 281
+ N + GD +G N +K + + + G+ L+ +
Sbjct: 194 CGSIRNFIGKDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKGIYLYSDGVDKDDSAWGT 253
Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
VA A V+ + I + W++ G + K +P A
Sbjct: 254 VALTTQAVNGVTYRTSSKADNWNNSIL--NFWDDFSADGML--TERSKQEDEDP----MA 305
Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRY 379
+++ + TI ST + TF + W P K + V Y
Sbjct: 306 SLSVNKTIAPQSTETFTFYITWSFPNRKAWSSVVVGNY 343
>gi|153807893|ref|ZP_01960561.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
gi|149129502|gb|EDM20716.1| hypothetical protein BACCAC_02179 [Bacteroides caccae ATCC 43185]
Length = 851
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 132/322 (40%), Gaps = 53/322 (16%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
++++YR++ +GD +F + W V ++Y + +D + DG++E G T D +
Sbjct: 488 IMKIYREWQLSGDNDFLKNNWEQVKKVLSYAWIEKGWDGNQDGVME--GSQHNTMDV-NY 544
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNS 709
G + G ++ AL+AA ++ + D A ++K D +L+NG Y+ + +
Sbjct: 545 FGPNPQMGFWYMGALKAAEKMSIAMKDKNFAKKCRTLFEKGSEWMDENLFNGEYYEHKIT 604
Query: 710 DGSS------------------STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
D + DQL GQ+ A CGL + D+ ++ + I
Sbjct: 605 DPKTFEFLDMNDPDVKIPGFQLGQGCLVDQLVGQYMAHLCGLGYLGDKKNIQTTMKSIMK 664
Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
+N ++ V G G + P GR+++ E+ G Y A M+ E
Sbjct: 665 YNFVEDFSRHFNNMRSYVMGDEAGLLMASWPKGRLEVPFPYFSEVMTGFEYCAAVGMLYE 724
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSLC---YMRPLTIWAMQWAL 856
M + A + DG + + P S S C Y R + W+ AL
Sbjct: 725 GMEEDALTCINAIRR---RHDG---AKRNPFS-------ESECGHHYARSMASWSAIIAL 771
Query: 857 TKPKLSRQEIKHEISDRDSSYL 878
++ + S + +I+DR +Y
Sbjct: 772 SEFQYSGVDKSMKITDRPGNYF 793
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 67/338 (19%), Positives = 132/338 (39%), Gaps = 39/338 (11%)
Query: 65 QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLA 124
QE + PV + K H+ +PLGGIG G++ RGE + +++ + V A
Sbjct: 22 QEWKPQQWPVLKKYDKEHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMN-------VPA 73
Query: 125 NQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDW-NLKGENCTYHAL------- 176
++S + N F+ P++ + S ++ + +G +H L
Sbjct: 74 KKYSTVTTGNNAPFFAIYAKPQNQKATTTLLAGPLYSHEYLHYEGRPVNHHGLPRFEQAS 133
Query: 177 FPRAW---TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
F A+ V + + +++ + +PFIP + S P +V ++ ++N+ +V++
Sbjct: 134 FEAAYPFGQVSLSDSNLPVKVKVKGFNPFIPGDADASGLPVAVLSYEVTNTTDQPMNVSI 193
Query: 234 LFTWANSVA-----------GDSGLSGHHFN-SKTMTKDGVHGLTLHHRTANGRPPVTFA 281
+ N + GD +G N +K + + + G+ L+ +
Sbjct: 194 CGSIRNFIGKDGSKFRTDWKGDYIPTGAKNNKNKYVESNNIKGIYLYSDGVDKDDSAWGT 253
Query: 282 VAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
VA A V+ + I + W++ G + K +P A
Sbjct: 254 VALTTQAVNGVTYRTSSKADNWNNSIL--NFWDDFSADGML--TERSKQEDEDP----MA 305
Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRY 379
+++ + TI ST + TF + W P K + V Y
Sbjct: 306 SLSVNKTIAPQSTETFTFYITWSFPNRKAWSSVVVGNY 343
>gi|348173841|ref|ZP_08880735.1| hypothetical protein SspiN1_25464 [Saccharopolyspora spinosa NRRL
18395]
Length = 995
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 122/509 (23%), Positives = 193/509 (37%), Gaps = 117/509 (22%)
Query: 341 AAIAASLTIPSGSTRSVTFSLAW-----DCPEVKFFE--KVYHRRYTKFYGTLGDSAARI 393
A + A +T+ G R + F LAW D + F E + R Y +G+ D A +
Sbjct: 427 AVVRAGMTLAPGEQRMIRFVLAWFFPVPDREGLSFLEHSRELLRHYAARFGSAQDVVAHL 486
Query: 394 AHDAILEHAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSL 453
A E + AWQR WY D + P L
Sbjct: 487 AR----EQERLIDGTRAWQR---------TWY------------------EDSTLPHWLL 515
Query: 454 ATIRERKFSLDTPSKIHTPASSDTALGTRLLENGEENIGQFLYLEGAEYVMYNTYDVHFY 513
+ A++ T L T + G+F EGA Y T ++
Sbjct: 516 ERV----------------ATNTTILSTSTCYRFAD--GRFYGWEGA-YCCAGTCTHVWH 556
Query: 514 SSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWF 573
+ AL LFP +E + R LG IG
Sbjct: 557 YAHALARLFPTVERDTRE------------------------RVDLG-----IGFHAATG 587
Query: 574 EINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDG 633
+++ + S D + +L++YR+ + D + + VWP A+ Y+ D + DG
Sbjct: 588 QLSMRGEADRSPAVDGQAGTILRIYREHQMSPDSQWLQRVWPRTRQAVEYLIASDAEPDG 647
Query: 634 MIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFW-VRYQKAK 692
++ +G T DA + G +++ GL++AAL+A +A+A +VGD A A + +
Sbjct: 648 IL--DGAQPNTQDA-TWFGRNSWLSGLYLAALRAGAAMAGEVGDDAFARRCTELATSGTE 704
Query: 693 AVYDSLWNGSYFNYD---NSDGSSSTS--IQADQLAGQWYARACGLLPIADEAKVKKALT 747
+ L+NG YF ++ GS +T+ ADQL G +A GL + + AL
Sbjct: 705 VIVRDLFNGEYFVHELDPAHPGSVNTNRGCFADQLLGPSWATQLGLPRVLPPEPTRSALR 764
Query: 748 KIYDFNVL----------KVKGG------------MCGAMNGMQPDGRIDMSGLQAREIW 785
I+ N + ++GG MC NG + + S E W
Sbjct: 765 SIWRHNFVPRPMEYRQNSPIEGGRIFYDADVPALVMCTWPNGGGDEAGDNWSVSYFNEAW 824
Query: 786 PGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
G+ Y +AA +I E MVD A V++
Sbjct: 825 HGIEYQVAAHLIAEGMVDEGLAVARSVHD 853
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 77/187 (41%), Gaps = 26/187 (13%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLCPRS 147
G+P+GGIG G + G +L+ D+ AN + +RP + P S
Sbjct: 28 GLPIGGIGCGQLHLGGDG-----RLWLWDVDNRTAPANINDLHFTRP--------MLPSS 74
Query: 148 P---GVPKKNTDSGIESWDW-NLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFI 203
P G + TD W + +G A P A TV +P +++ SPFI
Sbjct: 75 PVEQGFAVRVTDGDDVRTRWLDARGFPDATFAGRPPAATVDYADPQEPVQVRLDACSPFI 134
Query: 204 PHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH-----FNSKTMT 258
P +SS+PA +T +N+G + DV + AN V L+ H S+
Sbjct: 135 PTEIDDSSYPAVFLDYTATNTGPATVDVAIAGFLANPVC----LTSRHTRPLRLRSQEFA 190
Query: 259 KDGVHGL 265
DG G+
Sbjct: 191 LDGAAGV 197
>gi|160890984|ref|ZP_02071987.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
gi|156859205|gb|EDO52636.1| hypothetical protein BACUNI_03431 [Bacteroides uniformis ATCC 8492]
Length = 858
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/368 (22%), Positives = 137/368 (37%), Gaps = 75/368 (20%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYR++ GD N+ + ++P+V +M Y + +D G +E + W ANG
Sbjct: 498 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 557
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
+ +V ALQA + VGD S RY+ K+KA +S L++G YF
Sbjct: 558 MIT---SFYVGALQAICLMGKTVGDDIS------RYEELMNKSKAYLESKLFDGEYFIQN 608
Query: 705 ----NYDNSDGSSSTSI-------------------------QADQLAGQWYARACGLLP 735
D D + S +D + G W CG+
Sbjct: 609 IQWTGLDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPE 668
Query: 736 IADEAKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPD-GRIDMSGLQA 781
+ D KVK L ++ +N K G G + P G++ + + +
Sbjct: 669 VVDRLKVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYS 728
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
E+W G+ Y +AA ++ E V+ + V D + P N+
Sbjct: 729 NEVWTGIEYQVAAHLMFEGEVEKGLEI------VRTCRDRYNGRVRNP----FNEYECGA 778
Query: 842 CYMRPLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKLPKEE 897
Y R + +AM LT + + I I D +S+L F V P +
Sbjct: 779 WYARAMASYAMLEGLTGIRYDAVDKILYIDSRIGDDFTSFLSTETGFGNVGLKEGKPFID 838
Query: 898 ASKGFLKV 905
G + V
Sbjct: 839 VKYGVIDV 846
>gi|270294306|ref|ZP_06200508.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270275773|gb|EFA21633.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 851
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/369 (22%), Positives = 138/369 (37%), Gaps = 77/369 (20%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYR++ GD N+ + ++P+V +M Y + +D G +E + W ANG
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
+ +V ALQA + VGD S RY+ K+KA +S L++G YF
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDIS------RYEELMNKSKAYLESKLFDGEYFIQN 601
Query: 705 ----NYDNSDGSSSTSI-------------------------QADQLAGQWYARACGLLP 735
D D + S +D + G W CG+
Sbjct: 602 IQWTGLDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPE 661
Query: 736 IADEAKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPD-GRIDMSGLQA 781
+ D KVK L ++ +N K G G + P G++ + + +
Sbjct: 662 VVDRLKVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYS 721
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
E+W G+ Y +AA ++ E V+ + V D + P +EY
Sbjct: 722 NEVWTGIEYQVAAHLMFEGEVEKGLEI------VRTCRDRYNGRVRNP-----FNEYECG 770
Query: 842 C-YMRPLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKLPKE 896
Y R + +AM LT + + I I D +S+L F V P
Sbjct: 771 AWYARAMASYAMLEGLTGIRYDAVDKILYIDSRIGDDFTSFLSTETGFGNVGLKEGKPFI 830
Query: 897 EASKGFLKV 905
+ G + V
Sbjct: 831 DVKYGVIDV 839
>gi|317480931|ref|ZP_07940011.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
gi|316902824|gb|EFV24698.1| hypothetical protein HMPREF1007_03130 [Bacteroides sp. 4_1_36]
Length = 851
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/368 (22%), Positives = 137/368 (37%), Gaps = 75/368 (20%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYR++ GD N+ + ++P+V +M Y + +D G +E + W ANG
Sbjct: 491 IMKVYREWRIMGDINWLKMMYPAVKKSMDYCIRTWDPREVGAVEEPHHNTYDIEFWGANG 550
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
+ +V ALQA + VGD S RY+ K+KA +S L++G YF
Sbjct: 551 MIT---SFYVGALQAICLMGKTVGDDIS------RYEELMNKSKAYLESKLFDGEYFIQN 601
Query: 705 ----NYDNSDGSSSTSI-------------------------QADQLAGQWYARACGLLP 735
D D + S +D + G W CG+
Sbjct: 602 IQWTGLDAPDPVDAQSFVTHYTPEALKILQEEGPKYQYGKGCLSDGILGSWMTLVCGMPE 661
Query: 736 IADEAKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPD-GRIDMSGLQA 781
+ D KVK L ++ +N K G G + P G++ + + +
Sbjct: 662 VVDRLKVKSHLISVHKYNFKKSLSNHVNPQRSTFALGEDGGLLLCTWPKGGKLKLPFVYS 721
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYSFQTPESWNNNDEYRSL 841
E+W G+ Y +AA ++ E V+ + V D + P N+
Sbjct: 722 NEVWTGIEYQVAAHLMFEGEVEKGLEI------VRTCRDRYNGRVRNP----FNEYECGA 771
Query: 842 CYMRPLTIWAMQWALTKPKLSRQE----IKHEISDRDSSYLEQHAAFSKVASLLKLPKEE 897
Y R + +AM LT + + I I D +S+L F V P +
Sbjct: 772 WYARAMASYAMLEGLTGIRYDAVDKILYIDSRIGDDFTSFLSTETGFGNVGLKEGKPFID 831
Query: 898 ASKGFLKV 905
G + V
Sbjct: 832 VKYGVIDV 839
>gi|332981217|ref|YP_004462658.1| hypothetical protein Mahau_0633 [Mahella australiensis 50-1 BON]
gi|332698895|gb|AEE95836.1| protein of unknown function DUF608 [Mahella australiensis 50-1 BON]
Length = 880
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 172/815 (21%), Positives = 305/815 (37%), Gaps = 180/815 (22%)
Query: 89 IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEKFSSVLC---- 144
PLGGIG G IG + G +++ + + + F+V V + N + +L
Sbjct: 18 FPLGGIGTGCIGLAGNGRLIDWEIQNRPNKGSYNGYSHFAVKVEKENEVIDTRILNGDLH 77
Query: 145 -PRSPGV-----------PKKNTDSGIESW-DWNLKGENCTYHALFPRAWTVYDGEPDPE 191
P S P + SG+ + D KGE FP A + + P
Sbjct: 78 PPYSGNYAASHFNSFGFGPPRAYLSGVPHFRDVEFKGE-------FPIAELSFIDKTFPG 130
Query: 192 LRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSGHH 251
+ +PFIP + K+SS PA+ F +++N+ + T++ N G
Sbjct: 131 -NVKMTAFNPFIPLHDKDSSLPAAFFEISITNTTNETLHYTVVSCMGNVNGG-------- 181
Query: 252 FNSKTMTKDGVHGLTLHHRTANGRPP--VTFA--VAAEETADVHVSECPCFLLSGNSKGI 307
++ K G+ + T G P V + A + DV E +G
Sbjct: 182 -YNEYAKKGPAQGIKM---TCIGLTPDDVRYGDLTVATDMDDVSYQEYW-------YRGS 230
Query: 308 TAKDM---WNEIKKHGSF---DHLDNDKTSPSEPGSSIG-AAIAASLTIPSGSTRSVTFS 360
D+ W+++ K G F + D ++ + G+++ + +A ++ +G T++V F
Sbjct: 231 WFDDLEIYWSDLNKLGKFINRSYKDKGYSNGTGSGNAVDMSTLATHFSLDAGQTKNVRFI 290
Query: 361 LAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWECEIEAWQRPILEDKR 420
L W P + +S + +W+ + + ED
Sbjct: 291 LTWYYPNCANYWSQ-------------NSQQSCCGCQTSQKGQWK----NYYASLFEDSF 333
Query: 421 FPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERKFSLDTPSKIHTPASSD-TAL 479
Y + ++ LY + + FS PS I S++ + L
Sbjct: 334 ACAQYCLKNWDRLY----------------EQTLCFKNALFSSTLPSAIIDAVSANLSIL 377
Query: 480 GTRLLENGEENIGQFLYLEGA--EYVMYNTYDVHFYS-SFALVMLFPKLELSIQRDFAAA 536
T E+ G F EG + H ++ ++AL LFPKLE S+ R+
Sbjct: 378 KTPTCLRLED--GSFYGFEGCHCDSGCCEGSCTHVWNYAYALPFLFPKLERSM-RELDYK 434
Query: 537 VMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKF--V 594
M + G M + + L P SS + +F V
Sbjct: 435 YNMREDGGM---------------SFRLQLPLGSP----------RSSFRPCADGQFGGV 469
Query: 595 LQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDAW 648
++ YR++ +GD + +++WP V ++ Y +++D + G++ G T D
Sbjct: 470 IKTYREWKISGDDEWLKSLWPRVKKSIEYAWVDTNEDRWDPEQKGVLT--GRQHHTLDM- 526
Query: 649 SANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNY- 706
G +++ G ++AAL+A + +A+ VGD +A + +++ KA DS L+NG Y+ +
Sbjct: 527 ELFGPNSWLTGFYLAALKAGAEMASYVGDEQAAQRYTDIFRRGKAWADSNLFNGEYYYHL 586
Query: 707 -DNSDGS--------------SSTSIQA-------------------DQLAGQWYARACG 732
D D S ++I+A DQ+ QW+A G
Sbjct: 587 IDLQDKSILERFDTDKTGTLIGDSTIKAYWDEERGQIKYQVGEGCIIDQVLAQWHANLIG 646
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVK-----------GGMCGAMNGMQPDGRID--MSGL 779
L I D +KA+ +Y N K++ G + P GR +S
Sbjct: 647 LGEIFDPGNTRKAVYAVYKNNFKKMREVFNPCRVFSLNDESGMIICSYPAGRKKPAISVP 706
Query: 780 QAREIWPGVTYGLAASMIQEEMVDMAFQTAAGVYE 814
A+E G Y A +IQ+ + + + E
Sbjct: 707 YAQETMNGFEYQAAEHLIQKGFIREGEEVVRAIRE 741
>gi|283835266|ref|ZP_06355007.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
gi|291068424|gb|EFE06533.1| conserved hypothetical protein [Citrobacter youngae ATCC 29220]
Length = 896
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 66/268 (24%), Positives = 110/268 (41%), Gaps = 65/268 (24%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDA 647
VL +YRD++ GD+ + W +V + + +Q+D + G++ G T D
Sbjct: 470 VLAIYRDWLLCGDRAWLEEKWSAVRDMLRFAWNPDNPDQWDPEQRGVLT--GRQHHTLDM 527
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFW--VRYQKAKAVYDSLWNGSYF- 704
G + + GL++AAL+A A+AN +G H + W + + + + L+NG YF
Sbjct: 528 -ELFGPNPWLTGLYIAALEAGCAIANHLG-HNDDAMLWQDIARRGRVTLRERLFNGEYFQ 585
Query: 705 ---NYDNSDGSS---------------------------------STSIQADQLAGQWYA 728
+ N D + + DQL GQW+A
Sbjct: 586 QVIDLANRDTLAPWRTSAESADYGLSGDVWQQYWCEENHEVKYQFGSGCHIDQLMGQWFA 645
Query: 729 RACGLLPIADEAKVKKALTKIYDFNVLKVKGGM--CG--AMNGMQ-------PDGRIDMS 777
CGL P+ D + ++AL ++ +N + + C +N Q P G DM
Sbjct: 646 GLCGLPPVFDPGQTQRALRALWQYNFVSARHHTNPCRLYGLNDEQGVSICSWPTG--DMP 703
Query: 778 GLQ---AREIWPGVTYGLAASMIQEEMV 802
+ A+E+ G Y LAA M+ M
Sbjct: 704 AIPVPYAQEMMTGFEYQLAALMVASGMA 731
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 119/318 (37%), Gaps = 48/318 (15%)
Query: 89 IPLGGIGAGSIGRSYRGEFQRFKLF--------HGICDDAPVLANQFSVFVSRPNGEKFS 140
PLGGIG G I + G ++F HG A ++ +R +
Sbjct: 14 FPLGGIGTGCISLAGNGHLTDVEIFNHPDKGYHHGYTHFAVKAERNGALVDARILQGDWQ 73
Query: 141 SVLCPR---SPGVPKKNTDSGIESWDWNLKGENCTYHALFPR-AWTVYDGE--------P 188
C + + K++T G+ G + A P A + +DGE
Sbjct: 74 GNACGQWQIHDRLRKRHTGYGV--------GPDAETLAGMPHFAHSEFDGEFPMAQMSFS 125
Query: 189 DPELRIVCRQIS--PFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSG 246
DPE R ++ PFIP + SS P + F + N+ + D TL + N +A DS
Sbjct: 126 DPEFPGNIRMLAFNPFIPCDEDNSSLPLAWFELEVENTTDAALDYTLCLSLCNGLAPDST 185
Query: 247 LSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSG---N 303
+ H ++ H +TL R P +A T D + G +
Sbjct: 186 HNRHEI------REDRHMMTLTQRHYPADDPRYAELAT--TTDAQDVSWQTYWYRGDWFD 237
Query: 304 SKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAW 363
S + +D + + L + + S+PG + A++AA L + +G V F W
Sbjct: 238 SLTMFWRDFCGQPR-------LPEREYTQSQPGFADHASLAAHLHLAAGERGRVRFWTGW 290
Query: 364 DCPEVKFFEKVYHRRYTK 381
CP + + R T+
Sbjct: 291 YCPVTYNYWTDWRDRATQ 308
>gi|402589209|gb|EJW83141.1| hypothetical protein WUBG_05945 [Wuchereria bancrofti]
Length = 90
Score = 63.9 bits (154), Expect = 4e-07, Method: Composition-based stats.
Identities = 35/76 (46%), Positives = 45/76 (59%), Gaps = 5/76 (6%)
Query: 719 ADQLAGQW--YARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDM 776
ADQ W +RA LLP++ +K AL +IY N+L GG GA+NGM+ DG +
Sbjct: 8 ADQTYDAWKLLSRARALLPLS---HIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGR 64
Query: 777 SGLQAREIWPGVTYGL 792
LQA E+W GVTY L
Sbjct: 65 RHLQADEMWVGVTYAL 80
>gi|374373788|ref|ZP_09631448.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
gi|373234761|gb|EHP54554.1| hypothetical protein NiasoDRAFT_2604 [Niabella soli DSM 19437]
Length = 897
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 107/252 (42%), Gaps = 36/252 (14%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFP-DQTYDA-WSAN 651
VL YR+ + D F + W + A ++ D++ DGM E P + T DA W +
Sbjct: 504 VLGCYREHQMSADDGFLKRNWDKIKRATQFIINQDRNKDGM---EDTPLENTLDAIW--D 558
Query: 652 GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNY--DN 708
G A+ GL +AA++A A+A ++ D A A+ QK ++ + D L+NG YF + D
Sbjct: 559 GEIAWLVGLCIAAVKAGQAMAEEMNDTAFAAVCKDYVQKGSRHMSDQLFNGEYFIHRPDP 618
Query: 709 SDG----SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVLKVKGGMCGA 764
G S + DQ+ GQ +A GL + D+ + AL ++ +N +K G
Sbjct: 619 VKGRKGLGSYNTCHIDQVLGQSWAFQVGLGRVIDQQQTLSALKALWKYNFIKDMGIYAET 678
Query: 765 MNGMQPDGRIDMSGLQA----------------------REIWPGVTYGLAASMIQEEMV 802
+P SGL E G + +A+ M+ E M
Sbjct: 679 HTDGRPYALYGESGLVMNTNPKNEPKPFGDGAAWQIGYFNECMTGFEHQVASHMMAEGMT 738
Query: 803 DMAFQTAAGVYE 814
D A +++
Sbjct: 739 DEALTITKAIHD 750
Score = 46.2 bits (108), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 67/164 (40%), Gaps = 23/164 (14%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFH----GICD-DAP---VLANQFSVFVSRPNGEKF 139
G+P+GGI AG + S G + +F+ GI D P + + SR G
Sbjct: 69 GMPVGGINAGGMYLSGDGRLWLWDVFNLNQEGINPVDIPWKTSVQGELRKVRSRDGGSYI 128
Query: 140 S----SVLCPRSPGVPKKNTDSGIESW------DWNLKGENCTYHALFPRAWTVYDGEPD 189
+ L P G K G E DW + + A +P A Y +
Sbjct: 129 APPKAKELRPLDQGFALKLEYEGKEQTRQLREEDW----DEVRFEATYPVATITYSSKDI 184
Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTL 233
P ++ V + SPFIP + S P SVFT T+ N+G+ TL
Sbjct: 185 P-VQAVLQAYSPFIPLDTDRSGLPVSVFTITIKNAGKHPVKATL 227
>gi|89068658|ref|ZP_01156050.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
gi|89045845|gb|EAR51906.1| hypothetical protein OG2516_16434 [Oceanicola granulosus HTCC2516]
Length = 876
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/308 (24%), Positives = 122/308 (39%), Gaps = 50/308 (16%)
Query: 89 IPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRP-------------- 134
PLGGIGAG++G G Q +++F+ A N FS F R
Sbjct: 19 FPLGGIGAGTVGIGADGRLQDWEIFN---RPAKGSVNGFSHFAVRAERAGEVVDTRVLHG 75
Query: 135 --NGEKFSSVLCPRSPGV---PKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPD 189
G + S L P G P++ +G S+ + + +P A ++ +P
Sbjct: 76 PFTGNRTGSFLAPEFNGYGFGPRRQQMTGFPSF------RSSSLAGPYPAADLTFE-DPA 128
Query: 190 PELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAGDSGLSG 249
+ R +SPF+P ++SS P ++F T +N G T D TL A G +
Sbjct: 129 FPGAVDMRALSPFVPLEDRDSSMPVAMFEVTFNNPGDTPLDYTLFGCLAFDFVDLEGNA- 187
Query: 250 HHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITA 309
+ T + + G T ++ A ++T+ H L GN T
Sbjct: 188 -EVDIATGDRTILTGRTTLGEDHPDHRQLSLATDHDDTSHQH------HLYRGNWFD-TM 239
Query: 310 KDMWNEIKKHGSFDHLDNDKTSPSEPGSSIG--------AAIAASLTIPSGSTRSVTFSL 361
+ W +I + G D+ PS S + +AA +T+P+G +R+V +L
Sbjct: 240 EVYWADIARGGRL----RDRKYPSAKSSVANNQADAIEHSLLAAHVTVPAGESRTVRMAL 295
Query: 362 AWDCPEVK 369
+W P +
Sbjct: 296 SWYVPNFR 303
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 94/212 (44%), Gaps = 46/212 (21%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDA 647
V++ YRD+ +GD + R++WPS+ A+ + +++D D G++ G T D
Sbjct: 474 VMKAYRDWKLSGDDAWLRSLWPSIKSAIEFAWHPDNPDRWDPDQTGVL--SGRQHHTLDM 531
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGSYF-- 704
G +++ G ++ AL A + +A VG+ +A+ + ++K KA V + L+NG +F
Sbjct: 532 -ELFGPNSWLAGFYLGALSAGAEMAEFVGEADTAAQYRALFEKGKAWVGEELFNGEWFAQ 590
Query: 705 NYDNSD----------GSSSTSIQA------------------------DQLAGQWYARA 730
D +D G ++ I DQ+ +W+AR
Sbjct: 591 KIDIADRDTLAPFQEPGGDASHISGSIFEQYWSEEHGELKYQIGAGCAIDQVLAEWHARL 650
Query: 731 CGLLPIADEAKVKKALTKIYDFNVLKVKGGMC 762
GL + D A+ IY+ N ++ G +
Sbjct: 651 YGLDHLYDPNDFATAVRAIYEHNFVERLGDLA 682
>gi|224025536|ref|ZP_03643902.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
18228]
gi|224018772|gb|EEF76770.1| hypothetical protein BACCOPRO_02276 [Bacteroides coprophilus DSM
18228]
Length = 846
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 102/253 (40%), Gaps = 37/253 (14%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
+++ YRD+ +GD F + W V ++Y +D + DG++E G T D +
Sbjct: 484 IMKFYRDWQLSGDSRFLKENWAQVKKVLSYAWTEHGWDDNQDGVME--GSQHNTMDV-NY 540
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYDNS 709
G + G ++ AL+AA +A+ + D A ++ D+ L+NG Y+ + +
Sbjct: 541 FGPNPQMGFWYLGALKAAEKMAHAMKDRAFEKKCRALFENGSEWLDNNLFNGEYYEHKIT 600
Query: 710 DGSS------------------STSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
D + DQL GQ+ A CGL +A + ++ L I
Sbjct: 601 DPETFEFLDMNDPDTKIPDFQLGKGCLVDQLVGQYMAHICGLGYLAKKEHIRTTLKSIMR 660
Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
+N + V G G + P GR+++ E+ G Y A MI E
Sbjct: 661 YNFKEDFSRHFNNMRSYVLGDEAGLLMASWPKGRLEVPFPYFAEVMTGFEYCAATGMIYE 720
Query: 800 EMVDMAFQTAAGV 812
M + A + +
Sbjct: 721 GMEEDALKCIRAI 733
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 69/334 (20%), Positives = 129/334 (38%), Gaps = 51/334 (15%)
Query: 65 QEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH---------G 115
QE G PV + + H+ +PLGGIG G++ RGE + +++ +
Sbjct: 18 QEWKAGDWPVLRHYDQQHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMNVPGKKYSTVT 76
Query: 116 ICDDAPVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENC 171
++AP F+++ GE +++L G E N G +N
Sbjct: 77 TGNNAPF----FAIYTKPEKGEAQTTLLA----GALYDQEYLHYEGRPVNHHGLPRFKNS 128
Query: 172 TYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
++ A +P V+ + + + + +P +P + + S P +V + ++N+ QT DV
Sbjct: 129 SFDAAYPFG-QVHLSDKGIPVEVTVKGFNPLVPGDAEASGIPIAVLCYEVTNTSQTPVDV 187
Query: 232 TLLFTWANSVAGDSGLSGHHFNSK---TMTKDGVHGLTLHHRTANGRPPVTFAVAAEETA 288
++ + N + D G F + G +++A G + F E
Sbjct: 188 SICGSIRNFIGKD----GSKFVTDWKGDYIPVGCKDNRNIYKSAQGLQGIYFCSEGVEKT 243
Query: 289 DVHVSECPCFLLSGNSKGITAKD-------------MWNEIKKHGSFDHLDNDKTSPSEP 335
D L++ +G+T + W++ G + K ++P
Sbjct: 244 DPAYGNMT--LVTNAPEGVTYRTSSKADDWSNGILGFWDDFSADGELTEM--SKQYDADP 299
Query: 336 GSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVK 369
A++A + G TRS TF L W+ P K
Sbjct: 300 ----MASLAVKKKLAPGETRSFTFFLTWNFPNRK 329
>gi|340616465|ref|YP_004734918.1| hypothetical protein zobellia_454 [Zobellia galactanivorans]
gi|339731262|emb|CAZ94527.1| Conserved hypothetical protein [Zobellia galactanivorans]
Length = 923
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 154/715 (21%), Positives = 259/715 (36%), Gaps = 196/715 (27%)
Query: 177 FPRAWTVYDGEPDP-ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLL- 234
+P A Y + D + I SPF+P + + S+ P +V + ++N+ + DV+++
Sbjct: 165 YPIAQVNYRKKSDRLPIEISSEVFSPFVPLSVRSSANPVTVVKYKITNTSKDKIDVSIIG 224
Query: 235 ------------FTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAV 282
N+ G+SG F + M+ D V L H + F++
Sbjct: 225 ELKNIAFPQPSKVKQVNTALRSKGVSGISFEMQPMSNDSV--LAKHPQLGG------FSL 276
Query: 283 AA-EETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGA 341
+ + AD+ V+ + + K S I
Sbjct: 277 SVLNQKADLLVNA-----------------------------NEKDSKKKRSSSEDPISG 307
Query: 342 AIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEH 401
++ +++++ SG T+ +TF ++W P ++E +RY DA+ E
Sbjct: 308 SVGSTVSLRSGQTKEITFLVSWYFP--NYYEN--GKRY---------------KDAMAEA 348
Query: 402 AKWECEI-EAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
W I W + + Y + FN LY A ++ D T +
Sbjct: 349 PGWVGHIYNNWYANSFDVTK----YVASNFNTLY---ADTKLFRD---------TYNDNT 392
Query: 461 FSLDTPSKIHTPASSDTALGTRLLENGE----ENIGQFLYLEGAEYVMYNTYDVHFYSSF 516
++I P S+ A + +NG E IG + +G +YN F +
Sbjct: 393 LPYWLANRITMPVSTLAAGNIAIWKNGRMYGYEGIG---FCQGTCGHVYN-----FVA-- 442
Query: 517 ALVMLFPKLELSIQ--RDFAAAVMM--HDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPW 572
A+ LFP+LE S++ +DF + + P + +I G V + VPH
Sbjct: 443 AISKLFPELERSVRLLQDFNSTEKYSGYSP-SGRINFRGYGVNKP---EVPH-------- 490
Query: 573 FEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDG- 631
S D S +VL+ YR+ + + + F VW V +A+ Y KDG
Sbjct: 491 -----------SYASDAQSGYVLKAYREHLNSPNNEFLDTVWEKVKMAIGY--HIFKDGA 537
Query: 632 ------DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFW 685
+G++E E QT+ G + Y L++AAL+AA +A G+ AS +
Sbjct: 538 EIGLEPNGVLEGE----QTFWDPMWYGPNPYNNTLYLAALRAAEEMAKVKGEPELASRYH 593
Query: 686 VRYQKA-KAVYDSLWNGSY-----------------------------------FNYDNS 709
++ K + + +WNG + FN S
Sbjct: 594 KLFESGQKFMNEEMWNGEFYIHLYPVGLNGNVGIKNGFKLPSEIDDNAQSYIKAFNEGRS 653
Query: 710 DGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKA------------LTKIYDFN---- 753
T+ A QL GQ +A GL I K + A + IYDF
Sbjct: 654 GYFPGTACDAQQLFGQNWANQLGLGYILSPEKSEIAAKSIFKYNWTPDIATIYDFEKPNA 713
Query: 754 -VLKVKGGMCGAMNGMQPDGRIDMSGLQAR-EIWPGVTYGLAASMIQEEMVDMAF 806
VL KG G + + +W G+ Y A MI E +++ A
Sbjct: 714 RVLAAKGEAAMINGGWTKEKPKSFENTHDKTNVWTGLEYEAACDMINEGLLEEAI 768
>gi|312086024|ref|XP_003144914.1| hypothetical protein LOAG_09338 [Loa loa]
gi|307759923|gb|EFO19157.1| hypothetical protein LOAG_09338, partial [Loa loa]
Length = 84
Score = 60.8 bits (146), Expect = 3e-06, Method: Composition-based stats.
Identities = 29/76 (38%), Positives = 42/76 (55%)
Query: 632 DGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA 691
DGMIEN G DQTYD NGVS YCG LW+ L A + A+++ D S + + +A
Sbjct: 9 DGMIENSGTADQTYDTLKMNGVSTYCGSLWLTLLHAEVSFASEIDDIRSGNLYTGFLLRA 68
Query: 692 KAVYDSLWNGSYFNYD 707
+++S + Y +
Sbjct: 69 GDIFESEYYNIYLQME 84
>gi|410096859|ref|ZP_11291844.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225476|gb|EKN18395.1| hypothetical protein HMPREF1076_01022 [Parabacteroides goldsteinii
CL02T12C30]
Length = 847
Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 68/320 (21%), Positives = 130/320 (40%), Gaps = 49/320 (15%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
++++YR++ +GD +F + W + ++Y + +D + DG++E G T D +
Sbjct: 486 IMKIYREWQLSGDNDFLKNNWGQIKKVLSYAWTEKGWDGNQDGVME--GSQHNTMDV-NY 542
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNS 709
G + G ++ AL+AA +A + D A +++ D +L+NG Y+ + +
Sbjct: 543 FGPNPQMGFWYMGALKAAEKMAIAMKDKGFAQKCRTLFEQGSNWMDKNLFNGEYYEHKIT 602
Query: 710 DGSSSTSIQ------------------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
D + + DQL GQ+ + CGL + ++ ++ + I
Sbjct: 603 DPETFEYLDMNNPNVKIPSFQLGPGCLVDQLVGQYMSHLCGLGYLGNKDHIQTTMNSIMK 662
Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
+N + V G G + P GR+++ E+ G Y A MI E
Sbjct: 663 YNYVSDFSRHFNNMRSYVMGNESGLLMASWPKGRLEVPFPYFAEVMTGFEYCAAVGMIYE 722
Query: 800 EMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIWAMQWALTK 858
M A + + DG + F PE ++ Y R + W+ AL+
Sbjct: 723 GMEKEALTCIRSIRD---RHDGAKRNPFSEPECGHH--------YARSMASWSAIIALSD 771
Query: 859 PKLSRQEIKHEISDRDSSYL 878
+ S + +I+ R +Y
Sbjct: 772 FQYSGIDKSMKITARPGNYF 791
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/346 (19%), Positives = 135/346 (39%), Gaps = 44/346 (12%)
Query: 63 SKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPV 122
+ QE + PV + + H+ +PLGGIG G++ RGE + +++ + V
Sbjct: 18 ASQEWKPQEWPVLKNYDQEHLYQ-IALPLGGIGTGTVSLGGRGELRDWEIMN-------V 69
Query: 123 LANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDW-NLKGENCTYHALFPRAW 181
A ++S + N F+ P++ + ++ + +G +H L A
Sbjct: 70 PAKKYSTVTTGNNAPFFAIYAKPQNQEATTTLLAGPLYPQEYLHYEGRPVNHHGLPRFAQ 129
Query: 182 TVYDG----------EPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
+D + D +++ + +P IP + + S P +V ++ ++N ++
Sbjct: 130 ASFDAAYPFGQVHLSDKDLPVKVTIKGFNPLIPGDAEASGLPVAVLSYEVTNITSQPMEI 189
Query: 232 TLLFTWANSVA-----------GDSGLSGHHFN-SKTMTKDGVHGLTLHHRTANGRPPVT 279
+ + N + GD +G N +K + G+ G+ L+ + P
Sbjct: 190 AICGSMRNFIGKDGSKFRTDWKGDYIPTGVKDNKNKYVENKGIKGIYLYSDGVDKNDPAW 249
Query: 280 FAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSI 339
VA A VS + GI + W++ G + +K +P
Sbjct: 250 GTVALTTQATSGVSYRTSSKSDNWNNGIL--NFWDDFSADGML--TERNKQEDEDP---- 301
Query: 340 GAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKV-----YHRRYT 380
A+++ T+ ST + TF + W+ P K + Y R+YT
Sbjct: 302 MASLSVKKTVKPQSTETFTFYITWNFPNRKAWSSTVVGNYYSRQYT 347
>gi|402583425|gb|EJW77369.1| hypothetical protein WUBG_11725, partial [Wuchereria bancrofti]
Length = 109
Score = 59.7 bits (143), Expect = 8e-06, Method: Composition-based stats.
Identities = 29/60 (48%), Positives = 38/60 (63%), Gaps = 3/60 (5%)
Query: 733 LLPIADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
LLP++ +K AL +IY N+L GG GA+NGM+ DG + LQA E+W GVTY L
Sbjct: 1 LLPLS---HIKSALNQIYSSNILNFAGGRLGAVNGMRKDGTVGRRHLQADEMWVGVTYAL 57
>gi|333382634|ref|ZP_08474302.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828576|gb|EGK01276.1| hypothetical protein HMPREF9455_02468 [Dysgonomonas gadei ATCC
BAA-286]
Length = 859
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/270 (21%), Positives = 111/270 (41%), Gaps = 69/270 (25%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
++++YR++ +GD N+ + ++P V +M Y ++ +D G IE + W A+G
Sbjct: 495 IMKIYREWRISGDNNWLKEIYPFVKASMDYCIKTWDSREVGAIEEPHHNTYDIEFWGADG 554
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQ----KAKAVYDS-LWNGSYF--- 704
+ + ALQA + +G+ S RY+ K+K ++ L++G YF
Sbjct: 555 MIT---SFYTGALQAMCLMGEYLGEDIS------RYEALMDKSKIYMETKLFDGEYFIQN 605
Query: 705 -NYDNSDGSSSTSIQA----------------------------DQLAGQWYARACGLLP 735
+ + T +QA D + G W + CG+
Sbjct: 606 IQWKGLNADDPTKVQAFNSGYSDEALKILEAEGPKYQYGKGCLSDGILGSWMSLVCGMPE 665
Query: 736 IADEAKVKKALTKIYDFNV----------------LKVKGG--MCGAMNGMQPDGRIDMS 777
I D+ KV L ++ +N+ L +GG +C G G++ +
Sbjct: 666 IIDKTKVTSHLNAVHKYNLKHDLREHANPQRPTFALGNEGGLLLCSWPKG----GKLQLP 721
Query: 778 GLQAREIWPGVTYGLAASMIQEEMVDMAFQ 807
+ + E+W G+ Y +AA ++ E V+ +
Sbjct: 722 FVYSDEVWTGIEYQVAAHLMFEGEVEKGLE 751
>gi|326798039|ref|YP_004315858.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326548803|gb|ADZ77188.1| hypothetical protein Sph21_0609 [Sphingobacterium sp. 21]
Length = 877
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/265 (20%), Positives = 104/265 (39%), Gaps = 59/265 (22%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYRD+ +GD + + ++P V ++ Y ++ +D +G G++E + W NG
Sbjct: 517 IMKVYRDWRISGDDEWMKKLYPMVQKSLDYCIQTWDPNGKGILEEPHHNTYDIEFWGPNG 576
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKA-VYDSLWNGSYFNY----- 706
+ C ++ AL+A + + + Y +A + D L+NG YF
Sbjct: 577 M---CTSFYLGALKALIEMGRY--QRKNMKRYETLYDSGRAYMEDQLYNGEYFIQKIQWT 631
Query: 707 -----DNSDGSS----------------------STSIQADQLAGQWYARACGLLPIADE 739
D + S T +D + G W A CGL D+
Sbjct: 632 GLKAPDPTKAQSFHTQYTPEAIEILKKEGPKYQYGTGCLSDGVIGAWMAAMCGLENPLDQ 691
Query: 740 AKVKKALTKIYDFNVLKVKGGMCGAMNGMQPD-----------------GRIDMSGLQAR 782
KV L ++ +N +K + +N +P G + + + +
Sbjct: 692 EKVNSHLRAVHQYN---LKNDLTDHINPQRPTYAMGHEGGLLLCTWPKGGMLSLPFVYSN 748
Query: 783 EIWPGVTYGLAASMIQEEMVDMAFQ 807
E+W G+ Y +A+ ++Q D +
Sbjct: 749 EVWTGIEYQVASHLMQIGETDKGLE 773
Score = 45.1 bits (105), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 73/329 (22%), Positives = 116/329 (35%), Gaps = 65/329 (19%)
Query: 61 RYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDA 120
+++K+E + VFN H P+GGIGAG G + H
Sbjct: 46 QFTKEEHDR----VFNGPYVGHHLDKIAFPIGGIGAGMFCLEGSGAISHVSIRH-----K 96
Query: 121 PVLANQFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG------------ 168
P L ++ ++F + + +S P VP DW L G
Sbjct: 97 PELFHEPTIFGAVAIKGRANSAKVLEGP-VP-----------DWKLFGYKDYGGGSGGTT 144
Query: 169 ------ENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLS 222
E + A FP A + P L++ SPF+P + SS P T+
Sbjct: 145 YGLPHFEQAEFLARFPFAEIQLRDDKFP-LQVRLEAWSPFVPSDADNSSLPVGALTYRFI 203
Query: 223 NSGQTSADVTLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVT--F 280
N+ Q + + F N V G + K +G L G+ + F
Sbjct: 204 NTSQENLEAVFSFNSRNFVVQPEG--------EGSIKKIQNGFLLSQAKTTGKNNLATDF 255
Query: 281 AVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHG--SFDHLDNDKTSPSEPGSS 338
AV A+ +V V C+ G +T +WN IK + D +D D +
Sbjct: 256 AVFADTGDEVIVDY--CWFRGGWFDPLTM--VWNTIKDANVKAVDPIDKD---------A 302
Query: 339 IGAAIAASLTIPSGSTRSVTFSLAWDCPE 367
GA++ + + + + +AW PE
Sbjct: 303 PGASLFIPFQLGANEEKEIRILMAWYVPE 331
>gi|405382777|ref|ZP_11036555.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
gi|397320811|gb|EJJ25241.1| putative bile acid beta-glucosidase [Rhizobium sp. CF142]
Length = 863
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 112/276 (40%), Gaps = 61/276 (22%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM------EQFDKDGDGMIENEGFPDQTYDA 647
+++ YRD+ GD ++ R WP+V A+ Y +++D D G++ G QT D
Sbjct: 475 IIKTYRDWKLCGDTDWLRRYWPNVKRAIEYAWSSENPDRWDPDQTGIL--SGRQHQTLDM 532
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY--DSLWNGSYF- 704
G +++ ++VAAL S +A +GD S R KA A Y L+NG +F
Sbjct: 533 -ELFGPNSWLASMYVAALLGVSEMAAVLGD-TELSEKTARMGKAGAAYINSELFNGRWFI 590
Query: 705 -NYDNSD---------GSSSTSIQ------------------------ADQLAGQWYARA 730
D SD G ++ + +DQ+ GQW+A
Sbjct: 591 QKVDLSDKGVLIPFDVGRAAGVLADGFMETYWSEEFQELKYQMGEGCISDQILGQWHAEV 650
Query: 731 CGLLPIADEAKVKKALTKIYDFN---VLKVKGGMC---------GAMNGMQPDG-RIDM- 776
G+ D KV+ AL ++ N L+ C G + P+G R M
Sbjct: 651 AGIGAFLDADKVRTALKSVHRNNFRPTLEDHFNPCRNYAYEDEAGLLIATYPEGIRQPMV 710
Query: 777 SGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
+ A E+W G+ Y A+ +I +++ G
Sbjct: 711 AAPYAEEVWTGIEYMAASHLIMHGLIEEGMDIVCGA 746
Score = 47.8 bits (112), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 55/209 (26%), Positives = 87/209 (41%), Gaps = 39/209 (18%)
Query: 173 YHALFPRAWTVYDGEPDP-ELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADV 231
++ FP A V+ + P ++R+ +SPFIPHN ++SS P ++F F + N +AD
Sbjct: 119 FYGRFPIADLVFKDQRFPGDIRLTA--LSPFIPHNDRDSSMPVAMFEFEIVND---TAD- 172
Query: 232 TLLFTWANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRT-------ANGRPPVTFAVAA 284
L +T A ++ SG H T ++G G++ H T A R +T A A
Sbjct: 173 ELTYTLAGTLGNYGSNSGIH----TFRQEG--GISSLHLTSSDTTLPATQRGDLTIATDA 226
Query: 285 EET--ADVHVSECPCFLLSGNSKGITAKDM---WNEIKKHGSF--DHLDNDKTSPSEPGS 337
++ D H +G D+ W E + G H D +TS
Sbjct: 227 DDVDHTDHHY------------RGQWFDDLSVYWKEFARPGRLPKRHYDEPRTSRHMSLQ 274
Query: 338 SIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
+ A + G R V F + W+ P
Sbjct: 275 PEHGTLGARFAVAPGERRKVRFVITWNFP 303
>gi|156383620|ref|XP_001632931.1| predicted protein [Nematostella vectensis]
gi|156219994|gb|EDO40868.1| predicted protein [Nematostella vectensis]
Length = 1090
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 77/164 (46%), Gaps = 12/164 (7%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
VL+ YR+ K + +WP V I +M D +G G+I G TYD + G
Sbjct: 640 VLKTYREVRQGAPKTWFNKMWPQVKRIMKRWMTDLD-NGTGVIP--GPQPNTYDC-TLYG 695
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD-SLWNGSYFNYDNSDG 711
V+ Y G ++A L+AA +A + A+ + R++ A D +L+NGS++ D
Sbjct: 696 VNVYISGYYLAGLRAAEKMAKIQNEPELAATYHQRFELGSAKIDKTLFNGSWY-IQEVDS 754
Query: 712 SSSTSIQADQ-----LAGQWYARACGLLPIADEAKVKKALTKIY 750
++ D L GQW+A A GL I + ++ L I+
Sbjct: 755 EHPVNVVGDATWVDCLVGQWWAHALGLGYILSQKNIQSTLQNIF 798
>gi|332665583|ref|YP_004448371.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332334397|gb|AEE51498.1| protein of unknown function DUF608 [Haliscomenobacter hydrossis DSM
1100]
Length = 857
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 104/253 (41%), Gaps = 37/253 (14%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
V++ YR+++ +GD + + W V AM+Y +D + DG+ E + D +
Sbjct: 491 VMRFYREWILSGDNEWLKKHWARVKAAMSYAWVPGGWDANQDGVQEGKQHNTMDVDYYGP 550
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYF-WVRYQKAKAVYDSLWNGSYFNYDNS 709
N + + AL+A++A+A + D A V + +K V ++L+NG Y+ + +
Sbjct: 551 NPQMQFW---YFGALKASAAMARAMNDPVFAKQCEQVLAKGSKWVDENLFNGEYYEHKIT 607
Query: 710 D---------GSSSTSIQ---------ADQLAGQWYARACGLLPIADEAKVKKALTKIYD 751
D + ST+I DQ+ GQ+ A CGL +A + ++ AL +
Sbjct: 608 DPKTFEFLDRSNPSTAIPNFQLGEGCLLDQVVGQYMAHVCGLGYLAKKENIQTALQTVMK 667
Query: 752 FNVLK------------VKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQE 799
+N + V G + P GR+ + E G Y A M+ E
Sbjct: 668 YNFVPRFDNVFNNMRSYVMDKESGLIMASWPKGRLKVPFPYFAESMSGFEYAAAIGMLYE 727
Query: 800 EMVDMAFQTAAGV 812
+ Q +
Sbjct: 728 GQTEAGLQCIQSI 740
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 69/343 (20%), Positives = 132/343 (38%), Gaps = 47/343 (13%)
Query: 51 HLAHIGFRLYRYSKQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRF 110
H + F ++ +A+ PV + + H+ + +PLGGIG G++ RGE + +
Sbjct: 2 HQSFYLFAFLLFATAIQAQTSWPVLKHYDQDHLL-NIALPLGGIGTGTVSLGGRGELRDW 60
Query: 111 KL-------FHGICD--DAPVLANQFSVFVSRPNGEKFSS--VLCPRSPGVPKKNTDSGI 159
++ F I D AP F+V+V +P G+ S +L P P + +
Sbjct: 61 EIMNKPGKKFSTITDGNQAPF----FAVYV-KPIGKTAQSKALLGPFHPSEYQHYEGRPV 115
Query: 160 ESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTF 219
+ ++ A +P + P + + + +P IP + SS P ++ +
Sbjct: 116 NHHGMP-RFSVASFDAAYPFGQVNLSDDVLP-ISVKIKGFNPLIPGDVAASSIPLAILKY 173
Query: 220 TLSNSGQTSADVTLLFTWANSVAGDSG------------LSGHHFNSKTMTKDGVHGLTL 267
++N+ +V + T N + D L + ++ + G+ +
Sbjct: 174 EVTNNTDAPLEVAICGTMRNFIGIDGSKQSKNWKGDLVYLGAKNNQNEYRESPDLKGIFM 233
Query: 268 HHRTANGRPP----VTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIKKHGSFD 323
+ + + P + + A E S P N + D W++ G
Sbjct: 234 SSQGVDNQDPAWGTIALSTAERELVSFRRSSTP------NDWENSILDFWDDFSADGVLS 287
Query: 324 HLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCP 366
D DK P A++A TIP+ S+ + TF ++W P
Sbjct: 288 --DKDKLVDDNP----MASLAVKKTIPARSSLTFTFYISWHFP 324
>gi|429739681|ref|ZP_19273428.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
F0055]
gi|429156127|gb|EKX98766.1| hypothetical protein HMPREF9151_01886 [Prevotella saccharolytica
F0055]
Length = 989
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 54/266 (20%), Positives = 105/266 (39%), Gaps = 60/266 (22%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++ YRD+ +GD + + ++P++ ++ Y + +D G IE + W A+G
Sbjct: 633 IMKAYRDWRISGDTEWLKKLYPAIKASLDYCIRTWDPRQKGCIEEPHHNTYDIEFWGADG 692
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
+ C +V AL+A ++ D S Y + + + + L+NG Y+
Sbjct: 693 M---CTSFYVGALEAFIRMS-DALHRPSTPYRKLLKKSVELMNTQLFNGEYYIQKTQWTG 748
Query: 705 --------------NYDNSDG-----------SSSTSIQADQLAGQWYARACGLLPIADE 739
NY N + T +D + G W A CGL ++
Sbjct: 749 LRAKSPTEVLSFGSNYSNPEALQLLQAEGPKYQYGTGCLSDGVIGMWMASVCGLPEPLNQ 808
Query: 740 AKVKKALTKIYDFNV----------------LKVKGG--MCGAMNGMQPDGRIDMSGLQA 781
A+ L +Y +NV L +GG +C G +P + + +
Sbjct: 809 AQTLSHLNAVYKYNVREDLYRHYNTQRPSYALGHEGGLLLCTWPKGKKP----TLPFVYS 864
Query: 782 REIWPGVTYGLAASMIQEEMVDMAFQ 807
E+W G+ Y +A+ ++ + +D +
Sbjct: 865 NEVWTGIEYQVASHLLMKGEIDKGLK 890
>gi|282877348|ref|ZP_06286172.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
gi|281300533|gb|EFA92878.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
Length = 997
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 51/262 (19%), Positives = 104/262 (39%), Gaps = 52/262 (19%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYRD+ +G+ + ++++P+V ++ Y + +D G + + W A+G
Sbjct: 638 IMKVYRDWRISGNDEWLKSLYPAVKASLDYCIRTWDPHEKGSLTEPHHNTYDIEFWGADG 697
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
+ C + AL+A ++ + A Y + + + L+NG Y+
Sbjct: 698 M---CTSFYTGALEAFVQMSEHLNKQA-GKYKKLLKRSISCMDKDLFNGEYYFQKTQWTG 753
Query: 705 --------------NYDNSDGSS-----------STSIQADQLAGQWYARACGLLPIADE 739
NY +++ T +D + G W A CGL ++
Sbjct: 754 LEAKSPTEIPSFGSNYRHAEAQELLKTEGPKYQYDTGCLSDGILGMWMASVCGLPEPLNQ 813
Query: 740 AKVKKALTKIYDFNVLK-------------VKGGMCGAMNGMQPDG-RIDMSGLQAREIW 785
VK L +Y +N+ K GG G + P G + + + + E+W
Sbjct: 814 THVKSHLNAVYQYNLRKNMYNHYNTQRPSYAMGGDGGLLLCTWPKGAKPTLPFVYSNEVW 873
Query: 786 PGVTYGLAASMIQEEMVDMAFQ 807
G+ Y +A+ +I + V+ +
Sbjct: 874 TGIEYQVASHLIMKGEVEKGLE 895
>gi|284041275|ref|YP_003391205.1| hypothetical protein Slin_6449 [Spirosoma linguale DSM 74]
gi|283820568|gb|ADB42406.1| protein of unknown function DUF608 [Spirosoma linguale DSM 74]
Length = 857
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 102/256 (39%), Gaps = 34/256 (13%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
+++ YRD+ +GD +F R WP V A+A+ +D + DG++E G T D
Sbjct: 484 IMKFYRDWQLSGDTDFLRRYWPQVKQALAFAWVPGGWDANRDGVME--GAQHNTMDV-EY 540
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYFNYD-- 707
G + ++ AL+AASA+A + D + A +QK D+ L+NG Y+
Sbjct: 541 FGPNPQMQLWYLGALKAASAMATAMNDRSFAQTCDQIFQKGSTWTDANLFNGDYYEQQVI 600
Query: 708 ---------------NSDGSSSTSIQ------ADQLAGQWYARACGLLPIADEAKVKKAL 746
N + + Q DQL GQ+ A CGL + + +K L
Sbjct: 601 SPAGHLIAKGTFSGFNKVEQNDPAYQLAKGCLVDQLVGQFMAHVCGLGYLVNPDHIKTTL 660
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAF 806
I +N + M N M+ D + L WP + E M +
Sbjct: 661 KSILKYN---YRASMTDHFNNMRSYALGDEASLLMAS-WPKGRPKVPFPYFSEVMTGFEY 716
Query: 807 QTAAGVYEVAWSEDGL 822
A G+ +E GL
Sbjct: 717 TAAIGMLYENQTEPGL 732
>gi|255013072|ref|ZP_05285198.1| hypothetical protein B2_04148 [Bacteroides sp. 2_1_7]
gi|410102535|ref|ZP_11297461.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
gi|409238607|gb|EKN31398.1| hypothetical protein HMPREF0999_01233 [Parabacteroides sp. D25]
Length = 620
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 8/145 (5%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
D+N+ F+L++ R + +K WP++ AM ++ D D G+ F D
Sbjct: 346 DINAFFILRIARFYRYYHNKQDLINYWPAMKNAMNWLISRDTDKIGLPAQVSFWGDWKDV 405
Query: 648 WSANG--VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS------LW 699
G S + ++++AL+ LA + GD A A+Y+ YQK + LW
Sbjct: 406 KGIEGRKYSPFTCLIYLSALKEMVYLAEECGDQAGANYYQSAYQKGYKTMNKDVREGGLW 465
Query: 700 NGSYFNYDNSDGSSSTSIQADQLAG 724
NG+Y+ DGS + I DQ G
Sbjct: 466 NGNYYCQIWKDGSVNDRILQDQTIG 490
>gi|340616467|ref|YP_004734920.1| hypothetical protein zobellia_456 [Zobellia galactanivorans]
gi|339731264|emb|CAZ94529.1| Conserved hypothetical protein [Zobellia galactanivorans]
Length = 822
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 82/368 (22%), Positives = 132/368 (35%), Gaps = 75/368 (20%)
Query: 89 IPLGGIGAGSIGRSYRGEFQRFKLFH---GICDDAPVLANQFSVFVSRPNGEKFSSVLCP 145
+PLGG GAGS+ G F + +F+ +D L + F + +G+K +
Sbjct: 94 VPLGGFGAGSVELRADGGFMDWDIFNNSPATGNDKIQLNDAFMGLYIQADGKKAFATTLR 153
Query: 146 RSPGVPKKNTDSGIESWDWNLKGENCTYHALFPRAWTVYDGEPDPELRIVCRQISPFIPH 205
P PKK + Y FP + P L+ S +
Sbjct: 154 THP--PKKLPAV-----------QQLEYSGAFPVSRLNIQDSNIP-LKTKLYAYSEYKIR 199
Query: 206 NYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSVAG----DSGLSGHHFNSKTMTKDG 261
N +S+ P ++F+ L N LF N + G D GL +TK G
Sbjct: 200 NSIQSATPCALFSLELENVTDDEYQAAFLFNLPNHIQGLFNVDQGL--------VLTKKG 251
Query: 262 VH----GLTLHHRTANGRPPVTFAVAAEETADVHVSECPCFLLSGNSKGITAKDMWNEIK 317
+TL ANG VT++VA + +W+E
Sbjct: 252 KEPSSGDMTL---AANGADEVTYSVADD-----------------------LNKLWDEFS 285
Query: 318 KHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHR 377
+G+F +D S AI+A +++ G R++T ++ W PE ++
Sbjct: 286 DNGNFQEDIHD--------SGQYGAISAKVSLAPGEKRTITIAMGWYFPERPISVEIVGN 337
Query: 378 RYTKFYGTLGDSAARIAHDAILEHAK-WECEIEAWQRPILEDKRFPEWYPITLFNELYYL 436
YT + S+ + A+ + W+ IE W D P+W + N +
Sbjct: 338 YYTNLF----RSSTEVTEKALSRLPETWKGIIE-WNETCF-DNSLPDWLQDAMVNSTATI 391
Query: 437 NAGGTIWT 444
G IWT
Sbjct: 392 VKTG-IWT 398
Score = 47.8 bits (112), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 14/166 (8%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQ---T 644
D ++ F+L VY + GD+ F +WP V A + Q ++ G PD+ T
Sbjct: 476 DCSTSFILSVYASHIWDGDEEFVNEMWPHVKRAAQW--QINRS-----RKFGLPDRLAAT 528
Query: 645 YD--AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNG 701
YD + + +Y + +AA++AA+ LA + S + A+ + W G
Sbjct: 529 YDLSGFGKKDLVSYNAFMHLAAMKAATELAKEREKEDLVSLYKKNIAAAQNTLKAHFWTG 588
Query: 702 SYF-NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
+YF N+ N D + I D GQ ++ GL + + +K L
Sbjct: 589 TYFRNWWNKDEPKNDDIHIDTQFGQVWSYLLGLGDVIEPELMKSHL 634
>gi|229820204|ref|YP_002881730.1| hypothetical protein Bcav_1712 [Beutenbergia cavernae DSM 12333]
gi|229566117|gb|ACQ79968.1| protein of unknown function DUF608 [Beutenbergia cavernae DSM
12333]
Length = 865
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 68/157 (43%), Gaps = 16/157 (10%)
Query: 79 RKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK 138
R HI +PLGGIG G++ G ++++L + + + F+V VS+
Sbjct: 11 RGRHIA----MPLGGIGTGNVAIGADGGLRQWQLHNIGNHHGDLPGSLFAVRVSQWE-PP 65
Query: 139 FSSVLCPRSPGVPKKNT------DSGIESWDWNLKGEN----CTYHALFPRAWTVYDGEP 188
V ++P P NT D + W L + T A +P A YD
Sbjct: 66 LDEVRVLQAPAAPAGNTRTPMVDDDHVPDWQRELAAAHGVTATTISATYPVARVAYDLGE 125
Query: 189 DPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSG 225
P L I +P +P + SS P ++FTFTL N+G
Sbjct: 126 VP-LAIELEATTPLVPLDEAASSLPVAMFTFTLRNAG 161
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 130/330 (39%), Gaps = 68/330 (20%)
Query: 517 ALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLDDPWFEIN 576
AL LFP LE S++ + + + T GAVPH + L P +
Sbjct: 444 ALARLFPGLEASMRATELDVMQVPERHTHA-------------GAVPHRVLL--PRYVPQ 488
Query: 577 SYNL-FNSSRWKDLNSKF--VLQVYRDFV--ATGDKNFARAVWPSVYIAMAYME-QFDKD 630
+++ L+ +L+ YR+ A G + A A WP + + + ++ ++D
Sbjct: 489 MWDVPIGGPEEPALDGMLGVLLKTYREVRSGAIGLGHLA-ASWPRLMLLLDHVRGRWDPS 547
Query: 631 GDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQA--------------ASALANDVG 676
G G++ G T+D GV+ + G LW+AAL+A A AL
Sbjct: 548 GSGVLR--GVQPSTHDI-DLRGVNTFMGTLWLAALRACEEMALRLAADPHHADALGGATL 604
Query: 677 DHASASYFWVRYQKAKAVYDSL-WNGSYF----------NYDNSDGSSSTSIQADQLAGQ 725
A+ +++ A YD + + G Y+ + DG S DQL GQ
Sbjct: 605 GRELAAELHALFERGSAAYDEICFTGEYYRQVLEPDEPTQFQWGDGCLS-----DQLIGQ 659
Query: 726 WYARACGLLPIADEAKVKKALTKIYDFNVLK------------VKGGMCGAMNGMQP-DG 772
W+A L + A V+ AL + N+ + G G + P G
Sbjct: 660 WWAHLLDLGHVLPAAHVRSALRAVVGHNLRRGFDGWEHTQRVYADGDDVGLLMCTWPHGG 719
Query: 773 RIDMSGLQAREIWPGVTYGLAASMIQEEMV 802
R +++ E+W GV Y +AA + E +
Sbjct: 720 RPEVATRYCDEVWSGVEYEVAAHCLLEGLT 749
>gi|156376779|ref|XP_001630536.1| predicted protein [Nematostella vectensis]
gi|156217559|gb|EDO38473.1| predicted protein [Nematostella vectensis]
Length = 666
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 83/177 (46%), Gaps = 19/177 (10%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
D ++ FV++VY F TGD R +WP+V A+ ++ G G+ P +
Sbjct: 277 DSSALFVVEVYHYFKWTGDSALLRRLWPNVKRAVTWLIGESTKGTGL------PFRKVST 330
Query: 648 WSANGVSAYC-----GGLWVAALQAASALANDVGD---HASASYFWVRYQKAKAVYDSLW 699
+ G+ Y G+++ AL+AA LA + D +A+ ++ +R + V +LW
Sbjct: 331 YDLVGIDKYDHVMYEAGVFLLALRAAQHLAVHLKDKPFNATTTHALIRATQLVDV--TLW 388
Query: 700 NGS---YFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFN 753
N Y + +S+ +S I +D L GQ +A GL + E K+K L +N
Sbjct: 389 NEERQFYRAWWDSEYGASDWIMSDSLYGQVWANTLGLGKLLPEEKMKVHLQNEIKYN 445
>gi|312373134|gb|EFR20945.1| hypothetical protein AND_18252 [Anopheles darlingi]
Length = 734
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 88 GIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVLANQFSVFVSRPNGEK-FSSVL 143
G PLGGIGAG+IGR + GEF R++L G+ + V ANQF V + G F S+L
Sbjct: 674 GAPLGGIGAGTIGRGFAGEFCRYQLRPGLYEYNTVHANQFIVTIKDETGATIFQSLL 730
>gi|167524276|ref|XP_001746474.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775236|gb|EDQ88861.1| predicted protein [Monosiga brevicollis MX1]
Length = 775
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 93/221 (42%), Gaps = 30/221 (13%)
Query: 618 YIAMAYME---QFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALAND 674
++A A ME + + GDG++ G TYD + G + + G L++ AL A+ +A
Sbjct: 468 HLAAATMELLARLYRPGDGVVR--GAQPSTYDV-ALYGANTFIGSLYLCALCCAAEMATR 524
Query: 675 VGDHASASYFWVRYQKAKAVYD--SLWNGSYFNY----DNSDGSSSTSIQADQLAGQWYA 728
+G A+ + RY+ + A YD NG ++ D++ S D L GQW+A
Sbjct: 525 LGQSELAATYTARYKLSVANYDKACFTNGKWYTQVVDPDHATNVVSDGTFVDALLGQWWA 584
Query: 729 RACGLLPIADEAKVKKALTKIYDFNVL---------------KVKGGM--CGAMNGMQPD 771
G+ + +A V ++ +++ N + GM C QP
Sbjct: 585 HLLGIGDMLPQAHVASNVSSVFEANHVDEFDVDRQAPRKFCDNRDAGMYICTWGGAAQPP 644
Query: 772 GRIDMSGLQAREIWPGVTYGLAASMIQEEMVDMAFQTAAGV 812
+ + +W G+ Y A +Q VD+A Q + V
Sbjct: 645 PSNALL-YTSEAMWSGLAYPFAGLALQTGQVDVALQVLSDV 684
>gi|410867532|ref|YP_006982143.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
ATCC 4875]
gi|410824173|gb|AFV90788.1| hypothetical protein PACID_30230 [Propionibacterium acidipropionici
ATCC 4875]
Length = 845
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 96/229 (41%), Gaps = 25/229 (10%)
Query: 594 VLQVYRDFV--ATGDKNFARAVWPSVYIAMAYME-QFDKDGDGMIENEGFPDQTYDAWSA 650
VL+ YR+ A G + R W ++ M ++ ++D + G++ E T+D S
Sbjct: 497 VLKTYREVRRGAVG-AGWLRDRWNNLCRLMDHVAARWDPECSGLLTGE--QPSTHDI-SL 552
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----- 704
G + + G LW+AAL+A+ +A +G A + R+ + A YD+ L+ G Y+
Sbjct: 553 YGANMFMGSLWLAALRASEEMALILGATDRAMGWRERFVLSSAAYDTALFRGGYYVQAAD 612
Query: 705 NYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNV---------- 754
D ADQL GQW+A L + V+ AL + N+
Sbjct: 613 ETHGDDDEFGDGCLADQLIGQWWAHLLDLGHLLPAEHVRSALQAVVAHNLNPADETRHSQ 672
Query: 755 --LKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGLAASMIQEEM 801
V+G M GR + E+W G Y +AA + E +
Sbjct: 673 RAYAVRGEQGLVMCSWPRGGRPARATRYCDEVWTGCEYEVAALCLAEGL 721
>gi|156383694|ref|XP_001632968.1| predicted protein [Nematostella vectensis]
gi|156220031|gb|EDO40905.1| predicted protein [Nematostella vectensis]
Length = 1641
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 92/221 (41%), Gaps = 15/221 (6%)
Query: 553 WVARKCLGAVPHDIGLD-DPWFEINSYNLFNSS-RWKDLNSKFVLQVYRDFVATGDKNFA 610
W + G +P +G W + N ++ D+N F+ QVY ++ TG+K
Sbjct: 639 WAEYRRDGGIPEALGHQYGCWRKTNRFDKAGGRLNMADVNPAFLSQVYHLYMWTGNKKLL 698
Query: 611 RAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYD--AWSANGVSAYCGGLWVAALQAA 668
+WP++ M + +G + TYD + VS Y + + L A
Sbjct: 699 DDLWPAIKDVTHAMVTWTTNGTSLPLRM---TNTYDILQFQRYDVSMYNSVMHLLGLTVA 755
Query: 669 SALANDVGDHASASYFWVRYQKAKAVYDS-LWN---GSYFNYDNSDGSSSTSIQADQLAG 724
A+ + +R +KA + D LW+ G Y + + S++++ AD L G
Sbjct: 756 RAIGQVRKERKFVKDLTLRIRKATSAIDRLLWDEKRGYYRAWWDHAQPSTSALMADSLYG 815
Query: 725 QWYARACGLLPIADEAKVKKALTKIYDFNV----LKVKGGM 761
Q +A GL + DE K+ L K + N LKV G+
Sbjct: 816 QVWAYTLGLGNLLDENKMASHLRKEAELNDTPFGLKVMSGL 856
>gi|361067543|gb|AEW08083.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162873|gb|AFG64132.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162874|gb|AFG64133.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162875|gb|AFG64134.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162876|gb|AFG64135.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162877|gb|AFG64136.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162878|gb|AFG64137.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162879|gb|AFG64138.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162880|gb|AFG64139.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162881|gb|AFG64140.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
gi|383162882|gb|AFG64141.1| Pinus taeda anonymous locus 0_18489_02 genomic sequence
Length = 79
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 26/55 (47%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 838 YRSLCYMRPLTIWAMQWALTKPKLSRQEIKHEISDRDSSYLEQHAAFSKVASLLK 892
YRSL YMRPL IWAMQWAL PK + K + +R+ + HA + VA K
Sbjct: 3 YRSLIYMRPLAIWAMQWALNPPKAVLEAPKINLMEREHPS-QFHAGLAAVAEAFK 56
>gi|220925399|ref|YP_002500701.1| amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
gi|219950006|gb|ACL60398.1| Amylo-alpha-16-glucosidase [Methylobacterium nodulans ORS 2060]
Length = 765
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 62/130 (47%), Gaps = 18/130 (13%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
D FV+ + F TGDK+ +A+WP++ A+ +++ + D+DGDG +E
Sbjct: 413 DATPLFVMLAGQYFEQTGDKDTIQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 472
Query: 637 NEGFPDQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYFWVRYQ 689
N+G+ D D A G A C G AA +AAS LA +G+ A A +
Sbjct: 473 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAATLGNAALAERLKDAAE 532
Query: 690 KAKAVYDSLW 699
+ + +D +
Sbjct: 533 RLRVAFDEAF 542
>gi|338212680|ref|YP_004656735.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306501|gb|AEI49603.1| hypothetical protein Runsl_3226 [Runella slithyformis DSM 19594]
Length = 855
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 77/378 (20%), Positives = 145/378 (38%), Gaps = 47/378 (12%)
Query: 71 KVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFH----GICDDAP-VLAN 125
K P+ + + H+T +P+GGIG G++ RG+ Q +++ + G P +A
Sbjct: 23 KWPMLKHYDQQHLTRI-ALPIGGIGTGTVSLGGRGQLQDWEIMNRPGKGFSTTLPGNMAP 81
Query: 126 QFSVFVSRPNGEKFSSVLCPRSPGVPKKNTDSGIESWDWNLKG----ENCTYHALFPRAW 181
F+++V +P + + L G + + +E + G ++ A +P
Sbjct: 82 FFAIYVKKPTEKAQAKALI----GPLETHEYQHMEGRPVDHHGLPRFAKASFDAAYPFGQ 137
Query: 182 TVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTWANSV 241
E P + + + +P IP + + S P + T+ ++N+ V++ + N V
Sbjct: 138 VNLSDEKLP-ISVQIKAFNPLIPGDAERSGIPMAALTYEVTNTSNQPLTVSVAGSMRNFV 196
Query: 242 A-----------GDSGLSGHHFNSKTMTKDGV-HGLTLHHRTANGRPPV--TFAVAAEET 287
G+ SG N + G+ ++ + P T A++
Sbjct: 197 GKDGSKGSKSWKGEVNPSGAKQNKNQFRQGKTGQGIFMYSDGVDKADPAWGTIALSTNAV 256
Query: 288 ADV--HVSECPCFLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAA 345
DV S P N+ D W++ G +KT+ ++ A++A
Sbjct: 257 GDVSYRTSSVP------NAWENALLDFWDDFTDDGKL----TEKTTVADEDPM--ASLAV 304
Query: 346 SLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILEHAKWE 405
TI T S TF + W P + K R +Y T A +A A+ + + E
Sbjct: 305 KKTIQPKQTESFTFFITWHFPNRFAWSK---ERVGNYYTTRYADAWDVAEKAVPQLPQLE 361
Query: 406 CEIEAWQRPILEDKRFPE 423
E + L+ +PE
Sbjct: 362 NETLLFVNSFLKSS-YPE 378
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 73/328 (22%), Positives = 123/328 (37%), Gaps = 60/328 (18%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAYM---EQFDKDGDGMIENEGFPDQTYDAWSA 650
+++ YRD+ +G+ F + WP + A+A+ + +D + DG+++ G T D
Sbjct: 484 IMKFYRDWQLSGNTAFLQKNWPKIKRALAFAWVPDGWDGNLDGVMD--GAQHNTMDV-EY 540
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDS-LWNGSYF----- 704
G + ++ AL+AA +A + D A A +Q D L+NG Y+
Sbjct: 541 FGPNPQMQIWYLGALKAAEKMAEALHDAAFAQQCKTLFQSGSTWADQQLFNGEYYEQQVM 600
Query: 705 -----NYDNSDGSSSTSIQAD-------------QLAGQWYARACGLLPIADEAKVKKAL 746
+ S S Q D QL GQ+ A CGL + V+ +L
Sbjct: 601 SPVGRKFAKGTVSGWNSTQGDFPPFQLAKGCLVDQLVGQFMAHVCGLGYLVKPQNVQTSL 660
Query: 747 TKIYDFNVLKVKGGMCGAMNGMQ---------------PDGRIDMSGLQAREIWPGVTYG 791
I +N + + N M+ P GR + E+ G Y
Sbjct: 661 KSILKYN---YRATLTEHFNNMRSYALADEPALLMASWPKGRPTVPFPYFAEVMTGFEYT 717
Query: 792 LAASMIQEEMVDMAFQTAAGVYEVAWSEDGLGYS-FQTPESWNNNDEYRSLCYMRPLTIW 850
A M+ E M D + + + + DG S F E ++ Y R + W
Sbjct: 718 AAIGMLYENMTDDGLKC---IQNIRYRYDGAKRSPFDEAECGHH--------YARAMISW 766
Query: 851 AMQWALTKPKLSRQEIKHEISDRDSSYL 878
A AL+ + S E +D+ ++
Sbjct: 767 ASTLALSGFQYSGVEKTLHFNDQPGTFF 794
>gi|428168440|gb|EKX37385.1| hypothetical protein GUITHDRAFT_145081 [Guillardia theta CCMP2712]
Length = 887
Score = 49.3 bits (116), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 87/226 (38%), Gaps = 62/226 (27%)
Query: 510 VHFYSSFALVMLFPKLELSIQRDFAAAVMMHDPGTMKIMSDGKWVARKCLGAVPHDIGLD 569
+H Y S L +LFP+L R L P +D
Sbjct: 528 LHLYRSIPLALLFPQL-----------------------------VRNILDTGP----ID 554
Query: 570 DPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDK 629
+P F S + +S N FVL V F++TGD+ + + ++E
Sbjct: 555 EPKFPCGSRMMGDS------NPGFVLSVLNLFMSTGDRR----------VLLDFLEPV-- 596
Query: 630 DGDGMIENEGFP---DQTYD--AWSANGVSAYCGGLWVAALQAASALANDVGDHASASYF 684
M G P TYD + VSAY L++A L AA +A V D + Y
Sbjct: 597 ----MSSRFGLPRHLQNTYDWFHFHLKDVSAYNAVLYLAQLAAAERIAALVQDDSFQHYC 652
Query: 685 WVRYQKAK-AVYDSLWNGSYFN-YDNSDGSSSTSIQADQLAGQWYA 728
+ KA+ V + LWNG + + N +G ++ + L GQ +A
Sbjct: 653 QLHLSKAQEKVKELLWNGRFLRAWWNEEGELVEALHTETLYGQLWA 698
>gi|406663141|ref|ZP_11071211.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
gi|405552803|gb|EKB48138.1| putative bile acid beta-glucosidase [Cecembia lonarensis LW9]
Length = 937
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 84/178 (47%), Gaps = 14/178 (7%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKD----GDGMIENEGFPDQ 643
D + VL+ YRD+ + D F + W ++ +A+ Y+ DK+ +GMI E
Sbjct: 534 DGQAGVVLRAYRDYQMSEDDTFLKNNWDNIKLALKYLIDLDKEYGETANGMIYGE--QHN 591
Query: 644 TYDA-WSANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGS 702
T DA W G L++AAL AA +A GD +S + + + +SL+NG
Sbjct: 592 TLDAEWF--GYIPAITSLYLAALAAAVEMAKATGDISSEKEYLSILESGRKNIESLFNGE 649
Query: 703 YF-NYDNSDGSSSTSIQA----DQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL 755
YF ++ D + I DQ+ GQ +A L + ++ ++ +L ++ +N +
Sbjct: 650 YFIQEEDPDHLDAIGIGKGCYIDQVFGQGWAFQLNLGRLYNKKMIQSSLDSLWKYNFV 707
>gi|270295078|ref|ZP_06201279.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274325|gb|EFA20186.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 970
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 49/260 (18%), Positives = 103/260 (39%), Gaps = 57/260 (21%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYR++ +GD + + ++P+V ++ Y + +D G +E + W +G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
+ C ++ AL A + ++ Y + + K + +L++G YF
Sbjct: 672 M---CTSFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727
Query: 705 -------NYDNSDGSSS-----------------TSIQADQLAGQWYARACGLLPIADEA 740
+ + GS S T +D + G W A CGL + D
Sbjct: 728 LQAPNPVDVMSFGGSYSDEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQP-----------------DGRIDMSGLQARE 783
KV+ L ++ +N +K + N +P G + + + + E
Sbjct: 788 KVRSHLVAVHKYN---LKHDLVDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNE 844
Query: 784 IWPGVTYGLAASMIQEEMVD 803
+W G+ Y +A+ ++ + V+
Sbjct: 845 VWTGIEYQVASHLMMKGEVE 864
>gi|160887666|ref|ZP_02068669.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
gi|156862844|gb|EDO56275.1| hypothetical protein BACUNI_00066 [Bacteroides uniformis ATCC 8492]
Length = 992
Score = 46.2 bits (108), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 49/260 (18%), Positives = 103/260 (39%), Gaps = 57/260 (21%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYR++ +GD + + ++P+V ++ Y + +D G +E + W +G
Sbjct: 634 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 693
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
+ C ++ AL A + ++ Y + + K + +L++G YF
Sbjct: 694 M---CTSFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 749
Query: 705 -------NYDNSDGSSS-----------------TSIQADQLAGQWYARACGLLPIADEA 740
+ + GS S T +D + G W A CGL + D
Sbjct: 750 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 809
Query: 741 KVKKALTKIYDFNVLKVKGGMCGAMNGMQP-----------------DGRIDMSGLQARE 783
KV+ L ++ +N +K + N +P G + + + + E
Sbjct: 810 KVRSHLVAVHKYN---LKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNE 866
Query: 784 IWPGVTYGLAASMIQEEMVD 803
+W G+ Y +A+ ++ + V+
Sbjct: 867 VWTGIEYQVASHLMMKGEVE 886
>gi|424029315|ref|ZP_17768855.1| hypothetical protein VCHENC01_3267 [Vibrio cholerae HENC-01]
gi|408887281|gb|EKM25904.1| hypothetical protein VCHENC01_3267 [Vibrio cholerae HENC-01]
Length = 924
Score = 46.2 bits (108), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 70/170 (41%), Gaps = 19/170 (11%)
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
YN + + W + +++ +GD FA+ + P + + +E+ D DG++ +
Sbjct: 480 YNTTDGTPW------MIREIFEYINYSGDTQFAQQMLPIIERFIEGVEKHYLDDDGLMRH 533
Query: 638 EGFPDQTYDA-------WSANGVSAY-CGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
PD DA WS G A LW +LQ A+ LA GDHA A +
Sbjct: 534 RD-PDTWMDAKINGQIPWSPRGPKANDIEALWFESLQVANQLAKWNGDHALAEHCQTLAN 592
Query: 690 KAKAVY-DSLWNGSYF---NYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
K KA + D W+ ++ D SI+ +QL L+P
Sbjct: 593 KVKASFIDKFWDSESMCLADHLKQDDVKDISIRPNQLMTLTIPTNQALIP 642
>gi|431796995|ref|YP_007223899.1| glycogen debranching protein [Echinicola vietnamensis DSM 17526]
gi|430787760|gb|AGA77889.1| glycogen debranching enzyme [Echinicola vietnamensis DSM 17526]
Length = 1213
Score = 46.2 bits (108), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 68/141 (48%), Gaps = 16/141 (11%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
D+N F Q+ R F TGDK + +AVWPS+ +A+ ++ FD DGDG+ +
Sbjct: 419 DMNQVFFDQLLRHFDWTGDKAYLQAVWPSIERHLAWEKRNFDPDGDGLY------NAYAS 472
Query: 647 AWSANGVSAYCGGLWVA------ALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW- 699
W+++ + GG+ A A + A+ LA +G+ Y KA+ ++LW
Sbjct: 473 IWASDALQYNSGGVAHASAYNYFANKKAAELAAFIGEDGK-QYAAEAEAILKAMNETLWL 531
Query: 700 -NGSYFNYDNSDGSSSTSIQA 719
+G Y Y + G + QA
Sbjct: 532 PSGWYAEYKDFLGPQNLHPQA 552
>gi|317478064|ref|ZP_07937244.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
gi|316905767|gb|EFV27541.1| hypothetical protein HMPREF1007_00360 [Bacteroides sp. 4_1_36]
Length = 970
Score = 46.2 bits (108), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 49/257 (19%), Positives = 103/257 (40%), Gaps = 51/257 (19%)
Query: 594 VLQVYRDFVATGDKNFARAVWPSVYIAMAY-MEQFDKDGDGMIENEGFPDQTYDAWSANG 652
+++VYR++ +GD + + ++P+V ++ Y + +D G +E + W +G
Sbjct: 612 IMKVYREWRISGDTQWMKDLFPAVKKSLDYCIRTWDPLHKGYLEEPHHNTYDIEFWGPDG 671
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYF-------- 704
+ C ++ AL A + ++ Y + + K + +L++G YF
Sbjct: 672 M---CTSFYLGALTAFIEMGKEL-KQPVKEYTALLSKGKKYMETALFDGEYFIQKIQWEG 727
Query: 705 -------NYDNSDGSSS-----------------TSIQADQLAGQWYARACGLLPIADEA 740
+ + GS S T +D + G W A CGL + D
Sbjct: 728 LQAPNPVDVMSFGGSYSEEALKLLKEEGPKYQYGTGCLSDGILGMWMASVCGLDEVLDNE 787
Query: 741 KVKKALTKIYDFNVL---------KVKGGMCGAMNGM-----QPDGRIDMSGLQAREIWP 786
KV+ L ++ +N+ + CG G+ G + + + + E+W
Sbjct: 788 KVRSHLVAVHKYNLKHDLIDHFNPQRPVYACGKDGGLLLCTWPKGGMLSLPFVYSNEVWT 847
Query: 787 GVTYGLAASMIQEEMVD 803
G+ Y +A+ ++ + V+
Sbjct: 848 GIEYQVASHLMMKGEVE 864
>gi|268317317|ref|YP_003291036.1| amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
gi|262334851|gb|ACY48648.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus DSM 4252]
Length = 868
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 65/147 (44%), Gaps = 19/147 (12%)
Query: 560 GAVPHDI----GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
G +PH+I L D WFE Y ++ D F++ + A+GD ++ R W
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417
Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV------SAYCGGLWVAALQAAS 669
++ A + D DG+ ++EN G W G+ Y G+W+AAL+
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473
Query: 670 ALANDVGDHASASYFWVRYQKAKAVYD 696
A+A +G+ A+ R + +A +
Sbjct: 474 AMATALGETELAAEVRQRAVRVRAAIE 500
>gi|345303570|ref|YP_004825472.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
gi|345112803|gb|AEN73635.1| Amylo-alpha-16-glucosidase [Rhodothermus marinus SG0.5JP17-172]
Length = 868
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 65/147 (44%), Gaps = 19/147 (12%)
Query: 560 GAVPHDI----GLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWP 615
G +PH+I L D WFE Y ++ D F++ + A+GD ++ R W
Sbjct: 363 GKIPHEISQSAALID-WFEDYPYPWASA----DATPLFIIAHADYWQASGDLDYIREHWD 417
Query: 616 SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV------SAYCGGLWVAALQAAS 669
++ A + D DG+ ++EN G W G+ Y G+W+AAL+
Sbjct: 418 ALVRAYRFTAGTDTDGNDLVENTG----VGHGWVEGGLLYPPHEELYQQGVWLAALEGME 473
Query: 670 ALANDVGDHASASYFWVRYQKAKAVYD 696
A+A +G+ A+ R + +A +
Sbjct: 474 AMATALGETELAAEVRQRAVRVRAAIE 500
>gi|170743420|ref|YP_001772075.1| amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
gi|168197694|gb|ACA19641.1| Amylo-alpha-16-glucosidase [Methylobacterium sp. 4-46]
Length = 764
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 59/127 (46%), Gaps = 18/127 (14%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
D FV+ + F TGD+ +A+WP++ A+ +++ + D+DGDG +E
Sbjct: 412 DATPLFVMLAGQYFEQTGDRATVQAIWPNILAALEWIDTYGDRDGDGFVEYARETDKGLA 471
Query: 637 NEGFPDQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYFWVRYQ 689
N+G+ D D A G A C G AA +AAS LA +G A + +
Sbjct: 472 NQGWKDSHDSIFHADGRMAQGPIALCEVQGYVYAAKRAASKLAGVLGHVALSERLKDAAE 531
Query: 690 KAKAVYD 696
+ + +D
Sbjct: 532 RLRTAFD 538
>gi|154246650|ref|YP_001417608.1| amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
gi|154160735|gb|ABS67951.1| Amylo-alpha-16-glucosidase [Xanthobacter autotrophicus Py2]
Length = 733
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 30/124 (24%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD + AR +WP+V A+ ++ + D+DGDG +E N+G+ D + A+G
Sbjct: 397 TGDLDTARLLWPAVAAALGWITDYGDRDGDGFVEYGRRSGDGLVNQGWKDSHDSVFHADG 456
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
V AY G W QAA+++A +G A F +KA + D+ ++
Sbjct: 457 RLARGPIALVEVQAYVFGAW----QAAASIARRLGQAPDAMVF---ERKADLLRDA-FDA 508
Query: 702 SYFN 705
++F+
Sbjct: 509 AFFD 512
>gi|449681370|ref|XP_002168270.2| PREDICTED: uncharacterized protein LOC100212022 [Hydra
magnipapillata]
Length = 455
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 104/268 (38%), Gaps = 35/268 (13%)
Query: 594 VLQVYRDFVATGDKNFARAVW-PSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANG 652
VL+ YR+ K++ VW P + +++ D D +GMI G TYD + G
Sbjct: 67 VLKTYREIRQGASKSWFNTVWNPIKLLINRWLDVMDTDKNGMIY--GPQPNTYDV-ATYG 123
Query: 653 VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYD--SLWNGSYFNYDNSD 710
S G L++ AL+A +A D A R+ D NG ++ +
Sbjct: 124 YSTLIGLLYLCALRATEEMAKLQNDMNFAKVCRDRFNLGTVNLDKACFTNGKWYTHVMDP 183
Query: 711 G------SSSTSIQADQLAGQWYARACGLLPIADEAKVKKALTKIYDFNVL-------KV 757
S T I + GQW+A L + + +K L Y N + +
Sbjct: 184 AHPFNVLGSCTFIGG--MYGQWWAHILNLGYLLPQEHIKSHLKYTYTRNFVPSFDPKTQS 241
Query: 758 KGGMC-----GAMNGMQPDGRIDMSGLQ-AREI-WPGVTYGLAASMIQE----EMVDMAF 806
C G + G+ DG + L + EI W GV Y AA +I+E E +D+
Sbjct: 242 PRAFCDQRDSGWVIGVWDDGTKPANPLYYSSEIAWSGVIYPFAAMLIEEGLTKEGLDVLE 301
Query: 807 QTAA---GVYEVAWSEDGLGYSFQTPES 831
+T G W+E G + P S
Sbjct: 302 KTRYFYDGTRRSPWNEIECGDHYARPLS 329
>gi|300770983|ref|ZP_07080860.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300762256|gb|EFK59075.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 1205
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 15/78 (19%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
D+N F Q++ F TGD+ F + VWP++ +A+ ++ FD DGDG+ YD
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGL----------YD 467
Query: 647 A----WSANGVSAYCGGL 660
A W+++G+ GG+
Sbjct: 468 AYCAIWASDGLQYSGGGV 485
>gi|424041235|ref|ZP_17779213.1| hypothetical protein VCHENC02_5228 [Vibrio cholerae HENC-02]
gi|408890954|gb|EKM28915.1| hypothetical protein VCHENC02_5228 [Vibrio cholerae HENC-02]
Length = 924
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 69/170 (40%), Gaps = 19/170 (11%)
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
YN + + W + +++ +GD FA+ + P + + +E+ D DG++ +
Sbjct: 480 YNTTDGTPW------MIREIFEYINYSGDTQFAQQMLPIIERFIEGVEKHYLDDDGLMRH 533
Query: 638 EGFPDQTYDA-------WSANGVSAY-CGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
PD DA WS G A LW +LQ A LA GDHA A +
Sbjct: 534 RD-PDTWMDAKINGQIPWSPRGPKANDIEALWFESLQVAIQLAKWNGDHALAEHCQTLAN 592
Query: 690 KAKAVY-DSLWNGSYF---NYDNSDGSSSTSIQADQLAGQWYARACGLLP 735
K KA + D W+ ++ D SI+ +QL L+P
Sbjct: 593 KVKASFIDKFWDSESMCLADHLKQDDVKDISIRPNQLMTLTIPTNQALIP 642
>gi|227537591|ref|ZP_03967640.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
gi|227242593|gb|EEI92608.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
Length = 1205
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 15/78 (19%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
D+N F Q++ F TGD+ F + VWP++ +A+ ++ FD DGDG+ YD
Sbjct: 418 DMNLVFFDQLFTHFQYTGDRTFLKEVWPAIKRHLAWEKRNFDMDGDGL----------YD 467
Query: 647 A----WSANGVSAYCGGL 660
A W+++G+ GG+
Sbjct: 468 AYCAIWASDGLQYSGGGV 485
>gi|159040682|ref|YP_001539934.1| glycoside hydrolase 15-like protein [Caldivirga maquilingensis
IC-167]
gi|157919517|gb|ABW00944.1| glycoside hydrolase 15-related [Caldivirga maquilingensis IC-167]
Length = 604
Score = 45.1 bits (105), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 49/120 (40%), Gaps = 11/120 (9%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
D + VL Y F T D+ F + W V A Y+ + ++G T
Sbjct: 346 DETASVVLATYVHFKLTLDRQFLKNAWVMVRKAAEYLA-------ANVSDDGLTTPTVGP 398
Query: 648 WSAN-GVSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA--KAVYDSLWNGSYF 704
W + GV Y AAL +AS LA ++GD A W RY + WNGS+F
Sbjct: 399 WEEHLGVHTYTNASVYAALSSASYLAGEIGDRNRAVE-WGRYASVIRSTTLNQAWNGSFF 457
>gi|398838157|ref|ZP_10595439.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
gi|398116719|gb|EJM06477.1| glycogen debranching enzyme [Pseudomonas sp. GM102]
Length = 1066
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 57/132 (43%), Gaps = 26/132 (19%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
D FV+ T D R +WP++ A++++E++ D+DGDG +E
Sbjct: 714 DSTPLFVMLAAAYLERTNDLGTLRQLWPNIEAALSWIEEYGDRDGDGFVEYGRQSAEGLI 773
Query: 637 NEGFPDQTYDAWSANG-----------VSAYCGGLWVAALQAASALANDVGDHASASYFW 685
N+G+ D + ANG V AY G W A+ +A +GD A
Sbjct: 774 NQGWKDSHDSVFHANGQLAVGPIAIVEVQAYVYGAWT----GAAKIARRLGDPDRAQRLK 829
Query: 686 VRYQKAKAVYDS 697
+ Q+ + +D+
Sbjct: 830 YKAQRLREEFDN 841
>gi|326797904|ref|YP_004315723.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326548668|gb|ADZ77053.1| hypothetical protein Sph21_0471 [Sphingobacterium sp. 21]
Length = 1132
Score = 44.3 bits (103), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 21/101 (20%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
D+N F Q+Y F TGD F +A WP++ +A+ ++ FD+DGDG+ YD
Sbjct: 416 DMNLVFFDQLYNHFQYTGDTAFIKASWPAIKRHLAWEKRNFDQDGDGL----------YD 465
Query: 647 A----WSANGVSAYCGGLW------VAALQAASALANDVGD 677
A W+++ + GG+ + A+ LA VG+
Sbjct: 466 AYCAIWASDALQYSGGGVTHTSAYNYRGFEMAAKLATIVGE 506
>gi|170749555|ref|YP_001755815.1| amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
2831]
gi|170656077|gb|ACB25132.1| Amylo-alpha-16-glucosidase [Methylobacterium radiotolerans JCM
2831]
Length = 781
Score = 43.9 bits (102), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 57/133 (42%), Gaps = 19/133 (14%)
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMI----------ENEGFP 641
FV+ + + TGD RA+WP++ +A+ +M+++ D+DGDG + EN+G+
Sbjct: 434 FVMLAWEYYAVTGDLETVRAIWPALELALEWMDRYGDRDGDGFVEYARDTDKGLENQGWK 493
Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYFWVRYQKAKAV 694
D D A G A C G AA + LA +G A +
Sbjct: 494 DSHDSIFHADGHLAKGPIALCEVQGYVYAAKNGMAKLAALIGHDPMAERLTQEAASLRER 553
Query: 695 YD-SLWNGSYFNY 706
+D + WN Y
Sbjct: 554 FDAAFWNEEIGTY 566
>gi|373851062|ref|ZP_09593863.1| transcriptional regulator, GntR family [Opitutaceae bacterium TAV5]
gi|372477227|gb|EHP37236.1| transcriptional regulator, GntR family [Opitutaceae bacterium TAV5]
Length = 938
Score = 43.9 bits (102), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 57/114 (50%), Gaps = 8/114 (7%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG---FPDQTYDA----WSANGVSAY 656
TGD + WP++ +A+ + F++ DG++ + D DA + V A
Sbjct: 602 TGDDSLLAHWWPNLLLAIRWFSAFERPDDGLLARVPHWIYIDMGRDAAGRGMGSGEVLAT 661
Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNYDNS 709
+++AL+A + AN +GD AS ++F R ++ A ++ + WN + Y ++
Sbjct: 662 LNLQYLSALRAIAGYANRLGDQASHAWFASRAERLASSLREQFWNETEHGYADA 715
>gi|393766406|ref|ZP_10354962.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
gi|392728187|gb|EIZ85496.1| amylo-alpha-16-glucosidase [Methylobacterium sp. GXF4]
Length = 780
Score = 43.5 bits (101), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 31/45 (68%), Gaps = 1/45 (2%)
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE 636
FV+ + + TGD RA+WPS+ +A+ +M+++ D+DGDG +E
Sbjct: 433 FVMLAWEYYAVTGDLETIRAIWPSLELALEWMDRYGDRDGDGFVE 477
>gi|391229074|ref|ZP_10265280.1| transcriptional regulator [Opitutaceae bacterium TAV1]
gi|391218735|gb|EIP97155.1| transcriptional regulator [Opitutaceae bacterium TAV1]
Length = 939
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 29/114 (25%), Positives = 57/114 (50%), Gaps = 8/114 (7%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEG---FPDQTYDA----WSANGVSAY 656
TGD + WP++ +A+ + F++ DG++ + D DA + V A
Sbjct: 603 TGDDSLLAHWWPNLLLAIRWFSAFERPDDGLLARVPHWIYIDMGRDAAGRGMGSGEVLAT 662
Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQK-AKAVYDSLWNGSYFNYDNS 709
+++AL+A + AN +GD AS ++F R ++ A ++ + WN + Y ++
Sbjct: 663 LNLQYLSALRAVAGYANRLGDQASHAWFSGRAERLATSLREQFWNETEHGYADA 716
>gi|75676297|ref|YP_318718.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
gi|74421167|gb|ABA05366.1| amylo-alpha-1,6-glucosidase [Nitrobacter winogradskyi Nb-255]
Length = 741
Score = 43.5 bits (101), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 42/90 (46%), Gaps = 22/90 (24%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD R +WP + A+ ++EQ+ D+DGDG +E N+G+ D + A+G
Sbjct: 405 TGDVVTIRRLWPHIEAALTWIEQYGDRDGDGFVEYHRQTSEGLINQGWKDSYDSVFHADG 464
Query: 653 -----------VSAYCGGLWVAALQAASAL 671
V AY G W AA + A L
Sbjct: 465 TFAKGPIALVEVQAYVYGAWCAAAKIAEQL 494
>gi|384263211|ref|YP_005418399.1| amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
gi|378404313|emb|CCG09429.1| Amylo-alpha-16-glucosidase [Rhodospirillum photometricum DSM 122]
Length = 728
Score = 42.7 bits (99), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 12/74 (16%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD FARA+WP+V A+A+++ D+DGDG +E N+G+ D + A+G
Sbjct: 413 TGDGAFARALWPAVDAALAWIDHHGDRDGDGFVEYGRRCDDGLVNQGWKDSDDSVFHADG 472
Query: 653 VSAYCGGLWVAALQ 666
A G L V +Q
Sbjct: 473 RLAQ-GPLAVCEVQ 485
>gi|403511592|ref|YP_006643230.1| bacterial alpha-L-rhamnosidase family protein [Nocardiopsis alba
ATCC BAA-2165]
gi|402801909|gb|AFR09319.1| bacterial alpha-L-rhamnosidase family protein [Nocardiopsis alba
ATCC BAA-2165]
Length = 712
Score = 42.7 bits (99), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 53/112 (47%), Gaps = 18/112 (16%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA---------WS----A 650
TGD FARA+ P A++++E D DG+ E D+ D WS A
Sbjct: 346 TGDTGFARAMLPVARRAISFLESR-TDADGLFREE--EDRRTDPEAPLAMVANWSPLDLA 402
Query: 651 NGVSAYCGGLWVAALQAASALANDVGDHASASY-FWVRYQK-AKAVYDSLWN 700
+GV A + ALQ +AL DV D A++ R ++ +A+ D LW+
Sbjct: 403 SGVDALTNAVLYDALQGLAALERDVADDVDAAHRLEERAERLRRALVDRLWD 454
>gi|334345035|ref|YP_004553587.1| amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
gi|334101657|gb|AEG49081.1| Amylo-alpha-16-glucosidase [Sphingobium chlorophenolicum L-1]
Length = 744
Score = 42.7 bits (99), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 53/118 (44%), Gaps = 26/118 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYM-EQFDKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD F ++ P V A+A++ E D+DGDG +E N+G+ D + A+G
Sbjct: 408 TGDSAFIASILPHVRAALAWIDEHGDRDGDGFVEYGRLTDQGLVNQGWKDSHDSVFHADG 467
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
V AY G W +AA A+ +GD A +F R + +D+ +
Sbjct: 468 SIARGPIALAEVQAYVYGAW----RAAEAIFVTLGDPDRALHFRARADDLRERFDAAF 521
>gi|429221453|ref|YP_007173779.1| glycogen debranching protein [Deinococcus peraridilitoris DSM
19664]
gi|429132316|gb|AFZ69330.1| glycogen debranching enzyme [Deinococcus peraridilitoris DSM 19664]
Length = 430
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
D N FVL Y T D F RA WP + A+ ++ D + G++E D D
Sbjct: 99 DANMWFVLGHYACHRRTNDTAFLRAAWPRIQAALLWLRYQDMNDCGLLEAPEAADWA-DL 157
Query: 648 WSANGVSAYCGGLWVAALQAASALANDVGDHASA 681
++ + Y L+V L+AA L+ +G+ +A
Sbjct: 158 YATRYNTLYANALYVGTLEAAGYLSAALGEDGAA 191
>gi|449672634|ref|XP_002169551.2| PREDICTED: uncharacterized protein LOC100201428 [Hydra
magnipapillata]
Length = 502
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 75/165 (45%), Gaps = 9/165 (5%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA 647
D+N F++Q + + D F ++P AM ++ Q G G+ + PD TYD
Sbjct: 107 DVNVGFIVQALLMYKWSNDTVFFDEIYPFTVRAMNWLMQDATKGTGLPYRK--PD-TYDL 163
Query: 648 WSANGVS--AYCGGLWVAALQAASALANDVGDHASASYFWVRYQKA-KAVYDSLWN---G 701
+ AY ++ AL A ++A+ D+ + +A K + +WN G
Sbjct: 164 FDLEKYDHCAYNSISYLLALHAMKSMASMQKDNNTIKNVNAALARATKQLEVEMWNEKKG 223
Query: 702 SYFNYDNSDGSSSTSIQADQLAGQWYARACGLLPIADEAKVKKAL 746
Y + + + S T + +D L GQ +A GL + D+ K+KK L
Sbjct: 224 FYHAWFDDEWGSPTWLMSDVLYGQVWAYTLGLGDLLDKDKMKKHL 268
>gi|217978466|ref|YP_002362613.1| amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
gi|217503842|gb|ACK51251.1| Amylo-alpha-16-glucosidase [Methylocella silvestris BL2]
Length = 729
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 51/115 (44%), Gaps = 18/115 (15%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------- 636
D FV+ F TGD +WP++ A+ ++++F D+DGDG +E
Sbjct: 377 DATPLFVMLAGMYFERTGDSETIAKIWPNILAALRWIDEFGDRDGDGFVEYLRETENGLA 436
Query: 637 NEGFPDQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGDHASASYF 684
N+G+ D D A G A C G AA AA LA +GD + + F
Sbjct: 437 NQGWKDSHDSIFHADGSLAEGPIALCEVQGYVYAAKSAAGRLARLMGDDDAGAKF 491
>gi|150376166|ref|YP_001312762.1| amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
gi|150030713|gb|ABR62829.1| Amylo-alpha-16-glucosidase [Sinorhizobium medicae WSM419]
Length = 734
Score = 42.0 bits (97), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 63/140 (45%), Gaps = 32/140 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD + R +WP++ A+ ++++F D+DGDG +E N+ + D + A+G
Sbjct: 398 TGDTDTVRGLWPNIVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHADG 457
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
V AY G W +AA+ L+ +G A R + + +D+ +
Sbjct: 458 RLAKGPVATAEVQAYIFGAW----RAAARLSRKLGHAEDALRLEQRAEDLRIRFDAAF-- 511
Query: 702 SYFNYDNSDGSSSTSIQADQ 721
+D G+ S ++ D+
Sbjct: 512 ----FDEELGTYSLALDGDK 527
>gi|408404211|ref|YP_006862194.1| glycogen debranching enzyme [Candidatus Nitrososphaera gargensis
Ga9.2]
gi|408364807|gb|AFU58537.1| putative glycogen debranching enzyme [Candidatus Nitrososphaera
gargensis Ga9.2]
Length = 863
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 70/312 (22%), Positives = 126/312 (40%), Gaps = 49/312 (15%)
Query: 575 INSYNLFNSSRWKDLNSKFVLQ-VYRDFV-ATGDKNFARAVWPSVYIAMAYMEQFDKDGD 632
I+ +++ + +S F+ R++V TGD + + W S+ + Y D DGD
Sbjct: 498 ISGKAFLHTTTYGSADSTFLFPWAIREYVLGTGDIGYLKKRWDSIVDLVNYGFLKDVDGD 557
Query: 633 GMIEN------EGFPDQTYDAW--------SANGVSAYCGGLWVAALQAASALANDVGDH 678
G+IE+ E P Q W SAN + A L+ +L+ S LA GD
Sbjct: 558 GLIEHGFTGTAEKLPIQD-STWMDHIDRRKSANDIQA----LFYESLRIGSELAAMAGDS 612
Query: 679 ASASYFWVRYQKAKAVYD-SLWN---GSYFNYDNSDGSSSTSIQADQLAGQWYARACGLL 734
+ + + + ++ + D WN G Y++ DGS SI+ + L LL
Sbjct: 613 ENEARWSSKAKELQETIDREYWNTSAGFYYDTIRKDGSKDPSIRPNAL--------VLLL 664
Query: 735 P--IADEAKVKKALTKIYDFNVLKVKGGMCGAMNGMQPDGRIDMSGLQAREIWPGVTYGL 792
+ + ++ L++I + ++ G D + + +WP VT
Sbjct: 665 SDVVRERSRADSVLSRIEEQDMTTA----WGVRTLSSKDQKYQPTLYHDGAVWPLVTGWA 720
Query: 793 AASMIQEEMVDMAFQTAAGVYEVAWSEDGL---GYSFQTPESWNNNDEYRSLCYMRPLTI 849
AAS I+ + A + + E E+G+ Y PE +N+ C ++ ++
Sbjct: 721 AASEIKFGRHEQALRYMGSMAERILHENGMFAETYRGDRPEPFNS-------CILQAWSV 773
Query: 850 WAMQWALTKPKL 861
+AL + L
Sbjct: 774 GMYAYALREMML 785
>gi|170060875|ref|XP_001865995.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879232|gb|EDS42615.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 183
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 42/99 (42%), Gaps = 29/99 (29%)
Query: 575 INSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYI--------------- 619
+ +Y + + WKDLNSKF+LQVYRD+ +A +I
Sbjct: 54 VRAYPIHDVYEWKDLNSKFILQVYRDYYTQLKAEYAGKFSSIEFINKESFRNRMSRTERD 113
Query: 620 -------------AMAYMEQFDKDGDGMIENEGFPDQTY 645
A Y ++D D DG+IEN+ PD TY
Sbjct: 114 VLLEENVSGLQAGAGEYPLEWDMDNDGLIENKP-PDLTY 151
>gi|418404986|ref|ZP_12978419.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
gi|359501068|gb|EHK73697.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti CCNWSX0020]
Length = 734
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 32/140 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD + R +WP+V A+ ++++F D+DGDG +E N+ + D + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
V AY G W QAA+ L+ +G A R ++ + +D+ +
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRKLGHVEDALRLERRAEELRIRFDTAF-- 511
Query: 702 SYFNYDNSDGSSSTSIQADQ 721
+D+ G+ S ++ ++
Sbjct: 512 ----FDDELGTYSLALDGEK 527
>gi|16263864|ref|NP_436656.1| hypothetical protein SM_b20116 [Sinorhizobium meliloti 1021]
gi|334319999|ref|YP_004556628.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
gi|384539727|ref|YP_005723811.1| hypothetical protein SM11_pD1478 [Sinorhizobium meliloti SM11]
gi|407724161|ref|YP_006843822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
gi|15139988|emb|CAC48516.1| CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti 1021]
gi|334097738|gb|AEG55748.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti AK83]
gi|336038380|gb|AEH84310.1| conserved hypothetical protein [Sinorhizobium meliloti SM11]
gi|407324221|emb|CCM72822.1| hypothetical protein BN406_06540 [Sinorhizobium meliloti Rm41]
Length = 734
Score = 42.0 bits (97), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 32/140 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD + R +WP+V A+ ++++F D+DGDG +E N+ + D + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
V AY G W QAA+ L+ +G A R ++ + +D+ +
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRKLGHVEDALRLERRAEELRIRFDTAF-- 511
Query: 702 SYFNYDNSDGSSSTSIQADQ 721
+D+ G+ S ++ ++
Sbjct: 512 ----FDDELGTYSLALDGEK 527
>gi|374372806|ref|ZP_09630467.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
gi|373234882|gb|EHP54674.1| hypothetical protein NiasoDRAFT_3452 [Niabella soli DSM 19437]
Length = 872
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 58/310 (18%), Positives = 112/310 (36%), Gaps = 31/310 (10%)
Query: 64 KQEEAKGKVPVFNVFRKHHITSDQGIPLGGIGAGSIGRSYRGEFQRFKLFHGICDDAPVL 123
K+E + + +R H+ S P+GG+GAG G + ++ P +
Sbjct: 44 KEETPRTSRAYNSAYRDEHL-SRVAFPIGGLGAGMFCLEGSGAVSHMSI-----NNHPEI 97
Query: 124 ANQFSVF--VSRPNGEKFSSVL---CPRSPGVPKKNTDSGIESWDWNL-KGENCTYHALF 177
+ ++F ++ + + VL P ++ T +G ++ L + +N ++ A F
Sbjct: 98 FFEPNIFAAIAIKGAQPMAKVLEGPVPEWKIFGQRGTGNGASGTNYGLPRFDNASFLARF 157
Query: 178 PRAWTVYDGEPDPELRIVCRQISPFIPHNYKESSFPASVFTFTLSNSGQTSADVTLLFTW 237
P T+ + + + + SPFIP + S P +T +N + + +
Sbjct: 158 PFG-TIELKDNEMPMDVAINGWSPFIPGDEDNSGIPVGALEYTFANRSSKTVEAVFSYNA 216
Query: 238 ANSVAGDSGLSGHHFNSKTMTKDGVHGLTLHHRTANGRPPVTFAVAAEETADVHVSECPC 297
N VA K + +G L +P A TA+ C
Sbjct: 217 KNFVA---------RGEKAGIRKLENGFILSQGQGKDKPFTQTDFAIYTTANEQPVVDYC 267
Query: 298 FLLSGNSKGITAKDMWNEIKKHGSFDHLDNDKTSPSEPGSSIGAAIAASLTIPSGSTRSV 357
+ G +T +WN I +P + GA++ + G+ + +
Sbjct: 268 WFRGGWFDPVTM--IWNTIAD-------GKVSNTPPVAADAPGASLFVPFKLAPGAKKII 318
Query: 358 TFSLAWDCPE 367
+AW PE
Sbjct: 319 RVMMAWYVPE 328
>gi|433611718|ref|YP_007195179.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
gi|429556660|gb|AGA11580.1| Glycogen debranching enzyme [Sinorhizobium meliloti GR4]
Length = 734
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 64/140 (45%), Gaps = 32/140 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD + R +WP+V A+ ++++F D+DGDG +E N+ + D + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
V AY G W QAA+ L+ +G A R ++ + +D+ +
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRRLGHVEDALRLERRAEELRIRFDTAF-- 511
Query: 702 SYFNYDNSDGSSSTSIQADQ 721
+D+ G+ S ++ ++
Sbjct: 512 ----FDDELGTYSLALDGEK 527
>gi|373951743|ref|ZP_09611703.1| alpha-L-rhamnosidase [Mucilaginibacter paludis DSM 18603]
gi|373888343|gb|EHQ24240.1| alpha-L-rhamnosidase [Mucilaginibacter paludis DSM 18603]
Length = 1252
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 68/144 (47%), Gaps = 35/144 (24%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQ-FDKDGDGMIENEGFPDQTYD 646
D+N F+ ++ F TGD ++ + +WP + +A+ ++ FD DGDG+ YD
Sbjct: 442 DMNLVFIDELLNHFNWTGDLDYVKQMWPVIKRHLAWEKRNFDADGDGL----------YD 491
Query: 647 A----WSANGVSAYCGGLWVA------ALQAASALANDVGDHASASYFWVRYQKA----- 691
A W+++ + GG+ + A +AA+ LA +G+ A+ YQK
Sbjct: 492 AYCAIWASDALQYSGGGVTHSSAYNYRANKAAAQLAKLIGEDAAP------YQKEAEHIL 545
Query: 692 KAVYDSLW---NGSYFNYDNSDGS 712
A+ +LW G Y Y + G+
Sbjct: 546 NAINKTLWMPQTGQYAEYQDLLGN 569
>gi|398384505|ref|ZP_10542535.1| glycogen debranching enzyme [Sphingobium sp. AP49]
gi|397722664|gb|EJK83200.1| glycogen debranching enzyme [Sphingobium sp. AP49]
Length = 738
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 52/118 (44%), Gaps = 26/118 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD F A+ P V A+A+++ D+DGDG +E N+G+ D + A+G
Sbjct: 402 TGDAAFLAAILPHVEAALAWIDAHGDRDGDGFVEYGRLTDQGLQNQGWKDSHDSIFHADG 461
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
V AY G W QAA A+ VG A+ + R + +D+ +
Sbjct: 462 SIARGPIALAEVQAYVYGAW----QAAGAIFAAVGQEDKATGYRARADDLRTRFDAAF 515
>gi|294012573|ref|YP_003546033.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
gi|292675903|dbj|BAI97421.1| amylo-alpha-1,6-glucosidase [Sphingobium japonicum UT26S]
Length = 749
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 53/118 (44%), Gaps = 26/118 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYM-EQFDKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD F ++ P V A++++ E D+DGDG +E N+G+ D + A+G
Sbjct: 413 TGDSGFIASILPHVEAALSWIDEHGDRDGDGFVEYGRLTDQGLVNQGWKDSHDSVFHADG 472
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
V AY G W +AA A+ +GD A + +R + +D+ +
Sbjct: 473 GIARGPIALAEVQAYVYGAW----RAAEAIFTTLGDPDKARTYRMRADDLRERFDAAF 526
>gi|333978445|ref|YP_004516390.1| glycoside hydrolase 15-like protein [Desulfotomaculum kuznetsovii
DSM 6115]
gi|333821926|gb|AEG14589.1| glycoside hydrolase 15-related protein [Desulfotomaculum
kuznetsovii DSM 6115]
Length = 671
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 45/106 (42%), Gaps = 11/106 (10%)
Query: 598 YRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGV-SAY 656
+ F T D+ F + +WPS ++ AY+ + + G P+ D W N + S Y
Sbjct: 381 HHHFKLTRDREFLQKIWPSTFLGAAYLVEH-------LSPNGLPEPGLDLWEDNFIQSTY 433
Query: 657 CGGLWVAALQAASALANDVGDHASASYFWVRYQKA--KAVYDSLWN 700
L+ A+ALA +G A W + K++ LWN
Sbjct: 434 AAAAIYGGLRGAAALAGSIGAPEKAKK-WSEAAETVRKSILTHLWN 478
>gi|384533984|ref|YP_005716648.1| amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
gi|333816160|gb|AEG08827.1| Amylo-alpha-16-glucosidase [Sinorhizobium meliloti BL225C]
Length = 734
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 32/136 (23%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD + R +WP+V A+ ++++F D+DGDG +E N+ + D + A G
Sbjct: 398 TGDLDTVRNLWPNVVAALDWIDRFGDRDGDGFVEYGSRTAKGLVNQCWKDSHDSIFHAGG 457
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNG 701
V AY G W QAA+ L+ +G A R ++ + +D+ +
Sbjct: 458 NLASGPIATAEVQAYVFGAW----QAAARLSRKLGHVEDALRLERRAEELRIRFDTAF-- 511
Query: 702 SYFNYDNSDGSSSTSI 717
+D+ G+ S ++
Sbjct: 512 ----FDDELGTYSLAL 523
>gi|330507058|ref|YP_004383486.1| glycogen debranching protein [Methanosaeta concilii GP6]
gi|328927866|gb|AEB67668.1| glycogen debranching enzyme, archaeal type, putative [Methanosaeta
concilii GP6]
Length = 621
Score = 40.8 bits (94), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 67/163 (41%), Gaps = 37/163 (22%)
Query: 532 DFAAAVMMHDPGTMKIMSDGKWVARKCL----------GAVPHDIGLDDPWFEINSYNLF 581
D+ M+ PG + +S G++ A + + G +P+D+G SYN
Sbjct: 288 DWGRDAMIALPGIL--LSTGRYEAARLVLSSFAGAMKDGVLPNDLG-------ARSYNTV 338
Query: 582 NSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYM----EQFDKDGDGMIEN 637
++S W FV R F ATGD +F +WP + + + F D DG+I N
Sbjct: 339 DASLW------FVQATSRYFDATGDLDFLAELWPKLLDVVERYSRPGDDFGADEDGLI-N 391
Query: 638 EG----FPDQTYDAWSANGVSAYC---GGLWVAALQAASALAN 673
G + D D + C LW +AL+ +L+
Sbjct: 392 SGPALTWMDARVDGRPVTSRTGKCCEINALWYSALRRTESLSR 434
>gi|390166485|ref|ZP_10218748.1| amylo-alpha-1,6-glucosidase [Sphingobium indicum B90A]
gi|389590882|gb|EIM68867.1| amylo-alpha-1,6-glucosidase [Sphingobium indicum B90A]
Length = 749
Score = 40.8 bits (94), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 26/118 (22%)
Query: 604 TGDKNFARAVWPSVYIAMAYM-EQFDKDGDGMIE----------NEGFPDQTYDAWSANG 652
TGD F ++ P V A++++ E D+DGDG +E N+G+ D + A+G
Sbjct: 413 TGDSGFIASILPHVEAALSWIDEHGDRDGDGFVEYGRLTDQGLVNQGWKDSHDSVFHADG 472
Query: 653 -----------VSAYCGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVYDSLW 699
V AY G W +AA A+ +GD A + R + +D+ +
Sbjct: 473 GIARGPIALAEVQAYVYGAW----RAAEAIFTTLGDPDKARAYRTRADDLRERFDAAF 526
>gi|323495701|ref|ZP_08100771.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
gi|323319168|gb|EGA72109.1| glycogen debranching enzyme [Vibrio sinaloensis DSM 21326]
Length = 924
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 10/105 (9%)
Query: 604 TGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA-------WSANGVSAY 656
+GD +FA+ ++P V +A +E+ D DG+I++ PD DA WS G A
Sbjct: 498 SGDIDFAKRIYPVVQRFIAGVERNYLDQDGLIKHR-HPDTWMDAKIDGQIPWSPRGPKAN 556
Query: 657 -CGGLWVAALQAASALANDVGDHASASYFWVRYQKAKAVY-DSLW 699
LW AL A LA D A + Q+AK + D W
Sbjct: 557 DIQALWFDALHVAQTLAQLNQDDVYADHVVSLAQQAKQSFIDKFW 601
>gi|257052439|ref|YP_003130272.1| glycosyltransferase 36 [Halorhabdus utahensis DSM 12940]
gi|256691202|gb|ACV11539.1| glycosyltransferase 36 [Halorhabdus utahensis DSM 12940]
Length = 815
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 56/124 (45%), Gaps = 17/124 (13%)
Query: 637 NEGFPDQTYDAWSANGVSAYCGGLWVAALQAASALA--NDVGDHASASYFWVRYQKAKAV 694
+E F +D S + GG +V ALQ + LA D+ ++ Y + A+AV
Sbjct: 497 DESFQTAEHDVEEERAESVFIGGQFVYALQELAELAEQTDILPESAEEYQGYADEMAEAV 556
Query: 695 YDSLWNGSYF-----NYDNSDGSSSTSIQADQLAGQWYARACGLLPIA----DEAKVKKA 745
D+ W+G +F +Y + GSS GQ + + G+ +A DE KV+ A
Sbjct: 557 ADAGWDGEWFRRAYDHYSDPIGSSEND------EGQIFVESNGMCGMAGIGRDEGKVQDA 610
Query: 746 LTKI 749
+ +
Sbjct: 611 MDSV 614
>gi|240138186|ref|YP_002962658.1| hypothetical protein MexAM1_META1p1519 [Methylobacterium extorquens
AM1]
gi|418061390|ref|ZP_12699252.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
gi|240008155|gb|ACS39381.1| conserved hypothetical protein, putative glycosyltransferase
[Methylobacterium extorquens AM1]
gi|373565054|gb|EHP91121.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens DSM 13060]
Length = 778
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 18/103 (17%)
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
FV+ + TGD + RA+WP + +A+ +++ + D+DGDG +E N+G+
Sbjct: 431 FVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490
Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
D D A G A C G AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533
>gi|218529882|ref|YP_002420698.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
gi|218522185|gb|ACK82770.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens CM4]
Length = 778
Score = 40.4 bits (93), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 18/103 (17%)
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
FV+ + TGD + RA+WP + +A+ +++ + D+DGDG +E N+G+
Sbjct: 431 FVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490
Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
D D A G A C G AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533
>gi|254560746|ref|YP_003067841.1| hypothetical protein METDI2293 [Methylobacterium extorquens DM4]
gi|254268024|emb|CAX23895.1| conserved hypothetical protein, putative glycosyltransferase
[Methylobacterium extorquens DM4]
Length = 778
Score = 40.4 bits (93), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 18/103 (17%)
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
FV+ + TGD + RA+WP + +A+ +++ + D+DGDG +E N+G+
Sbjct: 431 FVMLAAQYHEVTGDLDTIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490
Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
D D A G A C G AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533
>gi|403382326|ref|ZP_10924383.1| alpha-L-rhamnosidase [Paenibacillus sp. JC66]
Length = 1003
Score = 40.0 bits (92), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 54/116 (46%), Gaps = 5/116 (4%)
Query: 604 TGDKNFARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
TGD +AR++WP V Y Y+E+ D+ G ++ F D +GV ++ V
Sbjct: 692 TGDTGYARSIWPHVRYTLEHYLERLDERGLLAMKAWNFLDWAPMDQPRSGVVSHQNMFLV 751
Query: 663 AALQAASALANDVGDHASASYFWVRYQKAK-AVYDSLWN---GSYFNYDNSDGSSS 714
AL++A+ LA G F ++ K A+ LW+ +Y + + DG S
Sbjct: 752 KALRSAALLAELAGQAEEGDGFLDHAERLKQAIGQHLWSEERKAYLDCIHEDGRRS 807
>gi|337288158|ref|YP_004627630.1| alpha-L-rhamnosidase [Thermodesulfobacterium sp. OPB45]
gi|334901896|gb|AEH22702.1| alpha-L-rhamnosidase [Thermodesulfobacterium geofontis OPF15]
Length = 707
Score = 40.0 bits (92), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 17/112 (15%)
Query: 588 DLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE---------N 637
D FV+ + F TGD +F +++W ++ +A+++++ + D DGDG +E N
Sbjct: 358 DATPLFVILASKYFERTGDIDFIKSIWKNIVLAISWIDNYGDVDGDGFVEYFPSKKGLTN 417
Query: 638 EGFPDQTYDAWSANGVSAY-------CGGLWVAALQAASALANDVGDHASAS 682
+G+ D + +G G A + AS LA +G+ AS
Sbjct: 418 KGWKDSEDSIFYEDGTLVKPPVALVEVQGYVYKAKREASKLARILGEKELAS 469
>gi|261368895|ref|ZP_05981778.1| cellulose degradation product phosphorylase [Subdoligranulum
variabile DSM 15176]
gi|282568993|gb|EFB74528.1| glycosyltransferase family 36 [Subdoligranulum variabile DSM 15176]
Length = 803
Score = 40.0 bits (92), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 67/174 (38%), Gaps = 25/174 (14%)
Query: 597 VYRDFVATGDKNFARAVWP-------SVYIAMAYMEQFDKDGDGMIENEGFPDQTYDAWS 649
VY+ TGDK F V P +VY + QF D G G P Y W+
Sbjct: 438 VYKYVAETGDKAFLDEVIPFANRDEGTVYEHLKRAVQFSLDHLG---PHGLPAGLYADWN 494
Query: 650 ------ANGVSAYCGGLWVAALQAASALANDVGDHASASYFWVRY-QKAKAVYDSLWNGS 702
ANG S + + AL+ A D GD A+ + A+ + W+G
Sbjct: 495 DCLRLGANGESTFVALQFYYALEILRRFAADRGDEAACRHLEATMADTARKIQSLCWDGD 554
Query: 703 YFNYDNSDGSSSTSIQADQLAGQW-----YARACGLLPIADEAKVKKALTKIYD 751
F ++ AD A W +A GL AD+ + +A+ ++YD
Sbjct: 555 RFIRGFTEEGQRIGAAADPEANLWLNPQSWAVISGL---ADKDQADRAMAQVYD 605
>gi|441502511|ref|ZP_20984521.1| Glycogen debranching enzyme [Photobacterium sp. AK15]
gi|441429689|gb|ELR67141.1| Glycogen debranching enzyme [Photobacterium sp. AK15]
Length = 935
Score = 40.0 bits (92), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 70/157 (44%), Gaps = 19/157 (12%)
Query: 578 YNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIEN 637
YN + + W V +V+ +GDK++AR+++P V + +E+ D G++ +
Sbjct: 489 YNTTDGTPW------MVREVWDYLRYSGDKDYARSIYPVVKTYIDGIEKNYLDQHGLMTH 542
Query: 638 EGFPDQTYDA-------WSANGVSAY-CGGLWVAALQAASALANDVGDHASASYFWVRYQ 689
PD DA WSA G A LW +L+ A LA D ++ +
Sbjct: 543 RD-PDTWMDAKLEGKLPWSARGNRANDIQALWYTSLKVAVELAKLNQDGEGEQHYLAMAE 601
Query: 690 KAKAVYDSL-WNGS---YFNYDNSDGSSSTSIQADQL 722
K +A + L W+G + ++D S I+ +QL
Sbjct: 602 KVQASFQQLFWDGELSLLADRLHADDSRDLQIRPNQL 638
>gi|373456251|ref|ZP_09548018.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
gi|371717915|gb|EHO39686.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
Length = 1123
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
Query: 603 ATGDKNFARAVWPSVYIAMAYMEQFDKDGDGMIENEGFPDQTYDA---WSANGVSAYCGG 659
A+GD F + W + A+ ++ D DGD +IEN+ + W AN + Y G
Sbjct: 666 ASGDLTFIQQSWSHIKRAIDFLYSTDTDGDMLIENKNVGHGWVEGGKLWGANP-TFYLAG 724
Query: 660 LWVAALQAASALANDV 675
LW L+ A+ LA +
Sbjct: 725 LWAQTLKDAAYLAKKL 740
>gi|163851052|ref|YP_001639095.1| amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
gi|163662657|gb|ABY30024.1| Amylo-alpha-16-glucosidase [Methylobacterium extorquens PA1]
Length = 778
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 18/103 (17%)
Query: 593 FVLQVYRDFVATGDKNFARAVWPSVYIAMAYMEQF-DKDGDGMIE----------NEGFP 641
FV+ + TGD RA+WP + +A+ +++ + D+DGDG +E N+G+
Sbjct: 431 FVMLAAQYHEVTGDLETIRAIWPQLKLALEWIDTYGDRDGDGFVEYARETEQGLANQGWK 490
Query: 642 DQ-----TYDAWSANGVSAYC--GGLWVAALQAASALANDVGD 677
D D A G A C G AA + A+ LA+ +G+
Sbjct: 491 DSHDSIFHADGAMAKGPIALCEVQGYVYAAKRGAARLASMLGE 533
>gi|40850651|gb|AAR96047.1| alpha-L-rhamnosidase B [Thermomicrobia bacterium PRI-1686]
Length = 954
Score = 39.7 bits (91), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 58/143 (40%), Gaps = 2/143 (1%)
Query: 604 TGDKNFARAVWPSV-YIAMAYMEQFDKDGDGMIENEGFPDQTYDAWSANGVSAYCGGLWV 662
TGD F + +WP + Y Y++ + DG I D +GV + +V
Sbjct: 647 TGDLAFVQDIWPDIQYTLDHYLQHINDDGLLEISAWNLLDWAPIDQPNSGVVTHQNCFFV 706
Query: 663 AALQAASALANDVGDHASASYFWVRYQKAKAVYDSLWNGSYFNYDNSDGSSSTSIQADQL 722
AL+ A L GD + Y + A A+ LW+ + Y +S + ST +
Sbjct: 707 RALKDADELGQSAGDETAGRYAERARELAAAINTHLWSDEHKAYIDSIHADSTRSSVISM 766
Query: 723 AGQWYARACGLLPIADEAKVKKA 745
Q A G+ D A+V ++
Sbjct: 767 QTQVVALLTGVAE-GDRAEVVRS 788
>gi|373457216|ref|ZP_09548983.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
gi|371718880|gb|EHO40651.1| Amylo-alpha-16-glucosidase [Caldithrix abyssi DSM 13497]
Length = 976
Score = 39.3 bits (90), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 43/195 (22%), Positives = 89/195 (45%), Gaps = 30/195 (15%)
Query: 557 KCLGAVPHDIGLDDPWFEINSYNLFNSSRWKDLNSKFVLQVYRDFVATGDKNFARAVWPS 616
+ LG +P+ I ++ YN + + W F+ +Y F+ TGD++F + ++P
Sbjct: 316 RELGRIPNRITNNE-----TIYNTADGTWW------FIRAIYEYFLYTGDQSFLQEMFPY 364
Query: 617 VYIAM--AYMEQFD-------KDGDGMIENEGFPDQTYDAWSANGVSAY-CGGLWVAALQ 666
V A+ A ++ D +D + ++ G + WS G A LW ALQ
Sbjct: 365 VRRALEGAIGKRVDELGFLTHRDAETWMDAVG----SEGPWSPRGNRAVEIQALWYTALQ 420
Query: 667 AASALANDV-GDHASASYFWVRYQKAKAVYDS-LWN---GSYFNYDNSDGSSSTSIQADQ 721
+ +A ++ G+ + + + +K K ++ WN + +++ N+DG++ I+ +Q
Sbjct: 421 IGAQMAKNIEGEDSQGQKWLLLSRKVKDHFNRYYWNETRQALYDHLNADGTADQQIRPNQ 480
Query: 722 LAGQWYARACGLLPI 736
+ G+ P+
Sbjct: 481 MFAVTVPDLPGIEPL 495
>gi|94500338|ref|ZP_01306871.1| Predicted peptidoglycan hydrolase, FlgJ family protein
[Oceanobacter sp. RED65]
gi|94427637|gb|EAT12614.1| Predicted peptidoglycan hydrolase, FlgJ family protein
[Oceanobacter sp. RED65]
Length = 251
Score = 39.3 bits (90), Expect = 9.7, Method: Composition-based stats.
Identities = 37/170 (21%), Positives = 72/170 (42%), Gaps = 18/170 (10%)
Query: 341 AAIAASLTIPSGSTRSVTFSLAWDCPEVKFFEKVYHRRYTKFYGTLGDSAARIAHDAILE 400
++ A P+ + + F + P+ + +E V ++ F L ++A++ ILE
Sbjct: 17 SSFTAGYLFPTNTEKKYKFEVE-TMPDFRQYENVQKKKQDFFNFLL--PRVKVANENILE 73
Query: 401 HAKWECEIEAWQRPILEDKRFPEWYPITLFNELYYLNAGGTIWTDGSPPMQSLATIRERK 460
+W E++ +++R E E Y + D + L T+ +R+
Sbjct: 74 ERRWLLELDLSNLSDDQEQRLNE------LAEKYEV--------DDYENSEQLFTVLKRR 119
Query: 461 FSLDTPSKIHTPASSDTALGT-RLLENGEENIGQFLYLEGAEYVMYNTYD 509
+ PS + A++++A GT R G GQ+ Y+EG + N D
Sbjct: 120 IDVLPPSMVLAQAANESAWGTSRFARKGNNLFGQWCYVEGCGLIPKNRND 169
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.419
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,894,215,217
Number of Sequences: 23463169
Number of extensions: 700295105
Number of successful extensions: 1444822
Number of sequences better than 100.0: 545
Number of HSP's better than 100.0 without gapping: 406
Number of HSP's successfully gapped in prelim test: 139
Number of HSP's that attempted gapping in prelim test: 1440787
Number of HSP's gapped (non-prelim): 1253
length of query: 920
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 768
effective length of database: 8,792,793,679
effective search space: 6752865545472
effective search space used: 6752865545472
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)