BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003095
(848 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
Length = 842
Score = 1449 bits (3751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 699/848 (82%), Positives = 764/848 (90%), Gaps = 6/848 (0%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M KEIL++ VVLA TSF ANVTYDHRA++I GKRRVLISGSIHYPRSTPEMWP
Sbjct: 1 MGRKEILVVFFF--SVVLAETSFAANVTYDHRALLIDGKRRVLISGSIHYPRSTPEMWPG 58
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LIQKSKDGGLDVIETYVFWN HEPVRNQYNFEGRYDLVKFVKLVAEAGLY H+RIGPYVC
Sbjct: 59 LIQKSKDGGLDVIETYVFWNGHEPVRNQYNFEGRYDLVKFVKLVAEAGLYVHIRIGPYVC 118
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GGFPLWLHFIPGI+FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI
Sbjct: 119 AEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 178
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYGNIDSA+G A K+YI WAAGMA+SLDTGVPWVMCQQ+DAPDP+INTCNGFYCDQFT
Sbjct: 179 ENEYGNIDSAFGPAAKTYINWAAGMAISLDTGVPWVMCQQADAPDPVINTCNGFYCDQFT 238
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PNS NKPKMWTENWSGWF SFGGAVPYRPVEDLAFAVARF+Q GTFQNYYMYHGGTNF
Sbjct: 239 PNSKNKPKMWTENWSGWFQSFGGAVPYRPVEDLAFAVARFYQLSGTFQNYYMYHGGTNFG 298
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGPFISTSYDYDAPLDEYGL+RQPKWGHLKD+HKAIKLCE AL+ATDPT SLG NL
Sbjct: 299 RTTGGPFISTSYDYDAPLDEYGLLRQPKWGHLKDVHKAIKLCEEALIATDPTTTSLGSNL 358
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EATVYKTGS LC+AFLANI T +D TV FNGNSY LPAWSVSILPDCKNV NTAKINSV
Sbjct: 359 EATVYKTGS-LCAAFLANIAT-TDKTVTFNGNSYNLPAWSVSILPDCKNVALNTAKINSV 416
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
T+VPSF+RQSL DSS AIGSGWS+INEPVGISK+DAF K GLLEQINTTAD+SDYLW
Sbjct: 417 TIVPSFARQSLVGDVDSSKAIGSGWSWINEPVGISKNDAFVKSGLLEQINTTADKSDYLW 476
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
YSLSTNIK DEP LEDGS+TVLHV+SLGHALHAFINGKL GSG G SSNAKVTVD PI L
Sbjct: 477 YSLSTNIKGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGKSSNAKVTVDIPITL 536
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
PGKNT DLLSLTVGLQNYGAFYE TGAGITGPV+LK NG +DLSSQQWTYQ GLKG
Sbjct: 537 TPGKNTIDLLSLTVGLQNYGAFYELTGAGITGPVKLKAQ-NGNTVDLSSQQWTYQIGLKG 595
Query: 601 EELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
E+ SGSS++W S+ TLPK QPL+WYKT+FDAPAG++PVAIDFTGMGKGEAWVNGQSI
Sbjct: 596 EDSGISSGSSSEWVSQPTLPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKGEAWVNGQSI 655
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYWPT VS + GC DSCNYRG YSSNKCLKNCGKPSQ+ YH+PRSW+KSSGN LVL EE
Sbjct: 656 GRYWPTNVSPSSGCADSCNYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSSGNILVLLEE 715
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVIS 780
IGGDPT+I+F T+Q+G SLCSHV++SHP PVDMW +DS+ ++ GPVLSL+CP+P++VIS
Sbjct: 716 IGGDPTQIAFATRQVG-SLCSHVSESHPQPVDMWNTDSEGGKRSGPVLSLQCPHPDKVIS 774
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKS 840
SIKFASFGTP G+CGS+S G+CSS +LS+V++ACVGSKSC++GVS+NTFGDPC+GV KS
Sbjct: 775 SIKFASFGTPHGSCGSYSHGKCSSTSALSIVQKACVGSKSCNVGVSINTFGDPCRGVKKS 834
Query: 841 LAVEASCT 848
LAVEASCT
Sbjct: 835 LAVEASCT 842
>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 1437 bits (3721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 679/842 (80%), Positives = 748/842 (88%), Gaps = 1/842 (0%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
+ VL +ATTSF + VTYDHRA+VI GKRRVLISGSIHYPRSTP+MWPDLIQKSK
Sbjct: 6 FVFVLVSLLGAIATTSFASTVTYDHRALVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNLHEPVR QY+F+GR DLVKFVK VAEAGLY HLRIGPYVCAEWN+G
Sbjct: 66 DGGLDVIETYVFWNLHEPVRRQYDFKGRNDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFIPGIQFRTDN PFK EMQ FTAKIVDMMK+E LYASQGGPIILSQIENEYGN
Sbjct: 126 GFPLWLHFIPGIQFRTDNGPFKEEMQIFTAKIVDMMKKENLYASQGGPIILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
IDSAYG+A KSYI+WAA MA SLDTGVPWVMCQQ+DAPDP+INTCNGFYCDQFTPNS K
Sbjct: 186 IDSAYGSAAKSYIQWAASMATSLDTGVPWVMCQQADAPDPMINTCNGFYCDQFTPNSVKK 245
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENW+GWFLSFGGAVPYRPVED+AFAVARFFQ GGTFQNYYMYHGGTNF RT+GGP
Sbjct: 246 PKMWTENWTGWFLSFGGAVPYRPVEDIAFAVARFFQLGGTFQNYYMYHGGTNFGRTTGGP 305
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYGL+RQPKWGHLKDLHKAIKLCEAAL+ATDPT SLG NLEA+VYK
Sbjct: 306 FIATSYDYDAPIDEYGLLRQPKWGHLKDLHKAIKLCEAALIATDPTITSLGTNLEASVYK 365
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TG+G C+AFLAN+ TNSD TV F+GNSY LPAWSVSILPDCKNV NTA+INS+ ++P F
Sbjct: 366 TGTGSCAAFLANVRTNSDATVNFSGNSYHLPAWSVSILPDCKNVALNTAQINSMAVMPRF 425
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+QSL+ DSSD SGWS+++EPVGISK++AFTK GLLEQIN TAD+SDYLWYSLST
Sbjct: 426 MQQSLKNDIDSSDGFQSGWSWVDEPVGISKNNAFTKLGLLEQINITADKSDYLWYSLSTE 485
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
I+ DEP LEDGS+TVLHV+SLGHALHAFINGKL GSG G+S NAKVTVD P+ L GKNT
Sbjct: 486 IQGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGNSGNAKVTVDIPVTLIHGKNT 545
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
DLLSLTVGLQNYGAFY+K GAGITGP++LKG NGT +DLSSQQWTYQ GL+GEEL P
Sbjct: 546 IDLLSLTVGLQNYGAFYDKQGAGITGPIKLKGLANGTTVDLSSQQWTYQVGLQGEELGLP 605
Query: 607 SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
SGSS++W + STLPK QPL+WYKTTFDAPAG++PVA+DF GMGKGEAWVNGQSIGRYWP
Sbjct: 606 SGSSSKWVAGSTLPKKQPLIWYKTTFDAPAGNDPVALDFMGMGKGEAWVNGQSIGRYWPA 665
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPT 726
YVS NGGCT SCNYRG YSSNKCLKNCGKPSQ LYHVPRSWL+ SGNTLVLFEEIGGDPT
Sbjct: 666 YVSSNGGCTSSCNYRGPYSSNKCLKNCGKPSQQLYHVPRSWLQPSGNTLVLFEEIGGDPT 725
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFAS 786
+ISF TKQ+ SLCS V++ HPLPVDMWGSD RK P+LSLECP PNQVISSIKFAS
Sbjct: 726 QISFATKQV-ESLCSRVSEYHPLPVDMWGSDLTTGRKSSPMLSLECPFPNQVISSIKFAS 784
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEAS 846
FGTP GTCGSFS +CSS +LS+V++AC+GSKSCSIGVS++TFGDPC G+ KSLAVEAS
Sbjct: 785 FGTPRGTCGSFSHSKCSSRTALSIVQEACIGSKSCSIGVSIDTFGDPCSGIAKSLAVEAS 844
Query: 847 CT 848
CT
Sbjct: 845 CT 846
>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 1425 bits (3688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 675/833 (81%), Positives = 747/833 (89%), Gaps = 4/833 (0%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
+ LATTS+G NVTYDHRA++I GKRRVL+SGSIHYPRST EMW DLIQKSKDGGLDVIE
Sbjct: 20 LLTLATTSYGVNVTYDHRALLIDGKRRVLVSGSIHYPRSTVEMWADLIQKSKDGGLDVIE 79
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TYVFWN HEPV+NQYNFEGRYDLVKF+KLV EAGLYAHLRIGPYVCAEWN+GGFPLWLHF
Sbjct: 80 TYVFWNAHEPVQNQYNFEGRYDLVKFIKLVGEAGLYAHLRIGPYVCAEWNYGGFPLWLHF 139
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI+FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDS+YG A
Sbjct: 140 VPGIKFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSSYGPA 199
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
KSYI WAA MA+SLDTGVPWVMCQQ+DAPDPIINTCNGFYCDQFTPNS NKPKMWTENW
Sbjct: 200 AKSYINWAASMAVSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSKNKPKMWTENW 259
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
SGWFLSFGGAVPYRPVEDLAFAVARF+Q GGTFQNYYMYHGGTNF R++GGPFISTSYDY
Sbjct: 260 SGWFLSFGGAVPYRPVEDLAFAVARFYQLGGTFQNYYMYHGGTNFGRSTGGPFISTSYDY 319
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAPLDEYGL RQPKWGHLKDLHK+IKLCE ALVATDP SLG NLEATVYKTG+GLCSA
Sbjct: 320 DAPLDEYGLTRQPKWGHLKDLHKSIKLCEEALVATDPVTSSLGQNLEATVYKTGTGLCSA 379
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVA 434
FLAN GT SD TV FNGNSY LP WSVSILPDCKNV NTAKINS+T++P+F QSL
Sbjct: 380 FLANFGT-SDKTVNFNGNSYNLPGWSVSILPDCKNVALNTAKINSMTVIPNFVHQSLIGD 438
Query: 435 ADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL 494
ADS+D +GS WS+I EPVGISK+DAF KPGLLEQINTTAD+SDYLWYSLST IK +EP L
Sbjct: 439 ADSADTLGSSWSWIYEPVGISKNDAFVKPGLLEQINTTADKSDYLWYSLSTVIKDNEPFL 498
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
EDGS+TVLHV+SLGHALHAF+NGKL GSG G++ NAKV V+ P+ L PGKNT DLLSLT
Sbjct: 499 EDGSQTVLHVESLGHALHAFVNGKLAGSGTGNAGNAKVAVEIPVTLLPGKNTIDLLSLTA 558
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD 614
GLQNYGAF+E GAGITGPV+L+G NGT +DLSS QWTYQ GLKGEEL SG+S QW
Sbjct: 559 GLQNYGAFFELEGAGITGPVKLEGLKNGTTVDLSSLQWTYQIGLKGEELGLSSGNS-QWV 617
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
++ LP QPL+WYKT+F+APAG++P+AIDF+GMGKGEAWVNGQSIGRYWPT VS GC
Sbjct: 618 TQPALPTKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGEAWVNGQSIGRYWPTKVSPTSGC 677
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
++ CNYRG+YSS+KCLKNC KPSQ+LYHVPRSW++SSGNTLVLFEEIGGDPT+I+F TKQ
Sbjct: 678 SN-CNYRGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGNTLVLFEEIGGDPTQIAFATKQ 736
Query: 735 LGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTC 794
+SLCSHV++SHPLPVDMW S+S+ +RK GPVLSLECP PNQVISSIKFASFGTP GTC
Sbjct: 737 -SASLCSHVSESHPLPVDMWSSNSEAERKAGPVLSLECPFPNQVISSIKFASFGTPRGTC 795
Query: 795 GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
GSFS G+C S R+LS+V++AC+GSKSCSIG S +TFGDPC+GV KSLAVEASC
Sbjct: 796 GSFSHGQCKSTRALSIVQKACIGSKSCSIGASASTFGDPCRGVAKSLAVEASC 848
>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 838
Score = 1383 bits (3579), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 661/847 (78%), Positives = 737/847 (87%), Gaps = 10/847 (1%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M +IL + L W F V A +SF ANVTYDHRA+VI GKRRVL+SGSIHYPRSTPEMWPD
Sbjct: 1 MRGTQILFVGLLWFFCVYAPSSFCANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPD 60
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LIQKSKDGGLDVIETYVFWNLHEPV+ QYNFEGR DLVKFVK VA AGLY HLRIGPY C
Sbjct: 61 LIQKSKDGGLDVIETYVFWNLHEPVQGQYNFEGRADLVKFVKAVAAAGLYVHLRIGPYAC 120
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GGFPLWLHFIPGIQFRTDN+PF+AEM+RFT KIVDMMKQE LYASQGGPIILSQ+
Sbjct: 121 AEWNYGGFPLWLHFIPGIQFRTDNKPFEAEMKRFTVKIVDMMKQESLYASQGGPIILSQV 180
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYGNID+AYG A KSYIKWAA MA SLDTGVPWVMCQQ+DAPDPIINTCNGFYCDQFT
Sbjct: 181 ENEYGNIDAAYGPAAKSYIKWAASMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFT 240
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PNSN KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTNF
Sbjct: 241 PNSNAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFG 300
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGPFISTSYDYDAP+D+YG+IRQPKWGHLKD+HKAIKLCE AL+ATDPT S GPN+
Sbjct: 301 RTTGGPFISTSYDYDAPIDQYGIIRQPKWGHLKDVHKAIKLCEEALIATDPTITSPGPNI 360
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EA VYKTGS +C+AFLANI T SD TV FNGNSY LPAWSVSILPDCKNVV NTAKINS
Sbjct: 361 EAAVYKTGS-ICAAFLANIAT-SDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSA 418
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
+++ SF+ +S + S D GSGWS+I+EP+GISK D+F+K GLLEQINTTAD+SDYLW
Sbjct: 419 SMISSFTTESFKEEVGSLDDSGSGWSWISEPIGISKSDSFSKFGLLEQINTTADKSDYLW 478
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
YS+S +++ D GS+TVLH++SLGHALHAFINGK+ GSG G+S AKV VD P+ L
Sbjct: 479 YSISIDVEGDS-----GSQTVLHIESLGHALHAFINGKIAGSGTGNSGKAKVNVDIPVTL 533
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
GKN+ DLLSLTVGLQNYGAF++ GAGITGPV LKG NG+ +DLSSQQWTYQ GLK
Sbjct: 534 VAGKNSIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLKNGSTVDLSSQQWTYQVGLKY 593
Query: 601 EELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
E+L +GSS QW+S+STLP Q L+WYKT F AP+GS PVAIDFTGMGKGEAWVNGQSI
Sbjct: 594 EDLGPSNGSSGQWNSQSTLPTNQSLIWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSI 653
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYWPTYVS NGGCTDSCNYRGAYSS+KCLKNCGKPSQ+LYH+PRSWL+ NTLVLFEE
Sbjct: 654 GRYWPTYVSPNGGCTDSCNYRGAYSSSKCLKNCGKPSQTLYHIPRSWLQPDSNTLVLFEE 713
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVIS 780
GGDPT+ISF TKQ+G S+CSHV++SHP PVD+W SD RK GPVLSLECP PNQ+IS
Sbjct: 714 SGGDPTQISFATKQIG-SMCSHVSESHPPPVDLWNSDKG--RKVGPVLSLECPYPNQLIS 770
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKS 840
SIKFASFGTP GTCG+F GRC S ++LS+V++AC+GS SC IG+S+NTFGDPCKGV KS
Sbjct: 771 SIKFASFGTPYGTCGNFKHGRCRSNKALSIVQKACIGSSSCRIGISINTFGDPCKGVTKS 830
Query: 841 LAVEASC 847
LAVEASC
Sbjct: 831 LAVEASC 837
>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
Length = 840
Score = 1371 bits (3548), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 655/829 (79%), Positives = 722/829 (87%), Gaps = 11/829 (1%)
Query: 19 ATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 78
AT S+ V+YDHRA+VI GKRRVL+SGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF
Sbjct: 22 ATASYCTTVSYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 81
Query: 79 WNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
WNLHEPVR QYNFEGR DLV FVK VAEAGLY HLRIGPYVCAEWN+GGFPLWLHFIPGI
Sbjct: 82 WNLHEPVRGQYNFEGRNDLVGFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI 141
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
+ RTDNEP+KAEM RFTAKIV+MMK EKLYASQGGPIILSQIENEYGNID AYG A K+Y
Sbjct: 142 KLRTDNEPYKAEMHRFTAKIVEMMKNEKLYASQGGPIILSQIENEYGNIDKAYGPAAKTY 201
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
I WAA MA+SLDTGVPWVMCQQ+DAP +INTCNGFYCDQF+PNSN+ PK+WTENWSGWF
Sbjct: 202 INWAANMAVSLDTGVPWVMCQQADAPSSVINTCNGFYCDQFSPNSNSTPKIWTENWSGWF 261
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
LSFGGAVP RPVEDLAFAVARF+QRGGTFQNYYMYHGGTNF R+SGGPFI+TSYDYDAPL
Sbjct: 262 LSFGGAVPQRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSSGGPFIATSYDYDAPL 321
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYGL+RQPKWGHLKD+HKAIKLCE A+VATDPT SLG N+EA VYKTGS +CSAFLAN
Sbjct: 322 DEYGLLRQPKWGHLKDVHKAIKLCEPAMVATDPTISSLGQNIEAAVYKTGS-VCSAFLAN 380
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+ T SD TV FNGNSY LPAWSVSILPDCKNVV NTAKIN+ T+VPSF+RQS+ + +
Sbjct: 381 VDTKSDATVTFNGNSYQLPAWSVSILPDCKNVVINTAKINTATMVPSFTRQSISADVEPT 440
Query: 439 DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
+A+GSGWS+INEPVGISK DAFT+ GLLEQINTTAD+SDYLWYS S ++K G
Sbjct: 441 EAVGSGWSWINEPVGISKGDAFTRVGLLEQINTTADKSDYLWYSTSIDVKG-------GY 493
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K LHVQSLGHALHAF+NGKL GSG G+S NAKV+V+ P+ A GKNT DLLSLTVGLQN
Sbjct: 494 KADLHVQSLGHALHAFVNGKLAGSGTGNSGNAKVSVEIPVEFASGKNTIDLLSLTVGLQN 553
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YGAF++ GAGITGPVQLKGS NGT IDLSSQQWTYQ GLKGE+ + PSGSS QW S+ T
Sbjct: 554 YGAFFDLVGAGITGPVQLKGSANGTTIDLSSQQWTYQIGLKGEDEDLPSGSS-QWISQPT 612
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
LPK QPL WYKT FDAP GS PVA+DFTGMGKGEAWVNGQSIGRYWPT V+ GCTD C
Sbjct: 613 LPKNQPLTWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNVAPKTGCTD-C 671
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
NYRGAYS++KC KNCG PSQ LYHVPRSW+KSSGNTLVLFEE+GGDPT++SF T+Q+ S
Sbjct: 672 NYRGAYSADKCRKNCGMPSQKLYHVPRSWMKSSGNTLVLFEEVGGDPTQLSFATRQV-ES 730
Query: 739 LCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
LCSHV++SHP PVDMW SDSK K P LSLECP PNQVISSIKFAS+G P GTCGSFS
Sbjct: 731 LCSHVSESHPSPVDMWSSDSKAGSKSRPRLSLECPFPNQVISSIKFASYGRPSGTCGSFS 790
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G C S+R+LS+V++ACVGSKSCSI VS +TFGDPCKG+ KSLAVEASC
Sbjct: 791 HGSCRSSRALSIVQKACVGSKSCSIEVSTHTFGDPCKGLAKSLAVEASC 839
>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 1363 bits (3528), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 659/841 (78%), Positives = 737/841 (87%), Gaps = 7/841 (0%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LVL W + + T F ANV YDHRA+VI GKRRVLISGSIHYPRSTPEMWPDLIQKSK
Sbjct: 6 IVLVLFWLLCIHSPTLFCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNL+EPVR QY+F+GR DLVKFVK VA AGLY HLRIGPYVCAEWN+G
Sbjct: 66 DGGLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFIPGI+FRTDNEPFKAEM+RFTAKIVDM+K+E LYASQGGP+ILSQIENEYGN
Sbjct: 126 GFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
IDSAYGAAGKSYIKWAA MA SLDTGVPWVMCQQ+DAPDPIINTCNGFYCDQFTPNSN K
Sbjct: 186 IDSAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTK 245
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENWSGWFL FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP
Sbjct: 246 PKMWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 305
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYG+IRQPKWGHLK++HKAIKLCE AL+ATDPT SLGPNLEA VYK
Sbjct: 306 FIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYK 365
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TGS +C+AFLAN+ T SDVTV F+GNSY LPAWSVSILPDCKNVV NTAKINS + + SF
Sbjct: 366 TGS-VCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISSF 424
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+ +SL+ SS+A +GWS+I+EPVGISK D+F + GLLEQINTTAD+SDYLWYSLS +
Sbjct: 425 TTESLKEDIGSSEASSTGWSWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSID 484
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
K D GS+TVLH++SLGHALHAFINGKL GS G+S K TVD P+ L GKNT
Sbjct: 485 YKGDA-----GSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNT 539
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
DLLSLTVGLQNYGAF++ GAGITGPV LKG NG +DLS Q+WTYQ GLKGE+L
Sbjct: 540 IDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGLS 599
Query: 607 SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
SGSS QW+S+ST PK QPL+WYKTTF AP+GS+PVAIDFTGMGKGEAWVNGQSIGRYWPT
Sbjct: 600 SGSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRYWPT 659
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPT 726
YV+ + GCTDSCNYRG YS++KC +NCGKPSQ+LYHVPRSWLK SGN LVLFEE GGDPT
Sbjct: 660 YVASDAGCTDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKGGDPT 719
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFAS 786
+ISFVTKQ SLC+HV+DSHP PVD+W SD++ RK GPVLSL CP+ NQVISSIKFAS
Sbjct: 720 QISFVTKQT-ESLCAHVSDSHPPPVDLWNSDTESGRKVGPVLSLTCPHDNQVISSIKFAS 778
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEAS 846
+GTPLGTCG+F GRCSS ++LS+V++AC+GS SCS+GVS TFG+PC+GV KSLAVEA+
Sbjct: 779 YGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSETFGNPCRGVAKSLAVEAT 838
Query: 847 C 847
C
Sbjct: 839 C 839
>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 842
Score = 1360 bits (3521), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 653/830 (78%), Positives = 721/830 (86%), Gaps = 3/830 (0%)
Query: 19 ATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 78
AT S+ A VTYDHRA+VI GKRRVL+SGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF
Sbjct: 14 ATASYCAKVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 73
Query: 79 WNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
WNLHE VR QY+F GR DLVKFVK VAEAGLY HLRIGPYVCAEWN+GGFPLWLHFIPGI
Sbjct: 74 WNLHEAVRGQYDFGGRKDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI 133
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
Q RTDNEPFKAEMQRFTAKIVDMMK+EKLYASQGGPIILSQIENEYGNID AYGAA ++Y
Sbjct: 134 QLRTDNEPFKAEMQRFTAKIVDMMKKEKLYASQGGPIILSQIENEYGNIDRAYGAAAQTY 193
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGW 257
IKWAA MA+SLDTGVPWVMCQQ DAP +I+TCNGFYCDQ+TP +PKMWTENWSGW
Sbjct: 194 IKWAADMAVSLDTGVPWVMCQQDDAPPSVISTCNGFYCDQWTPRLPEKRPKMWTENWSGW 253
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
FLSFGGAVP RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF R++GGPFI+TSYDYDAP
Sbjct: 254 FLSFGGAVPQRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAP 313
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLA 377
+DEYGL+RQPKWGHLKD+HKAIKLCE A+VATDP Y S GPN+EATVYKTGS C+AFLA
Sbjct: 314 IDEYGLLRQPKWGHLKDVHKAIKLCEEAMVATDPKYSSFGPNVEATVYKTGSA-CAAFLA 372
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T SD TV FNGNSY LPAWSVSILPDCKNVV NTAKINS ++PSF S+ DS
Sbjct: 373 NSDTKSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSAAMIPSFMHHSVLDDIDS 432
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
S+A+GSGWS+INEPVGISK DAFT+ GLLEQINTTAD+SDYLWYSLS ++ + + L+DG
Sbjct: 433 SEALGSGWSWINEPVGISKKDAFTRVGLLEQINTTADKSDYLWYSLSIDVTSSDTFLQDG 492
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
S+T+LHV+SLGHALHAFINGK G G +++N K++VD P+ A GKNT DLLSLT+GLQ
Sbjct: 493 SQTILHVESLGHALHAFINGKPAGRGIITANNGKISVDIPVTFASGKNTIDLLSLTIGLQ 552
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS 617
NYGAF++K+GAGITGPVQLKG NGT DLSSQ+WTYQ GL+GE+ F SGSS+QW S+
Sbjct: 553 NYGAFFDKSGAGITGPVQLKGLKNGTTTDLSSQRWTYQIGLQGEDSGFSSGSSSQWISQP 612
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
TLPK QPL WYK TF+AP GS PVA+DFTGMGKGEAWVNGQSIGRYWPT + GC DS
Sbjct: 613 TLPKKQPLTWYKATFNAPDGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNNAPTSGCPDS 672
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
CN+RG Y SNKC KNCGKPSQ LYHVPRSWLK SGNTLVLFEEIGGDPT+ISF T+Q+
Sbjct: 673 CNFRGPYDSNKCRKNCGKPSQELYHVPRSWLKPSGNTLVLFEEIGGDPTQISFATRQI-E 731
Query: 738 SLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
SLCSHV++SHP PVD W SDSK RK GPVLSLECP PNQVISSIKFAS+G P GTCGSF
Sbjct: 732 SLCSHVSESHPSPVDTWSSDSKAGRKLGPVLSLECPFPNQVISSIKFASYGKPQGTCGSF 791
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
S G+C S +LS+V++ACVGSKSCSI VSV TFGDPCKGV KSLAVEASC
Sbjct: 792 SHGQCKSTSALSIVQKACVGSKSCSIEVSVKTFGDPCKGVAKSLAVEASC 841
>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
Length = 851
Score = 1358 bits (3516), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 647/842 (76%), Positives = 719/842 (85%), Gaps = 7/842 (0%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+ LLVL V++ + A+VTYDHRA+VI GKR++LISGSIHYPRSTPEMWPDLIQKS
Sbjct: 16 VSLLVL----VMMTAAATAASVTYDHRALVIDGKRKILISGSIHYPRSTPEMWPDLIQKS 71
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVIETYVFWN HEP +N+YNFEGRYDLVKFVKL A+AGLY HLRIGPY CAEWN+
Sbjct: 72 KDGGLDVIETYVFWNGHEPEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYACAEWNY 131
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WLHF+PGI+FRTDNEPFKAEMQRFTAKIVD+MKQEKLYASQGGPIILSQIENEYG
Sbjct: 132 GGFPVWLHFVPGIKFRTDNEPFKAEMQRFTAKIVDLMKQEKLYASQGGPIILSQIENEYG 191
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
NIDS+YGAAGKSY+KW+A MALSLDTGVPW MCQQ DAPDPIINTCNGFYCDQFTPNSNN
Sbjct: 192 NIDSSYGAAGKSYMKWSASMALSLDTGVPWNMCQQGDAPDPIINTCNGFYCDQFTPNSNN 251
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENWSGWFL FG PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF+RTSGG
Sbjct: 252 KPKMWTENWSGWFLGFGEPSPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFERTSGG 311
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
P ISTSYDYDAP+DEYGL+RQPKWGHL+DLHKAIKLCE AL+ATDP SLG NLEA VY
Sbjct: 312 PLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPKITSLGSNLEAAVY 371
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
KT +G C+AFLANIGT SD TV FNG SY LPAWSVSILPDCKNV FNTAKINS T +
Sbjct: 372 KTSTGSCAAFLANIGTKSDATVTFNGKSYRLPAWSVSILPDCKNVAFNTAKINSATESTA 431
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
F+RQSL+ ADSS +GS WSYI EPVGISK DAF KPGLLEQINTTAD+SDYLWYSL
Sbjct: 432 FARQSLKPNADSSAELGSQWSYIKEPVGISKADAFVKPGLLEQINTTADKSDYLWYSLRM 491
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
+IK DE L++GSK VLHVQS+G ++AFINGKL GSG G K+++D PI L GKN
Sbjct: 492 DIKGDETFLDEGSKAVLHVQSIGQLVYAFINGKLAGSGNG---KQKISLDIPINLVTGKN 548
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
T DLLS+TVGL NYG F++ TGAGITGPV LK + G++ DLSSQQWTYQ GLKGE+
Sbjct: 549 TIDLLSVTVGLANYGPFFDLTGAGITGPVSLKSAKTGSSTDLSSQQWTYQVGLKGEDKGL 608
Query: 606 PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
SG S++W S S LP QPL+WYKTTFDAP+GS+PVAIDFTG GKG AWVNGQSIGRYWP
Sbjct: 609 GSGDSSEWVSNSPLPTSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGIAWVNGQSIGRYWP 668
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
T +++ GC SC+YRG+Y SNKCLKNCGKPSQ+LYHVPRSW+K SGNTLVL EE+GGDP
Sbjct: 669 TSIARTDGCVGSCDYRGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSGNTLVLLEEMGGDP 728
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFA 785
TKISF TKQ GS+LC V+ SHP PVD W SDSK + PVLSL+CP QVISSI+FA
Sbjct: 729 TKISFATKQTGSNLCLTVSQSHPAPVDTWISDSKFSNRTSPVLSLKCPVSTQVISSIRFA 788
Query: 786 SFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEA 845
SFGTP GTCGSFS G CSSARSLSVV++ACVGS+SC + VS FG+PC+GV+KSLAVEA
Sbjct: 789 SFGTPTGTCGSFSYGHCSSARSLSVVQKACVGSRSCKVEVSTRVFGEPCRGVVKSLAVEA 848
Query: 846 SC 847
SC
Sbjct: 849 SC 850
>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
Length = 846
Score = 1357 bits (3513), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 645/845 (76%), Positives = 729/845 (86%), Gaps = 10/845 (1%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL+L +++A T+ NVTYDHRA+VI GKR+VLISGSIHYPRSTPEMWP+LI+KS
Sbjct: 10 ILLLILQ---IMMAATA--VNVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIKKS 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVIETYVFW+ HEP +N+YNFEGRYDLVKFVKLV EAGLY HLRIGPYVCAEWN+
Sbjct: 65 KDGGLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLVEEAGLYVHLRIGPYVCAEWNY 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WLHF+PGI+FRTDNEPFK EMQRFT KIVD+MKQEKLYASQGGPIILSQIENEYG
Sbjct: 125 GGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
NIDSAYGAA K YIKW+A MALSLDTGVPW MCQQ+DAPDP+INTCNGFYCDQFTPNSN+
Sbjct: 185 NIDSAYGAAAKIYIKWSASMALSLDTGVPWNMCQQADAPDPMINTCNGFYCDQFTPNSNS 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENWSGWFL FG PYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTNFDRTSGG
Sbjct: 245 KPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
P ISTSYDYDAP+DEYGL+RQPKWGHL+DLHKAIKLCE AL+ATDPT SLG NLEA VY
Sbjct: 305 PLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVY 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
KT SG C+AFLAN+GT SD TV FNG SY LPAWSVSILPDCKNV FNTAKINS T +
Sbjct: 365 KTASGSCAAFLANVGTKSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTA 424
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
F+RQSL+ SS +GS WSYI EP+GISK DAF KPGLLEQINTTAD+SDYLWYSL
Sbjct: 425 FARQSLKPDGGSSAELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRM 484
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
+IK DE L++GSK VLH++SLG ++AFINGKL GSG+G K+++D PI LA GKN
Sbjct: 485 DIKGDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGKQ---KISLDIPINLAAGKN 541
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
T DLLS+TVGL NYGAF++ GAGITGPV LK + G++IDL+SQQWTYQ GLKGE+
Sbjct: 542 TVDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGL 601
Query: 606 PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
+ S++W SKS LP QPL+WYKTTFDAP+GSEPVAIDFTG GKG AWVNGQSIGRYWP
Sbjct: 602 ATVDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWP 661
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
T ++ NGGCTDSC+YRG+Y +NKCLKNCGKPSQ+LYHVPRSWLK SGNTLVLFEE+GGDP
Sbjct: 662 TSIAGNGGCTDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDP 721
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKI--QRKPGPVLSLECPNPNQVISSIK 783
T+ISF TKQ GS+LC V+ SHP PVD W SDSKI + + PVLSL+CP QVISSIK
Sbjct: 722 TQISFGTKQTGSNLCLMVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPVSTQVISSIK 781
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAV 843
FASFGTP GTCGSF+ G C+S+RSLSVV++AC+GS+SC++ VS FG+PC+GV+KSLAV
Sbjct: 782 FASFGTPQGTCGSFTHGHCNSSRSLSVVQKACIGSRSCNVEVSTRVFGEPCRGVIKSLAV 841
Query: 844 EASCT 848
EASC+
Sbjct: 842 EASCS 846
>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 846
Score = 1346 bits (3484), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 634/827 (76%), Positives = 715/827 (86%), Gaps = 5/827 (0%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
ANVTYDHRA+VI GKR+VLISGSIHYPRSTPEMWP+LIQKSKDGGLDVIETYVFW+ HE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P +N+YNFEGRYDLVKFVKL A+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI+FRTD
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK EMQRFT KIVD+MKQEKLYASQGGPIILSQIENEYGNIDSAYGAA KSYIKW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MALSLDTGVPW MCQQ+DAPDP+INTCNGFYCDQFTPNSNNKPKMWTENWSGWFL FG
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
PYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTNFDRTSGGP ISTSYDYDAP+DEYGL
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPKWGHL+DLHKAIKLCE AL+ATDPT SLG NLEA VYKT SG C+AFLAN+ T S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D TV FNG SY LPAWSVSILPDCKNV FNTAKINS T +F+RQSL+ SS +GS
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAELGS 442
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
WSYI EP+GISK DAF KPGLLEQINTTAD+SDYLWYSL T+IK DE L++GSK VLH
Sbjct: 443 QWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAVLH 502
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
++SLG ++AFINGKL GSG+G K+++D PI L G NT DLLS+TVGL NYGAF+
Sbjct: 503 IESLGQVVYAFINGKLAGSGHGKQ---KISLDIPINLVTGTNTIDLLSVTVGLANYGAFF 559
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQ 623
+ GAGITGPV LK + G++IDL+SQQWTYQ GLKGE+ + S++W SKS LP Q
Sbjct: 560 DLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWVSKSPLPTKQ 619
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL+WYKTTFDAP+GSEPVAIDFTG GKG AWVNGQSIGRYWPT ++ NGGCT+SC+YRG+
Sbjct: 620 PLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGS 679
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
Y +NKCLKNCGKPSQ+LYHVPRSWLK SGN LVLFEE+GGDPT+ISF TKQ GS+LC V
Sbjct: 680 YRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTV 739
Query: 744 TDSHPLPVDMWGSDSKI--QRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
+ SHP PVD W SDSKI + + PVLSL+CP QVI SIKFASFGTP GTCGSF++G
Sbjct: 740 SQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGH 799
Query: 802 CSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
C+S+RSLS+V++AC+G +SC++ VS FG+PC+GV+KSLAVEASC+
Sbjct: 800 CNSSRSLSLVQKACIGLRSCNVEVSTRVFGEPCRGVVKSLAVEASCS 846
>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
Full=Protein AR782; Flags: Precursor
gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 852
Score = 1346 bits (3484), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 634/827 (76%), Positives = 715/827 (86%), Gaps = 5/827 (0%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
ANVTYDHRA+VI GKR+VLISGSIHYPRSTPEMWP+LIQKSKDGGLDVIETYVFW+ HE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P +N+YNFEGRYDLVKFVKL A+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI+FRTD
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK EMQRFT KIVD+MKQEKLYASQGGPIILSQIENEYGNIDSAYGAA KSYIKW+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MALSLDTGVPW MCQQ+DAPDP+INTCNGFYCDQFTPNSNNKPKMWTENWSGWFL FG
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
PYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTNFDRTSGGP ISTSYDYDAP+DEYGL
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPKWGHL+DLHKAIKLCE AL+ATDPT SLG NLEA VYKT SG C+AFLAN+ T S
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D TV FNG SY LPAWSVSILPDCKNV FNTAKINS T +F+RQSL+ SS +GS
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAELGS 448
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
WSYI EP+GISK DAF KPGLLEQINTTAD+SDYLWYSL T+IK DE L++GSK VLH
Sbjct: 449 QWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAVLH 508
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
++SLG ++AFINGKL GSG+G K+++D PI L G NT DLLS+TVGL NYGAF+
Sbjct: 509 IESLGQVVYAFINGKLAGSGHGKQ---KISLDIPINLVTGTNTIDLLSVTVGLANYGAFF 565
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQ 623
+ GAGITGPV LK + G++IDL+SQQWTYQ GLKGE+ + S++W SKS LP Q
Sbjct: 566 DLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWVSKSPLPTKQ 625
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL+WYKTTFDAP+GSEPVAIDFTG GKG AWVNGQSIGRYWPT ++ NGGCT+SC+YRG+
Sbjct: 626 PLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGS 685
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
Y +NKCLKNCGKPSQ+LYHVPRSWLK SGN LVLFEE+GGDPT+ISF TKQ GS+LC V
Sbjct: 686 YRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTV 745
Query: 744 TDSHPLPVDMWGSDSKI--QRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
+ SHP PVD W SDSKI + + PVLSL+CP QVI SIKFASFGTP GTCGSF++G
Sbjct: 746 SQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGH 805
Query: 802 CSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
C+S+RSLS+V++AC+G +SC++ VS FG+PC+GV+KSLAVEASC+
Sbjct: 806 CNSSRSLSLVQKACIGLRSCNVEVSTRVFGEPCRGVVKSLAVEASCS 852
>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 852
Score = 1346 bits (3483), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 634/827 (76%), Positives = 715/827 (86%), Gaps = 5/827 (0%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
ANVTYDHRA+VI GKR+VLISGSIHYPRSTPEMWP+LIQKSKDGGLDVIETYVFW+ HE
Sbjct: 29 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P +N+YNFEGRYDLVKFVKL A+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI+FRTD
Sbjct: 89 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK EMQRFT KIVD+MKQEKLYASQGGPIILSQIENEYGNIDSAYGAA KSYIKW+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MALSLDTGVPW MCQQ+DAPDP+INTCNGFYCDQFTPNSNNKPKMWTENWSGWFL FG
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
PYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTNFDRTSGGP ISTSYDYDAP+DEYGL
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPKWGHL+DLHKAIKLCE AL+ATDPT SLG NLEA VYKT SG C+AFLAN+ T S
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D TV FNG SY LPAWSVSILPDCKNV FNTAKINS T +F+RQSL+ SS +GS
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAELGS 448
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
WSYI EP+GISK DAF KPGLLEQINTTAD+SDYLWYSL T+IK DE L++GSK VLH
Sbjct: 449 QWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAVLH 508
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
++SLG ++AFINGKL GSG+G K+++D PI L G NT DLLS+TVGL NYGAF+
Sbjct: 509 IESLGQVVYAFINGKLAGSGHGKQ---KISLDIPINLVTGTNTIDLLSVTVGLANYGAFF 565
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQ 623
+ GAGITGPV LK + G++IDL+SQQWTYQ GLKGE+ + S++W SKS LP Q
Sbjct: 566 DLMGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWVSKSPLPTKQ 625
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL+WYKTTFDAP+GSEPVAIDFTG GKG AWVNGQSIGRYWPT ++ NGGCT+SC+YRG+
Sbjct: 626 PLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGS 685
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
Y +NKCLKNCGKPSQ+LYHVPRSWLK SGN LVLFEE+GGDPT+ISF TKQ GS+LC V
Sbjct: 686 YRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTV 745
Query: 744 TDSHPLPVDMWGSDSKI--QRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
+ SHP PVD W SDSKI + + PVLSL+CP QVI SIKFASFGTP GTCGSF++G
Sbjct: 746 SQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGH 805
Query: 802 CSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
C+S+RSLS+V++AC+G +SC++ VS FG+PC+GV+KSLAVEASC+
Sbjct: 806 CNSSRSLSLVQKACIGLRSCNVEVSTRVFGEPCRGVVKSLAVEASCS 852
>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 1338 bits (3463), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 648/841 (77%), Positives = 729/841 (86%), Gaps = 7/841 (0%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LVL W + F ANV YDHRA+VI GKRRVLISGSIHYPRSTPEMWPDLIQKSK
Sbjct: 6 IVLVLFWLLCIHTPKLFCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNLHEPVR QY+F+GR DLVKFVK VA AGLY HLRIGPYVCAEWN+G
Sbjct: 66 DGGLDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WLHFIPGI+FRTDNEPFKAEM+RFTAKIVDM+KQEKLYASQGGP+ILSQIENEYGN
Sbjct: 126 GFPVWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
ID+AYGAAGKSYIKWAA MA SLDTGVPWVMC Q+DAPDPIINT NGFY D+FTPNSN K
Sbjct: 186 IDTAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTK 245
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENWSGWFL FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR SGGP
Sbjct: 246 PKMWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGP 305
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYG+IRQPKWGHLK++HKAIKLCE AL+ATDPT SLGPNLEA VYK
Sbjct: 306 FIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYK 365
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TGS +C+AFLAN+GT SDVTV F+GNSY LPAWSVSILPDCK+VV NTAKINS + + SF
Sbjct: 366 TGS-VCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSF 424
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+ +S + SS+A +GWS+I+EPVGISK D+F++ GLLEQINTTAD+SDYLWYSLS +
Sbjct: 425 TTESSKEDIGSSEASSTGWSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSID 484
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
KAD S+TVLH++SLGHALHAFINGKL GS G+S K TVD P+ L GKNT
Sbjct: 485 YKADA-----SSQTVLHIESLGHALHAFINGKLAGSQPGNSGKYKFTVDIPVTLVAGKNT 539
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
DLLSLTVGLQNYGAF++ G GITGPV LKG NG +DLSSQ+WTYQ GL+GE+L
Sbjct: 540 IDLLSLTVGLQNYGAFFDTWGVGITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGLS 599
Query: 607 SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
SGSS QW+ +ST PK QPL WYKTTF AP+GS+PVAIDFTGMGKGEAWVNGQ IGRYWPT
Sbjct: 600 SGSSGQWNLQSTFPKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPT 659
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPT 726
YV+ + CTDSCNYRG YS++KC KNC KPSQ+LYHVPRSWLK SGN LVLFEE GGDPT
Sbjct: 660 YVASDASCTDSCNYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPT 719
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFAS 786
+ISFVTKQ SLC+HV+DSHP PVD+W S+++ RK GPVLSL CP+ NQVISSIKFAS
Sbjct: 720 QISFVTKQT-ESLCAHVSDSHPPPVDLWNSETESGRKVGPVLSLTCPHDNQVISSIKFAS 778
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEAS 846
+GTPLGTCG+F GRCSS ++LS+V++AC+GS SCS+GVS +TFGDPC+G+ KSLAVEA+
Sbjct: 779 YGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSDTFGDPCRGMAKSLAVEAT 838
Query: 847 C 847
C
Sbjct: 839 C 839
>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
Length = 852
Score = 1335 bits (3455), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 625/846 (73%), Positives = 720/846 (85%), Gaps = 9/846 (1%)
Query: 8 LLVLCWGFV---VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
+++L +G V L TSF ANVTYDHRA+V+ G+RRVLISGSIHYPRSTP+MWPDLIQK
Sbjct: 11 VIMLVFGVVFLHCLVMTSFAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQK 70
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
SKDGGLDVIETYVFWNLHEPVRNQY+FEGR DL+ FVKLV +AGL+ H+RIGPYVCAEWN
Sbjct: 71 SKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVEKAGLFVHIRIGPYVCAEWN 130
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
+GGFPLWLHFIPGI+FRTDNEPFKAEM+RFTAKIVDM+KQE LYASQGGP+ILSQIENEY
Sbjct: 131 YGGFPLWLHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEY 190
Query: 185 GN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
GN I+S YG K Y+ WAA MA SL+TGVPWVMCQQ DAP +INTCNGFYCDQF N
Sbjct: 191 GNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQN 250
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
S+ PKMWTENW+GWFLSFGG VPYRPVED+AFAVARFFQRGGTFQNYYMYHGGTNF RT
Sbjct: 251 SDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRT 310
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
SGGPFI+TSYDYDAPLDEYGLI QPKWGHLKDLHKAIKLCEAA+VAT+P SLG N+E
Sbjct: 311 SGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPNITSLGSNIEV 370
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
+VYKT S C+AFLAN T SD V FNGNSY LP WSVSILPDCKNV F+TAKINS +
Sbjct: 371 SVYKTDS-QCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSAST 429
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
+ +F +S + AD+S SGW+ +NEPVGIS ++AFT+ GLLEQINTTAD+SDYLWYS
Sbjct: 430 ISTFVTRSSE--ADASGGSLSGWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYS 487
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
LS NIK DEP L+DGS TVLHV++LGH LHA+INGKL GSG G+S ++ T++ P+ L P
Sbjct: 488 LSVNIKNDEPFLQDGSATVLHVKTLGHVLHAYINGKLSGSGKGNSRHSNFTIEVPVTLVP 547
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G+N DLLS TVGLQNYGAF++ GAGITGPVQLKG NG+ DLSS+QWTYQ GLKGE+
Sbjct: 548 GENKIDLLSATVGLQNYGAFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGED 607
Query: 603 LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
L +G ST W S++ LP QPL+WYK +FDAPAG P+++DFTGMGKGEAWVNGQSIGR
Sbjct: 608 LGLSNGGSTLWKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGR 667
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y++ N GCTD CNYRG Y++ KCLKNCGKPSQ LYHVPRSWLKSSGN LVLFEE+G
Sbjct: 668 FWPAYIAPNDGCTDPCNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMG 727
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSI 782
GDPTK+SF T+++ S+CS ++D+HPLP+DMW S+ ++K GP LSLECP+PNQVISSI
Sbjct: 728 GDPTKLSFATREI-QSVCSRISDAHPLPIDMWASEDDARKKSGPTLSLECPHPNQVISSI 786
Query: 783 KFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLA 842
KFASFGTP GTCGSF GRCSS+ +LS+V++AC+GSKSCS+GVS+N FGDPCKGV KSLA
Sbjct: 787 KFASFGTPQGTCGSFIHGRCSSSNALSIVKKACIGSKSCSLGVSINAFGDPCKGVAKSLA 846
Query: 843 VEASCT 848
VEASCT
Sbjct: 847 VEASCT 852
>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 830
Score = 1335 bits (3454), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 652/841 (77%), Positives = 727/841 (86%), Gaps = 17/841 (2%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LVL W + + T F ANV YDHRA+VI GKRRVLISGSIHYPRSTPEMWPDLIQKSK
Sbjct: 6 IVLVLFWLLCIHSPTLFCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNL+EPVR QY+F+GR DLVKFVK VA AGLY HLRIGPYVCAEWN+G
Sbjct: 66 DGGLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFIPGI+FRTDNEPFKAEM+RFTAKIVDM+K+E LYASQGGP+ILSQIENEYGN
Sbjct: 126 GFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
IDSAYGAAGKSYIKWAA MA SLDTGVPWVMCQQ+DAPDPIINTCNGFYCDQFTPNSN K
Sbjct: 186 IDSAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTK 245
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENWSGWFL FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP
Sbjct: 246 PKMWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 305
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYG+IRQPKWGHLK++HKAIKLCE AL+ATDPT SLGPNLEA VYK
Sbjct: 306 FIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYK 365
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TGS +C+AFLAN+ T SDVTV F+GNSY LPAWSVSILPDCKNVV NTAK+ + F
Sbjct: 366 TGS-VCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKVCLTNFISMF 424
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+ SS +GWS+I+EPVGISK D+F + GLLEQINTTAD+SDYLWYSLS +
Sbjct: 425 ------MWLPSS----TGWSWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSID 474
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
K D GS+TVLH++SLGHALHAFINGKL GS G+S K TVD P+ L GKNT
Sbjct: 475 YKGDA-----GSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNT 529
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
DLLSLTVGLQNYGAF++ GAGITGPV LKG NG +DLS Q+WTYQ GLKGE+L
Sbjct: 530 IDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGLS 589
Query: 607 SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
SGSS QW+S+ST PK QPL+WYKTTF AP+GS+PVAIDFTGMGKGEAWVNGQSIGRYWPT
Sbjct: 590 SGSSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRYWPT 649
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPT 726
YV+ + GCTDSCNYRG YS++KC +NCGKPSQ+LYHVPRSWLK SGN LVLFEE GGDPT
Sbjct: 650 YVASDAGCTDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKGGDPT 709
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFAS 786
+ISFVTKQ SLC+HV+DSHP PVD+W SD++ RK GPVLSL CP+ NQVISSIKFAS
Sbjct: 710 QISFVTKQT-ESLCAHVSDSHPPPVDLWNSDTESGRKVGPVLSLTCPHDNQVISSIKFAS 768
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEAS 846
+GTPLGTCG+F GRCSS ++LS+V++AC+GS SCS+GVS TFG+PC+GV KSLAVEA+
Sbjct: 769 YGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSETFGNPCRGVAKSLAVEAT 828
Query: 847 C 847
C
Sbjct: 829 C 829
>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
Length = 833
Score = 1335 bits (3454), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 648/842 (76%), Positives = 726/842 (86%), Gaps = 16/842 (1%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LVL W F NV YDHRA+VI GKRRVLISGSIHYPRSTP+MWPDLIQKSK
Sbjct: 6 IVLVLLW----FLPKMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSK 61
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNLHEPV+ QY+F+GR DLVKFVK VAEAGLY HLRIGPYVCAEWN+G
Sbjct: 62 DGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYG 121
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFIPGI+FRTDNEPFKAEM+RFTAKIVD+MKQEKLYASQGGPIILSQIENEYGN
Sbjct: 122 GFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGN 181
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
IDS YG+AGKSYI WAA MA SLDTGVPWVMCQQ DAPDPIINTCNGFYCDQFTPNSN K
Sbjct: 182 IDSHYGSAGKSYINWAAKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTK 241
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENWSGWFLSFGGAVP+RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR++GGP
Sbjct: 242 PKMWTENWSGWFLSFGGAVPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGP 301
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYG+IRQ KWGHLKD+HKAIKLCE AL+ATDP SLG NLEA VYK
Sbjct: 302 FIATSYDYDAPIDEYGIIRQQKWGHLKDVHKAIKLCEEALIATDPKISSLGQNLEAAVYK 361
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TGS +C+AFLAN+ T +D TV F+GNSY LPAWSVSILPDCKNVV NTAKINS + + +F
Sbjct: 362 TGS-VCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISNF 420
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+ + SS S WS+INEPVGISKDD +K GLLEQINTTAD+SDYLWYSLS +
Sbjct: 421 VTEDISSLETSS----SKWSWINEPVGISKDDILSKTGLLEQINTTADRSDYLWYSLSLD 476
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
+ AD+P GS+TVLH++SLGHALHAFINGKL G+ G+S +K+ VD PIAL GKN
Sbjct: 477 L-ADDP----GSQTVLHIESLGHALHAFINGKLAGNQAGNSDKSKLNVDIPIALVSGKNK 531
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTN-IDLSSQQWTYQTGLKGEELNF 605
DLLSLTVGLQNYGAF++ GAGITGPV LKG NG N +DLSS++WTYQ GLKGE+L
Sbjct: 532 IDLLSLTVGLQNYGAFFDTVGAGITGPVILKGLKNGNNTLDLSSRKWTYQIGLKGEDLGL 591
Query: 606 PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
SGSS W+S+ST PK QPLVWYKT FDAP+GS PVAIDFTGMGKGEAWVNGQSIGRYWP
Sbjct: 592 SSGSSGGWNSQSTYPKNQPLVWYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWP 651
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
TYV+ N GCTDSCNYRG Y+S+KC KNCGKPSQ+LYHVPRS+LK +GNTLVLFEE GGDP
Sbjct: 652 TYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDP 711
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFA 785
T+ISF TKQL S+CSHV+DSHP +D+W D++ K GP L L CPN NQVISSIKFA
Sbjct: 712 TQISFATKQL-ESVCSHVSDSHPPQIDLWNQDTESGGKVGPALLLSCPNHNQVISSIKFA 770
Query: 786 SFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEA 845
S+GTPLGTCG+F RGRCSS ++LS+V++AC+GS+SCS+GVS +TFGDPC+GV KSLAVEA
Sbjct: 771 SYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSVGVSTDTFGDPCRGVPKSLAVEA 830
Query: 846 SC 847
+C
Sbjct: 831 TC 832
>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
sativus]
Length = 844
Score = 1333 bits (3450), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 631/826 (76%), Positives = 726/826 (87%), Gaps = 4/826 (0%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S NVTYDHRA+VI GKR+VL+SGS+HYPRSTPEMWP +IQKSKDGGLDVIETYVFWNL
Sbjct: 22 SLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNL 81
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEPVRNQY+FEGR DLVKF+KLV AGLY H+RIGPYVCAEWN+GGFP+WLHF+PG+QFR
Sbjct: 82 HEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFR 141
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDNEPFKAEM+RFTAKIVD++KQEKLYASQGGPIILSQIENEYGN+ S++G+A KSY++W
Sbjct: 142 TDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQW 201
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA SL+TGVPWVMC Q DAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF
Sbjct: 202 AATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGA+PYRPVEDLAFAVARF+Q GG+ QNYYMYHGGTNF RTSGGPFI+TSYDYDAP+DEY
Sbjct: 262 GGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEY 321
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPKWGHL+D+HKAIK+CE ALV+TDP SLGPNLEATVYK+GS CSAFLAN+ T
Sbjct: 322 GLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGSQ-CSAFLANVDT 380
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
SD TV FNGNSY LPAWSVSILPDCKNVV NTAKINSVT PSFS Q L+V +S+A
Sbjct: 381 QSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAF 440
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
SGWS+I+EP+GISK+++F GL EQINTTAD+SDYLWYSLST+IK DEP L +GS TV
Sbjct: 441 DSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNTV 500
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
LHV SLGH LH FIN KL GSG GS ++KV++D PI L PGKNT DLLSLTVGLQNYGA
Sbjct: 501 LHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYGA 560
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
F+E GAG+TGPV+L+ N +DLSS QWTYQ GL+GE+L PSGS++QW S+ LPK
Sbjct: 561 FFELRGAGVTGPVKLENXKNNITVDLSSGQWTYQIGLEGEDLGLPSGSTSQWLSQPNLPK 620
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+PL WYKTTFDAPAGS+P+A+DFTG GKGEAW+NG SIGRYWP+Y++ +G CT C+Y+
Sbjct: 621 NKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIA-SGQCTSYCDYK 679
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
GAYS+NKCL+NCGKPSQ+LYHVP+SWLK +GNTLVLFEEIG DPT+++F +KQLG SLCS
Sbjct: 680 GAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLG-SLCS 738
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
HV++SHP PV+MW SDSK Q+K GPVLSLECP+P+QVISSIKFASFGTP GTCGSFS G+
Sbjct: 739 HVSESHPPPVEMWSSDSK-QQKTGPVLSLECPSPSQVISSIKFASFGTPRGTCGSFSHGQ 797
Query: 802 CSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
CS+ +LS+V++AC+GSKSCSI VS+ FGDPC+G KSLAVEA C
Sbjct: 798 CSTRNALSIVQKACIGSKSCSIDVSIKAFGDPCRGKTKSLAVEAYC 843
>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 852
Score = 1333 bits (3449), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 624/846 (73%), Positives = 718/846 (84%), Gaps = 9/846 (1%)
Query: 8 LLVLCWGFV---VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
+++L +G V L TSF ANVTYDHRA+V+ G+RRVLISGSIHYPRSTP+MWPDLIQK
Sbjct: 11 VIMLVFGVVFLHCLVMTSFAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQK 70
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
SKDGGLDVIETYVFWNLHEPVRNQY+FEGR DL+ FVKLV AGL+ H+RIGPYVCAEWN
Sbjct: 71 SKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVERAGLFVHIRIGPYVCAEWN 130
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
+GGFPLWLHFIPGI+FRTDNEPFKAEM+RFTAKIVDM+KQE LYASQGGP+ILSQIENEY
Sbjct: 131 YGGFPLWLHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEY 190
Query: 185 GN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
GN I+S YG K Y+ WAA MA SL+TGVPWVMCQQ DAP +INTCNGFYCDQF N
Sbjct: 191 GNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQN 250
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
S+ PKMWTENW+GWFLSFGG VPYRPVED+AFAVARFFQRGGTFQNYYMYHGGTNF RT
Sbjct: 251 SDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRT 310
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
SGGPFI+TSYDYDAPLDEYGLI QPKWGHLKDLHKAIKLCEAA+VAT+P SLG N+E
Sbjct: 311 SGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPNVTSLGSNIEV 370
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
+VYKT S C+AFLAN T SD V FNGNSY LP WSVSILPDCKNV F+TAKINS +
Sbjct: 371 SVYKTDS-QCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSAST 429
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
+ +F +S + AD+S SGW+ +NEPVGIS ++AFT+ GLLEQINTTAD+SDYLWYS
Sbjct: 430 ISTFVTRSSE--ADASGGSLSGWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYS 487
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
LS NIK DEP L+DGS TVLHV++LGH LHA+ING+L GSG G+S ++ T++ P+ L P
Sbjct: 488 LSVNIKNDEPFLQDGSATVLHVKTLGHVLHAYINGRLSGSGKGNSRHSNFTIEVPVTLVP 547
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G+N DLLS TVGLQNYGAF++ GAGITGPVQLKG NG+ DLSS+QWTYQ GLKGE+
Sbjct: 548 GENKIDLLSATVGLQNYGAFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGED 607
Query: 603 LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
L +G ST W S++ LP QPL+WYK +FDAPAG P+++DFTGMGKGEAWVNGQSIGR
Sbjct: 608 LGLSNGGSTLWKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGR 667
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y++ N GCTD CNYRG Y++ KCLKNCGKPSQ LYHVPRSWLKSSGN LVLFEE+G
Sbjct: 668 FWPAYIAPNDGCTDPCNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMG 727
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSI 782
GDPTK+SF T+++ S+CS +D+HPLP+DMW S+ ++K GP LSLECP+PNQVISSI
Sbjct: 728 GDPTKLSFATREI-QSVCSRTSDAHPLPIDMWASEDDARKKSGPTLSLECPHPNQVISSI 786
Query: 783 KFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLA 842
KFASFGTP GTCGSF GRCSS+ +LS+V++AC+GSKSCS+GVS+N FGDPCKGV KSLA
Sbjct: 787 KFASFGTPQGTCGSFIHGRCSSSNALSIVKKACIGSKSCSLGVSINAFGDPCKGVAKSLA 846
Query: 843 VEASCT 848
VEASCT
Sbjct: 847 VEASCT 852
>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
Length = 844
Score = 1333 bits (3449), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 631/826 (76%), Positives = 726/826 (87%), Gaps = 4/826 (0%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S NVTYDHRA+VI GKR+VL+SGS+HYPRSTPEMWP +IQKSKDGGLDVIETYVFWNL
Sbjct: 22 SLAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNL 81
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEPVRNQY+FEGR DLVKF+KLV AGLY H+RIGPYVCAEWN+GGFP+WLHF+PG+QFR
Sbjct: 82 HEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFR 141
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDNEPFKAEM+RFTAKIVD++KQEKLYASQGGPIILSQIENEYGN+ S++G+A KSY++W
Sbjct: 142 TDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQW 201
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA SL+TGVPWVMC Q DAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF
Sbjct: 202 AATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGA+PYRPVEDLAFAVARF+Q GG+ QNYYMYHGGTNF RTSGGPFI+TSYDYDAP+DEY
Sbjct: 262 GGALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEY 321
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPKWGHL+D+HKAIK+CE ALV+TDP SLGPNLEATVYK+GS CSAFLAN+ T
Sbjct: 322 GLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGSQ-CSAFLANVDT 380
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
SD TV FNGNSY LPAWSVSILPDCKNVV NTAKINSVT PSFS Q L+V +S+A
Sbjct: 381 QSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAF 440
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
SGWS+I+EP+GISK+++F GL EQINTTAD+SDYLWYSLST+IK DEP L +GS TV
Sbjct: 441 DSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNTV 500
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
LHV SLGH LH FIN KL GSG GS ++KV++D PI L PGKNT DLLSLTVGLQNYGA
Sbjct: 501 LHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYGA 560
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
F+E GAG+TGPV+L+ N +DLSS QWTYQ GL+GE+L PSGS++QW S+ LPK
Sbjct: 561 FFELRGAGVTGPVKLENQKNNITVDLSSGQWTYQIGLEGEDLGLPSGSTSQWLSQPNLPK 620
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+PL WYKTTFDAPAGS+P+A+DFTG GKGEAW+NG SIGRYWP+Y++ +G CT C+Y+
Sbjct: 621 NKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIA-SGQCTSYCDYK 679
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
GAYS+NKCL+NCGKPSQ+LYHVP+SWLK +GNTLVLFEEIG DPT+++F +KQLG SLCS
Sbjct: 680 GAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLG-SLCS 738
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
HV++SHP PV+MW SDSK Q+K GPVLSLECP+P+QVISSIKFASFGTP GTCGSFS G+
Sbjct: 739 HVSESHPPPVEMWSSDSK-QQKTGPVLSLECPSPSQVISSIKFASFGTPRGTCGSFSHGQ 797
Query: 802 CSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
CS+ +LS+V++AC+GSKSCSI VS+ FGDPC+G KSLAVEA C
Sbjct: 798 CSTRNALSIVQKACIGSKSCSIDVSIKAFGDPCRGKTKSLAVEAYC 843
>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 836
Score = 1329 bits (3440), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 636/841 (75%), Positives = 719/841 (85%), Gaps = 11/841 (1%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
+LLVL W F + A +SFGANVTYDHRA+VI GKRRVL+SGSIHYPRSTPEMWPDLIQKSK
Sbjct: 6 ILLVLLWFFCIYAPSSFGANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNLHEPVR QYNFEGR DLVKFVK+VA AGLY HLRIGPY CAEWN+G
Sbjct: 66 DGGLDVIETYVFWNLHEPVRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIGPYACAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFIPGIQFRTDN+PF+AEM++FTAKIVD+MKQE LYASQGGPIILSQIENEYGN
Sbjct: 126 GFPLWLHFIPGIQFRTDNKPFEAEMKQFTAKIVDLMKQENLYASQGGPIILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
I++ YG A KSYIKWAA MA SL TGVPWVMCQQ +APDPIIN CNGFYCDQF PNSN K
Sbjct: 186 IEADYGPAAKSYIKWAASMATSLGTGVPWVMCQQQNAPDPIINACNGFYCDQFKPNSNTK 245
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PK+WTE ++GWFL+FG AVP+RPVEDLAFAVARF+QRGGTFQNYYMYHGGTNF R SGGP
Sbjct: 246 PKIWTEGYTGWFLAFGDAVPHRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRASGGP 305
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
F+++SYDYDAP+DEYG IRQPKWGHLKD+HKAIKLCE AL+ATDPT SLGPN+EA VYK
Sbjct: 306 FVASSYDYDAPIDEYGFIRQPKWGHLKDVHKAIKLCEEALIATDPTITSLGPNIEAAVYK 365
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TG +C+AFLANI T SD TV FNGNSY LPAWSVSILPDCKNVV NTAKI S +++ SF
Sbjct: 366 TGV-VCAAFLANIAT-SDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKITSASMISSF 423
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+ +SL+ D+ GS WS+I+EP+GISK D+F+ GLLEQINTTAD+SDYLWYSLS +
Sbjct: 424 TTESLKDVGSLDDS-GSRWSWISEPIGISKADSFSTFGLLEQINTTADRSDYLWYSLSID 482
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
+ A G++T LH++SLGHALHAFINGKL GSG G+ A V VD PI L GKNT
Sbjct: 483 LDA-------GAQTFLHIKSLGHALHAFINGKLAGSGTGNHEKANVEVDIPITLVSGKNT 535
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
DLLSLTVGLQNYGAF++ GAGITGPV LK NG+N+DLSS+QWTYQ GLK E+L
Sbjct: 536 IDLLSLTVGLQNYGAFFDTWGAGITGPVILKCLKNGSNVDLSSKQWTYQVGLKNEDLGLS 595
Query: 607 SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
SG S QW+S+STLP QPL WYKT F AP+G+ PVAIDFTGMGKGEAWVNGQSIGRYWPT
Sbjct: 596 SGCSGQWNSQSTLPTNQPLTWYKTNFVAPSGNNPVAIDFTGMGKGEAWVNGQSIGRYWPT 655
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPT 726
Y S GGCTDSCNYRGAY ++KCLKNCGKPSQ+LYHVPRSWL+ NTLVLFEE GG+P
Sbjct: 656 YASPKGGCTDSCNYRGAYDASKCLKNCGKPSQTLYHVPRSWLRPDRNTLVLFEESGGNPK 715
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFAS 786
+ISF TKQ+G S+CSHV++SHP PVD W S+++ RK PV+SLECP PNQV+SSIKFAS
Sbjct: 716 QISFATKQIG-SVCSHVSESHPPPVDSWNSNTESGRKVVPVVSLECPYPNQVVSSIKFAS 774
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEAS 846
FGTPLGTCG+F G CSS ++LS+V++AC+GS SC I +SVNTFGDPCKGV KSLAVEAS
Sbjct: 775 FGTPLGTCGNFKHGLCSSNKALSIVQKACIGSSSCRIELSVNTFGDPCKGVAKSLAVEAS 834
Query: 847 C 847
C
Sbjct: 835 C 835
>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
Length = 839
Score = 1328 bits (3436), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 646/844 (76%), Positives = 722/844 (85%), Gaps = 12/844 (1%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++ VL W V SF +NVTYDHRA+VI GKRRVL+SGSIHYPRSTP+MWPDLIQKSK
Sbjct: 6 IVFVLLWFLGVYVPASFCSNVTYDHRALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGG+DVIETYVFWNLHEPVR QYNFEGR DLV FVK VA AGLY HLRIGPYVCAEWN+G
Sbjct: 66 DGGIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKAVAAAGLYVHLRIGPYVCAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFI GI+FRT+NEPFKAEM+RFTAKIVDMMKQE LYASQGGPIILSQIENEYGN
Sbjct: 126 GFPLWLHFIAGIKFRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
ID+ A KSYI WAA MA SLDTGVPW+MCQQ++APDPIINTCN FYCDQFTPNS+NK
Sbjct: 186 IDTHDARAAKSYIDWAASMATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNK 245
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENWSGWFL+FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF RT+GGP
Sbjct: 246 PKMWTENWSGWFLAFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGP 305
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FISTSYDYDAP+DEYG IRQPKWGHLKDLHKAIKLCE AL+A+DPT S GPNLE VYK
Sbjct: 306 FISTSYDYDAPIDEYGDIRQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVYK 365
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TG+ +CSAFLANIG SD TV FNGNSY LP WSVSILPDCKNVV NTAK+N+ +++ SF
Sbjct: 366 TGA-VCSAFLANIGM-SDATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSF 423
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+ +SL+ DS D+ SGWS+I+EPVGIS DAFTK GLLEQINTTAD+SDYLWYSLS
Sbjct: 424 ATESLKEKVDSLDSSSSGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSI- 482
Query: 487 IKADEPLLED--GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+ ED G + VLH++SLGHALHAF+NGKL GS GSS NAKV VD PI L GK
Sbjct: 483 ------VYEDNAGDQPVLHIESLGHALHAFVNGKLAGSKAGSSGNAKVNVDIPITLVTGK 536
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
NT DLLSLTVGLQNYGAFY+ GAGITGPV LKG NG+++DL+SQQWTYQ GL+GE +
Sbjct: 537 NTIDLLSLTVGLQNYGAFYDTVGAGITGPVILKGLKNGSSVDLTSQQWTYQVGLQGEFVG 596
Query: 605 FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
SG+ QW+S+S LP QPL WYKT F AP+GS PVAIDFTGMGKGEAWVNGQSIGRYW
Sbjct: 597 LSSGNVGQWNSQSNLPANQPLTWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYW 656
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
PTY+S N GCTDSCNYRG YS++KCLKNCGKPSQ+LYHVPR+WLK NT VLFEE GGD
Sbjct: 657 PTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAWLKPDSNTFVLFEESGGD 716
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKF 784
PTKISF TKQ+ S+CSHVT+SHP PVD W S+++ +RK GPVLSLECP PNQ ISSIKF
Sbjct: 717 PTKISFGTKQI-ESVCSHVTESHPPPVDTWNSNAESERKVGPVLSLECPYPNQAISSIKF 775
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVE 844
ASFGTP GTCG+++ G CSS R+LS+V++AC+GS SC+IGVS+NTFG+PC+GV KSLAVE
Sbjct: 776 ASFGTPRGTCGNYNHGSCSSNRALSIVQKACIGSSSCNIGVSINTFGNPCRGVTKSLAVE 835
Query: 845 ASCT 848
A+CT
Sbjct: 836 AACT 839
>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 848
Score = 1327 bits (3433), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 646/849 (76%), Positives = 727/849 (85%), Gaps = 15/849 (1%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LVL W + F ANV YDHRA+VI GKRRVLISGSIHYPRSTPEMWPDLIQKSK
Sbjct: 6 IVLVLFWLLCIHTPKLFCANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNLHEPVR QY+F+GR DLVKFVK VA AGLY HLRIGPYVCAEWN+G
Sbjct: 66 DGGLDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WLHFIPGI+FRTDNEPFKAEM+RFTAKIVDM+KQEKLYASQGGP+ILSQIENEYGN
Sbjct: 126 GFPVWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
ID+AYGAAGKSYIKWAA MA SLDTGVPWVMC Q+DAPDPIINT NGFY D+FTPNSN K
Sbjct: 186 IDTAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTK 245
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENWSGWFL FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR SGGP
Sbjct: 246 PKMWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGP 305
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYG+IRQPKWGHLK++HKAIKLCE AL+ATDPT SLGPNLEA VYK
Sbjct: 306 FIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYK 365
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
TGS +C+AFLAN+GT SDVTV F+GNSY LPAWSVSILPDCK+VV NTAKINS + + SF
Sbjct: 366 TGS-VCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSF 424
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+ +S + SS+A +GWS+I+EPVGISK D+F++ GLLEQINTTAD+SDYLWYSLS +
Sbjct: 425 TTESSKEDIGSSEASSTGWSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSID 484
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSG--------YGSSSNAKVTVDFPI 538
KAD S+TVLH++SLGHALHAFINGKL G +S K TVD P+
Sbjct: 485 YKADA-----SSQTVLHIESLGHALHAFINGKLAGKYKLKHSQLIICNSGKYKFTVDIPV 539
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
L GKNT DLLSLTVGLQNYGAF++ G GITGPV LKG NG +DLSSQ+WTYQ GL
Sbjct: 540 TLVAGKNTIDLLSLTVGLQNYGAFFDTWGVGITGPVILKGFANGNTLDLSSQKWTYQVGL 599
Query: 599 KGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
+GE+L SGSS QW+ +ST PK QPL WYKTTF AP+GS+PVAIDFTGMGKGEAWVNGQ
Sbjct: 600 QGEDLGLSSGSSGQWNLQSTFPKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQ 659
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
IGRYWPTYV+ + CTDSCNYRG YS++KC KNC KPSQ+LYHVPRSWLK SGN LVLF
Sbjct: 660 RIGRYWPTYVASDASCTDSCNYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLF 719
Query: 719 EEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQV 778
EE GGDPT+ISFVTKQ SLC+HV+DSHP PVD+W S+++ RK GPVLSL CP+ NQV
Sbjct: 720 EERGGDPTQISFVTKQT-ESLCAHVSDSHPPPVDLWNSETESGRKVGPVLSLTCPHDNQV 778
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVM 838
ISSIKFAS+GTPLGTCG+F GRCSS ++LS+V++AC+GS SCS+GVS +TFGDPC+G+
Sbjct: 779 ISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSDTFGDPCRGMA 838
Query: 839 KSLAVEASC 847
KSLAVEA+C
Sbjct: 839 KSLAVEATC 847
>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 839
Score = 1326 bits (3431), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 628/829 (75%), Positives = 709/829 (85%), Gaps = 16/829 (1%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
ANVTYDHRA+VI GKR+VLISGSIHYPRSTPEMWP+LIQKSKDGGLDVIETYVFW+ HE
Sbjct: 23 AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P +N+YNFEGRYDLVKFVKL A+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI+FRTD
Sbjct: 83 PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK EMQRFT KIVD+MKQEKLYASQGGPIILSQIENEYGNIDSAYGAA KSYIKW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MALSLDTGVPW MCQQ+DAPDP+INTCNGFYCDQFTPNSNNKPKMWTENWSGWFL FG
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
PYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTNFDRTSGGP ISTSYDYDAP+DEYGL
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPKWGHL+DLHKAIKLCE AL+ATDPT SLG NLEA VYKT SG C+AFLAN+ T S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI--NSVTLVPSFSRQSLQVAADSSDAI 441
D TV FNG SY LPAWSVSILPDCKNV FNTAK+ NS++ P SS +
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKVKFNSISKTPD---------GGSSAEL 433
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
GS WSYI EP+GISK DAF KPGLLEQINTTAD+SDYLWYSL T+IK DE L++GSK V
Sbjct: 434 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 493
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
LH++SLG ++AFINGKL GSG+G K+++D PI L G NT DLLS+TVGL NYGA
Sbjct: 494 LHIESLGQVVYAFINGKLAGSGHGKQ---KISLDIPINLVTGTNTIDLLSVTVGLANYGA 550
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
F++ GAGITGPV LK + G++IDL+SQQWTYQ GLKGE+ + S++W SKS LP
Sbjct: 551 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWVSKSPLPT 610
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL+WYKTTFDAP+GSEPVAIDFTG GKG AWVNGQSIGRYWPT ++ NGGCT+SC+YR
Sbjct: 611 KQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYR 670
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G+Y +NKCLKNCGKPSQ+LYHVPRSWLK SGN LVLFEE+GGDPT+ISF TKQ GS+LC
Sbjct: 671 GSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCL 730
Query: 742 HVTDSHPLPVDMWGSDSKI--QRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
V+ SHP PVD W SDSKI + + PVLSL+CP QVI SIKFASFGTP GTCGSF++
Sbjct: 731 TVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQ 790
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
G C+S+RSLS+V++AC+G +SC++ VS FG+PC+GV+KSLAVEASC+
Sbjct: 791 GHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGEPCRGVVKSLAVEASCS 839
>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
Length = 866
Score = 1308 bits (3384), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 633/865 (73%), Positives = 714/865 (82%), Gaps = 35/865 (4%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LVL W F NV YDHRA+VI GKRRVLISGSIHYPRSTP+MWPDLIQKSK
Sbjct: 6 IVLVLLW----FLPKMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSK 61
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNLHEPV+ QY+F+GR DLVKFVK VAEAGLY HLRIGPYVCAEWN+G
Sbjct: 62 DGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYG 121
Query: 127 GFPLWLHFIPGIQFRTDNEPFK--AEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
GFPLWLHFIPGI+FRTDNEPFK AEM+RFTAKIVD+MKQEKLYASQGGPIILSQIENEY
Sbjct: 122 GFPLWLHFIPGIKFRTDNEPFKVEAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEY 181
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
G+IDSAYG+AGKSYI WAA MA SLDTGVPWVMCQQ DAPD IINTCNGFYCDQFTPNSN
Sbjct: 182 GDIDSAYGSAGKSYINWAAKMATSLDTGVPWVMCQQEDAPDSIINTCNGFYCDQFTPNSN 241
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYM------------ 292
KPKMWTENWS W+L FGG P+RPVEDLAFAVARFFQRGGTFQNYYM
Sbjct: 242 TKPKMWTENWSAWYLLFGGGFPHRPVEDLAFAVARFFQRGGTFQNYYMVLQPEMFFTSSI 301
Query: 293 ---------YHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCE 343
YHGGTNFDR++GGPFI+TSYD+DAP+DEYG+IRQPKWGHLKDLHKA+KLCE
Sbjct: 302 YYMVLFLRPYHGGTNFDRSTGGPFIATSYDFDAPIDEYGIIRQPKWGHLKDLHKAVKLCE 361
Query: 344 AALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSI 403
AL+AT+P SLGPNLEA VYKTGS +C+AFLAN+ T SD TV F+GNSY LPAWSVSI
Sbjct: 362 EALIATEPKITSLGPNLEAAVYKTGS-VCAAFLANVDTKSDKTVNFSGNSYHLPAWSVSI 420
Query: 404 LPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKP 463
LPDCKNVV NTAKINS + + +F +S + S + S WS+INEPVGISKDD F+K
Sbjct: 421 LPDCKNVVLNTAKINSASAISNFVTKSSKEDISSLETSSSKWSWINEPVGISKDDIFSKT 480
Query: 464 GLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSG 523
GLLEQIN TAD+SDYLWYSLS ++K D GS+TVLH++SLGHALHAF+NGKL GS
Sbjct: 481 GLLEQINITADRSDYLWYSLSVDLKDDL-----GSQTVLHIESLGHALHAFVNGKLAGSH 535
Query: 524 YGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGT 583
G+ K+ VD PI + G N DLLSLTVGLQNYGAF+++ GAGITGPV LKG NG
Sbjct: 536 TGNKDKPKLNVDIPIKVIYGNNQIDLLSLTVGLQNYGAFFDRWGAGITGPVTLKGLKNGN 595
Query: 584 N-IDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVA 642
N +DLSSQ+WTYQ GLKGE+L SGSS W+S+ST PK QPL+WYKT FDAP+GS PVA
Sbjct: 596 NTLDLSSQKWTYQVGLKGEDLGLSSGSSEGWNSQSTFPKNQPLIWYKTNFDAPSGSNPVA 655
Query: 643 IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYH 702
IDFTGMGKGEAWVNGQSIGRYWPTYV+ N CTDSCNYRG ++ KC NCGKPSQ+LYH
Sbjct: 656 IDFTGMGKGEAWVNGQSIGRYWPTYVASNADCTDSCNYRGPFTQTKCHMNCGKPSQTLYH 715
Query: 703 VPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQR 762
VPRS+LK +GNTLVLFEE GGDPT+I+F TKQL SLC+HV+DSHP +D+W D+
Sbjct: 716 VPRSFLKPNGNTLVLFEENGGDPTQIAFATKQL-ESLCAHVSDSHPPQIDLWNQDTTSWG 774
Query: 763 KPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCS 822
K GP L L CPN NQVI SIKFAS+GTPLGTCG+F RGRCSS ++LS+V++AC+GS+SCS
Sbjct: 775 KVGPALLLNCPNHNQVIFSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCS 834
Query: 823 IGVSVNTFGDPCKGVMKSLAVEASC 847
IGVS +TFGDPC+GV KSLAVEA+C
Sbjct: 835 IGVSTDTFGDPCRGVPKSLAVEATC 859
>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
Length = 818
Score = 1253 bits (3241), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 601/821 (73%), Positives = 684/821 (83%), Gaps = 10/821 (1%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
VI G RRVLISGSIHYPRSTPEMWPDLI KSK GGLD+IETYVFW+LHEP++ QY+F+GR
Sbjct: 1 VIDGTRRVLISGSIHYPRSTPEMWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGR 60
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DLV+F+K V EAGLY HLRIGPY CAEWN+GGFPLWLHFIPGI+FRTDN+PFK EMQRF
Sbjct: 61 KDLVRFIKTVGEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIKFRTDNKPFKDEMQRF 120
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
T KIVD+MKQE LYASQGGPIILSQIENEYGNID AYGAA KSYI WAA MA SLDTGVP
Sbjct: 121 TTKIVDLMKQENLYASQGGPIILSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGVP 180
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLA 274
WVMCQQ+DAPDPIINTCNGFYCDQF+PNSNNKPK+WTENWSGWFLSFGG VP RPVEDLA
Sbjct: 181 WVMCQQTDAPDPIINTCNGFYCDQFSPNSNNKPKIWTENWSGWFLSFGGPVPQRPVEDLA 240
Query: 275 FAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKD 334
FAVARFFQRGGTFQNYYMY G NF TSGGPFI+TSYDYDAP+DEYG+ RQPKWGHLK+
Sbjct: 241 FAVARFFQRGGTFQNYYMYTWGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGHLKE 300
Query: 335 LHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSY 394
LHKAIKLCE ALVATD LGPNLEA VYKT SG+C+AFLANIGT SD TV FNG SY
Sbjct: 301 LHKAIKLCEPALVATDHHTLRLGPNLEAHVYKTASGVCAAFLANIGTQSDATVTFNGKSY 360
Query: 395 LLPAWSVSILPDCKNVVFNTAKINSVTL---VPSFSRQSLQVAAD--SSDAIGSGWSYIN 449
LPAWSVSILPDC+ VVFNTA+INS + + + +SL SS+ S WS++
Sbjct: 361 SLPAWSVSILPDCRTVVFNTAQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSDWSFVI 420
Query: 450 EPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGH 509
EPVGISK +A K GLLEQINTTAD SDYLWYS+S I DEP L +G+++ LH +SLGH
Sbjct: 421 EPVGISKSNAIRKTGLLEQINTTADVSDYLWYSISIAIDGDEPFLSNGTQSNLHAESLGH 480
Query: 510 ALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAG 569
LHAF+NGKL GSG G+S NAK+ + I L PG N+ DLLS TVGLQNYGAF++ GAG
Sbjct: 481 VLHAFVNGKLAGSGIGNSGNAKIIFEKLIMLTPGNNSIDLLSATVGLQNYGAFFDLMGAG 540
Query: 570 ITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP--SGSSTQWDSKSTLPKLQPLVW 627
ITGPV+LKG NGT +DLSS WTYQ GLKGE+L+ SG +QW S+STLPK QPL+W
Sbjct: 541 ITGPVKLKGQ-NGT-LDLSSNAWTYQIGLKGEDLSLHENSGDVSQWISESTLPKNQPLIW 598
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
YKTTF+AP G++PVAIDFTGMGKGEAWVNGQSIGRYWPTY S GC+ +CNYRG YS++
Sbjct: 599 YKTTFNAPDGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTYSSPQNGCSTACNYRGPYSAS 658
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH 747
KC+KNCGKPSQ LYHVPRS+++S NTLVLFEE+GGDPT+IS TKQ+ +SLC+HV++SH
Sbjct: 659 KCIKNCGKPSQILYHVPRSFIQSESNTLVLFEEMGGDPTQISLATKQM-TSLCAHVSESH 717
Query: 748 PLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARS 807
P PVD W S + +K GP + LECP PNQVISSIKFASFGTP G CGSF+ +CSSA
Sbjct: 718 PAPVDTWLSLQQKGKKSGPTIQLECPYPNQVISSIKFASFGTPSGMCGSFNHSQCSSASV 777
Query: 808 LSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
L+VV++ACVGSK CS+G+S T GDPC+GV+KSLAVEA+C+
Sbjct: 778 LAVVQKACVGSKRCSVGISSKTLGDPCRGVIKSLAVEAACS 818
>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 956
Score = 1217 bits (3150), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/835 (69%), Positives = 686/835 (82%), Gaps = 12/835 (1%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S ANVTYDHRAVVI G RRVL+SGSIHYPRSTP+MWP LIQKSKDGGLDVIETYVFW++
Sbjct: 126 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 185
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HE VR QY+FEGR DLV+FVK VA+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI+FR
Sbjct: 186 HEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFR 245
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDNE FKAEMQRFT K+VD MK LYASQGGPIILSQIENEYGNIDSAYGAAGK+Y++W
Sbjct: 246 TDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRW 305
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AAGMA+SLDTGVPWVMCQQSDAPDP+INTCNGFYCDQFTPNS +KPKMWTENWSGWFLSF
Sbjct: 306 AAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSF 365
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGAVPYRP EDLAFAVARF+QRGGTFQNYYMYHGGTNF R++GGPFI+TSYDYDAP+DEY
Sbjct: 366 GGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEY 425
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG-SGLCSAFLANIG 380
G++RQPKWGHL+D+HKAIKLCE AL+A +P+Y SLG N EATVY+T + +C+AFLAN+
Sbjct: 426 GMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVD 485
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR--QSLQVAADS- 437
SD TVKFNGN+Y LPAWSVSILPDCKNVV NTA+INS S+Q DS
Sbjct: 486 AQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSL 545
Query: 438 --SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
+ +GWSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S +K DEP L
Sbjct: 546 ITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYL- 604
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
+GS++ L V SLGH L +INGKL GS GS+S++ +++ P+ L PGKN DLLS TVG
Sbjct: 605 NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVG 664
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQW 613
L NYGAF++ GAG+TGPV+L G NG ++LSS WTYQ GL+GE+L+ PS +S +W
Sbjct: 665 LSNYGAFFDLVGAGVTGPVKLSGP-NGA-LNLSSTDWTYQIGLRGEDLHLYNPSEASPEW 722
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S + P QPL+WYKT F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ G
Sbjct: 723 VSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSG 782
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C +SCNYRGAYSSNKCLK CG+PSQ+LYHVPRS+L+ N LVLFE+ GGDP+ ISF T+
Sbjct: 783 CVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTR 842
Query: 734 QLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
Q SS+C+HV++ HP +D W S + + GP L LECP QVIS+IKFASFGTP GT
Sbjct: 843 QT-SSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGT 901
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
CG+++ G CSS+++L+VV++ACVG +CS+ VS N FGDPC GV KSL VEA+C+
Sbjct: 902 CGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFGDPCSGVTKSLVVEAACS 956
>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
Length = 858
Score = 1216 bits (3147), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/835 (69%), Positives = 686/835 (82%), Gaps = 12/835 (1%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S ANVTYDHRAVVI G RRVL+SGSIHYPRSTP+MWP LIQKSKDGGLDVIETYVFW++
Sbjct: 28 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 87
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HE VR QY+FEGR DLV+FVK VA+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI+FR
Sbjct: 88 HEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFR 147
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDNE FKAEMQRFT K+VD MK LYASQGGPIILSQIENEYGNIDSAYGAAGK+Y++W
Sbjct: 148 TDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRW 207
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AAGMA+SLDTGVPWVMCQQSDAPDP+INTCNGFYCDQFTPNS +KPKMWTENWSGWFLSF
Sbjct: 208 AAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSF 267
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGAVPYRP EDLAFAVARF+QRGGTFQNYYMYHGGTNF R++GGPFI+TSYDYDAP+DEY
Sbjct: 268 GGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEY 327
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG-SGLCSAFLANIG 380
G++RQPKWGHL+D+HKAIKLCE AL+A +P+Y SLG N EATVY+T + +C+AFLAN+
Sbjct: 328 GMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVD 387
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR--QSLQVAADS- 437
SD TVKFNGN+Y LPAWSVSILPDCKNVV NTA+INS S+Q DS
Sbjct: 388 AQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSL 447
Query: 438 --SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
+ +GWSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S +K DEP L
Sbjct: 448 ITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYL- 506
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
+GS++ L V SLGH L +INGKL GS GS+S++ +++ P+ L PGKN DLLS TVG
Sbjct: 507 NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVG 566
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQW 613
L NYGAF++ GAG+TGPV+L G NG ++LSS WTYQ GL+GE+L+ PS +S +W
Sbjct: 567 LSNYGAFFDLVGAGVTGPVKLSGP-NGA-LNLSSTDWTYQIGLRGEDLHLYNPSEASPEW 624
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S + P QPL+WYKT F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ G
Sbjct: 625 VSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSG 684
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C +SCNYRGAYSSNKCLK CG+PSQ+LYHVPRS+L+ N LVLFE+ GGDP+ ISF T+
Sbjct: 685 CVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTR 744
Query: 734 QLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
Q SS+C+HV++ HP +D W S + + GP L LECP QVIS+IKFASFGTP GT
Sbjct: 745 QT-SSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGT 803
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
CG+++ G CSS+++L+VV++ACVG +CS+ VS N FGDPC GV KSL VEA+C+
Sbjct: 804 CGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFGDPCSGVTKSLVVEAACS 858
>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
Length = 861
Score = 1216 bits (3147), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 585/838 (69%), Positives = 687/838 (81%), Gaps = 15/838 (1%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S ANVTYDHRAVVI G RRVL+SGSIHYPRSTP+MWP LIQKSKDGGLDVIETYVFW++
Sbjct: 28 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 87
Query: 82 HEPVR---NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
HEPVR QY+FEGR DLV+FVK VA+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI
Sbjct: 88 HEPVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGI 147
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
+FRTDNE FKAEMQRFT K+VD MK LYASQGGPIILSQIENEYGNIDSAYGAAGK+Y
Sbjct: 148 KFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAY 207
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
++WAAGMA+SLDTGVPWVMCQQSDAPDP+INTCNGFYCDQFTPNS +KPKMWTENWSGWF
Sbjct: 208 MRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWF 267
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
LSFGGAVPYRP EDLAFAVARF+QRGGTFQNYYMYHGGTNF R++GGPFI+TSYDYDAP+
Sbjct: 268 LSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPI 327
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG-SGLCSAFLA 377
DEYG++RQPKWGHL+D+HKAIKLCE AL+A +P+Y SLG N EATVY+T + +C+AFLA
Sbjct: 328 DEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLA 387
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR--QSLQVAA 435
N+ SD VKFNGN+Y LPAWSVSILPDCKNVV NTA+INS S+Q
Sbjct: 388 NVDAQSDKAVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTD 447
Query: 436 DS---SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
DS + +GWSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S +K DEP
Sbjct: 448 DSLITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEP 507
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
L +GS++ L V SLGH L +INGKL GS GS+S++ +++ P+ L PGKN DLLS
Sbjct: 508 YL-NGSQSNLLVNSLGHVLQVYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLST 566
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSS 610
TVGL NYGAF++ GAG+TGPV+L G NG ++LSS WTYQ GL+GE+L+ PS +S
Sbjct: 567 TVGLSNYGAFFDLIGAGVTGPVKLSGP-NGA-LNLSSTDWTYQIGLRGEDLHLYNPSEAS 624
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ 670
+W S + P QPL+WYKT F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++
Sbjct: 625 PEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAP 684
Query: 671 NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
GC +SCNYRGAYSSNKCLK CG+PSQ+LYHVPRS+L+ N LVLFE+ GGDP+ ISF
Sbjct: 685 QSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISF 744
Query: 731 VTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTP 790
T+Q SS+C+HV++ HP +D W S + + PGP L LECP QVIS+IKFASFGTP
Sbjct: 745 TTRQT-SSICAHVSEMHPAQIDSWISPQQTSQTPGPALRLECPREGQVISNIKFASFGTP 803
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
GTCG+++ G CSS+++L+VV++ACVG +CS+ VS N FGDPC GV KSL VEA+C+
Sbjct: 804 SGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFGDPCSGVTKSLVVEAACS 861
>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
Length = 861
Score = 1215 bits (3143), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/857 (67%), Positives = 688/857 (80%), Gaps = 17/857 (1%)
Query: 7 LLLVLCWGFVVLATTSF----GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
L LVL + F+ + ANVTYDHR+++I G+RRVLISGSIHYPRSTPEMWPD+I
Sbjct: 7 LRLVLIYAFLFNGFYYWKHVSAANVTYDHRSLLIDGQRRVLISGSIHYPRSTPEMWPDII 66
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGGLDVIE+YVFWN+HEP +N+Y FE R+DLVKFVK+V +AGL HLRIGPY CAE
Sbjct: 67 QKAKDGGLDVIESYVFWNMHEPKQNEYYFEDRFDLVKFVKIVQQAGLLVHLRIGPYACAE 126
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WN+GGFP+WLH IPGI FRTDNEPFK EMQRFTAKIVDMMKQEKL+ASQGGPIIL+QIEN
Sbjct: 127 WNYGGFPVWLHLIPGIHFRTDNEPFKNEMQRFTAKIVDMMKQEKLFASQGGPIILAQIEN 186
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYGNID YGAAGKSY+KWAA MA+ L+TGVPWVMCQQ+DAPDPIINTCNGFYCD FTPN
Sbjct: 187 EYGNIDGPYGAAGKSYVKWAASMAVGLNTGVPWVMCQQADAPDPIINTCNGFYCDAFTPN 246
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
S NKPKMWTENWSGWFLSFGG +P+RP EDLAF+VARFFQRGGTFQNYYMYHGGTNF RT
Sbjct: 247 SPNKPKMWTENWSGWFLSFGGRLPFRPTEDLAFSVARFFQRGGTFQNYYMYHGGTNFGRT 306
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPFI+TSYDYDAP+DEYG++RQPKWGHLK+LHKAIKLCEAALV + Y SLG LEA
Sbjct: 307 TGGPFIATSYDYDAPIDEYGIVRQPKWGHLKELHKAIKLCEAALVNAESNYTSLGSGLEA 366
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
VY GSG C+AFLAN T SD TVKFNGNSY LPAWSVSILPDCKNVVFNTAKI S T
Sbjct: 367 HVYSPGSGTCAAFLANSNTQSDATVKFNGNSYHLPAWSVSILPDCKNVVFNTAKIGSQTT 426
Query: 423 VPSFSRQSLQVAADSS-----DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSD 477
+ +L +A +S A + WS+++E +GI + F+KPGLLEQINTT D SD
Sbjct: 427 SVQMNPANLILAGSNSMKGTDSANAASWSWLHEQIGIGGSNTFSKPGLLEQINTTVDSSD 486
Query: 478 YLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFP 537
YLWY+ S + +EP L +G++ VLHVQSLGHALH FING+ G G GSSS++K+ + P
Sbjct: 487 YLWYTTSIQVDDNEPFLHNGTQPVLHVQSLGHALHVFINGEFAGRGAGSSSSSKIALQTP 546
Query: 538 IALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTG 597
I L GKN DLLS+TVGLQNYG+F++ GAGITGPV L+G +G + DLS+QQWTYQ G
Sbjct: 547 ITLKSGKNNIDLLSITVGLQNYGSFFDTWGAGITGPVILQGFKDGEH-DLSTQQWTYQIG 605
Query: 598 LKGEELNFPSG---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
L GE+L SG +S QW + S LP QP++WYKT FDAP+G++PVA++ GMGKG AW
Sbjct: 606 LTGEQLGIYSGDTKASAQWVAGSDLPTKQPMIWYKTNFDAPSGNDPVALNLLGMGKGVAW 665
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNGQSIGRYWP+Y++ GCTDSC+YRGAYSS KC NCG+PSQ LYHVPRSW++ +GN
Sbjct: 666 VNGQSIGRYWPSYIASQSGCTDSCDYRGAYSSTKCQTNCGQPSQKLYHVPRSWIQPTGNV 725
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKI---QRKPGPVLSLE 771
LVLFEE+GGDPT+ISF+T+ +G SLC+ V+++H PVD W S + KP L L
Sbjct: 726 LVLFEELGGDPTQISFMTRSVG-SLCAQVSETHLPPVDSWKSSATSGLEVNKPKAELQLH 784
Query: 772 CPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG 831
CP+ +I SIKFASFGT G+CGSF+ G C++ ++S+V +AC+G +SCS+ VS+ FG
Sbjct: 785 CPSSRHLIKSIKFASFGTSKGSCGSFTYGHCNTNSTMSIVEEACIGRESCSVEVSIEKFG 844
Query: 832 DPCKGVMKSLAVEASCT 848
DPCKG +K+LAVEASC+
Sbjct: 845 DPCKGTVKNLAVEASCS 861
>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
Length = 861
Score = 1214 bits (3140), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/838 (69%), Positives = 686/838 (81%), Gaps = 15/838 (1%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S ANVTYDHRAVVI G RRVL+SGSIHYPRSTP+MWP LIQKSKDGGLDVIETYVFW++
Sbjct: 28 SRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDI 87
Query: 82 HEPVR---NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
HE VR QY+FEGR DLV+FVK VA+AGLY HLRIGPYVCAEWN+GGFP+WLHF+PGI
Sbjct: 88 HEAVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGI 147
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
+FRTDNE FKAEMQRFT K+VD MK LYASQGGPIILSQIENEYGNIDSAYGAAGK+Y
Sbjct: 148 KFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAY 207
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
++WAAGMA+SLDTGVPWVMCQQSDAPDP+INTCNGFYCDQFTPNS +KPKMWTENWSGWF
Sbjct: 208 MRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWF 267
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
LSFGGAVPYRP EDLAFAVARF+QRGGTFQNYYMYHGGTNF R++GGPFI+TSYDYDAP+
Sbjct: 268 LSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPI 327
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG-SGLCSAFLA 377
DEYG++RQPKWGHL+D+HKAIKLCE AL+A +P+Y SLG N EATVY+T + +C+AFLA
Sbjct: 328 DEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLA 387
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR--QSLQVAA 435
N+ SD TVKFNGN+Y LPAWSVSILPDCKNVV NTA+INS S+Q
Sbjct: 388 NVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTD 447
Query: 436 DS---SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
DS + +GWSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S +K DEP
Sbjct: 448 DSLITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEP 507
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
L +GS++ L V SLGH L +INGKL GS GS+S++ +++ P+ L PGKN DLLS
Sbjct: 508 YL-NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLST 566
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSS 610
TVGL NYGAF++ GAG+TGPV+L G NG ++LSS WTYQ GL+GE+L+ PS +S
Sbjct: 567 TVGLSNYGAFFDLVGAGVTGPVKLSGP-NGA-LNLSSTDWTYQIGLRGEDLHLYNPSEAS 624
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ 670
+W S + P QPL+WYKT F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++
Sbjct: 625 PEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAP 684
Query: 671 NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
GC +SCNYRGAYSSNKCLK CG+PSQ+LYHVPRS+L+ N LVLFE+ GGDP+ ISF
Sbjct: 685 QSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISF 744
Query: 731 VTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTP 790
T+Q SS+C+HV++ HP +D W S + + GP L LECP QVIS+IKFASFGTP
Sbjct: 745 TTRQT-SSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTP 803
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
GTCG+++ G CSS+++L+VV++ACVG +CS+ VS N FGDPC GV KSL VEA+C+
Sbjct: 804 SGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFGDPCSGVTKSLVVEAACS 861
>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
Length = 860
Score = 1209 bits (3129), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 591/830 (71%), Positives = 686/830 (82%), Gaps = 12/830 (1%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRA+VI G RRVL+SGSIHYPRSTP+MWP +IQK+KDGGLDVIETYVFW++HEPV
Sbjct: 36 NVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGIIQKAKDGGLDVIETYVFWDIHEPV 95
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
R QY+FEGR DL FVK VA+AGLY HLRIGPYVCAEWN+GGFPLWLHFIPGI+FRTDNE
Sbjct: 96 RGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 155
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK EMQRFTAK+VD MK LYASQGGPIILSQIENEYGNIDSAYGAAGK+Y++WAAGM
Sbjct: 156 PFKTEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGM 215
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+SLDTGVPWVMCQQ+DAPDP+INTCNGFYCDQFTPNS KPKMWTENWSGWFLSFGGAV
Sbjct: 216 AISLDTGVPWVMCQQTDAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGGAV 275
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
PYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTN DR+SGGPFI+TSYDYDAP+DEYGL+R
Sbjct: 276 PYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGLVR 335
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
+PKWGHL+D+HKAIKLCE AL+ATDP+Y SLG N EA VYKTGS +C+AFLANI SD
Sbjct: 336 EPKWGHLRDVHKAIKLCEPALIATDPSYTSLGQNAEAAVYKTGS-VCAAFLANIDGQSDK 394
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS-VTLVPSFSRQSLQVAADSS----DA 440
TV FNG Y LPAWSVSILPDCKNVV NTA+INS VT +S +A+D S +
Sbjct: 395 TVTFNGRMYRLPAWSVSILPDCKNVVLNTAQINSQVTSSEMRYLESSNMASDGSFITPEL 454
Query: 441 IGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
SGWSY EPVGI+KD+A TK GL+EQINTTAD SD+LWYS S +K DEP L +GS++
Sbjct: 455 AVSGWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYL-NGSQS 513
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L V SLGH L +INGK+ GS GS+S++ ++ PI L PGKN DLLS TVGL NYG
Sbjct: 514 NLVVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGLSNYG 573
Query: 561 AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQWDSKST 618
AF++ GAGITGPV+L G+ NG +DLSS +WTYQ GL+GE+L+ PS +S +W S +
Sbjct: 574 AFFDLVGAGITGPVKLSGT-NGA-LDLSSAEWTYQIGLRGEDLHLYDPSEASPEWVSANA 631
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
P QPL+WYKT F PAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ GC +SC
Sbjct: 632 YPINQPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSC 691
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
NYRG+Y+SNKCLK CG+PSQ+LYHVPRS+L+ N +VLFE+ GGDP+KISFV +Q G S
Sbjct: 692 NYRGSYNSNKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFVIRQTG-S 750
Query: 739 LCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
+C+ V++ HP +D W S + ++ GP L LECP QVISSIKFASFGTP GTCGS+S
Sbjct: 751 VCAQVSEEHPAQIDSWNSSQQTMQRYGPELRLECPKDGQVISSIKFASFGTPSGTCGSYS 810
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
G CSS ++LSVV++AC+G SCS+ VS N FG+PC GV KSLAVEA+C+
Sbjct: 811 HGECSSTQALSVVQEACIGVSSCSVPVSSNYFGNPCTGVTKSLAVEAACS 860
>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
gi|219886857|gb|ACL53803.1| unknown [Zea mays]
gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
Length = 852
Score = 1209 bits (3128), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 595/838 (71%), Positives = 687/838 (81%), Gaps = 13/838 (1%)
Query: 18 LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYV 77
+A + ANVTYDHRA+VI G RRVL+SGSIHYPRSTP+MWP LIQK+KDGGLDVIETYV
Sbjct: 21 IAGGARAANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYV 80
Query: 78 FWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPG 137
FW++HEPVR QY+FEGR DL FVK VA+AGLY HLRIGPYVCAEWN+GGFPLWLHFIPG
Sbjct: 81 FWDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPG 140
Query: 138 IQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKS 197
I+FRTDNEPFKAEMQRFTAK+VD MK LYASQGGPIILSQIENEYGNIDSAYGA GK+
Sbjct: 141 IKFRTDNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKA 200
Query: 198 YIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGW 257
Y++WAAGMA+SLDTGVPWVMCQQ+DAPDP+INTCNGFYCDQFTPNS KPKMWTENWSGW
Sbjct: 201 YMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGW 260
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
FLSFGGAVPYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTN DR+SGGPFI+TSYDYDAP
Sbjct: 261 FLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAP 320
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLA 377
+DEYGL+RQPKWGHL+D+HKAIKLCE AL+ATDP+Y SLGPN+EA VYK GS +C+AFLA
Sbjct: 321 IDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFLA 379
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR-QSLQVAAD 436
NI SD TV FNG Y LPAWSVSILPDCKNVV NTA+INS T +S VA+D
Sbjct: 380 NIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASD 439
Query: 437 SS----DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
S + S WSY EPVGI+KD+A TK GL+EQINTTAD SD+LWYS S +K DEP
Sbjct: 440 GSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEP 499
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
L +GS++ L V SLGH L +INGK+ GS GS+S++ ++ PI L PGKN DLLS
Sbjct: 500 YL-NGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSA 558
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSS 610
TVGL NYGAF++ GAGITGPV+L G NG +DLSS +WTYQ GL+GE+L+ PS +S
Sbjct: 559 TVGLSNYGAFFDLVGAGITGPVKLSGL-NGA-LDLSSAEWTYQIGLRGEDLHLYDPSEAS 616
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ 670
+W S + P PL+WYKT F PAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++
Sbjct: 617 PEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAP 676
Query: 671 NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
GC +SCNYRGAYSS+KCLK CG+PSQ+LYHVPRS+L+ N LVLFE GGDP+KISF
Sbjct: 677 QSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISF 736
Query: 731 VTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTP 790
V +Q G S+C+ V+++HP +D W S +QR GP L LECP QVISS+KFASFGTP
Sbjct: 737 VMRQTG-SVCAQVSEAHPAQIDSWSSQQPMQRY-GPALRLECPKEGQVISSVKFASFGTP 794
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
GTCGS+S G CSS ++LS+V++AC+G SCS+ VS N FG+PC GV KSLAVEA+C+
Sbjct: 795 SGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSSNYFGNPCTGVTKSLAVEAACS 852
>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
[Brachypodium distachyon]
Length = 852
Score = 1206 bits (3121), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/839 (68%), Positives = 674/839 (80%), Gaps = 16/839 (1%)
Query: 19 ATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 78
A S NVTYDHRA+VI G RRVL+SGSIHYPRSTP+MWP L+QK+KDGGLDV+ETYVF
Sbjct: 21 AGASSATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVF 80
Query: 79 WNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
W++HE QY+FEGR DLV+FVK A+ GLY HLRIGPYVCAEWN+GGFPLWLHFIPGI
Sbjct: 81 WDIHETATXQYDFEGRKDLVRFVKAAADTGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI 140
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
+FRTDNEPFK EMQRFT K+V MK LYASQGGPIILSQIENEYGNIDSAYGAAGKSY
Sbjct: 141 KFRTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 200
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
I+WAAGMA++LDTGVPWVMCQQ+DAPDP+INTCNGFYCDQFTPNSN+KPK+WTENWSGWF
Sbjct: 201 IRWAAGMAVALDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSNSKPKLWTENWSGWF 260
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
LSFGGAVPYRP EDLAFAVARF+QRGGT QNYYMYHGGTNF R+SGGPFISTSYDYDAP+
Sbjct: 261 LSFGGAVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPI 320
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYGL+RQPKWGHLKD+HKAIK CE AL+ATDP+Y S+G N EA VYK GS +C+AFLAN
Sbjct: 321 DEYGLVRQPKWGHLKDVHKAIKQCEPALIATDPSYMSMGQNAEAHVYKAGS-VCAAFLAN 379
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+ T SD TV FNGN+Y LPAWSVSILPDCKNVV NTA+INS T +SL + +S
Sbjct: 380 MDTQSDKTVTFNGNAYKLPAWSVSILPDCKNVVLNTAQINSQTTTSEM--RSLGSSTKAS 437
Query: 439 DAIG-------SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
D SGWSY EPVGI+ ++A TKPGL+EQINTTAD SD+LWYS S +K E
Sbjct: 438 DGSSIETELALSGWSYAIEPVGITTENALTKPGLMEQINTTADASDFLWYSTSVVVKGGE 497
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
P L +GS++ L V SLGH L A+INGK GS GS++++ +++ PI L PGKN DLLS
Sbjct: 498 PYL-NGSQSNLLVNSLGHVLQAYINGKFAGSAKGSATSSLISLQTPITLVPGKNKIDLLS 556
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGS 609
TVGL NYGAF++ GAGITGPV+L SG +DLSS WTYQ GL+GE L+ PS +
Sbjct: 557 GTVGLSNYGAFFDLVGAGITGPVKL--SGPKGVLDLSSTDWTYQVGLRGEGLHLYNPSEA 614
Query: 610 STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
S +W S P QPL+WYK+ F PAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++
Sbjct: 615 SPEWVSDKAYPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLA 674
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
GC +SCNYRG YSS+KCLK CG+PSQ+LYHVPRS+L+ N +VLFE+ GGDP+KIS
Sbjct: 675 PQSGCVNSCNYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKIS 734
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
F TKQ +S+C+HV++ HP +D W S + ++ GP L LECP QVISSIKFASFGT
Sbjct: 735 FTTKQT-ASVCAHVSEDHPDQIDSWISPQQKVQRSGPALRLECPKAGQVISSIKFASFGT 793
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
P GTCG+++ G CSS ++L+V ++AC+G SCS+ VS FGDPC GV KSL VEA+C+
Sbjct: 794 PSGTCGNYNHGECSSPQALAVAQEACIGVSSCSVPVSTKNFGDPCTGVTKSLVVEAACS 852
>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 853
Score = 1189 bits (3075), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/835 (68%), Positives = 676/835 (80%), Gaps = 12/835 (1%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
TS NVTYDHRA+VI G RRVL+SGSIHYPRSTP+MWP L+QK+KDGGLDV+ETYVFW+
Sbjct: 24 TSAATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWD 83
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
+HEPVR QY+FEGR DLV+FVK A+AGLY HLRIGPYVCAEWN+GGFPLWLHFIPGI+
Sbjct: 84 VHEPVRGQYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKL 143
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RTDNEPFK EMQRFT K+V MK LYASQGGPIILSQIENEYGNI ++YGAAGKSYI+
Sbjct: 144 RTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIR 203
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
WAAGMA++LDTGVPWVMCQQ+DAP+P+INTCNGFYCDQFTP+ ++PK+WTENWSGWFLS
Sbjct: 204 WAAGMAVALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLS 263
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
FGGAVPYRP EDLAFAVARF+QRGGT QNYYMYHGGTNF R+SGGPFISTSYDYDAP+DE
Sbjct: 264 FGGAVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDE 323
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGL+RQPKWGHL+D+HKAIK+CE AL+ATDP+Y SLG N EA VYK+GS LC+AFLANI
Sbjct: 324 YGLVRQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGS-LCAAFLANID 382
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS-----FSRQSLQVAA 435
SD TV FNG +Y LPAWSVSILPDCKNVV NTA+INS FS Q+ ++
Sbjct: 383 DQSDKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSS 442
Query: 436 DSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
++ S WSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S + EP L
Sbjct: 443 VEAELAASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYL- 501
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
+GS++ L V SLGH L FINGKL GS GS+S++ +++ P+ L GKN DLLS TVG
Sbjct: 502 NGSQSNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVG 561
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQW 613
L NYGAF++ GAGITGPV+L G GT +DLSS +WTYQ GL+GE+L+ PS +S +W
Sbjct: 562 LTNYGAFFDLVGAGITGPVKLTGP-KGT-LDLSSAEWTYQIGLRGEDLHLYNPSEASPEW 619
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S ++ P PL WYK+ F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ G
Sbjct: 620 VSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSG 679
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C +SCNYRG+YS+ KCLK CG+PSQ LYHVPRS+L+ N +VLFE+ GG+P+KISF TK
Sbjct: 680 CVNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTK 739
Query: 734 QLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
Q S+C+HV++ HP +D W S + ++ GP L LECP QVISSIKFASFGTP GT
Sbjct: 740 QT-ESVCAHVSEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFASFGTPSGT 798
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
CGS+S G CSS+++L+V ++ACVG SCS+ VS FGDPC+GV KSL VEA+C+
Sbjct: 799 CGSYSHGECSSSQALAVAQEACVGVSSCSVPVSAKNFGDPCRGVTKSLVVEAACS 853
>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 830
Score = 1164 bits (3010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/838 (68%), Positives = 669/838 (79%), Gaps = 35/838 (4%)
Query: 18 LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYV 77
+A + ANVTYDHRA+VI G RRVL+SGSIHYPRSTP+MWP LIQK+KDGGLDVIETYV
Sbjct: 21 IAGGARAANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYV 80
Query: 78 FWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPG 137
FW++HEPVR QY+FEGR DL FVK VA+AGLY HLRIGPYVCAEWN+GGFPLWLHFIPG
Sbjct: 81 FWDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPG 140
Query: 138 IQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKS 197
I+FRTDNEPFKAEMQRFTAKI ENEYGNIDSAYGA GK+
Sbjct: 141 IKFRTDNEPFKAEMQRFTAKI----------------------ENEYGNIDSAYGAPGKA 178
Query: 198 YIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGW 257
Y++WAAGMA+SLDTGVPWVMCQQ+DAPDP+INTCNGFYCDQFTPNS KPKMWTENWSGW
Sbjct: 179 YMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGW 238
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
FLSFGGAVPYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTN DR+SGGPFI+TSYDYDAP
Sbjct: 239 FLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAP 298
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLA 377
+DEYGL+RQPKWGHL+D+HKAIKLCE AL+ATDP+Y SLGPN+EA VYK GS +C+AFLA
Sbjct: 299 IDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFLA 357
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR-QSLQVAAD 436
NI SD TV FNG Y LPAWSVSILPDCKNVV NTA+INS T +S VA+D
Sbjct: 358 NIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASD 417
Query: 437 SS----DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
S + S WSY EPVGI+KD+A TK GL+EQINTTAD SD+LWYS S +K DEP
Sbjct: 418 GSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEP 477
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
L +GS++ L V SLGH L +INGK+ GS GS+S++ ++ PI L PGKN DLLS
Sbjct: 478 YL-NGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSA 536
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSS 610
TVGL NYGAF++ GAGITGPV+L G NG +DLSS +WTYQ GL+GE+L+ PS +S
Sbjct: 537 TVGLSNYGAFFDLVGAGITGPVKLSGL-NGA-LDLSSAEWTYQIGLRGEDLHLYDPSEAS 594
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ 670
+W S + P PL+WYKT F PAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++
Sbjct: 595 PEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAP 654
Query: 671 NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
GC +SCNYRGAYSS+KCLK CG+PSQ+LYHVPRS+L+ N LVLFE GGDP+KISF
Sbjct: 655 QSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISF 714
Query: 731 VTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTP 790
V +Q G S+C+ V+++HP +D W S +QR GP L LECP QVISS+KFASFGTP
Sbjct: 715 VMRQTG-SVCAQVSEAHPAQIDSWSSQQPMQRY-GPALRLECPKEGQVISSVKFASFGTP 772
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
GTCGS+S G CSS ++LS+V++AC+G SCS+ VS N FG+PC GV KSLAVEA+C+
Sbjct: 773 SGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSSNYFGNPCTGVTKSLAVEAACS 830
>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 796
Score = 1156 bits (2991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 555/800 (69%), Positives = 655/800 (81%), Gaps = 12/800 (1%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MWP LIQKSKDGGLDVIETYVFW++HE VR QY+FEGR DLV+FVK VA+AGLY HLRIG
Sbjct: 1 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
PYVCAEWN+GGFP+WLHF+PGI+FRTDNE FKAEMQRFT K+VD MK LYASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 120
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
LSQIENEYGNIDSAYGAAGK+Y++WAAGMA+SLDTGVPWVMCQQSDAPDP+INTCNGFYC
Sbjct: 121 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 180
Query: 237 DQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGG 296
DQFTPNS +KPKMWTENWSGWFLSFGGAVPYRP EDLAFAVARF+QRGGTFQNYYMYHGG
Sbjct: 181 DQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGG 240
Query: 297 TNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSL 356
TNF R++GGPFI+TSYDYDAP+DEYG++RQPKWGHL+D+HKAIKLCE AL+A +P+Y SL
Sbjct: 241 TNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSL 300
Query: 357 GPNLEATVYKTG-SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTA 415
G N EATVY+T + +C+AFLAN+ SD TVKFNGN+Y LPAWSVSILPDCKNVV NTA
Sbjct: 301 GQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTA 360
Query: 416 KINSVTLVPSFSR--QSLQVAADS---SDAIGSGWSYINEPVGISKDDAFTKPGLLEQIN 470
+INS S+Q DS + +GWSY EPVGI+K++A TKPGL+EQIN
Sbjct: 361 QINSQVTTSEMRSLGSSIQDTDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQIN 420
Query: 471 TTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNA 530
TTAD SD+LWYS S +K DEP L +GS++ L V SLGH L +INGKL GS GS+S++
Sbjct: 421 TTADASDFLWYSTSIVVKGDEPYL-NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSS 479
Query: 531 KVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQ 590
+++ P+ L PGKN DLLS TVGL NYGAF++ GAG+TGPV+L G NG ++LSS
Sbjct: 480 LISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGP-NGA-LNLSST 537
Query: 591 QWTYQTGLKGEELNF--PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGM 648
WTYQ GL+GE+L+ PS +S +W S + P QPL+WYKT F APAG +PVAIDFTGM
Sbjct: 538 DWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGM 597
Query: 649 GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWL 708
GKGEAWVNGQSIGRYWPT ++ GC +SCNYRGAYSSNKCLK CG+PSQ+LYHVPRS+L
Sbjct: 598 GKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFL 657
Query: 709 KSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVL 768
+ N LVLFE+ GGDP+ ISF T+Q SS+C+HV++ HP +D W S + + GP L
Sbjct: 658 QPGSNDLVLFEQFGGDPSMISFTTRQT-SSICAHVSEMHPAQIDSWISPQQTSQTQGPAL 716
Query: 769 SLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVN 828
LECP QVIS+IKFASFGTP GTCG+++ G CSS+++L+VV++ACVG +CS+ VS N
Sbjct: 717 RLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSN 776
Query: 829 TFGDPCKGVMKSLAVEASCT 848
FGDPC GV KSL VEA+C+
Sbjct: 777 NFGDPCSGVTKSLVVEAACS 796
>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 763
Score = 1069 bits (2765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/768 (67%), Positives = 615/768 (80%), Gaps = 12/768 (1%)
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY+FEGR DLV+FVK A+AGLY HLRIGPYVCAEWN+GGFPLWLHFIPGI+ RTDNEPF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
K EMQRFT K+V MK LYASQGGPIILSQIENEYGNI ++YGAAGKSYI+WAAGMA+
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+LDTGVPWVMCQQ+DAP+P+INTCNGFYCDQFTP+ ++PK+WTENWSGWFLSFGGAVPY
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RP EDLAFAVARF+QRGGT QNYYMYHGGTNF R+SGGPFISTSYDYDAP+DEYGL+RQP
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHL+D+HKAIK+CE AL+ATDP+Y SLG N EA VYK+GS LC+AFLANI SD TV
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGS-LCAAFLANIDDQSDKTV 299
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS-----FSRQSLQVAADSSDAIG 442
FNG +Y LPAWSVSILPDCKNVV NTA+INS FS Q+ ++ ++
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
S WSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S + EP L +GS++ L
Sbjct: 360 SSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYL-NGSQSNL 418
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V SLGH L FINGKL GS GS+S++ +++ P+ L GKN DLLS TVGL NYGAF
Sbjct: 419 PVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGAF 478
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQWDSKSTLP 620
++ GAGITGPV+L G GT +DLSS +WTYQ GL+GE+L+ PS +S +W S ++ P
Sbjct: 479 FDLVGAGITGPVKLTGP-KGT-LDLSSAEWTYQIGLRGEDLHLYNPSEASPEWVSDNSYP 536
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
PL WYK+ F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ C +SCNY
Sbjct: 537 TNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSDCVNSCNY 596
Query: 681 RGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLC 740
RG+YS+ KCLK CG+PSQ LYHVPRS+L+ N +VLFE+ GG+P+KISF TKQ S+C
Sbjct: 597 RGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQT-ESVC 655
Query: 741 SHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+HV++ HP +D W S + ++ GP L LECP QVISSIKFASFGTP GTCGS+S G
Sbjct: 656 AHVSEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFASFGTPSGTCGSYSHG 715
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
CSS+++L+V ++ACVG SCS+ VS FGDPC+GV KSL VEA+C+
Sbjct: 716 ECSSSQALAVAQEACVGVSSCSVPVSAKNFGDPCRGVTKSLVVEAACS 763
>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
Length = 841
Score = 1036 bits (2679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/856 (59%), Positives = 627/856 (73%), Gaps = 24/856 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSF-----GANVTYDHRAVVIGGKRRVLISGSIHYPRSTP 55
M SK L+L+L FV + S+ V+YDHRA+VI GKRRVL SGSIHYPR+TP
Sbjct: 1 MGSKNSLVLILL--FVSIFACSYLERGWSGKVSYDHRALVIDGKRRVLQSGSIHYPRTTP 58
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
E+WPD+I+KSK+GGLDVIETYVFWN HEPV+ QY FEGR+DLV+FVK + EAGL HLRI
Sbjct: 59 EVWPDIIRKSKEGGLDVIETYVFWNYHEPVKGQYYFEGRFDLVRFVKTIQEAGLLVHLRI 118
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPI 175
GPY CAEWN+GGFPLWLHFIPGIQFRT NE FK EM+ F KIV+MMK+E L+ASQGGPI
Sbjct: 119 GPYACAEWNYGGFPLWLHFIPGIQFRTTNELFKEEMKLFLTKIVNMMKEENLFASQGGPI 178
Query: 176 ILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFY 235
IL+Q+ENEYGN++ AYGAAG+ Y+KWAA A+SL+T VPWVMC Q DAPDPIINTCNGFY
Sbjct: 179 ILAQVENEYGNVEWAYGAAGELYVKWAAETAVSLNTSVPWVMCAQVDAPDPIINTCNGFY 238
Query: 236 CDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHG 295
CD+F+PNS +KPKMWTEN+SGWFLSFG A+PYRPVEDLAFAVARFF+ GGTFQNYYMY G
Sbjct: 239 CDRFSPNSPSKPKMWTENYSGWFLSFGYAIPYRPVEDLAFAVARFFETGGTFQNYYMYFG 298
Query: 296 GTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPS 355
GTNF RT+GGP ++TSYDYDAP+DEYG IRQPKWGHL+DLHKAIK CE L+++DP +
Sbjct: 299 GTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRDLHKAIKQCEEHLISSDPIHQQ 358
Query: 356 LGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTA 415
LG NLEA +Y S C+AFLAN ++SD V FNGN Y LPAWSVSILPDCKNV+FNTA
Sbjct: 359 LGNNLEAHIYYKSSNDCAAFLANYDSSSDANVTFNGNIYFLPAWSVSILPDCKNVIFNTA 418
Query: 416 KINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQ 475
K+ + L F S V + I WS+ E VGI +++FT PGLLEQINTT D
Sbjct: 419 KVLILNLGDDFFAHSTSVNEIPLEQI--VWSWYKEEVGIWGNNSFTAPGLLEQINTTKDI 476
Query: 476 SDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVD 535
SD+LWYS S ++ AD+ +L+++SLGHA F+N LVG YG+ +A ++
Sbjct: 477 SDFLWYSTSISVNADQV-----KDIILNIESLGHAALVFVNKVLVGK-YGNHDDASFSLT 530
Query: 536 FPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQ 595
I+L G NT DLLS+ +G+QNYG +++ GAGI V L G IDLSS++WTYQ
Sbjct: 531 EKISLIEGNNTLDLLSMMIGVQNYGPWFDVQGAGIYA-VLLVGQSK-VKIDLSSEKWTYQ 588
Query: 596 TGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGE 652
GL+GE +S+ W ++ P + L+WYK TF AP G P+A++ GMGKG+
Sbjct: 589 VGLEGEYFGLDKVSLANSSLWTQGASPPINKSLIWYKGTFVAPEGKGPLALNLAGMGKGQ 648
Query: 653 AWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG 712
AWVNGQSIGRYWP Y+S + GC DSC+YRGAY S KCLK CG+P+Q+LYH+PR+W+
Sbjct: 649 AWVNGQSIGRYWPAYLSPSTGCNDSCDYRGAYDSFKCLKKCGQPAQTLYHIPRTWVHPGE 708
Query: 713 NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLEC 772
N LVL EE+GGDP+KIS +T+ G +CS V++ P P D W S S+ + + P + L C
Sbjct: 709 NLLVLHEELGGDPSKISVLTRT-GHEICSIVSEDDPPPADSWKSSSEFKSQ-NPEVRLTC 766
Query: 773 PNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGD 832
I SI FASFGTP G CG+F+ G C A L +V++AC+G + CSI +S GD
Sbjct: 767 EQ-GWHIKSINFASFGTPAGICGTFNPGSC-HADMLDIVQKACIGQEGCSISISAANLGD 824
Query: 833 PCKGVMKSLAVEASCT 848
PC GV+K AVEA C+
Sbjct: 825 PCPGVLKRFAVEARCS 840
>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
Length = 836
Score = 1025 bits (2649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 495/838 (59%), Positives = 609/838 (72%), Gaps = 20/838 (2%)
Query: 16 VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIET 75
V+LA VTYDH+A+VI G+RR+LISGSIHYPRST EMWPDL +K+KDGGLDVI+T
Sbjct: 14 VMLAVGGVECGVTYDHKALVINGERRILISGSIHYPRSTAEMWPDLFRKAKDGGLDVIQT 73
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI 135
YVFWN+HEP YNFEGR+DLVKFVKL EAGLY HLRIGPYVCAEWNFGGFP+WL ++
Sbjct: 74 YVFWNMHEPSPGNYNFEGRFDLVKFVKLAQEAGLYVHLRIGPYVCAEWNFGGFPVWLKYV 133
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAG 195
PGI FRTDNEPFK M+ FT K+VD+MK E L+ SQGGPIIL+Q+ENEY + YG AG
Sbjct: 134 PGISFRTDNEPFKNAMEGFTKKVVDLMKSEGLFESQGGPIILAQVENEYKPEEMEYGLAG 193
Query: 196 KSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWS 255
Y+ WAA MA+ +DTGVPWVMC+Q DAPDP+INTCNGFYCD F PN KP MWTE WS
Sbjct: 194 AQYMNWAAQMAVGMDTGVPWVMCKQDDAPDPVINTCNGFYCDNFVPNKPYKPTMWTEAWS 253
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYD 315
GW+ FGGA P+RPVEDLAFAVARFF +GG+F NYYMYHGGTNF RT+GGPFI+TSYDYD
Sbjct: 254 GWYTEFGGASPHRPVEDLAFAVARFFVKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYD 313
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAF 375
AP+DEYGLIRQPKWGHLK+LHKAIKLCE ALV+ DP SLG +A VY G+G C+AF
Sbjct: 314 APIDEYGLIRQPKWGHLKELHKAIKLCEPALVSGDPVVTSLGHFQQAYVYSAGAGNCAAF 373
Query: 376 LANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAA 435
+ N +NS V FNG Y + WSVSILPDC+NVVFNTAK++ T S+ +
Sbjct: 374 IVNYDSNSVGRVIFNGQRYKIAPWSVSILPDCRNVVFNTAKVDVQT-----SQMKMTPVG 428
Query: 436 DSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
G GW I+E + +D++ + GLLEQIN T D +DYLWY S + DEP ++
Sbjct: 429 ------GFGWESIDENIASFEDNSISAVGLLEQINITRDNTDYLWYITSVEVDEDEPFIK 482
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
+G VL VQS G ALH FIN L GS YG N KV + L G N LLS+TVG
Sbjct: 483 NGGLPVLTVQSAGDALHVFINDDLAGSQYGRKENPKVRFSSGVRLNVGTNKISLLSMTVG 542
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF-PSGSST-QW 613
LQN G +E AG+ GP+ L G +GT DLSSQ+W+YQ GLKGE +N SG +T +W
Sbjct: 543 LQNIGPHFEMANAGVLGPITLSGFKDGTR-DLSSQRWSYQIGLKGETMNLHTSGDNTVEW 601
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
+P+ QPL WYK FDAPAG +P+ +D + MGKG+AWVNGQSIGRYWP+Y+++ G
Sbjct: 602 MKGVAVPQSQPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQSIGRYWPSYLAE-GV 660
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C+D C+Y G Y +KC NCG+ SQ YHVPRSWL+ SGNTLVLFEEIGG+P+ +S VT+
Sbjct: 661 CSDGCSYEGTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLFEEIGGNPSGVSLVTR 720
Query: 734 QLGSSLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPL 791
+ S+C+HV++SH ++ W +S ++Q+ P + L+C + Q IS+IKFASFGTP
Sbjct: 721 SV-DSVCAHVSESHSQSINFWRLESTDQVQKLHIPKVHLQC-SKGQRISAIKFASFGTPQ 778
Query: 792 GTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G CGSF +G C S S++ +++ C+G + CS+ VS F GDPC GV K +A+EA C+
Sbjct: 779 GLCGSFQQGDCHSPNSVATIQKKCMGLRKCSLSVSEKIFGGDPCPGVRKGVAIEAVCS 836
>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 1001 bits (2588), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/829 (58%), Positives = 610/829 (73%), Gaps = 23/829 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYDH+A+VI GKRRVL SGSIHYPR+TPE+WP++I+KSK+GGLDVIETYVFWN HEPVR
Sbjct: 36 VTYDHKALVIDGKRRVLQSGSIHYPRTTPEVWPEIIRKSKEGGLDVIETYVFWNYHEPVR 95
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY FEGR+DLV+FVK V EAGL+ HLRIGPY CAEWN+GGFPLWLHFIPG+QFRT N+
Sbjct: 96 GQYYFEGRFDLVRFVKTVQEAGLFVHLRIGPYACAEWNYGGFPLWLHFIPGVQFRTSNDI 155
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK M+ F KIVD+MK + L+ASQGGPIIL+Q+ENEYGN+ AYG G+ Y+KWAA A
Sbjct: 156 FKNAMKSFLTKIVDLMKDDNLFASQGGPIILAQVENEYGNVQWAYGVGGELYVKWAAETA 215
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+SL+T VPWVMC Q DAPDP+INTCNGFYCDQFTPNS +KPKMWTEN+SGWFL+FG AVP
Sbjct: 216 ISLNTTVPWVMCVQEDAPDPVINTCNGFYCDQFTPNSPSKPKMWTENYSGWFLAFGYAVP 275
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
YRPVEDLAFAVARFF+ GG+FQNYYMY GGTNF RT+GGP ++TSYDYDAP+DEYG IRQ
Sbjct: 276 YRPVEDLAFAVARFFEYGGSFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 335
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKWGHL+DLH AIK CE LV++DP + LG LEA VY S C+AFLAN + SD
Sbjct: 336 PKWGHLRDLHSAIKQCEEYLVSSDPVHQQLGNKLEAHVYYKHSNDCAAFLANYDSGSDAN 395
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS--FSRQSLQVAADSSDAIGSG 444
V FNGN+Y LPAWSVSIL DCKNV+FNTAK+ + + FSR + D + S
Sbjct: 396 VTFNGNTYFLPAWSVSILADCKNVIFNTAKVVTQRHIGDALFSRST---TVDGNLVAASP 452
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
WS+ E VGI +++FTKPGLLEQINTT D SD+LWYS S ++A + + +L++
Sbjct: 453 WSWYKEEVGIWGNNSFTKPGLLEQINTTKDTSDFLWYSTSLYVEAGQ-----DKEHLLNI 507
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+SLGHA F+N + V GYG+ +A ++ I+L G NT D+LS+ +G+QNYG +++
Sbjct: 508 ESLGHAALVFVNKRFVAFGYGNHDDASFSLTREISLEEGNNTLDVLSMLIGVQNYGPWFD 567
Query: 565 KTGAGITGP--VQLKGSGNGTNIDLSSQQWTYQTGLKGEEL---NFPSGSSTQWDSKSTL 619
GAGI V L S DLSS +WTYQ GL+GE L N +S+ W ++L
Sbjct: 568 VQGAGIHSVFLVDLHKSKK----DLSSGKWTYQVGLEGEYLGLDNVSLANSSLWSQGTSL 623
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
P + L+WYK T AP G+ P+A++ MGKG+AW+NGQSIGRYW Y+S + GCTD+C+
Sbjct: 624 PVNKSLIWYKATIIAPEGNGPLALNLASMGKGQAWINGQSIGRYWSAYLSPSAGCTDNCD 683
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
YRGAY+S KC K CG+P+Q+LYH+PR+W+ N LVL EE+GGDP++IS +T+ G +
Sbjct: 684 YRGAYNSFKCQKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSQISLLTRT-GQDI 742
Query: 740 CSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
CS V++ P P D W + + + P + L C + I++I FASFGTP G CG+F+
Sbjct: 743 CSIVSEDDPPPADSWKPNLEFMSQ-SPEVRLTCEH-GWHIAAINFASFGTPEGKCGTFTP 800
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
G C A L++V++AC+G + CSI +S GDPC GV+K VEA C+
Sbjct: 801 GNC-HADMLTIVQKACIGHERCSIPISAAKLGDPCPGVVKRFVVEALCS 848
>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 854
Score = 987 bits (2551), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/829 (58%), Positives = 586/829 (70%), Gaps = 24/829 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +A+VI G+RR+LISGSIHYPRSTPEMW DLIQK+KDGGLDV+ETYVFWN+HEP
Sbjct: 28 VTYDRKAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPTP 87
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLV+F+K + +AGLYAHLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 88 GNYNFEGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ FT KIV +MK E L+ SQGGPIILSQIENEYG +GAAG +YI WAA MA
Sbjct: 148 FKRAMQGFTQKIVGLMKSESLFESQGGPIILSQIENEYGAQSKLFGAAGHNYITWAAEMA 207
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ LDTGVPWVMC++ DAPDP+INTCNGFYCD F+PN KP +WTE WSGWF FGG +
Sbjct: 208 VGLDTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPYKPTIWTETWSGWFTEFGGPIH 267
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPV+DLA+AVA F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGLIRQ
Sbjct: 268 QRPVQDLAYAVATFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQ 327
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LHKAIK+CE ALV+ DP SLG +A VY + SG CSAFL+N + S
Sbjct: 328 PKYGHLKELHKAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSKSAAR 387
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW- 445
V FN Y LP WS+SILPDC+NVVFNTAK+ Q+ Q+ ++ W
Sbjct: 388 VMFNNMHYNLPPWSISILPDCRNVVFNTAKVGV---------QTSQMQMLPTNIPMLSWE 438
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
SY + + T PGLLEQIN T D +DYLWY S +I + E L G L VQ
Sbjct: 439 SYDEDLTSMDDSSTMTAPGLLEQINVTRDSTDYLWYITSVDIDSSESFLHGGELPTLIVQ 498
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHA+H FING+L GS +G+ + + T + L G N LLS+ VGL N G +E
Sbjct: 499 STGHAVHIFINGQLTGSAFGTRESRRFTYTGKVNLRAGTNKIALLSVAVGLPNVGGHFEA 558
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG---SSTQWDSKSTLP-- 620
GI GPV L G G DLS Q+WTYQ GLKGE +N S SS +W S S +
Sbjct: 559 WNTGILGPVALHGLNQG-KWDLSWQKWTYQVGLKGEAMNLVSQNAFSSVEWISGSLIAQK 617
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
K QPL W+KT F+ P GSEP+A+D GMGKG+ W+NGQSIGRYW + NG C + C+Y
Sbjct: 618 KQQPLTWHKTIFNEPEGSEPLALDMEGMGKGQIWINGQSIGRYWTAFA--NGNC-NGCSY 674
Query: 681 RGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLC 740
G + KC CGKP+Q YHVPRSWLK + N LVLFEE+GGDP++IS V + + SS+C
Sbjct: 675 AGGFRPTKCQSGCGKPTQRYYHVPRSWLKPTQNLLVLFEELGGDPSRISLVKRAV-SSVC 733
Query: 741 SHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
S V + HP + W +S K++ P + L C NP Q ISSIKFASFGTPLGTCGS+
Sbjct: 734 SEVAEYHPT-IKNWHIESYGKVEDFHSPKVHLRC-NPGQAISSIKFASFGTPLGTCGSYQ 791
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G C + S SVV++ C+G + C++ +S + FGDPC V+K L+VEA C
Sbjct: 792 EGTCHATTSYSVVQKKCIGKQRCAVTISNSNFGDPCPKVLKRLSVEAVC 840
>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
Length = 840
Score = 983 bits (2540), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/836 (58%), Positives = 583/836 (69%), Gaps = 29/836 (3%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S A V+YDHRA+ I G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN
Sbjct: 25 SILATVSYDHRAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNG 84
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP Y FE RYDLVKF+K+V AGLY HLRIGPY+CAEWNFGGFP+WL ++PGI+FR
Sbjct: 85 HEPSPGNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIEFR 144
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDN PFKA MQ+FT KIV MMK EKL+ SQGGPIILSQIENE+G ++ GA GK+Y KW
Sbjct: 145 TDNGPFKAAMQKFTEKIVSMMKSEKLFESQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 204
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA+ L TGVPWVMC+Q DAPDP+INTCNGFYC+ F PN + KPK+WTENW+GW+ F
Sbjct: 205 AADMAVKLGTGVPWVMCKQDDAPDPVINTCNGFYCENFKPNKDYKPKLWTENWTGWYTEF 264
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGAVPYRP EDLAF+VARF Q GG+F NYYMYHGGTNF RTS G FI+TSYDYDAPLDEY
Sbjct: 265 GGAVPYRPAEDLAFSVARFIQNGGSFMNYYMYHGGTNFGRTSAGLFIATSYDYDAPLDEY 324
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL R PKWGHL+DLHKAIKLCE ALV+ DPT SLG N EA V+++ S C+AFLAN T
Sbjct: 325 GLTRDPKWGHLRDLHKAIKLCEPALVSVDPTVKSLGSNQEAHVFQSKSS-CAAFLANYDT 383
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
V V F Y LP WS+SILPDCK VFNTA++ + QS Q+
Sbjct: 384 KYSVKVTFGNGQYDLPPWSISILPDCKTAVFNTARLGA---------QSSQMKMTPVGGA 434
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
S SYI E DD T GL EQIN T D SDYLWY + NI +DE L++G V
Sbjct: 435 LSWQSYIEEAATGYTDDTTTLEGLWEQINVTRDASDYLWYMTNVNIDSDEGFLKNGDSPV 494
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + S GH+LH FING+L G+ YGS N K+T + L G N LLS+ VGL N G
Sbjct: 495 LTIFSAGHSLHVFINGQLAGTVYGSLENPKLTFSQNVKLTAGINKISLLSVAVGLPNVGV 554
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKST 618
+EK AGI GPV LKG GT DLS +W+Y+ GLKGE L+ + SS +W S
Sbjct: 555 HFEKWNAGILGPVTLKGLNEGTR-DLSGWKWSYKIGLKGEALSLHTVTGSSSVEWVEGSL 613
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
K QPL WYK TFDAP G++PVA+D + MGKG+ WVNGQSIGR+WP Y ++ G C+ +C
Sbjct: 614 SAKKQPLTWYKATFDAPEGNDPVALDMSSMGKGQIWVNGQSIGRHWPAYTAR-GSCS-AC 671
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
NY G Y KC NCG+PSQ YHVPRSWL SGN LV+FEE GG+P+ IS V + G S
Sbjct: 672 NYAGTYDDKKCRSNCGEPSQRWYHVPRSWLNPSGNLLVVFEEWGGEPSGISLVKRTTG-S 730
Query: 739 LCSHVTDSHPLPVDMW-----GSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+C+ + + P + W G +Q P L CP+ Q IS IKFAS+G+P GT
Sbjct: 731 VCADIFEGQP-ALKNWQMIALGRLDHLQ----PKAHLWCPH-GQKISKIKFASYGSPQGT 784
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
CGSF G C + +S + C+G +SCS+ V+ F GDPC K L+VEA CT
Sbjct: 785 CGSFKAGSCHAHKSYDAFEKKCIGKQSCSVTVAAEVFGGDPCPDSSKKLSVEAVCT 840
>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
Length = 832
Score = 982 bits (2539), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/832 (59%), Positives = 588/832 (70%), Gaps = 34/832 (4%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+VTYDH++V+I G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 25 ASVTYDHKSVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 84
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
QY F GRYDLV+F+KLV +AGLYAHLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 85 SPGQYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDN 144
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA M +FT KIV MMK E LY +QGGPIILSQIENEYG ++ GAAGKSY WAA
Sbjct: 145 GPFKAAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENEYGPVEYYDGAAGKSYTNWAAK 204
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L+TGVPWVMC+Q DAPDP+INTCNGFYCD F+PN +NKPKMWTE W+GWF FGGA
Sbjct: 205 MAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKDNKPKMWTEAWTGWFTGFGGA 264
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP RP ED+AFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFISTSYDYDAP+DEYGL+
Sbjct: 265 VPQRPAEDMAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGLL 324
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHL+DLHKAIKLCE ALV+ +PT SLG N E+ VY++ S C+AFLAN +
Sbjct: 325 RQPKWGHLRDLHKAIKLCEPALVSGEPTITSLGQNQESYVYRSKSS-CAAFLANFNSRYY 383
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV FNG Y LP WSVSILPDCK VFNTA++ + T + +Q G
Sbjct: 384 ATVTFNGMHYNLPPWSVSILPDCKTTVFNTARVGAQT-----TTMKMQYLG------GFS 432
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W E D+ FTK GL+EQ++TT D+SDYLWY+ +I +E L+ G L V
Sbjct: 433 WKAYTEDTDALNDNTFTKDGLVEQLSTTWDRSDYLWYTTYVDIAKNEEFLKTGKYPYLTV 492
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHA+H FING+L G+ YGS N K+T L G N +LS++VGL N G +E
Sbjct: 493 MSAGHAVHVFINGQLSGTAYGSLDNPKLTYSGSAKLWAGSNKISILSVSVGLPNVGNHFE 552
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
G+ GPV L G G DLS Q+WTYQ GL GE L+ S S+ +W S +
Sbjct: 553 TWNTGVLGPVTLTGLNEGKR-DLSLQKWTYQIGLHGETLSLHSLTGSSNVEWGEAS---Q 608
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYKT F+AP G+EP+A+D MGKG+ W+NGQSIGRYWP Y + G SC+YR
Sbjct: 609 KQPLTWYKTFFNAPPGNEPLALDMNTMGKGQIWINGQSIGRYWPAY--KASGSCGSCDYR 666
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y+ KCL NCG+ SQ YHVPRSWL +GN LV+ EE GGDPT IS V + + +S+C+
Sbjct: 667 GTYNEKKCLSNCGEASQRWYHVPRSWLIPTGNFLVVLEEWGGDPTGISMVKRSV-ASVCA 725
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
V + P +D W + + P + L C +P Q +S IKFASFGTP GTCGSFS G
Sbjct: 726 EVEELQPT-MDNW----RTKAYGRPKVHLSC-DPGQKMSKIKFASFGTPQGTCGSFSEGS 779
Query: 802 CSSARSLSVVRQA-----CVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
C + +S Q CVG + CS+ V+ F GDPC G MK LAVEA C
Sbjct: 780 CHAHKSYDAFEQEGLMQNCVGQEFCSVNVAPEVFGGDPCPGTMKKLAVEAIC 831
>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 702
Score = 981 bits (2536), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/706 (67%), Positives = 568/706 (80%), Gaps = 12/706 (1%)
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
MQRFT K+VD MK LYASQGGPIILSQIENEYGNIDSAYGAAGK+Y++WAAGMA+SLD
Sbjct: 1 MQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLD 60
Query: 211 TGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
TGVPWVMCQQSDAPDP+INTCNGFYCDQFTPNS +KPKMWTENWSGWFLSFGGAVPYRP
Sbjct: 61 TGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPA 120
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWG 330
EDLAFAVARF+QRGGTFQNYYMYHGGTNF R++GGPFI+TSYDYDAP+DEYG++RQPKWG
Sbjct: 121 EDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWG 180
Query: 331 HLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG-SGLCSAFLANIGTNSDVTVKF 389
HL+D+HKAIKLCE AL+A +P+Y SLG N EATVY+T + +C+AFLAN+ SD TVKF
Sbjct: 181 HLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKF 240
Query: 390 NGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR--QSLQVAADS---SDAIGSG 444
NGN+Y LPAWSVSILPDCKNVV NTA+INS S+Q DS + +G
Sbjct: 241 NGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLITPELATAG 300
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
WSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S +K DEP L +GS++ L V
Sbjct: 301 WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYL-NGSQSNLLV 359
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
SLGH L +INGKL GS GS+S++ +++ P+ L PGKN DLLS TVGL NYGAF++
Sbjct: 360 NSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFD 419
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQWDSKSTLPKL 622
GAG+TGPV+L G NG ++LSS WTYQ GL+GE+L+ PS +S +W S + P
Sbjct: 420 LVGAGVTGPVKLSGP-NGA-LNLSSTDWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTN 477
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QPL+WYKT F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ GC +SCNYRG
Sbjct: 478 QPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRG 537
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
AYSSNKCLK CG+PSQ+LYHVPRS+L+ N LVLFE+ GGDP+ ISF T+Q SS+C+H
Sbjct: 538 AYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQT-SSICAH 596
Query: 743 VTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRC 802
V++ HP +D W S + + GP L LECP QVIS+IKFASFGTP GTCG+++ G C
Sbjct: 597 VSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGEC 656
Query: 803 SSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
SS+++L+VV++ACVG +CS+ VS N FGDPC GV KSL VEA+C+
Sbjct: 657 SSSQALAVVQEACVGMTNCSVPVSSNNFGDPCSGVTKSLVVEAACS 702
>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 981 bits (2535), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/832 (58%), Positives = 585/832 (70%), Gaps = 27/832 (3%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+AV+I G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGG+DVI+TYVFWN HEP
Sbjct: 26 ASVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGVDVIQTYVFWNGHEP 85
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
Y FE RYDLVKF+KLV +AGLY HLRIGPY+CAEWNFGGFP+WL ++PGI+FRTDN
Sbjct: 86 SPGNYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDN 145
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQ+FT KIV MMK EKL+ +QGGPIILSQIENEYG ++ GA GK+Y KWAA
Sbjct: 146 GPFKAAMQKFTEKIVGMMKSEKLFENQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAD 205
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L TGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN + KPK+WTE W+GW+ FGGA
Sbjct: 206 MAVKLGTGVPWIMCKQEDAPDPMIDTCNGFYCENFKPNKDYKPKIWTEAWTGWYTEFGGA 265
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RP ED+AF+VARF Q GG++ NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDE+GL
Sbjct: 266 VPHRPAEDMAFSVARFIQNGGSYINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLP 325
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R+PKWGHL+DLHKAIKLCE ALV+ DPT SLG N EA V+K+ S +C+AFLAN T
Sbjct: 326 REPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVFKSKS-VCAAFLANYDTKYS 384
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS----VTLVPSFSRQSLQVAADSSDA 440
V V F Y LP WSVSILPDCK V+NTA++ S + +VP+ S S Q
Sbjct: 385 VKVTFGNGQYELPPWSVSILPDCKTAVYNTARLGSQSSQMKMVPASSSFSWQ-------- 436
Query: 441 IGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
SY E DD T GL EQIN T D +DYLWY I ADE L+ G
Sbjct: 437 -----SYNEETASADDDDTTTMNGLWEQINVTRDATDYLWYLTDVKIDADEGFLKSGQNP 491
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
+L + S GHALH FING+L G+ YG SN K+T I L G N LLS+ VGL N G
Sbjct: 492 LLTIFSAGHALHVFINGQLAGTAYGGLSNPKLTFSQNIKLTEGINKISLLSVAVGLPNVG 551
Query: 561 AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGS---STQWDSKS 617
+E AG+ GP+ LKG GT DLS Q+W+Y+ GLKGE L+ + S S +W S
Sbjct: 552 LHFETWNAGVLGPITLKGLNEGTR-DLSGQKWSYKIGLKGESLSLHTASGSESVEWVEGS 610
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
L + Q L WYKT FDAP G++P+A+D + MGKG+ W+NGQ+IGR+WP Y++ +G C D
Sbjct: 611 LLAQKQALTWYKTAFDAPQGNDPLALDMSSMGKGQMWINGQNIGRHWPGYIA-HGSCGD- 668
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
CNY G + KC NCG+PSQ YHVPRSWLK SGN L +FEE GGDPT ISFV K+ +
Sbjct: 669 CNYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLAVFEEWGGDPTGISFV-KRTTA 727
Query: 738 SLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
S+C+ + + P + S P P L CP Q IS IKFASFG P GTCGSF
Sbjct: 728 SVCADIFEGQPALKNWQAIASGKVISPQPKAHLWCPT-GQKISQIKFASFGMPQGTCGSF 786
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G C + +S + CVG +SCS+ V+ F GDPC K L+VEA C+
Sbjct: 787 REGSCHAHKSYDAFERNCVGKQSCSVTVAPEVFGGDPCPDSAKKLSVEAVCS 838
>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
Length = 835
Score = 979 bits (2530), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/830 (58%), Positives = 587/830 (70%), Gaps = 22/830 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+A+++ G+R++LISGSIHYPRSTPEMWPDLIQK+K+GG+DVI+TYVFWN HEP
Sbjct: 22 ASVSYDHKAIIVNGQRKILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEP 81
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y FE RYDLVKF+K+V EAGLY HLRIGPY CAEWNFGGFP+WL ++PGI FRT+N
Sbjct: 82 EEGKYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFGGFPVWLKYVPGISFRTNN 141
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFKA MQ+FT KIVDMMK EKLY +QGGPIILSQIENEYG ++ G GK Y +WAA
Sbjct: 142 EPFKAAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENEYGPMEWELGEPGKVYSEWAAK 201
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L TGVPW+MC+Q D PDPIINTCNGFYCD FTPN NKPKMWTE W+ WF FGG
Sbjct: 202 MAVDLGTGVPWIMCKQDDVPDPIINTCNGFYCDYFTPNKANKPKMWTEAWTAWFTEFGGP 261
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VPYRP ED+AFAVARF Q GG+F NYYMYHGGTNF RTSGGPFI+TSYDYDAPLDE+G +
Sbjct: 262 VPYRPAEDMAFAVARFIQTGGSFINYYMYHGGTNFGRTSGGPFIATSYDYDAPLDEFGSL 321
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE ALV+ DPT SLG EA V+K+ SG C+AFLAN +S
Sbjct: 322 RQPKWGHLKDLHRAIKLCEPALVSVDPTVTSLGNYQEARVFKSESGACAAFLANYNQHSF 381
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F Y LP WS+SILPDCKN V+NTA++ + QS Q+ + + G
Sbjct: 382 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGA---------QSAQMKM-TPVSRGFS 431
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W NE +DD FT GLLEQIN T D SDYLWY I E L G+ L V
Sbjct: 432 WESFNEDAASHEDDTFTVVGLLEQINITRDVSDYLWYMTDIEIDPTEGFLNSGNWPWLTV 491
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH F+NG+L G+ YGS N K+T I L G N LLS+ VGL N G +E
Sbjct: 492 FSAGHALHVFVNGQLAGTVYGSLENPKLTFSNGINLRAGVNKISLLSIAVGLPNVGPHFE 551
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGS---STQWDSKSTLPK 621
AG+ GPV L G GT DL+ Q+W Y+ GLKGE L+ S S S +W S + +
Sbjct: 552 TWNAGVLGPVSLNGLNEGTR-DLTWQKWFYKVGLKGEALSLHSLSGSPSVEWVEGSLVAQ 610
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYKTTF+AP G+EP+A+D MGKG+ W+NGQS+GR+WP Y S +G C+ CNY
Sbjct: 611 KQPLSWYKTTFNAPDGNEPLALDMNTMGKGQVWINGQSLGRHWPAYKS-SGSCS-VCNYT 668
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G + KCL NCG+ SQ YHVPRSWL +GN LV+FEE GGDP I+ V +++G S+C+
Sbjct: 669 GWFDEKKCLTNCGEGSQRWYHVPRSWLYPTGNLLVVFEEWGGDPYGITLVKREIG-SVCA 727
Query: 742 HVTDSHPLPVDMWGS--DSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
+ + P ++ W K R P L+C P Q ISSIKFASFGTP G CG+F +
Sbjct: 728 DIYEWQPQLLN-WQRLVSGKFDRPLRPKAHLKCA-PGQKISSIKFASFGTPEGVCGNFQQ 785
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G C + RS ++ CVG +SCS+ V+ F GDPC+ V+K L+VEA C+
Sbjct: 786 GSCHAPRSYDAFKKNCVGKESCSVQVTPENFGGDPCRNVLKKLSVEAICS 835
>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 828
Score = 977 bits (2525), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/829 (58%), Positives = 585/829 (70%), Gaps = 22/829 (2%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YD RA+VI G+RR+LISGSIHYPRS+PEMWPDLIQK+K+GGLDVI+TYVFWN HEP
Sbjct: 16 NVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 75
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ +Y FEGRYDLV+F+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL ++ GI FRT+NE
Sbjct: 76 QGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTNNE 135
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQRFT KIVDMMK E L+ SQGGPIILSQIENEYG ++ GA G++Y +WAA M
Sbjct: 136 PFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAKM 195
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ L TGVPWVMC+Q DAPDPIINTCNGFYCD F+PN KPKMWTE W+GWF FGGAV
Sbjct: 196 AVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 255
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP EDLAF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDE+GL+R
Sbjct: 256 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLLR 315
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPKWGHLKDLH+AIKLCE AL++ DPT SLG EA V+ + SG C+AFLAN S
Sbjct: 316 QPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPRSYA 375
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V F Y LP WS+SILPDCKN V+NTA++ + + + S + GW
Sbjct: 376 KVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMTPVSGRF----------GW 425
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
NE D +F GLLEQINTT D SDYLWYS I +E L+ G VL V
Sbjct: 426 QSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLTVL 485
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH FING+L G+ YGS N K+T + L G NT LLS+ VGL N G +E
Sbjct: 486 SAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHFET 545
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPKL 622
AG+ GPV L G G DLS Q+W+Y+ GLKGE + SS +W S + +
Sbjct: 546 WNAGVLGPVSLNGLNEGRR-DLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLMARG 604
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QPL WYKTTF+AP G+ P+A+D MGKG+ W+NGQ++GRYWP Y + GGC D CNY G
Sbjct: 605 QPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA-TGGCGD-CNYAG 662
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
YS KCL NCG+PSQ YHVP SWL +GN LV+FEE GG+P IS V +++ S+C+
Sbjct: 663 TYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREI-ESVCAD 721
Query: 743 VTDSHP--LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+ + P + +M S K+ + P L C P Q ISSIKFASFGTP G CGS+ G
Sbjct: 722 IYEWQPTLMNYEMQAS-GKVNKPLRPKAHLWCA-PGQKISSIKFASFGTPEGVCGSYREG 779
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C + +S ++C+G SCS+ V+ F GDPC VMK L+VEA C+
Sbjct: 780 SCHAHKSYDAFERSCIGMNSCSVTVAPEIFGGDPCPSVMKKLSVEAICS 828
>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 977 bits (2525), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/830 (58%), Positives = 586/830 (70%), Gaps = 22/830 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YD RA+VI G+RR+LISGSIHYPRS+PEMWPDLIQK+K+GGLDVI+TYVFWN HEP
Sbjct: 28 ASVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 87
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ +Y FEGRYDLV+F+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL ++ GI FRT+N
Sbjct: 88 SQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTNN 147
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK MQRFT KIVDMMK E L+ SQGGPIILSQIENEYG ++ GA G++Y +WAA
Sbjct: 148 EPFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAK 207
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L TGVPWVMC+Q DAPDPIINTCNGFYCD F+PN KPKMWTE W+GWF FGGA
Sbjct: 208 MAVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 267
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RP EDLAF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDE+GL+
Sbjct: 268 VPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLL 327
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE AL++ DPT SLG EA V+ + SG C+AFLAN S
Sbjct: 328 RQPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPRSY 387
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F Y LP WS+SILPDCKN V+NTA++ + + + S + G
Sbjct: 388 AKVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMTPVSGRF----------G 437
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W NE D +F GLLEQINTT D SDYLWYS I +E L+ G VL V
Sbjct: 438 WQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLTV 497
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH FING+L G+ YGS N K+T + L G NT LLS+ VGL N G +E
Sbjct: 498 LSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHFE 557
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPK 621
AG+ GPV L G G DLS Q+W+Y+ GLKGE + SS +W S + +
Sbjct: 558 TWNAGVLGPVSLNGLNEGRR-DLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLMAR 616
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYKTTF+AP G+ P+A+D MGKG+ W+NGQ++GRYWP Y + GGC D CNY
Sbjct: 617 GQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA-TGGCGD-CNYA 674
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G YS KCL NCG+PSQ YHVP SWL +GN LV+FEE GG+P IS V +++ S+C+
Sbjct: 675 GTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREI-ESVCA 733
Query: 742 HVTDSHP--LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
+ + P + +M S K+ + P L C P Q ISSIKFASFGTP G CGS+
Sbjct: 734 DIYEWQPTLMNYEMQAS-GKVNKPLRPKAHLWCA-PGQKISSIKFASFGTPEGVCGSYRE 791
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G C + +S ++C+G SCS+ V+ F GDPC VMK L+VEA C+
Sbjct: 792 GSCHAHKSYDAFERSCIGMNSCSVTVAPEIFGGDPCPSVMKKLSVEAICS 841
>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 929
Score = 976 bits (2523), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 491/865 (56%), Positives = 614/865 (70%), Gaps = 41/865 (4%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
V +A+ NVTYD RA++I G+RR+LIS IHYPR+TPEMWP L+QKSK+GG DV++
Sbjct: 23 IVPIASARKPINVTYDQRALIINGQRRMLISAGIHYPRATPEMWPSLVQKSKEGGADVVQ 82
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
+YVFWN HEP + QYNFEGRYDLVKF+K+V +AGLY HLRIGPYVCAEWNFGGFP WL
Sbjct: 83 SYVFWNGHEPKQGQYNFEGRYDLVKFIKVVQQAGLYFHLRIGPYVCAEWNFGGFPYWLKD 142
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
IPGI FRTDNEPFK M+ F +KIV++MK+ +L+A QGGPII++QIENEYGNI+ A+G
Sbjct: 143 IPGIVFRTDNEPFKVAMEGFVSKIVNLMKENQLFAWQGGPIIMAQIENEYGNIEWAFGDG 202
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
GK Y WAA +AL LD GVPWVMCQQ DAP IINTCNG+YCD F N+ KP WTE+W
Sbjct: 203 GKRYAMWAAELALGLDAGVPWVMCQQDDAPGNIINTCNGYYCDGFKANTATKPAFWTEDW 262
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
+GWF +G +VP+RPVED AFA+ARFFQRGG+FQNYYMY GGTNF RT+GGPF++TSYDY
Sbjct: 263 NGWFQYWGQSVPHRPVEDNAFAIARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTSYDY 322
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKTGSGLC 372
DAPLDEYGLIRQPKWGHL+DLH AIKLCE AL A D P LGPN+EA VY +G G C
Sbjct: 323 DAPLDEYGLIRQPKWGHLRDLHAAIKLCEPALTAVDEVPLSTWLGPNVEAHVY-SGRGQC 381
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF----SR 428
+AFLANI + TV+F G +Y+LP WSVSILPDCKNVVFNTA++ + T + S+
Sbjct: 382 AAFLANIDSWKIATVQFKGKAYVLPPWSVSILPDCKNVVFNTAQVGAQTTLTRMTIVRSK 441
Query: 429 QSLQVAADSS--------DAIGSG--WSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
+V S+ +GSG W EPVGI LLEQ+N T D +DY
Sbjct: 442 LEGEVVMPSNMLRKHAPESIVGSGLKWEASVEPVGIRGAATLVSNRLLEQLNITKDSTDY 501
Query: 479 LWYSLS--TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDF 536
LWYS+S +++A L + S+ +L + S+ A+H F+N +LVGS GS V V
Sbjct: 502 LWYSISIKVSVEAVTALSKTKSQAILVLGSMRDAVHIFVNRQLVGSAMGSD----VQVVQ 557
Query: 537 PIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQT 596
P+ L GKN DLLS+TVGLQNYGA+ E GAGI G L+G +G +DLS+++W+YQ
Sbjct: 558 PVPLKEGKNDIDLLSMTVGLQNYGAYLETWGAGIRGSALLRGLPSGV-LDLSTERWSYQV 616
Query: 597 GLKGEELN-FPSGSS--TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEA 653
G++GEE F +G++ QWDS S+ P L WYKTTFDAP G++PVA+D MGKG+A
Sbjct: 617 GIQGEEKRLFETGTADGIQWDSSSSFPNASALTWYKTTFDAPKGTDPVALDLGSMGKGQA 676
Query: 654 WVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQ-----SLYHVPRSWL 708
WVNG +GRYWP+ ++ GC+ +C+YRGAY ++KC NCGKPSQ +YH+PR+WL
Sbjct: 677 WVNGHHMGRYWPSVLASQSGCS-TCDYRGAYDADKCRTNCGKPSQRWQYVDMYHIPRAWL 735
Query: 709 KSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ---RKPG 765
+ S N LVLFEEIGGD +K+S VT+ ++C+HV +S P PV W ++S + + G
Sbjct: 736 QLSNNLLVLFEEIGGDVSKVSLVTRS-APAVCTHVHESQPPPVLFWPANSSMDAMSSRSG 794
Query: 766 PVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGV 825
+ LEC Q I IKFASFG P G+CG+F RG C + +SL V R+AC+G CSI V
Sbjct: 795 EAV-LECI-AGQHIRHIKFASFGNPKGSCGNFQRGTCHAMKSLEVARKACMGMHRCSIPV 852
Query: 826 SVNTFG--DPCKGVMKSLAVEASCT 848
TFG DPC V KSLAV+ C+
Sbjct: 853 QWQTFGEFDPCPDVSKSLAVQVFCS 877
>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
Length = 853
Score = 976 bits (2523), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/850 (57%), Positives = 593/850 (69%), Gaps = 30/850 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
L LV GF ++ T VTYD RA+VI G+RR+LISGSIHYPRSTPEMW DLIQK+
Sbjct: 12 FLGLVCFLGFQLVQCT-----VTYDRRAIVINGQRRILISGSIHYPRSTPEMWEDLIQKA 66
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDV+ETYVFWN+HEP YNF+GRYDLV+F+K + +AGLYAHLRIGPYVCAEWNF
Sbjct: 67 KDGGLDVVETYVFWNVHEPSPGNYNFKGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNF 126
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFK MQ FT KIV +MK EKL+ SQGGPIILSQIENEYG
Sbjct: 127 GGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSEKLFESQGGPIILSQIENEYG 186
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+GAAG +Y+ WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD F PN
Sbjct: 187 AQSKLFGAAGHNYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFAPNKPY 246
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP +WTE WSGWF FGG + RPV+DLA+AVARF Q+GG+F NYYMYHGGTNF RT+GG
Sbjct: 247 KPTIWTEAWSGWFSEFGGPIHQRPVQDLAYAVARFIQKGGSFVNYYMYHGGTNFGRTAGG 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAPLDEYGLIRQPK+GHLK+LH+AIK+CE ALV+ DP SLG +A VY
Sbjct: 307 PFITTSYDYDAPLDEYGLIRQPKYGHLKELHRAIKMCERALVSADPIITSLGNFQQAYVY 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ SG CSAFL+N + S V FN Y LP WS+SILPDC+NVVFNTAK+
Sbjct: 367 TSESGDCSAFLSNHDSKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGV------ 420
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDA--FTKPGLLEQINTTADQSDYLWYSL 483
Q+ Q+ ++ W +E + S DD+ T PGLLEQIN T D +DYLWY
Sbjct: 421 ---QTSQMGMLPTNIQMLSWESYDEDI-TSLDDSSTITAPGLLEQINVTRDSTDYLWYKT 476
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
S +I + E L G L VQS GHA+H FING+L GS +G+ + + T + L G
Sbjct: 477 SVDIGSSESFLRGGELPTLIVQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVNLHAG 536
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL 603
N LLS+ VGL N G +E GI GPV L G G DLS Q+WTYQ GLKGE +
Sbjct: 537 TNRIALLSVAVGLPNVGGHFEAWNTGILGPVALHGLDQG-KWDLSWQKWTYQVGLKGEAM 595
Query: 604 NFPSG---SSTQWDSKS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
N S SS W S K QPL W+KT F+AP G EP+A+D GMGKG+ W+NGQS
Sbjct: 596 NLVSPNSISSVDWMRGSLAAQKQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWINGQS 655
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
IGRYW + NG C + C+Y G + KC CG+P+Q +YHVPRSWLK N LV+FE
Sbjct: 656 IGRYWTAFA--NGNC-NGCSYAGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVIFE 712
Query: 720 EIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQ 777
E GGDP++IS V + + SS+C+ V + HP + W +S K + P + L C NP Q
Sbjct: 713 EFGGDPSRISLVKRSV-SSVCAEVAEYHPT-IKNWHIESYGKAEDFHSPKVHLRC-NPGQ 769
Query: 778 VISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGV 837
ISSIKFASFGTPLGTCGS+ G C +A S SV+++ C+G + C++ +S + FGDPC V
Sbjct: 770 AISSIKFASFGTPLGTCGSYQEGTCHAATSYSVLQKKCIGKQRCAVTISNSNFGDPCPKV 829
Query: 838 MKSLAVEASC 847
+K L+VEA C
Sbjct: 830 LKRLSVEAVC 839
>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 976 bits (2522), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/827 (58%), Positives = 582/827 (70%), Gaps = 21/827 (2%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+V+YDH+A++I G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 38 SVSYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 97
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+Y FEGRYDLVKF+KLV EAGLY HLRIGPY CAEWNFGGFP+WL +IPGI FRTDNE
Sbjct: 98 PGEYYFEGRYDLVKFIKLVKEAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTDNE 157
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK M FT KIVDMMK+E+L+ +QGGPIILSQIENEYG ++ GA G++Y KWAA M
Sbjct: 158 PFKTAMAGFTKKIVDMMKEEELFETQGGPIILSQIENEYGPVEWEIGAPGQAYTKWAANM 217
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ L TGVPWVMC+Q DAPDPIINTCN YCD F+PN N KP MWTE W+ WF +FGG V
Sbjct: 218 AVGLGTGVPWVMCKQDDAPDPIINTCNDHYCDWFSPNKNYKPTMWTEAWTSWFTAFGGPV 277
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
PYRP ED+AFA+A+F QRGG+F NYYMYHGGTNF RT+GGPF++TSYDYDAP+DEYGLIR
Sbjct: 278 PYRPAEDMAFAIAKFIQRGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEYGLIR 337
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPKWGHLKDLHKAIK+CEAALV+ DP SLG + E+ V+K+ SG C+AFLAN S
Sbjct: 338 QPKWGHLKDLHKAIKMCEAALVSGDPIVTSLGSSQESHVFKSESGDCAAFLANYDEKSFA 397
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V F G Y LP WS+SILPDC N VFNTA++ + Q+ + S + G W
Sbjct: 398 KVAFQGMHYNLPPWSISILPDCVNTVFNTARVGA---------QTSSMTMTSVNPDGFSW 448
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
NE D + T GLLEQIN T D +DYLWY+ I +E L++G VL V
Sbjct: 449 ETYNEETASYDDASITMEGLLEQINVTRDVTDYLWYTTDITIDPNEGFLKNGEYPVLTVM 508
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH FING+L G+ YGS N K+T + L G N +LS+ VGL N GA +E
Sbjct: 509 SAGHALHIFINGELSGTVYGSVDNPKLTYTGSVKLLAGNNKISVLSIAVGLPNIGAHFET 568
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKL 622
G+ GPV L G G DLS Q W+Y+ GLKGE L S SS +W S + +
Sbjct: 569 WNTGVLGPVVLNGLNEGRR-DLSWQNWSYKIGLKGEALQLHSLTGSSSVEW--SSLIAQK 625
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QPL WYKTTF+AP G+ P A+D + MGKG+ W+NGQSIGRYWP Y + G C + C+Y G
Sbjct: 626 QPLTWYKTTFNAPEGNGPFALDMSMMGKGQIWINGQSIGRYWPAYKAY-GNCGE-CSYTG 683
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
Y+ KCL NCG+ SQ YHVP SWL + N LV+FEE GGDPT IS V + GS+ C+
Sbjct: 684 RYNEKKCLANCGEASQRWYHVPSSWLYPTANLLVVFEEWGGDPTGISLVRRTTGSA-CAF 742
Query: 743 VTDSHPLPVDMWGSD-SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
+++ HP D + +R P L C + Q ISSIKFASFGTP G CG+F+ G
Sbjct: 743 ISEWHPTLRKWHIKDYGRAERPRRPKAHLSCAD-GQKISSIKFASFGTPQGVCGNFTEGS 801
Query: 802 CSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
C + +S + + CVG + CS+ +S + F GDPC VMK+LAVEA C
Sbjct: 802 CHAHKSYDIFEKNCVGQQWCSVTISPDVFGGDPCPNVMKNLAVEAIC 848
>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
Length = 842
Score = 975 bits (2520), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/855 (56%), Positives = 597/855 (69%), Gaps = 25/855 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFG--ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMW 58
M S+ ++ VL V+L++ F A+V+YDH+A+++ G+RR+LISGSIHYPRSTPEMW
Sbjct: 6 MVSRLVMWNVL---LVLLSSCVFSGLASVSYDHKAIIVNGQRRILISGSIHYPRSTPEMW 62
Query: 59 PDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPY 118
PDLIQK+K+GG+DVI+TYVFWN HEP + +Y FE RYDLVKF+KLV +AGLY +LR+GPY
Sbjct: 63 PDLIQKAKEGGVDVIQTYVFWNGHEPEQGKYYFEERYDLVKFIKLVHQAGLYVNLRVGPY 122
Query: 119 VCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILS 178
CAEWNFGGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV+MMK E+LY SQGGPIILS
Sbjct: 123 ACAEWNFGGFPVWLKYVPGISFRTDNEPFKAAMQKFTTKIVNMMKAERLYESQGGPIILS 182
Query: 179 QIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ 238
QIENEYG ++ +G GKSY +WAA MAL L TGVPW+MC+Q DAPDP+INTCNGFYCD
Sbjct: 183 QIENEYGPLEVRFGEQGKSYAEWAAKMALDLGTGVPWLMCKQDDAPDPVINTCNGFYCDY 242
Query: 239 FTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTN 298
F PN KPK+WTE W+ WF FG VPYRPVEDLAF VA F Q GG+F NYYMYHGGTN
Sbjct: 243 FYPNKAYKPKIWTEAWTAWFTEFGSPVPYRPVEDLAFGVANFIQTGGSFINYYMYHGGTN 302
Query: 299 FDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
F RT+GGPF++TSYDYDAPLDE+GL+RQPKWGHLKDLH+AIKLCE ALV+ DPT +LG
Sbjct: 303 FGRTAGGPFVATSYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTALGN 362
Query: 359 NLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKIN 418
+A V+++ SG C+AFLAN NS TV F Y LP WS+SILPDCK+ V+NTA++
Sbjct: 363 YQKAHVFRSTSGACAAFLANNDPNSFATVAFGNKHYNLPPWSISILPDCKHTVYNTARVG 422
Query: 419 SVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
++ +L +++ G W N+ D+AFT GLLEQ+NTT D SDY
Sbjct: 423 --------AQSALMKMTPANE--GYSWQSYNDQTAFYDDNAFTVVGLLEQLNTTRDVSDY 472
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
LWY I E L G+ L V S G ALH F+NG+L G+ YGS K+T +
Sbjct: 473 LWYMTDVKIDPSEGFLRSGNWPWLTVSSAGDALHVFVNGQLAGTVYGSLKKQKITFSKAV 532
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
L G N LLS+ VGL N G +E G+ GPV L G G DL+ Q+W+Y+ GL
Sbjct: 533 NLRAGVNKISLLSIAVGLPNIGPHFETWNTGVLGPVSLSGLDEGKR-DLTWQKWSYKVGL 591
Query: 599 KGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWV 655
KGE LN S SS +W S + + QPL WYKTTF+APAG+EP+A+D MGKG+ W+
Sbjct: 592 KGEALNLHSLSGSSSVEWVEGSLVAQRQPLTWYKTTFNAPAGNEPLALDMNSMGKGQVWI 651
Query: 656 NGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTL 715
NGQSIGRYWP Y + G D+CNY G ++ KCL NCG SQ YHVPRSWL +GN L
Sbjct: 652 NGQSIGRYWPGY--KASGTCDACNYAGPFNEKKCLSNCGDASQRWYHVPRSWLHPTGNLL 709
Query: 716 VLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPN 774
V+FEE GGDP IS V ++L +S+C+ + + P V+ + K+ + P L C +
Sbjct: 710 VVFEEWGGDPNGISLVKREL-ASVCADINEWQPQLVNWQLQASGKVDKPLRPKAHLSCTS 768
Query: 775 PNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDP 833
Q I+SIKFASFGTP G CGSFS G C + S + C+G +SC++ V+ F GDP
Sbjct: 769 -GQKITSIKFASFGTPQGVCGSFSEGSCHAHHSYDAFEKYCIGQESCTVPVTPEIFGGDP 827
Query: 834 CKGVMKSLAVEASCT 848
C VMK L+VEA C+
Sbjct: 828 CPSVMKKLSVEAVCS 842
>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 839
Score = 974 bits (2517), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/842 (56%), Positives = 592/842 (70%), Gaps = 23/842 (2%)
Query: 11 LCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGL 70
C +VL + A+VTYDH+A+V+ G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGGL
Sbjct: 15 FCTLLLVLWVCAVTASVTYDHKAIVVNGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGL 74
Query: 71 DVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPL 130
DVI+TYVFWN HEP +Y FE RYDLVKF+KLV +AGLY HLRIGPY+CAEWNFGGFP+
Sbjct: 75 DVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYICAEWNFGGFPV 134
Query: 131 WLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSA 190
WL ++PGI FRTDNEPFKA MQ+FT KIV +MK+EKL+ +QGGPII+SQIENEYG ++
Sbjct: 135 WLKYVPGIAFRTDNEPFKAAMQKFTEKIVSIMKEEKLFQTQGGPIIMSQIENEYGPVEWE 194
Query: 191 YGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMW 250
GA GK+Y KW + MA+ LDTGVPW+MC+Q D PDP+I+TCNG+YC+ FTPN KPKMW
Sbjct: 195 IGAPGKAYTKWFSQMAVGLDTGVPWIMCKQQDTPDPLIDTCNGYYCENFTPNKKYKPKMW 254
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+GW+ FGGAVP RP ED+AF+VARF Q GG+F NYYMYHGGTNFDRTS G FI+T
Sbjct: 255 TENWTGWYTEFGGAVPRRPAEDMAFSVARFVQNGGSFVNYYMYHGGTNFDRTSSGLFIAT 314
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
SYDYD P+DEYGL+ +PKWGHL+DLHKAIKLCE ALV+ DPT G NLE V+KT SG
Sbjct: 315 SYDYDGPIDEYGLLNEPKWGHLRDLHKAIKLCEPALVSVDPTVTWPGNNLEVHVFKT-SG 373
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
C+AFLAN T S +VKF Y LP WS+SILPDCK VFNTA++ + QS
Sbjct: 374 ACAAFLANYDTKSSASVKFGNGQYDLPPWSISILPDCKTAVFNTARLGA---------QS 424
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKAD 490
+ + ++ SY EP ++DD+ T L EQIN T D +DYLWY NI A+
Sbjct: 425 SLMKMTAVNSAFDWQSYNEEPASSNEDDSLTAYALWEQINVTRDSTDYLWYMTDVNIDAN 484
Query: 491 EPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLL 550
E +++G VL V S GH LH IN +L G+ YG + K+T + L G N LL
Sbjct: 485 EGFIKNGQSPVLTVMSAGHVLHVLINDQLSGTVYGGLDSHKLTFSDSVKLRVGNNKISLL 544
Query: 551 SLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS--- 607
S+ VGL N G +E AG+ GPV LKG GT DLS Q+W+Y+ GLKGE LN +
Sbjct: 545 SIAVGLPNVGPHFETWNAGVLGPVTLKGLNEGTR-DLSKQKWSYKIGLKGEALNLNTVSG 603
Query: 608 GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
SS +W S L K QPL WYKTTF PAG++P+A+D MGKG+AW+NG+SIGR+WP Y
Sbjct: 604 SSSVEWVQGSLLAKQQPLAWYKTTFSTPAGNDPLALDMISMGKGQAWINGRSIGRHWPGY 663
Query: 668 VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTK 727
+++ G C D C Y G Y+ KC NCG+PSQ YH+PRSWL SGN LV+FEE GGDPT
Sbjct: 664 IAR-GNCGD-CYYAGTYTDKKCRTNCGEPSQRWYHIPRSWLNPSGNYLVVFEEWGGDPTG 721
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS-KIQRKPGPVLSLECPNPNQVISSIKFAS 786
I+ V K+ +S+C+ + P + DS K+ R P L CP P + IS IKFAS
Sbjct: 722 ITLV-KRTTASVCADIYQGQPTLKNRQMLDSGKVVR---PKAHLWCP-PGKNISQIKFAS 776
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEA 845
+G P GTCG+F G C + +S ++ C+G +SC + V+ F GDPC G+ K L++EA
Sbjct: 777 YGLPQGTCGNFREGSCHAHKSYDAPQKNCIGKQSCLVTVAPEVFGGDPCPGIAKKLSLEA 836
Query: 846 SC 847
C
Sbjct: 837 LC 838
>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
Length = 847
Score = 971 bits (2511), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/853 (57%), Positives = 591/853 (69%), Gaps = 24/853 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
MA+ L L+ GF+V S +V+YD RA+ I GKRR+LISGSIHYPRSTPEMWPD
Sbjct: 14 MAAVSALFLL---GFLV---CSVSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPD 67
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+K+K+GGLDVI+TYVFWN HEP +Y FEG YDLVKFVKLV ++GLY HLRIGPYVC
Sbjct: 68 LIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVC 127
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWNFGGFP+WL +IPGI FRTDN PFKA+MQRFT KIV+MMK E+L+ SQGGPIILSQI
Sbjct: 128 AEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQI 187
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG ++ GA G+SY WAA MA+ L TGVPWVMC+Q DAPDPIIN CNGFYCD F+
Sbjct: 188 ENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFS 247
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN KPKMWTE W+GWF FGG VPYRP ED+AF+VARF Q+GG+F NYYMYHGGTNF
Sbjct: 248 PNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFG 307
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGPFI+TSYDYDAPLDEYGL RQPKWGHLKDLH+AIKLCE ALV+ +PT LG
Sbjct: 308 RTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQ 367
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EA VYK+ SG CSAFLAN S V F N Y LP WS+SILPDCKN V+NTA++ +
Sbjct: 368 EAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQ 427
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
T R + G W NE D++FT GL+EQINTT D SDYLW
Sbjct: 428 TSRMKMVRVPVHG--------GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLW 479
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y + A+E L +G L V S GHA+H FING+L GS YGS + K+T + L
Sbjct: 480 YMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNL 539
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N +LS+ VGL N G +E AG+ GPV L G NG DLS Q+WTY+ GLKG
Sbjct: 540 RAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGL-NGGRRDLSWQKWTYKVGLKG 598
Query: 601 E---ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
E + SS +W + + + QPL WYKTTF APAG P+A+D MGKG+ W+NG
Sbjct: 599 ESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWING 658
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
QS+GR+WP Y + G C++ C+Y G + +KCL+NCG+ SQ YHVPRSWLK SGN LV+
Sbjct: 659 QSLGRHWPAYKAV-GSCSE-CSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVV 716
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPN 776
FEE GGDP I+ V +++ S+C+ + + V+ + K+ + P L+C P
Sbjct: 717 FEEWGGDPNGITLVRREV-DSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQC-GPG 774
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCK 835
Q I+++KFASFGTP GTCGS+ +G C + S + CVG CS+ V+ F GDPC
Sbjct: 775 QKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCP 834
Query: 836 GVMKSLAVEASCT 848
VMK LAVEA C
Sbjct: 835 NVMKKLAVEAVCA 847
>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
Length = 847
Score = 971 bits (2510), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/853 (57%), Positives = 591/853 (69%), Gaps = 24/853 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
MA+ L L+ GF+V S +V+YD RA+ I GKRR+LISGSIHYPRSTPEMWPD
Sbjct: 14 MAAVSALFLL---GFLV---CSVSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPD 67
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+K+K+GGLDVI+TYVFWN HEP +Y FEG YDLVKFVKLV ++GLY HLRIGPYVC
Sbjct: 68 LIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVC 127
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWNFGGFP+WL +IPGI FRTDN PFKA+MQRFT KIV+MMK E+L+ SQGGPIILSQI
Sbjct: 128 AEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQI 187
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG ++ GA G+SY WAA MA+ L TGVPWVMC+Q DAPDPIIN CNGFYCD F+
Sbjct: 188 ENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFS 247
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN KPKMWTE W+GWF FGG VPYRP ED+AF+VARF Q+GG+F NYYMYHGGTNF
Sbjct: 248 PNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFG 307
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGPFI+TSYDYDAPLDEYGL RQPKWGHLKDLH+AIKLCE ALV+ +PT LG
Sbjct: 308 RTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQ 367
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EA VYK+ SG CSAFLAN S V F N Y LP WS+SILPDCKN V+NTA++ +
Sbjct: 368 EAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQ 427
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
T R + G W NE D++FT GL+EQINTT D SDYLW
Sbjct: 428 TSRMKMVRVPVHG--------GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLW 479
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y + A+E L +G L V S GHA+H FING+L GS YGS + K+T + L
Sbjct: 480 YMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNL 539
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N +LS+ VGL N G +E AG+ GPV L G NG DLS Q+WTY+ GLKG
Sbjct: 540 RAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGL-NGGRRDLSWQKWTYKVGLKG 598
Query: 601 E---ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
E + SS +W + + + QPL WYKTTF APAG P+A+D MGKG+ W+NG
Sbjct: 599 ESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWING 658
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
QS+GR+WP Y + G C++ C+Y G + +KCL+NCG+ SQ YHVPRSWLK SGN LV+
Sbjct: 659 QSLGRHWPAYKAV-GSCSE-CSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVV 716
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPN 776
FEE GGDP I+ V +++ S+C+ + + V+ + K+ + P L+C P
Sbjct: 717 FEEWGGDPNGITLVRREV-DSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQC-GPG 774
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCK 835
Q I+++KFASFGTP GTCGS+ +G C + S + CVG CS+ V+ F GDPC
Sbjct: 775 QKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCP 834
Query: 836 GVMKSLAVEASCT 848
VMK LAVEA C
Sbjct: 835 NVMKKLAVEAVCA 847
>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 856
Score = 971 bits (2510), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/855 (55%), Positives = 594/855 (69%), Gaps = 28/855 (3%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S L+L C GF++L VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI
Sbjct: 9 SASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLI 68
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGG+DVIETYVFWNLHEP +Y+FEGR DLV+FVK + +AGLYAHLRIGPYVCAE
Sbjct: 69 QKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAE 128
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDNEPFK M+ FT +IV++MK E L+ SQGGPIILSQIEN
Sbjct: 129 WNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIEN 188
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG GA G +Y+ WAA MA++ +TGVPWVMC++ DAPDP+INTCNGFYCD F PN
Sbjct: 189 EYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPN 248
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KP +WTE WSGWF FGG + +RPV+DLAF VARF Q+GG+F NYYMYHGGTNF RT
Sbjct: 249 KPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRT 308
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPF++TSYDYDAP+DEYGLIRQPK+GHLK+LH+AIK+CE ALV+ DP S+G +A
Sbjct: 309 AGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQA 368
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
VY SG CSAFLAN T S V FN Y LP WS+SILPDC+N VFNTAK+
Sbjct: 369 HVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGV--- 425
Query: 423 VPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
Q+ Q+ +D W SY+ + + FT GLLEQIN T D SDYLWY
Sbjct: 426 ------QTSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWY 479
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
S +I E L G L +QS GHA+H F+NG+L GS +G+ N + T I L
Sbjct: 480 MTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLH 539
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI GPV L G G +DLS Q+WTYQ GLKGE
Sbjct: 540 SGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQG-KMDLSWQKWTYQVGLKGE 598
Query: 602 ELN--FPSGS-STQW-DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
+N FP+ + S W D+ T+ K QPL W+KT FDAP G+EP+A+D GMGKG+ WVNG
Sbjct: 599 AMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNG 658
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
+SIGRYW + + G C+ C+Y G Y NKC CG+P+Q YHVPR+WLK S N LV+
Sbjct: 659 ESIGRYWTAFAT--GDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVI 715
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGSDSKIQRKPGPVLSLECP 773
FEE+GG+P+ +S V + + S +C+ V++ HP ++ +G R P + L+C
Sbjct: 716 FEELGGNPSTVSLVKRSV-SGVCAEVSEYHPNIKNWQIESYGKGQTFHR---PKVHLKC- 770
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-D 832
+P Q I+SIKFASFGTPLGTCGS+ +G C +A S +++ + CVG C++ +S + FG D
Sbjct: 771 SPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISNSNFGKD 830
Query: 833 PCKGVMKSLAVEASC 847
PC V+K L VEA C
Sbjct: 831 PCPNVLKRLTVEAVC 845
>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
Length = 847
Score = 971 bits (2509), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/853 (57%), Positives = 589/853 (69%), Gaps = 24/853 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
MA+ L L+ GF+V S +V+YD RA+ I GKRR+LISGSIHYPRSTPEMWPD
Sbjct: 14 MAAVSALFLL---GFLV---CSVSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPD 67
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+K+K+GGLDVI+TYVFWN HEP +Y FEG YDLV+FVKLV ++GLY HLRIGPYVC
Sbjct: 68 LIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVRFVKLVQQSGLYLHLRIGPYVC 127
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWNFGGFP+WL +IPGI FRTDN PFKA+MQRFT KIV+MMK E+L+ SQGGPIILSQI
Sbjct: 128 AEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQI 187
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG ++ GA G+SY WAA MA+ L TGVPWVMC+Q DAPDPIIN CNGFYCD F+
Sbjct: 188 ENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFS 247
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN KPKMWTE W+GWF FGG VPYRP ED+AF+VARF Q+GG+F NYYMYHGGTNF
Sbjct: 248 PNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFG 307
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGPFI+TSYDYDAPLDEYGL RQPKWGHLKDLH+AIKLCE ALV+ +PT LG
Sbjct: 308 RTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQ 367
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EA VYK SG CSAFLAN S V F N Y LP WS+SILPDCKN V+NTA++ +
Sbjct: 368 EAHVYKAKSGACSAFLANYNPKSYAKVSFGSNHYNLPPWSISILPDCKNTVYNTARVGAQ 427
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
T R + G W NE D++FT GL+EQINTT D SDYLW
Sbjct: 428 TSRMKMVRVPVHG--------GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLW 479
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y I A+E L +G L V S GHA+H FING+L GS YGS + K+T + L
Sbjct: 480 YMTDVKIDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNL 539
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N +LS+ VGL N G +E AG+ GPV L G G DLS Q+WTY+ GLKG
Sbjct: 540 RAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLSGGRR-DLSWQKWTYKVGLKG 598
Query: 601 E---ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
E + SS +W + + + QPL WYKTTF APAG P+A+D MGKG+ W+NG
Sbjct: 599 ESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWING 658
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
QS+GR+WP Y + G C++ C+Y G + +KCL+NCG+ SQ YHVPRSWLK SGN LV+
Sbjct: 659 QSLGRHWPAYKAV-GSCSE-CSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVV 716
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPN 776
FEE GGDP IS V +++ S+C+ + + V+ + K+ + P + L+C P
Sbjct: 717 FEEWGGDPNGISLVRREV-DSVCADIYEWQSTLVNYQLHASGKVNKPLHPKVHLQC-GPG 774
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCK 835
Q I+++KFASFGTP GTCGS+ +G C S + CVG CS+ V+ F GDPC
Sbjct: 775 QKITTVKFASFGTPEGTCGSYRQGSCHDHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCP 834
Query: 836 GVMKSLAVEASCT 848
VMK LAVEA C
Sbjct: 835 NVMKKLAVEAVCA 847
>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 970 bits (2508), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/855 (55%), Positives = 594/855 (69%), Gaps = 28/855 (3%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S L+L C GF++L VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI
Sbjct: 6 SASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLI 65
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGG+DVIETYVFWNLHEP +Y+FEGR DLV+FVK + +AGLYAHLRIGPYVCAE
Sbjct: 66 QKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAE 125
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDNEPFK M+ FT +IV++MK E L+ SQGGPIILSQIEN
Sbjct: 126 WNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIEN 185
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG GA G +Y+ WAA MA++ +TGVPWVMC++ DAPDP+INTCNGFYCD F PN
Sbjct: 186 EYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPN 245
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KP +WTE WSGWF FGG + +RPV+DLAF VARF Q+GG+F NYYMYHGGTNF RT
Sbjct: 246 KPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRT 305
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPF++TSYDYDAP+DEYGLIRQPK+GHLK+LH+AIK+CE ALV+ DP S+G +A
Sbjct: 306 AGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQA 365
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
VY SG CSAFLAN T S V FN Y LP WS+SILPDC+N VFNTAK+
Sbjct: 366 HVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGV--- 422
Query: 423 VPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
Q+ Q+ +D W SY+ + + FT GLLEQIN T D SDYLWY
Sbjct: 423 ------QTSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWY 476
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
S +I E L G L +QS GHA+H F+NG+L GS +G+ N + T I L
Sbjct: 477 MTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLH 536
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI GPV L G G +DLS Q+WTYQ GLKGE
Sbjct: 537 SGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQG-KMDLSWQKWTYQVGLKGE 595
Query: 602 ELN--FPSGS-STQW-DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
+N FP+ + S W D+ T+ K QPL W+KT FDAP G+EP+A+D GMGKG+ WVNG
Sbjct: 596 AMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNG 655
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
+SIGRYW + + G C+ C+Y G Y NKC CG+P+Q YHVPR+WLK S N LV+
Sbjct: 656 ESIGRYWTAFAT--GDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVI 712
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGSDSKIQRKPGPVLSLECP 773
FEE+GG+P+ +S V + + S +C+ V++ HP ++ +G R P + L+C
Sbjct: 713 FEELGGNPSTVSLVKRSV-SGVCAEVSEYHPNIKNWQIESYGKGQTFHR---PKVHLKC- 767
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-D 832
+P Q I+SIKFASFGTPLGTCGS+ +G C +A S +++ + CVG C++ +S + FG D
Sbjct: 768 SPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISNSNFGKD 827
Query: 833 PCKGVMKSLAVEASC 847
PC V+K L VEA C
Sbjct: 828 PCPNVLKRLTVEAVC 842
>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 840
Score = 970 bits (2507), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/852 (57%), Positives = 592/852 (69%), Gaps = 26/852 (3%)
Query: 7 LLLVLCWGFVVLATTSF----GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
L L++ W +L S A+V+YD +A+ I G+RR+LISGSIHYPRSTPEMWPDLI
Sbjct: 5 LKLIIMWNVALLLVFSLIGSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLI 64
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGGLDVI+TYVFWN HEP +Y FEG YDLVKF+KLV +AGLY HLRIGPYVCAE
Sbjct: 65 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAE 124
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL +IPGI FRTDNEPFK +MQ+FT KIVD+MK E+LY SQGGPII+SQIEN
Sbjct: 125 WNFGGFPVWLKYIPGISFRTDNEPFKHQMQKFTTKIVDLMKAERLYESQGGPIIMSQIEN 184
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG ++ GAAGK+Y KWAA MA+ L TGVPWVMC+Q D PDP+INTCNGFYCD F+PN
Sbjct: 185 EYGPMEYEIGAAGKAYTKWAAEMAMGLGTGVPWVMCKQDDTPDPLINTCNGFYCDYFSPN 244
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KPKMWTE W+GWF FGG VP+RP EDLAF+VARF Q+GG+F NYYMYHGGTNF RT
Sbjct: 245 KAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRT 304
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPFI+TSYDYDAPLDEYGL+RQPKWGHLKDLH+AIKLCE ALV+ DPT +G EA
Sbjct: 305 AGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEA 364
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
V+K+ SG C+AFLAN S TV F Y LP WS+SILPDCKN V+NTA++ S +
Sbjct: 365 HVFKSKSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQSA 424
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
+R + G W NE + D +FT GLLEQ+NTT D SDYLWYS
Sbjct: 425 QMKMTRVPIHG--------GFSWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYS 476
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
+ +E L +G VL V S GHALH FING+L G+ YGS K+T + + L
Sbjct: 477 TDVVLDPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRA 536
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE- 601
G N LLS+ VGL N G +E AG+ GP+ L G G DLS Q+W+Y+ GLKGE
Sbjct: 537 GVNKISLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGRR-DLSWQKWSYKVGLKGEI 595
Query: 602 --ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
+ SS +W S + + QPL WYKTTFDAPAG+ P+A+D MGKG+ W+NGQ+
Sbjct: 596 LSLHSLSGSSSVEWIQGSLVSQRQPLTWYKTTFDAPAGTAPLALDMDSMGKGQVWLNGQN 655
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
+GRYWP Y + G D C+Y G Y+ NKC NCG+ SQ YHVP+SWLK +GN LV+FE
Sbjct: 656 LGRYWPAY--KASGTCDYCDYAGTYNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFE 713
Query: 720 EIGGDPTKISFVTKQLGSSLCSHVTDSHP--LPVDMWGSDSKIQRKPGPVLSLECPNPNQ 777
E+GGDP I V + + S+C+ + + P + M S R P + L C +P Q
Sbjct: 714 ELGGDPNGIFLVRRDI-DSVCADIYEWQPNLISYQMQTSGKAPVR---PKVHLSC-SPGQ 768
Query: 778 VISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKG 836
ISSIKFASFGTP G+CG+F G C + +S + CVG C++ VS F GDPC
Sbjct: 769 KISSIKFASFGTPAGSCGNFHEGSCHAHKSYDAFERNCVGQNWCTVTVSPENFGGDPCPN 828
Query: 837 VMKSLAVEASCT 848
V+K L+VEA C+
Sbjct: 829 VLKKLSVEAICS 840
>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
Length = 843
Score = 970 bits (2507), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/832 (57%), Positives = 572/832 (68%), Gaps = 18/832 (2%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S A+V+YD +A+VI G+RR+LISGSIHYPRSTPEMWPDLIQ++KDGGLDVI+TYVFWN
Sbjct: 25 SVRASVSYDSKAIVINGQRRILISGSIHYPRSTPEMWPDLIQRAKDGGLDVIQTYVFWNG 84
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP +Y FE YDLVKF+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGIQFR
Sbjct: 85 HEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIQFR 144
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDN PFK +MQRFT KIV+MMK E+L+ S GGPIILSQIENEYG ++ GA GK+Y W
Sbjct: 145 TDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPIILSQIENEYGPMEYEIGAPGKAYTDW 204
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA+ L TGVPWVMC+Q DAPDP+IN CNGFYCD F+PN KPKMWTE W+GWF F
Sbjct: 205 AAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTEF 264
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGAVPYRP EDLAF+VA+F Q+GG F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEY
Sbjct: 265 GGAVPYRPAEDLAFSVAKFLQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEY 324
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPKWGHLKDLH+AIKLCE ALV++DPT LG EA V+K+ SG C+AFLAN
Sbjct: 325 GLLRQPKWGHLKDLHRAIKLCEPALVSSDPTVTPLGTYQEAHVFKSNSGACAAFLANYNR 384
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
S V F Y LP WS+SILPDCKN V+NTA+I + T R +
Sbjct: 385 KSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARIGAQTARMKMPRVPIHG-------- 436
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
G W N+ D +FT GLLEQIN T D +DYLWY I E L G+ V
Sbjct: 437 GFSWQAYNDETATYSDTSFTTAGLLEQINITRDATDYLWYMTDVKIDPSEDFLRSGNYPV 496
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V S GHAL FING+L G+ YGS K+T + L G N LLS+ VGL N G
Sbjct: 497 LTVLSAGHALRVFINGQLAGTAYGSLETPKLTFKQGVNLRAGINQIALLSIAVGLPNVGP 556
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKST 618
+E AGI GPV L G G DLS Q+W+Y+ GLKGE L+ S SS +W S
Sbjct: 557 HFETWNAGILGPVILNGLNEGRR-DLSWQKWSYKIGLKGEALSLHSLTGSSSVEWTEGSF 615
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
+ + QPL WYKTTF+ PAG+ P+A+D MGKG+ W+N +SIGRYWP Y + G C
Sbjct: 616 VAQRQPLTWYKTTFNRPAGNSPLALDMGSMGKGQVWINDRSIGRYWPAY--KASGTCGEC 673
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
NY G +S KCL NCG+ SQ YHVPRSWL +GN LV+ EE GGDP I V +++ S
Sbjct: 674 NYAGTFSEKKCLSNCGEASQRWYHVPRSWLNPTGNLLVVLEEWGGDPNGIFLVRREV-DS 732
Query: 739 LCSHVTDSHPLPVDMWGSDSKIQRKP-GPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
+C+ + + P + S KP P L C P Q ISSIKFASFGTP G CGSF
Sbjct: 733 VCADIYEWQPNLMSWQMQVSGRVNKPLRPKAHLSC-GPGQKISSIKFASFGTPEGVCGSF 791
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G C + +S + ++C+G SCS+ VS F GDPC VMK L+VEA C+
Sbjct: 792 REGGCHAHKSYNAFERSCIGQNSCSVTVSPENFGGDPCPNVMKKLSVEAICS 843
>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 969 bits (2504), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/844 (56%), Positives = 589/844 (69%), Gaps = 28/844 (3%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
+VL +VTYD +A++I G+R++LISGSIHYPRSTP+MW L+QK+KDGGLDVI+
Sbjct: 18 LLVLHFQLIQCSVTYDRKAIIINGQRKILISGSIHYPRSTPDMWEGLMQKAKDGGLDVIQ 77
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TYVFWN+HEP YNFEGRYDLV+FVK V +AGLY HLRIGPYVCAEWNFGGFP+WL +
Sbjct: 78 TYVFWNVHEPSPGNYNFEGRYDLVRFVKTVQKAGLYMHLRIGPYVCAEWNFGGFPVWLKY 137
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI FRTDNEPFK MQ FT KIV MMK E L+ SQGGPIILSQIENEYG+ A GA
Sbjct: 138 VPGISFRTDNEPFKMAMQGFTEKIVQMMKSESLFESQGGPIILSQIENEYGSESKALGAP 197
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
G +Y+ WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD FTPN KP MWTE W
Sbjct: 198 GHAYMTWAAKMAVGLRTGVPWVMCKEDDAPDPVINTCNGFYCDAFTPNKPYKPTMWTEAW 257
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
SGWF FGG V RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDY
Sbjct: 258 SGWFTEFGGTVHERPVEDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDY 317
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAP+DEYGLIRQPK+GHLK+LH+AIKLCE AL++ DP SLGP ++ V+ +G+G C+A
Sbjct: 318 DAPIDEYGLIRQPKYGHLKELHRAIKLCEPALISADPIVTSLGPYQQSHVFSSGTGGCAA 377
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVA 434
FL+N NS V FN Y LP WS+SILPDC+NVVFNTAK+ Q+ Q+
Sbjct: 378 FLSNYNPNSVARVMFNNMHYSLPPWSISILPDCRNVVFNTAKVGV---------QTSQMH 428
Query: 435 ADSSDAIGSGWSYINEPVGISKDDAF-TKPGLLEQINTTADQSDYLWYSLSTNIKADEPL 493
+ + W +E + D++ T GLLEQ+N T D SDYLWY S +I E
Sbjct: 429 MSAGETKLLSWEMYDEDIASLGDNSMITAVGLLEQLNVTRDTSDYLWYMTSVDISPSESS 488
Query: 494 LEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLT 553
L G VL VQS GHALH +ING+L GS +GS N + T + + G N LLS+
Sbjct: 489 LRGGRPPVLTVQSAGHALHVYINGQLSGSAHGSRENRRFTFTGDVNMRAGINRIALLSIA 548
Query: 554 VGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SS 610
V L N G YE T G+ GPV L G G DL+ Q+W+YQ GLKGE +N PSG S
Sbjct: 549 VELPNVGLHYESTNTGVLGPVVLHGLDQGKR-DLTWQKWSYQVGLKGEAMNLVAPSGISY 607
Query: 611 TQWDSKS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
+W S KLQPL WYK F+AP G EP+A+D MGKG+ W+NG+SIGRYW +
Sbjct: 608 VEWMQASFATQKLQPLTWYKAYFNAPGGDEPLALDLGSMGKGQVWINGESIGRYWT--AA 665
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
NG C + C+Y G Y + KC CG+P+Q YHVPRSWL+ + N LV+FEEIGGD + IS
Sbjct: 666 ANGDC-NHCSYAGTYRAPKCQTGCGQPTQRWYHVPRSWLQPTKNLLVIFEEIGGDASGIS 724
Query: 730 FVTKQLGSSLCSHVTDSHPL----PVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFA 785
V + + SS+C+ V++ HP ++ +G ++ R P + L C Q IS+IKFA
Sbjct: 725 LVKRSV-SSVCADVSEWHPTIKNWHIESYGRSEELHR---PKVHLRCAM-GQSISAIKFA 779
Query: 786 SFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVE 844
SFGTPLGTCGSF +G C S S +++ + C+G + C++ +S+N F GDPC VMK +AVE
Sbjct: 780 SFGTPLGTCGSFQQGPCHSPNSHAILEKKCIGQQRCAVTISMNNFGGDPCPNVMKRVAVE 839
Query: 845 ASCT 848
A CT
Sbjct: 840 AICT 843
>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
Length = 836
Score = 967 bits (2501), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/848 (56%), Positives = 586/848 (69%), Gaps = 27/848 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+ +L+ W + A+VTYD R+ +I G+R++LISGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 VFILIFSW------VSHGSASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP R +Y FEGRYDLV+F+K+V AGLY HLRIGPY+CAEWNF
Sbjct: 65 KDGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNF 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDN PFK MQ FT KIVDMMK EKL+ QGGPII+SQIENEYG
Sbjct: 125 GGFPVWLKYVPGIAFRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ L TGVPWVMC+Q DAPDP+I+ CNGFYC+ F PN +
Sbjct: 185 PVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDY 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKM+TE W+GW+ FGGA+P RP EDLA++VARF Q G+F NYYMYHGGTNF RT+GG
Sbjct: 245 KPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFISTSYDYDAP+DEYGL +PKWGHL+DLHKAIKLCE ALV+ DPT LG NLEA VY
Sbjct: 305 PFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVY 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K SG C+AFLAN S V F Y LP WSVSILPDCKNVVFNTA+I +
Sbjct: 365 KAKSGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGA------ 418
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
QS Q+ + S SY E +D T GLLEQIN T D +DYLWY
Sbjct: 419 ---QSSQMKMNPVSTF-SWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEV 474
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
+IK DE L+ G VL V S GHALH FING+L G+ YG SN KVT + L G N
Sbjct: 475 HIKPDEGFLKTGQYPVLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTN 534
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS+ +GL N G +E AG+ GPV LKG GT +D+SS +W+Y+ GLKGE LN
Sbjct: 535 KISLLSVAMGLPNVGLHFETWNAGVLGPVTLKGLNEGT-VDMSSWKWSYKIGLKGEALNL 593
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS +W S L + QPL WYKTTF+AP G++P+A+D + MGKG+ W+NG+SIGR
Sbjct: 594 QAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGR 653
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y + +G C + CNY G ++ KC CG PSQ YHVPRSWLK SGN L++FEE+G
Sbjct: 654 HWPAYTA-HGNC-NGCNYAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELG 711
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISS 781
G+P I+ V + + +C+ + + P L SK+ L C P IS
Sbjct: 712 GNPAGITLVKRTM-DRVCADIFEGQPSLKNSQIIGSSKVNSLQSKA-HLWCA-PGLKISK 768
Query: 782 IKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKS 840
I+FASFG P GTCGSF G C + +S +++ C+G +SCS+ V+ F GDPC G MK
Sbjct: 769 IQFASFGVPQGTCGSFREGSCHAHKSYDALQRNCIGKQSCSVSVAPEVFGGDPCPGSMKK 828
Query: 841 LAVEASCT 848
L+VEA C+
Sbjct: 829 LSVEALCS 836
>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 967 bits (2499), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/848 (56%), Positives = 586/848 (69%), Gaps = 27/848 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+ +L+ W + A+VTYD R+ +I G+R++LISGSIHYPRSTPEMWPDLIQK+
Sbjct: 8 VFILIFSW------VSHGSASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKA 61
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP R +Y FEGRYDLV+F+K+V AGLY HLRIGPY+CAEWNF
Sbjct: 62 KDGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNF 121
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDN PFK MQ FT KIVDMMK EKL+ QGGPII+SQIENEYG
Sbjct: 122 GGFPVWLKYVPGIAFRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYG 181
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ L TGVPWVMC+Q DAPDP+I+ CNGFYC+ F PN +
Sbjct: 182 PVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDY 241
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKM+TE W+GW+ FGGA+P RP EDLA++VARF Q G+F NYYMYHGGTNF RT+GG
Sbjct: 242 KPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGG 301
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFISTSYDYDAP+DEYGL +PKWGHL+DLHKAIKLCE ALV+ DPT LG NLEA VY
Sbjct: 302 PFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVY 361
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K SG C+AFLAN S V F Y LP WSVSILPDCKNVVFNTA+I +
Sbjct: 362 KAKSGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGA------ 415
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
QS Q+ + S SY E +D T GLLEQIN T D +DYLWY
Sbjct: 416 ---QSSQMKMNPVSTF-SWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEV 471
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
+IK DE L+ G VL V S GHALH FING+L G+ YG SN KVT + L G N
Sbjct: 472 HIKPDEGFLKTGQYPVLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTN 531
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS+ +GL N G +E AG+ GPV LKG GT +D+SS +W+Y+ GLKGE LN
Sbjct: 532 KISLLSVAMGLPNVGLHFETWNAGVLGPVTLKGLNEGT-VDMSSWKWSYKIGLKGEALNL 590
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS +W S L + QPL WYKTTF+AP G++P+A+D + MGKG+ W+NG+SIGR
Sbjct: 591 QAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGR 650
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y + +G C + CNY G ++ KC CG PSQ YHVPRSWLK SGN L++FEE+G
Sbjct: 651 HWPAYTA-HGNC-NGCNYAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELG 708
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISS 781
G+P I+ V + + +C+ + + P L SK+ L C P IS
Sbjct: 709 GNPAGITLVKRTM-DRVCADIFEGQPSLKNSQIIGSSKVNSLQSKA-HLWCA-PGLKISK 765
Query: 782 IKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKS 840
I+FASFG P GTCGSF G C + +S +++ C+G +SCS+ V+ F GDPC G MK
Sbjct: 766 IQFASFGVPQGTCGSFREGSCHAHKSYDALQRNCIGKQSCSVSVAPEVFGGDPCPGSMKK 825
Query: 841 LAVEASCT 848
L+VEA C+
Sbjct: 826 LSVEALCS 833
>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
Length = 838
Score = 967 bits (2499), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/829 (57%), Positives = 586/829 (70%), Gaps = 20/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDHRA+++ G+RR+LISGS+HYPRSTPEMWP +IQK+K+GG+DVI+TYVFWN HEP
Sbjct: 25 ASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHEP 84
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ +Y FEGRYDLVKF+KLV +AGLY HLR+GPY CAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 85 QQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQ+FTAKIV+MMK E+LY +QGGPIILSQIENEYG ++ GA GKSY +WAA
Sbjct: 145 GPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAAK 204
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ LDTGVPWVMC+Q DAPDPIIN CNGFYCD F+PN KPK+WTE W+ WF FG
Sbjct: 205 MAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGNP 264
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VPYRP EDLAF+VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL+
Sbjct: 265 VPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 324
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE ALV+ DP +LG EA V+++ +G C+AFLAN +S
Sbjct: 325 RQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHSF 384
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV F Y LP WS+SILPDCKN VFNTA+I + QS Q+ + + G
Sbjct: 385 ATVSFANRHYNLPPWSISILPDCKNTVFNTARIGA---------QSAQMKM-TPVSRGLP 434
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W NE +D +FT GLLEQINTT D SDYLWYS I + E L G L +
Sbjct: 435 WQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWLTI 494
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH F+NG+L G+ YGS K+T + L G N LLS+ VGL N G +E
Sbjct: 495 MSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPHFE 554
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPK 621
AG+ GPV L G G DL+ Q+W+Y+ GLKGE + SS +W S + +
Sbjct: 555 TWNAGVLGPVSLTGLDEGKR-DLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLVAQ 613
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYK+TF+APAG++P+A+D MGKG+ W+NGQS+GRYWP Y + +G C +CNY
Sbjct: 614 RQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKA-SGNC-GACNYA 671
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G ++ KCL NCG+ SQ YHVPRSWL +GN LVLFEE GG+P IS V +++ +S+C+
Sbjct: 672 GWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREV-ASVCA 730
Query: 742 HVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+ + P V+ + K+ + P L C P Q I+SIKFASFGTP G CGSF G
Sbjct: 731 DINEWQPQLVNWQMQASGKVDKPLRPKAHLSCA-PGQKITSIKFASFGTPQGVCGSFREG 789
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C + S + C+G SCS+ V+ F GDPC VMK L+VE C+
Sbjct: 790 SCHAFHSYDAFERYCIGQNSCSVPVTPEIFGGDPCPHVMKKLSVEVICS 838
>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
Length = 853
Score = 966 bits (2497), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/855 (55%), Positives = 590/855 (69%), Gaps = 28/855 (3%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S L+L C G ++L VTYD +A++I G+RR+L SGSIHYPRSTP+MW LI
Sbjct: 6 SASRLILWFCLGLLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLI 65
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGG+DVIETYVFWNLHEP +Y+FEGR DLV+FVK + +AGLYAHLRIGPYVCAE
Sbjct: 66 QKAKDGGIDVIETYVFWNLHEPTPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAE 125
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDNEPFK M+ FT +IV++MK E L+ SQGGPIILSQIEN
Sbjct: 126 WNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIEN 185
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG GA G +Y+ WAA MA++ +TGVPWVMC++ DAPDP+INTCNGFYCD F PN
Sbjct: 186 EYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPN 245
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KP +WTE WSGWF FGG + +RPV+DLAF VARF Q+GG+F NYYMYHGGTNF RT
Sbjct: 246 KPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRT 305
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPF++TSYDYDAP+DEYGLIR+PK+GHLK+LH+AIK+CE ALV+ DP S+G +A
Sbjct: 306 AGGPFVTTSYDYDAPIDEYGLIREPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQA 365
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
VY SG CSAFLAN T S V FN Y LP WS+SILPDC+N VFNTAK+
Sbjct: 366 HVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGV--- 422
Query: 423 VPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
Q+ Q+ +D W SY+ + + FT GLLEQIN T D SDYLWY
Sbjct: 423 ------QTSQMEMLPTDTKNFQWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWY 476
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
S +I E L G L +QS GHA+H F+NG+L GS +G+ N + T I L
Sbjct: 477 MTSVDIGDTESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLH 536
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI GPV L G G DLS Q+WTYQ GLKGE
Sbjct: 537 SGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKR-DLSWQKWTYQVGLKGE 595
Query: 602 ELN--FPSGS-STQW-DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
+N FP+ + S W D+ T+ K QPL W+KT FDAP G+EP+A+D GMGKG+ WVNG
Sbjct: 596 AMNLAFPTNTRSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNG 655
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
+SIGRYW + + G C+ C+Y G Y NKC CG+P+Q YHVPRSWLK S N LV+
Sbjct: 656 ESIGRYWTAFAT--GDCSQ-CSYTGTYKPNKCQTGCGQPTQRYYHVPRSWLKPSQNLLVI 712
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGSDSKIQRKPGPVLSLECP 773
FEE+GG+P+ +S V + + S +C+ V++ HP ++ +G R P + L+C
Sbjct: 713 FEELGGNPSSVSLVKRSV-SGVCAEVSEYHPNIKNWQIESYGKGQTFHR---PKVHLKC- 767
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-D 832
+P Q I+SIKFASFGTPLGTCGS+ +G C +A S +++ + CVG C++ +S FG D
Sbjct: 768 SPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISNTNFGKD 827
Query: 833 PCKGVMKSLAVEASC 847
PC V+K L VEA C
Sbjct: 828 PCPNVLKRLTVEAVC 842
>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 855
Score = 966 bits (2497), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/855 (55%), Positives = 594/855 (69%), Gaps = 29/855 (3%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S L+L C GF++L VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI
Sbjct: 9 SASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLI 68
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGG+DVIETYVFWNLHEP +Y+FEGR DLV+FVK + +AGLYAHLRIGPYVCAE
Sbjct: 69 QKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAE 128
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDNEPFK M+ FT +IV++MK E L+ SQGGPIILSQIEN
Sbjct: 129 WNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIEN 188
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG GA G +Y+ WAA MA++ +TGVPWVMC++ DAPDP+INTCNGFYCD F PN
Sbjct: 189 EYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPN 248
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KP +WTE WSGWF FGG + +RPV+DLAF VARF Q+GG+F NYYMYHGGTNF RT
Sbjct: 249 KPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRT 308
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPF++TSYDYDAP+DEYGLIRQPK+GHLK+LH+AIK+CE ALV+ DP S+G +A
Sbjct: 309 AGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQA 368
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
VY SG CSAFLAN T S V FN Y LP WS+SILPDC+N VFNTAK+
Sbjct: 369 HVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGV--- 425
Query: 423 VPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
Q+ Q+ +D W SY+ + + FT GLLEQIN T D SDYLWY
Sbjct: 426 ------QTSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWY 479
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
S +I E L G L +QS GHA+H F+NG+L GS +G+ N + T I L
Sbjct: 480 MTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLH 539
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI GPV L G G +DLS Q+WTYQ GLKGE
Sbjct: 540 SGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQG-KMDLSWQKWTYQVGLKGE 598
Query: 602 ELN--FPSGS-STQW-DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
+N FP+ + S W D+ T+ K QPL W+KT FDAP G+EP+A+D GMGKG+ WVNG
Sbjct: 599 AMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNG 658
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
+SIGRYW + + G C+ C+Y G Y NKC CG+P+Q YHVPR+WLK S N LV+
Sbjct: 659 ESIGRYWTAFAT--GDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVI 715
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGSDSKIQRKPGPVLSLECP 773
FEE+GG+P+ +S V + + S +C+ V++ HP ++ +G R P + L+C
Sbjct: 716 FEELGGNPSTVSLVKRSV-SGVCAEVSEYHPNIKNWQIESYGKGQTFHR---PKVHLKC- 770
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-D 832
+P Q I+SIKFASFGTPLGTCGS+ +G C +A S +++ + CVG C++ +S + FG D
Sbjct: 771 SPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILER-CVGKARCAVTISNSNFGKD 829
Query: 833 PCKGVMKSLAVEASC 847
PC V+K L VEA C
Sbjct: 830 PCPNVLKRLTVEAVC 844
>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 966 bits (2496), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/831 (57%), Positives = 587/831 (70%), Gaps = 24/831 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI K+K+GGLDV+ETYVFWN+HEP
Sbjct: 25 ASVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGLDVVETYVFWNVHEP 84
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
YNFEGRYDLV+FVK + +AGLYAHLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 85 SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK MQ FT KIV MMK E+L+ SQGGPIILSQIENEYG G AG++Y+ WAA
Sbjct: 145 EPFKTAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGDAGQNYVNWAAK 204
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ + TGVPWVMC++ DAPDP+INTCNGFYCD+FTPN KP +WTE WSGWF FGG
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
+ RPV+DLAFAVARF RGG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGLI
Sbjct: 265 IHKRPVQDLAFAVARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPK+GHLK+LH+AIK+CE ALV+TDP SLG + +A VY T SG C+AFL+N + S
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIITSLGESQQAHVYTTESGDCAAFLSNYDSKSS 384
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V FN Y LP WSVSILPDC+NVVFNTAK+ Q+ Q+ ++
Sbjct: 385 ARVMFNNMHYNLPPWSVSILPDCRNVVFNTAKVGV---------QTSQMQMLPTNTQLFS 435
Query: 445 WSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W +E V + A PGLLEQIN T D SDYLWY S +I + E L G L
Sbjct: 436 WESFDEDVYSVDDSSAIMAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPTLI 495
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
VQS GHA+H FING+L GS YG+ + + L G N LLS+ +GL N G +
Sbjct: 496 VQSRGHAVHVFINGQLSGSAYGTREYRRFMYTGKVNLRAGINRIALLSVAIGLPNVGEHF 555
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQW-DSKSTL 619
E GI GPV L G G DLS Q+WTYQ GLKGE ++ P+G SS W S +
Sbjct: 556 ESWSTGILGPVALHGLDQG-KWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVV 614
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ QPL W+KT FDAP G EP+A+D GMGKG+ W+NGQSIGRYW T+ + G C D CN
Sbjct: 615 QRNQPLTWHKTHFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTTFAT--GNCND-CN 671
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
Y G++ KC CG+P+Q YHVPRSWLK + N LV+FEE+GG+P+KIS V + + SS+
Sbjct: 672 YAGSFRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVIFEELGGNPSKISLVKRSV-SSV 730
Query: 740 CSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
C+ V++ HP + W +S K + P + L C +P Q ISSIKFASFGTPLGTCG++
Sbjct: 731 CADVSEYHP-NIKNWHIESYGKSEEFHPPKVHLHC-SPGQTISSIKFASFGTPLGTCGNY 788
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
+G C S S +++ + C+G C++ VS + FG DPC V+K L+VEA C
Sbjct: 789 EQGACHSPASYAILEKRCIGKPRCTVTVSNSNFGQDPCPKVLKRLSVEAVC 839
>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 964 bits (2493), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/831 (57%), Positives = 588/831 (70%), Gaps = 24/831 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI K+K+GG+DV+ETYVFWN+HEP
Sbjct: 25 ASVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGIDVVETYVFWNVHEP 84
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
YNFEGRYDLV+FVK + +AGLYAHLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 85 SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK MQ FT KIV MMK E+L+ SQGGPIILSQIENEYG GAAG++Y+ WAA
Sbjct: 145 EPFKRAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGAAGQNYVNWAAK 204
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ + TGVPWVMC++ DAPDP+INTCNGFYCD+FTPN KP +WTE WSGWF FGG
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
+ RPV+DLAFA ARF RGG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGLI
Sbjct: 265 IHKRPVQDLAFAAARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPK+GHLK+LH+AIK+CE ALV+TDP SLG +A VY T SG C+AFL+N + S
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIVTSLGEFQQAHVYTTESGDCAAFLSNYDSKSS 384
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V FN Y LP WSVSILPDC+NVVFNTAK+ Q+ Q+ ++
Sbjct: 385 ARVMFNNMHYSLPPWSVSILPDCRNVVFNTAKVGV---------QTSQMQMLPTNTQLFS 435
Query: 445 WSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W +E + + + A T PGLLEQIN T D SDYLWY S +I + E L G L
Sbjct: 436 WESFDEDIYSVDESSAITAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPTLI 495
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
VQS GHA+H FING+L GS +G+ + T + L G N LLS+ +GL N G +
Sbjct: 496 VQSTGHAVHVFINGQLSGSAFGTREYRRFTYTGKVNLLAGINRIALLSVAIGLPNVGEHF 555
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQW-DSKSTL 619
E GI GPV L G G DLS Q+WTYQ GLKGE ++ P+G SS W S +
Sbjct: 556 ESWSTGILGPVALHGLDKG-KWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVV 614
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ QPL W+KT FDAP G EP+A+D GMGKG+ W+NGQSIGRYW + + G C D CN
Sbjct: 615 QRNQPLTWHKTYFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAFAT--GNCND-CN 671
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
Y G++ KC CG+P+Q YHVPRSWLK++ N LV+FEE+GG+P+KIS V + + SS+
Sbjct: 672 YAGSFRPPKCQLGCGQPTQRWYHVPRSWLKTTQNLLVIFEELGGNPSKISLVKRSV-SSV 730
Query: 740 CSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
C+ V++ HP + W +S K + P + L C +P Q ISSIKFASFGTPLGTCG++
Sbjct: 731 CADVSEYHP-NIKNWHIESYGKSEEFRPPKVHLHC-SPGQTISSIKFASFGTPLGTCGNY 788
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
+G C S S ++ + C+G C++ VS + FG DPC V+K L+VEA C
Sbjct: 789 EQGACHSPASYVILEKRCIGKPRCTVTVSNSNFGQDPCPKVLKRLSVEAVC 839
>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
Length = 838
Score = 964 bits (2493), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/829 (57%), Positives = 586/829 (70%), Gaps = 20/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDHRA+++ G+RR+LISGS+HYPRSTPEMWP +IQK+K+GG+DVI+TYVFWN HEP
Sbjct: 25 ASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHEP 84
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ +Y FEGRYDLVKF+KLV +AGLY HLR+GPY CAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 85 QQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQ+FTAKIV+MMK E+LY +QGGPIILSQIENEYG ++ GA GKSY +WAA
Sbjct: 145 GPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAAK 204
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ LDTGVPWVMC+Q DAPDPIIN CNGFYCD F+PN KPK+WTE W+ WF FG
Sbjct: 205 MAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGNP 264
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VPYRP EDLAF+VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL+
Sbjct: 265 VPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 324
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE ALV+ DP +LG EA V+++ +G C+AFLAN +S
Sbjct: 325 RQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHSF 384
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV F Y LP WS+SILPDCKN VFNTA+I + QS Q+ + + G
Sbjct: 385 ATVSFANRHYNLPPWSISILPDCKNTVFNTARIGA---------QSAQMKM-TPVSRGLP 434
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W NE +D +FT GLLEQINTT D SDYLWYS I + E L G L +
Sbjct: 435 WQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWLTI 494
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH F+NG+L G+ YGS K+T + L G N LLS+ VGL N G +E
Sbjct: 495 MSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPHFE 554
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPK 621
AG+ GPV L G G DL+ Q+W+Y+ GLKGE + SS +W S + +
Sbjct: 555 TWNAGVLGPVSLTGLDEGKR-DLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLVAQ 613
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYK+TF+APAG++P+A+D MGKG+ W+NGQS+GRYWP Y + +G C +CNY
Sbjct: 614 RQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKA-SGNC-GACNYA 671
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G ++ KCL NCG+ SQ YHVPRSWL +GN LVLFEE GG+P IS V +++ +S+C+
Sbjct: 672 GWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREV-ASVCA 730
Query: 742 HVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+ + P V+ + K+ + P L C + Q I+SIKFASFGTP G CGSF G
Sbjct: 731 DINEWQPQLVNWQMQASGKVDKPLRPKAHLSCAS-GQKITSIKFASFGTPQGVCGSFREG 789
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C + S + C+G SCS+ V+ F GDPC VMK L+VE C+
Sbjct: 790 SCHAFHSYDAFERYCIGQNSCSVPVTPEIFGGDPCPHVMKKLSVEVICS 838
>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 841
Score = 964 bits (2491), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/851 (56%), Positives = 590/851 (69%), Gaps = 26/851 (3%)
Query: 8 LLVLCWGFVVLATTSF----GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
L ++ W +L S A+V+YD +A+ I G+RR+LISGSIHYPRSTPEMWPDLIQ
Sbjct: 7 LKLIMWNVALLLAFSLIGSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQ 66
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
K+KDGGLDVI+TYVFWN HEP +Y FEG YDLVKF+KLV +AGLY HLRIGPYVCAEW
Sbjct: 67 KAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEW 126
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
NFGGFP+WL +IPGI FRTDNEPFK +MQ+FT KIVD+MK E+LY SQGGPII+SQIENE
Sbjct: 127 NFGGFPVWLKYIPGISFRTDNEPFKVQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENE 186
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG ++ GAAGK+Y KWAA MA+ L TGVPW+MC+Q D PDP+INTCNGFYCD F+PN
Sbjct: 187 YGPMEYEIGAAGKAYTKWAAEMAMELGTGVPWIMCKQDDTPDPLINTCNGFYCDYFSPNK 246
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
KPKMWTE W+GWF FGG VP+RP EDLAF+VARF Q+GG+F NYYMYHGGTNF RT+
Sbjct: 247 AYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTA 306
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
GGPFI+TSYDYDAPLDEYGL+RQPKWGHLKDLH+AIKLCE ALV+ DPT +G EA
Sbjct: 307 GGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAH 366
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
V+K+ SG C+AFLAN S TV F Y LP WS+SILP+CKN V+NTA++ S +
Sbjct: 367 VFKSMSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPNCKNTVYNTARVGSQSAQ 426
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
+R + G W NE + D +FT GLLEQ+NTT D SDYLWYS
Sbjct: 427 MKMTRVPIHG--------GLSWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYST 478
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
+ +E L +G VL V S GHALH FING+L G+ YGS K+T + + L G
Sbjct: 479 DVVLDPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRTG 538
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE-- 601
N LLS+ VGL N G +E AG+ GP+ L G G DLS Q+W+Y+ GLKGE
Sbjct: 539 VNKISLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGRR-DLSWQKWSYKVGLKGETL 597
Query: 602 -ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
+ SS +W S + + QPL WYKTTFDAP G+ P+A+D MGKG+ W+NGQ++
Sbjct: 598 SLHSLGGSSSVEWIQGSLVSQRQPLTWYKTTFDAPDGTAPLALDMNSMGKGQVWLNGQNL 657
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYWP Y + G D C+Y G Y+ NKC NCG+ SQ YHVP+SWLK +GN LV+FEE
Sbjct: 658 GRYWPAY--KASGTCDYCDYAGTYNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEE 715
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHP--LPVDMWGSDSKIQRKPGPVLSLECPNPNQV 778
+GGD IS V + + S+C+ + + P + M S R P + L C +P Q
Sbjct: 716 LGGDLNGISLVRRDI-DSVCADIYEWQPNLISYQMQTSGKAPVR---PKVHLSC-SPGQK 770
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGV 837
ISSIKFASFGTP+G+CG+F G C + S + CVG C++ VS F GDPC V
Sbjct: 771 ISSIKFASFGTPVGSCGNFHEGSCHAHMSYDAFERNCVGQNLCTVAVSPENFGGDPCPNV 830
Query: 838 MKSLAVEASCT 848
+K L+VEA C+
Sbjct: 831 LKKLSVEAICS 841
>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 843
Score = 964 bits (2491), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/842 (57%), Positives = 583/842 (69%), Gaps = 21/842 (2%)
Query: 15 FVVLATTSFG---ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLD 71
VV A + G A+V+YDH+A++I G+RR+L+SGSIHYPRSTPEMWPDLIQK+K+GGLD
Sbjct: 15 LVVFACSLLGQASASVSYDHKAIIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLD 74
Query: 72 VIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLW 131
VI+TYVFWN HEP +Y F G YDLV+F+KLV +AGLY +LRIGPYVCAEWNFGGFP+W
Sbjct: 75 VIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVW 134
Query: 132 LHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY 191
L +IPGI FRTDN PFK +M++FT KIVDMMK E+L+ SQGGPIILSQIENEYG ++
Sbjct: 135 LKYIPGISFRTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEI 194
Query: 192 GAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWT 251
GA G+SY +WAA MA+ L TGVPW+MC+Q DAPDPIINTCNGFYCD F+PN KPKMWT
Sbjct: 195 GAPGRSYTQWAAHMAVGLGTGVPWIMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWT 254
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTS 311
E W+GWF FGGAVP+RP EDLAF++ARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TS
Sbjct: 255 EAWTGWFTEFGGAVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATS 314
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGL 371
YDYDAPLDEYGL RQPKWGHLKDLH+AIKLCE ALV+ D T LG EA V+++ SG
Sbjct: 315 YDYDAPLDEYGLARQPKWGHLKDLHRAIKLCEPALVSGDSTVQRLGNYEEAHVFRSKSGA 374
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSL 431
C+AFLAN S TV F Y LP WS+SILP+CK+ V+NTA++ S + +R +
Sbjct: 375 CAAFLANYNPQSYATVAFGNQHYNLPPWSISILPNCKHTVYNTARVGSQSTTMKMTRVPI 434
Query: 432 QVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
G W NE + D +FT GLLEQIN T D SDYLWYS I ++E
Sbjct: 435 HG--------GLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNE 486
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
L +G VL V S GHALH FIN +L G+ YGS K+T + L G N LLS
Sbjct: 487 GFLRNGKNPVLTVLSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLS 546
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---G 608
+ VGL N G +E+ AG+ GP+ L G G DL+ Q+W+Y+ GLKGE LN S
Sbjct: 547 VAVGLPNVGPHFERWNAGVLGPITLSGLNEGRR-DLTWQKWSYKVGLKGEALNLHSLSGS 605
Query: 609 SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYV 668
SS +W + + QPL WYKTTFDAPAG P+A+D MGKG+ W+NGQS+GRYWP Y
Sbjct: 606 SSVEWLQGFLVSRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAY- 664
Query: 669 SQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKI 728
+ G CNY G Y+ KC NCG+ SQ YHVP SWLK SGN LV+FEE+GGDP I
Sbjct: 665 -KASGSCGYCNYAGTYNEKKCGSNCGEASQRWYHVPHSWLKPSGNLLVVFEELGGDPNGI 723
Query: 729 SFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKP-GPVLSLECPNPNQVISSIKFASF 787
V + + S+C+ + + P V S R P P L C P Q ISSIKFASF
Sbjct: 724 FLVRRDI-DSVCADIYEWQPNLVSYEMQASGKVRSPVRPKAHLSC-GPGQKISSIKFASF 781
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEAS 846
GTP+G+CGS+ G C + +S + CVG C++ VS F GDPC VMK L+VEA
Sbjct: 782 GTPVGSCGSYREGSCHAHKSYDAFLKNCVGQSWCTVTVSPEIFGGDPCPRVMKKLSVEAI 841
Query: 847 CT 848
CT
Sbjct: 842 CT 843
>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
Length = 841
Score = 964 bits (2491), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/829 (57%), Positives = 574/829 (69%), Gaps = 18/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YD +A+VI G+RR+LISGSIHYPRS+PEMWPDLIQK+K+GGLDVI+TYVFWN HEP
Sbjct: 26 ASVSYDSKAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y FE YDLVKF+KL+ +AGLY HLRIGPYVCAEWNFGGFP+WL +IPGIQFRTDN
Sbjct: 86 SPGKYYFEDNYDLVKFIKLIQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIQFRTDN 145
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA+MQRFT KIV+MMK E+L+ SQGGPIILSQIENEYG ++ GA GK Y WAA
Sbjct: 146 GPFKAQMQRFTTKIVNMMKAERLFQSQGGPIILSQIENEYGPMEYELGAPGKVYTDWAAH 205
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MAL L TGVPWVMC+Q DAPDPIIN CNGFYCD F+PN KPKMWTE W+GW+ FGGA
Sbjct: 206 MALGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWYTEFGGA 265
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP RP EDLAF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL+
Sbjct: 266 VPSRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE ALV+ DPT LG EA V+K+ SG C+AFLAN S
Sbjct: 326 RQPKWGHLKDLHRAIKLCEPALVSADPTVTPLGTYQEAHVFKSKSGACAAFLANYNPRSF 385
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F Y LP WS+SILPDCKN V+NTA++ + + R L A
Sbjct: 386 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSAQMKMPRVPLHGAFS-------- 437
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W N+ D +FT GLLEQINTT D SDYLWY I +E L G VL +
Sbjct: 438 WQAYNDETATYADTSFTTAGLLEQINTTRDSSDYLWYLTDVKIDPNEEFLRSGKYPVLTI 497
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHAL FING+L G+ YGS K+T + L G N LLS+ VGL N G +E
Sbjct: 498 LSAGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLRAGINQIALLSIAVGLPNVGPHFE 557
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPK 621
AG+ GPV L G G DLS Q+W+Y+ GLKGE + SS +W S + +
Sbjct: 558 TWNAGVLGPVILNGLNEGRR-DLSWQKWSYKVGLKGEALSLHSLSGSSSVEWIQGSLVTR 616
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYKTTF+APAG+ P+A+D MGKG+ W+NG+SIGRYWP Y + G +CNY
Sbjct: 617 RQPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRSIGRYWPAY--KASGSCGACNYA 674
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G+Y KCL NCG+ SQ YHVPR+WL +GN LV+ EE GGDP I V +++ S+C+
Sbjct: 675 GSYHEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEEWGGDPNGIFLVRREI-DSICA 733
Query: 742 HVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+ + P L + K+++ P L C P Q ISSIKFASFGTP G CGSF G
Sbjct: 734 DIYEWQPNLMSWQMQASGKVKKPVRPKAHLSC-GPGQKISSIKFASFGTPEGGCGSFREG 792
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C + S +++C+G SCS+ V+ F GDPC VMK L+VEA C+
Sbjct: 793 SCHAHNSYDAFQRSCIGQNSCSVTVAPENFGGDPCPNVMKKLSVEAICS 841
>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 845
Score = 963 bits (2489), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/830 (57%), Positives = 582/830 (70%), Gaps = 20/830 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+A+ I G+RR+L+SGSIHYPRSTPEMWPDLIQK+K+GGLDVI+TYVFWN HEP
Sbjct: 30 ASVSYDHKAITINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 89
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y F G YDLV+F+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL +IPGI FRTDN
Sbjct: 90 SPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 149
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFK +M++FT KIVDMMK E+L+ SQGGPIILSQIENEYG ++ GA G++Y +WAA
Sbjct: 150 GPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTQWAAH 209
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L TGVPW+MC+Q DAPDPIINTCNGFYCD F+PN KPKMWTE W+GWF FGGA
Sbjct: 210 MAVGLGTGVPWIMCKQEDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 269
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RP EDLAF++ARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL
Sbjct: 270 VPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLP 329
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE ALV+ DPT LG EA V+++ SG C+AFLAN S
Sbjct: 330 RQPKWGHLKDLHRAIKLCEPALVSGDPTVQQLGNYEEAHVFRSKSGACAAFLANYNPQSY 389
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV F Y LP WS+SILP+CK+ V+NTA++ S + +R + G
Sbjct: 390 ATVAFGNQRYNLPPWSISILPNCKHTVYNTARVGSQSTTMKMTRVPIHG--------GLS 441
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W NE + D +FT GLLEQIN T D SDYLWYS I ++E L +G VL V
Sbjct: 442 WKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTV 501
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH FIN +L G+ YGS K+T + L G N LLS+ VGL N G +E
Sbjct: 502 LSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFE 561
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
+ AG+ GP+ L G G DL+ Q+W+Y+ GLKGE LN S SS +W + +
Sbjct: 562 RWNAGVLGPITLSGLNEGRR-DLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSR 620
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYKTTFDAPAG P+A+D MGKG+ W+NGQS+GRYWP Y + G CNY
Sbjct: 621 RQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAY--KASGSCGYCNYA 678
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y+ KC NCG+ SQ YHVP SWLK +GN LV+FEE+GGDP I V + + S+C+
Sbjct: 679 GTYNEKKCGSNCGQASQRWYHVPHSWLKPTGNLLVVFEELGGDPNGIFLVRRDI-DSVCA 737
Query: 742 HVTDSHP--LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
+ + P + DM S K++ P L C P Q ISSIKFASFGTP+G+CG++
Sbjct: 738 DIYEWQPNLVSYDMQAS-GKVRSPVRPKAHLSC-GPGQKISSIKFASFGTPVGSCGNYRE 795
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G C + +S ++ CVG C++ VS F GDPC VMK L+VEA CT
Sbjct: 796 GSCHAHKSYDAFQKNCVGQSWCTVTVSPEIFGGDPCPSVMKKLSVEAICT 845
>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
Length = 845
Score = 962 bits (2487), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/851 (56%), Positives = 592/851 (69%), Gaps = 21/851 (2%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
+ IL++ L G V + +S +V+YD +A+ I G+RR+LISGSIHYPRS+PEMWPDLI
Sbjct: 11 NNNILVVFLLLGLWVCSVSS---SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLI 67
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+K+GGLDVI+TYVFWN HEP +Y FEG YDLVKF+KLV +AGLY HLRIGPYVCAE
Sbjct: 68 QKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVKQAGLYVHLRIGPYVCAE 127
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDN PFKA+MQRFT KIV+MMK E+L+ SQGGPIILSQIEN
Sbjct: 128 WNFGGFPVWLKYVPGINFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIEN 187
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG ++ GA G++Y KWAA MA+ L TGVPWVMC+Q DAPDP+INTCNGFYCD F+PN
Sbjct: 188 EYGPMEYELGAPGQAYSKWAAKMAVGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPN 247
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KPKMWTE W+GWF FGGAVPYRP EDLAF+VARF Q+GG F NYYMYHGGTNF RT
Sbjct: 248 KPYKPKMWTEAWTGWFTEFGGAVPYRPAEDLAFSVARFIQKGGAFINYYMYHGGTNFGRT 307
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPFI+TSYDYDAPLDEYGL+RQPKWGHLKDLH+AIKLCE ALV+ P+ LG EA
Sbjct: 308 AGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGAPSVMPLGNYQEA 367
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
V+K+ SG C+AFLAN S V F Y LP WS+SILPDCKN V+NTA+I + +
Sbjct: 368 HVFKSKSGACAAFLANYNQRSFAKVSFGNMHYNLPPWSISILPDCKNTVYNTARIGAQSA 427
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
S ++ G W +E D+ F GLLEQINTT D SDYLWYS
Sbjct: 428 RMKMSPIPMRG--------GFSWQAYSEEASTEGDNTFMMVGLLEQINTTRDVSDYLWYS 479
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
I ++E L G VL V S GHALH F+NG+L G+ YGS + K+T + +
Sbjct: 480 TDVRIDSNEGFLRSGKYPVLTVLSAGHALHVFVNGQLSGTAYGSLESPKLTFSQGVKMRA 539
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G N LLS+ VGL N G +E AG+ GPV L G G DLS Q+WTY+ GL GE
Sbjct: 540 GINRIYLLSIAVGLPNVGPHFETWNAGVLGPVTLNGLNEGRR-DLSWQKWTYKIGLHGEA 598
Query: 603 ---LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
+ SS +W S + + QPL+WYKTTF+APAG+ P+A+D MGKG+ W+NGQS
Sbjct: 599 LSLHSLSGSSSVEWAQGSFVSRKQPLMWYKTTFNAPAGNSPLALDMGSMGKGQVWINGQS 658
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
+GRYWP Y + +G C CNY G ++ KCL NCG+ SQ YHVPRSWL ++GN LV+FE
Sbjct: 659 VGRYWPAYKA-SGNC-GVCNYAGTFNEKKCLTNCGEASQRWYHVPRSWLNTAGNLLVVFE 716
Query: 720 EIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQV 778
E GGDP IS V +++ S+C+ + + P ++ M S K+ + P + L+C Q
Sbjct: 717 EWGGDPNGISLVRREV-DSVCADIYEWQPTLMNYMMQSSGKVNKPLRPKVHLQC-GAGQK 774
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGV 837
IS IKFASFGTP G CGS+ +G C + S + CVG CS+ V+ F GDPC V
Sbjct: 775 ISLIKFASFGTPEGVCGSYRQGSCHAFHSYDAFNRLCVGQNWCSVTVAPEMFGGDPCPNV 834
Query: 838 MKSLAVEASCT 848
MK LAVEA C+
Sbjct: 835 MKKLAVEAVCS 845
>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
Length = 841
Score = 962 bits (2486), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/848 (56%), Positives = 584/848 (68%), Gaps = 22/848 (2%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+++ LC+ F VL S A+V+YD +A++I G RR+LISGSIHYPRST EMWPDLIQK+
Sbjct: 11 VIMGFLCF-FGVL---SVQASVSYDSKAIIINGHRRILISGSIHYPRSTSEMWPDLIQKA 66
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVIETYVFWN HEP +Y FEG YDLV+FVKLV +AGLY HLRIGPYVCAEWNF
Sbjct: 67 KEGGLDVIETYVFWNGHEPEPGKYYFEGNYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNF 126
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL +IPGI FRTDN PFK +M+RFT KIV+MMK E+LY SQGGPIILSQIENEYG
Sbjct: 127 GGFPVWLKYIPGISFRTDNAPFKFQMERFTRKIVNMMKAERLYESQGGPIILSQIENEYG 186
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MAL L TGVPWVMC+Q DAPDPIINTCNGFYCD F+PN
Sbjct: 187 PMEYELGAPGKAYSKWAAQMALGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAY 246
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GWF FGGAVP+RP ED+AFAVARF Q+GG NYYMYHGGTNF RT+GG
Sbjct: 247 KPKMWTEAWTGWFTQFGGAVPHRPAEDMAFAVARFIQKGGALINYYMYHGGTNFGRTAGG 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGL+RQPKWGHLKDL++AIKLCE ALV+ DP LG EA V+
Sbjct: 307 PFIATSYDYDAPIDEYGLLRQPKWGHLKDLNRAIKLCEPALVSGDPIVTRLGNYQEAHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ SG C+AFL+N S TV F Y +P WS+SILPDCKN VFNTA++ + T +
Sbjct: 367 KSKSGACAAFLSNYNPRSYATVAFGNMHYNIPPWSISILPDCKNTVFNTARVGAQTAIMK 426
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
S + + W NE + AFT GLLEQINTT D +DYLWY+
Sbjct: 427 MSPVPMHESFS--------WQAYNEEPASYNEKAFTTVGLLEQINTTRDATDYLWYTTDV 478
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
+I A+E L G VL V S GHA+H F+NG+L G+ YGS K+T + L G N
Sbjct: 479 HIDANEGFLRSGKYPVLTVLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLRAGNN 538
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL-- 603
LLS+ VGL N G +E AGI GPV L G G DL+ Q+WTY+ GL GE +
Sbjct: 539 KIALLSIAVGLPNVGPHFEMWNAGILGPVNLNGLDEGRR-DLTWQKWTYKIGLDGEAMSL 597
Query: 604 -NFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS +W S + + QPL W+KTTF+APAG+ P+A+D MGKG+ W+NGQS+GR
Sbjct: 598 HSLSGSSSVEWIQGSLVAQKQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQSLGR 657
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
YWP Y S G SC+Y G Y+ KC NCG+ SQ YHVPRSWL +GN LV+FEE G
Sbjct: 658 YWPAYKST--GSCGSCDYTGTYNEKKCSSNCGEASQRWYHVPRSWLNPTGNLLVVFEEWG 715
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISS 781
GDP I V + + S+C ++ + P ++ S K+ + P L C P Q ISS
Sbjct: 716 GDPNGIHLVRRDV-DSVCVNINEWQPTLMNWQMQSSGKVNKPLRPKAHLSC-GPGQKISS 773
Query: 782 IKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKS 840
+KFASFGTP G CGSF G C + S ++ CVG C++ V+ F GDPC VMK
Sbjct: 774 VKFASFGTPEGECGSFREGSCHAHHSYDAFQRTCVGQNFCTVTVAPEMFGGDPCPNVMKK 833
Query: 841 LAVEASCT 848
L+VE C+
Sbjct: 834 LSVEVICS 841
>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
Length = 856
Score = 961 bits (2485), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/855 (55%), Positives = 593/855 (69%), Gaps = 28/855 (3%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S L+L C G ++L VTYD +A++I G+RR+L SGSIHYPRSTP+MW LI
Sbjct: 9 SASRLILWCCLGLLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLI 68
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGG+DVIETYVFWNLHEP +Y+FEGR DLV+FVK + +AGLYAHLRIGPYVCAE
Sbjct: 69 QKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKAIHKAGLYAHLRIGPYVCAE 128
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDNEPFK M+ FT +IV++MK E L+ SQGGPIILSQIEN
Sbjct: 129 WNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIEN 188
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG GA G +Y+ WAA MA++ +TGVPWVMC++ DAPDP+I+TCNGFYCD F PN
Sbjct: 189 EYGRQGQILGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVISTCNGFYCDSFAPN 248
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KP +WTE WSGWF FGG + +RPV+DLAFAVARF Q+GG+F NYYMYHGGTNF RT
Sbjct: 249 KPYKPTIWTEAWSGWFTEFGGPMHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRT 308
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPF++TSYDYDAP+DEYGLIRQPK+GHLK+LH+AIK+CE ALV+TDP SLG +A
Sbjct: 309 AGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSTDPVVTSLGNKQQA 368
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
VY + SG CSAFLAN T S V FN Y LP WS+SILPDC+N VFNTAK+
Sbjct: 369 HVYSSESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGV--- 425
Query: 423 VPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
Q+ Q+ + W SY+ + + FT GLLEQIN T D SDYLWY
Sbjct: 426 ------QTSQMEMLPTSTGSFQWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWY 479
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
S +I E L G L +QS GHA+H F+NG+L GS +G+ N + T I L
Sbjct: 480 MTSVDIGETESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYKGKINLH 539
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI GPV L G G DLS Q+WTYQ GLKGE
Sbjct: 540 SGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKR-DLSWQKWTYQVGLKGE 598
Query: 602 ELN--FPSGS-STQW-DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
+N +P+ + S W D+ T+ K QPL W+KT FDAP G+EP+A+D GMGKG+ WVNG
Sbjct: 599 AMNLAYPTNTPSFGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNG 658
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
+SIGRYW + + + G C+Y G Y NKC CG+P+Q YHVPRSWLK S N LV+
Sbjct: 659 ESIGRYWTAFATGDCG---HCSYTGTYKPNKCNSGCGQPTQKWYHVPRSWLKPSQNLLVI 715
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGSDSKIQRKPGPVLSLECP 773
FEE+GG+P+ +S V + + S +C+ V++ HP ++ +G +R P + L+C
Sbjct: 716 FEELGGNPSTVSLVKRSV-SGVCAEVSEYHPNIKNWQIESYGKGQTFRR---PKVHLKC- 770
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-D 832
+P Q IS+IKFASFGTPLGTCGS+ +G C +A S +++ + CVG C++ +S + FG D
Sbjct: 771 SPGQAISAIKFASFGTPLGTCGSYQQGDCHAATSYAILERKCVGKARCAVTISNSNFGKD 830
Query: 833 PCKGVMKSLAVEASC 847
PC V+K L VEA C
Sbjct: 831 PCPNVLKRLTVEAVC 845
>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
Length = 853
Score = 961 bits (2484), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/854 (56%), Positives = 585/854 (68%), Gaps = 35/854 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+ L+VL G ++ T VTYD +A++I G+RR+LISGSIHYPRSTP+MW DL+QK+
Sbjct: 12 LFLMVLIVGSKLIHCT-----VTYDKKAIIIDGQRRILISGSIHYPRSTPDMWEDLVQKA 66
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN+HEP YNFEGR+DLV+F+K V + GLY HLRIGPYVCAEWNF
Sbjct: 67 KDGGLDVIDTYVFWNVHEPSPGNYNFEGRFDLVRFIKTVQKGGLYVHLRIGPYVCAEWNF 126
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDN PFKA MQ FT KIV MMK E+L+ SQGGPII SQIENEYG
Sbjct: 127 GGFPVWLKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKDERLFQSQGGPIIFSQIENEYG 186
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
A+GAAG SYI WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD F+PN
Sbjct: 187 PESRAFGAAGHSYINWAAQMAVGLKTGVPWVMCKEDDAPDPVINTCNGFYCDAFSPNKPY 246
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP MWTE WSGWF FGGA +RPV+DLAFAVARF Q+GG+F NYYMYHGGTNF R++GG
Sbjct: 247 KPTMWTEAWSGWFTEFGGAFHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRSAGG 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGLIR+PK+GHLK+LH+AIKLCE LV++DPT LG +A V+
Sbjct: 307 PFITTSYDYDAPIDEYGLIREPKYGHLKELHRAIKLCEHELVSSDPTITLLGTYQQAHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVT 421
+G CSAFLAN T S V FN Y+LP WS+SILPDC+NVVFNTAK+ + V
Sbjct: 367 SSGKRSCSAFLANYHTQSAARVMFNNMHYVLPPWSISILPDCRNVVFNTAKVGVQTSHVQ 426
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
++P+ SR S SY + + T GL+EQIN T D +DYLWY
Sbjct: 427 MLPTGSR------------FFSWESYDEDISSLGASSRMTALGLMEQINVTRDTTDYLWY 474
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
S NI E L G L V+S GHALH FING+ GS +G+ N + T P+ L
Sbjct: 475 ITSVNINPSESFLRGGQWPTLTVESAGHALHVFINGQFSGSAFGTRENREFTFTGPVNLR 534
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G YE GI GPV L G G N DL+ QQW+YQ GLKGE
Sbjct: 535 AGTNRIALLSIAVGLPNVGVHYETWKTGILGPVMLHGLNQG-NKDLTWQQWSYQVGLKGE 593
Query: 602 ELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
+N S SS W S + QPL WYK FDAP G+EP+A+D MGKG+ W+NGQ
Sbjct: 594 AMNLVSPNRASSVDWIQGSLATRQQPLKWYKAYFDAPGGNEPLALDMRSMGKGQVWINGQ 653
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
SIGRYW +Y G C+ SC Y G + KC CG+P+Q YHVPRSWLK N LV+F
Sbjct: 654 SIGRYWLSYA--KGDCS-SCGYSGTFRPPKCQLGCGQPTQRWYHVPRSWLKPKQNLLVIF 710
Query: 719 EEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPG---PVLSLECPNP 775
EE+GGD +KIS V K+ +S+C+ + HP ++ + ++S + + + L C P
Sbjct: 711 EELGGDASKISLV-KRSTTSVCADAFEHHPT-IENYNTESNGESERNLHQAKVHLRCA-P 767
Query: 776 NQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPC 834
Q IS+I FASFGTP GTCGSF G C + S SVV + C+G +SC + +S + FG DPC
Sbjct: 768 GQSISAINFASFGTPTGTCGSFQEGTCHAPNSHSVVEKKCIGRESCMVAISNSNFGADPC 827
Query: 835 KGVMKSLAVEASCT 848
+K L+VEA C+
Sbjct: 828 PSKLKKLSVEAVCS 841
>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 961 bits (2483), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/849 (55%), Positives = 588/849 (69%), Gaps = 29/849 (3%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
L+ W F+ + T +VTYD +A++I G+RR+L SGSIHYPRSTP+MW LIQK+KD
Sbjct: 10 FLLCMWVFLCIQLTQ--CSVTYDRKALIINGQRRILFSGSIHYPRSTPQMWEGLIQKAKD 67
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLD I+TYVFWNLHEP +YNFEGRYDLV+F+KL+ +AGLY HLRIGPY+CAEWNFGG
Sbjct: 68 GGLDAIDTYVFWNLHEPSPGKYNFEGRYDLVRFIKLIQKAGLYVHLRIGPYICAEWNFGG 127
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WL F+PG+ FRTDNEPFK MQRFT KIV MMK EKL+ SQGGPII+SQIENEYG+
Sbjct: 128 FPVWLKFVPGVSFRTDNEPFKMAMQRFTQKIVQMMKNEKLFESQGGPIIISQIENEYGHE 187
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
A+GA G +Y+ WAA MA+++DTGVPWVMC++ DAPDP+INTCNGFYCD F+PN NKP
Sbjct: 188 SRAFGAPGYAYLTWAAKMAVAMDTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPNKP 247
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
+WTE WSGWF F G + RPVEDL+FAV RF Q+GG+F NYYMYHGGTNF RT+GGPF
Sbjct: 248 TLWTEAWSGWFTEFAGPIQQRPVEDLSFAVTRFIQKGGSFVNYYMYHGGTNFGRTAGGPF 307
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
I+TSYDYDAP+DEYGLIRQPK+GHLK+LHKAIKLCE AL++ DP SLG +A V+ +
Sbjct: 308 ITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCERALLSADPAETSLGTYAKAQVFYS 367
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
SG C+AFL+N S V FN Y L WS+SILPDCKNVVFNTA + T
Sbjct: 368 ESGGCAAFLSNYNPTSAARVTFNSMHYNLAPWSISILPDCKNVVFNTATVGVQT------ 421
Query: 428 RQSLQVAADSSDAIGSGWSYINEPVGISKDDA-FTKPGLLEQINTTADQSDYLWYSLSTN 486
+Q+ +S+ + W NE + + DD+ T GLLEQ+N T D SDYLWYS +
Sbjct: 422 -SQMQMLPTNSELL--SWETFNEDISSADDDSTITVVGLLEQLNVTRDTSDYLWYSTRID 478
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
I + E L G L VQS GHA+H FING L GS +G+ + + T + L G N
Sbjct: 479 ISSSESFLHGGQHPTLIVQSTGHAMHVFINGHLSGSAFGTREDRRFTFTGDVNLQTGSNI 538
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
+LS+ VGL N G +E G+ GPV L G G DLS Q+W+YQ GLKGE +N
Sbjct: 539 ISVLSIAVGLPNNGPHFETWSTGVLGPVVLHGLDEGKK-DLSWQKWSYQVGLKGEAMNLV 597
Query: 607 SG---SSTQWDSKSTLP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
S S+ W S K QPL WYK FDAP G EP+A+D MGKG+ W+NGQSIGR
Sbjct: 598 SPNVISNIDWMKGSLFAQKQQPLTWYKAYFDAPDGDEPLALDMGSMGKGQVWINGQSIGR 657
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
YW Y G C+ C+Y G + + KC CG+P+Q YHVPRSWLK + N LVLFEE+G
Sbjct: 658 YWTAYA--KGNCS-GCSYSGTFRTTKCQFGCGQPTQRWYHVPRSWLKPTQNLLVLFEELG 714
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKP----GPVLSLECPNPNQV 778
GD +KISF+ + + +++C+ V++ HP + W +S Q +P P + L C + Q
Sbjct: 715 GDASKISFMKRSV-TTVCAEVSEHHP-NIKNWHIES--QERPEEMSKPKVHLHCAS-GQS 769
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVM 838
IS+IKFASFGTP GTCG+F +G C + S +V+ + C+G + CS+ VS + F +PC +
Sbjct: 770 ISAIKFASFGTPSGTCGNFQKGTCHAPTSQAVLEKKCIGQQKCSVAVSSSNFANPCPNMF 829
Query: 839 KSLAVEASC 847
K L+VEA C
Sbjct: 830 KKLSVEAVC 838
>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
Length = 843
Score = 959 bits (2479), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/843 (57%), Positives = 584/843 (69%), Gaps = 26/843 (3%)
Query: 15 FVVLATTSFGA---NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLD 71
F+ ++ T F A +VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI K+K+GGLD
Sbjct: 11 FLFVSLTLFLAVYSDVTYDRKAIIINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLD 70
Query: 72 VIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLW 131
VIETYVFWN+HEP YNFEGR DLV+F++ V +AGLYAHLRIGPYVCAEWNFGGFP+W
Sbjct: 71 VIETYVFWNVHEPSPGNYNFEGRNDLVRFIQTVHKAGLYAHLRIGPYVCAEWNFGGFPVW 130
Query: 132 LHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY 191
L ++PGI FR DNEPFK MQ FT KIV MMK E+LY SQGGPIILSQIENEYG
Sbjct: 131 LKYVPGISFRQDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKML 190
Query: 192 GAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWT 251
G G +Y+ WAA MA+ + TGVPW+MC++ DAPDP+INTCNGFYCD+FTPN KP MWT
Sbjct: 191 GPVGYNYMSWAAKMAVEMGTGVPWIMCKEDDAPDPVINTCNGFYCDKFTPNKPYKPTMWT 250
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTS 311
E WSGWF FGG + RPV+DLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TS
Sbjct: 251 EAWSGWFSEFGGPIHKRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTS 310
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGL 371
YDYDAPLDEYGLIRQPK+GHLK+LHKAIK+CE AL++TDP SLG +A VY T SG
Sbjct: 311 YDYDAPLDEYGLIRQPKYGHLKELHKAIKMCEKALISTDPVVTSLGNFQQAYVYTTESGD 370
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSL 431
CSAFL+N + S V FN Y LP WSVSILPDC+N VFNTAK+ T +
Sbjct: 371 CSAFLSNYDSKSSARVMFNNMHYNLPPWSVSILPDCRNAVFNTAKVGVQT-------SQM 423
Query: 432 QVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
Q+ +S+ W E S T GLLEQIN T D SDYLWY S ++ + E
Sbjct: 424 QMLPTNSERF--SWESFEEDTSSSSATTITASGLLEQINVTRDTSDYLWYITSVDVGSSE 481
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
L G L VQS GHA+H FING+L GS YG+ + + + L G NT LLS
Sbjct: 482 SFLHGGKLPSLIVQSTGHAVHVFINGRLSGSAYGTREDRRFRYTGDVNLRAGTNTIALLS 541
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG- 608
+ VGL N G +E GI GPV + G G +DLS Q+WTYQ GLKGE +N P G
Sbjct: 542 VAVGLPNVGGHFETWNTGILGPVVIHGLDKG-KLDLSWQKWTYQVGLKGEAMNLASPDGI 600
Query: 609 SSTQW-DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
SS +W S + + QPL W+KT FDAP G EP+A+D GMGKG+ W+NG SIGRYW
Sbjct: 601 SSVEWMQSAVVVQRNQPLTWHKTFFDAPEGEEPLALDMDGMGKGQIWINGISIGRYWTAI 660
Query: 668 VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTK 727
+ G C D CNY G++ KC CG+P+Q YHVPRSWLK + N LV+FEE+GGDP+K
Sbjct: 661 AT--GSCND-CNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKQNHNLLVVFEELGGDPSK 717
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFA 785
IS + + SS+C+ V++ HP + W DS K + P + L C NP Q ISSIKFA
Sbjct: 718 ISLAKRSV-SSVCADVSEYHP-NLKNWHIDSYGKSENFRPPKVHLHC-NPGQAISSIKFA 774
Query: 786 SFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVE 844
SFGTPLGTCGS+ +G C S+ S ++ Q C+G C + VS + FG DPC V+K L+VE
Sbjct: 775 SFGTPLGTCGSYEQGACHSSSSYDILEQKCIGKPRCIVTVSNSNFGRDPCPNVLKRLSVE 834
Query: 845 ASC 847
A C
Sbjct: 835 AVC 837
>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 887
Score = 959 bits (2478), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/869 (55%), Positives = 592/869 (68%), Gaps = 35/869 (4%)
Query: 7 LLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
L++ L F +L+ + F NV+YDHRA++I GKRR+L+S IHYPR+TPEMW DLI KS
Sbjct: 17 LIIALLVYFPILSGSYFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKS 76
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GG DV++TYVFWN HEPV+ QYNFEGRYDLVKFVKL+ +GLY HLRIGPYVCAEWNF
Sbjct: 77 KEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNF 136
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL IPGI+FRTDNEPFK EMQ+F KIVD+M++ KL+ QGGPII+ QIENEYG
Sbjct: 137 GGFPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYG 196
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+++ +YG GK Y+KWAA MAL L GVPWVMC+Q+DAP+ II+ CNG+YCD F PNS
Sbjct: 197 DVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRT 256
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP +WTE+W GW+ +GG++P+RP EDLAFAVARF+QRGG+FQNYYMY GGTNF RTSGG
Sbjct: 257 KPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGG 316
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEATV 364
PF TSYDYDAPLDEYGL +PKWGHLKDLH AIKLCE ALVA D P Y LG EA +
Sbjct: 317 PFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHI 376
Query: 365 Y----KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
Y +TG +C+AFLANI + VKFNG SY LP WSVSILPDC++V FNTAK+ +
Sbjct: 377 YHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQ 436
Query: 421 TLV-------PSFSRQSLQ---VAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQIN 470
T V PS S+ V D+ I W + EP+GI ++ FT GLLE +N
Sbjct: 437 TSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLN 496
Query: 471 TTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSS 528
T D+SDYLW+ ++ D+ ++G + + + S+ L F+N +L GS G
Sbjct: 497 VTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWV 556
Query: 529 NAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS 588
A P+ G N LL+ TVGLQNYGAF EK GAG G +L G NG ++DLS
Sbjct: 557 KAVQ----PVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNG-DLDLS 611
Query: 589 SQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDF 645
WTYQ GLKGE +W + T +WYKT FD PAG++PV ++
Sbjct: 612 KSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNL 671
Query: 646 TGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPR 705
MG+G+AWVNGQ IGRYW +SQ GC +C+YRGAY+S+KC NCGKP+Q+ YHVPR
Sbjct: 672 ESMGRGQAWVNGQHIGRYW-NIISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPR 730
Query: 706 SWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ---- 761
SWLK S N LVLFEE GG+P KIS T G LC V++SH P+ W + I
Sbjct: 731 SWLKPSSNLLVLFEETGGNPFKISVKTVTAG-ILCGQVSESHYPPLRKWSTPDYINGTMS 789
Query: 762 -RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKS 820
P + L C + VISSI+FAS+GTP G+C FS G+C ++ SLS+V +AC G S
Sbjct: 790 INSVAPEVHLHCED-GHVISSIEFASYGTPRGSCDGFSIGKCHASNSLSIVSEACKGRNS 848
Query: 821 CSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C I VS F DPC G +K+LAV + C+
Sbjct: 849 CFIEVSNTAFISDPCSGTLKTLAVMSRCS 877
>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 853
Score = 959 bits (2478), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/834 (57%), Positives = 584/834 (70%), Gaps = 28/834 (3%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI K+K+GGLDVIETY+FWN+HEP
Sbjct: 31 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYIFWNVHEPS 90
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
R YNFEGRYDLV+FVK + +AGLYAHLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNE
Sbjct: 91 RGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 150
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQ FT KIV MMK E+LY SQGGPIILSQIENEYG G AG++Y+ WAA M
Sbjct: 151 PFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGPAGQNYVNWAAKM 210
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ TGVPWVMC++ DAPDP+INTCNGFYCD FTPN KP +WTE WSGWF FGG
Sbjct: 211 AVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFGGPN 270
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
RPV+DLAF VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGLIR
Sbjct: 271 HERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIR 330
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+GHLK+LHKAIK+CE ALV+ DP S+G +A VY T SG C+AFL+N T S V
Sbjct: 331 QPKYGHLKELHKAIKMCERALVSADPAVTSMGNFQQAHVYTTKSGDCAAFLSNFDTKSSV 390
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V FN Y LP WS+SILPDC+NVVFNTAK+ Q+ Q+ ++ W
Sbjct: 391 RVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGV---------QTSQMQMLPTNTHMFSW 441
Query: 446 SYINEPVGISKDDA----FTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
+E + S DD T GLLEQIN T D SDYLWY S +I + E L G
Sbjct: 442 ESFDEDIS-SLDDGSAITITTSGLLEQINVTRDTSDYLWYITSVDIGSSESFLRGGKLPT 500
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L VQS GHA+H FING+L GS YG+ + + + L G N LLS+ VGL N G
Sbjct: 501 LIVQSTGHAVHVFINGQLSGSAYGTREDRRFRYTGTVNLRAGTNRIALLSVAVGLPNVGG 560
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQWDSKST 618
+E GI GPV L+G G +DLS Q+WTYQ GLKGE +N P+G SS +W +
Sbjct: 561 HFETWNTGILGPVVLRGLNQG-KLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQSAL 619
Query: 619 L-PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
+ K QPL W+KT FDAP G EP+A+D GMGKG+ W+NG SIGRYW + G +
Sbjct: 620 VSEKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYW---TAPAAGICNG 676
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
C+Y G + KC CG+P+Q YHVPRSWLK + N LV+FEE+GGDP+KIS V + + S
Sbjct: 677 CSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPNHNLLVVFEELGGDPSKISLVKRSV-S 735
Query: 738 SLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCG 795
S+C+ V++ HP + W DS K + P + L C +P+Q ISSIKFASFGTPLGTCG
Sbjct: 736 SICADVSEYHP-NIRNWHIDSYGKSEEFHPPKVHLHC-SPSQAISSIKFASFGTPLGTCG 793
Query: 796 SFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
++ +G C S S + + + C+G C++ VS + FG DPC V+K L+VEA C+
Sbjct: 794 NYEKGVCHSPTSYATLEKKCIGKPRCTVTVSNSNFGQDPCPNVLKRLSVEAVCS 847
>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 958 bits (2477), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/831 (57%), Positives = 578/831 (69%), Gaps = 35/831 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +A++I G+RR+LISGSIHYPRSTPEMW DL+QK+KDGGLDV++TYVFWN+HEP
Sbjct: 29 VTYDKKAILINGQRRILISGSIHYPRSTPEMWDDLMQKAKDGGLDVVDTYVFWNVHEPSP 88
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Y+FEGRYDLV+F+K GLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN P
Sbjct: 89 GNYDFEGRYDLVRFIKTAQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 148
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ FT KIV MMK EKL+ASQGGPIILSQIENEYG A GAAG +Y+ WAA MA
Sbjct: 149 FKMAMQGFTQKIVQMMKSEKLFASQGGPIILSQIENEYGPQSKALGAAGHAYMNWAAKMA 208
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ L+TGVPWVMC++ DAPDP+IN+CNGFYCD F+PN KP +WTE WSGWF FGG V
Sbjct: 209 VGLNTGVPWVMCKEDDAPDPVINSCNGFYCDYFSPNKPYKPTLWTEAWSGWFTEFGGPVY 268
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPV+DLAFAVARF Q+GG+ NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYG++RQ
Sbjct: 269 GRPVQDLAFAVARFVQKGGSLFNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGMLRQ 328
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LH+AIKLCE ALV++DPT SLG +A V+ +G G C+AFLAN TNS T
Sbjct: 329 PKYGHLKNLHRAIKLCEHALVSSDPTVTSLGAYEQAHVFSSGPGRCAAFLANYHTNSAAT 388
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAK----INSVTLVPSFSRQSLQVAADSSDAIG 442
V FN Y LPAWS+SILPDCK VVFNTA+ I ++P+ S+ S + + + ++G
Sbjct: 389 VVFNNMRYALPAWSISILPDCKRVVFNTAQVGVHIAQTQMLPTISKLSWETYNEDTYSLG 448
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
G S+ T GLLEQIN T D SDYLWY S I + E L G K L
Sbjct: 449 ----------GSSR---MTVAGLLEQINVTRDTSDYLWYMTSVGISSSEAFLRGGQKPTL 495
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V+S GHA+H FING+ GS YGS + T PI L G N LLS+ VGL N G
Sbjct: 496 SVRSAGHAVHVFINGQFSGSAYGSREHPAFTYTGPINLRAGMNKIALLSIAVGLPNVGLH 555
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTL 619
+EK GI GP+ + G NG DL+ Q+W+YQ GLKGE +N S +S W S L
Sbjct: 556 FEKWQTGILGPISISGL-NGGKKDLTWQKWSYQVGLKGEAMNLVSPTEATSVDWIKGSLL 614
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+PL WYK +F+AP G+EP+A+D MGKG+AW+NGQSIGRYW Y GGC+ C
Sbjct: 615 QGQRPLTWYKASFNAPRGNEPLALDLRSMGKGQAWINGQSIGRYWMAYA--KGGCS-RCT 671
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
Y G Y C CG+P+Q YHVPRSWLK + N LVLFEE+GGD +KIS + + + + L
Sbjct: 672 YAGTYRPPTCENGCGQPTQRWYHVPRSWLKPTNNVLVLFEELGGDASKISLMRRSV-TGL 730
Query: 740 CSHVTDSHPLPVDMWGSDSKIQRKPGPV--LSLECPNPNQVISSIKFASFGTPLGTCGSF 797
C + H +DS I + L L+C NP QVIS+IKFASFGTP GTCGS+
Sbjct: 731 CGEAVEYHA------KNDSYIIESNEELDSLHLQC-NPGQVISAIKFASFGTPSGTCGSY 783
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
+G C + S +++ + C+G KSCS+ + + FG DPC +K L VE C
Sbjct: 784 QKGTCHAPDSHAIIEKKCIGLKSCSVSTTRDNFGVDPCPNELKQLLVEVDC 834
>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
Length = 836
Score = 958 bits (2477), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/829 (56%), Positives = 587/829 (70%), Gaps = 23/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+A++I G++R+LISGSIHYPRSTPEMWPDLIQKSKDGGLDVI+TYVFWN HEP
Sbjct: 26 ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEP 85
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y FE RYDLVKF+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 86 SPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDN 145
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFKA MQ+FT KIV MMK E+L+ SQGGPIILSQIENE+G ++ GA GK+Y KWAA
Sbjct: 146 EPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQ 205
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L+TGVPW+MC+Q DAPDP+I+TCNGFYC+ FTPN N KPKMWTE W+GW+ FGGA
Sbjct: 206 MAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGA 265
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP RP EDLAF++ARF Q+GG+F NYYMYHGGTNF RT+GGPF++TSYDYDAPLDEYGL
Sbjct: 266 VPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLP 325
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R+PKWGHL+DLHKAIK E+ALV+ +P+ SLG + EA V+K+ SG C+AFLAN T S
Sbjct: 326 REPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSG-CAAFLANYDTKSS 384
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F Y LP WS+SILPDC+ V+NTA++ S QS Q+ +
Sbjct: 385 AKVSFGNGQYELPPWSISILPDCRTAVYNTARLGS---------QSSQMKMTPVKSALPW 435
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
S+I E + D T GL EQIN T D +DY WY I DE ++ G +L +
Sbjct: 436 QSFIEESASSDESDTTTLDGLWEQINVTRDTTDYSWYMTDITISPDEGFIKRGESPLLTI 495
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH FING+L G+ YG+ N K+T + L G N LLS++VGL N G +E
Sbjct: 496 YSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGINKLALLSISVGLPNVGLHFE 555
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
AG+ GPV LKG +GT D+S +WTY+ GLKGE L + SS +W ++ +
Sbjct: 556 TWNAGVLGPVTLKGLNSGT-WDMSRWKWTYKVGLKGEALGLHTVSGSSSVEWAEGPSMAQ 614
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WY+ TF+AP G+ P+A+D + MGKG+ W+NGQSIGR+WP Y ++ G C + C Y
Sbjct: 615 KQPLTWYRATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTAR-GNCGN-CYYA 672
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y KC +CG+PSQ YHVPRSWL +SGN LV+FEE GGDPTKIS V ++ SS+C+
Sbjct: 673 GTYDDKKCRTHCGEPSQRWYHVPRSWLTTSGNLLVVFEEWGGDPTKISLVERRT-SSVCA 731
Query: 742 HVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+ + P L + K+ R P L CP P QVIS IKFAS+G GTCGSF G
Sbjct: 732 DIFEGQPTLTNSQKLASGKLNR---PKAHLWCP-PGQVISDIKFASYGLSQGTCGSFQEG 787
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C + +S ++ C+G +SCS+ V+ F GDPC G K L+VEA C+
Sbjct: 788 SCHAHKSYDAPKRNCIGKQSCSVTVAPEVFGGDPCPGSTKKLSVEAVCS 836
>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
Length = 836
Score = 958 bits (2477), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/829 (56%), Positives = 585/829 (70%), Gaps = 23/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+A++I G++R+LISGSIHYPRSTPEMWPDLIQKSKDGGLDVI+TYVFWN HEP
Sbjct: 26 ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEP 85
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y FE RYDLVKF+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 86 SPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDN 145
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFKA MQ+FT KIV MMK E+L+ SQGGPIILSQIENE+G ++ GA GK+Y KWAA
Sbjct: 146 EPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQ 205
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L+TGVPW+MC+Q DAPDP+I+TCNGFYC+ FTPN N KPKMWTE W+GW+ FGGA
Sbjct: 206 MAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGGA 265
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP RP EDLAF++ARF Q+GG+F NYYMYHGGTNF RT+GGPF++TSYDYDAPLDEYGL
Sbjct: 266 VPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLP 325
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R+PKWGHL+DLHKAIK E+ALV+ +P+ SLG EA V+K+ SG C+AFLAN T S
Sbjct: 326 REPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVFKSKSG-CAAFLANYDTKSS 384
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F Y LP W +SILPDCK V+NTA++ S QS Q+ +
Sbjct: 385 AKVSFGNGQYELPPWPISILPDCKTAVYNTARLGS---------QSSQMKMTPVKSALPW 435
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
S++ E + D T GL EQIN T D +DYLWY I DE ++ G +L +
Sbjct: 436 QSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEGFIKRGESPLLTI 495
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH FING+L G+ YG+ N K+T + G N LLS++VGL N G +E
Sbjct: 496 YSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKPRSGINKLALLSISVGLPNVGLHFE 555
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
AG+ GPV LKG +GT D+S +WTY+ GLKGE L + SS +W ++ +
Sbjct: 556 TWNAGVLGPVTLKGLNSGT-WDMSRWKWTYKIGLKGEALGLHTVSGSSSVEWAEGPSMAQ 614
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYK TF+AP G+ P+A+D + MGKG+ W+NGQSIGR+WP Y ++ G C + C Y
Sbjct: 615 KQPLTWYKATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTAR-GNCGN-CYYA 672
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y KC +CG+PSQ YHVPRSWL SGN LV+FEE GGDPTKIS V ++ SS+C+
Sbjct: 673 GTYDDKKCRTHCGEPSQRWYHVPRSWLTPSGNLLVVFEEWGGDPTKISLVERRT-SSVCA 731
Query: 742 HVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+ + P L + K+ R P L CP P QVIS IKFAS+G P GTCGSF G
Sbjct: 732 DIFEGQPTLTNSQKLASGKLNR---PKAHLWCP-PGQVISDIKFASYGLPQGTCGSFQEG 787
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C + +S ++ C+G +SCS+ V+ F GDPC G K L+VEA C+
Sbjct: 788 SCHAHKSYDAPKRNCIGKQSCSVAVAPEVFGGDPCPGSTKKLSVEAVCS 836
>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
like [Medicago truncatula]
gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
Length = 841
Score = 958 bits (2476), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/829 (56%), Positives = 575/829 (69%), Gaps = 18/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YD +A+ I G+ R+LISGSIHYPRSTPEMWPDLIQK+K+GGLDVI+TYVFWN HEP
Sbjct: 26 ASVSYDSKAITINGQSRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y FEG YDLVKF+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL +IPGI FRTDN
Sbjct: 86 SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 145
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK +MQ+FT KIVDMMK ++L+ SQGGPII+SQIENEYG ++ GA GKSY KWAA
Sbjct: 146 EPFKFQMQKFTEKIVDMMKADRLFESQGGPIIMSQIENEYGPMEYEIGAPGKSYTKWAAD 205
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L TGVPW+MC+Q DAPDP+INTCNGFYCD F+PN + KPKMWTE W+GWF FGG
Sbjct: 206 MAVGLGTGVPWIMCKQDDAPDPVINTCNGFYCDYFSPNKDYKPKMWTEAWTGWFTEFGGP 265
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RP ED+AF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL+
Sbjct: 266 VPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
+QPKWGHLKDLH+AIKL E AL++ DPT +G EA V+K+ SG C+AFL N +
Sbjct: 326 QQPKWGHLKDLHRAIKLSEPALISGDPTVTRIGNYQEAHVFKSKSGACAAFLGNYNPKAF 385
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV F Y LP WS+SILPDCKN V+NTA++ S + +R + G
Sbjct: 386 ATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQSAQMKMTRVPIHG--------GLS 437
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W E + D +FT GLLEQ+NTT D +DYLWYS I +E L G VL V
Sbjct: 438 WQVFTEQTASTDDSSFTMTGLLEQLNTTRDLTDYLWYSTDVVIDPNEGFLRSGKDPVLTV 497
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GHALH FIN +L G+ YGS K+T + L PG N LLS+ VGL N G +E
Sbjct: 498 LSAGHALHVFINSQLSGTIYGSLEFPKLTFSQNVKLIPGVNKISLLSVAVGLPNVGPHFE 557
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPK 621
AG+ GP+ L G G DLS Q+W+Y+ GL GE + SS +W S + +
Sbjct: 558 TWNAGVLGPITLNGLDEGRR-DLSWQKWSYKVGLHGEALSLHSLGGSSSVEWVQGSLVSR 616
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+QPL WYKTTFDAP G P A+D MGKG+ W+NGQ++GRYWP Y + G D+C+Y
Sbjct: 617 MQPLTWYKTTFDAPDGIAPFALDMGSMGKGQVWLNGQNLGRYWPAY--KASGTCDNCDYA 674
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y+ NKC NCG+ SQ YHVP SWL +GN LV+FEE+GGDP I V + + S+C+
Sbjct: 675 GTYNENKCRSNCGEASQRWYHVPHSWLIPTGNLLVVFEELGGDPNGIFLVRRDI-DSVCA 733
Query: 742 HVTDSHPLPVDMWGSDSKIQRKP-GPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+ + P + S KP P L C P Q ISSIKFASFGTP+G+CG+F G
Sbjct: 734 DIYEWQPNLISYQMQTSGKTNKPVRPKAHLSC-GPGQKISSIKFASFGTPVGSCGNFHEG 792
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C + +S + + CVG SC + VS F GDPC V+K L+VEA CT
Sbjct: 793 SCHAHKSYNTFEKNCVGQNSCKVTVSPENFGGDPCPNVLKKLSVEAICT 841
>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
Length = 839
Score = 958 bits (2476), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/831 (57%), Positives = 580/831 (69%), Gaps = 18/831 (2%)
Query: 23 FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLH 82
F A+V+YD++A+ I G+R++L+SGSIHYPRSTPEMWPDLIQK+K+GGLDVI+TYVFWN H
Sbjct: 22 FEASVSYDYKAITINGQRKILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGH 81
Query: 83 EPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRT 142
EP +Y FEG YDLVKF++LV +AGLY HLRIGPY CAEWNFGGFP+WL +IPGI FRT
Sbjct: 82 EPSPGKYYFEGNYDLVKFIRLVQQAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRT 141
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA 202
DN PFK +MQ+FT KIV++MK E+LY SQGGPIILSQIENEYG ++ GA GK+Y +WA
Sbjct: 142 DNGPFKFQMQKFTTKIVNIMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYAQWA 201
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A MA+ L TGVPWVMC+Q DAPDP+INTCNGFYCD F+PN KPKMWTE W+GWF FG
Sbjct: 202 AHMAIGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTGFG 261
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
G VP+RP EDLAF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYG
Sbjct: 262 GTVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
L+RQPKWGHLKDLH+AIKLCE ALV+ DPT LG EA V+K+ SG C+AFLAN +
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSADPTVTRLGNYQEAHVFKSKSGACAAFLANYNPH 381
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
S TV F Y LP WS+SILP+CK+ V+NTA++ S + +R + G
Sbjct: 382 SYSTVAFGNQHYNLPPWSISILPNCKHTVYNTARLGSQSAQMKMTRVPIHG--------G 433
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
W NE + D +FT GLLEQIN T D SDYLWYS I DE +G VL
Sbjct: 434 LSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINPDEGYFRNGKNPVL 493
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V S GHALH FING+L G+ YGS K+T + L G N LLS+ VGL N G
Sbjct: 494 TVLSAGHALHVFINGQLSGTVYGSLDFPKLTFSESVNLRAGVNKISLLSVAVGLPNVGPH 553
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTL 619
+E AG+ GP+ L G G DL+ Q+W+Y+ GLKGE+ + SS W +
Sbjct: 554 FETWNAGVLGPITLNGLNEGRR-DLTWQKWSYKVGLKGEDLSLHSLSGSSSVDWLQGYLV 612
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ QPL WYKTTFDAPAG P+A+D MGKG+ W+NGQS+GRYWP Y + G D CN
Sbjct: 613 SRRQPLTWYKTTFDAPAGVAPLALDMNSMGKGQVWLNGQSLGRYWPAYKAT--GSCDYCN 670
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
Y G Y+ KC NCG+ SQ YHVP SWLK +GN LV+FEE+GGDP + V + + S+
Sbjct: 671 YAGTYNEKKCGTNCGEASQRWYHVPHSWLKPTGNLLVMFEELGGDPNGVFLVRRDI-DSV 729
Query: 740 CSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
C+ + + P V + K+ R P L C P Q ISSIKFASFGTP+G+CG++
Sbjct: 730 CADIYEWQPNLVSYQMQASGKVSRPVSPKAHLSC-GPGQKISSIKFASFGTPVGSCGNYR 788
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G C + +S ++ CVG SC++ VS F GDPC VMK L+VEA CT
Sbjct: 789 EGSCHAHKSYDAFQRNCVGQSSCTVTVSPEIFGGDPCPNVMKKLSVEAICT 839
>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
Length = 892
Score = 957 bits (2474), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/876 (54%), Positives = 598/876 (68%), Gaps = 42/876 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
++L VL FV++A F NVTYD+RA++IGGKRR+LIS IHYPR+TPEMWP LI +
Sbjct: 15 LILTVLTIHFVIVAGEYFKPFNVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIAR 74
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
SK+GG DVIETY FWN HEP R QYNFEGRYD+VKF KLV GL+ +RIGPY CAEWN
Sbjct: 75 SKEGGADVIETYTFWNGHEPTRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWN 134
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
FGGFP+WL IPGI+FRTDN PFK EM+R+ KIVD+M E L++ QGGPIIL QIENEY
Sbjct: 135 FGGFPIWLRDIPGIEFRTDNAPFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEY 194
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
GN++S +G GK Y+KWAA MA+ L GVPWVMC+Q+DAP+ II+TCN +YCD FTPNS
Sbjct: 195 GNVESTFGPKGKLYMKWAAEMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSE 254
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
KPK+WTENW+GWF +G +PYRP ED+AFA+ARFFQRGG+ QNYYMY GGTNF RT+G
Sbjct: 255 KKPKIWTENWNGWFADWGERLPYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAG 314
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEAT 363
GP TSYDYDAPLDEYGL+RQPKWGHLKDLH AIKLCE ALVA D P Y LGP EA
Sbjct: 315 GPTQITSYDYDAPLDEYGLLRQPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAH 374
Query: 364 VYKTGS-----------GLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVF 412
VY+ S G+C+AF+ANI + TVKF G + LP WSVSILPDC+N F
Sbjct: 375 VYRGTSNNIGQYMSLNEGICAAFIANIDEHESATVKFYGQEFTLPPWSVSILPDCRNTAF 434
Query: 413 NTAKINSVTLVPSFSRQSLQVAADS----------SDAIGSGWSYINEPVGISKDDAFTK 462
NTAK+ + T + + S+ V +S ++ W + EP+G+ D FT
Sbjct: 435 NTAKVGAQTSIKTVGSDSVSVGNNSLFLQVITKSKLESFSQSWMTLKEPLGVWGDKNFTS 494
Query: 463 PGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLV 520
G+LE +N T DQSDYLWY I D+ E+ + + S+ + F+NG+L
Sbjct: 495 KGILEHLNVTKDQSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLA 554
Query: 521 GSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSG 580
GS G + V P+ L G N LLS TVGLQNYGAF EK GAG G ++L G
Sbjct: 555 GSVKGKW----IKVVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCK 610
Query: 581 NGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ---WDSKSTLPKLQPLVWYKTTFDAPAG 637
+G +I+L++ WTYQ GL+GE L +ST+ W T WYKT FDAP G
Sbjct: 611 SG-DINLTTSLWTYQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGG 669
Query: 638 SEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPS 697
++PVA+DF+ MGKG+AWVNG +GRYW T V+ N GC +C+YRGAY S+KC NCG+ +
Sbjct: 670 TDPVALDFSSMGKGQAWVNGHHVGRYW-TLVAPNNGCGRTCDYRGAYHSDKCRTNCGEIT 728
Query: 698 QSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSD 757
Q+ YH+PRSWLK+ N LV+FEEI P IS T+ ++C+ V++ H P+ W S
Sbjct: 729 QAWYHIPRSWLKTLNNVLVIFEEIDKTPFDISISTRST-ETICAQVSEKHYPPLHKW-SH 786
Query: 758 SKIQRK-----PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVR 812
S+ RK P + L+C + ISSI+FAS+G+P G+C FS+G+C +A SLSVV
Sbjct: 787 SEFDRKLSLMDKTPEMHLQC-DEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVS 845
Query: 813 QACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
QAC+G SCSIG+S FGDPC+ V+KSLAV+A C+
Sbjct: 846 QACIGRTSCSIGISNGVFGDPCRHVVKSLAVQAKCS 881
>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 849
Score = 957 bits (2474), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 482/833 (57%), Positives = 585/833 (70%), Gaps = 28/833 (3%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI K+K+GGLDVIETYVFWN+HEP
Sbjct: 31 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVHEPS 90
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
R YNFEGRYDLV+FVK + +AGLYA+LRIGPYVCAEWNFGGFP+WL ++PGI FRTDNE
Sbjct: 91 RGNYNFEGRYDLVRFVKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 150
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQ FT KIV MMK E+LY SQGGPIILSQIENEYG G+AG++Y+ WAA M
Sbjct: 151 PFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGSAGQNYVNWAAKM 210
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ TGVPWVMC++ DAPDP+INTCNGFYCD FTPN KP +WTE WSGWF FGG
Sbjct: 211 AVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFGGPN 270
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
RPV+DLAF VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGLIR
Sbjct: 271 HERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIR 330
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+GHLK+LHKAIK+CE ALV+TDP SLG +A VY SG C+AFL+N T S V
Sbjct: 331 QPKYGHLKELHKAIKMCERALVSTDPAVTSLGNFQQAHVYSAKSGDCAAFLSNFDTKSSV 390
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V FN Y LP WS+SILPDC+NVVFNTAK+ Q+ Q+ ++ W
Sbjct: 391 RVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGV---------QTSQMQMLPTNTRMFSW 441
Query: 446 SYINEPVGISKDDA----FTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
+E + S DD T GLLEQIN T D SDYLWY S +I + E L G
Sbjct: 442 ESFDEDIS-SLDDGSSITTTTSGLLEQINVTRDTSDYLWYITSVDIGSSESFLRGGKLPT 500
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L VQS GHA+H FING+L GS YG+ + + T + L G N LLS+ VGL N G
Sbjct: 501 LIVQSTGHAVHVFINGQLSGSAYGTREDRRFTYTGTVNLRAGTNRIALLSVAVGLPNVGG 560
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQWDSKST 618
+E GI GPV L+G G +DLS Q+WTYQ GLKGE +N P+G SS +W +
Sbjct: 561 HFETWNTGILGPVVLRGFDQG-KLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQSAL 619
Query: 619 LP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
+ K QPL W+KT FDAP G EP+A+D GMGKG+ W+NG SIGRYW + G C +
Sbjct: 620 VSDKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYWTALAA--GNC-NG 676
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
C+Y G + KC CG+P+Q YHVPRSWLK N LV+FEE+GGDP+KIS V + + S
Sbjct: 677 CSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPDHNLLVVFEELGGDPSKISLVKRSV-S 735
Query: 738 SLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCG 795
S+C+ V++ HP + W DS K + P + L C +P Q ISSIKFASFGTPLGTCG
Sbjct: 736 SVCADVSEYHP-NIRNWHIDSYGKSEEFHPPKVHLHC-SPGQTISSIKFASFGTPLGTCG 793
Query: 796 SFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
++ +G C S+ S + + + C+G C++ VS + FG DPC V+K L+VEA C
Sbjct: 794 NYEKGVCHSSTSHATLEKKCIGKPRCTVTVSNSNFGQDPCPNVLKRLSVEAVC 846
>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 846
Score = 956 bits (2472), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/832 (57%), Positives = 573/832 (68%), Gaps = 18/832 (2%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S A+V+YD +A+ I G+RR+LISGSIHYPRS+PEMWPDLIQK+K+GGLDVI+TYVFWN
Sbjct: 28 SVTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNG 87
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP +Y FEG YDLVKFVKL EAGLY HLRIGPY+CAEWNFGGFP+WL +IPGI FR
Sbjct: 88 HEPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFR 147
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDN PFKA+MQ+FT KIV+MMK E+L+ +QGGPIILSQIENEYG ++ G+ GK+Y KW
Sbjct: 148 TDNGPFKAQMQKFTTKIVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKW 207
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA+ L TGVPWVMC+Q DAPDPIINTCNGFYCD F+PN KPKMWTE W+GWF F
Sbjct: 208 AAEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQF 267
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG VP+RP ED+AF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEY
Sbjct: 268 GGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEY 327
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPKWGHLKDLH+AIKLCE ALV+ D T LG EA V+ +G C+AFLAN
Sbjct: 328 GLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQ 387
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
S V F Y LP WS+SILPDCKN V+NTA++ + + + +
Sbjct: 388 RSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTPVPMHG-------- 439
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
G W NE S D FT GLLEQINTT D SDYLWY +I E L G V
Sbjct: 440 GFSWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPV 499
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V S GHALH FING+L G+ YGS K+T + L G N LLS+ VGL N G
Sbjct: 500 LGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGP 559
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKST 618
+E AGI GPV L G G DLS Q+W+Y+ GL GE L S SS +W S
Sbjct: 560 HFETWNAGILGPVTLNGLNEGRR-DLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSL 618
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
+ + QPL WYKTTF+APAG+ P+A+D MGKG+ W+NGQ +GR+WP Y + +G C D C
Sbjct: 619 VAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA-SGTCGD-C 676
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
+Y G Y+ KC NCG+ SQ YHVP+SWLK +GN LV+FEE GGDP IS V + + S
Sbjct: 677 SYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDV-DS 735
Query: 739 LCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
+C+ + + P ++ + K+ + P L C P Q I SIKFASFGTP G CGS+
Sbjct: 736 VCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSC-GPGQKIRSIKFASFGTPEGVCGSY 794
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
+G C + S CVG SCS+ V+ F GDPC VMK LAVEA C+
Sbjct: 795 RQGSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCLNVMKKLAVEAICS 846
>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 916
Score = 956 bits (2471), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/867 (56%), Positives = 608/867 (70%), Gaps = 44/867 (5%)
Query: 14 GFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVI 73
F A NVTYD RAV+I G+RR+LIS IHYPR+TPEMWP +IQ +KDGG DV+
Sbjct: 19 AFTTRACVRKPVNVTYDQRAVLIDGERRMLISAGIHYPRATPEMWPSIIQHAKDGGADVV 78
Query: 74 ETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLH 133
+TYVFWN HEP + QYNFEGRYDLVKF+KLV +AGLY HLRIGPYVCAEWNFGGFP WL
Sbjct: 79 QTYVFWNGHEPEQGQYNFEGRYDLVKFIKLVKQAGLYFHLRIGPYVCAEWNFGGFPYWLK 138
Query: 134 FIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGA 193
IPGI FRTDNEPFK MQ FT+KIV++MK+ +L++ QGGPII++QIENEYG+I+S +G
Sbjct: 139 EIPGIVFRTDNEPFKVAMQGFTSKIVNLMKENELFSWQGGPIIMAQIENEYGDIESQFGD 198
Query: 194 AGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTEN 253
GK Y++WAA MALSLDT VPW+MC+Q DAP IINTCNGFYCD + PN+ KP +WTE+
Sbjct: 199 GGKRYVQWAADMALSLDTRVPWIMCKQEDAPANIINTCNGFYCDGWKPNTALKPILWTED 258
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYD 313
W+GWF ++G A P+RPVED AFAVARFFQRGG+FQNYYMY GGTNF RT+GGPF++T+YD
Sbjct: 259 WNGWFQNWGQAAPHRPVEDNAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTTYD 318
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKTGSGL 371
YDAP+DEYGLIRQPKWGHLKDLH AIKLCE AL A D P +G N EA Y + +G
Sbjct: 319 YDAPIDEYGLIRQPKWGHLKDLHAAIKLCEPALTAVDTVPQSTWIGSNQEAHEY-SANGH 377
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV------PS 425
C+AFLANI + + VTV+F G SY+LPAWSVSILPDCKNV FNTA+I + T V PS
Sbjct: 378 CAAFLANIDSENSVTVQFQGESYVLPAWSVSILPDCKNVAFNTAQIGAQTTVTRMRIAPS 437
Query: 426 FSRQSLQVAADS--SDAIGSG-------WSYINEPVGISKDDAFTKPGLLEQINTTADQS 476
SR + + +++ D I G W EP GI LLEQ+N T D S
Sbjct: 438 NSRGDIFLPSNTLVHDHISDGGVFANLKWQASAEPFGIRGSGTTVSNSLLEQLNITKDTS 497
Query: 477 DYLWYSLSTNIKADEPLLED--GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTV 534
DYLWYS S I + E + D G++ L + ++ A+H F+NGKL GS G + + V
Sbjct: 498 DYLWYSTSITITS-EGVTSDVSGTEANLVLGTMRDAVHIFVNGKLAGSAMGWN----IQV 552
Query: 535 DFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTY 594
PI L GKN+ DLLS+T+GLQNYGA+ E GAGI G V + G G N+ LS+ +W+Y
Sbjct: 553 VQPITLKDGKNSIDLLSMTLGLQNYGAYLETWGAGIRGSVSVTGLPYG-NLSLSTAEWSY 611
Query: 595 QTGLKGEELN-FPSGSST--QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKG 651
Q GL+GEEL F +G++ WDS S+ L WYKTTFDAP G++PVA+D MGKG
Sbjct: 612 QVGLRGEELKLFHNGTADGFSWDS-SSFTNASYLTWYKTTFDAPGGTDPVALDLGSMGKG 670
Query: 652 EAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQ-------SLYHVP 704
+AW+NG +GRY+ V+ GC ++C+YRGAY++NKC NCG+PSQ +YH+P
Sbjct: 671 QAWINGHHLGRYF-LMVAPQSGC-ETCDYRGAYNTNKCRTNCGEPSQRWQVIHFQMYHIP 728
Query: 705 RSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKP 764
R+WL+++GN LVLFEEIGGD +K+S VT+ ++C+H+ +S P P+ W I
Sbjct: 729 RAWLQATGNLLVLFEEIGGDISKVSVVTRS-AHAVCAHINESQPPPIRTWRPHRSIDAFN 787
Query: 765 GPV-LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSI 823
P + LEC Q I+ IKFASFG P G+CG F G C + +S+ VR+ C+G + C I
Sbjct: 788 NPAEMLLECA-AGQHITKIKFASFGNPRGSCGHFQHGTCHANKSMEAVRKVCIGKQQCYI 846
Query: 824 GVSVNTFG--DPCKGVMKSLAVEASCT 848
V FG DPC GV KSLAV+ C+
Sbjct: 847 PVQRKFFGSIDPCPGVSKSLAVQVHCS 873
>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
Length = 887
Score = 955 bits (2469), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/869 (55%), Positives = 588/869 (67%), Gaps = 35/869 (4%)
Query: 7 LLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
L++ L F +++ + F NV+YDHRA++I KRR+L+S IHYPR+TPEMW DLI+KS
Sbjct: 17 LIIALLVYFPIVSGSFFKPFNVSYDHRALIIADKRRMLVSAGIHYPRATPEMWSDLIEKS 76
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GG DVI+TYVFW+ HEPV+ QYNFEGRYDLVKFVKL+ +GLY HLRIGPYVCAEWNF
Sbjct: 77 KEGGADVIQTYVFWSGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNF 136
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL IPGIQFRTDNEPFK EMQ+F KIVD+M+ KL+ QGGPII+ QIENEYG
Sbjct: 137 GGFPVWLRDIPGIQFRTDNEPFKKEMQKFVTKIVDLMRDAKLFCWQGGPIIMLQIENEYG 196
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+++ +YG GK Y+KWAA MAL L GVPWVMC+Q+DAP+ II+ CNG+YCD F PNS
Sbjct: 197 DVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSQM 256
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP +WTE+W GW+ +GG++P+RP EDLAFAVARF+QRGG+FQNYYMY GGTNF RTSGG
Sbjct: 257 KPILWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGG 316
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEATV 364
PF TSYDYDAPLDEYGL +PKWGHLKDLH AIKLCE ALVA D P Y LG N EA +
Sbjct: 317 PFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSNQEAHI 376
Query: 365 YK----TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
Y+ TG +C+AFLANI + VKFNG SY LP WSVSILPDC++V FNTAK+ +
Sbjct: 377 YRGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQ 436
Query: 421 TLV-------PSFSRQSLQ---VAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQIN 470
T V PS +S+ V D+ I W + EP+GI ++ FT GLLE +N
Sbjct: 437 TSVKTVESARPSLGSKSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLN 496
Query: 471 TTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSS 528
T D+SDYLW+ + D+ ++G+ + + S+ L F+N +L GS G
Sbjct: 497 VTKDRSDYLWHKTRITVSEDDISFWKKNGANPTVSIDSMRDVLRVFVNKQLSGSVVGHWV 556
Query: 529 NAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS 588
A P+ G N LL+ TVGLQNYGAF EK GAG G +L G NG ++DL+
Sbjct: 557 KAVQ----PVRFMQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNG-DMDLA 611
Query: 589 SQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDF 645
WTYQ GLKGE +W + T +WYKT FD PAG++PV +D
Sbjct: 612 KSSWTYQVGLKGEAEKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDTPAGTDPVVLDL 671
Query: 646 TGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPR 705
MGKG+AWVNG IGRYW +SQ GC +C+YRGAY S+KC NCGKP+Q+ YHVPR
Sbjct: 672 ESMGKGQAWVNGHHIGRYW-NIISQKDGCERTCDYRGAYYSDKCTTNCGKPTQTRYHVPR 730
Query: 706 SWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ---- 761
SWLK S N LVLFEE GG+P IS T G LC V +SH P+ W + I
Sbjct: 731 SWLKPSSNLLVLFEETGGNPFNISVKTVTAG-ILCGQVLESHYPPLRKWSTPDYINGTMS 789
Query: 762 -RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKS 820
P + L C + VISSI+FAS+GTP G+C FS G+C ++ SLS+V +AC G S
Sbjct: 790 INSVAPEVYLHCED-GHVISSIEFASYGTPRGSCDRFSIGKCHASNSLSIVSEACKGRTS 848
Query: 821 CSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C I VS F DPC G +K+LAV A C+
Sbjct: 849 CFIEVSNTAFRSDPCSGTLKTLAVMARCS 877
>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
Length = 839
Score = 955 bits (2469), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/832 (57%), Positives = 573/832 (68%), Gaps = 18/832 (2%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S A+V+YD +A+ I G+RR+LISGSIHYPRS+PEMWPDLIQK+K+GGLDVI+TYVFWN
Sbjct: 21 SVTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNG 80
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP +Y FEG YDLVKFVKL EAGLY HLRIGPY+CAEWNFGGFP+WL +IPGI FR
Sbjct: 81 HEPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFR 140
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDN PFKA+MQ+FT K+V+MMK E+L+ +QGGPIILSQIENEYG ++ G+ GK+Y KW
Sbjct: 141 TDNGPFKAQMQKFTTKVVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKW 200
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA+ L TGVPWVMC+Q DAPDPIINTCNGFYCD F+PN KPKMWTE W+GWF F
Sbjct: 201 AAEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQF 260
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG VP+RP ED+AF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEY
Sbjct: 261 GGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEY 320
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPKWGHLKDLH+AIKLCE ALV+ D T LG EA V+ +G C+AFLAN
Sbjct: 321 GLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQ 380
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
S V F Y LP WS+SILPDCKN V+NTA++ + + + +
Sbjct: 381 RSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTPVPMHG-------- 432
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
G W NE S D FT GLLEQINTT D SDYLWY +I E L G V
Sbjct: 433 GFSWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPV 492
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V S GHALH FING+L G+ YGS K+T + L G N LLS+ VGL N G
Sbjct: 493 LGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGP 552
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKST 618
+E AGI GPV L G G DLS Q+W+Y+ GL GE L S SS +W S
Sbjct: 553 HFETWNAGILGPVTLNGLNEGRR-DLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSL 611
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
+ + QPL WYKTTF+APAG+ P+A+D MGKG+ W+NGQ +GR+WP Y + +G C D C
Sbjct: 612 VAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA-SGTCGD-C 669
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
+Y G Y+ KC NCG+ SQ YHVP+SWLK +GN LV+FEE GGDP IS V + + S
Sbjct: 670 SYIGTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDV-DS 728
Query: 739 LCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
+C+ + + P ++ + K+ + P L C P Q I SIKFASFGTP G CGS+
Sbjct: 729 VCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSC-GPGQKIRSIKFASFGTPEGVCGSY 787
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
+G C + S CVG SCS+ V+ F GDPC VMK LAVEA C+
Sbjct: 788 RQGSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCLNVMKKLAVEAICS 839
>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
Length = 898
Score = 955 bits (2468), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/850 (56%), Positives = 587/850 (69%), Gaps = 27/850 (3%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
L LVLC + L + +VTYD +A+VI G+RR+LISGSIHYPRSTP+MW D+IQK+K
Sbjct: 62 LFLVLCM-VLQLGSQLIQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAK 120
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDV+ETYVFWN+HEP YNFEGRYDLV+F++ V +AGLYAHLRIGPYVCAEWNFG
Sbjct: 121 DGGLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFG 180
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WL ++PGI FRTDNEPFK MQ FT KIV +MK E+L+ SQGGPIILSQIENEYG
Sbjct: 181 GFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGV 240
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
G AG Y+ WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD F+PN K
Sbjct: 241 QSKLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYK 300
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
P +WTE WSGWF FGG + RPV+DLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGP
Sbjct: 301 PTIWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGP 360
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYGL+RQPK+GHLK+LH++IKLCE ALV+ DP SLG +A VY
Sbjct: 361 FITTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYS 420
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
+ +G C+AFL+N T S V FN Y LP WS+SILPDC+N VFNTAK+
Sbjct: 421 SDAGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGV------- 473
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDA--FTKPGLLEQINTTADQSDYLWYSLS 484
Q+ + ++A W +E + S DD+ FT GLLEQIN T D SDYLWY
Sbjct: 474 --QTAHMEMLPTNAEMLSWESYDEDIS-SLDDSSTFTTLGLLEQINVTRDASDYLWYITR 530
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+I + E L G L +Q+ GHA+H FING+L GS +G+ + T + L G
Sbjct: 531 IDIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGT 590
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
NT LLS+ VGL N G +E GI GPV L G G DLS Q+WTY+ GLKGE +N
Sbjct: 591 NTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQG-KWDLSWQRWTYKVGLKGEAMN 649
Query: 605 F--PSG-SSTQWDSKS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
P+G SS W S + QPL W+K F+AP G EP+A+D GMGKG+ W+NGQSI
Sbjct: 650 LVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSI 709
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW Y NG C C+Y G Y KC CG+P+Q YHVPRSWLK + N LV+FEE
Sbjct: 710 GRYWTAYA--NGNC-QGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEE 766
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQV 778
+GGDP++IS V + + +S+C+ V + HP + W +S K + P + L C P Q
Sbjct: 767 LGGDPSRISLVRRSM-TSVCADVFEYHP-NIKNWHIESYGKTEELHKPKVHLRC-GPGQS 823
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGV 837
ISSIKFAS+GTPLGTCGSF +G C + S ++V + C+G + C++ +S F DPC V
Sbjct: 824 ISSIKFASYGTPLGTCGSFEQGPCHAPDSYAIVEKRCIGRQRCAVTISNTNFAQDPCPNV 883
Query: 838 MKSLAVEASC 847
+K L+VEA C
Sbjct: 884 LKRLSVEAVC 893
>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 955 bits (2468), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/829 (56%), Positives = 578/829 (69%), Gaps = 23/829 (2%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLIQK+KDGG+DVIETYVFWN+HEP
Sbjct: 28 SVTYDRKAIMINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNVHEPT 87
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
Y+FEGRYD+V+F+K + AGLYAHLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNE
Sbjct: 88 PGNYHFEGRYDIVRFMKTIQRAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 147
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQ FT KIV +MK E L+ SQGGPIILSQIENEYG +GAAG +Y+ WAA M
Sbjct: 148 PFKRAMQGFTEKIVGLMKAENLFESQGGPIILSQIENEYGVQSKLFGAAGYNYMTWAANM 207
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ TGVPWVMC++ DAPDP+INTCNGFYCD F PN KP +WTE WSGWF FGG +
Sbjct: 208 AIQTGTGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPTIWTEAWSGWFSEFGGTI 267
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
RPV+DLAFAVA+F Q+GG+F NYYM+HGGTNF R++GGPFI+TSYDYDAP+DEYGLIR
Sbjct: 268 HQRPVQDLAFAVAKFIQKGGSFINYYMFHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 327
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+GHLK+LH++IK+CE ALV+ DP LG + VY T SG C+AFLAN T S
Sbjct: 328 QPKYGHLKELHRSIKMCERALVSVDPIVTQLGTYQQVHVYSTESGDCAAFLANYDTKSAA 387
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V FN Y LP WS+SILPDC+NVVFNTAK+ Q+ Q+ ++ I S
Sbjct: 388 RVLFNNMHYNLPPWSISILPDCRNVVFNTAKVGV---------QTSQMEMLPTNGIFSWE 438
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
SY + + FT GLLEQIN T D SDYLWY S +I + E L G L +Q
Sbjct: 439 SYDEDISSLDDSSTFTTAGLLEQINVTRDASDYLWYMTSVDIGSSESFLHGGELPTLIIQ 498
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHA+H FING+L GS +G+ N + T + L PG N LLS+ VGL N G YE
Sbjct: 499 STGHAVHIFINGQLSGSAFGTRENRRFTYTGKVNLRPGTNRIALLSVAVGLPNVGGHYES 558
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQW-DSKSTLPK 621
GI GPV L G G DLS Q+WTYQ GLKGE +N S +S +W S +
Sbjct: 559 WNTGILGPVALHGLDQG-KWDLSWQKWTYQVGLKGEAMNLLSPDSVTSVEWMQSSLAAQR 617
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL W+K F+AP G EP+A+D GMGKG+ W+NGQSIGRYW Y S N + C+Y
Sbjct: 618 PQPLTWHKAYFNAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAYASGN---CNGCSYA 674
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G + KC CG+P+Q YHVPRSWLK + N LV+FEE+GGDP++IS V + L +S+C+
Sbjct: 675 GTFRPTKCQLGCGQPTQRWYHVPRSWLKPTNNLLVVFEELGGDPSRISLVKRSL-ASVCA 733
Query: 742 HVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
V++ HP + W +S + + P + L C Q I+SIKFASFGTPLGTCGS+ +
Sbjct: 734 EVSEFHPT-IKNWQIESYGRAEEFHSPKVHLRCSG-GQSITSIKFASFGTPLGTCGSYQQ 791
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
G C ++ S +++ + C+G + C++ +S + FG DPC VMK L+VEA C
Sbjct: 792 GACHASTSYAILEKKCIGKQRCAVTISNSNFGQDPCPNVMKKLSVEAVC 840
>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
Length = 909
Score = 954 bits (2467), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/859 (56%), Positives = 587/859 (68%), Gaps = 46/859 (5%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YDHRA+++ GKRR LIS IHYPR+TPEMWPDLI KSK+GG DVIETYVFWN HEPV
Sbjct: 46 NVSYDHRALILNGKRRFLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNGHEPV 105
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
R QYNFEGRYDLVKFV+L A GLY LRIGPY CAEWNFGGFP+WL IPGI+FRT+N
Sbjct: 106 RGQYNFEGRYDLVKFVRLAASHGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 165
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK EM+RF +K+V++M++E+L++ QGGPIIL QIENEYGNI+++YG GK Y+KWAA M
Sbjct: 166 PFKEEMKRFVSKVVNLMREERLFSWQGGPIILLQIENEYGNIENSYGKGGKEYMKWAAKM 225
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
ALSL GVPWVMC+Q DAP II+TCN +YCD F PNS+NKP MWTENW GW+ +G +
Sbjct: 226 ALSLGAGVPWVMCRQQDAPYDIIDTCNAYYCDGFKPNSHNKPTMWTENWDGWYTQWGERL 285
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RPVEDLAFAVARFFQRGG+FQNYYMY GGTNF RT+GGP TSYDYDAP+DEYGL+R
Sbjct: 286 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYGLLR 345
Query: 326 QPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEATVYKTG-------------SGL 371
+PKWGHLKDLH A+KLCE ALVATD PTY LGP EA VY+ S +
Sbjct: 346 EPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPKQEAHVYQANVHLEGLNLSMFESSSI 405
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKIN---SVTLVPS--- 425
CSAFLANI + TV F G Y +P WSVS+LPDC+N VFNTAK+ SV LV S
Sbjct: 406 CSAFLANIDEWKEATVTFRGQRYTIPPWSVSVLPDCRNTVFNTAKVRAQTSVKLVESYLP 465
Query: 426 -----FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
F Q L+ D I W EP+ I +FT G+ E +N T DQSDYLW
Sbjct: 466 TVSNIFPAQQLRHQNDFY-YISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLW 524
Query: 481 YSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
YS + + L E+ L + + L FING+L+G+ G T+ F
Sbjct: 525 YSTRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLIGNVVGHWIKVVQTLQF-- 582
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
PG N LL+ TVGLQNYGAF EK GAGI G +++ G NG +IDLS WTYQ GL
Sbjct: 583 --LPGYNDLTLLTQTVGLQNYGAFLEKDGAGIRGKIKITGFENG-DIDLSKSLWTYQVGL 639
Query: 599 KGEELNFPSGSSTQWDSKSTLPKLQP--LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
+GE L F S + + P P WYKT FD P G +PVA+DF MGKG+AWVN
Sbjct: 640 QGEFLKFYSEENENSEWVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVN 699
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQ IGRYW T VS GC C+YRGAY+S+KC NCGKP+Q+LYHVPRSWLK++ N LV
Sbjct: 700 GQHIGRYW-TRVSPKSGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLV 758
Query: 717 LFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPV------DMWGSDSKIQRKPGPVLSL 770
+ EE GG+P +IS V +C+ V++S+ P+ D+ G + P L L
Sbjct: 759 ILEETGGNPFEIS-VKLHSSRIICAQVSESNYPPLQKLVNADLIGEEVSANNMI-PELHL 816
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF 830
C ISS+ FASFGTP G+C +FSRG C + S+S+V +AC G +SCSI +S + F
Sbjct: 817 HC-QQGHTISSVAFASFGTPGGSCQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAF 875
Query: 831 G-DPCKGVMKSLAVEASCT 848
G DPC GV+K+L+VEA CT
Sbjct: 876 GVDPCPGVVKTLSVEARCT 894
>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 954 bits (2465), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/850 (55%), Positives = 586/850 (68%), Gaps = 16/850 (1%)
Query: 5 EILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
++ LVLC + ANVTYD R+++I G R++LIS SIHYPRS P MWP LIQ
Sbjct: 4 KLSFLVLC----LFLPLCLAANVTYDRRSLIIDGHRKLLISASIHYPRSVPAMWPSLIQN 59
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
+K+GG+DVIETYVFWN HE + Y+F+GR+DLVKF+ +V AGLY LRIGP+V AEWN
Sbjct: 60 AKEGGVDVIETYVFWNGHELSPDNYHFDGRFDLVKFINIVHNAGLYLILRIGPFVAAEWN 119
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
FGG P+WLH+IP FRTDN FK MQ+FT IV +MK+EKL+ASQGGPIILSQ+ENEY
Sbjct: 120 FGGVPVWLHYIPNTVFRTDNASFKFYMQKFTTYIVSLMKKEKLFASQGGPIILSQVENEY 179
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
G+I+ YG GK Y WAA MA+S + GVPW+MCQQ DAPDP+INTCN FYCDQFTPNS
Sbjct: 180 GDIERVYGEGGKPYAMWAAQMAVSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSP 239
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
NKPKMWTENW GWF +FG P+RP ED+AF+VARFFQ+GG+ QNYYMYHGGTNF RT+G
Sbjct: 240 NKPKMWTENWPGWFKTFGARDPHRPPEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTAG 299
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
GPFI+TSYDYDAP+DEYGL R PKWGHLK+LH+AIKL E L+ ++PTY SLGP+LEA V
Sbjct: 300 GPFITTSYDYDAPIDEYGLPRLPKWGHLKELHRAIKLTERVLLNSEPTYVSLGPSLEADV 359
Query: 365 YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVP 424
Y SG C+AF+ANI D TV+F SY LPAWSVSILPDCKNVVFNTA I S T +
Sbjct: 360 YTDSSGACAAFIANIDEKDDKTVQFRNISYHLPAWSVSILPDCKNVVFNTAMIRSQTAMV 419
Query: 425 SFSRQSLQVAADSS--DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
+ LQ +AD++ D W E GI F K L++ +NTT D +DYLWY+
Sbjct: 420 EMVPEELQPSADATNKDLKALKWEVFVEQPGIWGKADFVKNVLVDHLNTTKDTTDYLWYT 479
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
S + +E L+ GS+ VL V+S GHALHAFIN KL S G+ S+ I+L
Sbjct: 480 TSIFVNENEKFLK-GSQPVLVVESKGHALHAFINKKLQVSATGNGSDITFKFKQAISLKA 538
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
GKN LLS+TVGLQN G FYE GAG++ V ++G NG +DLSS W+Y+ GL+GE
Sbjct: 539 GKNEIALLSMTVGLQNAGPFYEWVGAGLSK-VVIEGFNNGP-VDLSSYAWSYKIGLQGEH 596
Query: 603 LNF--PSG-SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
L P G + +W S PK QPL WYK D P+G+EPV +D MGKG AW+NG+
Sbjct: 597 LGIYKPDGIKNVKWLSSREPPKQQPLTWYKVILDPPSGNEPVGLDMVHMGKGLAWLNGEE 656
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
IGRYWPT S + C C+YRG + +KCL CG+P+Q YHVPRSW K SGN LV+FE
Sbjct: 657 IGRYWPTKSSIHDVCVQKCDYRGKFRPDKCLTGCGEPTQRWYHVPRSWFKPSGNILVIFE 716
Query: 720 EIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVI 779
E GGDPT+I +++ +C+H+ + HP ++ W ++RK + L+CP+ N I
Sbjct: 717 EKGGDPTQIRLSKRKV-LGICAHLGEGHP-SIESWSEAENVERKSKATVDLKCPD-NGRI 773
Query: 780 SSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGD-PCKGVM 838
+ IKFASFGTP G+CGS+S G C S+S+V + C+ C I + F C
Sbjct: 774 AKIKFASFGTPQGSCGSYSIGDCHDPNSISLVEKVCLNRNECRIELGEEGFNKGLCPTAS 833
Query: 839 KSLAVEASCT 848
K LAVEA C+
Sbjct: 834 KKLAVEAMCS 843
>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 953 bits (2464), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/831 (56%), Positives = 590/831 (70%), Gaps = 27/831 (3%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV+I G+RR+L SGSIHYPRSTPEMW LIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLVKF+K +AGL+ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKA MQ FT KIV MMK E+L+ASQGGPIILSQIENEYG + +GAAGKSY WAA MA
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ LDTGVPWVMC+Q DAPDP+IN CNGFYCD FTPN+ +KP MWTE W+GWF FGG +
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPVEDL+FAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL R+
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LHKAIKLCE ALV+ DPT SLG EA VY++ SG C+AFLAN +NS
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSG-CAAFLANYNSNSHAK 390
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
+ F+ Y LP WS+SILPDCK VV+NTA + T +Q+ +D + ++ W
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQT-------SQMQMWSDGASSM--MWE 441
Query: 447 YINEPVG-ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+E VG ++ T GLLEQ+N T D SDYLWY S ++ E L+ G L VQ
Sbjct: 442 RYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTVQ 501
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH F+NG+L GS G+ + +++ + L G N LLS+ GL N G YE
Sbjct: 502 SAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYET 561
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKL 622
G+ GPV L G G+ DL+ Q WTYQ GLKGE++N S SS +W S + +
Sbjct: 562 WNTGVNGPVVLHGLDEGSR-DLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQN 620
Query: 623 Q-PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
Q PL WY+ FD P+G EP+A+D MGKG+ W+NGQSIGRY Y + G C D C+Y
Sbjct: 621 QMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAYAT--GDCKD-CSYT 677
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G++ + KC CG+P+Q YHVP+SWL+ + N LV+FEE+GGD +KIS V + + S++C+
Sbjct: 678 GSFRAIKCQAGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDTSKISLVKRSV-SNVCA 736
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLS---LECPNPNQVISSIKFASFGTPLGTCGSFS 798
V++ HP + W +++ + KP S L C P Q IS+IKFASFGTPLGTCGSF
Sbjct: 737 DVSEFHP-SIKNWQTENSGEAKPELRRSKVHLRCA-PGQSISAIKFASFGTPLGTCGSFE 794
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
+G+C S +S +V+ + C+G + C++ +S + F GDPC VMK +AVEA C+
Sbjct: 795 QGQCHSTKSQTVL-ENCIGKQRCAVTISPDNFGGDPCPNVMKRVAVEAVCS 844
>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
Length = 845
Score = 953 bits (2463), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/850 (56%), Positives = 587/850 (69%), Gaps = 27/850 (3%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
L LVLC + L + +VTYD +A+VI G+RR+LISGSIHYPRSTP+MW D+IQK+K
Sbjct: 9 LFLVLCM-VLQLGSQLIQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAK 67
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDV+ETYVFWN+HEP YNFEGRYDLV+F++ V +AGLYAHLRIGPYVCAEWNFG
Sbjct: 68 DGGLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFG 127
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WL ++PGI FRTDNEPFK MQ FT KIV +MK E+L+ SQGGPIILSQIENEYG
Sbjct: 128 GFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGV 187
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
G AG Y+ WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD F+PN K
Sbjct: 188 QSKLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYK 247
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
P +WTE WSGWF FGG + RPV+DLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGP
Sbjct: 248 PTIWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGP 307
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYGL+RQPK+GHLK+LH++IKLCE ALV+ DP SLG +A VY
Sbjct: 308 FITTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYS 367
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
+ +G C+AFL+N T S V FN Y LP WS+SILPDC+N VFNTAK+
Sbjct: 368 SDAGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGV------- 420
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDA--FTKPGLLEQINTTADQSDYLWYSLS 484
Q+ + ++A W +E + S DD+ FT GLLEQIN T D SDYLWY
Sbjct: 421 --QTAHMEMLPTNAEMLSWESYDEDIS-SLDDSSTFTTLGLLEQINVTRDASDYLWYITR 477
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+I + E L G L +Q+ GHA+H FING+L GS +G+ + T + L G
Sbjct: 478 IDIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGT 537
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
NT LLS+ VGL N G +E GI GPV L G G DLS Q+WTY+ GLKGE +N
Sbjct: 538 NTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQG-KWDLSWQRWTYKVGLKGEAMN 596
Query: 605 F--PSG-SSTQWDSKS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
P+G SS W S + QPL W+K F+AP G EP+A+D GMGKG+ W+NGQSI
Sbjct: 597 LVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSI 656
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW Y NG C C+Y G Y KC CG+P+Q YHVPRSWLK + N LV+FEE
Sbjct: 657 GRYWTAYA--NGNC-QGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEE 713
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQV 778
+GGDP++IS V + + +S+C+ V + HP + W +S K + P + L C P Q
Sbjct: 714 LGGDPSRISLVRRSM-TSVCADVFEYHP-NIKNWHIESYGKTEELHKPKVHLRC-GPGQS 770
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGV 837
ISSIKFAS+GTPLGTCGSF +G C + S ++V + C+G + C++ +S F DPC V
Sbjct: 771 ISSIKFASYGTPLGTCGSFEQGPCHAPDSYAIVEKRCIGRQRCAVTISNTNFAQDPCPNV 830
Query: 838 MKSLAVEASC 847
+K L+VEA C
Sbjct: 831 LKRLSVEAVC 840
>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 952 bits (2461), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/866 (54%), Positives = 590/866 (68%), Gaps = 53/866 (6%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S L+L C GF++L VTYD +A++I G+RR+L SGSIHYPRSTP+MW DLI
Sbjct: 9 SASRLILWFCLGFLILGVGFVQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLI 68
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGG+DVIETYVFWNLHEP +Y+FEGR DLV+FVK + +AGLYAHLRIGPYVCAE
Sbjct: 69 QKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAE 128
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDNEPFK M+ FT +IV++MK E L+ SQGGPIILSQIEN
Sbjct: 129 WNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIEN 188
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG GA G +Y+ WAA MA++ +TGVPWVMC++ DAPDP+INTCNGFYCD F PN
Sbjct: 189 EYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPN 248
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KP +WTE WSGWF FGG + +RPV+DLAF VARF Q+GG+F NYYMYHGGTNF RT
Sbjct: 249 KPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRT 308
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE- 361
+GGPF++TSYDYDAP+DEYGLIRQPK+GHLK+LH+AIK+CE ALV+ DP S+G +
Sbjct: 309 AGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQV 368
Query: 362 -------ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNT 414
A VY SG CSAFLAN T S V FN Y LP WS+SILPDC+N VFNT
Sbjct: 369 WIYYERFAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNT 428
Query: 415 AKINSVTLVPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTA 473
AK+++ W SY+ + + FT GLLEQIN T
Sbjct: 429 AKVSNFQ-----------------------WESYLEDLSSLDDSSTFTTHGLLEQINVTR 465
Query: 474 DQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVT 533
D SDYLWY S +I E L G L +QS GHA+H F+NG+L GS +G+ N + T
Sbjct: 466 DTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFT 525
Query: 534 VDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWT 593
I L G N LLS+ VGL N G +E GI GPV L G G +DLS Q+WT
Sbjct: 526 YQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQG-KMDLSWQKWT 584
Query: 594 YQTGLKGEELN--FPSGS-STQW-DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMG 649
YQ GLKGE +N FP+ + S W D+ T+ K QPL W+KT FDAP G+EP+A+D GMG
Sbjct: 585 YQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMG 644
Query: 650 KGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK 709
KG+ WVNG+SIGRYW + + G C+ C+Y G Y NKC CG+P+Q YHVPR+WLK
Sbjct: 645 KGQIWVNGESIGRYWTAFAT--GDCSH-CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLK 701
Query: 710 SSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGSDSKIQRKPG 765
S N LV+FEE+GG+P+ +S V + + S +C+ V++ HP ++ +G R
Sbjct: 702 PSQNLLVIFEELGGNPSTVSLVKRSV-SGVCAEVSEYHPNIKNWQIESYGKGQTFHR--- 757
Query: 766 PVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVR---QACVGSKSCS 822
P + L+C +P Q I+SIKFASFGTPLGTCGS+ +G C +A S +++ Q CVG C+
Sbjct: 758 PKVHLKC-SPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERYMQKCVGKARCA 816
Query: 823 IGVSVNTFG-DPCKGVMKSLAVEASC 847
+ +S + FG DPC V+K L VEA C
Sbjct: 817 VTISNSNFGKDPCPNVLKRLTVEAVC 842
>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
Length = 882
Score = 952 bits (2461), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/876 (55%), Positives = 591/876 (67%), Gaps = 41/876 (4%)
Query: 4 KEILLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
+ + +LC+ + SF NV+YDHRA++I GKRR+L+S IHYPR+TPEMWPDLI
Sbjct: 5 RALFAALLCFSLTIQLGVSFAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLI 64
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
KSK+GG DVI+TYVFWN HEPVR QYNFEGRYD+VKFVKLV +GLY HLRIGPYVCAE
Sbjct: 65 AKSKEGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAE 124
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL IPGI+FRTDN PFK EMQRF KIVD+M++E L++ QGGPII+ QIEN
Sbjct: 125 WNFGGFPVWLRDIPGIEFRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIEN 184
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYGN++S++G GK Y+KWAA MAL LD GVPWVMCQQ+DAPD IIN CNGFYCD F PN
Sbjct: 185 EYGNVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPN 244
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
S NKPK+WTE+W+GWF S+GG P RPVED+AFAVARFFQRGG+F NYYMY GGTNF R+
Sbjct: 245 SANKPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRS 304
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLE 361
SGGPF TSYDYDAP+DEYGL+ QPKWGHLK+LH AIKLCE ALVA D P Y LGP E
Sbjct: 305 SGGPFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQE 364
Query: 362 ATVYKTGSGL----------CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVV 411
A VY+ L CSAFLANI + +V F G Y LP WSVSILPDC+ V
Sbjct: 365 AHVYRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTV 424
Query: 412 FNTAKINSVT----------LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFT 461
FNTAK+ + T LV + S + + + W + EP+ + ++ FT
Sbjct: 425 FNTAKVGAQTSIKTVEFDLPLVRNISVTQPLMVQNKISYVPKTWMTLKEPISVWSENNFT 484
Query: 462 KPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKL 519
G+LE +N T D SDYLW N+ A++ E+ L + S+ LH F+NG+L
Sbjct: 485 IQGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQL 544
Query: 520 VGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGS 579
+GS G V V PI L G N LLS TVGLQNYGAF EK GAG G V+L G
Sbjct: 545 IGSVIGHW----VKVVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGF 600
Query: 580 GNGTNIDLSSQQWTYQTGLKGE-ELNFPSGSSTQWDSKSTLPKLQP--LVWYKTTFDAPA 636
NG IDLS WTYQ GL+GE + + S + + P P WYKT FDAP
Sbjct: 601 KNG-EIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPN 659
Query: 637 GSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKP 696
G PVA+D MGKG+AWVNG IGRYW T V+ GC C+YRG Y ++KC NCG P
Sbjct: 660 GENPVALDLGSMGKGQAWVNGHHIGRYW-TRVAPKDGC-GKCDYRGHYHTSKCATNCGNP 717
Query: 697 SQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGS 756
+Q YH+PRSWL++S N LVLFEE GG P +IS V + ++C+ V++SH + W
Sbjct: 718 TQIWYHIPRSWLQASNNLLVLFEETGGKPFEIS-VKSRSTQTICAEVSESHYPSLQNWSP 776
Query: 757 ----DSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVR 812
D + K P + L+C + ISSI+FAS+GTP G+C FS+G+C + SL++V
Sbjct: 777 SDFIDQNSKNKMTPEMHLQC-DDGHTISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVS 835
Query: 813 QACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+AC G SC I + + F GDPC+G++K+LAVEA C
Sbjct: 836 KACQGKGSCVIRILNSAFGGDPCRGIVKTLAVEAKC 871
>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 952 bits (2460), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/852 (55%), Positives = 589/852 (69%), Gaps = 31/852 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+L+L L W +L +VTYD +A++I G+RRVL SGSIHYPRSTPEMW LIQK+
Sbjct: 11 MLVLGLFW---LLGVQFVQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDV+ETYVFWN+HEP YNFEGRYDL +F+K + +AGLYA+LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLARFIKTIQKAGLYANLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFK MQ FT KIV +MK E L+ SQGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+GAAG++Y+ WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD F+PN
Sbjct: 188 VQSKLFGAAGQNYMTWAAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPY 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP MWTE WSGWF FGG + RPV+DLAFAVARF Q+GG+F NYYMYHGGTNF RT+GG
Sbjct: 248 KPTMWTEAWSGWFNEFGGPIHQRPVQDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGG 307
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGLIRQPK+GHLK+LH+A+K+CE ALV+ DP SLG + +A VY
Sbjct: 308 PFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVY 367
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ SG C+AFL+N T+S V FN Y LP WS+SILPDC+NVVFNTAK+
Sbjct: 368 TSESGNCAAFLSNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGV------ 421
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDD-AFTKPGLLEQINTTADQSDYLWYSLS 484
Q+ Q+ +++ W NE V D T GLLEQIN T D SDYLWY S
Sbjct: 422 ---QTSQLEMLPTNSPMLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITS 478
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+I + E L G L VQS GHA+H FING+L GS +GS N + T + G+
Sbjct: 479 VDIGSTESFLHGGELPTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGR 538
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
NT LLS+ VGL N G +E GI GPV L G G +DLS +WTY+ GLKGE +N
Sbjct: 539 NTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLDQG-KLDLSWAKWTYKVGLKGEAMN 597
Query: 605 F--PSG-SSTQWDSKSTLPKL-QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
P+G SS +W S + QPL W+K+ FDAP G EP+AID GMGKG+ W+NG SI
Sbjct: 598 LVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSI 657
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW Y + G C D CNY G + KC + CG+P+Q YHVPR+WLK N LV+FEE
Sbjct: 658 GRYWTAYAT--GNC-DKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEE 714
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPL----PVDMWGSDSKIQRKPGPVLSLECPNPN 776
+GG+PT IS V + + + +C+ V++ HP ++ +G + R P + L+C +
Sbjct: 715 LGGNPTSISLVKRSV-TGVCADVSEYHPTLKNWHIESYGKSEDLHR---PKVHLKC-SAG 769
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCK 835
I+SIKFASFGTPLGTCGS+ +G C + S ++ + C+G + C++ +S FG DPC
Sbjct: 770 YSITSIKFASFGTPLGTCGSYQQGTCHAPMSYDILEKRCIGKQRCAVTISNTNFGQDPCP 829
Query: 836 GVMKSLAVEASC 847
V+K L+VE C
Sbjct: 830 NVLKRLSVEVVC 841
>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 951 bits (2458), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/831 (56%), Positives = 589/831 (70%), Gaps = 27/831 (3%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV+I G+RR+L SGSIHYPRSTPEMW LIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLVKF+K +AGL+ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKA MQ FT KIV MMK E+L+ASQGGPIILSQIENEYG + +GAAGKSY WAA MA
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ LDTGVPWVMC+Q DAPDP+IN CNGFYCD FTPN+ +KP MWTE W+GWF FGG +
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPVEDL+FAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL R+
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LHKAIKLCE ALV+ DPT SLG EA VY++ SG C+AFLAN +NS
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSG-CAAFLANYNSNSHAK 390
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
+ F+ Y LP WS+SILPDCK VV+NTA + T +Q+ +D + ++ W
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQT-------SQMQMWSDGASSM--MWE 441
Query: 447 YINEPVG-ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+E VG ++ T GLLEQ+N T D SDYLWY S ++ E L+ G L VQ
Sbjct: 442 RYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTVQ 501
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH F+NG+L GS G+ + +++ + L G N LLS+ GL N G YE
Sbjct: 502 SAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYET 561
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKL 622
G+ GPV L G G+ DL+ Q WTYQ GLKGE++N S SS +W S + +
Sbjct: 562 WNTGVNGPVVLHGLDEGSR-DLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQN 620
Query: 623 Q-PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
Q PL WY+ FD P+G EP+A+D MGKG+ W+NGQSIGRY Y + G C D C+Y
Sbjct: 621 QMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAYAT--GDCKD-CSYT 677
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G++ + KC CG+P+Q YHVP+ WL+ + N LV+FEE+GGD +KIS V + + S++C+
Sbjct: 678 GSFRAIKCQAGCGQPTQRWYHVPKPWLQPTRNLLVVFEELGGDTSKISLVKRSV-SNVCA 736
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLS---LECPNPNQVISSIKFASFGTPLGTCGSFS 798
V++ HP + W +++ + KP S L C P Q IS+IKFASFGTPLGTCGSF
Sbjct: 737 DVSEFHP-SIKNWQTENSGEAKPELRRSKVHLRCA-PGQSISAIKFASFGTPLGTCGSFE 794
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
+G+C S +S +V+ + C+G + C++ +S + F GDPC VMK +AVEA C+
Sbjct: 795 QGQCHSTKSQTVL-ENCIGKQRCAVTISPDNFGGDPCPNVMKRVAVEAVCS 844
>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 951 bits (2457), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/852 (55%), Positives = 588/852 (69%), Gaps = 31/852 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+L+L L W +L +VTYD +A++I G+RRVL SGSIHYPRSTPEMW LIQK+
Sbjct: 11 MLVLGLFW---LLGVQFVQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDV+ETYVFWN+HEP YNFEGRYDLV+F+K + +AGLYA+LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFIKTIQKAGLYANLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFK MQ FT KIV +MK E L+ SQGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+GAAG++Y+ WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD F+PN
Sbjct: 188 VQSKLFGAAGQNYMTWAAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPY 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP MWTE WSGWF FGG + RPV+DLAFAVA F Q+GG+F NYYMYHGGTNF RT+GG
Sbjct: 248 KPTMWTEAWSGWFNEFGGPIHQRPVQDLAFAVALFIQKGGSFINYYMYHGGTNFGRTAGG 307
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGLIRQPK+GHLK+LH+A+K+CE ALV+ DP SLG + +A VY
Sbjct: 308 PFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVY 367
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ SG C+AFL+N T+S V FN Y LP WS+SILPDC+NVVFNTAK+
Sbjct: 368 TSESGNCAAFLSNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGV------ 421
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDD-AFTKPGLLEQINTTADQSDYLWYSLS 484
Q+ Q+ +++ W NE V D T GLLEQIN T D SDYLWY S
Sbjct: 422 ---QTSQLEMLPTNSPMLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITS 478
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+I + E L G L VQS GHA+H FING+L GS +GS N + T + G+
Sbjct: 479 VDIGSTESFLHGGELPTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGR 538
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
NT LLS+ VGL N G +E GI GPV L G G +DLS +WTY+ GLKGE +N
Sbjct: 539 NTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLDQG-KLDLSWAKWTYKVGLKGEAMN 597
Query: 605 F--PSG-SSTQWDSKSTLPKL-QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
P+G SS +W S + QPL W+K+ FDAP G EP+AID GMGKG+ W+NG SI
Sbjct: 598 LVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSI 657
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW Y + N D CNY G + KC + CG+P+Q YHVPR+WLK N LV+FEE
Sbjct: 658 GRYWTAYATGN---CDKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEE 714
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPL----PVDMWGSDSKIQRKPGPVLSLECPNPN 776
+GG+PT IS V + + + +C+ V++ HP ++ +G + R P + L+C +
Sbjct: 715 LGGNPTSISLVKRSV-TGVCADVSEYHPTLKNWHIESYGKSEDLHR---PKVHLKC-SAG 769
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCK 835
I+SIKFASFGTPLGTCGS+ +G C + S ++ + C+G + C++ +S FG DPC
Sbjct: 770 YSITSIKFASFGTPLGTCGSYQQGTCHAPMSYDILEKRCIGKQRCAVTISNTNFGQDPCP 829
Query: 836 GVMKSLAVEASC 847
V+K L+VE C
Sbjct: 830 NVLKRLSVEVVC 841
>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 846
Score = 949 bits (2454), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/852 (55%), Positives = 584/852 (68%), Gaps = 24/852 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
MA+ + F++ + + VTYD +A++I G+RR+L SGSIHYPRSTPEMW D
Sbjct: 1 MATHYYCFPLFLIAFLLANSHLIHSTVTYDRKAILINGQRRILFSGSIHYPRSTPEMWED 60
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI K+K+GGLDV+ETYVFWN+HEP YNFEGR+DLV+F+K + +AGLYA+LRIGPYVC
Sbjct: 61 LILKAKNGGLDVVETYVFWNVHEPYPGIYNFEGRFDLVRFIKTIQKAGLYANLRIGPYVC 120
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWNFGGFP+WL ++PGI FRTDNE FK MQ FT KIV +MK E L+ SQGGPIIL+QI
Sbjct: 121 AEWNFGGFPVWLKYVPGISFRTDNEAFKNAMQGFTEKIVALMKSENLFESQGGPIILAQI 180
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG +G AG +Y+ WAA MA+ L TGVPWVMC+++DAPDP+INTCNGFYCD F+
Sbjct: 181 ENEYGTESKLFGEAGYNYMTWAANMAVGLQTGVPWVMCKEADAPDPVINTCNGFYCDTFS 240
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN KP MWTE W+GWF FGG + RPV+DLAFAVARF QRGG+ NYYMYHGGTNF
Sbjct: 241 PNKPYKPTMWTEAWTGWFSEFGGPLHQRPVQDLAFAVARFIQRGGSLVNYYMYHGGTNFG 300
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGPFI+TSYDYDAP+DEYGL+RQPK+GHLK+LH+AIK+CE ALV+ DP SLG
Sbjct: 301 RTAGGPFITTSYDYDAPIDEYGLLRQPKYGHLKELHRAIKMCEPALVSADPIVTSLGDYQ 360
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
+A VY + SG C+AFL+N T S V FN Y LP WS+SILPDCKN VFNTAK+
Sbjct: 361 QAHVYSSESGGCAAFLSNYDTKSFARVLFNNRHYNLPPWSISILPDCKNAVFNTAKVGV- 419
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
Q+ Q+ +++ W SY + + T PGLLEQIN T D SDYL
Sbjct: 420 --------QTAQMGMLPAESTTLSWESYFEDISALDDRSMMTSPGLLEQINVTRDTSDYL 471
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
WY S +I + EP L G L VQS GHA+H FING+L GS GS + + T +
Sbjct: 472 WYITSVDISSSEPFLHGGELPTLLVQSTGHAVHVFINGQLSGSVSGSRKSRRFTYSGKVN 531
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
L G N LLS+ VGL N G +E GI GPV L G G DLSSQ+WTY+ GLK
Sbjct: 532 LHAGTNKIGLLSVAVGLPNVGGHFETWNTGILGPVVLYGLRQG-KWDLSSQKWTYKVGLK 590
Query: 600 GEELNF--PSG-SSTQWDSKSTLPKL-QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWV 655
GE +N PSG S +W S + QPL W+K FDAP G EP+A+D GMGKG+ W+
Sbjct: 591 GEAMNLISPSGFSPVEWMQASLAAQTPQPLTWHKAYFDAPEGEEPLALDMEGMGKGQIWI 650
Query: 656 NGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTL 715
NGQSIGRYW Y G C+ CNY A+ KC CG+P+Q YHVPRSWL+ N L
Sbjct: 651 NGQSIGRYWTAYA--RGNCS-RCNYATAFRPPKCQLGCGQPTQRWYHVPRSWLRPEQNLL 707
Query: 716 VLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
V+FEE+GG+P++IS V K+L +S+C+ V++ HP W +K P + L C +P
Sbjct: 708 VVFEEVGGNPSRISIV-KRLVTSVCADVSEFHPT-FKNWHITAKFIT---PKVHLSC-DP 761
Query: 776 NQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCK 835
Q ISSIKFASFGTPLGTCGS+ +G C + S ++ + CVG + C++ VS + F DPC
Sbjct: 762 GQYISSIKFASFGTPLGTCGSYQQGTCHAPSSSGILEKKCVGKQRCAVTVSNSNFEDPCP 821
Query: 836 GVMKSLAVEASC 847
+MK L+VEA C
Sbjct: 822 NMMKRLSVEAVC 833
>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 949 bits (2452), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/836 (55%), Positives = 583/836 (69%), Gaps = 11/836 (1%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
T+S ANVTYD R+++I G+R++LIS SIHYPRS P MWP L++ +K+GG+DVIETYVFW
Sbjct: 16 TSSLAANVTYDRRSLIIDGQRKLLISASIHYPRSVPGMWPGLVKTAKEGGIDVIETYVFW 75
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
N HE + Y F GRYDL+KFVK+V +A +Y LR+GP+V AEWNFGG P+WLH++PG
Sbjct: 76 NGHELSPDNYYFGGRYDLLKFVKIVQQARMYLILRVGPFVAAEWNFGGVPVWLHYVPGTV 135
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYI 199
FRT++EPFK MQ+F IV++MK+EKL+ASQGGPIIL+Q+ENEYG+ + YG GK Y
Sbjct: 136 FRTNSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPIILAQVENEYGDTERIYGDGGKPYA 195
Query: 200 KWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
WAA MALS + GVPW+MCQQ DAPDP+INTCN FYCDQFTPNS NKPKMWTENW GWF
Sbjct: 196 MWAANMALSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFK 255
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG P+RP ED+AF+VARFFQ+GG+ QNYYMYHGGTNF RTSGGPFI+TSYDY+AP+D
Sbjct: 256 TFGAPDPHRPHEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTSGGPFITTSYDYNAPID 315
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANI 379
EYGL R PKWGHLK+LH+AIK CE L+ +P SLGP+ E VY SG C+AF++N+
Sbjct: 316 EYGLARLPKWGHLKELHRAIKSCEHVLLYGEPINLSLGPSQEVDVYTDSSGGCAAFISNV 375
Query: 380 GTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQ--VAADS 437
D + F SY +PAWSVSILPDCKNVVFNTAK+ S T + LQ + +
Sbjct: 376 DEKEDKIIVFQNVSYHVPAWSVSILPDCKNVVFNTAKVGSQTSQVEMVPEELQPSLVPSN 435
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
D G W E GI + F K G ++ INTT D +DYLWY++S + E L++
Sbjct: 436 KDLKGLQWETFVEKAGIWGEADFVKNGFVDHINTTKDTTDYLWYTVSLTVGESENFLKEI 495
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
S+ VL V+S GHALHAF+N KL GS G+ S++ + PI+L GKN LLS+TVGLQ
Sbjct: 496 SQPVLLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAGKNDIALLSMTVGLQ 555
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQWD 614
N G FYE GAG+T V++KG NG +DLS+ WTY+ GL+GE L P G +S +W
Sbjct: 556 NAGPFYEWVGAGLTS-VKIKGLNNGI-MDLSTYTWTYKIGLQGEHLLIYKPEGLNSVKWL 613
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
S PK QPL WYK D P+G+EP+ +D MGKG AW+NG+ IGRYWP S + C
Sbjct: 614 STPEPPKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKGLAWLNGEEIGRYWPRKSSIHDKC 673
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
C+YRG + NKC CG+P+Q YHVPRSW K SGN LV+FEE GGDPTKI F +++
Sbjct: 674 VQECDYRGKFMPNKCSTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTKIRF-SRR 732
Query: 735 LGSSLCSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+ +C+ V++ HP ++ W D+ K + L+CP N ISS+KFAS+GTP G
Sbjct: 733 KTTGVCALVSEDHPTYELESWHKDANENNKNKATIHLKCPE-NTHISSVKFASYGTPTGK 791
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
CGS+S+G C S SVV + C+ C+I ++ F D C K LAVEA C+
Sbjct: 792 CGSYSQGDCHDPNSASVVEKLCIRKNDCAIELAEKNFSKDLCPSTTKKLAVEAVCS 847
>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
Length = 846
Score = 949 bits (2452), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/849 (55%), Positives = 586/849 (69%), Gaps = 26/849 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
L+VL G ++ T VTYD +A++I G+RR+LISGSIHYPRSTPEMW DLIQK+
Sbjct: 12 FFLMVLLMGSKLVQCT-----VTYDKKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKA 66
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFW++HE YNF+GRYDLV+F+K V + GLYAHLRIGPYVCAEWNF
Sbjct: 67 KDGGLDVIDTYVFWDVHETSPGNYNFDGRYDLVRFIKTVQKVGLYAHLRIGPYVCAEWNF 126
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ FT KIV MMK E L+ASQGGPIILSQIENEYG
Sbjct: 127 GGFPVWLKYVPGISFRTDNEPFKAAMQGFTQKIVQMMKNENLFASQGGPIILSQIENEYG 186
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
A GAAG+SYI WAA MA+ LDTGVPWVMC++ DAPDP+INTCNGFYCD F PN
Sbjct: 187 PESRALGAAGRSYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINTCNGFYCDAFAPNKPY 246
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP +WTE WSGWF FGG + RPVEDLAFAVARF Q+GG++ NYYMYHGGTNF R++GG
Sbjct: 247 KPTLWTEAWSGWFTEFGGPIHQRPVEDLAFAVARFIQKGGSYFNYYMYHGGTNFGRSAGG 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGLIR+PK+GHLK LHKAIKLCE ALV++DP+ SLG +A V+
Sbjct: 307 PFITTSYDYDAPIDEYGLIREPKYGHLKALHKAIKLCEHALVSSDPSITSLGTYQQAHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+G C+AFLAN S V FN Y LP WS+SILPDC+NVVFNTA++ + TL
Sbjct: 367 SSGRS-CAAFLANYNAKSAARVMFNNMHYDLPPWSISILPDCRNVVFNTARVGAQTL--- 422
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+Q+ S+ S +Y E ++ T GLLEQIN T D SDYLWY S
Sbjct: 423 ----RMQMLPTGSELF-SWETYDEEISSLTDSSRITALGLLEQINVTRDTSDYLWYLTSV 477
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
+I E L +G K L VQS GH LH FING+ GS +G+ N ++T P+ L G N
Sbjct: 478 DISPSEAFLRNGQKPSLTVQSAGHGLHVFINGQFSGSAFGTRENRQLTFTGPVNLRAGTN 537
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS+ VGL N G YE G+ GPV L G G DL+ Q+W+YQ GLKGE +N
Sbjct: 538 RIALLSIAVGLPNVGLHYETWKTGVQGPVLLNGLNQGKK-DLTWQKWSYQVGLKGEAMNL 596
Query: 606 --PSG-SSTQWDSKSTL-PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
P+G SS W S + Q L W+K FDAP G+EP+A+D MGKG+ W+NGQSIG
Sbjct: 597 VSPNGVSSVDWIEGSLASSQGQALKWHKAYFDAPRGNEPLALDMRSMGKGQVWINGQSIG 656
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
RYW Y G C +SC+Y + +KC CG+P+Q YHVPRSWLK + N LV+FEE+
Sbjct: 657 RYWMAYA--KGDC-NSCSYIWTFRPSKCQLGCGEPTQRWYHVPRSWLKPTKNLLVVFEEL 713
Query: 722 GGDPTKISFVTKQLGSSLCSHVTDSHPLPVDM-WGSDSKIQRKPGPVLSLECPNPNQVIS 780
GGD +KIS V + + +C+ + HP + G + + + + L C P Q I+
Sbjct: 714 GGDASKISLVKRSI-EGVCADAYEHHPATKNYNTGGNDESSKLHQAKIHLRCA-PGQFIA 771
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMK 839
+IKFASFGTP GTCGSF +G C + + SV+ + C+G +SC + +S + FG DPC V+K
Sbjct: 772 AIKFASFGTPSGTCGSFQQGTCHAPNTHSVIEKKCIGQESCMVTISNSNFGADPCPNVLK 831
Query: 840 SLAVEASCT 848
L+VEA C+
Sbjct: 832 KLSVEAVCS 840
>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
Length = 916
Score = 946 bits (2444), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/827 (55%), Positives = 578/827 (69%), Gaps = 17/827 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R+++I G+RR+LIS SIHYPRS P MWP L+ ++KDGG D IETYVFWN HE
Sbjct: 102 VTYDGRSLIISGRRRLLISTSIHYPRSVPAMWPKLVAEAKDGGADCIETYVFWNGHETAP 161
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+Y FE R+DLV+F K+V +AGLY LRIGP+V AEWNFGG P+WLH+IPG FRT+NEP
Sbjct: 162 GEYYFEDRFDLVRFAKVVKDAGLYLMLRIGPFVAAEWNFGGVPVWLHYIPGAVFRTNNEP 221
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK+ M+ FT KIVDMMK+E+ +ASQGG IIL+QIENEYG+ + AYGA GK+Y WAA MA
Sbjct: 222 FKSHMKSFTTKIVDMMKRERFFASQGGHIILAQIENEYGDTEQAYGADGKAYAMWAASMA 281
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
L+ +TGVPW+MCQQ DAP+ +INTCN FYCDQF NS KPK+WTENW GWF +FG + P
Sbjct: 282 LAQNTGVPWIMCQQYDAPEHVINTCNSFYCDQFKTNSPTKPKIWTENWPGWFQTFGESNP 341
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+RP ED+AF+VARFFQ+GG+ QNYY+YHGGTNF RT+GGPFI+TSYDYDAP+DEYGL R
Sbjct: 342 HRPPEDVAFSVARFFQKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLTRL 401
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKW HL+DLHK+IKLCE +L+ + T SLG EA VY SG C AFLANI +D
Sbjct: 402 PKWAHLRDLHKSIKLCEHSLLYGNLTSLSLGTKQEADVYTDHSGGCVAFLANIDPENDTV 461
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V F Y LPAWSVSILPDCKN VFNTAK+ S TL+ ++LQ WS
Sbjct: 462 VTFRSRQYDLPAWSVSILPDCKNAVFNTAKVQSQTLMVDMVPETLQSTKPDR------WS 515
Query: 447 YINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQS 506
E GI + F + G ++ INTT D +DYLW++ S N+ P +G++ +L + S
Sbjct: 516 IFREKTGIWDKNDFIRNGFVDHINTTKDSTDYLWHTTSFNVDRSYPT--NGNRELLSIDS 573
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
GHA+HAF+N +L+GS YG+ S + V PI L PGKN LLS+TVGLQN G YE
Sbjct: 574 KGHAVHAFLNNELIGSAYGNGSKSSFNVHMPIKLKPGKNEIALLSMTVGLQNAGPHYEWV 633
Query: 567 GAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLPKLQ 623
GAG+T V + G NG+ IDLSS W Y+ GL+GE G++ +W +S PK Q
Sbjct: 634 GAGLTS-VNISGMKNGS-IDLSSNNWAYKIGLEGEHYGLFKPDQGNNQRWSPQSEPPKGQ 691
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL WYK D P G +PV ID MGKG AW+NG +IGRYWP S + CT SCNYRG
Sbjct: 692 PLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSSDDRCTPSCNYRGP 751
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
++ +KC CGKP+Q YHVPRSW SGNTLV+FEE GGDPTKI+F ++++ + +CS V
Sbjct: 752 FNPSKCRTGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITF-SRRVATKVCSFV 810
Query: 744 TDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRC 802
++++P + ++ W K + L CP + ISS+KFASFG P GTC S+ +GRC
Sbjct: 811 SENYPSIDLESWDKSISDDGKDTAKVQLSCPK-GKNISSVKFASFGDPSGTCRSYQQGRC 869
Query: 803 SSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
SLSVV +AC+ SC++ +S FG D C GV K+LA+EA C+
Sbjct: 870 HHPSSLSVVEKACLNINSCTVSLSDEGFGKDLCPGVAKTLAIEADCS 916
>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
Length = 894
Score = 945 bits (2443), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/881 (54%), Positives = 595/881 (67%), Gaps = 51/881 (5%)
Query: 7 LLLVLCWGFVVLATTSFGA--NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
L L L F + A + NV+YDHRA++I GKRR+L+S IHYPR+TPEMWPDLI K
Sbjct: 14 LFLCLAVQFALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAK 73
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
SK+GG+DVI+TY FW+ HEPVR QYNFEGRYD+VKF LV +GLY HLRIGPYVCAEWN
Sbjct: 74 SKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWN 133
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
FGGFP+WL IPGI+FRT+N FK EMQRF K+VD+M++E+L + QGGPII+ QIENEY
Sbjct: 134 FGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMLQIENEY 193
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
GNI+ +G GK YIKWAA MAL L GVPWVMC+Q DAP II+ CNG+YCD + PNS
Sbjct: 194 GNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPNSY 253
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
NKP MWTE+W GW+ S+GG +P+RPVEDLAFAVARF+QRGG+FQNYYMY GGTNF RTSG
Sbjct: 254 NKPTMWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSG 313
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEAT 363
GPF TSYDYDAP+DEYGL+ +PKWGHLKDLH AIKLCE ALVA D P Y LGP EA
Sbjct: 314 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQEAH 373
Query: 364 VYKTGS---GL----------CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNV 410
VY+ S GL CSAFLANI + +V F G Y LP WSVSILPDC+NV
Sbjct: 374 VYRMNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNV 433
Query: 411 VFNTAKINSVT----------LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAF 460
V+NTAK+ + T L S Q + + I W + EPVG+ ++ F
Sbjct: 434 VYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNF 493
Query: 461 TKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGK 518
T G+LE +N T DQSDYLW+ + D+ ++ + + S+ L F+NG+
Sbjct: 494 TVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQ 553
Query: 519 LVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKG 578
L GS G V V+ P+ G N LL+ TVGLQNYGAF EK GAG G ++L G
Sbjct: 554 LTGSVIGHW----VKVEQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTG 609
Query: 579 SGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQP------LVWYKTTF 632
NG +ID S WTYQ GLKGE L + + + K++ +L P +WYKT F
Sbjct: 610 FKNG-DIDFSKLLWTYQVGLKGEFLKI---YTIEENEKASWAELSPDDDPSTFIWYKTYF 665
Query: 633 DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKN 692
D+PAG++PVA+D MGKG+AWVNG IGRYW T V+ GC + C+YRGAY S+KC N
Sbjct: 666 DSPAGTDPVALDLGSMGKGQAWVNGHHIGRYW-TLVAPEDGCPEICDYRGAYDSDKCSFN 724
Query: 693 CGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD 752
CGKP+Q+LYHVPRSWL+SS N LV+ EE GG+P IS + G LC+ V++SH PV
Sbjct: 725 CGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSAG-VLCAQVSESHYPPVQ 783
Query: 753 MWGSDSKIQRKP-----GPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARS 807
W + + K P + L+C + ISSI+FAS+GTP G+C FS G C + S
Sbjct: 784 KWFNPDSVDEKITVNDLTPEMHLQCQD-GFTISSIEFASYGTPQGSCQKFSMGNCHATNS 842
Query: 808 LSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
S+V ++C+G SCS+ +S +F GDPC+GV+K+LAVEA C
Sbjct: 843 SSIVSKSCLGKNSCSVEISNISFGGDPCRGVVKTLAVEARC 883
>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 943 bits (2437), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 483/877 (55%), Positives = 592/877 (67%), Gaps = 43/877 (4%)
Query: 5 EILLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
+ L L F ++++ F NVTYDHRA++I G+RR+L S IHYPR+TPEMWPDLI
Sbjct: 13 QFLSFYLIIQFTLISSNFFEPFNVTYDHRALIIDGRRRILNSAGIHYPRATPEMWPDLIA 72
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
KSK+GG DV++TYVFW HEPV+ QY FEGRYDLVKFVKLV E+GLY HLRIGPYVCAEW
Sbjct: 73 KSKEGGADVVQTYVFWGGHEPVKGQYYFEGRYDLVKFVKLVGESGLYLHLRIGPYVCAEW 132
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
NFGGFP+WL +PG+ FRTDN PFK EMQ+F KIVD+M++E L + QGGPII+ QIENE
Sbjct: 133 NFGGFPVWLRDVPGVVFRTDNAPFKEEMQKFVTKIVDLMREEMLLSWQGGPIIMFQIENE 192
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YGNI+ ++G GK Y+KWAAGMAL+LD GVPWVMC+Q+DAP+ II+ CNG+YCD F PNS
Sbjct: 193 YGNIEHSFGQGGKEYMKWAAGMALALDAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNS 252
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
KP WTE+W GW+ ++GG +P+RPVEDLAFAVARFFQRGG+FQNYYMY GGTNF RTS
Sbjct: 253 PKKPIFWTEDWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTS 312
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPT-YPSLGPNLEA 362
GGPF TSYDYDAP+DEYGL+ +PKWGHLKDLH AIKLCE ALVA D Y LGP EA
Sbjct: 313 GGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGPKQEA 372
Query: 363 TVY------------KTGS-GLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKN 409
VY + GS CSAFLANI TV+F G S+ LP WSVSILPDC+N
Sbjct: 373 HVYGGSLSIQGMNFSQYGSQSKCSAFLANIDERQAATVRFLGQSFTLPPWSVSILPDCRN 432
Query: 410 VVFNTAK------INSVTLVPSFSRQSL--QVAADSSDAIGS-GWSYINEPVGISKDDAF 460
VFNTAK I +V V S SL Q + D+ S W EP+ + ++ F
Sbjct: 433 TVFNTAKVAAQTHIKTVEFVLPLSNSSLLPQFIVQNEDSPQSTSWLIAKEPITLWSEENF 492
Query: 461 TKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT--VLHVQSLGHALHAFINGK 518
T G+LE +N T D+SDYLWY + D+ + +K + + S+ L FING+
Sbjct: 493 TVKGILEHLNVTKDESDYLWYFTRIYVSDDDIAFWEKNKVSPAVSIDSMRDVLRVFINGQ 552
Query: 519 LVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKG 578
L GS G A P+ G N LLS TVGLQNYGAF E+ GAG G ++L G
Sbjct: 553 LTGSVVGHWVKAVQ----PVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFKGQIKLTG 608
Query: 579 SGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAP 635
NG +IDLS+ WTYQ GLKGE L S +W + WYKT FDAP
Sbjct: 609 FKNG-DIDLSNLSWTYQVGLKGEFLKVYSTGDNEKFEWSELAVDATPSTFTWYKTFFDAP 667
Query: 636 AGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGK 695
+G +PVA+D MGKG+AWVNG IGRYW T VS GC SC+YRGAYSS KC NCG
Sbjct: 668 SGVDPVALDLGSMGKGQAWVNGHHIGRYW-TVVSPKDGC-GSCDYRGAYSSGKCRTNCGN 725
Query: 696 PSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWG 755
P+Q+ YHVPR+WL++S N LV+FEE GG+P +IS V + +C+ V++SH P+ W
Sbjct: 726 PTQTWYHVPRAWLEASNNLLVVFEETGGNPFEIS-VKLRSAKVICAQVSESHYPPLRKWS 784
Query: 756 ----SDSKIQRKP-GPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSV 810
+ I R P + L+C + ++SSI+FAS+GTP G+C FSRG C ++ S SV
Sbjct: 785 RADLTGGNISRNDMTPEMHLKCQD-GHIMSSIEFASYGTPNGSCQKFSRGNCHASNSSSV 843
Query: 811 VRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
V +AC G C I +S FGDPC+GV+K+LAVEA C
Sbjct: 844 VTEACQGKNKCDIAISNAVFGDPCRGVIKTLAVEARC 880
>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 903
Score = 943 bits (2437), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/881 (54%), Positives = 596/881 (67%), Gaps = 50/881 (5%)
Query: 7 LLLVLCWGFVVLATTSFGA--NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
L L L F + A + NV+YDHRA++I GKRR+L+S IHYPR+TPEMWPDLI K
Sbjct: 14 LFLCLAVQFALEAAAEYFKPFNVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAK 73
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
SK+GG+DVI+TY FW+ HEPVR QYNFEGRYD+VKF LV +GLY HLRIGPYVCAEWN
Sbjct: 74 SKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWN 133
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
FGGFP+WL IPGI+FRT+N FK EMQRF K+VD+M++E+L + QGGPII+ QIENEY
Sbjct: 134 FGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMMQIENEY 193
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
GNI+ +G GK YIKWAA MAL L GVPWVMC+Q DAP II+ CNG+YCD + PNS
Sbjct: 194 GNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPNSY 253
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
NKP +WTE+W GW+ S+GG +P+RPVEDLAFAVARF+QRGG+FQNYYMY GGTNF RTSG
Sbjct: 254 NKPTLWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSG 313
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEAT 363
GPF TSYDYDAP+DEYGL+ +PKWGHLKDLH AIKLCE ALVA D P Y LGP EA
Sbjct: 314 GPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQEAH 373
Query: 364 VYKTGS---GL----------CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNV 410
VY+ S GL CSAFLANI + +V F G Y LP WSVSILPDC+NV
Sbjct: 374 VYRVNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNV 433
Query: 411 VFNTAKINSVT----------LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAF 460
V+NTAK+ + T L S Q + + I W + EPVG+ ++ F
Sbjct: 434 VYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNF 493
Query: 461 TKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGK 518
T G+LE +N T DQSDYLW+ + D+ ++ + + S+ L F+NG+
Sbjct: 494 TVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQ 553
Query: 519 LVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKG 578
L GS V V+ P+ G N LL+ TVGLQNYGAF EK GAG G ++L G
Sbjct: 554 LTE---GSVIGHWVKVEQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTG 610
Query: 579 SGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQP------LVWYKTTF 632
NG +IDLS WTYQ GLKGE F + + + K+ +L P +WYKT F
Sbjct: 611 FKNG-DIDLSKLLWTYQVGLKGE---FFKIYTIEENEKAGWAELSPDDDPSTFIWYKTYF 666
Query: 633 DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKN 692
D+PAG++PVA+D MGKG+AWVNG IGRYW T V+ GC + C+YRGAY+S+KC N
Sbjct: 667 DSPAGTDPVALDLGSMGKGQAWVNGHHIGRYW-TLVAPEDGCPEICDYRGAYNSDKCSFN 725
Query: 693 CGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD 752
CGKP+Q+LYHVPRSWL+SS N LV+ EE GG+P IS + G LC+ V++SH PV
Sbjct: 726 CGKPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSAG-VLCAQVSESHYPPVQ 784
Query: 753 MWGSDSKIQRKP-----GPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARS 807
W + + K P + L+C + ISSI+FAS+GTP G+C FS G C + S
Sbjct: 785 KWFNPDSVDEKITVNDLTPEMHLQCQD-GFTISSIEFASYGTPQGSCQKFSMGNCHATNS 843
Query: 808 LSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
S+V ++C+G SCS+ +S N+F GDPC+G++K+LAVEA C
Sbjct: 844 SSIVSKSCLGKNSCSVEISNNSFGGDPCRGIVKTLAVEARC 884
>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
Length = 889
Score = 941 bits (2433), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/881 (55%), Positives = 591/881 (67%), Gaps = 49/881 (5%)
Query: 5 EILLLVLCWGFVVLATTSFGA--NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
E LL+V+ + A T F NV+YDHRA++I GKRR+LIS IHYPR+TPEMWPDLI
Sbjct: 9 EFLLVVMT--LQIAACTEFFKPFNVSYDHRALIIDGKRRMLISSGIHYPRATPEMWPDLI 66
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
KSK+GG D+I+TY FWN HEP+R QYNFEGRYD+VKF+KL AGLY HLRIGPYVCAE
Sbjct: 67 AKSKEGGADLIQTYAFWNGHEPIRGQYNFEGRYDIVKFIKLAGSAGLYFHLRIGPYVCAE 126
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL IPGI+FRTDN P+K EMQRF KIVD+M+QE L++ QGGPIIL QIEN
Sbjct: 127 WNFGGFPVWLRDIPGIEFRTDNAPYKDEMQRFVKKIVDLMRQEMLFSWQGGPIILLQIEN 186
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYGNI+ YG GK Y+KWAA MA+ L GVPWVMC+Q+DAP+ II+ CN FYCD F PN
Sbjct: 187 EYGNIERLYGQRGKDYVKWAADMAIGLGAGVPWVMCRQTDAPENIIDACNAFYCDGFKPN 246
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
S KP +WTE+W+GW+ S+GG VP+RPVED AFAVARFFQRGG++ NYYM+ GGTNF RT
Sbjct: 247 SYRKPALWTEDWNGWYTSWGGRVPHRPVEDNAFAVARFFQRGGSYHNYYMFFGGTNFGRT 306
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNL 360
SGGPF TSYDYDAP+DEYGL+ QPKWGHLKDLH AIKLCE ALVA D P Y LGP
Sbjct: 307 SGGPFYVTSYDYDAPIDEYGLLSQPKWGHLKDLHSAIKLCEPALVAVDDAPQYIRLGPMQ 366
Query: 361 EATVYKTGS-------------GLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
EA VY+ S LCSAFLANI ++ VKF G Y LP WSVSILPDC
Sbjct: 367 EAHVYRHSSYVEDQSSSTLGNGTLCSAFLANIDEHNSANVKFLGQVYSLPPWSVSILPDC 426
Query: 408 KNVVFNTAKINSVTLVPS--FSRQSLQ--------VAADSSDAIGSGWSYINEPVGISKD 457
KNV FNTAK+ S V + FS ++ + D I + W + EP+G
Sbjct: 427 KNVAFNTAKVASQISVKTVEFSSPFIENTTEPGYLLLHDGVHHISTNWMILKEPIGEWGG 486
Query: 458 DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV--LHVQSLGHALHAFI 515
+ FT G+LE +N T D SDYLWY + +I ++ + S+ L + S+ + F+
Sbjct: 487 NNFTAEGILEHLNVTKDTSDYLWYIMRLHISDEDISFWEASEVSPKLIIDSMRDVVRIFV 546
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQ 575
NG+L GS G V V+ P+ L G N +LS TVGLQNYGAF EK GAG G ++
Sbjct: 547 NGQLAGSHVGRW----VRVEQPVDLVQGYNELAILSETVGLQNYGAFLEKDGAGFKGQIK 602
Query: 576 LKGSGNGTNIDLSSQQWTYQTGLKGEEL---NFPSGSSTQWDSKSTLPKLQPLVWYKTTF 632
L G +G DL++ W YQ GL+GE + + S W WYKT F
Sbjct: 603 LTGLKSG-EYDLTNSLWVYQVGLRGEFMKIFSLEEHESADWVDLPNDSVPSAFTWYKTFF 661
Query: 633 DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKN 692
DAP G +PV++ MGKG+AWVNG SIGRYW + V+ GC SC+YRGAY +KC N
Sbjct: 662 DAPQGKDPVSLYLGSMGKGQAWVNGHSIGRYW-SLVAPVDGC-QSCDYRGAYHESKCATN 719
Query: 693 CGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD 752
CGKP+QS YH+PRSWL+ S N LV+FEE GG+P +IS V SS+C+ V++SH P+
Sbjct: 720 CGKPTQSWYHIPRSWLQPSKNLLVIFEETGGNPLEIS-VKLHSTSSICTKVSESHYPPLH 778
Query: 753 MWGSDSKIQRKPG-----PVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARS 807
+W + K P + L+C N Q ISSI FASFGTP G+C FS+G C + S
Sbjct: 779 LWSHKDIVNGKVSISNAVPEIHLQCDN-GQRISSIMFASFGTPQGSCQRFSQGDCHAPNS 837
Query: 808 LSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
SVV +AC G +CSIGVS F GDPC+GV+K+LAVEA C
Sbjct: 838 FSVVSEACQGRNNCSIGVSNKVFGGDPCRGVVKTLAVEAKC 878
>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
Length = 831
Score = 941 bits (2431), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/824 (55%), Positives = 571/824 (69%), Gaps = 23/824 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AVV+ G+RR+L+SGSIHYPRS PEMWPDLIQK+KDGGLDV++TYVFWN HEP
Sbjct: 29 VTYDRKAVVVNGQRRILLSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSP 88
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY+FEGRYDLV F+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 89 GQYHFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPIWLKYVPGISFRTDNEP 148
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKAEMQ+FT KIV MMK E+L+ QGGPIILSQIENE+G ++ G K Y WAA MA
Sbjct: 149 FKAEMQKFTTKIVQMMKSERLFEWQGGPIILSQIENEFGPLEWDQGEPAKDYASWAANMA 208
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
++L+TGVPW+MC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP
Sbjct: 209 MALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 268
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+RPVEDLA+ VA+F Q+GG+F NYYMYHGGTNF+RT+GGPFI+TSYDYDAPLDEYGL+R+
Sbjct: 269 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFERTAGGPFIATSYDYDAPLDEYGLLRE 328
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKWGHLK+LH+AIKLCE ALVA DP SLG +A+V+++ +G C+AFL N S
Sbjct: 329 PKWGHLKELHRAIKLCEPALVAADPILSSLGNAQKASVFRSSTGACAAFLENKHKLSYAR 388
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V FNG Y LP WS+SILPDCK VFNTA++ S S+ ++ A G W
Sbjct: 389 VSFNGMHYDLPPWSISILPDCKTTVFNTARVGS-----QISQMKMEWAG------GLTWQ 437
Query: 447 YINEPVG-ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
NE + S+ ++FT GLLEQIN T D +DYLWY+ ++ DE L G L V
Sbjct: 438 SYNEEINSFSELESFTTVGLLEQINMTRDNTDYLWYTTYVDVAKDEQFLTSGKNPKLTVM 497
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH FING+L G+ YGS N K+T + L G NT LS+ VGL N G +E
Sbjct: 498 SAGHALHVFINGQLSGTVYGSVENPKLTYTGKVKLWSGSNTISCLSIAVGLPNVGEHFET 557
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPL 625
AGI GPV L G G DL+ Q+WTYQ GLKGE ++ S S + + QPL
Sbjct: 558 WNAGILGPVTLDGLNEGKR-DLTWQKWTYQVGLKGEAMSLHSLSGSSSVEWGEPVQKQPL 616
Query: 626 VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYS 685
WYK F+AP G EP+A+D MGKG+ W+NGQ IGRYWP Y + G C+YRG Y+
Sbjct: 617 TWYKAFFNAPDGDEPLALDMNSMGKGQIWINGQGIGRYWPGY--KASGTCGHCDYRGEYN 674
Query: 686 SNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTD 745
KC NCG PSQ YHVPR WL +GN LV+FEE GGDPT IS V + G S+C+ V++
Sbjct: 675 ETKCQTNCGDPSQRWYHVPRPWLNPTGNLLVIFEEWGGDPTGISMVKRTTG-SVCADVSE 733
Query: 746 SHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSA 805
P + W + + + + L+C + + I+ IKFASFGTP G+CG++S G C +
Sbjct: 734 WQP-SIKNWRTKDYEKAE----VHLQC-DHGRKITEIKFASFGTPQGSCGNYSEGGCHAH 787
Query: 806 RSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
RS + ++ C+ + C + V F GDPC G MK VE +C+
Sbjct: 788 RSYDIFKKNCINQEWCGVSVVPEAFGGDPCPGTMKRAVVEVTCS 831
>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
Length = 854
Score = 940 bits (2430), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/846 (54%), Positives = 580/846 (68%), Gaps = 32/846 (3%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
+ L + +VTYD +A+VI G+RR+LISGSIHYPRSTP+MW DLI+K+KDGGLDVI+
Sbjct: 17 LMFLHSQLIQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVID 76
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TY+FWN+HEP YNFEGRYDLV+F+K V + GLY HLRIGPYVCAEWNFGGFP+WL F
Sbjct: 77 TYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKF 136
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI FRT+NEPFK MQ FT KIV MMK E L+ASQGGPIILSQIENEYG GAA
Sbjct: 137 VPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAA 196
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
G +YI WAA MA+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP++WTE W
Sbjct: 197 GHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAW 256
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
SGWF FGG + RPV+DLAF VARF Q GG+F NYYMYHGGTNF R++GGPFI+TSYDY
Sbjct: 257 SGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDY 316
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAP+DEYGLIRQPK+GHLK+LHKAIKLCE A+V+ DPT SLG +A V+ +G G C+A
Sbjct: 317 DAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAA 376
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVTLVPSFSR-Q 429
FL+N S V FN Y LPAWS+SILPDC+ VVFNTA++ + + + P+ S+
Sbjct: 377 FLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFPTNSKLH 436
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
S + + ++GS T GLLEQIN T D +DYLWY S NI +
Sbjct: 437 SWETYGEDISSLGS-------------SGTMTAGGLLEQINITRDSTDYLWYMTSVNIDS 483
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E L G L VQS GHA+H FING+ GS YG+ N K T L G N L
Sbjct: 484 SESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIAL 543
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PS 607
LS+ VGL N G +E GI GPV L G G DLS Q+W+YQ GLKGE +N P+
Sbjct: 544 LSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKR-DLSWQKWSYQVGLKGEAMNLVSPN 602
Query: 608 G-SSTQWDSKSTLPK-LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
G S+ +W S + QPL WYK F+AP G EP+A+D MGKG+ W+NGQSIGRYW
Sbjct: 603 GVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWM 662
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
Y G C + C+Y G Y KC CG P+Q YHVPRSWLK + N L++FEE+GGD
Sbjct: 663 AYA--KGDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDA 719
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKP--GPVLSLECPNPNQVISSIK 783
+KI+ + + + S+C+ + HP ++ W ++S + + + L+C P Q IS+I
Sbjct: 720 SKIALMKRAM-KSVCADANEHHPT-LENWHTESPSESEELHQASVHLQCA-PGQSISTIM 776
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLA 842
FASFGTP GTCGSF +G C + S +++ + C+G + CS+ +S + FG DPC V+K L+
Sbjct: 777 FASFGTPSGTCGSFQKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFGADPCPNVLKRLS 836
Query: 843 VEASCT 848
VEA+C+
Sbjct: 837 VEAACS 842
>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
Length = 890
Score = 940 bits (2430), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/879 (53%), Positives = 583/879 (66%), Gaps = 44/879 (5%)
Query: 5 EILLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
+++ L L +V++ F NV+YDHRA++I GKRR+LIS +HYPR++PEMWPD+I+
Sbjct: 10 QLMSLTLTIHLLVVSGEFFKPFNVSYDHRALIIDGKRRMLISAGVHYPRASPEMWPDIIE 69
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
KSK+GG DVI++YVFWN HEP + QYNF+GRYDLVKF++LV +GLY HLRIGPYVCAEW
Sbjct: 70 KSKEGGADVIQSYVFWNGHEPTKGQYNFDGRYDLVKFIRLVGSSGLYLHLRIGPYVCAEW 129
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
NFGGFPLWL +PGI+FRTDN PFK EMQRF KIVD+++ EKL+ QGGP+I+ Q+ENE
Sbjct: 130 NFGGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVENE 189
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YGNI+S+YG G+ YIKW MAL L VPWVMCQQ DAP IIN+CNG+YCD F NS
Sbjct: 190 YGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKANS 249
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
+KP WTENW+GWF S+G P+RPVEDLAF+VARFFQR G+FQNYYMY GGTNF RT+
Sbjct: 250 PSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGRTA 309
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEA 362
GGPF TSYDYD+P+DEYGLIR+PKWGHLKDLH A+KLCE ALV+ D P Y LGP EA
Sbjct: 310 GGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQEA 369
Query: 363 TVYKTGSGL-------------CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKN 409
VY S CSAFLANI V VKFNG +Y LP WSVSILPDC+N
Sbjct: 370 HVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDCQN 429
Query: 410 VVFNTAKINSVTLV-------PSFSRQSLQVAADSSDA---IGSGWSYINEPVGISKDDA 459
VVFNTAK+ + T + P + SL++ A + I + W + EP+GI D
Sbjct: 430 VVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSDQN 489
Query: 460 FTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFING 517
FT G+LE +N T D+SDYLWY ++ D+ E + + S+ F+NG
Sbjct: 490 FTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFVNG 549
Query: 518 KLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLK 577
KL GS G V P+ G N LLS +GLQN GAF EK GAGI G ++L
Sbjct: 550 KLTGSAIGQW----VKFVQPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLT 605
Query: 578 GSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDA 634
G NG +IDLS WTYQ GLKGE LNF S W S WYK F +
Sbjct: 606 GFKNG-DIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFSS 664
Query: 635 PAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCG 694
P G++PVAI+ MGKG+AWVNG IGRYW + VS GC C+YRGAY+S KC NCG
Sbjct: 665 PDGTDPVAINLGSMGKGQAWVNGHHIGRYW-SVVSPKDGCPRKCDYRGAYNSGKCATNCG 723
Query: 695 KPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH-----PL 749
+P+QS YH+PRSWLK S N LVLFEE GG+P +I G +C V++SH L
Sbjct: 724 RPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTG-VICGQVSESHYPSLRKL 782
Query: 750 PVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
D + + P + L C + VISS++FAS+GTP G+C FSRG C + SLS
Sbjct: 783 SNDYISDGETLSNRANPEMFLHC-DDGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLS 841
Query: 810 VVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
VV QAC+G SC++ +S + F GDPC ++K+LAVEA C
Sbjct: 842 VVSQACLGKNSCTVEISNSAFGGDPCHSIVKTLAVEARC 880
>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
Length = 854
Score = 940 bits (2430), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/846 (54%), Positives = 580/846 (68%), Gaps = 32/846 (3%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
+ L + +VTYD +A+VI G+RR+LISGSIHYPRSTP+MW DLI+K+KDGGLDVI+
Sbjct: 17 LMFLHSQLIQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVID 76
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TY+FWN+HEP YNFEGRYDLV+F+K V + GLY HLRIGPYVCAEWNFGGFP+WL F
Sbjct: 77 TYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKF 136
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI FRT+NEPFK MQ FT KIV MMK E L+ASQGGPIILSQIENEYG GAA
Sbjct: 137 VPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAA 196
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
G +YI WAA MA+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP++WTE W
Sbjct: 197 GHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAW 256
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
SGWF FGG + RPV+DLAF VARF Q GG+F NYYMYHGGTNF R++GGPFI+TSYDY
Sbjct: 257 SGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDY 316
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAP+DEYGLIRQPK+GHLK+LHKAIKLCE A+V+ DPT SLG +A V+ +G G C+A
Sbjct: 317 DAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAA 376
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVTLVPSFSR-Q 429
FL+N S V FN Y LPAWS+SILPDC+ VVFNTA++ + + + P+ S+
Sbjct: 377 FLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFPTNSKLH 436
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
S + + ++GS T GLLEQIN T D +DYLWY S NI +
Sbjct: 437 SWETYGEDISSLGS-------------SGTMTAGGLLEQINITRDSTDYLWYMTSVNIDS 483
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E L G L VQS GHA+H FING+ GS YG+ N K T L G N L
Sbjct: 484 SESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIAL 543
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PS 607
LS+ VGL N G +E GI GPV L G G DLS Q+W+YQ GLKGE +N P+
Sbjct: 544 LSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKR-DLSWQKWSYQVGLKGEAMNLVSPN 602
Query: 608 G-SSTQWDSKSTLPK-LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
G S+ +W S + QPL WYK F+AP G EP+A+D MGKG+ W+NGQSIGRYW
Sbjct: 603 GVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWM 662
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
Y G C + C+Y G Y KC CG P+Q YHVPRSWLK + N L++FEE+GGD
Sbjct: 663 AYA--KGDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDA 719
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKP--GPVLSLECPNPNQVISSIK 783
+KI+ + + + S+C+ + HP ++ W ++S + + + L+C P Q IS+I
Sbjct: 720 SKIALMKRAM-KSVCADANEHHPT-LENWHTESPSESEELHZASVHLQCA-PGQSISTIM 776
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLA 842
FASFGTP GTCGSF +G C + S +++ + C+G + CS+ +S + FG DPC V+K L+
Sbjct: 777 FASFGTPSGTCGSFQKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFGADPCPNVLKRLS 836
Query: 843 VEASCT 848
VEA+C+
Sbjct: 837 VEAACS 842
>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
Length = 854
Score = 940 bits (2429), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/846 (54%), Positives = 580/846 (68%), Gaps = 32/846 (3%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
+ L + +VTYD +A+VI G+RR+LISGSIHYPRSTP+MW DLI+K+KDGGLDVI+
Sbjct: 17 LMFLHSQLIQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVID 76
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TY+FWN+HEP YNFEGRYDLV+F+K V + GLY HLRIGPYVCAEWNFGGFP+WL F
Sbjct: 77 TYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKF 136
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI FRT+NEPFK MQ FT KIV MMK E L+ASQGGPIILSQIENEYG GAA
Sbjct: 137 VPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAA 196
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
G +YI WAA MA+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP++WTE W
Sbjct: 197 GHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAW 256
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
SGWF FGG + RPV+DLAF VARF Q GG+F NYYMYHGGTNF R++GGPFI+TSYDY
Sbjct: 257 SGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDY 316
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAP+DEYGLIRQPK+GHLK+LHKAIKLCE A+V+ DPT SLG +A V+ +G G C+A
Sbjct: 317 DAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAA 376
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVTLVPSFSR-Q 429
FL+N S V FN Y LPAWS+SILPDC+ VVFNTA++ + + + P+ S+
Sbjct: 377 FLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFPTNSKLH 436
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
S + + ++GS T GLLEQIN T D +DYLWY S NI +
Sbjct: 437 SWETYGEDISSLGS-------------SGTMTAGGLLEQINITRDSTDYLWYMTSVNIDS 483
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E L G L VQS GHA+H FING+ GS YG+ N K T L G N L
Sbjct: 484 SESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIAL 543
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PS 607
LS+ VGL N G +E GI GPV L G G DLS Q+W+YQ GLKGE +N P+
Sbjct: 544 LSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKR-DLSWQKWSYQVGLKGEAMNLVSPN 602
Query: 608 G-SSTQWDSKSTLPK-LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
G S+ +W S + QPL WYK F+AP G EP+A+D MGKG+ W+NGQSIGRYW
Sbjct: 603 GVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWM 662
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
Y G C + C+Y G Y KC CG P+Q YHVPRSWLK + N L++FEE+GGD
Sbjct: 663 AYA--KGDC-NVCSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDA 719
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKP--GPVLSLECPNPNQVISSIK 783
+KI+ + + + S+C+ + HP ++ W ++S + + + L+C P Q IS+I
Sbjct: 720 SKIALMKRAM-KSVCADANEHHPT-LENWHTESPSESEELHEASVHLQCA-PGQSISTIM 776
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLA 842
FASFGTP GTCGSF +G C + S +++ + C+G + CS+ +S + FG DPC V+K L+
Sbjct: 777 FASFGTPSGTCGSFQKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFGADPCPNVLKRLS 836
Query: 843 VEASCT 848
VEA+C+
Sbjct: 837 VEAACS 842
>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 841
Score = 939 bits (2427), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/829 (55%), Positives = 578/829 (69%), Gaps = 24/829 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV++ G+RR+L SGSIHYPRSTPEMW LI+K+KDGGLDVI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLV+F+K V +AG++ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ FT KIV MMK E L+ASQGGPIILSQIENEYG +GAAGK+YI WAA MA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSGWF FGG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPVEDLAF VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LH+A+KLCE LV+ DPT +LG EA V+++ SG C+AFLAN +NS
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAK 385
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V FN +Y LP WS+SILPDCKNVVFNTA + T +Q+ AD + ++ W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQT-------NQMQMWADGASSM--MWE 436
Query: 447 YINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+E V ++ T GLLEQ+N T D SDYLWY S + E L+ G+ L VQ
Sbjct: 437 KYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQ 496
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH FING+L GS YG+ + K++ L G N LLS+ GL N G YE
Sbjct: 497 SAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYET 556
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK- 621
G+ GPV + G G+ DL+ Q W+YQ GLKGE++N S S +W S + +
Sbjct: 557 WNTGVVGPVVIHGLDEGSR-DLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQN 615
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WY+ FD P+G EP+A+D MGKG+ W+NGQSIGRYW Y G C C+Y
Sbjct: 616 QQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAE--GDC-KGCHYT 672
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G+Y + KC CG+P+Q YHVPRSWL+ + N LV+FEE+GGD +KI+ + + S +C+
Sbjct: 673 GSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTV-SGVCA 731
Query: 742 HVTDSHPLPVDMWGSDSKIQRK-PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
V++ HP + W +S + + + L+C P Q IS+IKFASFGTPLGTCG+F +G
Sbjct: 732 DVSEYHP-NIKNWQIESYGEPEFHTAKVHLKCA-PGQTISAIKFASFGTPLGTCGTFQQG 789
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C S S SV+ + C+G + C + +S + F GDPC VMK +AVEA C+
Sbjct: 790 ECHSINSNSVLEKKCIGLQRCVVAISPSNFGGDPCPEVMKRVAVEAVCS 838
>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 919
Score = 939 bits (2426), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/829 (54%), Positives = 578/829 (69%), Gaps = 18/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
++VTYDHR+++I G+RR+LIS SIHYPRS PEMWP L+ ++KDGG D +ETYVFWN HEP
Sbjct: 104 SSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 163
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ QY FE R+DLV+F K+V +AGLY LRIGP+V AEW FGG P+WLH+ PG FRT+N
Sbjct: 164 AQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNN 223
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK+ M+RFT IVDMMK+E+ +ASQGG IIL+Q+ENEYG+++ AYGA K Y WAA
Sbjct: 224 EPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAAS 283
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MAL+ +TGVPW+MCQQ DAPDP+INTCN FYCDQF PNS KPK WTENW GWF +FG +
Sbjct: 284 MALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGES 343
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
P+RP ED+AF+VARFF +GG+ QNYY+YHGGTNF RT+GGPFI+TSYDYDAP+DEYGL
Sbjct: 344 NPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLR 403
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R PKW HL+DLHK+IKL E L+ + ++ SLGP EA VY SG C AFL+N+ + D
Sbjct: 404 RLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKD 463
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F SY LPAWSVSILPDCKNV FNTAK+ S TL+ V A+ + G
Sbjct: 464 KVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM------VPANLESSKVDG 517
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
WS E GI + + G ++ INTT D +DYLWY+ S ++ G VLH+
Sbjct: 518 WSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLA---GGNHVLHI 574
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+S GHA+ AF+N +L+GS YG+ S + +V+ P+ L GKN LLS+TVGLQN G YE
Sbjct: 575 ESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYE 634
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLPK 621
GAGIT V++ G N IDLSS +W Y+ GL+GE + G +W +S PK
Sbjct: 635 WAGAGITS-VKISGMENRI-IDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPK 692
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QP+ WYK D P G +PV +D MGKG AW+NG +IGRYWP + CT SC+YR
Sbjct: 693 NQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYR 752
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G +S NKC + CG+P+Q YHVPRSW SGNTLV+FEE GGDPTKI+F +++ +S+CS
Sbjct: 753 GTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITF-SRRTVASVCS 811
Query: 742 HVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
V++ +P + ++ W +++ + + L CP + ISS+KF SFG P GTC S+ +G
Sbjct: 812 FVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPK-GKSISSVKFVSFGNPSGTCRSYQQG 870
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
C S+SVV +AC+ C++ +S FG D C GV K+LA+EA C+
Sbjct: 871 SCHHPNSISVVEKACLNMNGCTVSLSDEGFGEDLCPGVTKTLAIEADCS 919
>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
Length = 851
Score = 939 bits (2426), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/829 (54%), Positives = 578/829 (69%), Gaps = 18/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
++VTYDHR+++I G+RR+LIS SIHYPRS PEMWP L+ ++KDGG D +ETYVFWN HEP
Sbjct: 36 SSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 95
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ QY FE R+DLV+F K+V +AGLY LRIGP+V AEW FGG P+WLH+ PG FRT+N
Sbjct: 96 AQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNN 155
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK+ M+RFT IVDMMK+E+ +ASQGG IIL+Q+ENEYG+++ AYGA K Y WAA
Sbjct: 156 EPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAAS 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MAL+ +TGVPW+MCQQ DAPDP+INTCN FYCDQF PNS KPK WTENW GWF +FG +
Sbjct: 216 MALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGES 275
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
P+RP ED+AF+VARFF +GG+ QNYY+YHGGTNF RT+GGPFI+TSYDYDAP+DEYGL
Sbjct: 276 NPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLR 335
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R PKW HL+DLHK+IKL E L+ + ++ SLGP EA VY SG C AFL+N+ + D
Sbjct: 336 RLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKD 395
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F SY LPAWSVSILPDCKNV FNTAK+ S TL+ V A+ + G
Sbjct: 396 KVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM------VPANLESSKVDG 449
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
WS E GI + + G ++ INTT D +DYLWY+ S ++ G VLH+
Sbjct: 450 WSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLA---GGNHVLHI 506
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+S GHA+ AF+N +L+GS YG+ S + +V+ P+ L GKN LLS+TVGLQN G YE
Sbjct: 507 ESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYE 566
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLPK 621
GAGIT V++ G N IDLSS +W Y+ GL+GE + G +W +S PK
Sbjct: 567 WAGAGITS-VKISGMENRI-IDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPK 624
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QP+ WYK D P G +PV +D MGKG AW+NG +IGRYWP + CT SC+YR
Sbjct: 625 NQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYR 684
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G +S NKC + CG+P+Q YHVPRSW SGNTLV+FEE GGDPTKI+F +++ +S+CS
Sbjct: 685 GTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITF-SRRTVASVCS 743
Query: 742 HVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
V++ +P + ++ W +++ + + L CP + ISS+KF SFG P GTC S+ +G
Sbjct: 744 FVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPK-GKSISSVKFVSFGNPSGTCRSYQQG 802
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
C S+SVV +AC+ C++ +S FG D C GV K+LA+EA C+
Sbjct: 803 SCHHPNSISVVEKACLNMNGCTVSLSDEGFGEDLCPGVTKTLAIEADCS 851
>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
Length = 836
Score = 938 bits (2425), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/839 (56%), Positives = 580/839 (69%), Gaps = 36/839 (4%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+A+ I GKRR+L+SGSIHYPRSTPEMWPDLIQK+K+GGLDVI+TYVFWN HEP
Sbjct: 19 ASVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 78
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y F G YDLV+F+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL +IPGI FRT+N
Sbjct: 79 SPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNN 138
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQRFT KIVDMMK E L+ SQGGPIILSQIENEYG ++ GAAG++Y +WAA
Sbjct: 139 GPFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQ 198
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L TGVPWVMC+Q DAPDPIIN+CNGFYCD F+PN KPKMWTE W+GWF FGGA
Sbjct: 199 MAVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 258
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VPYRPVEDLAF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL+
Sbjct: 259 VPYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLV 318
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE ALV+ DP+ LG EA V+K+ G C+AFLAN S
Sbjct: 319 RQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSF 378
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS----VTLVP-----SFSRQSLQVAA 435
V F Y LP WS+SILPDCKN V+NTA++ + + +VP +FS Q+ A
Sbjct: 379 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVPVPIHGAFSWQAYNEEA 438
Query: 436 DSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
SS+ + +FT GL+EQINTT D SDYLWYS I DE L+
Sbjct: 439 PSSNG----------------ERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLK 482
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
G L V S GHALH F+N +L G+ YGS K+T + L G N +LS+ VG
Sbjct: 483 TGKYPTLTVLSAGHALHVFVNDQLSGTAYGSLEFPKITFSKGVNLRAGINKISILSIAVG 542
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL---NFPSGSSTQ 612
L N G +E AG+ GPV L G G DLS Q+W+Y+ G++GE + + SS +
Sbjct: 543 LPNVGPHFETWNAGVLGPVTLNGLNEGRR-DLSWQKWSYKVGVEGEAMSLHSLSGSSSVE 601
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W + S + + QPL W+KTTF+APAG+ P+A+D MGKG+ W+NG+SIGR+WP Y +
Sbjct: 602 WTAGSFVARRQPLTWFKTTFNAPAGNSPLALDMNSMGKGQIWINGKSIGRHWPAY--KAS 659
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
G C+Y G ++ KCL NCG+ SQ YHVPRSW +GN LV+FEE GGDP IS V
Sbjct: 660 GSCGWCDYAGTFNEKKCLSNCGEASQRWYHVPRSWPNPTGNLLVVFEEWGGDPNGISLVR 719
Query: 733 KQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPL 791
+++ S+C+ + + P ++ + K+ + P L+C P Q ISS+KFASFGTP
Sbjct: 720 REV-DSVCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLQC-GPGQKISSVKFASFGTPE 777
Query: 792 GTCGSFSRGRCSSARSLSVVRQACVGSKSCSIG-VSVNTFGD-PCKGVMKSLAVEASCT 848
G CGS+ G C + S + CVG CS+ V N G+ P VMK LAVE C+
Sbjct: 778 GACGSYREGSCHAHHSYDAFERLCVGQNWCSVTVVPRNVSGEIPAPSVMKKLAVEVVCS 836
>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 937 bits (2423), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/824 (56%), Positives = 573/824 (69%), Gaps = 28/824 (3%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYD +AVV+ G+RR+LISGSIHYPRSTPEMWPDLI+K+KDGGLDV++TYVFWN HEP
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSPG 86
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY FEGRYDLV F+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEPF
Sbjct: 87 QYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 146
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KAEMQ+FT KIV+MMK E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA+
Sbjct: 147 KAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 206
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+L+T VPW+MC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP+
Sbjct: 207 ALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPH 266
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVEDLA+ VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R+P
Sbjct: 267 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 326
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHLK LHKAIKLCE ALVA DP SLG +++V+++ +G C+AFL N S V
Sbjct: 327 KWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLENKDKVSYARV 386
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
FNG Y LP WS+SILPDCK VFNTA++ S S+ ++ A G W
Sbjct: 387 AFNGMHYDLPPWSISILPDCKTTVFNTARVGS-----QISQMKMEWAG------GFAWQS 435
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
NE + +D T GLLEQIN T D +DYLWY+ ++ DE L +G L V S
Sbjct: 436 YNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGENLKLTVMSA 495
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
GHALH FING+L G+ YGS + K+T + L G NT LS+ VGL N G +E
Sbjct: 496 GHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWN 555
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP--SGSST-QWDSKSTLPKLQP 624
AGI GPV L G G DL+ Q+WTYQ GLKGE ++ SGSST +W + QP
Sbjct: 556 AGILGPVTLDGLNEGRR-DLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGEPV---QKQP 611
Query: 625 LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAY 684
L WYK F+AP G EP+A+D + MGKG+ W+NGQ IGRYWP Y + +G C +C+YRG Y
Sbjct: 612 LTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA-SGNC-GTCDYRGEY 669
Query: 685 SSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVT 744
KC NCG SQ YHVPRSWL +GN LV+FEE GGDPT IS V + +G S+C+ V+
Sbjct: 670 DETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIG-SVCADVS 728
Query: 745 DSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSS 804
+ P + W + + K + L+C N Q I+ IKFASFGTP G+CGS++ G C +
Sbjct: 729 EWQP-SMKNWHTKDYEKAK----VHLQCDN-GQKITEIKFASFGTPQGSCGSYTEGGCHA 782
Query: 805 ARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+S + + CVG + C + V F GDPC G MK VEA C
Sbjct: 783 HKSYDIFWKNCVGQERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 826
>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
Length = 844
Score = 937 bits (2423), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/831 (55%), Positives = 580/831 (69%), Gaps = 18/831 (2%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
+NVTYDHR+++I G+RR++IS SIHYPRS PEMWP L+ ++KDGG D IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
QY FE R+DLV+FVK+V +AGL LRIGPYV AEWN+GG P+WLH++PG FRT+
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI-DSAYGAAGKSYIKWA 202
NEPFK ++ FT IVDMMK+E+L+ASQGG IIL+QIENEYG+ + AYGA GK Y WA
Sbjct: 146 NEPFKNHVKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A MAL+ +TGVPW+MCQ+SDAPDP+IN+CNGFYCD F PNS KPK+WTENW GWF +FG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
+ P+RP ED+AFAVARFF++GG+ QNYY+YHGGTNF RT+GGPFI+TSYDYDAP+DEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
L R PKW HL+DLHK+I+LCE L+ + T+ SLGP EA +Y SG C AFLANI +
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
+D V F Y LPAWSVSILPDC+NVVFNTAK+ S T + + +SLQ +
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVPESLQASKPER---- 441
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
WS E GI + F + G ++ INTT D +DYLWY +T+ D GS VL
Sbjct: 442 --WSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWY--TTSFSVDGSYSSKGSHAVL 497
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
++ S GH +HAF+N L+GS YG+ S ++ +V PI L GKN LLS+TVGLQN G
Sbjct: 498 NIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLRTGKNELALLSMTVGLQNAGFA 557
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQ-WDSKSTL 619
YE GAG T V + G GT IDLSS W Y+ GL+GE N P ++ Q W +S
Sbjct: 558 YEWIGAGFTN-VNISGVRTGT-IDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEP 615
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
PK QPL WYK D P G +PV ID MGKG AW+NG +IGRYWP S N CT SCN
Sbjct: 616 PKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCN 675
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
YRG + +KC CG+P+Q YH+PRSW SGN LV+FEE GGDPTKI+F +++ +S+
Sbjct: 676 YRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITF-SRRAVTSV 734
Query: 740 CSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
CS V++ P + ++ W + + P L CP + ISS+KFAS G P GTC S+
Sbjct: 735 CSFVSEHFPSIDLESWDESAMTEGTPPAKAQLFCPE-GKSISSVKFASLGNPSGTCRSYQ 793
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
GRC SLSVV +AC+ + SC++ ++ +FG D C GV K+LA+EA C+
Sbjct: 794 MGRCHHPNSLSVVEKACLNTNSCTVSLTDESFGKDLCPGVTKTLAIEADCS 844
>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
Length = 851
Score = 937 bits (2422), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/829 (54%), Positives = 578/829 (69%), Gaps = 18/829 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
++VTYD R+++I G+RR+LIS SIHYPRS PEMWP L+ ++KDGG D +ETYVFWN HEP
Sbjct: 36 SSVTYDQRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 95
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ QY FE R+DLV+F K+V +AGLY LRIGP+V AEW FGG P+WLH+ PG FRT+N
Sbjct: 96 AQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNN 155
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK+ M+RFT IVDMMK+E+ +ASQGG IIL+Q+ENEYG+++ AYGA K Y WAA
Sbjct: 156 EPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAAS 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MAL+ +TGVPW+MCQQ DAPDP+INTCN FYCDQF PNS KPK WTENW GWF +FG +
Sbjct: 216 MALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGES 275
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
P+RP ED+AF+VARFF +GG+ QNYY+YHGGTNF RT+GGPFI+TSYDYDAP+DEYGL
Sbjct: 276 NPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLR 335
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R PKW HL+DLHK+IKL E L+ + ++ SLGP EA VY SG C AFL+N+ + D
Sbjct: 336 RLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEKD 395
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F SY LPAWSVSILPDCKNV FNTAK+ S TL+ V A+ + G
Sbjct: 396 KVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM------VPANLESSKVDG 449
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
WS E GI + + G ++ INTT D +DYLWY+ S ++ G VLH+
Sbjct: 450 WSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLA---GGNHVLHI 506
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+S GHA+ AF+N +L+GS YG+ S + +V+ P+ L GKN LLS+TVGLQN G YE
Sbjct: 507 ESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYE 566
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLPK 621
GAGIT V++ G N IDLSS +W Y+ GL+GE + G +W +S PK
Sbjct: 567 WAGAGITS-VKISGMENRI-IDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPK 624
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QP+ WYK D P G +PV +D MGKG AW+NG +IGRYWP + CT SC+YR
Sbjct: 625 NQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYR 684
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G +S NKC + CG+P+Q YHVPRSW SGNTLV+FEE GGDPTKI+F +++ +S+CS
Sbjct: 685 GTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITF-SRRTVASVCS 743
Query: 742 HVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
V++ +P + ++ W +++ + + L CP + ISS+KFASFG P GTC S+ +G
Sbjct: 744 FVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPK-GKSISSVKFASFGNPSGTCRSYQQG 802
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
C S+SVV +AC+ C++ +S FG D C GV K+LA+EA C+
Sbjct: 803 SCHHPNSISVVEKACLNMNGCTLSLSDEGFGEDLCPGVTKTLAIEADCS 851
>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 852
Score = 934 bits (2414), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/857 (55%), Positives = 591/857 (68%), Gaps = 37/857 (4%)
Query: 5 EILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
+IL+L L + + VTYD +A++I G+RR+LISGSIHYPRSTPEMW LIQK
Sbjct: 8 KILVLFLTMTLFMASELIHCTTVTYDKKAILINGQRRLLISGSIHYPRSTPEMWEGLIQK 67
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
+KDGGLDVI+TYVFWN HEP Y FEGRYDLV+F+K V +AGL+ HLRIGPYVCAEWN
Sbjct: 68 AKDGGLDVIDTYVFWNGHEPSPGNYYFEGRYDLVRFIKTVQKAGLFLHLRIGPYVCAEWN 127
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
FGGFP+WL ++PGI FRTDN PFK MQ FT KIV MMK EKL+ASQGGPIILSQIENEY
Sbjct: 128 FGGFPVWLKYVPGISFRTDNGPFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEY 187
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
G A GA G++YI WAA MA+ LDTGVPWVMC++ DAPDP+IN CNGFYCD FTPN
Sbjct: 188 GPERKALGAPGQNYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINACNGFYCDGFTPNKP 247
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
KP MWTE WSGWFL FGG + +RPV+DLAFAVARF QRGG++ NYYMYHGGTNF RT+G
Sbjct: 248 YKPTMWTEAWSGWFLEFGGTIHHRPVQDLAFAVARFIQRGGSYVNYYMYHGGTNFGRTAG 307
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
GPFI+TSYDYDAP+DEYGLIRQPK+GHLK+LHKAIKLCE +L++++PT SLG +A V
Sbjct: 308 GPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHSLLSSEPTVTSLGTYHQAYV 367
Query: 365 YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSV 420
+ +G C+AFL+N + + V FN Y LP WSVSILPDC+N V+NTAK+ + V
Sbjct: 368 FNSGPRRCAAFLSNFHS-VEARVTFNNKHYDLPPWSVSILPDCRNEVYNTAKVGVQTSHV 426
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
++P+ SR + S +Y + + + + GLLEQIN T D SDYLW
Sbjct: 427 QMIPTNSR------------LFSWQTYDEDISSVHERSSIPAIGLLEQINVTRDTSDYLW 474
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y + +I + + L G K L VQS GHALH F+NG+ GS +G+ + T P+ L
Sbjct: 475 YMTNVDISSSD--LSGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFADPVNL 532
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N LLS+ VGL N G YE GI GPV L G GNG DL+ +W + GLKG
Sbjct: 533 HAGINRIALLSIAVGLPNVGLHYESWKTGIQGPVFLDGLGNGKK-DLTLHKWFNKVGLKG 591
Query: 601 EELNF--PSG-SSTQWDSKSTLPKL-QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
E +N P+G SS W +S + Q L WYK F+AP G+EP+A+D MGKG+ W+N
Sbjct: 592 EAMNLVSPNGASSVGWIRRSLATQTKQTLKWYKAYFNAPGGNEPLALDMRRMGKGQVWIN 651
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQSIGRYW Y G C+ SC+Y G + KC +CG+P+Q YHVPRSWLK + N +V
Sbjct: 652 GQSIGRYWMAYA--KGDCS-SCSYIGTFRPTKCQLHCGRPTQRWYHVPRSWLKPTQNLVV 708
Query: 717 LFEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGSDSKIQRKPGPVLSLEC 772
+FEE+GGDP+KI+ V + + + +C + ++HP VD DSK + + L C
Sbjct: 709 VFEELGGDPSKITLVRRSV-AGVCGDLHENHPNAENFDVDG-NEDSKTLHQAQ--VHLHC 764
Query: 773 PNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-G 831
P Q ISSIKFASFGTP GTCGSF +G C + S +VV + C+G +SCS+ VS +TF
Sbjct: 765 A-PGQSISSIKFASFGTPSGTCGSFQQGTCHATNSHAVVEKNCIGRESCSVAVSNSTFET 823
Query: 832 DPCKGVMKSLAVEASCT 848
DPC V+K L+VEA C+
Sbjct: 824 DPCPNVLKRLSVEAVCS 840
>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
Length = 844
Score = 934 bits (2413), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/831 (55%), Positives = 578/831 (69%), Gaps = 18/831 (2%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
+NVTYDHR+++I G+RR++IS SIHYPRS PEMWP L+ ++KDGG D IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
QY FE R+DLV+FVK+V +AGL LRIGPYV AEWN+GG P+WLH++PG FRT+
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI-DSAYGAAGKSYIKWA 202
NEPFK M+ FT IVDMMK+E+L+ASQGG IIL+QIENEYG+ + AYGA GK Y WA
Sbjct: 146 NEPFKNHMKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A MAL+ +TGVPW+MCQ+SDAPDP+IN+CNGFYCD F PNS KPK+WTENW GWF +FG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
+ P+RP ED+AFAVARFF++GG+ QNYY+YHGGTNF RT+GGPFI+TSYDYDAP+DEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
L R PKW HL++LHK+I+LCE L+ + T+ SLGP EA +Y SG C AFLANI +
Sbjct: 326 LRRFPKWAHLRELHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
+D V F Y LPAWSVSILPDC+NVVFNTAK+ S T + + +SLQ +
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVPESLQASKPER---- 441
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
WS E GI + F + G ++ INTT D +DYLWY +T+ D GS VL
Sbjct: 442 --WSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWY--TTSFSVDGSYSSKGSHAVL 497
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
++ S GH +HAF+N L+GS YG+ S ++ +V I L GKN LLS+TVGLQN G
Sbjct: 498 NIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLRTGKNELALLSMTVGLQNAGFA 557
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQ-WDSKSTL 619
YE GAG T V + G G IDLSS W Y+ GL+GE N P ++ Q W +S
Sbjct: 558 YEWIGAGFTN-VNISGVRTGI-IDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEP 615
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
PK QPL WYK D P G +PV ID MGKG AW+NG +IGRYWP S N CT SCN
Sbjct: 616 PKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCN 675
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
YRG + +KC CG+P+Q YH+PRSW SGN LV+FEE GGDPTKI+F +++ +S+
Sbjct: 676 YRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITF-SRRAVTSV 734
Query: 740 CSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
CS V++ P + ++ W + + P L CP + ISS+KFAS G P GTC S+
Sbjct: 735 CSFVSEHFPSIDLESWDESAMNEGTPPAKAQLSCPE-GKSISSVKFASLGNPSGTCRSYQ 793
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
GRC SLSVV +AC+ + SC++ ++ +FG D C GV K+LA+EA C+
Sbjct: 794 MGRCHHPNSLSVVEKACLNTNSCTVSLTDESFGKDLCHGVTKTLAIEADCS 844
>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
Length = 870
Score = 934 bits (2413), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/837 (54%), Positives = 570/837 (68%), Gaps = 12/837 (1%)
Query: 20 TTSFGAN-VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 78
T+ G + VTYD R+++I G+R++LIS SIHYPRS P MWP L++ +K+GG+DVIETYVF
Sbjct: 38 VTTIGTDSVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVF 97
Query: 79 WNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
WN HEP Y F GR+DLVKF K++ +AG+Y LRIGP+V AEWNFGG P+WLH++PG
Sbjct: 98 WNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGT 157
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
FRTD+EPFK MQ+F V++MK+E+L+ASQGGPIILSQ+ENEYG ++AYG GK Y
Sbjct: 158 TFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRY 217
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
WAA MALS +TGVPW+MCQQ DAPDP+I+TCN FYCDQF P S NKPK+WTENW GWF
Sbjct: 218 ALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWF 277
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+FG P+RP ED+A++VARFFQ+GG+ QNYYMYHGGTNF RT+GGPFI+TSYDYDAP+
Sbjct: 278 KTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPI 337
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYGL R PKWGHLK+LHK IK CE AL+ DPT SLGP EA VY+ SG C+AFLAN
Sbjct: 338 DEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLAN 397
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS- 437
+ +D V+F SY LPAWSVSILPDCKNV FNTAK+ T + + + L A S
Sbjct: 398 MDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSP 457
Query: 438 -SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
D W E G+ FTK G ++ INTT D +DYLWY+ S + A+E L +
Sbjct: 458 KRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFVHAEEDFLRN 517
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
+L V+S GHA+H FIN KL S G+ + + PIAL GKN LLS+TVGL
Sbjct: 518 RGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEISLLSMTVGL 577
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG---SSTQW 613
Q GAFYE GAG T V++ G GT +DL++ WTY+ GL+GE L S W
Sbjct: 578 QTAGAFYEWIGAGPTS-VKVAGFKTGT-MDLTASAWTYKIGLQGEHLRIQKSYNLKSKIW 635
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S PK QPL WYK DAP G+EPVA+D MGKG AW+NGQ IGRYWP S+
Sbjct: 636 APTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKYEN 695
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C C+YRG ++ +KC+ CG+P+Q YHVPRSW K SGN L++FEEIGGDP++I F +
Sbjct: 696 CVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRFSMR 755
Query: 734 QLGSSLCSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
++ S C H++ HP V+ K P LSL+CP N ISS+KFASFG P G
Sbjct: 756 KV-SGACGHLSVDHPSFDVENLQGSEIENDKNRPTLSLKCPT-NTNISSVKFASFGNPNG 813
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
TCGS+ G C S ++V + C+ C++ +S F C +K LAVE +C+
Sbjct: 814 TCGSYMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQLCPSTVKKLAVEVNCS 870
>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
Length = 839
Score = 933 bits (2412), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/831 (56%), Positives = 574/831 (69%), Gaps = 30/831 (3%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV+I G+RR+L SGSIHYPRSTPEMW L QK+KDGGLDVI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPEMWEGLFQKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLVKF+K +AGL+ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 87 GNYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ FT KIV MMK E+L+ASQGGPIILSQIENEYG ++GAAGKSY WAA MA
Sbjct: 147 FKTAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEGKSFGAAGKSYSNWAAKMA 206
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ LDTGVPWVMC+Q DAPDP+IN CNGFYCD F+PN KP MWTE W+GWF FGG +
Sbjct: 207 VGLDTGVPWVMCKQDDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWTGWFTEFGGTIR 266
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPVEDL+FAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL R+
Sbjct: 267 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LH+A+KLCE ALV+ DP +LG EA V+++ S C+AFLAN +NS
Sbjct: 327 PKYGHLKELHRAVKLCEPALVSVDPAVTTLGSMQEAHVFRSPSS-CAAFLANYNSNSHAN 385
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V FN Y LP WS+SILPDCK VVFNTA + T +Q+ AD ++ W
Sbjct: 386 VVFNNEHYSLPPWSISILPDCKTVVFNTATVGVQT-------SQMQMWADGESSM--MWE 436
Query: 447 YINEPVG-ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+E VG ++ T GLLEQ+N T D SDYLWY S ++ E L+ G L VQ
Sbjct: 437 RYDEEVGSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDVSPSEKFLQGGEPLSLTVQ 496
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH FING+L GS G+ K + L G N LLS+ GL N G YE
Sbjct: 497 SAGHALHIFINGQLQGSASGTREAKKFSYKGNANLRAGTNKIALLSIACGLPNVGVHYET 556
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKL 622
GI GPV L G G+ DL+ Q W+YQ GLKGE++N S SS +W S L +
Sbjct: 557 WNTGIVGPVVLHGLDVGSR-DLTWQTWSYQVGLKGEQMNLNSLEGASSVEWMQGSLLAQ- 614
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
PL WY+ FD P G EP+A+D MGKG+ W+NGQSIGRY +Y S G C +C+Y G
Sbjct: 615 APLSWYRAYFDTPTGDEPLALDMGSMGKGQIWINGQSIGRYSTSYAS--GDC-KACSYAG 671
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
+Y + KC CG+P+Q YHVP+SWL+ S N LV+FEE+GGD +KIS V + + SS+C+
Sbjct: 672 SYRAPKCQAGCGQPTQRWYHVPKSWLQPSRNLLVVFEELGGDSSKISLVKRSV-SSVCAD 730
Query: 743 VTDSHPLPVDMW----GSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
V++ H + W + + R P + L C P Q IS+IKFASFGTPLGTCG+F
Sbjct: 731 VSEYH-TNIKNWQIENAGEVEFHR---PKVHLRCA-PGQTISAIKFASFGTPLGTCGNFQ 785
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
+G C S +S +V+ + C+G + C++ +S + F GDPC MK +AVEA C+
Sbjct: 786 QGDCHSTKSHAVLEKNCIGQQRCAVTISPDNFGGDPCPKEMKKVAVEAVCS 836
>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
Length = 843
Score = 933 bits (2411), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 463/831 (55%), Positives = 577/831 (69%), Gaps = 26/831 (3%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV++ G+RR+L SGSIHYPRSTPEMW LI+K+KDGGLDVI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLV+F+K V +AG++ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ FT KIV MMK E L+ASQGGPIILSQIENEYG +GAAGK+YI WAA MA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSGWF FGG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPVEDLAF VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LH+A+KLCE LV+ DPT +LG EA V+++ SG C+AFLAN +NS
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAK 385
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V FN +Y LP WS+SILPDCKNVVFNTA + T +Q+ AD + ++ W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQT-------NQMQMWADGASSM--MWE 436
Query: 447 YINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+E V ++ T GLLEQ+N T D SDYLWY + E L+ G+ L VQ
Sbjct: 437 KYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITRVEVDPSEKFLQGGTPLSLTVQ 496
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH FING+L GS YG+ + K++ L G N LLS+ GL N G YE
Sbjct: 497 SAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYET 556
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTY--QTGLKGEELNFPS---GSSTQWDSKSTLP 620
G+ GPV + G G+ DL+ Q W+Y Q GLKGE++N S S +W S +
Sbjct: 557 WNTGVVGPVVIHGLDEGSR-DLTWQTWSYQFQVGLKGEQMNLNSLEGSGSVEWMQGSLVA 615
Query: 621 K-LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ QPL WY+ FD P+G EP+A+D MGKG+ W+NGQSIGRYW Y G C C+
Sbjct: 616 QNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAE--GDC-KGCH 672
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
Y G+Y + KC CG+P+Q YHVPRSWL+ + N LV+FEE+GGD +KI+ + + S +
Sbjct: 673 YTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTV-SGV 731
Query: 740 CSHVTDSHPLPVDMWGSDSKIQRK-PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
C+ V++ HP + W +S + + + L+C P Q IS+IKFASFGTPLGTCG+F
Sbjct: 732 CADVSEYHP-NIKNWQIESYGEPEFHTAKVHLKCA-PGQTISAIKFASFGTPLGTCGTFQ 789
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
+G C S S SV+ + C+G + C + +S + F GDPC VMK +AVEA C+
Sbjct: 790 QGECHSINSNSVLEKKCIGLQRCVVAISPSNFGGDPCPEVMKRVAVEAVCS 840
>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
Length = 870
Score = 933 bits (2411), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/837 (54%), Positives = 570/837 (68%), Gaps = 12/837 (1%)
Query: 20 TTSFGAN-VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 78
T+ G + VTYD R+++I G+R++LIS SIHYPRS P MWP L++ +K+GG+DVIETYVF
Sbjct: 38 VTTIGTDSVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVF 97
Query: 79 WNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
WN HEP Y F GR+DLVKF K++ +AG+Y LRIGP+V AEWNFGG P+WLH++PG
Sbjct: 98 WNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGT 157
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
FRTD+EPFK MQ+F V++MK+E+L+ASQGGPIILSQ+ENEYG ++AYG GK Y
Sbjct: 158 TFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRY 217
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
WAA MALS +TGVPW+MCQQ DAPDP+I+TCN FYCDQF P S NKPK+WTENW GWF
Sbjct: 218 ALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWF 277
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+FG P+RP ED+A++VARFFQ+GG+ QNYYMYHGGTNF RT+GGPFI+TSYDYDAP+
Sbjct: 278 KTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPI 337
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYGL R PKWGHLK+LHK IK CE AL+ DPT SLGP EA VY+ SG C+AFLAN
Sbjct: 338 DEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLAN 397
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS- 437
+ +D V+F SY LPAWSVSILPDCKNV FNTAK+ T + + + L A S
Sbjct: 398 MDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSP 457
Query: 438 -SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
D W E G+ FTK G ++ INTT D +DYLWY+ S + A+E L +
Sbjct: 458 KRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFVHAEEDFLRN 517
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
+L V+S GHA+H FIN KL S G+ + + PIAL GKN LLS+TVGL
Sbjct: 518 RGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEIALLSMTVGL 577
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG---SSTQW 613
Q GAFYE GAG T V++ G GT +DL++ WTY+ GL+GE L S W
Sbjct: 578 QTAGAFYEWIGAGPTS-VKVAGFKTGT-MDLTASAWTYKIGLQGEHLRIQKSYNLKSKIW 635
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S PK QPL WYK DAP G+EPVA+D MGKG AW+NGQ IGRYWP S+
Sbjct: 636 APTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKYEN 695
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C C+YRG ++ +KC+ CG+P+Q YHVPRSW K SGN L++FEEIGGDP++I F +
Sbjct: 696 CVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRFSMR 755
Query: 734 QLGSSLCSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
++ S C H++ HP V+ K P LSL+CP N ISS+KFASFG P G
Sbjct: 756 KV-SGACGHLSVDHPSFDVENLQGSEIESDKNRPTLSLKCPT-NTNISSVKFASFGNPNG 813
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
TCGS+ G C S ++V + C+ C++ +S F C +K LAVE +C+
Sbjct: 814 TCGSYMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQLCPSTVKKLAVEVNCS 870
>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 845
Score = 932 bits (2408), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/848 (54%), Positives = 583/848 (68%), Gaps = 25/848 (2%)
Query: 10 VLCWGFVVLATTSF-GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 68
VL W V+ ++ +VTYD +A+VI G+RR+L SGSIHYPRSTPEMW DLI K+K+G
Sbjct: 10 VLLWCIVLFISSGLVHCDVTYDRKAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEG 69
Query: 69 GLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGF 128
GLDV+ETYVFWN+HEP YNFEGRYDLV+FVK + +AGLYAHLRIGPYVCAEWNFGGF
Sbjct: 70 GLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGF 129
Query: 129 PLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNID 188
P+WL ++PGI FR DNEPFK M+ + KIV++MK L+ SQGGPIILSQIENEYG
Sbjct: 130 PVWLKYVPGISFRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQA 189
Query: 189 SAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPK 248
GA G Y WAA MA+ LDTGVPWVMC++ DAPDP+INTCNGFYCD F PN KP
Sbjct: 190 KVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPA 249
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
+WTE WSGWF FGG + RPV+DLAFAVA+F QRGG+F NYYMYHGGTNF RT+GGPFI
Sbjct: 250 IWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFI 309
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
+TSYDYDAP+DEYGLIRQPK+GHLK+LH+A+K+CE ++V+ DP SLG +A VY +
Sbjct: 310 TTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSE 369
Query: 369 SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR 428
+G C+AFL+N S V FN Y LP WS+SILPDC+NVVFNTAK+ T
Sbjct: 370 TGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQT------- 422
Query: 429 QSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKP-GLLEQINTTADQSDYLWYSLSTNI 487
+++ +S+ + W +E + D + + GLLEQIN T D SDYLWY S +I
Sbjct: 423 SKMEMLPTNSEML--SWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDI 480
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+ E L G L V++ GHA+H FING+L GS +G+ N + + L G N
Sbjct: 481 GSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRI 540
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
LLS+ VGL N G +E G+ GPV ++G +G DLS +WTYQ GLKGE +N S
Sbjct: 541 ALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHG-KWDLSWAKWTYQVGLKGEAMNLVS 599
Query: 608 G---SSTQWDSKSTLP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
S+ W S + K QPL W+K F+ P G EP+A+D + MGKG+ W+NGQSIGRY
Sbjct: 600 TNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRY 659
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG 723
W Y + G C + C Y G + KC CG+P+Q YHVPRSWLK + N LVLFEE+GG
Sbjct: 660 WTAYAT--GDC-NGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGG 716
Query: 724 DPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISS 781
DPT+IS V + + +++CS+V + HP + W ++ K + P + + C P Q ISS
Sbjct: 717 DPTRISLVKRSV-TNVCSNVAEYHP-NIKNWQIENYGKTEEFHLPKVRIHCA-PGQSISS 773
Query: 782 IKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKS 840
IKFASFGTPLGTCGSF +G C + S +VV + C+G ++C++ +S + FG DPC V+K
Sbjct: 774 IKFASFGTPLGTCGSFKQGTCHAPDSHAVVEKKCLGRQTCAVTISNSNFGEDPCPNVLKR 833
Query: 841 LAVEASCT 848
L+VEA CT
Sbjct: 834 LSVEAHCT 841
>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 845
Score = 931 bits (2406), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/827 (54%), Positives = 573/827 (69%), Gaps = 17/827 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYDHR++VI G+RR+LIS SIHYPRS P MWP L+ ++K+GG D IETYVFWN HE
Sbjct: 31 VTYDHRSLVISGRRRLLISASIHYPRSVPAMWPKLVAEAKEGGADCIETYVFWNGHETAP 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+Y FE R+DLV+F ++V +AGL+ LRIGP+V AEWNFGG P WLH+IPG FRT+NEP
Sbjct: 91 GKYYFEDRFDLVQFARVVKDAGLFLMLRIGPFVAAEWNFGGVPAWLHYIPGTVFRTNNEP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK+ M+ FT KIVDMMK+++ +ASQGG IIL+QIENEYG AYGA GK+Y WA MA
Sbjct: 151 FKSHMKSFTTKIVDMMKEQRFFASQGGHIILAQIENEYGYYQQAYGAGGKAYAMWAGSMA 210
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ +TGVPW+MCQQ D PD +INTCN FYCDQF PNS +PK+WTENW GWF +FG + P
Sbjct: 211 QAQNTGVPWIMCQQYDVPDRVINTCNSFYCDQFKPNSPTQPKIWTENWPGWFQTFGESNP 270
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+RP ED+AF+VARFF +GG+ QNYY+YHGGTNFDRT+GGPFI+TSYDYDAP+DEYGL R
Sbjct: 271 HRPPEDVAFSVARFFGKGGSVQNYYVYHGGTNFDRTAGGPFITTSYDYDAPIDEYGLRRL 330
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKW HLK+LH++IKLCE +L+ + T SLGP EA VY SG C AFLANI + D
Sbjct: 331 PKWAHLKELHQSIKLCEHSLLFGNSTLLSLGPQQEADVYTDHSGGCVAFLANIDSEKDRV 390
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V F Y LPAWSVSILPDCKNVVFNTAK+ S TL+ +LQ + WS
Sbjct: 391 VTFRNRQYDLPAWSVSILPDCKNVVFNTAKVRSQTLMVDMVPGTLQASKPDQ------WS 444
Query: 447 YINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQS 506
E +G+ + F + ++ INTT D +DYLW++ S ++ + P G+ VL++ S
Sbjct: 445 IFTERIGVWDKNDFVRNEFVDHINTTKDSTDYLWHTTSFDVDRNYP--SSGNHPVLNIDS 502
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
GHA+HAF+N L+GS YG+ S + + PI L GKN +LS+TVGL++ G +YE
Sbjct: 503 KGHAVHAFLNNMLIGSAYGNGSESSFSAHMPINLKAGKNEIAILSMTVGLKSAGPYYEWV 562
Query: 567 GAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPKLQ 623
GAG+T V + G NGT DLSS W Y+ GL+GE G++ +W +S PK Q
Sbjct: 563 GAGLTS-VNISGMKNGT-TDLSSNNWAYKVGLEGEHYGLFKHDQGNNQRWRPQSQPPKHQ 620
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL WYK D P G +PV +D MGKG W+NG +IGRYWP N CT SC+YRG
Sbjct: 621 PLTWYKVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAIGRYWPRTSPTNDRCTTSCDYRGK 680
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
+S NKC CGKP+Q YHVPRSW SGNTLV+FEE GGDPTKI+F ++++ +S+CS V
Sbjct: 681 FSPNKCRVGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITF-SRRVATSVCSFV 739
Query: 744 TDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRC 802
++++P + ++ W + + L CP + ISS+KFASFG P GTC S+ +G C
Sbjct: 740 SENYPSIDLESWDKSISDDGRVAAKVQLSCPK-GKNISSVKFASFGDPSGTCRSYQQGSC 798
Query: 803 SSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
S+SVV +AC+ SC++ +S FG DPC GV K+LA+EA C+
Sbjct: 799 HHPDSVSVVEKACMNMNSCTVSLSDEGFGEDPCPGVTKTLAIEADCS 845
>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
Length = 851
Score = 931 bits (2406), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/839 (55%), Positives = 578/839 (68%), Gaps = 34/839 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV++ G+RR+L SGSIHYPRSTPEMW LI+K+KDGGLDVI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLV+F+K V +AG++ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ----------IENEYGNIDSAYGAAGK 196
FK MQ FT KIV MMK E L+ASQGGPIILSQ IENEYG +GAAGK
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
+YI WAA MA+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSG
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
WF FGG + RPVEDLAF VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 317 PLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFL 376
PLDEYGL R+PK+GHLK+LH+A+KLCE LV+ DPT +LG EA V+++ SG C+AFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFL 385
Query: 377 ANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAAD 436
AN +NS V FN +Y LP WS+SILPDCKNVVFNTA + T +Q+ AD
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQT-------NQMQMWAD 438
Query: 437 SSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
+ ++ W +E V ++ T GLLEQ+N T D SDYLWY S + E L+
Sbjct: 439 GASSM--MWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQ 496
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
G+ L VQS GHALH FING+L GS YG+ + K++ L G N LLS+ G
Sbjct: 497 GGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACG 556
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQ 612
L N G YE G+ GPV + G G+ DL+ Q W+YQ GLKGE++N S S +
Sbjct: 557 LPNVGVHYETWNTGVVGPVVIHGLDEGSR-DLTWQTWSYQVGLKGEQMNLNSLEGSGSVE 615
Query: 613 WDSKSTLPK-LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W S + + QPL WY+ FD P+G EP+A+D MGKG+ W+NGQSIGRYW Y
Sbjct: 616 WMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAE-- 673
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G C C+Y G+Y + KC CG+P+Q YHVPRSWL+ + N LV+FEE+GGD +KI+
Sbjct: 674 GDC-KGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALA 732
Query: 732 TKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRK-PGPVLSLECPNPNQVISSIKFASFGTP 790
+ + S +C+ V++ HP + W +S + + + L+C P Q IS+IKFASFGTP
Sbjct: 733 KRTV-SGVCADVSEYHP-NIKNWQIESYGEPEFHTAKVHLKCA-PGQTISAIKFASFGTP 789
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
LGTCG+F +G C S S SV+ + C+G + C + +S + F GDPC VMK +AVEA C+
Sbjct: 790 LGTCGTFQQGECHSINSNSVLEKKCIGLQRCVVAISPSNFGGDPCPEVMKRVAVEAVCS 848
>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
Length = 851
Score = 931 bits (2405), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 464/839 (55%), Positives = 578/839 (68%), Gaps = 34/839 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV++ G+RR+L SGSIHYPRSTPEMW LI+K+KDGGLDVI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLV+F+K V +AG++ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ----------IENEYGNIDSAYGAAGK 196
FK MQ FT KIV MMK E L+ASQGGPIILSQ IENEYG +GAAGK
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
+YI WAA MA+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSG
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
WF FGG + RPVEDLAF VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326
Query: 317 PLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFL 376
PLDEYGL R+PK+GHLK+LH+A+KLCE LV+ DPT +LG EA V+++ SG C+AFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFL 385
Query: 377 ANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAAD 436
AN +NS V FN +Y LP WS+SILPDCKNVVFNTA + T +Q+ AD
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQT-------NQMQMWAD 438
Query: 437 SSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
+ ++ W +E V ++ T GLLEQ+N T D SDYLWY S + E L+
Sbjct: 439 GASSM--MWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQ 496
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
G+ L VQS GHALH FING+L GS YG+ + K++ L G N LLS+ G
Sbjct: 497 GGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACG 556
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQ 612
L N G YE G+ GPV + G G+ DL+ Q W+YQ GLKGE++N S S +
Sbjct: 557 LPNVGVHYETWNTGVVGPVVIHGLDEGSR-DLTWQTWSYQVGLKGEQMNLNSLEGSGSVE 615
Query: 613 WDSKSTLPK-LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W S + + QPL WY+ FD P+G EP+A+D MGKG+ W+NGQSIGRYW Y
Sbjct: 616 WMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAE-- 673
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G C C+Y G+Y + KC CG+P+Q YHVPRSWL+ + N LV+FEE+GGD +KI+
Sbjct: 674 GDC-KGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALA 732
Query: 732 TKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRK-PGPVLSLECPNPNQVISSIKFASFGTP 790
+ + S +C+ V++ HP + W +S + + + L+C P Q IS+IKFASFGTP
Sbjct: 733 KRTV-SGVCADVSEYHP-NIKNWQIESYGEPEFHTAKVHLKCA-PGQTISAIKFASFGTP 789
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
LGTCG+F +G C S S SV+ + C+G + C + +S + F GDPC VMK +AVEA C+
Sbjct: 790 LGTCGTFQQGECHSINSNSVLERKCIGLERCVVAISPSNFGGDPCPEVMKRVAVEAVCS 848
>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
Length = 845
Score = 930 bits (2403), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/848 (54%), Positives = 581/848 (68%), Gaps = 25/848 (2%)
Query: 10 VLCWGFVVLATTSF-GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 68
VL W V+ ++ +VTYD A+VI G+RR+L SGSIHYPRSTPEMW DLI K+K+G
Sbjct: 10 VLLWCIVLFISSGLVHCDVTYDREAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEG 69
Query: 69 GLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGF 128
GLDV+ETYVFWN+HEP YNFEGRYDLV+FVK + +AGLYAHLRIGPYVCAEWNFGGF
Sbjct: 70 GLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGF 129
Query: 129 PLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNID 188
P+WL ++PGI FR DNEPFK M+ + KIV++MK L+ SQGGPIILSQIENEYG
Sbjct: 130 PVWLKYVPGISFRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQA 189
Query: 189 SAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPK 248
GA G Y WAA MA+ LDTGVPWVMC++ DAPDP+INTCNGFYCD F PN KP
Sbjct: 190 KVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPA 249
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
WTE WSGWF FGG + RPV+DLAFAVA+F QRGG+F NYYMYHGGTNF RT+GGPFI
Sbjct: 250 TWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFI 309
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
+TSYDYDAP+DEYGLIRQPK+GHLK+LH+A+K+CE ++V+ DP SLG +A VY +
Sbjct: 310 TTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSE 369
Query: 369 SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR 428
+G C+AFL+N S V FN Y LP WS+SILPDC+NVVFNTAK+ T
Sbjct: 370 TGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQT------- 422
Query: 429 QSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKP-GLLEQINTTADQSDYLWYSLSTNI 487
+++ +S+ + W +E + D + + GLLEQIN T D SDYLWY S +I
Sbjct: 423 SKMEMLPTNSEML--SWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDI 480
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+ E L G L V++ GHA+H FING+L GS +G+ N + + L G N
Sbjct: 481 GSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRI 540
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
LLS+ VGL N G +E G+ GPV ++G +G DLS +WTYQ GLKGE +N S
Sbjct: 541 ALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHG-KWDLSWAKWTYQVGLKGEAMNLVS 599
Query: 608 G---SSTQWDSKSTLP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
S+ W S + K QPL W+K F+ P G EP+A+D + MGKG+ W+NGQSIGRY
Sbjct: 600 TNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRY 659
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG 723
W Y + G C + C Y G + KC CG+P+Q YHVPRSWLK + N LVLFEE+GG
Sbjct: 660 WTAYAT--GDC-NGCQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGG 716
Query: 724 DPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS--KIQRKPGPVLSLECPNPNQVISS 781
DPT+IS V + + +++CS+V + HP + W ++ K + P + + C P Q ISS
Sbjct: 717 DPTRISLVKRSV-TNVCSNVAEYHP-NIKNWQIENYGKTEEFHLPKVRIHCA-PGQSISS 773
Query: 782 IKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKS 840
IKFASFGTPLGTCGSF +G C + S +VV + C+G ++C++ +S + FG DPC V+K
Sbjct: 774 IKFASFGTPLGTCGSFKQGTCHAPDSHAVVEKKCLGRQTCAVTISNSNFGEDPCPNVLKR 833
Query: 841 LAVEASCT 848
L+VEA CT
Sbjct: 834 LSVEAHCT 841
>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 830
Score = 929 bits (2402), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/821 (55%), Positives = 567/821 (69%), Gaps = 22/821 (2%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYD +AVV+ G+RR+L+SGSIHYPRS PEMWPDLIQK+KDGGLDV++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY FEGRYDLV F+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KAEMQ FT KIVDMMK E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA+
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+L+T VPWVMC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP+
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVEDLA+ VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R+P
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 329
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHLK+LHKAIKLCE ALVA DP SLG +A+V+++ + C AFL N S V
Sbjct: 330 KWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARV 389
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
FNG Y LP WS+SILPDCK V+NTA + S S+ ++ A G W
Sbjct: 390 SFNGMHYDLPPWSISILPDCKTTVYNTASVGS-----QISQMKMEWAG------GFTWQS 438
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
NE + D++F GLLEQIN T D +DYLWY+ +I DE L +G +L V S
Sbjct: 439 YNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVMSA 498
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
GHALH F+NG+L G+ YGS + K+T + L G NT LS+ VGL N G +E
Sbjct: 499 GHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFETWN 558
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVW 627
AGI GPV L G G DL+ Q+WTY+ GLKGE L+ S S + + QPL W
Sbjct: 559 AGILGPVTLDGLNEGRR-DLTWQKWTYKVGLKGEALSLHSLSGSSSVEWGEPVQKQPLSW 617
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
YK F+AP G EP+A+D + MGKG+ W+NGQ IGRYWP Y + G C+YRG Y
Sbjct: 618 YKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGY--KASGTCGICDYRGEYDEK 675
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH 747
KC NCG SQ YHVPRSWL +GN LV+FEE GGDPT IS V K++ S+C+ V++
Sbjct: 676 KCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMV-KRIAGSICADVSEWQ 734
Query: 748 PLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARS 807
P + W + + K + L+C + + ++ IKFASFGTP G+CGS+S G C + +S
Sbjct: 735 PSMAN-WRTKGYEKAK----VHLQC-DHGRKMTHIKFASFGTPQGSCGSYSEGGCHAHKS 788
Query: 808 LSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+ ++C+G + C + V + F GDPC G MK VEA C
Sbjct: 789 YDIFWKSCIGQERCGVSVVPDAFGGDPCPGTMKRAVVEAIC 829
>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
Length = 897
Score = 928 bits (2399), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/858 (54%), Positives = 582/858 (67%), Gaps = 44/858 (5%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YDHRA++I G RR+LISG IHYPR+TP+MWPDLI KSK+GG+DVI+TYVFWN HEPV
Sbjct: 39 NVSYDHRALIIDGHRRMLISGGIHYPRATPQMWPDLIAKSKEGGVDVIQTYVFWNGHEPV 98
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QY FEG+YDLVKFVKLV +GLY HLRIGPYVCAEWNFGGFP+WL IPGI FRTDN
Sbjct: 99 KGQYIFEGQYDLVKFVKLVGVSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIVFRTDNS 158
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PF EMQ+F KIVD+M++E L++ QGGPII+ QIENEYGNI+ ++G GK Y+KWAA M
Sbjct: 159 PFMEEMQQFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNIEHSFGPGGKEYVKWAARM 218
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
AL L GVPWVMC+Q+DAP II+ CN +YCD + PNSN KP +WTE+W GW+ ++GG++
Sbjct: 219 ALGLGAGVPWVMCRQTDAPGSIIDACNEYYCDGYKPNSNKKPILWTEDWDGWYTTWGGSL 278
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RPVEDLAFAVARFFQRGG+FQNYYMY GGTNF RT+GGPF TSYDYDAP+DEYGL+
Sbjct: 279 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFYITSYDYDAPIDEYGLLS 338
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPT-YPSLGPNLEATVYKT-----GSGL-------- 371
+PKWGHLKDLH AIKLCE ALVA D Y LG EA VY+ G L
Sbjct: 339 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGSKQEAHVYRANVHAEGQNLTQHGSQSK 398
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV-------P 424
CSAFLANI + VTV+F G SY LP WSVS+LPDC+N VFNTAK+ + T + P
Sbjct: 399 CSAFLANIDEHKAVTVRFLGQSYTLPPWSVSVLPDCRNAVFNTAKVAAQTSIKSMELALP 458
Query: 425 SFSRQSLQ---VAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
FS S +A + + S W + EP+ + + FT G+LE +N T D SDYLWY
Sbjct: 459 QFSGISAPKQLMAQNEGSYMSSSWMTVKEPISVWSGNNFTVEGILEHLNVTKDHSDYLWY 518
Query: 482 SLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
+ D+ E+ + + S+ L FING+L GS G + V P+
Sbjct: 519 FTRIYVSDDDIAFWEENNVHPAIKIDSMRDVLRVFINGQLTGSVIGRW----IKVVQPVQ 574
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
G N LLS TVGLQNYGAF E+ GAG G +L G +G +IDLS+ +WTYQ GL+
Sbjct: 575 FQKGYNELVLLSQTVGLQNYGAFLERDGAGFRGHTKLTGFRDG-DIDLSNLEWTYQVGLQ 633
Query: 600 GEELNF---PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
GE + +W + WYKT FDAP+G++PVA+D MGKG+AWVN
Sbjct: 634 GENQKIYTTENNEKAEWTDLTLDDIPSTFTWYKTYFDAPSGADPVALDLGSMGKGQAWVN 693
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
IGRYW T V+ GC C+YRGAY+S KC NCGKP+Q YH+PRSWL+ S N LV
Sbjct: 694 DHHIGRYW-TLVAPEEGC-QKCDYRGAYNSEKCRTNCGKPTQIWYHIPRSWLQPSNNLLV 751
Query: 717 LFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ-----RKPGPVLSLE 771
+FEE GG+P +IS + S +C+ V+++H P+ W I + P + L
Sbjct: 752 IFEETGGNPFEISIKLRS-ASVVCAQVSETHYPPLQRWIHTDFIYGNVSGKDMTPEIQLR 810
Query: 772 CPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF- 830
C + VISSI+FAS+GTP G+C FSRG C + SLSVV +AC G +C+I +S F
Sbjct: 811 CQD-GYVISSIEFASYGTPQGSCQKFSRGNCHAPNSLSVVSKACQGRDTCNIAISNAVFG 869
Query: 831 GDPCKGVMKSLAVEASCT 848
GDPC+G++K+LAVEA C+
Sbjct: 870 GDPCRGIVKTLAVEAKCS 887
>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
Length = 839
Score = 928 bits (2398), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/836 (55%), Positives = 573/836 (68%), Gaps = 40/836 (4%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPE------------MWPDLIQKSKDGGLDVIET 75
TYD +AVV+ G+RR+LISGSIHYPRSTPE MWPDLI+K+KDGGLDV++T
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEARRTRFPFLLLTMWPDLIEKAKDGGLDVVQT 86
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI 135
YVFWN HEP QY FEGRYDLV F+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL ++
Sbjct: 87 YVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYV 146
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAG 195
PGI FRTDNEPFKAEMQ+FT KIV+MMK E L+ QGGPIILSQIENE+G ++ G
Sbjct: 147 PGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPA 206
Query: 196 KSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWS 255
K+Y WAA MA++L+T VPW+MC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+
Sbjct: 207 KAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWT 266
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYD 315
W+ FG VP+RPVEDLA+ VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYD
Sbjct: 267 AWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYD 326
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAF 375
AP+DEYGL+R+PKWGHLK LHKAIKLCE ALVA DP SLG +++V+++ +G C+AF
Sbjct: 327 APIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAF 386
Query: 376 LANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAA 435
L N S V FNG Y LP WS+SILPDCK VFNTA++ S S+ ++ A
Sbjct: 387 LENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGS-----QISQMKMEWAG 441
Query: 436 DSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
G W NE + +D T GLLEQIN T D +DYLWY+ ++ DE L
Sbjct: 442 ------GFAWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLS 495
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
+G L V S GHALH FING+L G+ YGS + K+T + L G NT LS+ VG
Sbjct: 496 NGENLKLTVMSAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVG 555
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP--SGSST-Q 612
L N G +E AGI GPV L G G DL+ Q+WTYQ GLKGE ++ SGSST +
Sbjct: 556 LPNVGEHFETWNAGILGPVTLDGLNEGRR-DLTWQKWTYQVGLKGESMSLHSLSGSSTVE 614
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W + QPL WYK F+AP G EP+A+D + MGKG+ W+NGQ IGRYWP Y + +G
Sbjct: 615 WGEPV---QKQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA-SG 670
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
C +C+YRG Y KC NCG SQ YHVPRSWL +GN LV+FEE GGDPT IS V
Sbjct: 671 NC-GTCDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVK 729
Query: 733 KQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
+ +G S+C+ V++ P + W + + K + L+C N Q I+ IKFASFGTP G
Sbjct: 730 RSIG-SVCADVSEWQP-SMKNWHTKDYEKAK----VHLQCDN-GQKITEIKFASFGTPQG 782
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+CGS++ G C + +S + + CVG + C + V F GDPC G MK VEA C
Sbjct: 783 SCGSYTEGGCHAHKSYDIFWKNCVGQERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 838
>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
Length = 912
Score = 928 bits (2398), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/865 (55%), Positives = 582/865 (67%), Gaps = 54/865 (6%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRA++I G RR+LIS IHYPR+TPEMWPDLI K+K+GG+DVIETYVFWN H+PV
Sbjct: 49 NVTYDHRALIIDGHRRMLISAGIHYPRATPEMWPDLIAKAKEGGVDVIETYVFWNGHQPV 108
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QYNFEGRYDLVKF KLVA GLY LRIGPY CAEWNFGGFP+WL IPGI+FRT+N
Sbjct: 109 KGQYNFEGRYDLVKFAKLVASNGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 168
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ------IENEYGNIDSAYGAAGKSYI 199
PFK EM+RF +K+V++M++E L++ QGGPIIL Q IENEYGN++S+YG GK Y+
Sbjct: 169 PFKEEMKRFVSKVVNLMREEMLFSWQGGPIILLQVRREYGIENEYGNLESSYGNEGKEYV 228
Query: 200 KWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
KWAA MALSL GVPWVMC+Q DAP II+TCN +YCD F PNS NKP WTENW GW+
Sbjct: 229 KWAASMALSLGAGVPWVMCKQPDAPYDIIDTCNAYYCDGFKPNSRNKPIFWTENWDGWYT 288
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+G +P+RPVEDLAFAVARFFQRGG+ QNYYMY GGTNF RT+GGP TSYDYDAP+D
Sbjct: 289 QWGERLPHRPVEDLAFAVARFFQRGGSLQNYYMYFGGTNFGRTAGGPLQITSYDYDAPID 348
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEATVYKTG---------- 368
EYGL+ +PKWGHLKDLH A+KLCE ALVA D PTY LG EA VY+
Sbjct: 349 EYGLLNEPKWGHLKDLHAALKLCEPALVAADSPTYIKLGSKQEAHVYQENVHREGLNLSI 408
Query: 369 ---SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI---NSVTL 422
S CSAFLANI TV F G +Y LP WSVSILPDC++ +FNTAK+ SV L
Sbjct: 409 SQISNKCSAFLANIDERKAATVTFRGQTYTLPPWSVSILPDCRSAIFNTAKVGAQTSVKL 468
Query: 423 VPS---------FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTA 473
V S S+QS+ S I W EP+ I + +FT G+ E +N T
Sbjct: 469 VGSNLPLTSNLLLSQQSIDHNGISH--ISKSWMTTKEPINIWINSSFTAEGIWEHLNVTK 526
Query: 474 DQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAK 531
DQSDYLWYS + + L E+ + L + S+ L F+NG+L+G+ G A
Sbjct: 527 DQSDYLWYSTRIYVSDGDILFWKENAAHPKLAIDSVRDILRVFVNGQLIGNVVGHWVKAV 586
Query: 532 VTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ 591
T+ F PG N LL+ TVGLQNYGAF EK GAGI G +++ G NG +IDLS
Sbjct: 587 QTLQF----QPGYNDLTLLTQTVGLQNYGAFIEKDGAGIRGTIKITGFENG-HIDLSKPL 641
Query: 592 WTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQP--LVWYKTTFDAPAGSEPVAIDFTGMG 649
WTYQ GL+GE L F + S P P WYKT FD P G++PVA+D MG
Sbjct: 642 WTYQVGLQGEFLKFYNEESENAGWVELTPDAIPSTFTWYKTYFDVPGGNDPVALDLESMG 701
Query: 650 KGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK 709
KG+AWVNG IGRYW T VS GC C+YRGAY S+KC NCGKP+Q+LYHVPRSWLK
Sbjct: 702 KGQAWVNGHHIGRYW-TRVSPKTGC-QVCDYRGAYDSDKCTTNCGKPTQTLYHVPRSWLK 759
Query: 710 SSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD------MWGSDSKIQRK 763
+S N LV+ EE GG+P IS V S +C+ V+ S+ P+ + G
Sbjct: 760 ASNNFLVILEETGGNPLGIS-VKLHSASIVCAQVSQSYYPPMQKLLNASLLGQQEVSSND 818
Query: 764 PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSI 823
P ++L C + N +ISSI FASFGTP G+C SFSRG C + S S+V +AC+G +SCSI
Sbjct: 819 MIPEMNLRCRDGN-IISSITFASFGTPGGSCQSFSRGNCHAPSSKSIVSKACLGKRSCSI 877
Query: 824 GVSVNTF-GDPCKGVMKSLAVEASC 847
+S + F GDPC+ V+K+L+VEA C
Sbjct: 878 KISSDVFGGDPCQDVVKTLSVEARC 902
>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 898
Score = 927 bits (2397), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/838 (54%), Positives = 582/838 (69%), Gaps = 12/838 (1%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
V T + ANV+YD R+++I +R++LIS SIHYPRS P MWP L+Q +K+GG+DVIETY
Sbjct: 67 VTFTVASSANVSYDGRSLIIDAQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETY 126
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
VFWN HE Y F GR+DLVKF + V +AG+Y LRIGP+V AEWNFGG P+WLH++P
Sbjct: 127 VFWNGHELSPGNYYFGGRFDLVKFAQTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVP 186
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK 196
G FRT N+PF MQ+FT IV++MKQEKL+ASQGGPIIL+QIENEYG ++ Y GK
Sbjct: 187 GTVFRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILAQIENEYGYYENFYKEDGK 246
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
Y WAA MA+S +TGVPW+MCQQ DAPDP+I+TCN FYCDQFTP S N+PK+WTENW G
Sbjct: 247 KYALWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPG 306
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
WF +FGG P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GGPFI+TSYDYDA
Sbjct: 307 WFKTFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDA 366
Query: 317 PLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFL 376
P+DEYGL R PKWGHLK+LH+AIKLCE L+ SLGP++EA VY SG C+AF+
Sbjct: 367 PVDEYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFI 426
Query: 377 ANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAAD 436
+N+ +D TV+F S+ LPAWSVSILPDCKNVVFNTAK+ S T V + +SLQ +
Sbjct: 427 SNVDDKNDKTVEFRNASFHLPAWSVSILPDCKNVVFNTAKVTSQTSVVAMVPESLQQSDK 486
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
++ W + E GI F K G ++ INTT D +DYLW++ S + +E L+
Sbjct: 487 VVNSF--KWDIVKEKPGIWGKADFVKNGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKK 544
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G+K VL ++S GHALHAF+N + G+G G+ ++A T PI+L GKN LL LTVGL
Sbjct: 545 GNKPVLLIESTGHALHAFVNQEYEGTGSGNGTHAPFTFKNPISLRAGKNEIALLCLTVGL 604
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG---SSTQW 613
Q G FY+ GAG+T V++KG NGT IDLSS WTY+ G++GE L G ++ W
Sbjct: 605 QTAGPFYDFVGAGLTS-VKIKGLNNGT-IDLSSYAWTYKIGVQGEYLRLYQGNGLNNVNW 662
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS-QNG 672
S S PK+QPL WYK DAP G EPV +D MGKG AW+NG+ IGRYWP ++
Sbjct: 663 TSTSEPPKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSE 722
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
C C+YRG ++ +KC CG+P+Q YHVPRSW K SGN LVLFEE GGDP KI FV
Sbjct: 723 DCVKECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVR 782
Query: 733 KQLGSSLCSHVTDSHPLPVDMWGSDSKIQ-RKPGPVLSLECPNPNQVISSIKFASFGTPL 791
+++ S C+ V + +P + + KIQ K P L CP N IS++KFASFG+P
Sbjct: 783 RKV-SGACALVAEDYPSVALVSQGEDKIQSNKNIPFARLACPG-NTRISAVKFASFGSPS 840
Query: 792 GTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
GTCGS+ +G C S ++V +AC+ C I ++ F + C G+ + LAVEA C+
Sbjct: 841 GTCGSYLKGDCHDPNSSTIVEKACLNKNDCVIKLTEENFKSNLCPGLSRKLAVEAVCS 898
>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 859
Score = 926 bits (2394), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/834 (55%), Positives = 570/834 (68%), Gaps = 34/834 (4%)
Query: 7 LLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
L++ L F +L+ + F NV+YDHRA++I GKRR+L+S IHYPR+TPEMW DLI KS
Sbjct: 17 LIIALLVYFPILSGSYFKPFNVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKS 76
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GG DV++TYVFWN HEPV+ QYNFEGRYDLVKFVKL+ +GLY HLRIGPYVCAEWNF
Sbjct: 77 KEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNF 136
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL IPGI+FRTDNEPFK EMQ+F KIVD+M++ KL+ QGGPII+ QIENEYG
Sbjct: 137 GGFPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYG 196
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+++ +YG GK Y+KWAA MAL L GVPWVMC+Q+DAP+ II+ CNG+YCD F PNS
Sbjct: 197 DVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRT 256
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP +WTE+W GW+ +GG++P+RP EDLAFAVARF+QRGG+FQNYYMY GGTNF RTSGG
Sbjct: 257 KPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGG 316
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEATV 364
PF TSYDYDAPLDEYGL +PKWGHLKDLH AIKLCE ALVA D P Y LG EA +
Sbjct: 317 PFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHI 376
Query: 365 Y----KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
Y +TG +C+AFLANI + VKFNG SY LP WSVSILPDC++V FNTAK+ +
Sbjct: 377 YHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQ 436
Query: 421 TLV-------PSFSRQSLQ---VAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQIN 470
T V PS S+ V D+ I W + EP+GI ++ FT GLLE +N
Sbjct: 437 TSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLN 496
Query: 471 TTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSS 528
T D+SDYLW+ ++ D+ ++G + + + S+ L F+N +L GS G
Sbjct: 497 VTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWV 556
Query: 529 NAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS 588
A P+ G N LL+ TVGLQNYGAF EK GAG G +L G NG ++DLS
Sbjct: 557 KAVQ----PVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNG-DLDLS 611
Query: 589 SQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDF 645
WTYQ GLKGE +W + T +WYKT FD PAG++PV ++
Sbjct: 612 KSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNL 671
Query: 646 TGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPR 705
MG+G+AWVNGQ IGRYW +SQ GC +C+YRGAY+S+KC NCGKP+Q+ YHVPR
Sbjct: 672 ESMGRGQAWVNGQHIGRYW-NIISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPR 730
Query: 706 SWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ---- 761
SWLK S N LVLFEE GG+P KIS T G LC V++SH P+ W + I
Sbjct: 731 SWLKPSSNLLVLFEETGGNPFKISVKTVTAG-ILCGQVSESHYPPLRKWSTPDYINGTMS 789
Query: 762 -RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQA 814
P + L C + VISSI+FAS+GTP G+C FS G+C ++ SLS+V +
Sbjct: 790 INSVAPEVHLHCED-GHVISSIEFASYGTPRGSCDGFSIGKCHASNSLSIVSEV 842
>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
Length = 843
Score = 926 bits (2392), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/831 (54%), Positives = 579/831 (69%), Gaps = 19/831 (2%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
+NVTYDHR+++I G+RR++IS SIHYPRS PEMWP L+ ++KDGG D IETYVFWN HE
Sbjct: 26 ASNVTYDHRSLIISGRRRLIISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
QY FE R+DLV+FVK+V +AGL LRIGP+V AEWNFGG P+WLH++PG FRTD
Sbjct: 86 IAPGQYYFEDRFDLVRFVKVVKDAGLLLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTD 145
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI-DSAYGAAGKSYIKWA 202
NEPFK+ M+ FT IV+MMK+E+L+ASQGG IIL+QIENEYG+ + AY GK Y WA
Sbjct: 146 NEPFKSHMKSFTTYIVNMMKKEQLFASQGGNIILAQIENEYGDYYEQAYAPGGKPYAMWA 205
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A MA++ +TGVPW+MCQ+SDAPDP+IN+CNGFYCD F PNS KPK+WTENW GWF +FG
Sbjct: 206 ASMAVAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKLWTENWPGWFQTFG 265
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
+ P+RP ED+AFAVARFF++GG+ QNYY+YHGGTNF RT+GGPFI+TSYDYDAP+DEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
L R PKW HL+DLHK+I+LCE L+ + T+ SLGP EA +Y SG C AFLANI +
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
+D V F Y LPAWSVSILPDC+NVVFNTAK+ S T + + +SLQ +
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVAMVPESLQASKPER---- 441
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
W+ E GI + F + G ++ INTT D +DYLWY +T+ DE GS VL
Sbjct: 442 --WNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWY--TTSFSVDES-YSKGSHVVL 496
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
++ S GH +HAF+N + +GS YG+ S + +V PI L GKN LLS+TVGLQN G
Sbjct: 497 NIDSKGHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRTGKNELALLSMTVGLQNAGFS 556
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSSTQ-WDSKSTL 619
YE GAG T V + G NGT I+LSS W Y+ GL+GE L P + Q W +S
Sbjct: 557 YEWIGAGFTN-VNISGVRNGT-INLSSNNWAYKIGLEGEYYSLFKPDQRNNQRWIPQSEP 614
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
PK QPL WYK D P G +PV ID MGKG W+NG +IGRYWP S + CT SC+
Sbjct: 615 PKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIGRYWPRTSSIDDRCTPSCD 674
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
YRG ++ NKC CG+P+Q YH+PRSW SGN LV+FEE GGDPTKI+F +++ +S+
Sbjct: 675 YRGEFNPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEKGGDPTKITF-SRRAVTSV 733
Query: 740 CSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
CS V++ P + ++ W + + L CP + ISS+KFAS GTP GTC S+
Sbjct: 734 CSFVSEHFPSIDLESWDGSATNEGTSPAKAQLSCP-IGKNISSLKFASLGTPSGTCRSYQ 792
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
+G C SLSVV +AC+ + SC++ +S +FG D C GV K+LA+EA C+
Sbjct: 793 KGSCHHPNSLSVVEKACLNTNSCTVSLSDESFGKDLCPGVTKTLAIEADCS 843
>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
Length = 677
Score = 925 bits (2391), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 470/705 (66%), Positives = 549/705 (77%), Gaps = 35/705 (4%)
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
MQRFTAKI ENEYGNIDSAYGA GK+Y++WAAGMA+SLD
Sbjct: 1 MQRFTAKI----------------------ENEYGNIDSAYGAPGKAYMRWAAGMAVSLD 38
Query: 211 TGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
TGVPWVMCQQ+DAPDP+INTCNGFYCDQFTPNS KPKMWTENWSGWFLSFGGAVPYRPV
Sbjct: 39 TGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGGAVPYRPV 98
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWG 330
EDLAFAVARF+QRGGTFQNYYMYHGGTN DR+SGGPFI+TSYDYDAP+DEYGL+RQPKWG
Sbjct: 99 EDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWG 158
Query: 331 HLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFN 390
HL+D+HKAIKLCE AL+ATDP+Y SLGPN+EA VYK GS +C+AFLANI SD TV FN
Sbjct: 159 HLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFLANIDGQSDKTVTFN 217
Query: 391 GNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR-QSLQVAADSS----DAIGSGW 445
G Y LPAWSVSILPDCKNVV NTA+INS T +S VA+D S + S W
Sbjct: 218 GKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTPELAVSDW 277
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
SY EPVGI+KD+A TK GL+EQINTTAD SD+LWYS S +K DEP L +GS++ L V
Sbjct: 278 SYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEPYL-NGSQSNLAVN 336
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
SLGH L +INGK+ GS GS+S++ ++ PI L PGKN DLLS TVGL NYGAF++
Sbjct: 337 SLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDL 396
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQWDSKSTLPKLQ 623
GAGITGPV+L G NG +DLSS +WTYQ GL+GE+L+ PS +S +W S + P
Sbjct: 397 VGAGITGPVKLSGL-NGA-LDLSSAEWTYQIGLRGEDLHLYDPSEASPEWVSANAYPINH 454
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL+WYKT F PAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ GC +SCNYRGA
Sbjct: 455 PLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGA 514
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
YSS+KCLK CG+PSQ+LYHVPRS+L+ N LVLFE GGDP+KISFV +Q G S+C+ V
Sbjct: 515 YSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFVMRQTG-SVCAQV 573
Query: 744 TDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCS 803
+++HP +D W S +QR GP L LECP QVISS+KFASFGTP GTCGS+S G CS
Sbjct: 574 SEAHPAQIDSWSSQQPMQRY-GPALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECS 632
Query: 804 SARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
S ++LS+V++AC+G SCS+ VS N FG+PC GV KSLAVEA+C+
Sbjct: 633 STQALSIVQEACIGVSSCSVPVSSNYFGNPCTGVTKSLAVEAACS 677
>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 843
Score = 925 bits (2391), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/835 (54%), Positives = 581/835 (69%), Gaps = 12/835 (1%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
T + NV+YD R+++I G+R++LIS SIHYPRS P MWP L+Q +K+GG+DVIETYVFW
Sbjct: 15 TVALSGNVSYDGRSLLIDGQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFW 74
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
N HE Y F GR+DLVKF K V +AG+Y LRIGP+V AEWNFGG P+WLH++PG
Sbjct: 75 NGHELSPGNYYFGGRFDLVKFAKTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTV 134
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYI 199
FRT N+PF MQ+FT IV++MKQEKL+ASQGGPIILSQIENEYG ++ Y GK Y
Sbjct: 135 FRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILSQIENEYGYYENFYKEDGKKYA 194
Query: 200 KWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S +TGVPW+MCQQ DAPDP+I+TCN FYCDQFTP S N+PK+WTENW GWF
Sbjct: 195 LWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGWFK 254
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FGG P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GGPFI+TSYDYDAP+D
Sbjct: 255 TFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVD 314
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANI 379
EYGL R PKWGHLK+LH+AIKLCE L+ SLGP++EA VY SG C+AF++N+
Sbjct: 315 EYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFISNV 374
Query: 380 GTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSD 439
+D TV+F SY LPAWSVSILPDCKNVVFNTAK+ S T V + +SLQ + +
Sbjct: 375 DDKNDKTVEFRNASYHLPAWSVSILPDCKNVVFNTAKVTSQTNVVAMIPESLQQSDKGVN 434
Query: 440 AIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
++ W + E GI F K G ++ INTT D +DYLW++ S + +E L+ GSK
Sbjct: 435 SL--KWDIVKEKPGIWGKADFVKSGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKGSK 492
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
VL ++S GHALHAF+N + G+G G+ +++ + PI+L GKN LL LTVGLQ
Sbjct: 493 PVLLIESTGHALHAFVNQEYQGTGTGNGTHSPFSFKNPISLRAGKNEIALLCLTVGLQTA 552
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG---SSTQWDSK 616
G FY+ GAG+T V++KG NGT IDLSS WTY+ G++GE L G + W S
Sbjct: 553 GPFYDFIGAGLTS-VKIKGLKNGT-IDLSSYAWTYKIGVQGEYLRLYQGNGLNKVNWTST 610
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS-QNGGCT 675
S K+QPL WYK DAP G EPV +D MGKG AW+NG+ IGRYWP ++ C
Sbjct: 611 SEPQKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSEDCV 670
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
C+YRG ++ +KC CG+P+Q YHVPRSW K SGN LVLFEE GGDP KI FV +++
Sbjct: 671 KECDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVRRKV 730
Query: 736 GSSLCSHVTDSHPLPVDMWGSDSKIQ-RKPGPVLSLECPNPNQVISSIKFASFGTPLGTC 794
S C+ V + +P + + KIQ K P L CP+ N IS++KFASFGTP G+C
Sbjct: 731 -SGACALVAEDYPSVGLLSQGEDKIQNNKNVPFAHLTCPS-NTRISAVKFASFGTPSGSC 788
Query: 795 GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
GS+ +G C S ++V +AC+ C I ++ F + C G+ + LAVEA C+
Sbjct: 789 GSYLKGDCHDPNSSTIVEKACLNKNDCVIKLTEENFKTNLCPGLSRKLAVEAVCS 843
>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
Length = 842
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/829 (55%), Positives = 579/829 (69%), Gaps = 25/829 (3%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYD +AV+I G+RR+L SGSIHYPRSTP+MW LIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 28 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 87
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
Y FE RYDLV+F+K V +AGL+ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEPF
Sbjct: 88 NYYFEERYDLVRFIKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 147
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
K MQ FT KIV MMK EKL+ASQGGPIILSQIENEYG GAAG++YI WAA MA+
Sbjct: 148 KTAMQGFTEKIVGMMKSEKLFASQGGPIILSQIENEYGPEGKELGAAGQAYINWAAKMAI 207
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
L TGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSGWF FGG +
Sbjct: 208 GLGTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 267
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R+P
Sbjct: 268 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREP 327
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
K HLK+LH+A+KLCE ALV+ DP +LG EA V+++ SG C+AFLAN +NS V
Sbjct: 328 KHSHLKELHRAVKLCEQALVSVDPAITTLGTMQEAHVFRSPSG-CAAFLANYNSNSYAKV 386
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
FN Y LP WS+SILPDCKNVVFN+A + T +Q+ D + ++ W
Sbjct: 387 VFNNEQYSLPPWSISILPDCKNVVFNSATVGVQT-------SQMQMWGDGASSM--MWER 437
Query: 448 INEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV-LHVQ 505
+E V ++ T GLLEQ+N T D SDYLWY S +I E L+ G K + L V
Sbjct: 438 YDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPLSLSVL 497
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH F+NG+L GS YG+ + ++ + L G N LLS+ GL N G YE
Sbjct: 498 SAGHALHVFVNGELQGSAYGTREDRRIKYNGNANLRAGTNKIALLSVACGLPNVGVHYET 557
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK- 621
G+ GPV L G G+ DL+ Q W+YQ GLKGE++N S +S +W S + +
Sbjct: 558 WNTGVGGPVGLHGLNEGSR-DLTWQTWSYQVGLKGEQMNLNSLEGSTSVEWMQGSLIAQN 616
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WY+ F+ P+G EP+A+D MGKG+ W+NGQSIGRYW Y +G C + C+Y
Sbjct: 617 QQPLSWYRAYFETPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYA--DGDCKE-CSYT 673
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G + + KC CG+P+Q YHVPRSWL+ + N LV+FEE+GGD +KI+ V + + SS+C+
Sbjct: 674 GTFRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALVKRSV-SSVCA 732
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPV-LSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
V++ HP + W +S +R+ + L C +P Q IS+IKFASFGTP+GTCG+F +G
Sbjct: 733 DVSEDHP-NIKNWQIESYGEREYHRAKVHLRC-SPGQSISAIKFASFGTPMGTCGNFQQG 790
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
C SA S +V+ + C+G + C++ +S +F GDPC V K +AVEA C+
Sbjct: 791 DCHSANSHTVLEKKCIGLQRCAVAISPESFGGDPCPRVTKRVAVEAVCS 839
>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
Length = 923
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/853 (54%), Positives = 577/853 (67%), Gaps = 39/853 (4%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRA+++GGKRR+L+S +HYPR+TPEMWP LI K+K+GG+DVIETY+FWN HEP
Sbjct: 68 NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKAKEGGVDVIETYIFWNGHEPA 127
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QY FEGR+D+V+F KLVA GL+ LRIGPY CAEWNFGGFP+WL IPGI+FRTDNE
Sbjct: 128 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 187
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
P+KAEMQ F KIVD+MK+EKLY+ QGGPIIL QIENEYGNI YG AGK Y++WAA M
Sbjct: 188 PYKAEMQNFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 247
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
AL+LDTGVPWVMC+Q+DAP+ I++TCN FYCD F PNS NKP +WTE+W GW+ +G A+
Sbjct: 248 ALALDTGVPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGEAL 307
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP +D AFAVARF+QRGG+FQNYYMY GGTNF+RT+GGP TSYDYDAP+DEYG++R
Sbjct: 308 PHRPAQDSAFAVARFYQRGGSFQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 367
Query: 326 QPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKT-----------GSGLC 372
QPKWGHLKDLH AIKLCE AL A D P Y LGP EA VY + + C
Sbjct: 368 QPKWGHLKDLHAAIKLCEPALTAVDGSPRYIKLGPMQEAHVYSSENVHTNGSISGNAQFC 427
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV-------PS 425
SAFLANI + +V G SY LP WSVSILPDC+ V FNTA++ + T PS
Sbjct: 428 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVESGSPS 487
Query: 426 F-SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
+ SR ++ + + S W EPVGI +D F G+LE +N T D SDYL Y+
Sbjct: 488 YSSRHKPRILSLGGPYLSSTWWASKEPVGIWSEDIFAAQGILEHLNVTKDISDYLSYTTR 547
Query: 485 TNIKADEPLL---EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
NI +DE +L +G L + + + F+NGKL GS G V+++ P+ L
Sbjct: 548 VNI-SDEDVLYWNSEGLLPSLTIDQIRDVVRIFVNGKLAGSQVGH----WVSLNQPLQLV 602
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS VGLQNYGAF EK GAG G V+L G NG +IDL++ WTYQ GLKGE
Sbjct: 603 QGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNG-DIDLTNSLWTYQIGLKGE 661
Query: 602 ELNFPSGS---STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
S S W S L P W+KTTFDAP G+ PVAID MGKG+AWVNG
Sbjct: 662 FSRIYSPEKQGSAGWSSMQNDDTLSPFTWFKTTFDAPEGNGPVAIDLGSMGKGQAWVNGH 721
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
IGRYW + V+ GC SCNY G Y +KC NCG +QS YH+PR WL+ S N LVLF
Sbjct: 722 LIGRYW-SLVAPESGCPSSCNYAGNYGDSKCRSNCGIATQSWYHIPREWLQESDNLLVLF 780
Query: 719 EEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ---RKPGPVLSLECPNP 775
EE GGDP++IS ++CS +++++ P+ W + + P L L+C +
Sbjct: 781 EETGGDPSQISLEV-HYTKTICSKISETYYPPLSAWSRAANGRPSVNTVAPELRLQC-DE 838
Query: 776 NQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCK 835
VIS I FAS+GTP G C +FS G C ++ +L +V +AC G C+I V+ + FGDPC+
Sbjct: 839 GHVISKITFASYGTPTGDCQNFSVGNCHASTTLDLVAEACEGKNRCAISVTNDVFGDPCR 898
Query: 836 GVMKSLAVEASCT 848
V+K LAV A C+
Sbjct: 899 KVVKDLAVVAECS 911
>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 848
Score = 924 bits (2388), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 459/853 (53%), Positives = 580/853 (67%), Gaps = 30/853 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+L + W V + T NV YD +A+VI G+RR+L SGSIHYPRSTPEMW LIQK+
Sbjct: 12 LLCCCIVWSSVYVEVTK--CNVVYDRKALVIDGQRRLLFSGSIHYPRSTPEMWEGLIQKA 69
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLD I+TYVFWNLHEP YNFEGR DLV+F+K V +AGLY HLRIGPY+C+EWNF
Sbjct: 70 KDGGLDAIDTYVFWNLHEPSPGNYNFEGRNDLVRFIKTVHKAGLYVHLRIGPYICSEWNF 129
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL F+PGI FRTDNEPFK+ MQ+FT K+V +MK EKL+ SQGGPIILSQIENEY
Sbjct: 130 GGFPVWLKFVPGISFRTDNEPFKSAMQKFTQKVVQLMKNEKLFESQGGPIILSQIENEYE 189
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
A+GA+G +Y+ WAA MA+ + TGVPWVMC++ DAPDP+INTCNGFYCD F+PN
Sbjct: 190 PESKAFGASGYAYMTWAAKMAVGMGTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPY 249
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP MWTE WSGWF FGG + RPVEDL FAVARF Q+GG+F NYYMYHGGTNF RT+GG
Sbjct: 250 KPTMWTEAWSGWFTEFGGPIYQRPVEDLTFAVARFIQKGGSFINYYMYHGGTNFGRTAGG 309
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGLIR+PK+GHLK+LHKA+KLCE AL+ DPT +LG +A V+
Sbjct: 310 PFITTSYDYDAPIDEYGLIRRPKYGHLKELHKAVKLCELALLNADPTVTTLGSYEQAHVF 369
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ SG + FL+N T S V FN ++ LP WS+SILPDCKNV FNTA++ T
Sbjct: 370 SSKSGSGAVFLSNFNTKSATKVTFNNMNFHLPPWSISILPDCKNVAFNTARVGVQTSQTQ 429
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
R + ++ + W NE V ++ D T GLL+Q+N T D SDYLWY+ S
Sbjct: 430 LLRTNSELHS---------WGIFNEDVSSVAGDTTITVTGLLDQLNITRDSSDYLWYTTS 480
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+I E L G L VQS G A+H FIN +L GS G+ + + T + L G
Sbjct: 481 VDIDPSESFLGGGQHPSLTVQSAGDAMHVFINDQLSGSASGTREHRRFTFTGNVNLHAGL 540
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS+ VGL N G +E G+ GPV L G +GT DLS Q+W+YQ GLKGE N
Sbjct: 541 NKISLLSIAVGLANNGPHFETRNTGVLGPVALHGLDHGTR-DLSWQKWSYQVGLKGEATN 599
Query: 605 FPSG---SSTQWDSKSTLP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
S S+ W + S + K QPL WYK FD P G EP+A+D MGKG+ W+NGQSI
Sbjct: 600 LDSPNSISAVDWMTGSLVAQKQQPLTWYKAYFDEPNGDEPLALDMGSMGKGQVWINGQSI 659
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW Y + C+ +C Y G + KC C P+Q YHVPRSWLK S N LV+FEE
Sbjct: 660 GRYWTIYADSD--CS-ACTYSGTFRPKKCQFGCQHPTQQWYHVPRSWLKPSKNLLVVFEE 716
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSLECPNPN 776
IGGD +K++ V K + +S+C+ V+++HP + W ++S ++Q+KP +SL C +
Sbjct: 717 IGGDVSKVALVKKSV-TSVCAEVSENHPR-ITNWHTESHGQTEVQQKPE--ISLHCTD-G 771
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCK 835
IS+IKF+SFGTP G+CG F G C + S +V+++ C+G + CS+ +S FG DPC
Sbjct: 772 HSISAIKFSSFGTPSGSCGKFQHGTCHAPNSNAVLQKECLGKQKCSVTISNTNFGADPCP 831
Query: 836 GVMKSLAVEASCT 848
+K L+VEA C+
Sbjct: 832 SKLKKLSVEAVCS 844
>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
Length = 845
Score = 924 bits (2388), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/830 (55%), Positives = 576/830 (69%), Gaps = 26/830 (3%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYD +AV+I G+RR+L SGSIHYPRSTP+MW LIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 89
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
Y FE RYDLV+FVK V +AGL+ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEPF
Sbjct: 90 NYYFEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
K MQ FT KIV MMK E L+ASQGGPIILSQIENEYG +GAAG++YI WAA MA+
Sbjct: 150 KTAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAV 209
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSGWF FGG +
Sbjct: 210 GLDTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 269
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGLIR+P
Sbjct: 270 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREP 329
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
K HLK+LH+A+KLCE ALV+ DPT +LG EA V+++ SG C+AFLAN +NS V
Sbjct: 330 KHSHLKELHRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKV 388
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
FN Y LP WS+SILPDCKNVVFN+A + T +Q+ D + ++ W
Sbjct: 389 VFNNEQYSLPPWSISILPDCKNVVFNSATVGVQT-------SQMQMWGDGATSM--MWER 439
Query: 448 INEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK-TVLHVQ 505
+E V ++ T GLLEQ+N T D SDYLWY S +I E L+ G K L VQ
Sbjct: 440 YDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQ 499
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH F+NG+L GS YG+ + ++ + + L G N LLS+ GL N G YE
Sbjct: 500 SAGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYET 559
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLP-K 621
G+ GPV L G G+ DL+ Q W+YQ GLKGE++N S S +W S + K
Sbjct: 560 WNTGVGGPVVLHGLNEGSR-DLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQK 618
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYK F+ P+G EP+A+D MGKG+ W+NGQSIGRYW Y +G C C+Y
Sbjct: 619 QQPLAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAYA--DGDC-KGCSYT 675
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI-GGDPTKISFVTKQLGSSLC 740
G + + KC CG+P+Q YHVPRSWL+ S N LV+ EE+ GGD +KI+ + + SS+C
Sbjct: 676 GTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSV-SSVC 734
Query: 741 SHVTDSHPLPVDMWGSDSKIQRKPGPV-LSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
+ V++ HP + W +S +R+ + L C + Q IS+I+FASFGTP+GTCG+F +
Sbjct: 735 ADVSEDHP-NIKKWQIESYGEREHRRAKVHLRCAH-GQSISAIRFASFGTPVGTCGNFQQ 792
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
G C SA S +V+ + C+G + C + +S + F GDPC V K +AVEA C+
Sbjct: 793 GGCHSASSHAVLEKRCIGLQRCVVAISPDNFGGDPCPSVTKRVAVEAVCS 842
>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
Length = 892
Score = 922 bits (2384), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/880 (53%), Positives = 592/880 (67%), Gaps = 50/880 (5%)
Query: 6 ILLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
++L VL FV++A F NVTYD+RA++IGGKRR+LIS IHYPR+TPEMWP LI +
Sbjct: 15 LILTVLTIHFVIVAGEYFKPFNVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIAR 74
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
SK+GG DVIETY FWN HEP R QYNFEGRYD+VKF KLV GL+ +RIGPY CAEWN
Sbjct: 75 SKEGGADVIETYTFWNGHEPTRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWN 134
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
FGGFP+WL IPGI+FRTDN PFK EM+R+ KIVD+M E L++ QGGPIIL QIENEY
Sbjct: 135 FGGFPIWLRDIPGIEFRTDNAPFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEY 194
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
GN++S++G GK Y+KWAA MA+ L GVPWVMC+Q+DAP+ II+TCN +YCD FTPNS
Sbjct: 195 GNVESSFGPKGKLYMKWAAEMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSE 254
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
KPK+WTENW+GWF +G +PYRP ED+AFA+ARFFQRGG+ QNYYMY GGTNF RT+G
Sbjct: 255 KKPKIWTENWNGWFADWGERLPYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAG 314
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEAT 363
GP TSYDYDAPLDEYGL+RQPKWGHLKDLH AIKLCE ALVA D P Y LGP EA
Sbjct: 315 GPTQITSYDYDAPLDEYGLLRQPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAH 374
Query: 364 VYKTGS-----------GLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVF 412
VY+ S G+C+AF+ANI + TVKF G + LP WSV + +
Sbjct: 375 VYRGTSNNIGQYMSLNEGICAAFIANIDEHESATVKFYGQEFTLPPWSV-VFCQIAEIQL 433
Query: 413 NTA-----KINS---------VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDD 458
+T K+ S + ++ F + SL+ SS++ W + EP+G+ D
Sbjct: 434 STQLRWGHKLQSKQWAQILFQLGIILCFYKLSLKA---SSESFSQSWMTLKEPLGVWGDK 490
Query: 459 AFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFIN 516
FT G+LE +N T DQSDYLWY I D+ E+ + + S+ + F+N
Sbjct: 491 NFTSKGILEHLNVTKDQSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVN 550
Query: 517 GKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQL 576
G+L GS G + V P+ L G N LLS TVGLQNYGAF EK GAG G ++L
Sbjct: 551 GQLAGSVKGKW----IKVVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKL 606
Query: 577 KGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ---WDSKSTLPKLQPLVWYKTTFD 633
G +G +I+L++ WTYQ GL+GE L +ST+ W T WYKT FD
Sbjct: 607 TGCKSG-DINLTTSLWTYQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFD 665
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
AP G++PVA+DF+ MGKG+AWVNG +GRYW T V+ N GC +C+YRGAY S+KC NC
Sbjct: 666 APGGTDPVALDFSSMGKGQAWVNGHHVGRYW-TLVAPNNGCGRTCDYRGAYHSDKCRTNC 724
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDM 753
G+ +Q+ YH+PRSWLK+ N LV+FEE P IS T+ ++C+ V++ H P+
Sbjct: 725 GEITQAWYHIPRSWLKTLNNVLVIFEETDKTPFDISISTRST-ETICAQVSEKHYPPLHK 783
Query: 754 WGSDSKIQRK-----PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSL 808
W S S+ RK P + L+C + ISSI+FAS+G+P G+C FS+G+C +A SL
Sbjct: 784 W-SHSEFDRKLSLMDKTPEMHLQC-DEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSL 841
Query: 809 SVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
SVV QAC+G SCSIG+S FGDPC+ V+KSLAV+A C+
Sbjct: 842 SVVSQACIGRTSCSIGISNGVFGDPCRHVVKSLAVQAKCS 881
>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
Length = 908
Score = 922 bits (2384), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/855 (54%), Positives = 576/855 (67%), Gaps = 41/855 (4%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YDHRAV +GG+RR+L+S +HYPR+TPEMWP +I K K+GG DVIETY+FWN HEP
Sbjct: 51 NVSYDHRAVRVGGERRMLVSAGVHYPRATPEMWPSIIAKCKEGGADVIETYIFWNGHEPA 110
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QY FE R+DLV+F+KLVA GL+ LRIGPY CAEWNFGGFP+WL IPGI+FRTDNE
Sbjct: 111 KGQYYFEERFDLVRFIKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 170
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
P+KAEMQ F KIVDMMK EKLY+ QGGPIIL QIENEYGNI YG AGK Y++WAA M
Sbjct: 171 PYKAEMQTFVTKIVDMMKDEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 230
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
AL LDTG+PWVMC+Q+DAP+ I++TCN FYCD F PNS NKP +WTE+W GW+ +GG +
Sbjct: 231 ALGLDTGIPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGPL 290
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP ED AFAVARF+QRGG+ QNYYMY GGTNF RT+GGP TSYDYDAP++EYG++R
Sbjct: 291 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPINEYGMLR 350
Query: 326 QPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKTG-----------SGLC 372
QPKWGHLKDLH AIKLCE AL+A D P Y LG EA +Y + + +C
Sbjct: 351 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYVKLGSMQEAHIYSSAKVHTNGSTAGNAQIC 410
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF------ 426
SAFLANI + V+V G SY LP WSVSILPDC+NV FNTA++ + T V +F
Sbjct: 411 SAFLANIDEHKYVSVWIFGKSYNLPPWSVSILPDCENVAFNTARVGAQTSVFTFESGSPS 470
Query: 427 --SRQ--SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
SR+ S+ + + S W E +G D +F G+LE +N T D SDYLWY+
Sbjct: 471 HSSRREPSVLLPGVRGSYLSSTWWTSKETIGTWGDGSFATQGILEHLNVTKDISDYLWYT 530
Query: 483 LSTNIKADEPLLEDGSKTVLH---VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
S NI +DE + SK VL + + F+NGKL GS G V++ PI
Sbjct: 531 TSVNI-SDEDVAFWSSKGVLPSLIIDQIRDVARVFVNGKLAGSQVGH----WVSLKQPIQ 585
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
G N LLS VGLQNYGAF EK GAG G V+L G NG + DL++ WTYQ GLK
Sbjct: 586 FVRGLNELTLLSEIVGLQNYGAFLEKDGAGFKGQVKLTGLSNG-DTDLTNSAWTYQVGLK 644
Query: 600 GE--ELNFPSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
GE + P +W + T P WYKT DAP G++PVAID MGKG+AWVN
Sbjct: 645 GEFSMIYTPEKQECAEWSAMQTDNIQSPFTWYKTMVDAPEGTDPVAIDLGSMGKGQAWVN 704
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
G+ IGRYW + V+ GC SCNY GAYS KC NCG P+QS YH+PR WL+ S N LV
Sbjct: 705 GRLIGRYW-SLVAPESGCPSSCNYPGAYSETKCQSNCGMPTQSWYHIPREWLQESNNLLV 763
Query: 717 LFEEIGGDPTKISFVTKQLGSSLCSHVTDSH--PLPVDMWGSDSKIQRKP-GPVLSLECP 773
LFEE GGDP+KIS ++CS +++++ PL W ++ P L L C
Sbjct: 764 LFEETGGDPSKISLEV-HYTKTICSRISENYYPPLSAWSWLDTGRVSVDSVAPELLLRCD 822
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP 833
+ + IS I FAS+GTP G C +FS+G+C +A +L V +ACVG C+I VS + FGDP
Sbjct: 823 DGYE-ISRITFASYGTPSGGCQNFSKGKCHAASTLDFVTEACVGKNKCAISVSNDVFGDP 881
Query: 834 CKGVMKSLAVEASCT 848
C+GV+K LAVEA C+
Sbjct: 882 CRGVLKDLAVEAECS 896
>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
Length = 919
Score = 921 bits (2380), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/854 (55%), Positives = 573/854 (67%), Gaps = 40/854 (4%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRAV+IGGKRR+L+S +HYPR+TPEMWP LI K K+GG DVIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QY FE R+DLVKF KLVA GL+ LRIGPY CAEWNFGGFP+WL IPGI+FRTDNE
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFKAEMQ F KIV +MK+EKLY+ QGGPIIL QIENEYGNI YG AGK Y++WAA M
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ LDTG+PWVMC+Q+DAP+ II+TCN FYCD F PNS NKP +WTE+W GW+ +GGA+
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP ED AFAVARF+QRGG+ QNYYMY GGTNF RT+GGP TSYDYDAP+DEYG++R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362
Query: 326 QPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKTG-----------SGLC 372
QPKWGHLKDLH AIKLCE AL+A D P Y LG EA VY TG + +C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV-------PS 425
SAFLANI + +V G SY LP WSVSILPDC+NV FNTA+I + T V PS
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482
Query: 426 FS---RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
S + S+ + S W E +G + F G+LE +N T D SDYLWY+
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542
Query: 483 LSTNIKADEPLLEDGSKTV---LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
NI +D + SK V L + + F+NGKL GS G V++ PI
Sbjct: 543 TRVNI-SDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGH----WVSLKQPIQ 597
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
L G N LLS VGLQNYGAF EK GAG G V L G +G ++DL++ WTYQ GLK
Sbjct: 598 LVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDG-DVDLTNSLWTYQVGLK 656
Query: 600 GE--ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
GE + P S+ +QP WYKT F P G++PVAID MGKG+AWVNG
Sbjct: 657 GEFSMIYAPEKQGCAGWSRMQKDSVQPFTWYKTMFSTPKGTDPVAIDLGSMGKGQAWVNG 716
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
IGRYW + V+ GC+ SC Y GAY+ KC NCG P+Q+ YH+PR WLK S N LVL
Sbjct: 717 HLIGRYW-SLVAPESGCSSSCYYPGAYNERKCQSNCGMPTQNWYHIPREWLKESDNLLVL 775
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ---RKPGPVLSLECPN 774
FEE GGDP+ IS + ++CS +++++ P+ W S + P L L+C +
Sbjct: 776 FEETGGDPSLIS-LEAHYAKTVCSRISENYYPPLSAWSHLSSGRASVNAATPELRLQC-D 833
Query: 775 PNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPC 834
VIS I FAS+GTP G C +FS+G C ++ +L +V +ACVG+ C+I VS + FGDPC
Sbjct: 834 DGHVISEITFASYGTPSGGCLNFSKGNCHASSTLDLVTEACVGNTKCAISVSNDVFGDPC 893
Query: 835 KGVMKSLAVEASCT 848
+GV+K LAVEA C+
Sbjct: 894 RGVLKDLAVEAKCS 907
>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 851
Score = 920 bits (2378), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 452/829 (54%), Positives = 575/829 (69%), Gaps = 13/829 (1%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YD R+++I G+R++LIS +IHYPRS PEMWP L+Q +K+GG+DVIETYVFWN HEP
Sbjct: 28 NVSYDSRSLIIDGQRKLLISAAIHYPRSVPEMWPKLVQTAKEGGVDVIETYVFWNGHEPS 87
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
Y F GRYDLVKFVK+V +AG++ LRIGP+V AEW FGG P+WLH++PG FRT+N+
Sbjct: 88 PGNYYFGGRYDLVKFVKIVEQAGMHLILRIGPFVAAEWYFGGIPVWLHYVPGTVFRTENK 147
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQ+FT IVD+MKQEK +ASQGGPIIL+Q+ENEYG + YG GK Y WAA M
Sbjct: 148 PFKYHMQKFTTFIVDLMKQEKFFASQGGPIILAQVENEYGYYEKDYGEGGKQYAMWAASM 207
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+S + GVPW+MCQQ DAP+ +INTCN FYCDQFTP NKPK+WTENW GWF +FGG
Sbjct: 208 AVSQNIGVPWIMCQQFDAPESVINTCNSFYCDQFTPIYQNKPKIWTENWPGWFKTFGGWN 267
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP ED+AF+VARFFQ+GG+ NYYMYHGGTNF RTSGGPFI+TSYDY+AP+DEYGL R
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 327
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
PKWGHLK LH+AIKLCE ++ + PT SLGP+LEA V+ SG C+AF+AN+ +D
Sbjct: 328 LPKWGHLKQLHRAIKLCEHIMLNSQPTNVSLGPSLEADVFTNSSGACAAFIANMDDKNDK 387
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA--IGS 443
TV+F SY LPAWSVSILPDCKNVVFNTAK+ S + V +SLQ++ S+D
Sbjct: 388 TVEFRNMSYHLPAWSVSILPDCKNVVFNTAKVGSQSSVVEMLPESLQLSVGSADKSLKDL 447
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W E GI + F K GL++ INTT +DYLWY+ S + +E L+ GS VL
Sbjct: 448 KWDVFVEKAGIWGEADFVKSGLVDHINTTKFTTDYLWYTTSILVGENEEFLKKGSSPVLL 507
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
++S GHA+HAF+N +L S G+ ++ + PI+L GKN LLS+TVGLQN G+FY
Sbjct: 508 IESKGHAVHAFVNQELQASAAGNGTHFPFKLKAPISLKEGKNDIALLSMTVGLQNAGSFY 567
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE--LNFPSG-SSTQWDSKSTLP 620
E GAG+T V+++G NGT IDLS+ WTY+ GL+GE L+ G + W S S P
Sbjct: 568 EWVGAGLTS-VKIQGFNNGT-IDLSAYNWTYKIGLEGEHQGLDKEEGFGNVNWISASEPP 625
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
K QPL WYK D P G +PV +D MGKG AW+NG+ IGRYWP + GC CNY
Sbjct: 626 KEQPLTWYKVIVDPPPGDDPVGLDMIHMGKGLAWLNGEEIGRYWPRKGPLH-GCVKECNY 684
Query: 681 RGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLC 740
RG + +KC CG+P+Q YHVPRSW K SGN LV+FEE GGDP+KI F +++ + +C
Sbjct: 685 RGKFDPDKCNTGCGEPTQRWYHVPRSWFKQSGNVLVIFEEKGGDPSKIEFSRRKI-TGVC 743
Query: 741 SHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
+ V +++P + ++ W +D K + L CP ISS+KFASFG P G C S+++
Sbjct: 744 ALVAENYPSIDLESW-NDGSGSNKTVATIHLGCPEDTH-ISSVKFASFGNPTGACRSYTQ 801
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTFGD-PCKGVMKSLAVEASC 847
G C S+SVV + C+ C I ++ F C K LAVE C
Sbjct: 802 GDCHDPNSISVVEKVCLNKNRCDIELTGENFNKGSCLSEPKKLAVEVQC 850
>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 641
Score = 919 bits (2374), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/623 (72%), Positives = 515/623 (82%), Gaps = 11/623 (1%)
Query: 18 LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYV 77
+A + ANVTYDHRA+VI G RRVL+SGSIHYPRSTP+MWP LIQK+KDGGLDVIETYV
Sbjct: 21 IAGGARAANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYV 80
Query: 78 FWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPG 137
FW++HEPVR QY+FEGR DL FVK VA+AGLY HLRIGPYVCAEWN+GGFPLWLHFIPG
Sbjct: 81 FWDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPG 140
Query: 138 IQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKS 197
I+FRTDNEPFKAEMQRFTAK+VD MK LYASQGGPIILSQIENEYGNIDSAYGA GK+
Sbjct: 141 IKFRTDNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKA 200
Query: 198 YIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGW 257
Y++WAAGMA+SLDTGVPWVMCQQ+DAPDP+INTCNGFYCDQFTPNS KPKMWTENWSGW
Sbjct: 201 YMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGW 260
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
FLSFGGAVPYRPVEDLAFAVARF+QRGGTFQNYYMYHGGTN DR+SGGPFI+TSYDYDAP
Sbjct: 261 FLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAP 320
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLA 377
+DEYGL+RQPKWGHL+D+HKAIKLCE AL+ATDP+Y SLGPN+EA VYK GS +C+AFLA
Sbjct: 321 IDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFLA 379
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR-QSLQVAAD 436
NI SD TV FNG Y LPAWSVSILPDCKNVV NTA+INS T +S VA+D
Sbjct: 380 NIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASD 439
Query: 437 SS----DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
S + S WSY EPVGI+KD+A TK GL+EQINTTAD SD+LWYS S +K DEP
Sbjct: 440 GSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEP 499
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
L +GS++ L V SLGH L +INGK+ GS GS+S++ ++ PI L PGKN DLLS
Sbjct: 500 YL-NGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSA 558
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSS 610
TVGL NYGAF++ GAGITGPV+L G NG +DLSS +WTYQ GL+GE+L+ PS +S
Sbjct: 559 TVGLSNYGAFFDLVGAGITGPVKLSGL-NGA-LDLSSAEWTYQIGLRGEDLHLYDPSEAS 616
Query: 611 TQWDSKSTLPKLQPLVWYKTTFD 633
+W S + P PL+WYK + +
Sbjct: 617 PEWVSANAYPINHPLIWYKVSME 639
>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 826
Score = 918 bits (2372), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/826 (55%), Positives = 579/826 (70%), Gaps = 29/826 (3%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV YD RA+ I G+RR+L+SGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 25 NVWYDSRAITINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 84
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+Y FEG YDLV+F+KLV + GLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNE
Sbjct: 85 PGKYYFEGNYDLVRFIKLVQQGGLYLHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNE 144
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFKAEM++FT+ IV+MMK EKL+ QGGPIILSQIENE+G ++ GA K+Y WAA M
Sbjct: 145 PFKAEMEKFTSHIVNMMKAEKLFHWQGGPIILSQIENEFGPLEYDQGAPAKAYAAWAAKM 204
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ L+TGVPWVMC++ DAPDP+INT NGFY D F PN KP MWTENW+GWF +G V
Sbjct: 205 AVDLETGVPWVMCKEDDAPDPVINTWNGFYADGFYPNKRYKPMMWTENWTGWFTGYGVPV 264
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RPVEDLAF+VA+F Q+GG++ NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYG++R
Sbjct: 265 PHRPVEDLAFSVAKFVQKGGSYVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGMLR 324
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+GHL DLHKAIKLCE ALV+ P SLG N E+ V+++ SG C+AFLAN T
Sbjct: 325 QPKYGHLTDLHKAIKLCEPALVSGYPVVTSLGNNQESNVFRSNSGACAAFLANYDTKYYA 384
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
TV FNG Y LP WS+SILPDCK VFNTA++ + Q+ Q+ + G W
Sbjct: 385 TVTFNGMRYNLPPWSISILPDCKTTVFNTARVGA---------QTTQMQMTTVG--GFSW 433
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
NE D +FTK GL+EQI+ T D +DYLWY+ NI +E L++G VL Q
Sbjct: 434 VSYNEDPNSIDDGSFTKLGLVEQISMTRDSTDYLWYTTYVNIDQNEQFLKNGQYPVLTAQ 493
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GH+LH FING+L+G+ YGS + ++T + L G N LS+ VGL N G +E
Sbjct: 494 SAGHSLHVFINGQLIGTAYGSVEDPRLTYTGNVKLFAGSNKISFLSIAVGLPNVGEHFET 553
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP--SGSS-TQWDSKSTLPKL 622
G+ GPV L G G DL+ Q+WTY+ GLKGE L+ SGSS +W S +
Sbjct: 554 WNTGLLGPVTLNGLNEGKR-DLTWQKWTYKIGLKGEALSLHTLSGSSNVEWGDAS---RK 609
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QPL WYK F+AP GSEP+A+D + MGKG+ W+NGQSIGRYWP Y ++ G C+Y G
Sbjct: 610 QPLAWYKGFFNAPGGSEPLALDMSTMGKGQVWINGQSIGRYWPAYKAR--GSCPKCDYEG 667
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
Y KC NCG SQ YHVPRSWL +GN +V+FEE GG+PT IS V + + S+ C++
Sbjct: 668 TYEETKCQSNCGDSSQRWYHVPRSWLNPTGNLIVVFEEWGGEPTGISLVKRSMRSA-CAY 726
Query: 743 VTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRC 802
V+ P ++ W + + + L C +P ++ IKFAS+GTP G C S+S GRC
Sbjct: 727 VSQGQP-SMNNWHTKYAESK-----VHLSC-DPGLKMTQIKFASYGTPQGACESYSEGRC 779
Query: 803 SSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+ +S + ++ C+G + CS+ V F GDPC G+MKS+AV+ASC
Sbjct: 780 HAHKSYDIFQKNCIGQQVCSVTVVPEVFGGDPCPGIMKSVAVQASC 825
>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 918
Score = 915 bits (2365), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/853 (53%), Positives = 570/853 (66%), Gaps = 38/853 (4%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRA+++GGKRR+L+S +HYPR+TPEMWP LI K K+GG+D IETYVFWN HEP
Sbjct: 62 NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGVDAIETYVFWNGHEPA 121
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QY FEGR+D+V+F KLVA GL+ LRIGPY CAEWNFGGFP+WL +PGI+FRTDNE
Sbjct: 122 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDVPGIEFRTDNE 181
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
P+KAEMQ F KIVD+MK+EKLY+ QGGPIIL QIENEYGNI YG AGK Y+ WAA M
Sbjct: 182 PYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGHYGQAGKRYMLWAAQM 241
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
AL+LDTGVPWVMC+Q+DAP+ I+NTCN FYCD F PNS NKP +WTE+W GW+ +G ++
Sbjct: 242 ALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGESL 301
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP +D AFAVARF+QRGG+ QNYYMY GGTNF+RT+GGP TSYDYDAP+DEYG++R
Sbjct: 302 PHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 361
Query: 326 QPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKT-----------GSGLC 372
QPKWGHLKDLH AIKLCE+AL A D P Y LGP EA VY + S C
Sbjct: 362 QPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHVYSSENVHTNGSISGNSQFC 421
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV-------PS 425
SAFLANI + +V G SY LP WSVSILPDC+ V FNTA++ + T PS
Sbjct: 422 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVESGSPS 481
Query: 426 FS--RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
+S + ++ + + W EPVGI + FT G+LE +N T D SDYL Y+
Sbjct: 482 YSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQGILEHLNVTKDISDYLSYTT 541
Query: 484 STNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
NI ++ L G L + + F+NGKL GS G V+++ P+ L
Sbjct: 542 RVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGSKVGH----WVSLNQPLQLV 597
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS VGLQNYGAF EK GAG G V+L G NG +IDL++ WTYQ GLKGE
Sbjct: 598 QGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNG-DIDLTNSLWTYQIGLKGE 656
Query: 602 ELNFPSGS---STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
S S +W S + P W+KT FDAP G+ PV ID MGKG+AWVNG
Sbjct: 657 FSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGPVTIDLGSMGKGQAWVNGH 716
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
IGRYW + V+ GC SCNY G YS +KC NCG +QS YH+PR WL+ SGN LVLF
Sbjct: 717 LIGRYW-SLVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSWYHIPREWLQESGNLLVLF 775
Query: 719 EEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ---RKPGPVLSLECPNP 775
EE GGDP++IS ++CS +++++ P+ W + + P L L+C +
Sbjct: 776 EETGGDPSQISLEV-HYTKTICSKISETYYPPLSAWSRAANGRPSVNTVAPELRLQC-DD 833
Query: 776 NQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCK 835
VIS I FAS+GTP G C +FS G C ++ +L +V +AC G C+I V+ FGDPC+
Sbjct: 834 GHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVEACEGKNRCAISVTNEVFGDPCR 893
Query: 836 GVMKSLAVEASCT 848
V+K LAVEA C+
Sbjct: 894 KVVKDLAVEAECS 906
>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 723
Score = 912 bits (2358), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/729 (61%), Positives = 532/729 (72%), Gaps = 16/729 (2%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
++V+ G V+ +S A+VTYDH+A+VI GKRR+LISGSIHYPRSTP+MWPDLIQK+KD
Sbjct: 7 IMVVFLGLVLWVCSSVMASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKD 66
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLDVIETYVFWN HEP QY FE RY+LV+FVKLV +AGLY HLRIGPYVCAEWNFGG
Sbjct: 67 GGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGG 126
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WL ++PGI FRTDN PFKA MQ+FTAKIV MMK EKLY SQGGPIILSQIENEYG +
Sbjct: 127 FPVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPV 186
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
+ GA GKSY KWAA MAL LDTGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN KP
Sbjct: 187 EWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKP 246
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
KMWTE W+GWF FGG VPYRPVEDLA+AVARF Q G+ NYYMYHGGTNF RT+GGPF
Sbjct: 247 KMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPF 306
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
I+TSYDYDAP+DEYGLIRQPKWGHL+DLHKAIKLCE ALV+ DPT SLG EA VY T
Sbjct: 307 IATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNT 366
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
SG C+AFLAN ++ V V F + Y LP WSVSILPDCK VVFNTAK+N+ PS+
Sbjct: 367 RSGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNA----PSYW 422
Query: 428 RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI 487
+ +++ S SY E DD T GL+EQI+ T D +DYLWY I
Sbjct: 423 PKMTPISSFSWH------SYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIRI 476
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
++E L+ G +L + S GHALH FING+L G+ YG N K+T + L PG N
Sbjct: 477 DSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKL 536
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
+LS+ VGL N G +E AGI GPV LKG GT D+S +W+Y+ GLKGE LN +
Sbjct: 537 SMLSVAVGLPNVGVHFETWNAGILGPVTLKGLNEGTR-DMSGYKWSYKVGLKGEALNLHT 595
Query: 608 ---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
SS +W + S + + QPL WYKTTF+AP G+EP+A+D MGKG+ W+NG+SIGR+W
Sbjct: 596 VSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHW 655
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
P Y ++ G C Y G ++ KC +CG+PSQ YHVPR+WLK SGN LV+FEE GG+
Sbjct: 656 PAYTAR--GSCGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGN 713
Query: 725 PTKISFVTK 733
P IS V +
Sbjct: 714 PDGISLVKR 722
>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
Length = 827
Score = 912 bits (2358), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/856 (53%), Positives = 580/856 (67%), Gaps = 43/856 (5%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
+ L L + F + T F NV+YD R+++I G+R++LIS +IHYPRS P MWP+L++ +K
Sbjct: 1 MALGLIFFFSLCFTLCFAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAK 60
Query: 67 DGGLDVIETYVFWNLHEPVR-NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
+GG+DVIETYVFWN+H+P ++Y+F+GR+DLVKF+ +V EAG+Y LRIGP+V AEWNF
Sbjct: 61 EGGVDVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNF 120
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ--IENE 183
GG P+WLH++ G FRTDN FK M+ FT IV +MK+EKL+ASQGGPIILSQ +ENE
Sbjct: 121 GGIPVWLHYVNGTVFRTDNYNFKYYMEEFTTYIVKLMKKEKLFASQGGPIILSQAKVENE 180
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG + AYG GK Y WAA MA+S +TGVPW+MCQQ DAP +INTCN FYCDQF P
Sbjct: 181 YGYYEGAYGEGGKRYAAWAAQMAVSQNTGVPWIMCQQFDAPPSVINTCNSFYCDQFKPIF 240
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
+KPK+WTENW GWF +FG P+RP ED+AF+VARFFQ+GG+ QNYYMYHGGTNF RT+
Sbjct: 241 PDKPKIWTENWPGWFQTFGAPNPHRPAEDVAFSVARFFQKGGSVQNYYMYHGGTNFGRTA 300
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
GGPFI+TSYDY+AP+DEYGL R PKWGHLK+LHKAIKLCE L+ + P SLGP+ EA
Sbjct: 301 GGPFITTSYDYEAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLNSKPVNLSLGPSQEAD 360
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
VY SG C AFLANI +D TV F SY LPAWSVSILPDCKNVV+NTAK
Sbjct: 361 VYADASGGCVAFLANIDDKNDKTVDFQNVSYKLPAWSVSILPDCKNVVYNTAK------- 413
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
D S A+ W E GI + F K G ++ INTT D +DYLWY+
Sbjct: 414 ----------QKDGSKAL--KWEVFVEKAGIWGEPDFMKNGFVDHINTTKDTTDYLWYTT 461
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
S + +E L++G VL ++S+GHALHAF+N +L GS G+ S++ PI+L G
Sbjct: 462 SIVVGENEEFLKEGRHPVLLIESMGHALHAFVNQELQGSASGNGSHSPFKFKNPISLKAG 521
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL 603
N LLS+TVGL N G+FYE GAG+T V+++G NGT +DLS W Y+ GL+GE+L
Sbjct: 522 NNEIALLSMTVGLPNAGSFYEWVGAGLTS-VRIEGFNNGT-VDLSHFNWIYKIGLQGEKL 579
Query: 604 NF--PSG-SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
P G +S W + S PK QPL WYK D PAG+EPV +D MGKG AW+NG+ I
Sbjct: 580 GIYKPEGVNSVSWVATSEPPKKQPLTWYKVVLDPPAGNEPVGLDMLHMGKGLAWLNGEEI 639
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYWP S + C C+YRG + +KC CG+P+Q YHVPRSW K SGN LV+FEE
Sbjct: 640 GRYWPRKSSVHEKCVTECDYRGKFMPDKCFTGCGQPTQRWYHVPRSWFKPSGNLLVIFEE 699
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPG-------PVLSLECP 773
GGDP KI+F +++ SS+C+ + + +P +D K ++ G + L CP
Sbjct: 700 KGGDPEKITFSRRKM-SSICALIAEDYP------SADRKSLQEAGSKNSNSKASVHLGCP 752
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP 833
N VIS++KFASFGTP G CGS+S G C S+SVV +AC+ C+I ++ F
Sbjct: 753 Q-NAVISAVKFASFGTPTGKCGSYSEGECHDPNSISVVEKACLNKTECTIELTEENFNKG 811
Query: 834 -CKGVMKSLAVEASCT 848
C + LAVEA C+
Sbjct: 812 LCPDFTRRLAVEAVCS 827
>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
Length = 802
Score = 912 bits (2358), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/826 (54%), Positives = 566/826 (68%), Gaps = 46/826 (5%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YDHR++++ GKRR+L+SGS+HYPR+TPEMWP +IQK+K+GGLDVIETYVFW+ HEP
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
QY FEGRYDLVKFVKLV +AGL +LRIGPYVCAEWN GGFP+WL IP I FRTDNE
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLVNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQ F KIV+MMK+E L+ASQGGPIIL+Q+ENEYGN+DS YG AG YI WAA M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A + +TGVPW+MC QS P+ II+TCNG YCD + P KP MWTE+++GWF +G +
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPTLYKKPTMWTESYTGWFTYYGWPL 258
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RPVED+AFAVARFF+RGG+F NYYMY GGTNF RTSGGP++++SYDYDAPLDEYG+
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMYFGGTNFGRTSGGPYVASSYDYDAPLDEYGMQH 318
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
PKWGHLKDLH+ +KL E +++++ + LGPN EA VY G+G C AFLAN+ + +D
Sbjct: 319 LPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNG-CVAFLANVDSMNDT 377
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF--SRQSLQVAADSSDAIGS 443
V+F SY LPAWSVSI+ DCK V FN+AK+ S + V S S+ SL
Sbjct: 378 VVEFRNVSYSLPAWSVSIVLDCKTVAFNSAKVKSQSAVVSMNPSKSSLS----------- 426
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W+ +EPVGIS +F LLEQ+ TT D SDYLWY+ T L
Sbjct: 427 -WTSFDEPVGIS-GSSFKAKQLLEQMETTKDTSDYLWYTTRYATGT--------GSTWLS 476
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
++S+ +H F+NG+ S + S S +V+ PI LAPG NT LLS TVGLQN+GAF
Sbjct: 477 IESMRDVVHIFVNGQFQSSWHTSKSVLYNSVEAPIKLAPGSNTIALLSATVGLQNFGAFI 536
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQ 623
E AG++G + LKG G +LS Q+WTYQ GLKGE+L + ++ + S + +
Sbjct: 537 ETWSAGLSGSLILKGLPGGDQ-NLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAVSTKK 595
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL WY T FDAP G +PVA+D MGKG+AWVNGQSIGRYWP Y + + C +SC+YRG+
Sbjct: 596 PLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGS 655
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
Y NKCL CG+ SQ YHVPRSW+K GN LVLFEE GGDP+ I FVT+ + +C+ V
Sbjct: 656 YDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRST-NVICARV 714
Query: 744 TDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCS 803
+SHP V +W CP QVIS I+FAS G P G+CGSF G C
Sbjct: 715 YESHPASVKLW-----------------CPGEKQVISQIRFASLGNPEGSCGSFKEGSCH 757
Query: 804 SARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVM-KSLAVEASCT 848
+ + V +ACVG +SCS+ T C GV K LAVEA C+
Sbjct: 758 TNDLSNTVEKACVGQRSCSLAPDFTT--SACPGVREKFLAVEALCS 801
>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 1225
Score = 911 bits (2355), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/729 (61%), Positives = 532/729 (72%), Gaps = 16/729 (2%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
++V+ G V+ +S A+VTYDH+A+VI GKRR+LISGSIHYPRSTP+MWPDLIQK+KD
Sbjct: 7 IMVVFLGLVLWVCSSVMASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKD 66
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLDVIETYVFWN HEP QY FE RY+LV+FVKLV +AGLY HLRIGPYVCAEWNFGG
Sbjct: 67 GGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGG 126
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WL ++PGI FRTDN PFKA MQ+FTAKIV MMK EKLY SQGGPIILSQIENEYG +
Sbjct: 127 FPVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPV 186
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
+ GA GKSY KWAA MAL LDTGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN KP
Sbjct: 187 EWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKP 246
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
KMWTE W+GWF FGG VPYRPVEDLA+AVARF Q G+ NYYMYHGGTNF RT+GGPF
Sbjct: 247 KMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPF 306
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
I+TSYDYDAP+DEYGLIRQPKWGHL+DLHKAIKLCE ALV+ DPT SLG EA VY T
Sbjct: 307 IATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNT 366
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
SG C+AFLAN ++ V V F + Y LP WSVSILPDCK VVFNTAK+N+ PS+
Sbjct: 367 RSGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNA----PSYW 422
Query: 428 RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI 487
+ +++ S SY E DD T GL+EQI+ T D +DYLWY I
Sbjct: 423 PKMTPISSFSWH------SYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIRI 476
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
++E L+ G +L + S GHALH FING+L G+ YG N K+T + L PG N
Sbjct: 477 DSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKL 536
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
+LS+ VGL N G +E AGI GPV LKG GT D+S +W+Y+ GLKGE LN +
Sbjct: 537 SMLSVAVGLPNVGVHFETWNAGILGPVTLKGLNEGTR-DMSGYKWSYKVGLKGEALNLHT 595
Query: 608 ---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
SS +W + S + + QPL WYKTTF+AP G+EP+A+D MGKG+ W+NG+SIGR+W
Sbjct: 596 VSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHW 655
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
P Y ++ G C Y G ++ KC +CG+PSQ YHVPR+WLK SGN LV+FEE GG+
Sbjct: 656 PAYTAR--GSCGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGN 713
Query: 725 PTKISFVTK 733
P IS V +
Sbjct: 714 PDGISLVKR 722
Score = 527 bits (1357), Expect = e-146, Method: Compositional matrix adjust.
Identities = 271/511 (53%), Positives = 345/511 (67%), Gaps = 16/511 (3%)
Query: 228 INTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTF 287
I+TCNGFYC+ F PN KPK+WTENWSGW+ +FGG PYRP ED+AF+VARF Q GG+
Sbjct: 723 IDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSL 782
Query: 288 QNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALV 347
NYYMYHGGTNF RTSG F++TSYD+DAP+DEYGL+R+PKWGHL+DLHKAIKLCE ALV
Sbjct: 783 VNYYMYHGGTNFGRTSG-LFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALV 841
Query: 348 ATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
+ DPT LG + EA V+K+ SG C+AFLAN T++ V V F + Y LP WS+SILPDC
Sbjct: 842 SADPTSTWLGKDQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDC 901
Query: 408 KNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW--SYINEPVGISKDDAFTKPGL 465
K V FNTA++ P +L +A + I S W SY EP D TK GL
Sbjct: 902 KTVTFNTARVRR---DPKLFIPNLLMAKMT--PISSFWWLSYKEEPASAYAKDTTTKDGL 956
Query: 466 LEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYG 525
+EQ++ T D +DYLWY I + E L+ G +L V S GH LH FING+L GS YG
Sbjct: 957 VEQVSVTWDTTDYLWYMTDIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYG 1016
Query: 526 SSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI 585
S + ++T + L G N +LS+TVGL N G ++ AG+ GPV LKG GT
Sbjct: 1017 SLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTR- 1075
Query: 586 DLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVA 642
D+S +W+Y+ GL+GE LN S +S QW K + K QPL WYKTTF+ PAG+EP+A
Sbjct: 1076 DMSKYKWSYKVGLRGEILNLYSVKGSNSVQW-MKGSFQK-QPLTWYKTTFNTPAGNEPLA 1133
Query: 643 IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYH 702
+D + M KG+ WVNG+SIGRY+P Y++ +G C + C+Y G ++ KCL NCG PSQ YH
Sbjct: 1134 LDMSSMSKGQIWVNGRSIGRYFPGYIA-SGKC-NKCSYTGFFTEKKCLWNCGGPSQKWYH 1191
Query: 703 VPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
+PR WL +GN L++ EEIGG+P IS V +
Sbjct: 1192 IPRDWLSPNGNLLIILEEIGGNPQGISLVKR 1222
>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
Length = 785
Score = 911 bits (2355), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/805 (55%), Positives = 554/805 (68%), Gaps = 22/805 (2%)
Query: 44 ISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKL 103
+SGS+HYPRS PEMWPDLIQK+KDGGLDV++TYVFWN HEP R QY FEGRYDLV F+KL
Sbjct: 1 MSGSVHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKL 60
Query: 104 VAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMK 163
V +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEPFKAEMQ+FT KIVDMMK
Sbjct: 61 VKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVDMMK 120
Query: 164 QEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDA 223
E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA++L+T VPWVMC++ DA
Sbjct: 121 SEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDA 180
Query: 224 PDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQR 283
PDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP+RPVEDLA+ VA+F Q+
Sbjct: 181 PDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQK 240
Query: 284 GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCE 343
GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R+PKWGHLK+LHKAIKLCE
Sbjct: 241 GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLCE 300
Query: 344 AALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSI 403
ALVA DP SLG +A+V+++ + C AFL N S V FNG Y LP WS+SI
Sbjct: 301 PALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYNLPPWSISI 360
Query: 404 LPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKP 463
LPDCK V+NTA++ S S+ ++ A G W NE + D++F
Sbjct: 361 LPDCKTTVYNTARVGS-----QISQMKMEWAG------GFTWQSYNEDINSLGDESFVTV 409
Query: 464 GLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSG 523
GLLEQIN T D +DYLWY+ ++ DE L +G VL V S GHALH F+NG+L G+
Sbjct: 410 GLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGKNPVLTVMSAGHALHIFVNGQLTGTV 469
Query: 524 YGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGT 583
YGS + K+T + L PG NT LS+ VGL N G +E AGI GPV L G G
Sbjct: 470 YGSVDDPKLTYRGNVKLWPGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGR 529
Query: 584 NIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAI 643
DL+ Q+WTY+ GLKGE+L+ S S + + QPL WYK F+AP G EP+A+
Sbjct: 530 R-DLTWQKWTYKVGLKGEDLSLHSLSGSSSVEWGEPMQKQPLTWYKAFFNAPDGDEPLAL 588
Query: 644 DFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHV 703
D + MGKG+ W+NGQ IGRYWP Y + G C+YRG Y KC NCG SQ YHV
Sbjct: 589 DMSSMGKGQIWINGQGIGRYWPGY--KASGTCGICDYRGEYDEKKCQTNCGDSSQRWYHV 646
Query: 704 PRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRK 763
PRSWL +GN LV+FEE GGDPT IS V + G S+C+ V++ P + D + +
Sbjct: 647 PRSWLNPTGNLLVIFEEWGGDPTGISMVKRTTG-SICADVSEWQPSMTNWRTKDYEKAK- 704
Query: 764 PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSI 823
+ L+C + + ++ IKFASFGTP G+CGS+S G C + +S + + C+G + C +
Sbjct: 705 ----IHLQC-DHGRKMTDIKFASFGTPQGSCGSYSEGGCHAHKSYDIFWKNCIGQERCGV 759
Query: 824 GVSVNTF-GDPCKGVMKSLAVEASC 847
V N F GDPC G MK VEA C
Sbjct: 760 SVVPNVFGGDPCPGTMKRAVVEAIC 784
>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 725
Score = 910 bits (2353), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/731 (60%), Positives = 528/731 (72%), Gaps = 16/731 (2%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
++V+ G + +S A+VTYDH+A++I G+RR+LISGSIHYPRS P+MWPDLIQK+KD
Sbjct: 7 IMVVFLGLFLWVCSSVMASVTYDHKAIIINGRRRILISGSIHYPRSIPQMWPDLIQKAKD 66
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLDVIETYVFWN HEP QYNFE RYDLV+FVKLV +AGLY HLRIGPYVCAEWNFGG
Sbjct: 67 GGLDVIETYVFWNGHEPSPGQYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGG 126
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WL ++PGI FRTDN PFKA MQ+FT KIV +MK EKLY SQGGPIILSQIENEYG +
Sbjct: 127 FPVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPV 186
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
+ GA GKSY KWAA MAL L+TGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN KP
Sbjct: 187 EWEIGAPGKSYTKWAAQMALGLNTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKP 246
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
KMWTE W+GWF FGG PYRPVED+A++VARF Q GG+F NYYMYHGGTNF RT+GGPF
Sbjct: 247 KMWTEAWTGWFTEFGGPAPYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPF 306
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
I+TSYDYDAP+DEYGL+R+PKW HL+DLHKAIKLCE ALV+ DPT LG N EA V+KT
Sbjct: 307 IATSYDYDAPIDEYGLLREPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKT 366
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
SG C+AFLAN +S TV F N Y LP WSVSILPDCK+V+FNTAK+ + T P +
Sbjct: 367 RSGSCAAFLANYDASSSATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMT 426
Query: 428 RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI 487
S S SY E +D T GL+EQI+ T D +DYLWY I
Sbjct: 427 PVSSF----------SWLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRI 476
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+E L+ G +L V S GHALH FING+L G+ YG S N K+T + L G N
Sbjct: 477 DPNEGFLKSGQWPLLTVFSAGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKL 536
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
+LS+ VGL N G YE G+ GPV LKG T D+S +W+Y+ GLKGE LN S
Sbjct: 537 SILSVAVGLPNGGLHYETWNTGVLGPVTLKGLNEDTR-DMSGYKWSYKIGLKGEALNLHS 595
Query: 608 ---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
SS +W + S + + QPL WYKTTFD+P G+EP+A+D + MGKG+ W+NGQSIGR+W
Sbjct: 596 VSGSSSVEWVTGSLVAQKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHW 655
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
P Y ++ G CNY G ++ KC CG+PSQ YHVPR+WLKSSGN LV+FEE GG+
Sbjct: 656 PAYTAK--GSCGKCNYGGIFNEKKCHSXCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGN 713
Query: 725 PTKISFVTKQL 735
P IS V + +
Sbjct: 714 PEGISLVKRSI 724
>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
Length = 805
Score = 910 bits (2352), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/826 (54%), Positives = 565/826 (68%), Gaps = 43/826 (5%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YDHR++++ GKRR+L+SGS+HYPR+TPEMWP +IQK+K+GGLDVIETYVFW+ HEP
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
QY FEGRYDLVKFVKLV +AGL +LRIGPYVCAEWN GGFP+WL IP I FRTDNE
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLMNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQ F KIV+MMK+E L+ASQGGPIIL+Q+ENEYGN+DS YG AG YI WAA M
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A + +TGVPW+MC QS P+ II+TCNG YCD + P KP MWTE+++GWF +G +
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPILYKKPTMWTESYTGWFTYYGWPI 258
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYM--YHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
P+RPVED+AFAVARFF+RGG+F NYYM Y GGTNF RTSGGP++++SYDYDAPLDEYG+
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMVWYFGGTNFGRTSGGPYVASSYDYDAPLDEYGM 318
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
PKWGHLKDLH+ +KL E +++++ + LGPN EA VY G+G C AFLAN+ + +
Sbjct: 319 QHLPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNG-CVAFLANVDSMN 377
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D V+F SY LPAWSVSIL DCK V FN+AK+ S + V S S ++
Sbjct: 378 DTVVEFRNVSYSLPAWSVSILLDCKTVAFNSAKVKSQSAVVSMSPSKSTLS--------- 428
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W+ +EPVGIS +F LLEQ+ TT D SDYLWY+ S T L
Sbjct: 429 -WTSFDEPVGIS-GSSFKAKQLLEQMETTKDTSDYLWYTTSVEATGT-------GSTWLS 479
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
++S+ +H F+NG+ S + S S +V+ PI LAPG NT LLS TVGLQN+GAF
Sbjct: 480 IESMRDVVHIFVNGQFQSSWHTSKSVLYNSVEAPITLAPGSNTIALLSATVGLQNFGAFI 539
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQ 623
E AG++G + LKG G +LS Q+WTYQ GLKGE+L + ++ + S + +
Sbjct: 540 ETWSAGLSGSLILKGLPGGDQ-NLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAVSTEK 598
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL WY T FDAP G +PVA+D MGKG+AWVNGQSIGRYWP Y + + C +SC+YRG+
Sbjct: 599 PLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGS 658
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
Y NKCL CG+ SQ YHVPRSW+K GN LVLFEE GGDP+ I FVT+ + +C+ V
Sbjct: 659 YDQNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRST-NVICARV 717
Query: 744 TDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCS 803
+SHP V +W CP QVIS I+FAS G P G+CGSF G C
Sbjct: 718 YESHPASVKLW-----------------CPGEKQVISQIRFASLGNPEGSCGSFKEGSCH 760
Query: 804 SARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVM-KSLAVEASCT 848
+ + V +ACVG +SCS+ C GV K LAVEA C+
Sbjct: 761 TNDLSNTVEKACVGQRSCSLAPDFTI--SACPGVREKFLAVEALCS 804
>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
Length = 822
Score = 907 bits (2345), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 456/825 (55%), Positives = 568/825 (68%), Gaps = 30/825 (3%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+TYD +AVV+ G+RR+LISGSIHYPRSTPEMWPDLI+K+KDGGLDV++TYVFWN HEP
Sbjct: 23 LTYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSP 82
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY FEGRYDLV F+KLV +AGLY +LRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 83 GQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 142
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKAEMQ+FT KIV+MMK E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA
Sbjct: 143 FKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 202
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
++L+TGVPW+MC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP
Sbjct: 203 VALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 262
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+RPVEDLA+ VA+F Q+GG+F NYYM+HGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R+
Sbjct: 263 HRPVEDLAYGVAKFIQKGGSFVNYYMFHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 322
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKWGHLK LHKAIKLCE ALVA DP SLG +++V+++ +G C+AFL N S
Sbjct: 323 PKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLDNKDKVSYAR 382
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V FNG Y LP WS+SILPDCK VFNTA++ S S+ ++ A G W
Sbjct: 383 VAFNGMHYDLPPWSISILPDCKTTVFNTARVGS-----QISQMKMEWAG------GFAWQ 431
Query: 447 YINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQS 506
NE + +D FT GLLEQIN T D +DYLWY+ ++ D+ L +G L V
Sbjct: 432 SYNEEINSFGEDPFTTVGLLEQINVTRDNTDYLWYTTYVDVAQDDQFLSNGENPKLTV-- 489
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
+ + + L G+ YGS + K+T + L G NT LS+ VGL N G +E
Sbjct: 490 MCFLILNILFNLLAGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETW 549
Query: 567 GAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP--SGSST-QWDSKSTLPKLQ 623
AGI GPV L G G DL+ Q+WTYQ GLKGE ++ SGSST +W + Q
Sbjct: 550 NAGILGPVTLDGLNEGRR-DLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGEPV---QKQ 605
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL WYK F+AP G EP+A+D + MGKG+ W+NGQ IGRYWP Y + +G C +C+YRG
Sbjct: 606 PLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA-SGNC-GTCDYRGE 663
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
Y KC NCG SQ YHVPRSWL +GN LV+FEE GGDPT IS V + +G S+C+ V
Sbjct: 664 YDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIG-SVCADV 722
Query: 744 TDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCS 803
++ P + W + + K + L+C N Q I+ IKFASFGTP G+CGS+S G C
Sbjct: 723 SEWQP-SMKNWHTKDYEKAK----VHLQCDN-GQKITEIKFASFGTPQGSCGSYSEGGCH 776
Query: 804 SARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+ +S + + CVG + C + V F GDPC G MK VEA C
Sbjct: 777 AHKSYDIFWKNCVGQERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 821
>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
Length = 830
Score = 907 bits (2344), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/842 (54%), Positives = 565/842 (67%), Gaps = 32/842 (3%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
F+ S A+V+YD +A+ I G+RR+LISGSIHYPRS+PEMWPDLIQK+K+GGLDVI+
Sbjct: 13 FLASLVCSVTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQ 72
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TYVFWN HEP +Y FEG YDLVKFVKLV EAGLY +LRIGPY+CAEWNFG
Sbjct: 73 TYVFWNGHEPSPGKYYFEGNYDLVKFVKLVKEAGLYVNLRIGPYICAEWNFGH------- 125
Query: 135 IPGIQFRTDNEPFK---AEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY 191
QF+ PF+ A+M++FT KIV+MMK E+L+ SQGGPIILSQIENEYG ++
Sbjct: 126 ----QFQNGQWPFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYEL 181
Query: 192 GAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWT 251
G+ G++Y KWAA MA+ L TGVPWVMC+Q DAPDPIINTCNGFYCD F+PN KPKMWT
Sbjct: 182 GSPGQAYTKWAAQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWT 241
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTS 311
E W+GWF FGG VP+RP ED+AF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TS
Sbjct: 242 EAWTGWFTQFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATS 301
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGL 371
YDYDAPLDEYGL+RQPKWGHLKDLH+AIKLCE ALV+ D T LG EA V+ +G
Sbjct: 302 YDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGG 361
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSL 431
C+AFLAN S V F Y LP WS+SILPDCKN V+NTA++ + + + +
Sbjct: 362 CAAFLANYHQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSATIKMTPVPM 421
Query: 432 QVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
G W NE S D+ FT GLLEQINTT D SDYLWY +I E
Sbjct: 422 HG--------GLSWQTYNEEPSSSGDNTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSE 473
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
L+ G VL V S GHALH FING+L G+ YGS K+T ++L G N LLS
Sbjct: 474 GFLKSGKYPVLTVLSAGHALHVFINGQLSGTAYGSLDFPKLTFSQGVSLRAGVNKISLLS 533
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL---NFPSG 608
+ VGL N G +E AGI GPV L G G +DLS Q+W+Y+ GL GE L +
Sbjct: 534 IAVGLPNVGPHFETWNAGILGPVTLNGLNEG-RMDLSWQKWSYKIGLHGEALSLHSISGS 592
Query: 609 SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYV 668
SS +W S + + QPL WYKTTF+APAG+ P+A+D MGKG+ W+NGQ +GR+WP Y
Sbjct: 593 SSVEWAEGSLVAQKQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAY- 651
Query: 669 SQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKI 728
+ G C Y G Y+ NKC NCG+ SQ YHVP+SWLK +GN LV+FEE GGDP +
Sbjct: 652 -KASGTCGECTYIGTYNENKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGV 710
Query: 729 SFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASF 787
S V +++ S+C+ + + P ++ + K+ + P L C P Q I SIKFASF
Sbjct: 711 SLVRREV-DSVCADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSC-GPGQKIRSIKFASF 768
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEAS 846
GTP G CGS+++G C + S CVG SCS+ V+ F GDPC VMK LA EA
Sbjct: 769 GTPEGVCGSYNQGSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCPSVMKKLAAEAI 828
Query: 847 CT 848
C+
Sbjct: 829 CS 830
>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 848
Score = 905 bits (2339), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/851 (52%), Positives = 570/851 (66%), Gaps = 27/851 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
++ LCW T NVTYD +A++I G+R++L SGSIHYPRS P+MW LI+K+
Sbjct: 11 VVFFFLCWSLHFQLTNC--ENVTYDGKALIINGQRKILFSGSIHYPRSVPDMWESLIEKA 68
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K GGLDV++TYVFWNLHEP Y+FEGR DLVKF+KLV +AGLY HLRIGPY+C EWNF
Sbjct: 69 KMGGLDVVDTYVFWNLHEPSPGIYDFEGRNDLVKFIKLVEKAGLYVHLRIGPYICGEWNF 128
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP WL F+PGI FRTDNEPFK M +FT KIV MMK E+L+ SQGGPIILSQIENEY
Sbjct: 129 GGFPAWLKFVPGISFRTDNEPFKLAMAKFTKKIVQMMKDERLFQSQGGPIILSQIENEYE 188
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
D +G AG +Y+ WAA MA+ +DTGVPWVMC+Q DAPDP+INTCNGFYCD F+PN
Sbjct: 189 TEDKVFGEAGFAYMNWAAKMAVQMDTGVPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPY 248
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP WTE W+ WF +FGG RPVEDLAF VARF Q+GG+ NYYMYHGGTNF RT+GG
Sbjct: 249 KPNFWTEAWTAWFNNFGGPNHKRPVEDLAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGG 308
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGLIRQPK+GHLK LH A+KLCE AL+ +P +L +A V+
Sbjct: 309 PFITTSYDYDAPIDEYGLIRQPKFGHLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVF 368
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ SG C+AFL+N +N+ V FNG Y LP WS+SILPDCK+V++NTA++ T S
Sbjct: 369 SSSSGDCAAFLSNYHSNNTARVTFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTNQLS 428
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
F ++ + W NE + I +D + + GLLEQ+ T D SDYLWY+ S
Sbjct: 429 FLPTKVESFS---------WETYNENISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTS 479
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
N+ +E L G L S GH +H FINGKL GS +G+ N+K T I L G
Sbjct: 480 VNVDPNESYLRGGKFPTLTATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGV 539
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS+ GL N G YE+ G+ GPV + G G +DLS Q+W+Y+ GLKGE +N
Sbjct: 540 NKVSLLSIAGGLPNNGPHYEEREMGVLGPVAIHGLDKG-KMDLSRQKWSYKVGLKGENMN 598
Query: 605 FPSGSSTQ---WDSKSTLPK--LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
S SS Q W +K +L + QPL WYK FDAP G EP+A+D M KG+ W+NGQ+
Sbjct: 599 LGSPSSVQAVDW-AKDSLKQENAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWINGQN 657
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
+GRYW ++ NG CTD C+Y G Y KC CG+P+Q YHVPRSWL + N +V+FE
Sbjct: 658 VGRYWT--ITANGNCTD-CSYSGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFE 714
Query: 720 EIGGDPTKISFVTKQLGSSLCSHVTDSHPL--PVDMWGSDSKIQRKPGPVLSLECPNPNQ 777
E+GG+P++IS V + + +S+C+ + P+ V M ++ ++ + ++L C Q
Sbjct: 715 EVGGNPSRISLVKRSV-TSICTEASQYRPVIKNVHMHQNNGELNEQNVLKINLHCA-AGQ 772
Query: 778 VISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKG 836
IS+IKFASFGTP G CGS +G C S +S V+++ CVG + C + + FG DPC
Sbjct: 773 FISAIKFASFGTPSGACGSHKQGTCHSPKSDYVLQKLCVGRQRCLATIPTSIFGEDPCPN 832
Query: 837 VMKSLAVEASC 847
+ K L+ E C
Sbjct: 833 LRKKLSAEVVC 843
>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
Length = 827
Score = 904 bits (2336), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/847 (54%), Positives = 576/847 (68%), Gaps = 33/847 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+LL V W V AT V YDH+A+ I +RR+LISGSIHYPRSTPEMWP LIQK+
Sbjct: 10 LLLFVTAWVCNVTAT------VWYDHKAITINNQRRILISGSIHYPRSTPEMWPGLIQKA 63
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GG++VI+TYVFWN HEP QY F+ RYDLVKF+KLV +AGLY HLRIGPYVCAEWNF
Sbjct: 64 KEGGIEVIQTYVFWNGHEPSPGQYYFQDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNF 123
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI+FRTDN PFKA MQ+F IV+MMK++KL+ +QGGPIILSQIENEYG
Sbjct: 124 GGFPMWLKYVPGIEFRTDNGPFKAAMQKFVTLIVNMMKEQKLFQTQGGPIILSQIENEYG 183
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA L+TGVPW+MC+Q DAPDP I+TCNGFYC+ + PN+ N
Sbjct: 184 PVEWTIGAPGKAYTKWAAAMATGLNTGVPWIMCKQEDAPDPTIDTCNGFYCEGYKPNNYN 243
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPK+WTENW+GW+ +G +VPYRP ED AF+VARF G+F NYYMYHGGTNFDRT+ G
Sbjct: 244 KPKVWTENWTGWYTEWGASVPYRPPEDTAFSVARFIAASGSFVNYYMYHGGTNFDRTA-G 302
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
F++TSYDYDAPLDEYGL PKWGHL+DLH+AIK E ALV+ DPT SLG N EA V+
Sbjct: 303 LFMATSYDYDAPLDEYGLTHDPKWGHLRDLHRAIKQSERALVSADPTVISLGKNQEAHVF 362
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
++ G C+AFLAN T V F Y LP WS+S+LPDCK VV+NTAKI++ S
Sbjct: 363 QSKMG-CAAFLANYDTQYSARVNFWNKPYSLPRWSISVLPDCKTVVYNTAKISA----QS 417
Query: 426 FSRQSLQVAADSSDAIGSGW-SYINE-PVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
+ + VA+ G W S+I+E PVG S FTK GL EQ T D++DYLWY
Sbjct: 418 TQKWMMPVAS------GFSWQSHIDEVPVGYSA-GTFTKVGLWEQKYLTGDKTDYLWYMT 470
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
I ++E L G L V S GH LH FING L GS YGS N K+T + L G
Sbjct: 471 DVTINSNEGFLRSGKNPFLTVASAGHVLHVFINGHLAGSAYGSLENPKLTFSQNVKLVGG 530
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL 603
N LLS TVGL N G Y+ G+ GPV L+G GT +D++ +W+Y+ GLKGE+L
Sbjct: 531 VNKIALLSATVGLANVGVHYDTWNVGVLGPVTLQGLNQGT-LDMTKWKWSYKIGLKGEDL 589
Query: 604 N-FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
F G++ W + L K PL WYKT +AP G++PVA+ MGKG+ ++NG+SIGR
Sbjct: 590 KLFSGGANVGWAQGAQLAKKTPLTWYKTFINAPPGNDPVALYMGSMGKGQMYINGRSIGR 649
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y ++ G C D C+Y G Y KC CG+P Q YHVPRSWLK +GN LV+FEE+G
Sbjct: 650 HWPAYTAK-GNCKD-CDYAGYYDDQKCRSGCGQPPQQWYHVPRSWLKPTGNLLVVFEEMG 707
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSI 782
GDPT IS V + +G S+C+ + D P + W + + P L CP P Q S I
Sbjct: 708 GDPTGISLVKRVVG-SVCADIDDDQP-EMKSWTENIPVT----PKAHLWCP-PGQKFSKI 760
Query: 783 KFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSL 841
FAS+G P G CG++ +G+C + +S ++ C+G +C I V+ TF GDPC G K L
Sbjct: 761 VFASYGWPQGRCGAYRQGKCHALKSWDPFQKYCIGKGACDIDVAPATFGGDPCPGSAKRL 820
Query: 842 AVEASCT 848
+V+ C+
Sbjct: 821 SVQLQCS 827
>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 819
Score = 903 bits (2334), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 447/793 (56%), Positives = 554/793 (69%), Gaps = 23/793 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +AV++ G+RR+L SGSIHYPRSTPEMW LI+K+KDGGLDVI+TYVFWN HEP
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLV+F+K V +AG++ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ FT KIV MMK E L+ASQGGPIILSQIENEYG +GAAGK+YI WAA MA
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ LDTGVPWVMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSGWF FGG +
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPVEDLAF VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLK+LH+A+KLCE LV+ DPT +LG EA V+++ SG C+AFLAN +NS
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAK 385
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V FN +Y LP WS+SILPDCKNVVFNTA + T +Q+ AD + ++ W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQT-------NQMQMWADGASSM--MWE 436
Query: 447 YINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+E V ++ T GLLEQ+N T D SDYLWY S + E L+ G+ L VQ
Sbjct: 437 KYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQ 496
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHALH FING+L GS YG+ + K++ L G N LLS+ GL N G YE
Sbjct: 497 SAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYET 556
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK- 621
G+ GPV + G G+ DL+ Q W+YQ GLKGE++N S S +W S + +
Sbjct: 557 WNTGVVGPVVIHGLDEGSR-DLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQN 615
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WY+ FD P+G EP+A+D MGKG+ W+NGQSIGRYW Y G C C+Y
Sbjct: 616 QQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAE--GDC-KGCHYT 672
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G+Y + KC CG+P+Q YHVPRSWL+ + N LV+FEE+GGD +KI+ + + S +C+
Sbjct: 673 GSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTV-SGVCA 731
Query: 742 HVTDSHPLPVDMWGSDSKIQRK-PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
V++ HP + W +S + + + L+C P Q IS+IKFASFGTPLGTCG+F +G
Sbjct: 732 DVSEYHP-NIKNWQIESYGEPEFHTAKVHLKCA-PGQTISAIKFASFGTPLGTCGTFQQG 789
Query: 801 RCSSARSLSVVRQ 813
C S S SV+ +
Sbjct: 790 ECHSINSNSVLEK 802
>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
Length = 874
Score = 903 bits (2333), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/862 (53%), Positives = 579/862 (67%), Gaps = 60/862 (6%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
N++YDHRA++IGG+RR+LISG IHYPR++P+MWP LI+ +K+GGLD+I+TYVFW+ HEP
Sbjct: 22 NISYDHRAIIIGGQRRILISGCIHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHEPS 81
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
YNF+GRYDL++F+KLV +AGLY +LRIGPYVCAEWNFGGFP WL +PGIQFRT N
Sbjct: 82 PGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTHNR 141
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
F+ +M+ F KIVDM+K E+L+ASQGGP++ SQIENEYGN+ +YG GK+Y+ WAA M
Sbjct: 142 AFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGINGKTYMLWAARM 201
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A L+TGVPW+MC+Q DAPD IINTCNG+YCD + PNS +KP MWTENWSGW+ S+G A
Sbjct: 202 AKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQSWGEAA 261
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYM------------------YHGGTNFDRTSGGPF 307
PYR VED+AFAVARFFQRGG QNYYM Y GGTNF RTSGGPF
Sbjct: 262 PYRTVEDVAFAVARFFQRGGVAQNYYMVRTLHDLEQRLLMPERCQYFGGTNFGRTSGGPF 321
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG---PNLEATV 364
I+TSYDYDAPLDE+G++RQPKWGHLK+LH A+KLCE AL + DP Y +LG ++A V
Sbjct: 322 ITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPVYYTLGRMQEMVQAHV 381
Query: 365 YKTGS---------GLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTA 415
Y GS C+AFLANI T+S +VKF G Y LP WSVSILPDC+NVVFNTA
Sbjct: 382 YSDGSLEANFSNLATPCAAFLANIDTSS-ASVKFGGKVYNLPPWSVSILPDCRNVVFNTA 440
Query: 416 KIN---SVTLVPSFSRQSLQVAADSSDAIG----SGWSYINEPVGISKDDAFTKPGLLEQ 468
+++ SVT + + + SL S G W + EPVG S + LLEQ
Sbjct: 441 QVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHALLEQ 500
Query: 469 INTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYG-SS 527
I+TT D +DY+WYS I D+ L G VL + S+ +H F+NG+ GS S
Sbjct: 501 ISTTNDSTDYMWYSTRFEI-LDQEL--KGGDPVLVITSMRDMVHIFVNGEFAGSTSTLKS 557
Query: 528 SNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDL 587
V PI L G N +LS TVGLQNYGA E GAGITG + ++G GT +L
Sbjct: 558 GGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSIWIQGLSTGTR-NL 616
Query: 588 SSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTG 647
+S W +Q GL GE + W S ++LP QPLVWYK F+ P G +PVAI
Sbjct: 617 TSALWLHQVGLNGEH------DAITWSSTTSLPFFQPLVWYKANFNIPDGDDPVAIHLGS 670
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKG+AWVNG S+GR+WP + + GC+D C+YRG Y S+KCL +CG PSQ YHVPR W
Sbjct: 671 MGKGQAWVNGHSLGRFWPVITAPSTGCSDRCDYRGTYYSSKCLSSCGLPSQEWYHVPREW 730
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV 767
L + NTLVL EEIGG+ + +SF ++ + +C+ V++ PV + S P
Sbjct: 731 LVNEKNTLVLLEEIGGNVSGVSFASRVV-DRVCAQVSEYSLPPVAQFSSL--------PE 781
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSV 827
L L C +P Q ISSI FASFG P G CG+F +G C + S ++V +AC+G +SCS +
Sbjct: 782 LGLSC-SPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKACIGRQSCSFEIFW 840
Query: 828 NTFG-DPCKGVMKSLAVEASCT 848
FG DPC G K+LAVEA+CT
Sbjct: 841 KNFGTDPCPGKAKTLAVEAACT 862
>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
Length = 722
Score = 903 bits (2333), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 434/713 (60%), Positives = 529/713 (74%), Gaps = 19/713 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V YDHR ++I G+ R+LIS SIHYPR+ P+MW LI +K GG+DVIETYVFW+ H+P R
Sbjct: 24 VAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGHQPTR 83
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+ YNFEGR+DLV FVKLV EAGLYA+LRIGPYVCAEWN GGFP+WL +PGI+FRT+N+P
Sbjct: 84 DTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVPGIEFRTNNQP 143
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKAEMQ F KIV MMK +KL+A QGGPIIL+QIENEYGNID+AYGAAGK Y++WAA MA
Sbjct: 144 FKAEMQAFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMEWAANMA 203
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
L TGVPW+MCQQSDAPD I++TCNGFYCD + PN+ KPKMWTENWSGWF +G A P
Sbjct: 204 QGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWGEASP 263
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+RPVED+AFAVARFFQRGG+FQNYYMY GGTNF R+SGGP+++TSYDYDAP+DE+G+IRQ
Sbjct: 264 HRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFGVIRQ 323
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTNSDV 385
PKWGHLK LH AIKLCEAAL + DPTY SLG EA VY T SG C+AFLANI ++SD
Sbjct: 324 PKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDSSSDA 383
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
TVKFN +YLLPAWSVSILPDCK V NTAK++ T +P+ G W
Sbjct: 384 TVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVHVQTAMPTM----------KPSITGLAW 433
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI-KADEPLLEDGSKTVLHV 504
EPVG+ D LLEQINTT D SDYLWY+ S +I +AD K +L +
Sbjct: 434 ESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAA----SGKALLSL 489
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+S+ +H F+NGKL GS + V+ PI LA G N+ +L TVGLQNYG F E
Sbjct: 490 ESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFIE 549
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKSTLPKLQ 623
GAGI G V +KG +G IDL++++W +Q GLKGE L F S + S +P+ Q
Sbjct: 550 TWGAGINGSVIVKGLPSG-QIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWSSAVPQGQ 608
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN-GGCTDSCNYRG 682
LVWYK FD+P+G++PVA+D MGKG+AW+NGQSIGR+WP+ + + GC +C+YRG
Sbjct: 609 ALVWYKAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPSLRAPDTAGCPQTCDYRG 668
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
+YSS+KC CG+PSQ YHVPRSWL+ SGN +VLFEE GG P+ +SFVT+ +
Sbjct: 669 SYSSSKCRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEGGKPSGVSFVTRTV 721
>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
Length = 726
Score = 902 bits (2331), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/733 (59%), Positives = 531/733 (72%), Gaps = 20/733 (2%)
Query: 4 KEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
+ +L LC+ FV T A+VTYDH+A+VI GKRR+LISGSIHYPRSTP+MWPDLIQ
Sbjct: 10 RNCYILFLCF-FVCYVT----ASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQ 64
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
K+KDGG+DVIETYVFWN HEP + +Y FE R+DLVKF+K+V +AGLY HLRIGPYVCAEW
Sbjct: 65 KAKDGGVDVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEW 124
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
NFGGFP+WL ++PG+ FRTDNEPFKA MQ+FT KIV +MK E L+ SQGGPIILSQIENE
Sbjct: 125 NFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENE 184
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG ++ GA GKSY KW + MA+ L+TGVPWVMC+Q DAPDPII+TCNG+YC+ F+PN
Sbjct: 185 YGPVEWEIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNK 244
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
N KPKMWTENW+GW+ FG AVPYRP EDLAF+VARF Q G++ NYYMYHGGTNF RTS
Sbjct: 245 NYKPKMWTENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTS 304
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
G FI+TSYDYDAP+DEYGLI +PKWGHL+DLHKAIK CE+ALV+ DPT G NLE
Sbjct: 305 SGLFIATSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVH 364
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
+YKT G C+AFLAN T S V F Y LP WS+SILPDCK VFNTAK+ +
Sbjct: 365 LYKTSFGACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRA---- 420
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
P R + +++ + SY +P + ++T GLLEQ++ T D+SDYLWY
Sbjct: 421 PRVHR-----SMTPANSAFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMT 475
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
NI +E +++G VL S GH LH FING+ G+ YGS N K+T + L G
Sbjct: 476 DVNISPNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVG 535
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL 603
N LLS+ VGL N G YEK G+ GPV LKG GT DLS Q+W+Y+ GLKGE L
Sbjct: 536 NNKISLLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTR-DLSKQKWSYKIGLKGESL 594
Query: 604 NFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
N + SS +W S L K QPL WYKTTF+APAG++P+A+D + MGKGE WVNGQSI
Sbjct: 595 NLHTTSGSSSVKWTQGSFLSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNGQSI 654
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GR+WP Y+++ G C SCNY G ++ KC NCG+P+Q YH+PRSWL SGN LV+ EE
Sbjct: 655 GRHWPAYIAR-GNC-GSCNYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVLEE 712
Query: 721 IGGDPTKISFVTK 733
GGDPT IS V +
Sbjct: 713 WGGDPTGISLVKR 725
>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
Length = 874
Score = 902 bits (2330), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/862 (53%), Positives = 579/862 (67%), Gaps = 60/862 (6%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
N++YDHRA++IGG+RR+LISG +HYPR++P+MWP LI+ +K+GGLD+I+TYVFW+ HEP
Sbjct: 22 NISYDHRAIIIGGQRRILISGCLHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHEPS 81
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
YNF+GRYDL++F+KLV +AGLY +LRIGPYVCAEWNFGGFP WL +PGIQFRT N
Sbjct: 82 PGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTHNR 141
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
F+ +M+ F KIVDM+K E+L+ASQGGP++ SQIENEYGN+ +YG GK+Y+ WAA M
Sbjct: 142 AFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGTNGKTYMLWAARM 201
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A L+TGVPW+MC+Q DAPD IINTCNG+YCD + PNS +KP MWTENWSGW+ +G A
Sbjct: 202 AKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQLWGEAA 261
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYM------------------YHGGTNFDRTSGGPF 307
PYR VED+AFAVARFFQRGG QNYYM Y GGTNF RTSGGPF
Sbjct: 262 PYRTVEDVAFAVARFFQRGGVAQNYYMVRMLHDLEQHLLMPERCQYFGGTNFGRTSGGPF 321
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG---PNLEATV 364
I+TSYDYDAPLDE+G++RQPKWGHLK+LH A+KLCE AL + DP Y +LG ++A V
Sbjct: 322 ITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPLYYTLGRMQEMVQAHV 381
Query: 365 YKTGS---------GLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTA 415
Y GS C+AFLANI T+S +VKF GN Y LP WSVSILPDC+NVVFNTA
Sbjct: 382 YSDGSLEANFSNLATPCAAFLANIDTSS-ASVKFGGNVYNLPPWSVSILPDCRNVVFNTA 440
Query: 416 KIN---SVTLVPSFSRQSLQVAADSSDAIG----SGWSYINEPVGISKDDAFTKPGLLEQ 468
+++ SVT + + + SL S G W + EPVG S + LLEQ
Sbjct: 441 QVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHALLEQ 500
Query: 469 INTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYG-SS 527
I+TT D +DYLWYS I +D+ L G VL + S+ +H F+NG+ GS S
Sbjct: 501 ISTTNDSTDYLWYSTRFEI-SDQEL--KGGDPVLVITSMRDMVHIFVNGEFAGSTSTLKS 557
Query: 528 SNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDL 587
V PI L G N +LS TVGLQNYGA E GAGITG V ++G GT +L
Sbjct: 558 GGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSVWIQGLSTGTR-NL 616
Query: 588 SSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTG 647
+S W +Q GL GE + W S ++LP QPLVWYK F+ P G +PVAI
Sbjct: 617 TSALWLHQVGLNGEH------DAITWSSTTSLPFFQPLVWYKANFNIPDGDDPVAIHLGS 670
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKG+AWVNG S+GR+WP + + GC+D C+YRG Y S+KCL CG PSQ YHVPR W
Sbjct: 671 MGKGQAWVNGHSLGRFWPAITAPSTGCSDRCDYRGTYYSSKCLSGCGLPSQEWYHVPREW 730
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV 767
L + NTLVL EEIGG+ + +SF ++ + +C+ V++ PV + S P
Sbjct: 731 LVNEKNTLVLLEEIGGNVSGVSFASRVV-DRVCAQVSEYSLPPVAQFSSL--------PE 781
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSV 827
L L C +P Q ISSI FASFG P G CG+F +G C + S ++V +AC+G +SCS +
Sbjct: 782 LGLSC-SPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKACIGRQSCSFEIFW 840
Query: 828 NTFG-DPCKGVMKSLAVEASCT 848
FG DPC G K+LAVEA+CT
Sbjct: 841 KNFGTDPCPGKAKTLAVEAACT 862
>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
Length = 721
Score = 900 bits (2327), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/732 (60%), Positives = 525/732 (71%), Gaps = 25/732 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+L+L+ W V T A+VTYDH+A+VI GKRR+LISGSIHYPRSTP+MWPDLIQK+
Sbjct: 10 VLMLLFFW---VCGVT---ASVTYDHKAIVIDGKRRILISGSIHYPRSTPQMWPDLIQKA 63
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP +Y FE RYDLV+FVKL +AGLY HLRIGPY+CAEWNF
Sbjct: 64 KDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFVKLAQQAGLYVHLRIGPYICAEWNF 123
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FTAKIV +MK+E+L+ SQGGPIILSQIENEYG
Sbjct: 124 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKEERLFQSQGGPIILSQIENEYG 183
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GKSY KWAA MA+ LDTGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN N
Sbjct: 184 PVEWEIGAPGKSYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNT 243
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GW+ FGGA P RP EDLAF+VARF Q GG+F NYYMYHGGTNF RTSGG
Sbjct: 244 KPKMWTENWTGWYTDFGGASPIRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGG 303
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL +PKWGHL+ LHKAIK E ALV+TDP SLG NLEA V+
Sbjct: 304 LFIATSYDYDAPLDEYGLQNEPKWGHLRALHKAIKQSEPALVSTDPKVTSLGYNLEAHVF 363
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
T G C+AF+AN T S F Y LP WS+SILPDCK VV+NTA++ +
Sbjct: 364 ST-PGACAAFIANYDTKSSAKATFGSGQYDLPPWSISILPDCKTVVYNTARVGN-----G 417
Query: 426 FSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
+ ++ V + G W SY EP S+DD+ L EQ+N T D SDYLWY
Sbjct: 418 WVKKMTPVNS------GFAWQSYNEEPASSSQDDSIAAEALWEQVNVTRDSSDYLWYMTD 471
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
I +E L++G VL V S GH LH FING+L G+ YG N K+T + L G
Sbjct: 472 VYINGNEGFLKNGRSPVLTVMSAGHLLHVFINGQLSGTVYGGLGNPKLTFSDNVNLRVGN 531
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS+ VGL N G +E AG+ GPV LKG GT DLS Q+W+Y+ GLKGE LN
Sbjct: 532 NKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTR-DLSRQKWSYKVGLKGEALN 590
Query: 605 FPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
+ SS +W S + K QPL WYK TF APAG++P+A+D MGKGE WVNG+SIG
Sbjct: 591 LHTESGSSSVEWIQGSLVAKKQPLTWYKATFSAPAGNDPLALDLGSMGKGEVWVNGRSIG 650
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
R+WP Y++ G ++CNY G Y+ KC NCGKPSQ YHVPRSWL S GN+LV+FEE
Sbjct: 651 RHWPGYIAH--GSCNACNYAGYYTDQKCRTNCGKPSQRWYHVPRSWLNSGGNSLVVFEEW 708
Query: 722 GGDPTKISFVTK 733
GGDP I+ V +
Sbjct: 709 GGDPNGIALVKR 720
>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
Length = 897
Score = 900 bits (2325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/882 (52%), Positives = 576/882 (65%), Gaps = 78/882 (8%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPE------------------------------- 56
TYD +AV+I G+RR+L SGSIHYPRSTP+
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDVISCILQNLSFFFSPLLPRGGGEFMAVVSCV 89
Query: 57 ---------------------MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
MW LIQK+KDGGLDVI+TYVFWN HEP Y FE RY
Sbjct: 90 LDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERY 149
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DLV+FVK V +AGL+ HLRIGPY+C EWNFGGFP+WL ++PGI FRTDNEPFK MQ FT
Sbjct: 150 DLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFT 209
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
KIV MMK E L+ASQGGPIILSQIENEYG +GAAG++YI WAA MA+ LDTGVPW
Sbjct: 210 EKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPW 269
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
VMC++ DAPDP+IN CNGFYCD F+PN KP MWTE WSGWF FGG + RPVEDLAF
Sbjct: 270 VMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAF 329
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
AVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGLIR+PK HLK+L
Sbjct: 330 AVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKEL 389
Query: 336 HKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYL 395
H+A+KLCE ALV+ DPT +LG EA V+++ SG C+AFLAN +NS V FN Y
Sbjct: 390 HRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKVVFNNEQYS 448
Query: 396 LPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPV-GI 454
LP WS+SILPDCKNVVFN+A + T +Q+ D + ++ W +E V +
Sbjct: 449 LPPWSISILPDCKNVVFNSATVGVQT-------SQMQMWGDGATSM--MWERYDEEVDSL 499
Query: 455 SKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK-TVLHVQSLGHALHA 513
+ T GLLEQ+N T D SDYLWY S +I E L+ G K L VQS GHALH
Sbjct: 500 AAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHV 559
Query: 514 FINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGP 573
F+NG+L GS YG+ + ++ + + L G N LLS+ GL N G YE G+ GP
Sbjct: 560 FVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGP 619
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLP-KLQPLVWYK 629
V L G G+ DL+ Q W+YQ GLKGE++N S S +W S + K QPL WYK
Sbjct: 620 VVLHGLNEGSR-DLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYK 678
Query: 630 TTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKC 689
F+ P+G EP+A+D MGKG+ W+NGQSIGRYW Y +G C C+Y G + + KC
Sbjct: 679 AYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAYA--DGDC-KGCSYTGTFRAPKC 735
Query: 690 LKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI-GGDPTKISFVTKQLGSSLCSHVTDSHP 748
CG+P+Q YHVPRSWL+ S N LV+ EE+ GGD +KI+ + + SS+C+ V++ HP
Sbjct: 736 QAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSV-SSVCADVSEDHP 794
Query: 749 LPVDMWGSDSKIQRKPGPV-LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARS 807
+ W +S +R+ + L C + Q IS+I+FASFGTP+GTCG+F +G C SA S
Sbjct: 795 -NIKKWQIESYGEREHRRAKVHLRCAH-GQSISAIRFASFGTPVGTCGNFQQGGCHSASS 852
Query: 808 LSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
+V+ + C+G + C + +S + F GDPC V K +AVEA C+
Sbjct: 853 HAVLEKRCIGLQRCVVAISPDNFGGDPCPSVTKRVAVEAVCS 894
>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
Length = 730
Score = 896 bits (2316), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/731 (59%), Positives = 521/731 (71%), Gaps = 21/731 (2%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
L+L LC L S A+VTYDH+A+VI G+RR+LISGSIHYPRSTP+MWPDLIQK+K
Sbjct: 16 LVLFLC-----LFVFSVTASVTYDHKAIVINGQRRILISGSIHYPRSTPQMWPDLIQKAK 70
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGG+DVI+TYVFWN HEP Y FE R+DLVKFVK+V +AGLY +LRIGPYVCAEWNFG
Sbjct: 71 DGGVDVIQTYVFWNGHEPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYVCAEWNFG 130
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WL ++PG+ FRTDNEPFKA MQ+FTAKIV MMK E L+ SQGGPII+SQIENEYG
Sbjct: 131 GFPVWLKYVPGVAFRTDNEPFKAAMQKFTAKIVSMMKAENLFESQGGPIIMSQIENEYGP 190
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
++ GA GK+Y KW + MA+ LDTGVPW+MC+Q DAPDPII+TCNG+YC+ FTPN N K
Sbjct: 191 VEWEIGAPGKAYTKWFSQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYCENFTPNKNYK 250
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENWSGW+ FG AVPYRP +D+AF+VARF Q G++ NYYMYHGGTNF RTS G
Sbjct: 251 PKMWTENWSGWYTDFGSAVPYRPAQDVAFSVARFIQNRGSYVNYYMYHGGTNFGRTSAGL 310
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
FI+TSYDYDAP+DEYGL+ +PKWGHL++LHKAIK CE LV+ DPT G NLE VYK
Sbjct: 311 FIATSYDYDAPIDEYGLLSEPKWGHLRNLHKAIKQCEPILVSVDPTVSWPGKNLEVHVYK 370
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
T +G C+AFLAN T S V F Y LP WS+SILPDCK VFNTAK+ + VPSF
Sbjct: 371 TSTGACAAFLANYDTTSPAKVTFGNGQYDLPPWSISILPDCKTAVFNTAKVGT---VPSF 427
Query: 427 SRQSLQVAADSSDAIGSGWSYINE-PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
R+ V++ W NE P DD+ T LLEQI T D SDYLWY
Sbjct: 428 HRKMTPVSS------AFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDV 481
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
NI +E +++G VL S GH LH F+NG+ G+ YG N K+T + L G N
Sbjct: 482 NISPNEGFIKNGQYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNN 541
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS+ VGL N G YE G+ GPV LKG GT DLS Q+W+Y+ GLKGE LN
Sbjct: 542 KISLLSVAVGLSNVGLHYETWNVGVLGPVTLKGLNEGTR-DLSGQKWSYKIGLKGETLNL 600
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS QW S+L K QPL WYK TFDAPAG++P+A+D + MGKGE WVNG+SIGR
Sbjct: 601 HTLIGSSSVQWTKGSSLVKKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGR 660
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y+++ G CNY G ++ KC +CG+P+Q YH+PRSW+ GN LV+ EE G
Sbjct: 661 HWPAYIAR--GSCGGCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWG 718
Query: 723 GDPTKISFVTK 733
GDP+ IS V +
Sbjct: 719 GDPSGISLVKR 729
>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 721
Score = 894 bits (2310), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/734 (59%), Positives = 525/734 (71%), Gaps = 28/734 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
++L+ LC L A+VTYDH+A+V+ GKRR+LISGSIHYPRSTP+MWPDLIQK+
Sbjct: 9 VVLMSLC-----LWVCGVTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKA 63
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP QY FE R+DLVKFVKLV +AGLY HLRIGPY+CAEWNF
Sbjct: 64 KDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLVQQAGLYVHLRIGPYICAEWNF 123
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FTAKIV +MK+ +L+ SQGGPII+SQIENEYG
Sbjct: 124 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIIMSQIENEYG 183
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPWVMC+Q DAPDP+I+TCNG+YC+ F PN N
Sbjct: 184 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGYYCENFKPNKNT 243
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GW+ FGGAVP RP EDLAF+VARF Q GG+F NYYMYHGGTNF RTSGG
Sbjct: 244 KPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGG 303
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL +PK+ HL++LHKAIK CE ALVATDP SLG NLEA V+
Sbjct: 304 LFIATSYDYDAPLDEYGLQNEPKYEHLRNLHKAIKQCEPALVATDPKVQSLGYNLEAHVF 363
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV---TL 422
T G C+AF+AN T S F Y LP WS+SILPDCK VV+NTAK+ + +
Sbjct: 364 ST-PGACAAFIANYDTKSYAKATFGNGQYDLPPWSISILPDCKTVVYNTAKVGNSWLKKM 422
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
P S + Q SY EP S+ D+ L EQ+N T D SDYLWY
Sbjct: 423 TPVNSAFAWQ-------------SYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYM 469
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
I A+E L++G VL S GH LH FIN +L G+ +G +N K+T + L
Sbjct: 470 TDVYINANEGFLKNGQSPVLTAMSAGHVLHVFINDQLAGTVWGGLANPKLTFSDNVKLRV 529
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G N LLS+ VGL N G +E AG+ GPV LKG GT DLSSQ+W+Y+ GLKGE
Sbjct: 530 GNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTR-DLSSQKWSYKVGLKGES 588
Query: 603 LNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
L+ + SS +W S + K QPL WYKTTF APAG++P+A+D MGKGE WVNG+S
Sbjct: 589 LSLHTESGSSSVEWIRGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRS 648
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
IGR+WP Y++ G ++CNY G Y+ KC NCG+PSQ YHVPRSWL S GN+LV+FE
Sbjct: 649 IGRHWPGYIAH--GSCNACNYAGFYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFE 706
Query: 720 EIGGDPTKISFVTK 733
E GGDP I+ V +
Sbjct: 707 EWGGDPNGIALVKR 720
>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
Length = 721
Score = 892 bits (2306), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/734 (59%), Positives = 523/734 (71%), Gaps = 28/734 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
++L++LC L A+VTYDH+A+V+ GKRR+LISGSIHYPRSTP+MWPDLIQK+
Sbjct: 9 VVLMMLC-----LWVCGVTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKA 63
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP QY FE R+DLVKFVKL +AGLY HLRIGPY+CAEWN
Sbjct: 64 KDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNL 123
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FTAKIV +MK+ +L+ SQGGPIILSQIENEYG
Sbjct: 124 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYG 183
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN N
Sbjct: 184 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNT 243
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GW+ FGGAVP RP EDLAF+VARF Q GG+F NYYMYHGGTNF RTSGG
Sbjct: 244 KPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGG 303
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL +PK+ HL+ LHKAIK E ALVATDP SLG NLEA V+
Sbjct: 304 LFIATSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVF 363
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS---VTL 422
+ G C+AF+AN T S KF Y LP WS+SILPDCK VV+NTAK+ +
Sbjct: 364 -SAPGACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKVGYGWLKKM 422
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
P S + Q SY EP S+ D+ L EQ+N T D SDYLWY
Sbjct: 423 TPVNSAFAWQ-------------SYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYM 469
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
N+ A+E L++G +L V S GH LH FING+L G+ +G N K+T + L
Sbjct: 470 TDVNVNANEGFLKNGQSPLLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRA 529
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G N LLS+ VGL N G +E AG+ GPV LKG GT DLS Q+W+Y+ GLKGE
Sbjct: 530 GNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTR-DLSRQKWSYKVGLKGES 588
Query: 603 LNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
L+ + SS +W S + K QPL WYKTTF APAG++P+A+D MGKGE WVNG+S
Sbjct: 589 LSLHTESGSSSVEWIQGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRS 648
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
IGR+WP Y++ G ++CNY G Y+ KC NCG+PSQ YHVPRSWL S GN+LV+FE
Sbjct: 649 IGRHWPGYIAH--GSCNACNYAGYYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFE 706
Query: 720 EIGGDPTKISFVTK 733
E GGDP I+ V +
Sbjct: 707 EWGGDPNGIALVKR 720
>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
Length = 723
Score = 891 bits (2302), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/722 (59%), Positives = 519/722 (71%), Gaps = 18/722 (2%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
F V A T A+VTYDH+ +VI G+RR+LISGSIHYPRSTPEMWP L QK+K+GGLDVI+
Sbjct: 16 FWVCAVT---ASVTYDHKTIVIDGQRRILISGSIHYPRSTPEMWPALFQKAKEGGLDVIQ 72
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TYVFWN HEP +Y FE R+DLVKF+KL +AGLY HLRIGPYVCAEWNFGGFP+WL +
Sbjct: 73 TYVFWNGHEPSPGKYYFEDRFDLVKFIKLAQQAGLYVHLRIGPYVCAEWNFGGFPVWLKY 132
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI FRTDNEPFKA MQ+FT KIV MMK E L+ +QGGPII+SQIENEYG ++ GA
Sbjct: 133 VPGISFRTDNEPFKAAMQKFTTKIVSMMKAENLFQNQGGPIIMSQIENEYGPVEWNIGAP 192
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
GK+Y WAA MA+ LDTGVPW MC+Q DAPDP+I+TCNG+YC+ FTPN N KPKMWTENW
Sbjct: 193 GKAYTNWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNKNYKPKMWTENW 252
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
SGW+ FG A+ YRPVEDLA++VARF Q G+F NYYMYHGGTNF RTS G FI+TSYDY
Sbjct: 253 SGWYTDFGNAICYRPVEDLAYSVARFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDY 312
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAP+DEYGL +PKW HL+DLHKAIK CE ALV+ DPT SLG LEA VY TG+ +C+A
Sbjct: 313 DAPIDEYGLTNEPKWSHLRDLHKAIKQCEPALVSVDPTITSLGNKLEAHVYSTGTSVCAA 372
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVA 434
FLAN T S TV F Y LP WSVSILPDCK VFNTAK+ + QS Q
Sbjct: 373 FLANYDTKSAATVTFGNGKYDLPPWSVSILPDCKTDVFNTAKVGA---------QSSQKT 423
Query: 435 ADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL 494
S+++ SYI EP S+DD+ T L EQIN T D SDYLWY NI +E +
Sbjct: 424 MISTNSTFDWQSYIEEPAFSSEDDSITAEALWEQINVTRDSSDYLWYLTDVNISPNEDFI 483
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
++G +L+V S GH LH F+NG+L G+ YG N K+T + L G N LLS+ V
Sbjct: 484 KNGQYPILNVMSAGHVLHVFVNGQLSGTVYGVLDNPKLTFSNSVNLTVGNNKISLLSVAV 543
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSST 611
GL N G +E G+ GPV LKG GT DLS Q+W+Y+ GLKGE L+ + GSS
Sbjct: 544 GLPNVGLHFETWNVGVLGPVTLKGLNEGTR-DLSWQKWSYKVGLKGESLSLHTITGGSSV 602
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W S L K QPL WYK TF+APAG++P+ +D + MGKGE WVN QSIGR+WP Y++
Sbjct: 603 DWTQGSLLAKKQPLTWYKATFNAPAGNDPLGLDMSSMGKGEIWVNDQSIGRHWPGYIAH- 661
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G C D C+Y G +++ KC NCG P+Q+ YH+PRSWL +GN LV+ EE GGDP+ IS +
Sbjct: 662 GSCGD-CDYAGTFTNTKCRTNCGNPTQTWYHIPRSWLNPTGNVLVVLEEWGGDPSGISLL 720
Query: 732 TK 733
+
Sbjct: 721 KR 722
>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 616
Score = 887 bits (2292), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/620 (69%), Positives = 504/620 (81%), Gaps = 11/620 (1%)
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY+FEGR DLV+FVK A+AGLY HLRIGPYVCAEWN+GGFPLWLHFIPGI+ RTDNEPF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
K EMQRFT K+V MK LYASQGGPIILSQIENEYGNI ++YGAAGKSYI+WAAGMA+
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+LDTGVPWVMCQQ+DAP+P+INTCNGFYCDQFTP+ ++PK+WTENWSGWFLSFGGAVPY
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RP EDLAFAVARF+QRGGT QNYYMYHGGTNF R+SGGPFISTSYDYDAP+DEYGL+RQP
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHL+D+HKAIK+CE AL+ATDP+Y SLG N EA VYK+GS LC+AFLANI SD TV
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGS-LCAAFLANIDDQSDKTV 299
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS-----FSRQSLQVAADSSDAIG 442
FNG +Y LPAWSVSILPDCKNVV NTA+INS FS Q+ ++ ++
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
S WSY EPVGI+K++A TKPGL+EQINTTAD SD+LWYS S + EP L +GS++ L
Sbjct: 360 SSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYL-NGSQSNL 418
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V SLGH L FINGKL GS GS+S++ +++ P+ L GKN DLLS TVGL NYGAF
Sbjct: 419 LVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGAF 478
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQWDSKSTLP 620
++ GAGITGPV+L G GT +DLSS +WTYQ GL+GE+L+ PS +S +W S ++ P
Sbjct: 479 FDLVGAGITGPVKLTGP-KGT-LDLSSAEWTYQIGLRGEDLHLYNPSEASPEWVSDNSYP 536
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
PL WYK+ F APAG +PVAIDFTGMGKGEAWVNGQSIGRYWPT ++ GC +SCNY
Sbjct: 537 TNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSGCVNSCNY 596
Query: 681 RGAYSSNKCLKNCGKPSQSL 700
RG+YS+ KCLK CG+PSQ L
Sbjct: 597 RGSYSATKCLKKCGQPSQIL 616
>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
Length = 741
Score = 886 bits (2290), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/730 (59%), Positives = 527/730 (72%), Gaps = 36/730 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V YDHR ++I G+ R+LIS SIHYPR+ P+MW LI +K GG+DVIETYVFW+ H+P R
Sbjct: 26 VAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGHQPTR 85
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+ YNFEGR+DLV FVKLV EAGLYA+LRIGPYVCAEWN GGFP+WL + GI+FRT+N+P
Sbjct: 86 DTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVAGIEFRTNNQP 145
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKAEMQ F KIV MMK +KL+A QGGPIIL+QIENEYGNID+AYGAAGK Y+ WAA M+
Sbjct: 146 FKAEMQTFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMVWAANMS 205
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
L TGVPW+MCQQSDAPD I++TCNGFYCD + PN+ KPKMWTENWSGWF +G A P
Sbjct: 206 QGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWGEASP 265
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+RPVED+AFAVARFFQRGG+FQNYYMY GGTNF R+SGGP+++TSYDYDAP+DE+G+IRQ
Sbjct: 266 HRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFGVIRQ 325
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTNSDV 385
PKWGHLK LH AIKLCEAAL + DPTY SLG EA VY T SG C+AFLANI ++SD
Sbjct: 326 PKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDSSSDA 385
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
TVKFN +YLLPAWSVSILPDCK V NTAK++ T +P+ G W
Sbjct: 386 TVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVDVQTAMPTM----------KPSITGLAW 435
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI-KADEPLLEDGSKTVLHV 504
EPVG+ D LLEQINTT D SDYLWY+ S +I +AD K +L++
Sbjct: 436 ESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAA----SGKALLYL 491
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+S+ +H F+NGKL GS + V+ PI LA G N+ +L TVGLQNYG F E
Sbjct: 492 ESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFIE 551
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKSTLPKLQ 623
GAGI G V +KG +G IDL++++W +Q GLKGE L F S + S +P+ Q
Sbjct: 552 TWGAGINGSVIVKGLPSG-QIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWSSAVPQGQ 610
Query: 624 PLVWYKTT-----------------FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
LVWYK FD+P+G++PVA+D MGKG+AW+NGQSIGR+WP+
Sbjct: 611 ALVWYKVIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPS 670
Query: 667 YVSQN-GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
+ + GC +C+YRG+YSS+KC CG+PSQ YHVPRSWL+ GN +VLFEE GG P
Sbjct: 671 LRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDGGNLVVLFEEEGGKP 730
Query: 726 TKISFVTKQL 735
+ +SFVT+ +
Sbjct: 731 SGVSFVTRTV 740
>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
Length = 835
Score = 886 bits (2289), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/846 (51%), Positives = 578/846 (68%), Gaps = 39/846 (4%)
Query: 9 LVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 68
+ C FV+L + V+YD RA++I GKRRVL SGSIHYPRSTPEMWPDLI+K+K G
Sbjct: 22 ISFCVLFVLLNVLASAVEVSYDGRALIIDGKRRVLQSGSIHYPRSTPEMWPDLIRKAKAG 81
Query: 69 GLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGF 128
GLD IETYVFWN+HEP+R +Y+F G DL++F++ + GLYA LRIGPYVCAEW +GGF
Sbjct: 82 GLDAIETYVFWNVHEPLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIGPYVCAEWTYGGF 141
Query: 129 PLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNID 188
P+WLH +PGI+FRT N+ F EMQ FT IVDM KQEKL+ASQGGPII++QIENEYGNI
Sbjct: 142 PMWLHNMPGIEFRTANKVFMNEMQNFTTLIVDMAKQEKLFASQGGPIIIAQIENEYGNIM 201
Query: 189 SAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPK 248
+ YG AGK Y+ W A MA SLD GVPW+MCQQSDAP P+INTCNG+YCD FTPN+ N PK
Sbjct: 202 APYGDAGKVYVDWCAAMANSLDIGVPWIMCQQSDAPQPMINTCNGWYCDSFTPNNPNSPK 261
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
MWTENW+GWF ++GG P+R EDL+++VARFFQ GGTFQNYYMYHGGTNF R +GGP+I
Sbjct: 262 MWTENWTGWFKNWGGKDPHRTAEDLSYSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYI 321
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
+TSYDYDAPLDE+G + QPKWGHLKDLH +K E L + T +G ++E TVY T
Sbjct: 322 TTSYDYDAPLDEFGNLNQPKWGHLKDLHTVLKSMEETLTEGNITTIDMGNSVEVTVYAT- 380
Query: 369 SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR 428
+ S F +N T +D T + G Y +PAWSVSILPDCK V+NTAK+N+ T V ++
Sbjct: 381 QKVSSCFFSNSNTTNDATFTYGGTEYTVPAWSVSILPDCKKEVYNTAKVNAQTSVMVKNK 440
Query: 429 QSLQVAADSSDAIGSGW--SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
A D ++ W I++ + K + L++Q TT D+SDYLWY S +
Sbjct: 441 NE---AEDQPASLKWSWRPEMIDDTAVLGKGQV-SANRLIDQ-KTTNDRSDYLWYMNSVD 495
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
+ D+ + D L V + GH LHA++NG+ +GS + ++ + + L PGKN
Sbjct: 496 LSEDDLVWTD--NMTLRVNATGHILHAYVNGEYLGSQWATNGIFNYVFEEKVKLKPGKNL 553
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEELN 604
LLS T+G QNYGAFY+ +GI+GPV++ G I DLSS +W+Y+ G+ G +
Sbjct: 554 IALLSATIGFQNYGAFYDLVQSGISGPVEIVGRKGDETIIKDLSSHKWSYKVGMHGMAMK 613
Query: 605 -FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
+ S +W+ + +P + L WYKTTF AP G++ V +D G+GKGEAWVNGQS+GRY
Sbjct: 614 LYDPESPYKWE-EGNVPLNRNLTWYKTTFKAPLGTDAVVVDLQGLGKGEAWVNGQSLGRY 672
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG 723
WP+ ++++ GC +C+YRG Y++ KC++NCG P+Q YHVPRS+L + NTLVLFEE GG
Sbjct: 673 WPSSIAED-GCNATCDYRGPYTNTKCVRNCGNPTQRWYHVPRSFLTADENTLVLFEEFGG 731
Query: 724 DPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIK 783
+P+ ++F T +G++ C + +++ VL L C N+ IS IK
Sbjct: 732 NPSLVNFQTVTIGTA-CGNAYENN-------------------VLELAC--QNRPISDIK 769
Query: 784 FASFGTPLGTCGSFSRGRCSSAR-SLSVVRQACVGSKSCSIGVSVNTFGD-PCKGVMKSL 841
FASFG P G+CGSFS+G C + +L ++++ACVG +SCS+ VS FG C + K L
Sbjct: 770 FASFGDPQGSCGSFSKGSCEGNKDALDIIKKACVGKESCSLDVSEKAFGSTSCGSIPKRL 829
Query: 842 AVEASC 847
AVEA C
Sbjct: 830 AVEAVC 835
>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
Length = 827
Score = 885 bits (2288), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/852 (53%), Positives = 576/852 (67%), Gaps = 45/852 (5%)
Query: 10 VLCWGF-VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 68
+LC F V + S NV++D RA++I G+RRVL+SGSIHYPRSTPEMWPDLI+K+K+G
Sbjct: 7 LLCLLFQAVFISLSCAYNVSHDGRAIIIDGQRRVLLSGSIHYPRSTPEMWPDLIRKAKEG 66
Query: 69 GLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGF 128
GLD IETYVFWN HEP R QY+F G DL++F+K + + GLYA LRIGPYVCAEWN+GGF
Sbjct: 67 GLDAIETYVFWNAHEPARRQYDFSGHLDLIRFIKTIQDEGLYAVLRIGPYVCAEWNYGGF 126
Query: 129 PLWLHFIPGIQ-FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
P+WLH +PG+Q FRT NE F EMQ FT IVDM+KQEKL+ASQGGPII++QIENEYGN+
Sbjct: 127 PVWLHNMPGVQEFRTVNEVFMNEMQNFTTLIVDMVKQEKLFASQGGPIIIAQIENEYGNM 186
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
S YG AGK YI W A MA SLD GVPW+MCQ+SDAP P+INTCNG+YCD FTPN N P
Sbjct: 187 ISNYGDAGKVYIDWCAKMAESLDIGVPWIMCQESDAPQPMINTCNGWYCDSFTPNDPNSP 246
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
KMWTENW+GWF S+GG P+R EDLAF+VARFFQ GGTFQNYYMYHGGTNF RTSGGP+
Sbjct: 247 KMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRTSGGPY 306
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
++TSYDYDAPLDE+G + QPKWGHLK+LH +K E L + + G ++ ATVY T
Sbjct: 307 LTTSYDYDAPLDEFGNLNQPKWGHLKELHTVLKAMEKTLTHGNVSTTDFGNSVTATVYAT 366
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
G S F N T D T+ F G+ Y++PAWSVSILPDCK +NTAK+N+ T S
Sbjct: 367 EEG-SSCFFGNANTTGDATITFQGSDYVVPAWSVSILPDCKTEAYNTAKVNTQT---SVI 422
Query: 428 RQSLQVAADSSDAIGSGW--SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ A + ++ W I+EPV + +F+ L++Q D SDYLWY S
Sbjct: 423 VKKPNQAENEPSSLKWVWRPEAIDEPV-VQGKGSFSASFLIDQ-KVINDASDYLWYMTSV 480
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
++K D+ + D L V + G LHAF+NG+ VGS + K + L PGKN
Sbjct: 481 DLKPDDIIWSD--NMTLRVNTTGIVLHAFVNGEHVGSQWTKYGVFKDVFQQQVKLNPGKN 538
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGS-GNGTNI-DLSSQQWTYQTGLKGEEL 603
LLS+TVGLQNYG ++ AGITGPV+L G G+ T I DLS +WTY+ GL G E
Sbjct: 539 QISLLSVTVGLQNYGPMFDMVQAGITGPVELIGQKGDETVIKDLSCHKWTYEVGLTGLED 598
Query: 604 N-FPSGSSTQ----WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
N F S +ST W S +P + WYKTTF AP G++PV +D GMGKG AWVNG
Sbjct: 599 NKFYSKASTNETCGW-SAENVPSNSKMTWYKTTFKAPLGNDPVVLDLQGMGKGFAWVNGY 657
Query: 659 SIGRYWPTYVSQNGGC-TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
++GRYWP+Y+++ GC +D C+YRG Y +NKC+ NCG+PSQ YHVPRS+L+ NTLVL
Sbjct: 658 NLGRYWPSYLAEADGCSSDPCDYRGQYDNNKCVTNCGQPSQRWYHVPRSFLQDGENTLVL 717
Query: 718 FEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQ 777
FEE GG+P +++F T +G S+C + + L L C +
Sbjct: 718 FEEFGGNPWQVNFQTLVVG-SVCGNAHEKK-------------------TLELSC--NGR 755
Query: 778 VISSIKFASFGTPLGTCGSFSRGRCSSARS-LSVVRQACVGSKSCSIGVSVNTFGDP-CK 835
IS+IKFASFG P GTCGSF G C + + L V++Q CVG ++CSI +S + G C
Sbjct: 756 PISAIKFASFGDPQGTCGSFQAGTCQTEQDILPVLQQECVGKETCSIDISEDKLGKTNCG 815
Query: 836 GVMKSLAVEASC 847
V+K LAVEA C
Sbjct: 816 SVVKKLAVEAVC 827
>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
Length = 823
Score = 884 bits (2285), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/854 (51%), Positives = 579/854 (67%), Gaps = 38/854 (4%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M ++LL L F LA + + VTYD RA++I GK R+L+SGSIHYPRST +MWPD
Sbjct: 1 MHPSKVLLATLF--FFTLAPWATASKVTYDGRAIIIDGKHRLLVSGSIHYPRSTAQMWPD 58
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
L++KS++GGLD IETYVFW+ HEP R +Y+F G DL++F+K + + GLYA LRIGPYVC
Sbjct: 59 LVKKSREGGLDAIETYVFWDSHEPARREYDFSGNLDLIRFLKTIQDEGLYAVLRIGPYVC 118
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GGFP+WLH +PG+Q RT N+ F EM+ FT IV+M+KQE L+ASQGGP+IL+QI
Sbjct: 119 AEWNYGGFPVWLHNMPGVQMRTANDVFMNEMRNFTTLIVNMVKQENLFASQGGPVILAQI 178
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYGN+ S+YG GK+YI+W A MA SL GVPW+MCQQSDAP+P+INTCNG+YCDQFT
Sbjct: 179 ENEYGNVMSSYGDEGKAYIEWCANMAQSLHIGVPWLMCQQSDAPEPMINTCNGWYCDQFT 238
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN PKMWTENW+GWF S+GG P+R EDLAF+VARF+Q GGTFQNYYMYHGGTNF
Sbjct: 239 PNRPTSPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFG 298
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGP+I+TSYDYDAPLDEYG + QPKWGHLK+LH + E L + + G ++
Sbjct: 299 RTAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKELHDVLHSMEDTLTRGNISSVDFGNSV 358
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
T+Y T G S FL N + +D T+ F G Y +PAWSVSILPDC++VV+NTAK+++
Sbjct: 359 SGTIYSTEKG-SSCFLTNTDSRNDTTINFQGLDYEVPAWSVSILPDCQDVVYNTAKVSAQ 417
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWS-YINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
T S + VA D A+ W N+ + + +L+Q + D SDYL
Sbjct: 418 T---SVMVKKKNVAEDEPAALTWSWRPETNDKSILFGKGEVSVNQILDQKDAANDLSDYL 474
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
+Y S ++K D+P+ G L + G LH F+NG+ +GS + + I
Sbjct: 475 FYMTSVSLKEDDPIW--GDNMTLRITGSGQVLHVFVNGEFIGSQWAKYGVFDYVFEQQIK 532
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTG 597
L GKNT LLS TVG NYGA ++ T AG+ GPV+L G + I DLSS +W+Y+ G
Sbjct: 533 LNKGKNTITLLSATVGFANYGANFDLTQAGVRGPVELVGYHDDEIIIKDLSSHKWSYKVG 592
Query: 598 LKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
L+G N S S++W + P + WYK TF AP G++PV +D G+GKG AWVNG
Sbjct: 593 LEGLRQNLYSSDSSKW-QQDNYPTNKMFTWYKATFKAPLGTDPVVVDLLGLGKGLAWVNG 651
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG-NTLV 716
SIGRYWP++++++G D C+YRG+Y +NKC+ NCGKP+Q YHVPRS+L + G NTLV
Sbjct: 652 NSIGRYWPSFIAEDGCSLDPCDYRGSYDNNKCVTNCGKPTQRWYHVPRSFLNNEGDNTLV 711
Query: 717 LFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPN 776
LFEE GGDP+ ++F T +GS+ C + + + + Q +P
Sbjct: 712 LFEEFGGDPSSVNFQTTAIGSA-CVNAEEKKKIEL-------SCQGRP------------ 751
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRC-SSARSLSVVRQACVGSKSCSIGVSVNTFGDPCK 835
IS+IKFASFG PLGTCGSFS+G C +S +LS+V++ACVG +SC+I VS +TFG
Sbjct: 752 --ISAIKFASFGNPLGTCGSFSKGTCEASNDALSIVQKACVGQESCTIDVSEDTFGSTTC 809
Query: 836 G--VMKSLAVEASC 847
G V+K+L+VEA C
Sbjct: 810 GDDVIKTLSVEAIC 823
>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
Length = 737
Score = 884 bits (2285), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/739 (58%), Positives = 527/739 (71%), Gaps = 25/739 (3%)
Query: 2 ASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDL 61
A+ ++ +LVL F + A+V+YDH+AV+I G++R+LISGSIHYPRSTPEMWPDL
Sbjct: 15 ANVKVSMLVLL-SFCSWEISFVKASVSYDHKAVIINGQKRILISGSIHYPRSTPEMWPDL 73
Query: 62 IQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCA 121
IQK+KDGGLDVI+TYVFWN HEP + Y F+ RYDLV+F+KLV +AGLY HLRIGPYVCA
Sbjct: 74 IQKAKDGGLDVIQTYVFWNGHEPTQGNYYFQDRYDLVRFIKLVQQAGLYVHLRIGPYVCA 133
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EWN+GGFP+WL ++PGI+FRTDN PFKA M +FT KIV MMK EKL+ +QGGPIILSQIE
Sbjct: 134 EWNYGGFPVWLKYVPGIEFRTDNGPFKAAMHKFTEKIVSMMKAEKLFQTQGGPIILSQIE 193
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTP 241
NE+G ++ GA GK+Y KWAA MA+ L+TGVPWVMC+Q DAPDP+INTCNGFYC++F P
Sbjct: 194 NEFGPVEWDIGAPGKAYAKWAAQMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCEKFVP 253
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
N N KPKMWTE W+GWF FG AVP RP EDL F+VARF Q GG+F NYYMYHGGTNF R
Sbjct: 254 NQNYKPKMWTEAWTGWFTEFGSAVPTRPAEDLVFSVARFIQSGGSFINYYMYHGGTNFGR 313
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
TSGG F++TSYDYDAP+DEYGL+ +PKWGHL+ LHKAIKLCE ALV+ DPT SLG N E
Sbjct: 314 TSGG-FVATSYDYDAPIDEYGLLNEPKWGHLRGLHKAIKLCEPALVSVDPTVKSLGENQE 372
Query: 362 ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI---- 417
A V+ + SG C+AFLAN T V F Y LP WS+S+LPDCK VFNTA++
Sbjct: 373 AHVFNSISGKCAAFLANYDTTFSAKVSFGNAQYDLPPWSISVLPDCKTAVFNTARVGVQS 432
Query: 418 NSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSD 477
+ VP + S Q SYI E + D+ FTK GL EQ+ TAD SD
Sbjct: 433 SQKKFVPVINAFSWQ-------------SYIEETASSTDDNTFTKDGLWEQVYLTADASD 479
Query: 478 YLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFP 537
YLWY NI ++E L++G +L + S GHAL FING+L G+ YGS N K+T
Sbjct: 480 YLWYMTDVNIGSNEGFLKNGQDPLLTIWSAGHALQVFINGQLSGTVYGSLENPKLTFSKN 539
Query: 538 IALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTG 597
+ L G N LLS +VGL N G +EK AG+ GPV LKG GT D+S Q+WTY+ G
Sbjct: 540 VKLRAGVNKISLLSTSVGLPNVGTHFEKWNAGVLGPVTLKGLNEGTR-DISKQKWTYKIG 598
Query: 598 LKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
LKGE L+ + SS +W ++L + QP+ WYKTTF+ P G++P+A+D MGKG W
Sbjct: 599 LKGEALSLHTVSGSSSVEWAQGASLAQKQPMTWYKTTFNVPPGNDPLALDMGAMGKGMVW 658
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
+NGQSIGR+WP Y+ NG C CNY G Y+ KC CGKPSQ YHVPRS LK SGN
Sbjct: 659 INGQSIGRHWPGYIG-NGNC-GGCNYAGTYTEKKCRTYCGKPSQRWYHVPRSRLKPSGNL 716
Query: 715 LVLFEEIGGDPTKISFVTK 733
LV+FEE GG+P IS + +
Sbjct: 717 LVVFEEWGGEPHWISLLKR 735
>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
Length = 721
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/737 (58%), Positives = 523/737 (70%), Gaps = 23/737 (3%)
Query: 1 MASKEILLLVLC-WGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
+ + +L L+ C W + V AT V+YDH+A++I G+RR+LISGSIHYPRSTP+MWP
Sbjct: 2 LKTNLVLFLLFCSWLWSVEAT------VSYDHKAIIINGRRRILISGSIHYPRSTPQMWP 55
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
DLIQ +K+GGLDVI+TYVFWN HEP Y FE RYDLVKF+KLV +AGLY HLRIGPY+
Sbjct: 56 DLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRIGPYI 115
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
C EWNFGGFP+WL ++PGIQFRTDN PFKA+MQ+FT KIV+MMK EKL+ QGGPII+SQ
Sbjct: 116 CGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQ 175
Query: 180 IENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF 239
IENEYG I+ GA GK+Y KWAA MA+ L TGVPW+MC+Q DAPDPII+TCNGFYC+ F
Sbjct: 176 IENEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENF 235
Query: 240 TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 299
PN+N KPKM+TE W+GW+ FGG VPYRP ED+A++VARF Q G+F NYYMYHGGTNF
Sbjct: 236 MPNANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNF 295
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
RT+GGPFI+TSYDYDAPLDEYGL R+PKWGHL+DLHK IKLCE +LV+ DP SLG N
Sbjct: 296 GRTAGGPFIATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSN 355
Query: 360 LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS 419
EA V+ T + C+AFLAN V V F Y LP WSVSILPDCK VVFNTAK+
Sbjct: 356 QEAHVFWTKTS-CAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKV-- 412
Query: 420 VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
S+ SL + A S SY E + D FTK GL EQI+ T D +DYL
Sbjct: 413 ------VSQGSLAKMIAVNSAF-SWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYL 465
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
WY I DE L++G +L V S GHALH F+NG+L G+ YG N K+ +
Sbjct: 466 WYMTDVTIGPDEAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVK 525
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
L G N LLS+ VGL N G +E AG+ GPV LKG +GT D+S +W+Y+ GLK
Sbjct: 526 LRAGVNKVSLLSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGT-WDMSKWKWSYKIGLK 584
Query: 600 GEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
GE L+ + SS +W S L + QPL+WYKTTF+AP G++P+A+D MGKG+ W+N
Sbjct: 585 GEALSLHTVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWIN 644
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQSIGR+WP Y ++ G +CNY G Y KC NCGK SQ YHVPRSWL + N LV
Sbjct: 645 GQSIGRHWPGYKAR--GSCGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLV 702
Query: 717 LFEEIGGDPTKISFVTK 733
+FEE GGDPTKIS V +
Sbjct: 703 VFEEWGGDPTKISLVKR 719
>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
gi|223950023|gb|ACN29095.1| unknown [Zea mays]
Length = 815
Score = 878 bits (2269), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/801 (54%), Positives = 550/801 (68%), Gaps = 26/801 (3%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MW LIQK+KDGGLDVI+TYVFWN HEP Y FE RYDLV+FVK V +AGL+ HLRIG
Sbjct: 29 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 88
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
PY+C EWNFGGFP+WL ++PGI FRTDNEPFK MQ FT KIV MMK E L+ASQGGPII
Sbjct: 89 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 148
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
LSQIENEYG +GAAG++YI WAA MA+ LDTGVPWVMC++ DAPDP+IN CNGFYC
Sbjct: 149 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 208
Query: 237 DQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGG 296
D F+PN KP MWTE WSGWF FGG + RPVEDLAFAVARF Q+GG+F NYYMYHGG
Sbjct: 209 DAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYHGG 268
Query: 297 TNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSL 356
TNF RT+GGPFI+TSYDYDAP+DEYGLIR+PK HLK+LH+A+KLCE ALV+ DPT +L
Sbjct: 269 TNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTITTL 328
Query: 357 GPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAK 416
G EA V+++ SG C+AFLAN +NS V FN Y LP WS+SILPDCKNVVFN+A
Sbjct: 329 GTMQEAHVFRSPSG-CAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNSAT 387
Query: 417 INSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQ 475
+ T +Q+ D + ++ W +E V ++ T GLLEQ+N T D
Sbjct: 388 VGVQT-------SQMQMWGDGATSM--MWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDS 438
Query: 476 SDYLWYSLSTNIKADEPLLEDGSK-TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTV 534
SDYLWY S +I E L+ G K L VQS GHALH F+NG+L GS YG+ + ++
Sbjct: 439 SDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKY 498
Query: 535 DFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTY 594
+ + L G N LLS+ GL N G YE G+ GPV L G G+ DL+ Q W+Y
Sbjct: 499 NGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSR-DLTWQTWSY 557
Query: 595 QTGLKGEELNFPS---GSSTQWDSKSTLP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGK 650
Q GLKGE++N S S +W S + K QPL WYK F+ P+G EP+A+D MGK
Sbjct: 558 QVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMGSMGK 617
Query: 651 GEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKS 710
G+ W+NGQSIGRYW Y +G C C+Y G + + KC CG+P+Q YHVPRSWL+
Sbjct: 618 GQVWINGQSIGRYWTAYA--DGDC-KGCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQP 674
Query: 711 SGNTLVLFEEI-GGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV-L 768
S N LV+ EE+ GGD +KI+ + + SS+C+ V++ HP + W +S +R+ +
Sbjct: 675 SRNLLVVLEELGGGDSSKIALAKRSV-SSVCADVSEDHP-NIKKWQIESYGEREHRRAKV 732
Query: 769 SLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVN 828
L C + Q IS+I+FASFGTP+GTCG+F +G C SA S +V+ + C+G + C + +S +
Sbjct: 733 HLRCAH-GQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKRCIGLQRCVVAISPD 791
Query: 829 TF-GDPCKGVMKSLAVEASCT 848
F GDPC V K +AVEA C+
Sbjct: 792 NFGGDPCPSVTKRVAVEAVCS 812
>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
Length = 728
Score = 878 bits (2268), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/733 (59%), Positives = 521/733 (71%), Gaps = 16/733 (2%)
Query: 5 EILLLVLCWG-FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
I +L +C G F +L S A+VTYD +A+ I G+RR+L SGSIHYPRSTPEMWP LIQ
Sbjct: 6 RIKVLFVCVGLFFLLCCCSVTASVTYDGKAIKINGQRRILFSGSIHYPRSTPEMWPGLIQ 65
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
K+K+GGLDVI+TYVFWN HEP QY FEGRYDLV+F+KL +AGLY HLRIG YVCAEW
Sbjct: 66 KAKEGGLDVIQTYVFWNGHEPSPGQYYFEGRYDLVRFIKLAQQAGLYVHLRIGLYVCAEW 125
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
NFGGFP+WL ++PGI FRTDN PFKA MQ+FT KIV++MK EKL+ SQGGPII+SQIENE
Sbjct: 126 NFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVNLMKSEKLFESQGGPIIMSQIENE 185
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG ++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDPII+TCNGFYC+ FTPN
Sbjct: 186 YGPVEWEIGAPGKAYTKWAAEMAVGLDTGVPWIMCKQEDAPDPIIDTCNGFYCEGFTPNK 245
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
N KPKMWTE W+GW+ FGG + RPVEDLA++VARF Q G+F NYYMYHGGTNF RT+
Sbjct: 246 NYKPKMWTEAWTGWYTEFGGPIHNRPVEDLAYSVARFIQNNGSFVNYYMYHGGTNFGRTA 305
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
G F++TSYDYDAP+DEYGL R+PKWGHL+DLHKAIKLCE +LV+ PT G NLE
Sbjct: 306 AGLFVATSYDYDAPIDEYGLPREPKWGHLRDLHKAIKLCEPSLVSAYPTVTWPGKNLEVH 365
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
V+K+ S C+AFLAN +S V F Y LP WS+SILPDCKN VFNTA+++
Sbjct: 366 VFKSKSS-CAAFLANYDPSSPAKVTFQNMQYDLPPWSISILPDCKNAVFNTARVS----- 419
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
S+ S S S SYI E V D K GL EQI+ T D SDYLWY
Sbjct: 420 ---SKSSQMKMTPVSGGAFSWQSYIEETVSADDSDTIAKNGLWEQISITRDGSDYLWYLT 476
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
NI +E L++G VL V S GHALH FING+L G+ YGS N K+T + L G
Sbjct: 477 DVNIHPNEGFLKNGQSPVLTVMSAGHALHVFINGQLAGTVYGSLENPKLTFSNNVKLRAG 536
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL 603
N LLS VGL N G +E G+ GPV LKG GT DL+ Q+W+Y+ GLKGE+L
Sbjct: 537 INKISLLSAAVGLPNVGLHFETWNTGVLGPVTLKGLNEGTR-DLTKQKWSYKVGLKGEDL 595
Query: 604 NFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
+ + SS +W S L + QPL WYK TF+AP G++P+A+D MGKG+ W+NG+SI
Sbjct: 596 SLHTLSGSSSVEWVQGSLLAQKQPLTWYKATFNAPEGNDPLALDMNTMGKGQIWINGESI 655
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GR+WP Y +G C C+Y G Y+ KCL NCG+ SQ YHVPRSWLK SGN LV+FEE
Sbjct: 656 GRHWPEY-KASGNC-GGCSYAGIYTEKKCLSNCGEASQRWYHVPRSWLKPSGNFLVVFEE 713
Query: 721 IGGDPTKISFVTK 733
+GGDPT ISFV +
Sbjct: 714 LGGDPTGISFVRR 726
>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
Length = 721
Score = 877 bits (2266), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/737 (58%), Positives = 522/737 (70%), Gaps = 23/737 (3%)
Query: 1 MASKEILLLVLC-WGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
+ + +L L+ C W + V AT V+YDH+A++I G+RR+LISGSIHYPRSTP+MWP
Sbjct: 2 LKTNLVLFLLFCSWLWSVEAT------VSYDHKAIIINGRRRILISGSIHYPRSTPQMWP 55
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
DLIQ +K+GGLDVI+TYVFWN HEP Y FE RYDLVKF+KLV +AGLY HLRI PY+
Sbjct: 56 DLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRISPYI 115
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
C EWNFGGFP+WL ++PGIQFRTDN PFKA+MQ+FT KIV+MMK EKL+ QGGPII+SQ
Sbjct: 116 CGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQ 175
Query: 180 IENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF 239
IENEYG I+ GA GK+Y KWAA MA+ L TGVPW+MC+Q DAPDPII+TCNGFYC+ F
Sbjct: 176 IENEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENF 235
Query: 240 TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 299
PN+N KPKM+TE W+GW+ FGG VPYRP ED+A++VARF Q G+F NYYMYHGGTNF
Sbjct: 236 MPNANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNF 295
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
RT+GGPFI+TSYDYDAPLDEYGL R+PKWGHL+DLHK IKLCE +LV+ DP SLG N
Sbjct: 296 GRTAGGPFIATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSN 355
Query: 360 LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS 419
EA V+ T + C+AFLAN V V F Y LP WSVSILPDCK VVFNTAK+
Sbjct: 356 QEAHVFWTKTS-CAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKV-- 412
Query: 420 VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
S+ SL + A S SY E + D FTK GL EQI+ T D +DYL
Sbjct: 413 ------VSQGSLAKMIAVNSAF-SWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYL 465
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
WY I DE L++G +L V S GHALH F+NG+L G+ YG N K+ +
Sbjct: 466 WYMTDVTIGPDEAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVK 525
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
L G N LLS+ VGL N G +E AG+ GPV LKG +GT D+S +W+Y+ GLK
Sbjct: 526 LRAGVNKVSLLSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGT-WDMSKWKWSYKIGLK 584
Query: 600 GEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
GE L+ + SS +W S L + QPL+WYKTTF+AP G++P+A+D MGKG+ W+N
Sbjct: 585 GEALSLHTVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWIN 644
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQSIGR+WP Y ++ G +CNY G Y KC NCGK SQ YHVPRSWL + N LV
Sbjct: 645 GQSIGRHWPGYKAR--GSCGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLV 702
Query: 717 LFEEIGGDPTKISFVTK 733
+FEE GGDPTKIS V +
Sbjct: 703 VFEEWGGDPTKISLVKR 719
>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 836
Score = 877 bits (2265), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/840 (53%), Positives = 576/840 (68%), Gaps = 39/840 (4%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S V+YDHRA+ + G+RR+L+SGSIHYPRSTP MWP LI K+K+GGLDVI+TYVFWN
Sbjct: 23 SVAVTVSYDHRALKLDGQRRMLVSGSIHYPRSTPLMWPGLIAKAKEGGLDVIQTYVFWNG 82
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP R YN+ GRY+L KF++LV EAG+Y +LRIGPYVCAEWN GGFP WL FIPGI+FR
Sbjct: 83 HEPTRGVYNYAGRYNLPKFIRLVYEAGMYVNLRIGPYVCAEWNSGGFPAWLRFIPGIEFR 142
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDNEPFK E QRF +V +K+EKL+A QGGPII++QIENEYGNID++YG AG+ Y+ W
Sbjct: 143 TDNEPFKNETQRFVNHLVRKLKREKLFAWQGGPIIMAQIENEYGNIDASYGEAGQRYLNW 202
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
A MA++ +T VPW+MCQQ +AP +INTCNGFYCD + PNS +KP WTENW+GWF S+
Sbjct: 203 IANMAVATNTSVPWIMCQQPEAPQLVINTCNGFYCDGWRPNSEDKPAFWTENWTGWFQSW 262
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG P RPV+D+AF+VARFF++GG+F NYYMYHGGTNF+RT G ++TSYDYDAP+DEY
Sbjct: 263 GGGAPTRPVQDIAFSVARFFEKGGSFMNYYMYHGGTNFERT-GVESVTTSYDYDAPIDEY 321
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKTGSGLCSAFLANI 379
+RQPKWGHLKDLH A+KLCE ALV D PT SLGPN EA VY++ SG C+AFLA+
Sbjct: 322 D-VRQPKWGHLKDLHAALKLCEPALVEVDTVPTGISLGPNQEAHVYQSSSGTCAAFLASW 380
Query: 380 GTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSD 439
TN D V F G Y LPAWSVSILPDCK+VVFNTAK+ + +++ ++Q A ++
Sbjct: 381 DTN-DSLVTFQGQPYDLPAWSVSILPDCKSVVFNTAKVGAQSVI-----MTMQGAVPVTN 434
Query: 440 AIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED-GS 498
W +EP+G F+ GLLEQI TT D +DYLWY TN++ E + + +
Sbjct: 435 -----WVSYHEPLG-PWGSVFSTNGLLEQIATTKDTTDYLWY--MTNVQVAESDVRNISA 486
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ L + SL A H F+NG G+ + +A+ PI+L PG N +LS+T+GLQ
Sbjct: 487 QATLVMSSLRDAAHTFVNGFYTGTSHQQFMHARQ----PISLRPGSNNITVLSMTMGLQG 542
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSST-QWDS 615
YG F E AGI V+++ +GT I+L WTYQ GL+GE +L +GS T +W++
Sbjct: 543 YGPFLENEKAGIQYGVRIEDLPSGT-IELGGSTWTYQVGLQGESKQLFEVNGSLTAEWNT 601
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
S + L W KT FD PAG+ +A+D + MGKG WVNG ++GRYW ++ +Q GC
Sbjct: 602 ISEVSDQNFLFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGVNLGRYWSSFTAQRDGCD 661
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
SC+YRG+Y+ +KCL C +PSQ+ YH+PR WL N +VLFEE GG+P IS T+ +
Sbjct: 662 ASCDYRGSYTQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLFEEKGGNPKDISIATR-M 720
Query: 736 GSSLCSHVTDSHPLPVDM--WGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
+CSH++ SHP P + W S + R P L+LEC Q IS I FAS+GT
Sbjct: 721 PQQICSHISQSHPFPFSLTSWTKRDNLTSTLLRAP---LTLECAEGQQ-ISRICFASYGT 776
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
P G C F C + S V+ +ACVG + CS+ + + FG DPC G+ KSLA A C+
Sbjct: 777 PSGDCEGFVLSSCHANTSYDVLTKACVGRQKCSVPIVSSIFGDDPCPGLSKSLAATAECS 836
>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 831
Score = 874 bits (2259), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/836 (52%), Positives = 568/836 (67%), Gaps = 39/836 (4%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
+ NV++D RA+ I GKRRVLISGSIHYPRSTPEMWP+LIQK+K+GGLD IETYVFWN
Sbjct: 25 ALHTNVSHDGRAIKIDGKRRVLISGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNA 84
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP R Y+F G D+++F+K + E+GLY LRIGPYVCAEWN+GG P+W+H +P ++ R
Sbjct: 85 HEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIR 144
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
T N F EMQ FT IVDM+K+EKL+ASQGGPIIL+QIENEYGN+ S YG AGK+Y+ W
Sbjct: 145 TANSVFMNEMQNFTTLIVDMLKKEKLFASQGGPIILTQIENEYGNVISQYGDAGKAYMNW 204
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
A MA SL GVPW+MCQ+SDAP P+INTCNG+YCD F PNS N PKMWTENW GWF ++
Sbjct: 205 CANMAESLKVGVPWIMCQESDAPQPMINTCNGWYCDNFEPNSFNSPKMWTENWIGWFKNW 264
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG P+R ED+AFAVARFFQ GGTFQNYYMYHGGTNF RT+GGP+I+TSYDYDAPLDEY
Sbjct: 265 GGRDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEY 324
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
G I QPKWGHLK+LH A+K E AL + + + LG +++ T+Y T +G S FL+N T
Sbjct: 325 GNIAQPKWGHLKELHSALKAMEEALTSGNVSETDLGNSVKVTIYAT-NGSSSCFLSNTNT 383
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
+D T+ F GN+Y +PAWSVSILPDC++ +NTAK+ T V + + A +
Sbjct: 384 TADATLTFRGNNYTVPAWSVSILPDCQHEEYNTAKVKEQTSVMTKENSKAEKEAAILKWV 443
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
W N + + LL+Q + D SDYLWY ++K D+P+ +
Sbjct: 444 ---WRSENIDKALHGKSNVSAHRLLDQKDAANDASDYLWYMTKLHVKHDDPVWSE--NMT 498
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + GH +HAF+NG+ + S + + + I L G NT LLS+TVGLQNYGA
Sbjct: 499 LRINGSGHVIHAFVNGEYIDSHWATYGIHNDKFEPKIKLKHGTNTISLLSVTVGLQNYGA 558
Query: 562 FYEKTGAGITGPVQLKG-SGNGTNI-DLSSQQWTYQTGLKGEELNF-----PSGSSTQWD 614
F++ AG+ GP++L G T I +LSS +W+Y+ GL G + P + ++W+
Sbjct: 559 FFDTWHAGLVGPIELVSVKGEETIIKNLSSHKWSYKIGLHGWDHKLFSDDSPFAAQSKWE 618
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
S+ LP + L WYKTTF AP G++PV +D GMGKG AWVNG++IGR WP+Y ++ GC
Sbjct: 619 SEK-LPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGYAWVNGKNIGRIWPSYNAEEDGC 677
Query: 675 TDS-CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
+D C+YRG YS +KC+ NCGKP+Q YHVPRS+LK NTLVLF E+GG+P+ ++F T
Sbjct: 678 SDEPCDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDGANTLVLFAELGGNPSLVNFQTV 737
Query: 734 QLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+G ++C++ ++ L + G RK IS+IKFASFG P G
Sbjct: 738 VVG-NVCANAYENKTLELSCQG------RK---------------ISAIKFASFGDPKGV 775
Query: 794 CGSFSRGRCSS-ARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
CG+F+ G C S + +L +V++ACVG ++CSI +S TFG C + K LAVEA C
Sbjct: 776 CGAFTNGSCESKSNALPIVQKACVGKEACSIDLSEKTFGATACGNLAKRLAVEAVC 831
>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
Length = 725
Score = 874 bits (2257), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/735 (58%), Positives = 520/735 (70%), Gaps = 30/735 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL+ C + + S A+V YDH+A++I G+RR+LISGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 ILLLLSC----IFSAAS--ASVGYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K GGLDVI+TYVFWN HEP +Y FE RYDLVKF+KLV +AGL+ +LRIGPYVCAEWNF
Sbjct: 65 KAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNF 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV+MMK EKL+ ++GGPIILSQIENEYG
Sbjct: 125 GGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTEGGPIILSQIENEYG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ L+TGVPW+MC+Q DAPDP+I+TCNG+YC+ F PN
Sbjct: 185 PVEWEIGAPGKAYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVY 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGA+P RPVEDLAF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 245 KPKMWTEVWTGWYTEFGGAIPTRPVEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL++QPKWGHLKDLHKAIK CE ALVA DP+ LG N EA V+
Sbjct: 305 PFMATSYDYDAPLDEYGLLQQPKWGHLKDLHKAIKSCEYALVAVDPSVTKLGNNQEAHVF 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVT 421
T SG C+AFLAN T V V F Y LP WS+SILPDCK VFNTAK+ + V
Sbjct: 365 NTKSG-CAAFLANYDTKYPVRVSFGQGQYDLPPWSISILPDCKTAVFNTAKVTWKTSQVQ 423
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
+ P +SR Q S+I E + T GL EQI T D +DYLWY
Sbjct: 424 MKPVYSRLPWQ-------------SFIEETTTSDESGTTTLDGLYEQIYMTRDATDYLWY 470
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
I +DE L +G +L + S HALH FING+L G+ YGS N K+T + L
Sbjct: 471 MTDITIGSDEAFLNNGKFPLLTIFSACHALHVFINGQLSGTVYGSLENPKLTFSQNVKLR 530
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
PG N LLS++VGL N G +E AG+ GP+ LKG GT D+S +WTY+ G+KGE
Sbjct: 531 PGINKLALLSISVGLPNVGTHFETWNAGVLGPISLKGLNTGT-WDMSRWKWTYKIGMKGE 589
Query: 602 ELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
L + SS W ++ K QPL WYK TF+AP G P+A+D MGKG+ W+NGQ
Sbjct: 590 ALGLHTVTGSSSVDWAEGPSMAKKQPLTWYKATFNAPPGHAPLALDMGSMGKGQIWINGQ 649
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
S+GR+WP Y++Q G +CNY G + KC CGKPSQ YH+PRSWL +GN LV+F
Sbjct: 650 SVGRHWPGYIAQ--GSCGTCNYAGTFYDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVF 707
Query: 719 EEIGGDPTKISFVTK 733
EE GGDP +S V +
Sbjct: 708 EEWGGDPQWMSLVER 722
>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
Length = 730
Score = 873 bits (2255), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/732 (58%), Positives = 524/732 (71%), Gaps = 26/732 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+LLL+ W V A+ VTYDH+A++I G+RR+LISGSIHYPRSTP+MWPDLIQK+
Sbjct: 20 VLLLLFFWVCYVTAS------VTYDHKAIMINGQRRILISGSIHYPRSTPQMWPDLIQKA 73
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVIETYVFWN HEP +Y FE R+DLV F+KLV +AGL+ HLRIGP++CAEWNF
Sbjct: 74 KDGGLDVIETYVFWNGHEPSPGKYYFEDRFDLVGFIKLVQQAGLFVHLRIGPFICAEWNF 133
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFK MQ+FT KIV++MK EKL+ SQGGPIILSQIENEYG
Sbjct: 134 GGFPVWLKYVPGIAFRTDNEPFKEAMQKFTEKIVNIMKAEKLFQSQGGPIILSQIENEYG 193
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPWVMC+Q DAPDPII+TCNGFYC+ FTPN N
Sbjct: 194 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNY 253
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPK+WTENW+GW+ +FGGA PYRP ED+AF+VARF Q G+ NYYMYHGGTNF RTS G
Sbjct: 254 KPKLWTENWTGWYTAFGGATPYRPAEDIAFSVARFIQNRGSLFNYYMYHGGTNFGRTSNG 313
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
F++TSYDYDAP+DEYGL+ +PKWGHL++LH+AIK CE+ALV+ DPT G NLE +Y
Sbjct: 314 LFVATSYDYDAPIDEYGLLNEPKWGHLRELHRAIKQCESALVSVDPTVSWPGKNLEVHLY 373
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
KT S C+AFLAN T+ VKF Y LP WS+SILPDCK VFNTAK+NS P
Sbjct: 374 KTESA-CAAFLANYNTDYSTQVKFGNGQYDLPPWSISILPDCKTEVFNTAKVNS----PR 428
Query: 426 FSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
R+ V + W SY EP S++D T L EQ+ T D SDYLWY
Sbjct: 429 LHRKMTPVNS------AFAWQSYNEEPASSSENDPVTGYALWEQVGVTRDSSDYLWYLTD 482
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
NI ++ ++DG VL S GH L+ FING+ G+ YGS + ++T + L G
Sbjct: 483 VNIGPND--IKDGKWPVLTAMSAGHVLNVFINGQYAGTAYGSLDDPRLTFSQSVNLRVGN 540
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS++VGL N G +E G+ GPV L G +GT DLS Q+W+Y+ GLKGE L+
Sbjct: 541 NKISLLSVSVGLANVGTHFETWNTGVLGPVTLTGLSSGT-WDLSKQKWSYKIGLKGESLS 599
Query: 605 FPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
+ +S +W S + K QPL WYKTTF APAG++P+A+D MGKGE WVNGQSIG
Sbjct: 600 LHTEAGSNSVEWVQGSLVAKKQPLAWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGQSIG 659
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
R+WP ++ G C + CNY G Y+ KCL NCG+PSQ YHVPRSWL+S GN LV+ EE
Sbjct: 660 RHWPGNKAR-GNCGN-CNYAGTYTDTKCLANCGQPSQRWYHVPRSWLRSGGNYLVVLEEW 717
Query: 722 GGDPTKISFVTK 733
GGDP I+ V +
Sbjct: 718 GGDPNGIALVER 729
>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 731
Score = 872 bits (2253), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/731 (58%), Positives = 529/731 (72%), Gaps = 22/731 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL C + + S A+V+YDH+A++I G++R+LISGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 ILLLFSC----IFSAAS--ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP +Y FE RYDLVKF+KLV +AGL+ +LRIGPYVCAEWNF
Sbjct: 65 KDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNF 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV MMK EKL+ +QGGPIILSQIENE+G
Sbjct: 125 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN +
Sbjct: 185 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGAVP RP ED+AF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 245 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL+R+PKWGHL+DLHKAIK CE+ALV+ DP+ LG N EA V+
Sbjct: 305 PFMATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ S C+AFLAN V V F G Y LP WS+SILPDCK V++TAK+ S
Sbjct: 365 KSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYSTAKVGS------ 417
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
QS QV + S+I E + D T GL EQIN T D +DYLWY
Sbjct: 418 ---QSSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDI 474
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I +DE L++G +L + S GHAL+ FING+L G+ YGS N K++ + L G N
Sbjct: 475 TIGSDEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 534
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS++VGL N G +E AG+ GP+ LKG +GT D+S +WTY+TGLKGE L
Sbjct: 535 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGT-WDMSGWKWTYKTGLKGEALGL 593
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS +W ++ K QPL WYK TF+AP G P+A+D MGKG+ W+NGQS+GR
Sbjct: 594 HTVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGR 653
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y+++ G C D C+Y G Y KC +CG+PSQ YH+PRSWL +GN LV+FEE G
Sbjct: 654 HWPGYIAR-GSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPNGNLLVVFEEWG 711
Query: 723 GDPTKISFVTK 733
GDP++IS V +
Sbjct: 712 GDPSRISLVER 722
>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
Length = 731
Score = 872 bits (2253), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/731 (58%), Positives = 527/731 (72%), Gaps = 22/731 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL C + + S A+V+YDH+A++I G++R+LISGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 ILLLFSC----IFSAAS--ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP +Y FE RYDLVKF+KLV +AGL+ +LRIGPYVCAEWNF
Sbjct: 65 KDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNF 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV MMK EKL+ SQGGPIILSQIENE+G
Sbjct: 125 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN +
Sbjct: 185 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGAVP RP ED+AF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 245 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL R+PKWGHL+DLHKAIK CE+ALV+ DP+ LG N EA V+
Sbjct: 305 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVF 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ S C+AFLAN V V F G Y LP WS+SILPDCK V+NTAK+ S
Sbjct: 365 KSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGS------ 417
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
QS QV + S+I E + D T GL EQIN T D +DYLWY
Sbjct: 418 ---QSSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDI 474
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I +DE L++G +L + S GHAL+ FING+L G+ YGS N K++ + L G N
Sbjct: 475 TIGSDEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 534
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS++VGL N G +E AG+ GP+ LKG +GT D+S +WTY+TGLKGE L
Sbjct: 535 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGT-WDMSGWKWTYKTGLKGEALGL 593
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS +W ++ K QPL WYK TF+AP G P+A+D MGKG+ W+NGQS+GR
Sbjct: 594 HTVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGR 653
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y+++ G C D C+Y G Y KC +CG+PSQ YH+PRSWL +GN LV+FEE G
Sbjct: 654 HWPGYIAR-GSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWG 711
Query: 723 GDPTKISFVTK 733
GDP+ IS V +
Sbjct: 712 GDPSGISLVER 722
>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 871 bits (2251), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/733 (58%), Positives = 529/733 (72%), Gaps = 23/733 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I L +LC+ ++ +T A VTYDH+A++I G+RR+LISGSIHYPRSTPEMWPDLI+K+
Sbjct: 11 IFLAILCFSSLIWSTE---AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVI+TYVFWN HEP Y F+ RYDLVKF KLV +AGLY LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFK MQRFT KIVDMMK+EKL+ +QGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGIVFRTDNEPFKIAMQRFTKKIVDMMKEEKLFETQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GAAGK+Y KW A MAL L TGVPW+MC+Q DAP PII+TCNGFYC+ F PNS+N
Sbjct: 188 PMEWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPK+WTENW+GWF FGGA+P RPVED+AF+VARF Q GG+F NYYMY+GGTNFDRT+ G
Sbjct: 248 KPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFLNYYMYYGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL+R+PK+ HLK+LHK IKLCE ALV+ DPT SLG E V+
Sbjct: 307 VFIATSYDYDAPLDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEVHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ + C+AFL+N T+S + F G Y LP WSVSILPDCK +NTAKI + T++
Sbjct: 367 KSKTS-CAAFLSNYDTSSAARIMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMK 425
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDD-AFTKPGLLEQINTTADQSDYLWYSLS 484
S + + W NE S DD F K GL+EQI+ T D++DY WY
Sbjct: 426 MVPTSTKFS----------WESYNEGSPSSNDDGTFVKDGLVEQISMTRDKTDYFWYLTD 475
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
I +DE L+ G +L + S GHALH F+NG L G+ YG+ SN+K+T I L+ G
Sbjct: 476 ITIGSDESFLKTGDDPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQKIKLSVGI 535
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS VGL N G YE G+ GPV LKG +GT D+S +W+Y+ G++GE ++
Sbjct: 536 NKLALLSTAVGLPNAGVHYETWNTGVLGPVTLKGVNSGT-WDMSKWKWSYKIGIRGEAMS 594
Query: 605 FPS--GSST--QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
F + GSS W S + K +PL WYK++FD P G+EP+A+D MGKG+ WVNG +I
Sbjct: 595 FHTIAGSSAVKWWIKGSFVVKKEPLTWYKSSFDTPKGNEPLALDMNTMGKGQVWVNGHNI 654
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GR+WP Y ++ G C CNY G Y+ KCL +CG+PSQ YHVPRSWLK GN LV+FEE
Sbjct: 655 GRHWPAYTAR-GNC-GRCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEE 712
Query: 721 IGGDPTKISFVTK 733
GGDP+ IS V +
Sbjct: 713 WGGDPSGISLVKR 725
>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
Length = 719
Score = 871 bits (2251), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/736 (57%), Positives = 518/736 (70%), Gaps = 25/736 (3%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
SK +LL + +V A A VTYD +A++I GKRR+L+SGSIHYPRSTP+MWP LI
Sbjct: 2 SKCVLLFLGLLSWVCYAM----ATVTYDQKAIIINGKRRILVSGSIHYPRSTPQMWPSLI 57
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
Q +KDGGLD+IETYVFWN HEP + +Y FE RYDLV+F+KLV +AGLY HLRIGPYVCAE
Sbjct: 58 QNAKDGGLDIIETYVFWNGHEPTQGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAE 117
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WN+GGFP+WL +PGI FRT+NEPFKA MQ+FT KIV MMK EKLY SQGGPIILSQIEN
Sbjct: 118 WNYGGFPIWLKHVPGIVFRTENEPFKAAMQKFTEKIVGMMKSEKLYESQGGPIILSQIEN 177
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG ++ GA GKSY KWAA MAL LDTGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN
Sbjct: 178 EYGPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPN 237
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
NKPK+WTE WSGW+ +FGGAVPYRP EDLAF+VARF Q GG+ NYYMYHGGTNF R+
Sbjct: 238 RENKPKIWTEVWSGWYTAFGGAVPYRPAEDLAFSVARFVQNGGSLFNYYMYHGGTNFGRS 297
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
S G FI+ SYD+DAP+DEYGL R+PKW HL+DLHKAIKLCE ALV+ DP LG NLEA
Sbjct: 298 S-GLFIANSYDFDAPIDEYGLKREPKWEHLRDLHKAIKLCEPALVSADPNVTWLGKNLEA 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
V+K+ SG C+AFLAN ++ V F Y LP WS+SIL DCK+ +FNTA+I +
Sbjct: 357 RVFKSSSGACAAFLANYDISTSSKVSFWNTQYDLPPWSISILSDCKSAIFNTARIGA--- 413
Query: 423 VPSFSRQSLQVAADSSDAIGSGW--SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
Q A + S W SY E D TK GL+EQ+N T D +DYLW
Sbjct: 414 ---------QSAPMKMMLVSSFWWLSYKEEVASGYATDTTTKDGLVEQVNFTWDSTDYLW 464
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y I +E ++ G +L++ S GH LH F+NG+L G+ YGS N KV + L
Sbjct: 465 YMTDIQIDPNEAFIKSGQWPLLNISSAGHVLHVFVNGQLSGTVYGSLENPKVAFSKYVNL 524
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N +LS+TVGL N G +E AG+ GPV LKG G D+S +W+++ GLKG
Sbjct: 525 KAGVNKLSMLSVTVGLPNVGLHFESWNAGVLGPVTLKGLNEGIR-DMSGYKWSHKVGLKG 583
Query: 601 EELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
E +N + +S QW S L + QPL WYKT F+ PAG+EP+A+D + MGKG+ W+NG
Sbjct: 584 ENMNLHTIGGSNSVQWAKGSGLVQKQPLTWYKTNFNTPAGNEPLALDMSSMGKGQIWING 643
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
+SIGRYWP Y + G C+Y G ++ KCL NCG+PSQ YHVPR WL+S GN LV+
Sbjct: 644 RSIGRYWPAYAAS--GSCGKCSYAGIFTEKKCLSNCGQPSQKWYHVPREWLESKGNFLVV 701
Query: 718 FEEIGGDPTKISFVTK 733
FEE+GG+P IS V +
Sbjct: 702 FEELGGNPGGISLVKR 717
>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
Length = 724
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/731 (58%), Positives = 526/731 (71%), Gaps = 22/731 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL C + + S A+V+YDH+A++I G++R+LISGSIHYPRSTPEMWPDLIQK+
Sbjct: 4 ILLLFSC----IFSAAS--ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKA 57
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP +Y FE RYDLVKF+KLV +AGL+ +LRIGPYVCAEWNF
Sbjct: 58 KDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNF 117
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV MMK EKL+ SQGGPIILSQIENE+G
Sbjct: 118 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFG 177
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN +
Sbjct: 178 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 237
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGAVP RP ED+AF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 238 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGG 297
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL R+PKWGHL+DLHKAIK CE+ALV+ DP+ LG N EA V+
Sbjct: 298 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVF 357
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ S C+AFLAN V V F G Y LP WS+SILPDCK V+NTAK+ S
Sbjct: 358 KSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGS------ 410
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
QS QV + S+I E + D GL EQIN T D +DYLWY
Sbjct: 411 ---QSSQVQMTPVHSGFPWQSFIEETTSSDETDTTYMDGLYEQINITRDTTDYLWYMTDI 467
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I +DE L++G +L + S GHAL+ FING+L G+ YGS N K++ + L G N
Sbjct: 468 TIGSDEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 527
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS++VGL N G +E AG+ GP+ LKG +GT D+S +WTY+TGLKGE L
Sbjct: 528 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGT-WDMSGWKWTYKTGLKGEALGL 586
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS +W ++ K QPL W+K TF+AP G P+A+D MGKG+ W+NGQS+GR
Sbjct: 587 HTVTGSSSVEWVEGPSMAKKQPLTWHKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGR 646
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y+++ G C D C+Y G Y KC +CG+PSQ YH+PRSWL +GN LV+FEE G
Sbjct: 647 HWPGYIAR-GSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWG 704
Query: 723 GDPTKISFVTK 733
GDP+ IS V +
Sbjct: 705 GDPSGISLVER 715
>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 782
Score = 870 bits (2249), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/728 (59%), Positives = 525/728 (72%), Gaps = 31/728 (4%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
F LA+ S +VTYDH+A++I G+RR+LISGSIHYPRSTP+MWPDLIQK+KDGGLD+IE
Sbjct: 74 FSGLASAS--RSVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIE 131
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TYVFWN HEP +Y FE RYDLV+F+KLV +AGLY HLRIGPYVCAEWN+GGFPLWL F
Sbjct: 132 TYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKF 191
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI FRTDN PFKA MQ+F KIVDMMK EKL+ +QGGPIILSQIENEYG ++ GA
Sbjct: 192 VPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYGPVEWEIGAP 251
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
GKSY KWAA MA+ L TGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN KPK+WTENW
Sbjct: 252 GKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENW 311
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
SGW+ +FGG PYRP ED+AF+VARF Q GG+ NYYMYHGGTNF RTS G F++TSYD+
Sbjct: 312 SGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-GLFVTTSYDF 370
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAP+DEYGL+R+PKWGHL+DLHKAIKLCE ALV+ DPT LG N EA V+K+ SG C+A
Sbjct: 371 DAPIDEYGLLREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEARVFKSSSGACAA 430
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVA 434
FLAN T++ V V F + Y LP WS+SILPDCK V FNT SLQ+
Sbjct: 431 FLANYDTSAFVRVNFWNHPYDLPPWSISILPDCKTVTFNTG--------------SLQIG 476
Query: 435 ADSSDA----IGSGW--SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIK 488
S +A I S W SY EP D TK GL+EQ++ T D +DYLWY LS I
Sbjct: 477 VKSYEAKMTPISSFWWLSYKEEPASAYAQDTTTKDGLVEQVSVTWDTTDYLWYILSIRID 536
Query: 489 ADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFD 548
+ E L+ G +L V S GH LH FING+L GS YGS + ++T + L G N
Sbjct: 537 STEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLS 596
Query: 549 LLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS- 607
+LS+TVGL N G ++ AG+ GPV LKG GT D+S +W+Y+ GL+GE LN S
Sbjct: 597 MLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTR-DMSKYKWSYKVGLRGEILNLYSV 655
Query: 608 --GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
+S QW K + K QPL WYKTTF+ PAG+EP+A+D + M KG+ WVNG+SIGRY+P
Sbjct: 656 KGSNSVQW-MKGSFQK-QPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFP 713
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
Y+++ G C + C+Y G ++ KCL NCG PSQ YH+PR WL +GN L++ EEIGG+P
Sbjct: 714 GYIAR-GKC-NKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNP 771
Query: 726 TKISFVTK 733
IS V +
Sbjct: 772 QGISLVKR 779
>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
Length = 825
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/857 (51%), Positives = 574/857 (66%), Gaps = 42/857 (4%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
MAS + LL + F ++ A +++D RA+ I GKRRVL+SGSIHYPRSTP+MWPD
Sbjct: 1 MASLKFLLAISFSLFTFHLVSA--AVISHDGRAITIDGKRRVLLSGSIHYPRSTPQMWPD 58
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+KSK+GGLD IETYVFWN+HEP R QY+F G DLV+F+K V + GLYA LRIGPYVC
Sbjct: 59 LIKKSKEGGLDAIETYVFWNVHEPSRRQYDFGGNLDLVRFIKAVQDEGLYAVLRIGPYVC 118
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GGFP+WLH +PGI+ RT N F EMQ FT+ IVDMMKQE+L+ASQGGPII++Q+
Sbjct: 119 AEWNYGGFPVWLHNMPGIELRTANSIFMNEMQNFTSLIVDMMKQEQLFASQGGPIIIAQV 178
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYGN+ S+YGAAGK+YI W A MA SL+ GVPW+MCQQSDAPDP+INTCNG+YCDQFT
Sbjct: 179 ENEYGNVMSSYGAAGKAYIDWCANMAESLNIGVPWIMCQQSDAPDPMINTCNGWYCDQFT 238
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
P++ N PKMWTENW+GWF S+GG P+R ED+AFAVARFFQ GGTFQNYYMYHGGTNF
Sbjct: 239 PSNPNSPKMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFG 298
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RT+GGP+I+TSYDYDAPLDE+G + QPKWGHLK LH + E L + + ++
Sbjct: 299 RTAGGPYITTSYDYDAPLDEFGNLNQPKWGHLKQLHDVLHSMEEILTSGTVSSVDYDNSV 358
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
AT+Y T S FL+N SD T++F G +Y +PAWSVSILPDC NV +NTAK+ +
Sbjct: 359 TATIYATDKE-SSCFLSNANETSDATIEFKGTTYTIPAWSVSILPDCANVGYNTAKVKTQ 417
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGW--SYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
T S + A D ++ W +++ V + + K +++Q D SDY
Sbjct: 418 T---SVMVKRDNKAEDEPTSLNWSWRPENVDKTVLLGQGHIHAKQ-IVDQKAVANDASDY 473
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
LWY S ++K D+ + + + GH LHA++NG+ +GS + S + + +
Sbjct: 474 LWYMTSVDLKKDDLIWS--KDMSIRINGSGHILHAYVNGEYLGSQWSEYSVSNYVFEKSV 531
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQT 596
L G+N LLS TVGL NYGA Y+ AGI GPV+L G I DLS+ +W+Y+
Sbjct: 532 KLKHGRNLITLLSATVGLANYGANYDLIQAGILGPVELVGRKGDETIIKDLSNNRWSYKV 591
Query: 597 GLKGEELNF---PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEA 653
GL G E S +++W + LP + L WYKTTF AP G++PV +D G+GKG A
Sbjct: 592 GLLGLEDKLYLSDSKHASKWQEQE-LPTNKMLTWYKTTFKAPLGTDPVVLDLQGLGKGMA 650
Query: 654 WVNGQSIGRYWPTYVSQNGGC-TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG 712
W+NG SIGRYWP++++++ GC TD C+YRG Y +NKC+ NCGKP+Q YHVPRS+L+ +
Sbjct: 651 WINGNSIGRYWPSFLAEDDGCSTDLCDYRGPYDNNKCVSNCGKPTQRWYHVPRSFLQDNE 710
Query: 713 NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLEC 772
NTLVLFEE GG+P++++F T G + S G V+ + C
Sbjct: 711 NTLVLFEEFGGNPSQVNFQTVVTGVACVS--------------------GDEGEVVEISC 750
Query: 773 PNPNQVISSIKFASFGTPLGTCGSFSRGRCSSAR-SLSVVRQACVGSKSCSIGVSVNTFG 831
Q IS+++FASFG P GTCGS +G C +L +V++ACVG++SCS+ VS FG
Sbjct: 751 --NGQSISAVQFASFGDPQGTCGSSVKGSCEGTEDALLIVQKACVGNESCSLEVSHKLFG 808
Query: 832 D-PCKGVMKSLAVEASC 847
C + LAVE C
Sbjct: 809 STSCDNGVNRLAVEVLC 825
>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
Length = 731
Score = 868 bits (2243), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/731 (58%), Positives = 526/731 (71%), Gaps = 22/731 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL C + + S A+V+YDH+A++I G++R+LISGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 ILLLFSC----IFSAAS--ASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP Y FE RYDLVKF+KLV + GL+ +LRIGPYVCAEWNF
Sbjct: 65 KDGGLDVIQTYVFWNGHEPSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIGPYVCAEWNF 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV MMK EKL+ +QGGPIILSQIENE+G
Sbjct: 125 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN +
Sbjct: 185 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGAVP RP ED+AF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 245 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL R+PKWGHL+DLHKAIK CE+ALV+ DP+ LG N EA V+
Sbjct: 305 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ S C+AFLAN V V F G Y LP WS+SILPDCK V+NTAK+ S
Sbjct: 365 KSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGS------ 417
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
QS QV + S+I E + D T GL EQIN T D +DYLWY
Sbjct: 418 ---QSSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDI 474
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I +DE L++G +L + S GHAL+ FING+L G+ YGS N K++ + L G N
Sbjct: 475 TIGSDEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGIN 534
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS++VGL N G +E AG+ GP+ LKG +GT D+S +WTY+TGLKGE L
Sbjct: 535 KLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGT-WDMSGWKWTYKTGLKGEALGL 593
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ SS +W ++ + QPL WYK TF+AP G P+A+D MGKG+ W+NGQS+GR
Sbjct: 594 HTVTGSSSVEWVEGPSMAEKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGR 653
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y+++ G C D C+Y G Y KC +CG+PSQ YH+PRSWL +GN LV+FEE G
Sbjct: 654 HWPGYIAR-GSCGD-CSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWG 711
Query: 723 GDPTKISFVTK 733
GDP++IS V +
Sbjct: 712 GDPSRISLVER 722
>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
Length = 731
Score = 868 bits (2242), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/729 (58%), Positives = 510/729 (69%), Gaps = 20/729 (2%)
Query: 16 VVLATTS--FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVI 73
VVL T+ NVTYD +A++I G+R+VL SGSIHYPRSTPEMW LIQK+KDGGLDVI
Sbjct: 15 VVLLTSLQLIQCNVTYDKKALIINGQRKVLFSGSIHYPRSTPEMWEGLIQKAKDGGLDVI 74
Query: 74 ETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLH 133
+TYVFWNLHEP YNF+GRYDLV+F+KLV EAGLY HLRIGPY+CAEWNFGGFP+WL
Sbjct: 75 DTYVFWNLHEPSPGNYNFDGRYDLVRFIKLVHEAGLYVHLRIGPYICAEWNFGGFPVWLK 134
Query: 134 FIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGA 193
++PGI FRTDNEPFK+ MQ+FT KIV MMK E L+ SQGGPIILSQIENEY A+G+
Sbjct: 135 YVPGISFRTDNEPFKSAMQKFTQKIVQMMKDENLFESQGGPIILSQIENEYEPESKAFGS 194
Query: 194 AGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTEN 253
G +Y+ WAA MA+S+DTGVPWVMC++ DAPDP+INTCNGFYCD F+PN KP MWTE
Sbjct: 195 PGHAYMTWAAHMAISMDTGVPWVMCKEFDAPDPVINTCNGFYCDYFSPNKPYKPTMWTEA 254
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYD 313
W+GWF FGG RP EDLAFAVARF Q+GG+ NYYMYHGGTNF RTSGGPFI+TSYD
Sbjct: 255 WTGWFTDFGGPNHQRPAEDLAFAVARFIQKGGSLVNYYMYHGGTNFGRTSGGPFITTSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCS 373
YDAP+DEYGLIRQPK+GHLK+LHKAIKLCE AL+A D T SLG +A V+ + SG C+
Sbjct: 315 YDAPIDEYGLIRQPKYGHLKELHKAIKLCEKALLAADSTVTSLGSYEQAHVFSSDSGGCA 374
Query: 374 AFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQV 433
AFL+N T VKFN Y LP WS+SILPDCKNVVFNTA + Q+ QV
Sbjct: 375 AFLSNYNTKQAARVKFNNIQYSLPPWSISILPDCKNVVFNTAHVGV---------QTSQV 425
Query: 434 AADSSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
+D+ W NE + + D T GLLEQ+N T D SDYLWY+ S +I + E
Sbjct: 426 HMLPTDSELLSWETFNEDISSVDDDKMITVAGLLEQLNITRDTSDYLWYTTSVHISSSES 485
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
L G VL VQS GHALH FING+L GS +G+ + T + GKN LLS+
Sbjct: 486 FLRGGRLPVLTVQSAGHALHVFINGELSGSAHGTREQRRFTFTEDMKFHAGKNRISLLSV 545
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSS-- 610
VGL N G +E GI GPV L G G DL+ Q+W+Y+ GLKGE++N S S
Sbjct: 546 AVGLPNNGPRFETWNTGILGPVTLHGLDEGQR-DLTWQKWSYKVGLKGEDMNLRSRKSVS 604
Query: 611 -TQWDSKSTLP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYV 668
W S + K QPL WYK F++P G +P+A+D MGKG+ W+NG SIGRYW Y
Sbjct: 605 LVDWIQGSLMVGKQQPLTWYKAYFNSPKGDDPLALDMGSMGKGQVWINGHSIGRYWTLYA 664
Query: 669 SQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKI 728
G C+ C+Y + +C CG+P+Q YHVPRSWLKS+ N LVLFEEIGGD ++I
Sbjct: 665 E--GNCS-GCSYSATFRPARCQLGCGQPTQKWYHVPRSWLKSTRNLLVLFEEIGGDASRI 721
Query: 729 SFVTKQLGS 737
S V + + S
Sbjct: 722 SLVKRLVTS 730
>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
Length = 830
Score = 868 bits (2242), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/858 (51%), Positives = 579/858 (67%), Gaps = 41/858 (4%)
Query: 1 MASKEILL-LVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
MASK + LC+ F+ L T + V++D RA+ I GKRRVLISGSIHYPRSTP+MWP
Sbjct: 1 MASKCFVFPFFLCYIFLALYGT-YAVEVSHDGRAIKIDGKRRVLISGSIHYPRSTPQMWP 59
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
DLI+K+K+GGLD IETYVFWN HEP+R +Y+F G DL++F+K + + GL+A LRIGPYV
Sbjct: 60 DLIKKAKEGGLDAIETYVFWNAHEPIRREYDFSGNNDLIRFLKTIQDEGLFAVLRIGPYV 119
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
CAEWN+GG P+W++ +PG++ RT N+ F EMQ FT IVDM+++EKL+ASQGGPIILSQ
Sbjct: 120 CAEWNYGGIPVWVYNLPGVEIRTANKVFMNEMQNFTTLIVDMVRKEKLFASQGGPIILSQ 179
Query: 180 IENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF 239
IENEYGN+ SAYG GK+YI W A MA S + GVPW+MCQQ DAP P+INTCNG+YC F
Sbjct: 180 IENEYGNVMSAYGDEGKAYINWCANMADSFNIGVPWIMCQQPDAPQPMINTCNGWYCHDF 239
Query: 240 TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 299
PN+ N PKMWTENW GWF ++GG P+R ED+A++VARFF+ GGTFQNYYMYHGGTNF
Sbjct: 240 EPNNPNSPKMWTENWVGWFKNWGGKDPHRTAEDIAYSVARFFETGGTFQNYYMYHGGTNF 299
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
RT+GGP+I+TSYDYDAPLDEYG I QPKWGHLK+LH +K E +L + + LG
Sbjct: 300 GRTAGGPYITTSYDYDAPLDEYGNIAQPKWGHLKELHLVLKSMENSLTNGNVSKIDLGSY 359
Query: 360 LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS 419
++ATVY T S FL N T +D TV F GN+Y +PAWSVSILPDC+ +NTAK+N
Sbjct: 360 VKATVYATNDS-SSCFLTNTNTTTDATVTFKGNTYNVPAWSVSILPDCQTEEYNTAKVNV 418
Query: 420 VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
T S + A D +A+ W N + + +K +++Q D SDYL
Sbjct: 419 QT---SIMVKRENKAEDEPEALKWVWRAENVHNSLIGKSSVSKNTIVDQKIAANDSSDYL 475
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
WY +I +P+ + T+L + GH +HAF+NG+ +GS + + + I
Sbjct: 476 WYMTRLDINQKDPVWTN--NTILRINGTGHVIHAFVNGEHIGSHWATYGIHNDQFETNIK 533
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTG 597
L G+N LLS+TVGLQNYG Y+K G+ P++L G+ I DLSS +WTY+ G
Sbjct: 534 LKHGRNDISLLSVTVGLQNYGKEYDKWQDGLVSPIELIGTKGDETIIKDLSSHKWTYKVG 593
Query: 598 LKGEELNFPS-----GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGE 652
L G E F S SS++W+S + LP + L WYKTTF AP S+P+ +D GMGKG
Sbjct: 594 LHGWENKFFSQDTFFASSSKWES-NELPINKMLTWYKTTFKAPLESDPIVVDLQGMGKGY 652
Query: 653 AWVNGQSIGRYWPTYVSQNGGCTDS-CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSS 711
AWVNG S+GRYWP+Y + GC+D C+YRG Y+ KC+ NCGKPSQ YHVPR +++
Sbjct: 653 AWVNGHSLGRYWPSYNADEDGCSDDPCDYRGEYNDTKCVSNCGKPSQRWYHVPRDFIEDG 712
Query: 712 GNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLE 771
NTLVLFEEIGG+P++I+F T +GS+ C++ ++ L + G
Sbjct: 713 VNTLVLFEEIGGNPSQINFQTVIVGSA-CANAYENKTLELSCHG---------------- 755
Query: 772 CPNPNQVISSIKFASFGTPLGTCGSFSRGRC-SSARSLSVVRQACVGSKSCSIGVSVNTF 830
+ IS IKFASFG P GTCG+F++G C S+ +LS+V++ACVG +SCSI VS TF
Sbjct: 756 -----RSISDIKFASFGNPQGTCGAFTKGSCESNNEALSLVQKACVGKESCSIDVSEKTF 810
Query: 831 GDP-CKGVMKSLAVEASC 847
G C ++K LAVEA C
Sbjct: 811 GATNCGNMVKRLAVEAVC 828
>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
Length = 725
Score = 867 bits (2241), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/721 (58%), Positives = 511/721 (70%), Gaps = 20/721 (2%)
Query: 16 VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIET 75
+++A + A V+YDHRAVVI G+RR+LISGSIHYPRSTPEMWPDL+QK+KDGGLDV++T
Sbjct: 20 IIVAPSPANAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVVQT 79
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI 135
YVFWN HEP + QY F RYDLV+FVKL +AGL+ HLRIGPYVCAEWNFGGFP+WL ++
Sbjct: 80 YVFWNGHEPQQGQYYFGDRYDLVRFVKLAKQAGLFVHLRIGPYVCAEWNFGGFPVWLKYV 139
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAG 195
PG+ FRTDN PFKA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S G
Sbjct: 140 PGVSFRTDNAPFKAAMQAFVEKIVSMMKAEGLFEWQGGPIILAQVENEYGPMESVMGGGA 199
Query: 196 KSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWS 255
K Y WAA MA++ GVPWVMC+Q DAPDP+INTCNGFYCD F+PNSN+KP MWTE W+
Sbjct: 200 KPYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWT 259
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYD 315
GWF +FGGAVP+RPVED+AFAVARF Q+GG+F NYYMYHGGTNFDRTSGGPFI+TSYDYD
Sbjct: 260 GWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYD 319
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAF 375
AP+DEYGL+RQPKWGHL+DLHKAIK E ALV+ DPT ++G +A VYK+ SG C+AF
Sbjct: 320 APIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQTIGNYEKAYVYKSSSGACAAF 379
Query: 376 LANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAA 435
L+N TN+ V FNG Y LPAWS+S+LPDC+ VFNTA ++S PS A
Sbjct: 380 LSNYHTNAAARVVFNGRRYDLPAWSISVLPDCRTAVFNTATVSS----PS-------APA 428
Query: 436 DSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
+ A G W +E D AFTK GL+EQ++ T D+SDYLWY+ NI ++E L+
Sbjct: 429 RMTPAGGFSWQSYSEATNSLDDRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLK 488
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
G L + S GHAL F+NG+ G+ YG + K+T + + G N +LS VG
Sbjct: 489 SGQWPQLTIYSAGHALQVFVNGQSYGAAYGGYDSPKLTYSGYVKMWQGSNKISILSAAVG 548
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQ 612
L N G YE G+ GPV L G G DLS+Q+WTYQ GL GE L S SS +
Sbjct: 549 LPNQGTHYEAWNVGVLGPVTLSGLNEGKR-DLSNQKWTYQIGLHGESLGVHSVAGSSSVE 607
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W S + QPL W+K F+AP+G+ PVA+D + MGKG+AWVNG IGRYW +Y + G
Sbjct: 608 WGSAA---GKQPLTWHKAYFNAPSGNAPVALDMSSMGKGQAWVNGHHIGRYW-SYKATGG 663
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
C C+Y G YS KC CG SQ YHVPRSWL SGN LV+ EE GGD + + VT
Sbjct: 664 SC-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLVT 722
Query: 733 K 733
+
Sbjct: 723 R 723
>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
Length = 727
Score = 867 bits (2239), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/738 (57%), Positives = 531/738 (71%), Gaps = 34/738 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I+L +LC+ ++ +T A VTYDH+A++I G+RR+LISGSIHYPRSTPEMWPDLI+K+
Sbjct: 11 IILAILCFSSLIHSTE---AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVI+TYVFWN HEP Y F+ RYDLVKF KLV +AGLY LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PG+ FRTDNEPFK MQ+FT KIVDMMK+EKL+ +QGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+ GAAGK+Y KW A MAL L TGVPW+MC+Q DAP PII+TCNGFYC+ F PNS+N
Sbjct: 188 PMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPK+WTENW+GWF FGGA+P RPVED+AF+VARF Q GG+F NYYMY+GGTNFDRT+ G
Sbjct: 248 KPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAP+DEYGL+R+PK+ HLK+LHK IKLCE ALV+ DPT SLG E V+
Sbjct: 307 VFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVT---- 421
K+ + C+AFL+N T+S V F G Y LP WSVSILPDCK +NTAKI + T
Sbjct: 367 KSKTS-CAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMK 425
Query: 422 LVPS---FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
++P+ FS +S + SS+ G+ F K GL+EQI+ T D++DY
Sbjct: 426 MIPTSTKFSWESYNEGSPSSNEAGT----------------FVKDGLVEQISMTRDKTDY 469
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
WY I +DE L+ G +L + S GHALH F+NG L G+ YG+ SN+K+T I
Sbjct: 470 FWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNI 529
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
L+ G N LLS VGL N G YE GI GPV LKG +GT D+S +W+Y+ GL
Sbjct: 530 KLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGT-WDMSKWKWSYKIGL 588
Query: 599 KGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWV 655
+GE ++ + S+ +W K + K QPL WYK++FD P G+EP+A+D MGKG+ WV
Sbjct: 589 RGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWV 648
Query: 656 NGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTL 715
NG +IGR+WP Y ++ G C CNY G Y+ KCL +CG+PSQ YHVPRSWLK GN L
Sbjct: 649 NGHNIGRHWPAYTAR-GNC-GRCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLL 706
Query: 716 VLFEEIGGDPTKISFVTK 733
V+FEE GGDP+ IS V +
Sbjct: 707 VIFEEWGGDPSGISLVKR 724
>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
Length = 766
Score = 866 bits (2237), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/743 (56%), Positives = 535/743 (72%), Gaps = 22/743 (2%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+VTYD +A+VI G+RR+LISGSIHYPRSTPEMWPDLIQK+K+GGLDVI+TYVFW+ HEP
Sbjct: 36 SVTYDRKAIVINGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWDGHEPS 95
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+Y FEGRYDLVKF+KLV +AGLY +LRIGPY+CAEWN GGFP+WL +IPGI FRTDNE
Sbjct: 96 PGKYYFEGRYDLVKFIKLVKQAGLYVNLRIGPYICAEWNLGGFPVWLKYIPGISFRTDNE 155
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK M FT KIV+MMK E L+ QGGPII+SQIENEYG ++ GA GK Y +WAA M
Sbjct: 156 PFKRYMAGFTKKIVEMMKAESLFEPQGGPIIMSQIENEYGPVEWEIGAIGKVYTRWAASM 215
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A++L+TGVPW+MC+Q + PDPIINTCNGFYCD F PN + KP MWTE W+GWF +FGG V
Sbjct: 216 AVNLNTGVPWIMCKQDEVPDPIINTCNGFYCDWFKPNKDYKPIMWTELWTGWFTAFGGPV 275
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
PYRPVED+A+AV +F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL R
Sbjct: 276 PYRPVEDVAYAVVKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLKR 335
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
+PKWGHL+DLH+AIK+CE ALV+ DPT +G + EA V+K SG CSAFL N + V
Sbjct: 336 EPKWGHLRDLHRAIKMCEPALVSNDPTVTKIGDSQEAHVFKFESGACSAFLENKDETNFV 395
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V F G Y LP WS+SILPDC NVV+NT ++ + T S ++ A+++ + W
Sbjct: 396 KVTFQGMQYELPPWSISILPDCVNVVYNTGRVGTQT-----SMMTMLSASNNEFS----W 446
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+ NE +++ T GL EQI+ T D +DYL Y+ I +E L++G VL V
Sbjct: 447 ASYNEDTASYNEESMTIEGLSEQISITKDSTDYLRYTTDVTIGQNEGFLKNGEYPVLTVN 506
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GHAL F+NG+L G+ YGS ++ ++T + L G N LLS VGL N G +E
Sbjct: 507 SAGHALQVFVNGQLSGTAYGSVNDPRLTFSGKVKLWAGNNKISLLSSAVGLPNVGTHFET 566
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSST-QWDSKSTLPKL 622
G+ GPV L G G DLS Q+W+Y+ G+ GE +L+ P+GSS+ +W S ++ K+
Sbjct: 567 WNYGVLGPVTLNGLNEGKR-DLSLQKWSYKVGVIGEALQLHSPTGSSSVEWGSSTS--KI 623
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QP WYKTTF+AP G++P+A+D MGKG+ W+NGQSIGRYWP Y + NG C+ +C+Y G
Sbjct: 624 QPFTWYKTTFNAPGGNDPLALDMNTMGKGQIWINGQSIGRYWPAYKA-NGKCS-ACHYTG 681
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
Y KC NCG+ SQ YH+PRSWL +GN LV+FEE GGDPT I+ V + +GS+ C++
Sbjct: 682 WYDEKKCGFNCGEASQRWYHIPRSWLNPTGNLLVVFEEWGGDPTGITLVRRTIGSA-CAY 740
Query: 743 VTDSHPL----PVDMWGSDSKIQ 761
+ + HP ++ WG K Q
Sbjct: 741 INEWHPTVKNWKIENWGKAEKWQ 763
>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 826
Score = 865 bits (2236), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/853 (51%), Positives = 580/853 (67%), Gaps = 40/853 (4%)
Query: 6 ILLLVLCWGFVVLA-TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
L L + + FV+L+ S V++D RA++I GKRRVL+SGSIHYPRSTPEMWP+LIQK
Sbjct: 3 FLSLSVWFCFVILSFIGSNAVEVSHDGRAIIIDGKRRVLLSGSIHYPRSTPEMWPELIQK 62
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
+K+GGLD IETYVFWN HEP R Y+F G D+++F+K + E+GLY LRIGPYVCAEWN
Sbjct: 63 AKEGGLDAIETYVFWNAHEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWN 122
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
+GG P+W+H +P ++ RT N + EMQ FT IVDM+K+EKL+ASQGGPIIL+QIENEY
Sbjct: 123 YGGIPVWVHNLPDVEIRTANSVYMNEMQNFTTLIVDMVKKEKLFASQGGPIILTQIENEY 182
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
GN+ S YG AGK+Y+ W A MA SL+ GVPW+MCQ+SDAP +INTCNGFYCD F PN+
Sbjct: 183 GNVISHYGDAGKAYMNWCANMAESLNVGVPWIMCQESDAPQSMINTCNGFYCDNFEPNNP 242
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
+ PKMWTENW GWF ++GG P+R ED+AFAVARFFQ GGTFQNYYMYHGGTNFDRT+G
Sbjct: 243 SSPKMWTENWVGWFKNWGGRDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFDRTAG 302
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
GP+I+TSYDYDAPLDEYG I QPKWGHLK+LH +K E L + + + G +++AT+
Sbjct: 303 GPYITTSYDYDAPLDEYGNIAQPKWGHLKELHNVLKSMEETLTSGNVSETDFGNSVKATI 362
Query: 365 YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVP 424
Y T +G S FL++ T +D T+ F G +Y +PAWSVSILPDC++ +NTAK+N T
Sbjct: 363 YAT-NGSSSCFLSSTNTTTDATLTFRGKNYTVPAWSVSILPDCEHEEYNTAKVNVQT--- 418
Query: 425 SFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
S + A + + A+ W N + + LL+Q + D SDYLWY
Sbjct: 419 SVMVKENSKAEEEATALKWVWRSENIDNALHGKSNVSANRLLDQKDAANDASDYLWYMTK 478
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
++K D+P+ G L + S GH +HAF+NG+ +GS + + + I L G
Sbjct: 479 LHVKHDDPVW--GENMTLRINSSGHVIHAFVNGEHIGSHWATYGIHNDKFEPKIKLKHGT 536
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKG-SGNGTNI-DLSSQQWTYQTGLKGEE 602
NT LLS+TVGLQNYGAF++ AG+ P++L G+ T I +LSS +W+Y+ GL G +
Sbjct: 537 NTISLLSVTVGLQNYGAFFDTWHAGLVEPIELVSVKGDETIIKNLSSNKWSYKVGLHGWD 596
Query: 603 LNF-----PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
P + +W+S+ LP + L WYKTTF+AP G++PV +D GMGKG AWVNG
Sbjct: 597 HKLFSDDSPFAAPNKWESEK-LPTDRMLTWYKTTFNAPLGTDPVVVDLQGMGKGYAWVNG 655
Query: 658 QSIGRYWPTYVSQNGGCTDS-CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
Q+IGR WP+Y ++ GC+D C+YRG Y+ +KC+ NCGKP+Q YHVPRS+LK N LV
Sbjct: 656 QNIGRIWPSYNAEEDGCSDEPCDYRGEYTDSKCVTNCGKPTQRWYHVPRSYLKDGANNLV 715
Query: 717 LFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPN 776
LF E+GG+P++++F T +G ++C++ ++ L + G RK
Sbjct: 716 LFAELGGNPSQVNFQTVVVG-TVCANAYENKTLELSCQG------RK------------- 755
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSS-ARSLSVVRQACVGSKSCSIGVSVNTFG-DPC 834
IS+IKFASFG P G CG+F+ G C S + +LS+V++ACVG ++CS VS TFG C
Sbjct: 756 --ISAIKFASFGDPEGVCGAFTNGSCESKSNALSIVQKACVGKQACSFDVSEKTFGPTAC 813
Query: 835 KGVMKSLAVEASC 847
V K LAVEA C
Sbjct: 814 GNVAKRLAVEAVC 826
>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
Length = 722
Score = 865 bits (2234), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/739 (57%), Positives = 519/739 (70%), Gaps = 32/739 (4%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S+ + L+ F+V +S A+V YDHRA+++ GKRR+LISGSIHYPRSTPEMWPDL+
Sbjct: 7 SRNMFFLL----FLVSWLSSALASVGYDHRAIIVNGKRRILISGSIHYPRSTPEMWPDLL 62
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
QK+KDGGLDV++TYVFWN HEP +Y FE RYDLVKF+KL + GLY HLRIGPY+CAE
Sbjct: 63 QKAKDGGLDVLQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLAQQHGLYVHLRIGPYICAE 122
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL ++PGI FRTDN PF A M++FT KIV MMK E+L+ +QGGPIILSQIEN
Sbjct: 123 WNFGGFPVWLKYVPGIAFRTDNRPFMAAMEKFTQKIVYMMKAERLFQTQGGPIILSQIEN 182
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG ++ GA GKSY +WAA MA+ L+TGVPWVMC+Q DAPDPII+TCNGFYC+ FTPN
Sbjct: 183 EYGPVEWEIGAPGKSYTQWAAKMAVGLNTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPN 242
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
N KPKMWTE W+GW+ FGGAVP RP +DLAF+VARF Q GG+F NYYMYHGGTNF RT
Sbjct: 243 KNYKPKMWTEIWTGWYTEFGGAVPTRPAQDLAFSVARFIQNGGSFANYYMYHGGTNFGRT 302
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
+GGPFI+TSYDYDAPLDEYGL R+PK+ HLK +HKAIK+ E AL+ATD LG N EA
Sbjct: 303 AGGPFIATSYDYDAPLDEYGLPREPKYSHLKYMHKAIKMAEPALLATDAAVSKLGNNQEA 362
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----- 417
VY++ SG C+AFLAN T V V F Y LP WS+SILPDCK VFNTA++
Sbjct: 363 HVYQSRSG-CAAFLANYDTKYPVRVTFWNKQYNLPPWSISILPDCKTEVFNTARVGQSPP 421
Query: 418 NSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSD 477
+T V S Q +YI + + D+AFT GL EQI+ T D +D
Sbjct: 422 TKMTPVAHLSWQ----------------AYIEDVATSADDNAFTSVGLREQISLTWDNTD 465
Query: 478 YLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFP 537
YLWY I +E L G L V S GHALH FING+L GS YG+ + K+ +
Sbjct: 466 YLWYMTDITIGPNEQFLRTGKYPTLKVDSAGHALHVFINGQLSGSAYGTLAFPKLEFNQG 525
Query: 538 IALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTG 597
+ L G N LLS++VGL N G +E G+ GPV L G +GT D++ QWTY+ G
Sbjct: 526 VKLRAGINKLALLSVSVGLANVGLHFETWNTGVLGPVTLAGVNSGT-WDMTRWQWTYKIG 584
Query: 598 LKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
++GE+++ + SS +W S L + +PL WYK +AP G+ P+A+D MGKG+ W
Sbjct: 585 MRGEDMSLHTVSGSSSVEWVQGSLLAQYRPLTWYKAILNAPPGNAPLALDMGSMGKGQMW 644
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
+NGQSIGR+WP Y + G +C Y G Y+ NKC NCG+PSQ YHVPRSWLKSSGN
Sbjct: 645 INGQSIGRHWPAYKAH--GSCGACYYAGTYTENKCRTNCGQPSQRWYHVPRSWLKSSGNL 702
Query: 715 LVLFEEIGGDPTKISFVTK 733
LV+FEE GGDPTKIS V +
Sbjct: 703 LVVFEEWGGDPTKISLVAR 721
>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 716
Score = 865 bits (2234), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/706 (59%), Positives = 500/706 (70%), Gaps = 15/706 (2%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
+YDHRAVVI G+RR+L+SGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEP R
Sbjct: 24 SYDHRAVVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPARG 83
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY+F RYDLV+FVKL +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN PF
Sbjct: 84 QYHFADRYDLVRFVKLARQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGPF 143
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KAEMQRF KIV MMK E L+ QGGPIIL+Q+ENEYG ++SA GA K Y WAA MA+
Sbjct: 144 KAEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESAMGAGAKPYANWAANMAV 203
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+ D GVPWVMC+Q DAPDP+INTCNGFYCD FTPNSN+KP MWTE W+GWF +FGG VP+
Sbjct: 204 ATDAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNSKPTMWTEAWTGWFTAFGGPVPH 263
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVED+AFAVARF Q+GG+F NYYMYHGGTNFDRT+GGPFI+TSYDYDAP+DEYGLIRQP
Sbjct: 264 RPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLIRQP 323
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHL+DLHKAIK E ALV+ DPT +G +A V+K+ +G C+AFL+N T+S +
Sbjct: 324 KWGHLRDLHKAIKQAEPALVSGDPTIQRIGNYEKAYVFKSSTGACAAFLSNYHTSSAARI 383
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
+NG Y LPAWS+SILPDCK VFNTA + T A + A G W
Sbjct: 384 VYNGRRYDLPAWSISILPDCKTAVFNTATVKEPT-----------APAKMNPAGGFAWQS 432
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
+E AFTK GL+EQ++ T D+SDYLWY+ NI + E L+ G L + S
Sbjct: 433 YSEDTNALDSSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSSEQFLKTGQWPQLTINSA 492
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
GH++ F+NG+ G YG ++ K+T P+ + G N +LS +GL N G YE
Sbjct: 493 GHSVQVFVNGQSFGVAYGGYNSPKLTYSKPVKMWQGSNKISILSSAMGLPNQGTHYEAWN 552
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVW 627
G+ GPV L G G DLS+Q+WTYQ GLKGE L S S + S+ QPL W
Sbjct: 553 VGVLGPVTLSGLNQGKR-DLSNQKWTYQIGLKGESLGVNSISGSSSVEWSSASGAQPLTW 611
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
+K F APAGS PVA+D MGKG+ WVNG + GRYW S G C+Y G +S
Sbjct: 612 HKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAGRYWSYRAS---GSCGGCSYAGTFSEA 668
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
KC NCG SQ YHVPRSWLK SGN LV+ EE GGD + ++ +T+
Sbjct: 669 KCQTNCGDISQRWYHVPRSWLKPSGNLLVVLEEFGGDLSGVTLMTR 714
>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
Length = 848
Score = 864 bits (2232), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 441/859 (51%), Positives = 562/859 (65%), Gaps = 50/859 (5%)
Query: 2 ASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDL 61
+SK ++ + C V AT V++D RA+ I GKRRVLISGSIHYPRST EMWPDL
Sbjct: 27 SSKSVVAIFFCLFTFVSATI-----VSHDGRAITIDGKRRVLISGSIHYPRSTAEMWPDL 81
Query: 62 IQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCA 121
I+KSK+GGLD IETYVFWN HEP R QY+F G DLV+F+K + GLYA LRIGPYVCA
Sbjct: 82 IKKSKEGGLDAIETYVFWNSHEPSRRQYDFSGNLDLVRFIKTIQAEGLYAVLRIGPYVCA 141
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EWN+GGFP+WLH +PG + RT N F EMQ FT+ IVDMMK E L+ASQGGPIIL+Q+E
Sbjct: 142 EWNYGGFPMWLHNLPGCELRTANSVFMNEMQNFTSLIVDMMKDENLFASQGGPIILAQVE 201
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTP 241
NEYGN+ SAYGAAGK+YI W + MA SLD GVPW+MCQQSDAP P+INTCNG+YCDQFTP
Sbjct: 202 NEYGNVMSAYGAAGKTYIDWCSNMAESLDIGVPWIMCQQSDAPQPMINTCNGWYCDQFTP 261
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
N+ N PKMWTENW+GWF S+GG P+R ED+AFAVARFFQ GGTFQNYYMYHGGTNF R
Sbjct: 262 NNANSPKMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGR 321
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
T+GGP+I+TSYDYDAPLDEYG + QPKWGHLK LH + E L + + ++
Sbjct: 322 TAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHDILHSMEYTLTHGNISTIDYDNSVT 381
Query: 362 ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVT 421
AT+Y T + F N SD T+ F G Y +PAWSVSILPDC+NV +NTAK+ + T
Sbjct: 382 ATIYATDKE-SACFFGNANETSDATIVFKGTEYNVPAWSVSILPDCENVGYNTAKVKTQT 440
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVG----ISKDDAFTKPGLLEQINTTADQSD 477
+ + A D ++ WS+I E + K A + L++Q D SD
Sbjct: 441 AIMVKQKNE---AEDQPSSL--KWSWIPENTHTTSLLGKGHAHARQ-LIDQKAAANDASD 494
Query: 478 YLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFP 537
YLWY S +IK D+P+ S L V GH LHA++NGK +GS + +
Sbjct: 495 YLWYMTSLHIKKDDPVWS--SDMSLRVNGSGHVLHAYVNGKHLGSQFAKYGVFSYVFEKS 552
Query: 538 IALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQ 595
+ L PGKN LLS TVGLQNYG ++ GI GPV++ G + DLSS +W+Y
Sbjct: 553 LKLRPGKNVISLLSATVGLQNYGPMFDLVQTGIPGPVEIIGHRGDEKVVKDLSSHKWSYS 612
Query: 596 TGLKG---EELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGE 652
GL G E + S +++W + LP + ++WYKTTF AP G +PV +D GMGKG
Sbjct: 613 VGLNGFHNELYSSNSRHASRW-VEQDLPTNKMMIWYKTTFKAPLGKDPVVLDLQGMGKGF 671
Query: 653 AWVNGQSIGRYWPTYVSQNGGC-TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSS 711
AWVNG +IGRYWP+++++ GC T+ C+YRGAY +NKC+ NCGKP+Q YHVPRS+
Sbjct: 672 AWVNGNNIGRYWPSFLAEEDGCSTEVCDYRGAYDNNKCVTNCGKPTQRWYHVPRSFFNDY 731
Query: 712 GNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLE 771
NTLVLFEE GG+P ++F T +G K+ G ++E
Sbjct: 732 ENTLVLFEEFGGNPAGVNFQTVTVG----------------------KVSGSAGEGETIE 769
Query: 772 CPNPNQVISSIKFASFGTPLGTCGSFSRGRCS-SARSLSVVRQACVGSKSCSIGVSVNTF 830
+ IS+I+FASFG P GT G++ +G C S + S+V++ACVG ++C + S + F
Sbjct: 770 LSCNGKSISAIEFASFGDPQGTSGAYVKGTCEGSNDAFSIVQKACVGKETCKLEASKDVF 829
Query: 831 GDPCKG--VMKSLAVEASC 847
G G V+ +LAV+A+C
Sbjct: 830 GPTSCGSDVVNTLAVQATC 848
>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
Length = 787
Score = 864 bits (2232), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/712 (58%), Positives = 510/712 (71%), Gaps = 21/712 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A V+YDHR++VI G+RR+LISGSIHYPRSTPEMWP LIQK+KDGGLDV++TYVFWN HEP
Sbjct: 92 AAVSYDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVVQTYVFWNGHEP 151
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
V+ QY F RYDL++FVKLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 152 VKGQYYFSDRYDLIRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 211
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKAEMQRF KIV MMK E+L+ QGGPII+SQ+ENE+G ++SA G K Y WAA
Sbjct: 212 GPFKAEMQRFVEKIVSMMKSERLFEWQGGPIIMSQVENEFGPMESAGGVGAKPYANWAAK 271
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA++ +TGVPWVMC+Q DAPDP+INTCNGFYCD FTPN NKP MWTE W+GWF SFGGA
Sbjct: 272 MAVATNTGVPWVMCKQEDAPDPVINTCNGFYCDYFTPNKKNKPAMWTEAWTGWFTSFGGA 331
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RPVED+AFAVARF Q+GG+F NYYMYHGGTNF RT+GGPF++TSYDYDAP+DE+GL+
Sbjct: 332 VPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEFGLL 391
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHL+DLHKAIK E LV+ DPT SLG +A V+K+ +G C+AFL+N NS
Sbjct: 392 RQPKWGHLRDLHKAIKQAEPTLVSGDPTIQSLGNYEKAYVFKSKNGACAAFLSNYHMNSA 451
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V V+FNG Y LPAWS+SILPDCK VVFNTA + TL+P +
Sbjct: 452 VKVRFNGRHYDLPAWSISILPDCKTVVFNTATVKEPTLLPKM-----------HPVVRFT 500
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +E D AFTK GL+EQ++ T D+SDYLWY+ NI E L ++G L V
Sbjct: 501 WQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDYLWYTTFVNIGPGE-LSKNGQWPQLTV 559
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GH++ F+NGK GS YG N K+T D + + G N +LS VGL N G +E
Sbjct: 560 YSAGHSMQVFVNGKSYGSVYGGFENPKLTYDGHVKMWQGSNKISILSSAVGLPNVGDHFE 619
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP--SGSS-TQWDSKSTLPK 621
+ G+ GPV L G G DLS Q+WTYQ GLKGE L SGSS +W +
Sbjct: 620 RWNVGVLGPVTLSGLSEGKR-DLSHQKWTYQVGLKGESLGIHTVSGSSAVEWGGPGS--- 675
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL W+K F+AP+GS+PVA+D MGKG+ WVNG +GRYW +Y + + GC C+Y
Sbjct: 676 KQPLTWHKALFNAPSGSDPVALDMGSMGKGQMWVNGHHVGRYW-SYKAPSRGC-GGCSYA 733
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G Y +KC +CG+ SQ YHVPRSWLK GN LV+ EE GGD ++ T+
Sbjct: 734 GTYREDKCRSSCGELSQRWYHVPRSWLKPGGNLLVVLEEYGGDVAGVTLATR 785
>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 864 bits (2232), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/734 (58%), Positives = 524/734 (71%), Gaps = 18/734 (2%)
Query: 4 KEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
K +LL + W ++ S A VTYD +AV+I G+RR+L+SGSIHYPRSTPEMWPDLIQ
Sbjct: 8 KAWILLGILWCSSLIY--SVKAMVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQ 65
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
K+KDGGLDVI+TYVFWN HEP QY FE RYDLVKF+KLV +AGLY HLRIGPYVCAEW
Sbjct: 66 KAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEW 125
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
NFGGFP+WL ++P + FRTDNEPFKA MQ+FT KIV MMK+EKL+ +QGGPIILSQIENE
Sbjct: 126 NFGGFPVWLKYVPDMVFRTDNEPFKAAMQKFTEKIVGMMKEEKLFETQGGPIILSQIENE 185
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG I+ GA GK+Y KW A MA L TGVPW+MC+Q DAP+ IINTCNGFYC+ F PNS
Sbjct: 186 YGPIEWEIGAPGKAYTKWVAKMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNS 245
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
+ KPKMWTENW+GWF FGGAVPYRP ED+A +VARF Q GG+F NYYMYHGGTNFDRT+
Sbjct: 246 DKKPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA 305
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
G FI+TSYDYDAPLDEYGL R+PK+ HLK LHK IKLCE ALV+ DPT SLG EA
Sbjct: 306 -GEFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAQ 364
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
V+K+ S C+AFL+N T+S V F G++Y LP WSVSILPDCK +NTAK+ T
Sbjct: 365 VFKSQSS-CAAFLSNYNTSSAARVSFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRT-- 421
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
S+ + ++ + S SY E + + F++ GL+EQI+ T D++DY WY
Sbjct: 422 -----SSIHMKMVPTNTLFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLT 476
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
I DE L G +L++ S GHALH F+NG+L G+ YGS K+T I L G
Sbjct: 477 DITISPDEKFL-TGEDPLLNIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAG 535
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL 603
N LLS+ GL N G YE G+ GPV LKG +GT D+S +W+Y+ G KGE L
Sbjct: 536 VNKLALLSIAAGLPNVGVHYETWNTGVLGPVTLKGVNSGT-WDMSQWKWSYKIGTKGEAL 594
Query: 604 NFP--SGSST-QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
+ +GSST +W S + QPL WYK+TFD PAG+EP+A+D MGKG+ W+NGQ+I
Sbjct: 595 SIHTVTGSSTVEWKQGSLVATKQPLTWYKSTFDTPAGNEPLALDMNTMGKGQTWINGQNI 654
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GR+WP Y ++ G C + C+Y G ++ NKCL NCG+ SQ YHVPRSWLK + N +V+ EE
Sbjct: 655 GRHWPAYTAR-GKC-ERCSYAGTFTENKCLSNCGEASQRWYHVPRSWLKPTNNLVVVLEE 712
Query: 721 IGGDPTKISFVTKQ 734
GG+P IS V ++
Sbjct: 713 WGGEPNGISLVKRR 726
>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
Length = 727
Score = 863 bits (2230), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/738 (57%), Positives = 530/738 (71%), Gaps = 34/738 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I+L +LC+ ++ +T A VTYDH+A++I G+RR+LISGSIHYPRSTPEMWPDLI+K+
Sbjct: 11 IILAILCFSSLIHSTE---AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVI+TYVFWN HEP Y F+ RYDLVKF KLV +AGLY LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PG+ FRTDNEPFK MQ+FT KIVDMMK+EKL+ +QGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+ GAAGK+Y KW A MAL L TGVPW+M +Q DAP PII+TCNGFYC+ F PNS+N
Sbjct: 188 PMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMSKQEDAPYPIIDTCNGFYCEGFKPNSDN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPK+WTENW+GWF FGGA+P RPVED+AF+VARF Q GG+F NYYMY+GGTNFDRT+ G
Sbjct: 248 KPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAP+DEYGL+R+PK+ HLK+LHK IKLCE ALV+ DPT SLG E V+
Sbjct: 307 VFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVT---- 421
K+ + C+AFL+N T+S V F G Y LP WSVSILPDCK +NTAKI + T
Sbjct: 367 KSKTS-CAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMK 425
Query: 422 LVPS---FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
++P+ FS +S + SS+ G+ F K GL+EQI+ T D++DY
Sbjct: 426 MIPTSTKFSWESYNEGSPSSNEAGT----------------FVKDGLVEQISMTRDKTDY 469
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
WY I +DE L+ G +L + S GHALH F+NG L G+ YG+ SN+K+T I
Sbjct: 470 FWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNI 529
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
L+ G N LLS VGL N G YE GI GPV LKG +GT D+S +W+Y+ GL
Sbjct: 530 KLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGT-WDMSKWKWSYKIGL 588
Query: 599 KGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWV 655
+GE ++ + S+ +W K + K QPL WYK++FD P G+EP+A+D MGKG+ WV
Sbjct: 589 RGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWV 648
Query: 656 NGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTL 715
NG +IGR+WP Y ++ G C CNY G Y+ KCL +CG+PSQ YHVPRSWLK GN L
Sbjct: 649 NGHNIGRHWPAYTAR-GNC-GRCNYAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLL 706
Query: 716 VLFEEIGGDPTKISFVTK 733
V+FEE GGDP+ IS V +
Sbjct: 707 VIFEEWGGDPSGISLVKR 724
>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
Length = 745
Score = 863 bits (2229), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/715 (58%), Positives = 512/715 (71%), Gaps = 18/715 (2%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+VTYD +A++I G+RR+LISGSIHYPRSTPEMW DLIQK+K GGLDVI+TYVFWN+HEP
Sbjct: 27 SVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKVGGLDVIDTYVFWNVHEPS 86
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ YNFEGRYDLV+F+K V + GLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 87 PSNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 146
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFKA MQ FT KIV MMK EKL+ SQGGPIILSQIENEYG A GA G +Y WAA M
Sbjct: 147 PFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGAVGHAYSNWAAKM 206
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ L TGVPWVMC++ DAPDP+IN+CNGFYCD F+PN KPK+WTE+WSGWF FGG V
Sbjct: 207 AVGLGTGVPWVMCKEDDAPDPVINSCNGFYCDDFSPNKPYKPKLWTESWSGWFSEFGGPV 266
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P RP +DLAFAVARF Q+GG+F NYYMYHGGTNF R++GGPFI+TSYDYDAP+DEYGL+R
Sbjct: 267 PQRPAQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLLR 326
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
+PK+GHLKDLHKAIK CE ALV++DPT SLG +A V+ +G+ C+AFLAN +NS
Sbjct: 327 EPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTQTCAAFLANYHSNSAA 386
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V FN Y LP WS+SILPDCK VFNTA++ F +Q+ +S + W
Sbjct: 387 RVTFNNRHYDLPPWSISILPDCKTDVFNTARVR-------FQNSKIQMLPSNSKLL--SW 437
Query: 446 SYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
+E V +++ T GLLEQIN T D SDYLWY S +I E L G+K + V
Sbjct: 438 ETYDEDVSSLAESSRITASGLLEQINATRDTSDYLWYITSVDISPSESFLRGGNKPSISV 497
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S G A+H FINGK GS +G+ T + PI L G N LLS+ VGL N G +E
Sbjct: 498 HSSGDAVHVFINGKFSGSAFGTREQRSCTFNGPINLHAGTNKIALLSVAVGLPNGGIHFE 557
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQWDSKSTLPK 621
GITGP+ L G +G DL+ Q+W+YQ GLKGE +N P+G SS W +S +
Sbjct: 558 SWKTGITGPILLHGLDHGQK-DLTWQKWSYQVGLKGEAMNLVSPNGVSSVDWVRESLASQ 616
Query: 622 LQP-LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
QP L W+K F+AP G+E +A+D +GMGKG+ W+NGQSIGRYW Y G C +SCNY
Sbjct: 617 NQPQLKWHKAYFNAPDGNEALALDMSGMGKGQVWINGQSIGRYWLVYA--KGNC-NSCNY 673
Query: 681 RGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
G Y KC CG+P+Q YHVPRSWLK + N +V+FEE+GG+P KIS V + +
Sbjct: 674 AGTYRQAKCQLGCGQPTQRWYHVPRSWLKPTNNLMVVFEELGGNPWKISLVKRTI 728
>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 725
Score = 863 bits (2229), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/735 (57%), Positives = 519/735 (70%), Gaps = 30/735 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL C + + S A+V YDH+A++I G+RR+LISGSIHYPRSTP MWPDLIQK+
Sbjct: 11 ILLLFSC----IFSAAS--ASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K GGLDVI+TYVFWN HEP +Y FE RYDLVKF+KLV +AGL+ +LRIGPYVCAEWNF
Sbjct: 65 KAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNF 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV+MMK EKL+ +QGGPIILSQIENE+G
Sbjct: 125 GGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDP+I+TCNG+YC+ F PN
Sbjct: 185 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVY 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGA+P RP EDLAF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 245 KPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL++QPKWGHL+DLHKAIK CE ALVA DP+ LG N EA V+
Sbjct: 305 PFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVF 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVT 421
+ SG C+AFLAN T V V F Y LP WS+SILPDCK VFNTAK+ + V
Sbjct: 365 NSKSG-CAAFLANHDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASEVQ 423
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
+ P +SR Q S+I E + T GL EQI T D +DYLWY
Sbjct: 424 MKPVYSRLPWQ-------------SFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWY 470
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
I +DE L++G +L + S GHALH FING+L G+ YGS N K+T + L
Sbjct: 471 MTDITIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLR 530
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
PG N LLS++VGL N G +E G+ GP+ LKG GT D+S +WTY+ G+KGE
Sbjct: 531 PGINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGT-WDMSRWKWTYKIGMKGE 589
Query: 602 ELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
L + SS W ++ + QPL WYK TFDAP G P+A+D MGKG+ W+NGQ
Sbjct: 590 SLGLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQ 649
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
S+GR+WP Y++Q G +C Y G ++ KC CGKPSQ YH+PRSWL +GN LV+F
Sbjct: 650 SVGRHWPGYIAQ--GSCGNCYYAGTFNDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVF 707
Query: 719 EEIGGDPTKISFVTK 733
EE GGDP+ +S V +
Sbjct: 708 EEWGGDPSWMSLVER 722
>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 832
Score = 862 bits (2227), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/852 (50%), Positives = 548/852 (64%), Gaps = 43/852 (5%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
LL+L + V LA +F V+YD RA+ I GKR+VL SGSIHYPRST EMWP LI K+K+
Sbjct: 5 LLLLSFTLVNLAINAF--EVSYDSRAITIDGKRKVLFSGSIHYPRSTAEMWPSLINKAKE 62
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLDVIETYVFWN HEP QY+F G DLVKF+K + + GLYA LRIGPYVCAEWN+GG
Sbjct: 63 GGLDVIETYVFWNAHEPQPRQYDFSGNLDLVKFIKTIQKEGLYAMLRIGPYVCAEWNYGG 122
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WLH +P ++FRT+N + EMQ FT IVD M+ E L+ASQGGPIIL+QIENEYGNI
Sbjct: 123 FPVWLHNMPNMEFRTNNTAYMNEMQTFTTLIVDKMRHENLFASQGGPIILAQIENEYGNI 182
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
S YG GK Y++W A +A S GVPWVMCQQSDAPDPIINTCNG+YCDQF+PNS +KP
Sbjct: 183 MSEYGENGKQYVQWCAQLAESYKIGVPWVMCQQSDAPDPIINTCNGWYCDQFSPNSKSKP 242
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
KMWTENW+GWF ++GG +P+R D+A+AVARFFQ GGTFQNYYMYHGGTNF RTSGGP+
Sbjct: 243 KMWTENWTGWFKNWGGPIPHRTARDVAYAVARFFQYGGTFQNYYMYHGGTNFGRTSGGPY 302
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
I+TSYDYDAPLDEYG QPKWGHLK LH+ +K E L + G L ATVY
Sbjct: 303 ITTSYDYDAPLDEYGNKNQPKWGHLKQLHELLKSMEDVLTQGTTNHTDYGNLLTATVYNY 362
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
SG + FL N +++D T+ F Y++PAWSVSILP+C N V+NTAKIN+ T +
Sbjct: 363 -SGKSACFLGNANSSNDATIMFQSTQYIVPAWSVSILPNCVNEVYNTAKINAQTSIMVMK 421
Query: 428 RQSLQVAADSSDAIGSGWSYINEPVGISKDDAF------TKPGLLEQINTTADQSDYLWY 481
+ + W +++EP KD LL+Q T D SDYLWY
Sbjct: 422 DNKSDNEEEPHSTL--NWQWMHEPHVQMKDGQVLGSVSRKAAQLLDQKVVTNDTSDYLWY 479
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
S +I ++P+ + V + GH LH F+NG G YG + T + I L
Sbjct: 480 ITSVDISENDPIWSK-----IRVSTNGHVLHVFVNGAQAGYQYGQNGKYSFTYEAKIKLK 534
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLK 599
G N LLS TVGL NYGA + G+ GPVQL N T + D+++ W Y+ GL
Sbjct: 535 KGTNEISLLSGTVGLPNYGAHFSNVSVGVCGPVQLVALQNNTEVVKDITNNTWNYKVGLH 594
Query: 600 GEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
GE + + + + + LP + VWYKT F +P G++PV +D G+ KG+AWVNG +
Sbjct: 595 GEIVKLYCPENNKGWNTNGLPTNRVFVWYKTLFKSPKGTDPVVVDLKGLKKGQAWVNGNN 654
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLF 718
IGRYW Y++ + GCT +CNYRG YSS+KC+ CG+P+Q YHVPRS+L+ NTLVLF
Sbjct: 655 IGRYWTRYLADDNGCTATCNYRGPYSSDKCITKCGRPTQRWYHVPRSFLRQDNQNTLVLF 714
Query: 719 EEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQV 778
EE GG P ++ F T + +C++ + G VL L C QV
Sbjct: 715 EEFGGHPNEVKFATVMV-EKICANSYE-------------------GNVLELSC-REEQV 753
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CK-- 835
IS IKFASFG P G CGSF + +C S +LS++ ++C+G +SCS+ VS G C+
Sbjct: 754 ISKIKFASFGVPEGECGSFKKSQCESPNALSILSKSCLGKQSCSVQVSQRMLGPTGCRMP 813
Query: 836 GVMKSLAVEASC 847
LA+EA C
Sbjct: 814 QNQNKLAIEAVC 825
>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
gi|194689400|gb|ACF78784.1| unknown [Zea mays]
gi|224030521|gb|ACN34336.1| unknown [Zea mays]
gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
Length = 722
Score = 861 bits (2225), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/720 (58%), Positives = 506/720 (70%), Gaps = 20/720 (2%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
++A + A V+YDHRAVVI G+RR+LISGSIHYPRSTPEMWP L+QK+KDGGLDV++TY
Sbjct: 18 MIAPSPANAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTY 77
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
VFWN HEPVR QY F RYDLV+FVKL +AGLY HLRIGPYVCAEWNFGGFP+WL ++P
Sbjct: 78 VFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVP 137
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK 196
GI FRTDN PFKA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S GA K
Sbjct: 138 GISFRTDNGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAK 197
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
Y WAA MA++ GVPWVMC+Q DAPDP+INTCNGFYCD F+PNSN+KP MWTE W+G
Sbjct: 198 PYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTG 257
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
WF +FGGAVP+RPVED+AFAVARF Q+GG+F NYYMYHGGTNFDRTSGGPFI+TSYDYDA
Sbjct: 258 WFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDA 317
Query: 317 PLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFL 376
P+DEYGL+RQPKWGHL+DLHKAIK E ALV+ DPT SLG +A V+K+ G C+AFL
Sbjct: 318 PIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFL 377
Query: 377 ANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAAD 436
+N T++ V FNG Y LPAWS+S+LPDCK VFNTA ++ PS A
Sbjct: 378 SNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSE----PS-------APAR 426
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
S A G W +E AFTK GL+EQ++ T D+SDYLWY+ NI ++E L+
Sbjct: 427 MSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKS 486
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G L + S GH+L F+NG+ G+ YG + K+T + + G N +LS VGL
Sbjct: 487 GQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGL 546
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQW 613
N G YE G+ GPV L G G DLS Q+WTYQ GL GE L S SS +W
Sbjct: 547 PNQGTHYETWNVGVLGPVTLSGLNEGKR-DLSDQKWTYQIGLHGESLGVQSVAGSSSVEW 605
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S + QPL W+K F AP+G PVA+D MGKG+AWVNG+ IGRYW +Y + + G
Sbjct: 606 GSAA---GKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSG 661
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C C+Y G YS KC CG SQ YHVPRSWL SGN LV+ EE GGD + + VT+
Sbjct: 662 C-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTR 720
>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 716
Score = 860 bits (2222), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/734 (57%), Positives = 522/734 (71%), Gaps = 25/734 (3%)
Query: 4 KEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
K +LL + + ++ GA VTYD +A++I +RR+LISGSIHYPRSTP+MWPDLIQ
Sbjct: 3 KTVLLFL---SLLTWVGSTIGA-VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQ 58
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
K+KDGGLD+IETYVFWN HEP +Y FE RYDLV F+KLV +AGLY HLRIGPYVCAEW
Sbjct: 59 KAKDGGLDIIETYVFWNGHEPSEGKYYFEERYDLVGFIKLVQKAGLYVHLRIGPYVCAEW 118
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
N+GGFP+WL F+PGI FRTDNEPFKA MQ+F KIVDMMK EKLY +QGGPIILSQIENE
Sbjct: 119 NYGGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIENE 178
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG ++ GA GKSY KW A MA+ L TGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN
Sbjct: 179 YGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQ 238
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
KPK+WTENWSGW+ +FGG PYRP ED+AF+VARF Q G+ NYY+YHGGTNF RTS
Sbjct: 239 IYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRTS 298
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
G FI+TSYD+DAP+DEYGLIR+PKWGHL+DLHKAIK CE ALV+ DPT LG N EA
Sbjct: 299 -GLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKSCEPALVSADPTITWLGKNQEAR 357
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
V+K+ S C+AFLAN T++ V V F N Y LP WS+SILPDC V FNTA++ V
Sbjct: 358 VFKSSSA-CAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCXTVTFNTAQVG----V 412
Query: 424 PSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
S+ + + +++ GW SY EP D TK GL+EQ++ T D +DYLWY
Sbjct: 413 KSYQAKMMPISS-------FGWLSYKEEPASAYAKDTTTKAGLVEQVSITWDTTDYLWYM 465
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
+I + E L+ G +L V S GH LH FING+L GS YGS + +T + L
Sbjct: 466 QDISIDSTEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPAITFSKNVDLKQ 525
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G N +LS+TVGL N G ++ AG+ GPV L+G GT D+S +W+Y+ GL GE
Sbjct: 526 GVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLEGLNEGTR-DMSKYKWSYKVGLSGES 584
Query: 603 LNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
LN S +S QW +K +L + QPL WYKTTF PAG+EP+ +D + M KG+ W+NGQS
Sbjct: 585 LNLYSDKGSNSVQW-TKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWINGQS 643
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
IGRY+P Y++ NG C D C+Y G ++ KCL NCG+PSQ YH+PR WL S N LV+FE
Sbjct: 644 IGRYFPGYIA-NGKC-DKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFE 701
Query: 720 EIGGDPTKISFVTK 733
EIGG P IS V +
Sbjct: 702 EIGGSPDGISLVKR 715
>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
Length = 725
Score = 860 bits (2222), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/735 (57%), Positives = 518/735 (70%), Gaps = 30/735 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILLL C + + S A+V YDH+A++I G+RR+LISGSIHYPRSTP MWPDLIQK+
Sbjct: 11 ILLLFSC----IFSAAS--ASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K GGLDVI+TYVFWN HEP +Y FE RYDLVKF+KLV +AGL+ +LRIGPYVCAEWNF
Sbjct: 65 KAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNF 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV+MMK EKL+ +QGGPIILSQIENE+G
Sbjct: 125 GGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDP+I+TCNG+YC+ F PN
Sbjct: 185 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVY 244
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGA+P RP EDLAF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 245 KPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGG 304
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL++QPKWGHL+DLHKAIK CE ALVA DP+ LG N EA V+
Sbjct: 305 PFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVF 364
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVT 421
+ SG C+AFLAN T V V F Y LP WS+SILPDCK VFNTAK+ + V
Sbjct: 365 NSKSG-CAAFLANYDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASEVQ 423
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
+ P +SR Q S+I E + T GL EQI T D +DYLWY
Sbjct: 424 MKPVYSRLPWQ-------------SFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWY 470
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
I +DE L++G +L + S GHALH FING+L G+ YGS N K+T + L
Sbjct: 471 MTDITIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLR 530
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
PG N LLS++VGL N G +E G+ GP+ LKG GT D+S +WTY+ G+KGE
Sbjct: 531 PGINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGT-WDMSRWKWTYKIGMKGE 589
Query: 602 ELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
L + SS W ++ + QPL WYK TFDAP G P+A+D MGKG+ W+NGQ
Sbjct: 590 SLGLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQ 649
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
S+GR+WP Y++Q G +C Y G ++ KC CGKPSQ H+PRSWL +GN LV+F
Sbjct: 650 SVGRHWPGYIAQ--GSCGNCYYAGTFNDKKCRTYCGKPSQRWCHIPRSWLTPTGNLLVVF 707
Query: 719 EEIGGDPTKISFVTK 733
EE GGDP+ +S V +
Sbjct: 708 EEWGGDPSWMSLVER 722
>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 728
Score = 860 bits (2221), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/731 (58%), Positives = 521/731 (71%), Gaps = 19/731 (2%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILL +LC ++ S A VTYD +AV+I G+RR+L+SGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 ILLGILCCSSLIC---SVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP QY FE RYDLVKF+K+V +AGLY HLRIGPYVCAEWNF
Sbjct: 68 KDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PG+ FRTDNEPFKA MQ+FT KIV MMK+EKL+ +QGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
I+ GA GK+Y KW A MA L TGVPW+MC+Q DAP+ IINTCNGFYC+ F PNS+N
Sbjct: 188 PIEWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GWF FGGAVPYRP ED+A +VARF Q GG+F NYYMYHGGTNFDRT+ G
Sbjct: 248 KPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL R+PK+ HLK LHK IKLCE ALV+ DPT SLG EA V+
Sbjct: 307 EFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ S C+AFL+N T+S V F G++Y LP WSVSILPDCK +NTAK+ T
Sbjct: 367 KSKSS-CAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRT---- 421
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
S+ + ++ S SY E + + F++ GL+EQI+ T D++DY WY
Sbjct: 422 ---SSIHMKMVPTNTPFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDI 478
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I DE L G +L + S GHALH F+NG+L G+ YGS K+T I L G N
Sbjct: 479 TISPDEKFL-TGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVN 537
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS GL N G YE G+ GPV L G +GT D++ +W+Y+ G KGE L+
Sbjct: 538 KLALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGT-WDMTKWKWSYKIGTKGEALSV 596
Query: 606 PS--GSST-QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ GSST +W S + K QPL WYK+TFD+P G+EP+A+D MGKG+ W+NGQ+IGR
Sbjct: 597 HTLAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGR 656
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+WP Y ++ G C + C+Y G ++ KCL NCG+ SQ YHVPRSWLK + N +++ EE G
Sbjct: 657 HWPAYTAR-GKC-ERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEWG 714
Query: 723 GDPTKISFVTK 733
G+P IS V +
Sbjct: 715 GEPNGISLVKR 725
>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 859 bits (2219), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/741 (56%), Positives = 516/741 (69%), Gaps = 28/741 (3%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M S +LL+VL + L ANV+YD RA+VI GKR++LISGSIHYPRSTP+MWPD
Sbjct: 2 MKSNNVLLVVLVICSLDLLVK---ANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPD 58
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LIQK+KDGGLDVIETYVFWN HEP +YNFEGRYDLVKF+KLV AGLY +LRIGPY+C
Sbjct: 59 LIQKAKDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYIC 118
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWNFGG P+WL ++ G++FRTDN+PFK MQ F KIV MMK EKL+ QGGPII++QI
Sbjct: 119 AEWNFGGLPVWLKYVSGMEFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQI 178
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG ++ GA GK+Y KWAA MA+ L T VPW+MC+Q DAPDP+I+TCNGFYC+ F
Sbjct: 179 ENEYGPVEWEIGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFR 238
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN KPKMWTE W+GWF FGG +P RP ED+AF+VARF Q G++ NYYMYHGGTNF
Sbjct: 239 PNKPYKPKMWTEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFG 298
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RTS G FI+TSYDYDAP+DEYGL+ +PK+GHL++LHKAIK CE ALV++ PT SLG N
Sbjct: 299 RTSSGLFIATSYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQ 358
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI--- 417
EA VY++ SG C+AFL+N V V F Y LP WS+SILPDCK VV+NTAK+
Sbjct: 359 EAHVYRSKSGACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQ 418
Query: 418 -NSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQ 475
+S+ + P+ G W NE + D D GL EQ N T D
Sbjct: 419 GSSIKMTPA--------------GGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDS 464
Query: 476 SDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVD 535
SDYLWY NI ++E L+ G L V S GH LH F+NGKL G+ YG+ N K+T
Sbjct: 465 SDYLWYMTDINIASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYS 524
Query: 536 FPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQ 595
+ L G N LLS++VGL N G Y+ AG+ GPV L G G+ DL+ Q+W+Y+
Sbjct: 525 GNVKLNAGINKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSR-DLAKQKWSYK 583
Query: 596 TGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGE 652
GLKGE L+ + SS +W S + + QPL WYK TF AP G+EP+A+D MGKG+
Sbjct: 584 VGLKGESLSLHTLSGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQ 643
Query: 653 AWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG 712
W+NG+ +GR+WP Y +Q G C+ C+Y G ++ KC NCG+PSQ YHVPRSWLK+SG
Sbjct: 644 IWINGEGVGRHWPGYAAQ-GDCS-KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSG 701
Query: 713 NTLVLFEEIGGDPTKISFVTK 733
N LV+FEE GGDPT IS V +
Sbjct: 702 NLLVVFEEWGGDPTGISLVRR 722
>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
distachyon]
Length = 719
Score = 858 bits (2216), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/712 (58%), Positives = 500/712 (70%), Gaps = 21/712 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A V+YDH+A+VI G+RR+L+SGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 24 AAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 83
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
V+ QY F RYDLV+FVKL +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 84 VQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 143
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S G K Y WAA
Sbjct: 144 GPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAK 203
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA++ GVPWVMC+Q DAPDP+INTCNGFYCD FTPNSN KP MWTE WSGWF +FGGA
Sbjct: 204 MAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGGA 263
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RPVEDLAFAVARF Q+GG+F NYYMYHGGTNFDRT+GGPFI+TSYDYDAP+DEYGL+
Sbjct: 264 VPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLL 323
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHL+DLHKAIK E A+V+ DPT S+G +A V+K+ +G C+AFL+N T+S
Sbjct: 324 RQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSSP 383
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V +NG Y LPAWS+SILPDCK V+NTA + PS A + A G
Sbjct: 384 AKVVYNGRRYELPAWSISILPDCKTAVYNTATVKE----PS-------APAKMNPAGGFS 432
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +E D AFTK GL+EQ++ T D+SD+LWY+ NI + E L+ G L +
Sbjct: 433 WQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQLTI 492
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GH L F+NG+ G+GYG + K++ + + G N +LS VGL N G YE
Sbjct: 493 NSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYE 552
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
G+ GPV L G G DLS+Q+WTYQ GLKGE L S SS +W S +
Sbjct: 553 NWNVGVLGPVTLSGLNQGKR-DLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSAN---G 608
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL W+K F APAG PVA+D MGKG+ WVNG++ GRYW S G SC+Y
Sbjct: 609 AQPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWSYKAS---GSCGSCSYT 665
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G YS KC NCG SQ YHVPRSWL SGN LV+ EE GGD + + +T+
Sbjct: 666 GTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTR 717
>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 857 bits (2215), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/741 (56%), Positives = 516/741 (69%), Gaps = 28/741 (3%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M S +LL+VL + L ANV+YD RA+VI GKR++LISGSIHYPRSTP+MWPD
Sbjct: 2 MKSNNVLLVVLVICSLDLLVK---ANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPD 58
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+K+KDGGLDVIETYVFWN HEP +YNFEGRYDLVKF+KLV AGLY +LRIGPY+C
Sbjct: 59 LIEKAKDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYIC 118
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWNFGG P+WL ++ G++FRTDN+PFK MQ F KIV MMK EKL+ QGGPII++QI
Sbjct: 119 AEWNFGGLPVWLKYVSGMEFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQI 178
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG ++ GA GK+Y KWAA MA+ L T VPW+MC+Q DAPDP+I+TCNGFYC+ F
Sbjct: 179 ENEYGPVEWEIGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFR 238
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN KPKMWTE W+GWF FGG +P RP ED+AF+VARF Q G++ NYYMYHGGTNF
Sbjct: 239 PNKPYKPKMWTEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFG 298
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RTS G FI+TSYDYDAP+DEYGL+ +PK+GHL++LHKAIK CE ALV++ PT SLG N
Sbjct: 299 RTSSGLFIATSYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQ 358
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI--- 417
EA VY++ SG C+AFL+N V V F Y LP WS+SILPDCK VV+NTAK+
Sbjct: 359 EAHVYRSKSGACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQ 418
Query: 418 -NSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQ 475
+S+ + P+ G W NE + D D GL EQ N T D
Sbjct: 419 GSSIKMTPA--------------GGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDS 464
Query: 476 SDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVD 535
SDYLWY NI ++E L+ G L V S GH LH F+NGKL G+ YG+ N K+T
Sbjct: 465 SDYLWYMTDVNIASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYS 524
Query: 536 FPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQ 595
+ L G N LLS++VGL N G Y+ AG+ GPV L G G+ DL+ Q+W+Y+
Sbjct: 525 GNVKLNAGINKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSR-DLAKQKWSYK 583
Query: 596 TGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGE 652
GLKGE L+ + SS +W S + + QPL WYK TF AP G+EP+A+D MGKG+
Sbjct: 584 VGLKGESLSLHTLSGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQ 643
Query: 653 AWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG 712
W+NG+ +GR+WP Y +Q G C+ C+Y G ++ KC NCG+PSQ YHVPRSWLK+SG
Sbjct: 644 IWINGEGVGRHWPGYAAQ-GDCS-KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSG 701
Query: 713 NTLVLFEEIGGDPTKISFVTK 733
N LV+FEE GGDPT IS V +
Sbjct: 702 NLLVVFEEWGGDPTGISLVRR 722
>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
distachyon]
Length = 721
Score = 857 bits (2213), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/712 (58%), Positives = 502/712 (70%), Gaps = 19/712 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A V+YDH+A+VI G+RR+L+SGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 24 AAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 83
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
V+ QY F RYDLV+FVKL +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN
Sbjct: 84 VQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 143
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S G K Y WAA
Sbjct: 144 GPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAK 203
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA++ GVPWVMC+Q DAPDP+INTCNGFYCD FTPNSN KP MWTE WSGWF +FGGA
Sbjct: 204 MAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGGA 263
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RPVEDLAFAVARF Q+GG+F NYYMYHGGTNFDRT+GGPFI+TSYDYDAP+DEYGL+
Sbjct: 264 VPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLL 323
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHL+DLHKAIK E A+V+ DPT S+G +A V+K+ +G C+AFL+N T+S
Sbjct: 324 RQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSSP 383
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V +NG Y LPAWS+SILPDCK V+NTA T+ + + L + + A G
Sbjct: 384 AKVVYNGRRYELPAWSISILPDCKTAVYNTA-----TVRQKWKEKKLWM----NPAGGFS 434
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +E D AFTK GL+EQ++ T D+SD+LWY+ NI + E L+ G L +
Sbjct: 435 WQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQLTI 494
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GH L F+NG+ G+GYG + K++ + + G N +LS VGL N G YE
Sbjct: 495 NSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYE 554
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
G+ GPV L G G DLS+Q+WTYQ GLKGE L S SS +W S +
Sbjct: 555 NWNVGVLGPVTLSGLNQGKR-DLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSAN---G 610
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL W+K F APAG PVA+D MGKG+ WVNG++ GRYW S G SC+Y
Sbjct: 611 AQPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWSYKAS---GSCGSCSYT 667
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G YS KC NCG SQ YHVPRSWL SGN LV+ EE GGD + + +T+
Sbjct: 668 GTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTR 719
>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 729
Score = 857 bits (2213), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/732 (58%), Positives = 521/732 (71%), Gaps = 20/732 (2%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILL +LC ++ S A VTYD +AV+I G+RR+L+SGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 ILLGILCCSSLIC---SVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP QY FE RYDLVKF+K+V +AGLY HLRIGPYVCAEWNF
Sbjct: 68 KDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PG+ FRTDNEPFKA MQ+FT KIV MMK+EKL+ +QGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
I+ GA GK+Y KW A MA L TGVPW+MC+Q DAP+ IINTCNGFYC+ F PNS+N
Sbjct: 188 PIEWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GWF FGGAVPYRP ED+A +VARF Q GG+F NYYMYHGGTNFDRT+ G
Sbjct: 248 KPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL R+PK+ HLK LHK IKLCE ALV+ DPT SLG EA V+
Sbjct: 307 EFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ S C+AFL+N T+S V F G++Y LP WSVSILPDCK +NTAK+ T
Sbjct: 367 KSKSS-CAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRT---- 421
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
S+ + ++ S SY E + + F++ GL+EQI+ T D++DY WY
Sbjct: 422 ---SSIHMKMVPTNTPFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDI 478
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I DE L G +L + S GHALH F+NG+L G+ YGS K+T I L G N
Sbjct: 479 TISPDEKFL-TGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVN 537
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTY-QTGLKGEELN 604
LLS GL N G YE G+ GPV L G +GT D++ +W+Y Q G KGE L+
Sbjct: 538 KLALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGT-WDMTKWKWSYKQIGTKGEALS 596
Query: 605 FPS--GSST-QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
+ GSST +W S + K QPL WYK+TFD+P G+EP+A+D MGKG+ W+NGQ+IG
Sbjct: 597 VHTLAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIG 656
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
R+WP Y ++ G C + C+Y G ++ KCL NCG+ SQ YHVPRSWLK + N +++ EE
Sbjct: 657 RHWPAYTAR-GKC-ERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEW 714
Query: 722 GGDPTKISFVTK 733
GG+P IS V +
Sbjct: 715 GGEPNGISLVKR 726
>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
Length = 729
Score = 856 bits (2212), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/714 (58%), Positives = 513/714 (71%), Gaps = 29/714 (4%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYD ++++I G+RR+LISGSIHYPRSTPEMW DLI K+K GGLDVI+TYVFW++HEP
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
Y+FEGRYDLV+F+K V + GLYA+LRIGPYVCAEWNFGG P+WL ++PG+ FRTDNE
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFKA MQ FT KIV MMK EKL+ SQGGPIILSQIENEYG + GAAG++Y+ WAA M
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGRAYVNWAASM 206
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ L TGVPWVMC+++DAPDP+IN+CNGFYCD F+PN KP MWTE WSGWF FGG +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
RPVEDL+FAVARF Q+GG++ NYYMYHGGTNF R++GGPFI+TSYDYDAP+DEYGLIR
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+ HLK+LHKAIK CE ALV+ DPT SLG L+A V+ +G+G C+AFLAN S
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS-FSRQSLQVAADSSDAIGSG 444
TV FN Y LP WS+SILPDCK VFNTAK+ + + P FS +
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAKVKMLPVKPKLFSWE--------------- 431
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
SY + +++ T PGLLEQ+N T D SDYLWY S +I + E L G K ++V
Sbjct: 432 -SYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSINV 490
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
QS GHA+H F+NG+ GS +G+ T + P+ L G N LLS+TVGLQN G YE
Sbjct: 491 QSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHYE 550
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQW--DSKSTL 619
AGITGPV L G G DL+ +W+Y+ GL+GE +N P+G SS W +S++T
Sbjct: 551 TWEAGITGPVLLHGLDQGQK-DLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQ 609
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ Q L WYK FDAP G EP+A+D MGKG+ W+NGQSIGRYW Y G C +SC
Sbjct: 610 SRSQ-LKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAYA--KGDC-NSCT 665
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
Y G + KC CG+P+Q YHVPRSWLK + N +V+FEE+GG+P KIS V +
Sbjct: 666 YSGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKR 719
>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
Length = 732
Score = 856 bits (2211), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/739 (58%), Positives = 507/739 (68%), Gaps = 30/739 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
LL + G V+ +S VTYD +A+VI G RR+L+SGSIHYPRSTPEMW DLI+K+
Sbjct: 14 FLLTTMLIGSSVIQCSS----VTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKA 69
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP YNFEGRYDLV+F+K + E GLY HLRIGPYVCAEWNF
Sbjct: 70 KDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNF 129
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++ GI FRTDN PFK+ MQ FT KIV MMK+ + +ASQGGPIILSQIENE+
Sbjct: 130 GGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFE 189
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
G AG SY+ WAA MA+ L+TGVPWVMC++ DAPDPIINTCNGFYCD FTPN
Sbjct: 190 PDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPY 249
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP MWTE WSGWF FGG VP RPVEDLAF VARF Q+GG++ NYYMYHGGTNF RT+GG
Sbjct: 250 KPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGG 309
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGL+++PK+ HLK LH+AIK CEAALV++DP LG EA V+
Sbjct: 310 PFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVF 369
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNT----AKINSVT 421
G G C AFL N N+ V FN Y LPAWS+SILPDC+NVVFNT AK + V
Sbjct: 370 TAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQ 429
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
+VPS S L A + I +Y N T GLLEQ+N T D +DYLWY
Sbjct: 430 MVPSGS--ILYSVARYDEDIA---TYGNR-------GTITARGLLEQVNVTRDTTDYLWY 477
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
+ S +IKA E L G L V S GHA+H F+NG GS +G+ N K + + L
Sbjct: 478 TTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLR 537
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI G V L G G N DLS Q+WTYQ GL+GE
Sbjct: 538 GGANKIALLSVAVGLPNVGPHFETWATGIVGSVVLHGLDEG-NKDLSWQKWTYQAGLRGE 596
Query: 602 ELNFPS---GSSTQWDSKSTLPKL--QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
+N S SS W K +L K QPL WYK FDAP G+EP+A+D MGKG+AW+N
Sbjct: 597 SMNLVSPTEDSSVDW-IKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWIN 655
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQSIGRYW + + G SCNY G Y NKC CG+P+Q YHVPRSWLK GN LV
Sbjct: 656 GQSIGRYWMAFAKGDCG---SCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLV 712
Query: 717 LFEEIGGDPTKISFVTKQL 735
LFEE+GGD +K+S V + +
Sbjct: 713 LFEELGGDISKVSVVKRSV 731
>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
Length = 745
Score = 855 bits (2210), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/712 (58%), Positives = 510/712 (71%), Gaps = 19/712 (2%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +A++I G+RR+LISGSIHYPRSTPEMW DLIQK+KDGGLDVI+TYVFWN+HEP
Sbjct: 29 VTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWNVHEPSP 88
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEGRYDLV+F+K V + GLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN P
Sbjct: 89 GNYNFEGRYDLVQFIKTVQKKGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 148
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKA MQ FT KIV MMK EKL+ SQGGPIILSQIENEYG A GA+G +Y WAA MA
Sbjct: 149 FKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGASGHAYSNWAAKMA 208
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ L TGVPWVMC++ DAPDP+IN CNGFYCD F+PN KPK+WTE+WSGWF FGG+ P
Sbjct: 209 VGLGTGVPWVMCKEDDAPDPVINACNGFYCDDFSPNKPYKPKLWTESWSGWFSEFGGSNP 268
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF R++GGPFI+TSYDYDAP+DEYGL+R+
Sbjct: 269 QRPVEDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLLRE 328
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PK+GHLKDLHKAIK CE ALV++DPT SLG +A V+ +G+ C+AFLAN +NS
Sbjct: 329 PKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGT-TCAAFLANYHSNSAAR 387
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
V FN Y LP WS+SILPDC+ VFNTA++ F +Q+ +S + W
Sbjct: 388 VTFNNRHYDLPPWSISILPDCRTDVFNTARMR-------FQPSQIQMLPSNSKLL--SWE 438
Query: 447 YINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
+E V +++ T LLEQI+ T D SDYLWY S +I + E L +K + V
Sbjct: 439 TYDEDVSSLAESSRITASRLLEQIDATRDTSDYLWYITSVDISSSESFLRGRNKPSISVH 498
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S G A+H FINGK GS +G+ + T + PI L G N LLS+ VGL N G +E
Sbjct: 499 SSGDAVHVFINGKFSGSAFGTREDRSFTFNGPIDLRAGTNKIALLSVAVGLPNGGIHFES 558
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQWDSKSTLPKL 622
+GITGPV L +G DL+ Q+W+YQ GLKGE +N P+G SS W S+S +
Sbjct: 559 WKSGITGPVLLHDLDHGQK-DLTGQKWSYQVGLKGEAMNLVSPNGVSSVDWVSESLASQN 617
Query: 623 QP-LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QP L W+K F+AP G EP+A+D + MGKG+ W+NGQSIGRYW Y G C +SCNY
Sbjct: 618 QPQLKWHKAHFNAPNGVEPLALDMSSMGKGQVWINGQSIGRYWMVYA--KGNC-NSCNYA 674
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G Y KC CG+P+Q YHVPRSWLK N +V+FEE+GG+P KIS V +
Sbjct: 675 GTYRQAKCQVGCGQPTQRWYHVPRSWLKPKNNLMVVFEELGGNPWKISLVKR 726
>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 732
Score = 855 bits (2210), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/739 (57%), Positives = 505/739 (68%), Gaps = 30/739 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
LL + G V+ +S VTYD +A+VI G RR+L+SGSIHYPRSTPEMW DLI+K+
Sbjct: 14 FLLTTMLIGSSVIQCSS----VTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKA 69
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP YNFEGRYDLV+F+K + E GLY HLRIGPYVCAEWNF
Sbjct: 70 KDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNF 129
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++ GI FRTDN PFK+ MQ FT KIV MMK+ + +ASQGGPIILSQIENE+
Sbjct: 130 GGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFE 189
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
G AG SY+ WAA MA+ L+TGVPWVMC++ DAPDPIINTCNGFYCD FTPN
Sbjct: 190 PDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPY 249
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP MWTE WSGWF FGG VP RPVEDLAF VARF Q+GG++ NYYMYHGGTNF RT+GG
Sbjct: 250 KPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGG 309
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGL+++PK+ HLK LH+AIK CEAALV++DP LG EA V+
Sbjct: 310 PFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVF 369
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNT----AKINSVT 421
G G C AFL N N+ V FN Y LPAWS+SILPDC+NVVFNT AK + V
Sbjct: 370 TAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQ 429
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
+VPS S L A + I + T GLLEQ+N T D +DYLWY
Sbjct: 430 MVPSGS--ILYSVARYDEDIAT----------YGNPGTITARGLLEQVNVTRDTTDYLWY 477
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
+ S +IKA E L G L V S GHA+H F+NG GS +G+ N K + + L
Sbjct: 478 TTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLR 537
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI G V L G G N DLS Q+WTYQ GL+GE
Sbjct: 538 GGANKIALLSVAVGLPNVGPHFETWATGIVGSVALHGLDEG-NKDLSWQKWTYQAGLRGE 596
Query: 602 ELNFPS---GSSTQWDSKSTLPKL--QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
+N S SS W K +L K QPL WYK FDAP G+EP+A+D MGKG+AW+N
Sbjct: 597 SMNLVSPTEDSSVDW-IKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWIN 655
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQSIGRYW + + G SCNY G Y NKC CG+P+Q YHVPRSWLK GN LV
Sbjct: 656 GQSIGRYWMAFAKGDCG---SCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLV 712
Query: 717 LFEEIGGDPTKISFVTKQL 735
LFEE+GGD +K+S V + +
Sbjct: 713 LFEELGGDISKVSVVKRSV 731
>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
Length = 736
Score = 855 bits (2209), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/714 (58%), Positives = 511/714 (71%), Gaps = 22/714 (3%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYD ++++I G+RR+LISGSIHYPRSTPEMW DLI K+K GGLDVI+TYVFW++HEP
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
Y+FEGRYDLV+F+K V + GLYA+LRIGPYVCAEWNFGG P+WL ++PG+ FRTDNE
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFKA MQ FT KIV MMK EKL+ SQGGPIILSQIENEYG + GAAG++Y+ WAA M
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGRAYVNWAASM 206
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ L TGVPWVMC+++DAPDP+IN+CNGFYCD F+PN KP MWTE WSGWF FGG +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
RPVEDL+FAVARF Q+GG++ NYYMYHGGTNF R++GGPFI+TSYDYDAP+DEYGLIR
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+ HLK+LHKAIK CE ALV+ DPT SLG L+A V+ +G+G C+AFLAN S
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
TV FN Y LP WS+SILPDCK VFNTAK+ Q QV W
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAKVRV---------QPSQVKMLPVKPKLFSW 437
Query: 446 -SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
SY + +++ T PGLLEQ+N T D SDYLWY S +I + E L G K ++V
Sbjct: 438 ESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSINV 497
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
QS GHA+H F+NG+ GS +G+ T + P+ L G N LLS+TVGLQN G YE
Sbjct: 498 QSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHYE 557
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQW--DSKSTL 619
AGITGPV L G G DL+ +W+Y+ GL+GE +N P+G SS W +S++T
Sbjct: 558 TWEAGITGPVLLHGLDQGQK-DLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQ 616
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ Q L WYK FDAP G EP+A+D MGKG+ W+NGQSIGRYW Y G C +SC
Sbjct: 617 SRSQ-LKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAYA--KGDC-NSCT 672
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
Y G + KC CG+P+Q YHVPRSWLK + N +V+FEE+GG+P KIS V +
Sbjct: 673 YSGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKR 726
>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
Length = 732
Score = 854 bits (2207), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/739 (58%), Positives = 506/739 (68%), Gaps = 30/739 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
LL + G V+ +S VTYD +A+VI G RR+L+SGSIHYPRSTPEMW DLI+K+
Sbjct: 14 FLLTTMLIGSSVIQCSS----VTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKA 69
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP YNFEGRYDLV+F+K + E GLY HLRIGPYVCAEWNF
Sbjct: 70 KDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNF 129
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++ GI FRTDN PFK+ MQ FT KIV MMK+ + +ASQGGPIILSQIENE+
Sbjct: 130 GGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFE 189
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
G AG SY+ WAA MA+ L+TGVPWVMC++ DAPDPIINTCNGFYCD FTPN
Sbjct: 190 PDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPY 249
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP MWTE WSGWF FGG VP RPVEDLAF VARF Q+GG++ NYYMYHGGTNF RT+GG
Sbjct: 250 KPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGG 309
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGL+++PK+ HLK LH+AIK CEAALV++DP LG EA V+
Sbjct: 310 PFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVF 369
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNT----AKINSVT 421
G G C AFL N N+ V FN Y LPAWS+SILPDC+NVVFNT AK + V
Sbjct: 370 TAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQ 429
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
+VPS S L A + I +Y N T GLLEQ+N T D +DYLWY
Sbjct: 430 MVPSGS--ILYSVARYDEDIA---TYGNR-------GTITARGLLEQVNVTRDTTDYLWY 477
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
+ S +IKA E L G L V S GHA+H F+NG GS +G+ N K + + L
Sbjct: 478 TTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLR 537
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G +E GI G V L G G N DLS Q+WTYQ GL+GE
Sbjct: 538 GGANKIALLSVAVGLPNVGPHFETWATGIVGSVVLHGLDEG-NKDLSWQKWTYQAGLRGE 596
Query: 602 ELNFPS---GSSTQWDSKSTLPKL--QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
+N S SS W K +L K QPL WYK FD P G+EP+A+D MGKG+AW+N
Sbjct: 597 SMNLVSPTEDSSVDW-IKGSLAKQNKQPLTWYKAYFDVPRGNEPLALDLKSMGKGQAWIN 655
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQSIGRYW + + G SCNY G Y NKC CG+P+Q YHVPRSWLK GN LV
Sbjct: 656 GQSIGRYWMAFAKGDCG---SCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLV 712
Query: 717 LFEEIGGDPTKISFVTKQL 735
LFEE+GGD +K+S V + +
Sbjct: 713 LFEELGGDISKVSVVKRSV 731
>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
Length = 723
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/720 (58%), Positives = 505/720 (70%), Gaps = 19/720 (2%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
++A + A V+YDHRAVVI G+RR+LISGSIHYPRSTPEMWP L+QK+KDGGLDV++TY
Sbjct: 18 MIAPSPANAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTY 77
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
VFWN HEPVR QY F RYDLV+FVKL +AGLY HLRIGPYVCAEWNFGGFP+WL ++P
Sbjct: 78 VFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVP 137
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK 196
GI FRTDN PFKA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S GA K
Sbjct: 138 GISFRTDNGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAK 197
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
Y WAA MA++ GVPWVMC+Q DAPDP+INTCNGFYCD F+PNSN+KP MWTE W+G
Sbjct: 198 PYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTG 257
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
WF +FGGAVP+RPVED+AFAVARF Q+GG+F NYYMYHGGTNFDRTSGGPFI+TSYDYDA
Sbjct: 258 WFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDA 317
Query: 317 PLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFL 376
P+DEYGL+RQPKWGHL+DLHKAIK E ALV+ DPT SLG +A V+K+ G C+AFL
Sbjct: 318 PIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFL 377
Query: 377 ANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAAD 436
+N T++ V FNG Y LPAWS+S+LPDCK VFNTA ++ PS A
Sbjct: 378 SNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSE----PS-------APAR 426
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
S A G W +E AFTK GL+EQ++ T D+SDYLWY+ NI ++E L+
Sbjct: 427 MSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKS 486
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G L V S GH+L F+NG+ G+ YG + K+T + + G N +LS VGL
Sbjct: 487 GQWPQLTVYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGL 546
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQW 613
N G YE G+ GPV L G G DLS+Q+WTYQ GL GE L S SS +W
Sbjct: 547 PNQGTHYETWNVGVLGPVTLSGLNEGKR-DLSNQKWTYQIGLHGESLGVQSVAGSSSVEW 605
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S + QPL W+K F AP+G PVA+D MGKG+AWVNG+ IGRYW +Y + + G
Sbjct: 606 GSAA---GKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSG 661
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C+Y G YS KC CG SQ YHVPRSWL SGN LVL EE GGD + VT+
Sbjct: 662 GCGGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTR 721
>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
Length = 827
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/832 (51%), Positives = 553/832 (66%), Gaps = 41/832 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+Y +R + I G+ ++ +SGSIHYPRSTP+MWPDLI+KSK+GGLD IETYVFWN HEPVR
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ-FRTDNE 145
QY+F DLV+F+K + GLYA LRIGPYVCAEWN+GGFP+WLH +PGI+ RT N
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
F EMQ FT IVDMMKQE L+ASQGGPIIL+QIENEYGN+ ++YG AGK+Y+ W A M
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A S + GVPW+MCQQ DAP+P INTCNG+YCDQFTPN+ PKMWTENW+GWF S+GG
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGRD 265
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P R EDLAF+VARFFQ GGTFQNYYMYHGGTNFDR +GGP+I+T+YDY+APLDEYG +
Sbjct: 266 PVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNLN 325
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+GHLK LH A+K E ALV+ + T L ++ T Y T G S F +NI +D
Sbjct: 326 QPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINETTDA 384
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V + G + +PAWSVSILPDC+ V+NTAK+N+ T S + A + + + W
Sbjct: 385 LVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQT---SVMVKKENKAENEPEVLEWMW 441
Query: 446 --SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
I+ + K T L++Q + D SDYLWY S N+K +P+ + + L
Sbjct: 442 RPENIDNTARLGKGQV-TANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSN--EMTLR 498
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
+ GH +HAF+NG+ +GS + S + + L PGKN LLS T+GL+NYGA Y
Sbjct: 499 INVSGHIVHAFVNGEHIGSQWASYDVYNYIFEQEVKLKPGKNIISLLSATIGLKNYGAQY 558
Query: 564 EKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEELNFPSGSS---TQWDSKST 618
+ +GI GPVQL G I DLS+ +W+Y+ GL G E S S T+W S
Sbjct: 559 DLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQS-GN 617
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
LP + + WYKTTF P G++PV +D G+GKG AWVNG SIGRYWP++++++G + C
Sbjct: 618 LPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEPC 677
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
+YRG+Y++ KC+++CGKP+Q YHVPRSWL NTLVLFEE GG+P+ ++F T + +
Sbjct: 678 DYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEKA 737
Query: 739 LCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
C H + L + G + I+ IKFASFG P G+CG+FS
Sbjct: 738 -CGHAYEKKSLELSCQGKE---------------------ITGIKFASFGDPTGSCGNFS 775
Query: 799 RGRCSSAR-SLSVVRQACVGSKSCSIGVSVNTFG--DPCKGVMKSLAVEASC 847
+G C ++ +V C+G +SC I +S +TFG + GV+K LAVEA C
Sbjct: 776 KGSCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCALGVVKRLAVEAVC 827
>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
sativus]
Length = 827
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/832 (51%), Positives = 553/832 (66%), Gaps = 41/832 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+Y +R + I G+ ++ +SGSIHYPRSTP+MWPDLI+KSK+GGLD IETYVFWN HEPVR
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ-FRTDNE 145
QY+F DLV+F+K + GLYA LRIGPYVCAEWN+GGFP+WLH +PGI+ RT N
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
F EMQ FT IVDMMKQE L+ASQGGPIIL+QIENEYGN+ ++YG AGK+Y+ W A M
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A S + GVPW+MCQQ DAP+P INTCNG+YCDQFTPN+ PKMWTENW+GWF S+GG
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWGGRD 265
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P R EDLAF+VARFFQ GGTFQNYYMYHGGTNFDR +GGP+I+T+YDY+APLDEYG +
Sbjct: 266 PVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYGNLN 325
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+GHLK LH A+K E ALV+ + T L ++ T Y T G S F +NI +D
Sbjct: 326 QPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINETTDA 384
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V + G + +PAWSVSILPDC+ V+NTAK+N+ T S + A + + + W
Sbjct: 385 LVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQT---SVMVKKENKAENEPEVLEWMW 441
Query: 446 --SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
I+ + K T L++Q + D SDYLWY S N+K +P+ + + L
Sbjct: 442 RPENIDNTARLGKGQV-TANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSN--EMTLR 498
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
+ GH +HAF+NG+ +GS + S + + L PGKN LLS T+GL+NYGA Y
Sbjct: 499 INVSGHIVHAFVNGEHIGSQWASYDVYNYIXEQEVKLKPGKNIISLLSATIGLKNYGAQY 558
Query: 564 EKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEELNFPSGSS---TQWDSKST 618
+ +GI GPVQL G I DLS+ +W+Y+ GL G E S S T+W S
Sbjct: 559 DLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQS-GN 617
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
LP + + WYKTTF P G++PV +D G+GKG AWVNG SIGRYWP++++++G + C
Sbjct: 618 LPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEPC 677
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
+YRG+Y++ KC+++CGKP+Q YHVPRSWL NTLVLFEE GG+P+ ++F T + +
Sbjct: 678 DYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEKA 737
Query: 739 LCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
C H + L + G + I+ IKFASFG P G+CG+FS
Sbjct: 738 -CGHAYEKKSLELSCQGKE---------------------ITGIKFASFGDPTGSCGNFS 775
Query: 799 RGRCSSAR-SLSVVRQACVGSKSCSIGVSVNTFG--DPCKGVMKSLAVEASC 847
+G C ++ +V C+G +SC I +S +TFG + GV+K LAVEA C
Sbjct: 776 KGSCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCALGVVKRLAVEAVC 827
>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
Length = 732
Score = 853 bits (2205), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/717 (58%), Positives = 498/717 (69%), Gaps = 20/717 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
++VTYD +A+VI G RR+L+SGSIHYPRSTPEMW DLI+K+KDGGLDVI+TYVFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
YNFEGRYDLV+F+K + E GLY HLRIGPYVCAEWNFGGFP+WL ++ GI FRTDN
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQ FT KIV MMK+ + +ASQGGPIILSQIENE+ G AG SY+ WAA
Sbjct: 149 GPFKAAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPELKGLGPAGHSYVNWAAK 208
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L+TGVPWVMC++ DAPDPIIN+CNGFYCD FTPN KP MWTE WSGWF FGG
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINSCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
+P RPVEDLAF VARF Q+GG++ NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+
Sbjct: 269 IPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
++PK+ HLK LH+AIK CEAALV++DP LG EA V+ G G C AFL N N+
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V FN Y LPAWS+SILPDC+NVVFNTA + + ++ V S +I
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATV---------AAKTSHVQMMPSGSILYS 439
Query: 445 WSYINEPVGISKDDA-FTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
+ +E + D T GLLEQ+N T D +DYLWY+ S +IKA E L G L
Sbjct: 440 VARYDEDIATYGDRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLT 499
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
V S GHA+H F+NG GS +G+ N K + + L G N LLS+ VGL N G +
Sbjct: 500 VDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANRIALLSVAVGLPNVGPHF 559
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLP 620
E GI G V L G G N DLS Q+WTYQ GL+GE + S SS W K +L
Sbjct: 560 ETWATGIVGSVVLHGLDEG-NKDLSWQKWTYQAGLRGEAMKLVSPTEDSSVDW-IKGSLA 617
Query: 621 KL--QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
K QPL WYK FDAP G+EP+A+D MGKG+AW+NGQSIGRYW + N G SC
Sbjct: 618 KQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFAKGNCG---SC 674
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
NY G Y NKC CG+P+Q YHVPRSWLK GN LVLFEE+GGD +K+S V + +
Sbjct: 675 NYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPRGNLLVLFEELGGDISKVSVVKRSV 731
>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
Length = 2260
Score = 852 bits (2202), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/520 (77%), Positives = 450/520 (86%), Gaps = 12/520 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M + EI+L VL W T F NV YDHRA+VI GKRRVLISGSIHYPRSTP+MWPD
Sbjct: 1 MRATEIVL-VLLW----FLPTMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPD 55
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LIQKSKDGGLDVIETYVFWNLHEPV+ QY+F+GR DLVKFVK VAEAGLY HLRIGPYVC
Sbjct: 56 LIQKSKDGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVC 115
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
+EWN+GGFPLWLHFIPGI+FRTDNEPFK EM+RFT KIVD+MKQEKLYASQGGPIILSQI
Sbjct: 116 SEWNYGGFPLWLHFIPGIKFRTDNEPFKVEMKRFTTKIVDLMKQEKLYASQGGPIILSQI 175
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPI-INTCNGFYCDQF 239
ENEYG+IDSAYG+AGKSYI WAA MA SLDTGVPWVMCQQ+DAPDPI INTCNGFYCDQF
Sbjct: 176 ENEYGDIDSAYGSAGKSYINWAAKMATSLDTGVPWVMCQQADAPDPIVINTCNGFYCDQF 235
Query: 240 TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 299
TPNS KPK+WTENWS W+L FGG P+RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF
Sbjct: 236 TPNSKTKPKLWTENWSAWYLLFGGGFPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 295
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
DR++GGPFI+TSYD+DAP+DEYG+IRQPKWGHLKD+HKAIKLCE AL+A +P LGPN
Sbjct: 296 DRSTGGPFIATSYDFDAPIDEYGVIRQPKWGHLKDVHKAIKLCEEALIAAEPKITYLGPN 355
Query: 360 LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS 419
LEA VYKTGS +C+AFLAN+ SD TV F+GNSY LPAWSVSILPDCKNVV NTAKINS
Sbjct: 356 LEAAVYKTGS-VCAAFLANVDAKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINS 414
Query: 420 VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
+ + +F +SL+ SS+ S WS+INEPVGISKDD +K GLLEQIN TAD+SDYL
Sbjct: 415 ASTISNFVTESLKEDISSSETSRSKWSWINEPVGISKDDILSKTGLLEQINITADRSDYL 474
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKL 519
WYSLS ++K D+P GS+TVLH++SLGHALHAFINGKL
Sbjct: 475 WYSLSVDLK-DDP----GSQTVLHIESLGHALHAFINGKL 509
Score = 477 bits (1228), Expect = e-131, Method: Compositional matrix adjust.
Identities = 227/329 (68%), Positives = 268/329 (81%), Gaps = 2/329 (0%)
Query: 520 VGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGS 579
+GS G+ K+ D PI + GKN DLLSLTVGLQNYGAF++ GAGITGPV LKG
Sbjct: 1932 LGSQTGNKEKPKLNEDIPITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGL 1991
Query: 580 GNGTN-IDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGS 638
NG +DLSS++WTYQ GLKGE+L SGSS W+SK+T PK QPL+WYKT FDAP+GS
Sbjct: 1992 KNGNKTLDLSSRKWTYQVGLKGEDLGLSSGSSGAWNSKTTFPKKQPLIWYKTNFDAPSGS 2051
Query: 639 EPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQ 698
PV IDFTGMGKGEAWVNGQSIGRYWPTYV+ N CTDSCNYRG ++ KC NCGKPSQ
Sbjct: 2052 NPVVIDFTGMGKGEAWVNGQSIGRYWPTYVASNVDCTDSCNYRGPFTQTKCHMNCGKPSQ 2111
Query: 699 SLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS 758
+LYHVP+S+LK +GNTLVLFEE GGDPT+ISF TKQ+G S+C+HV+DSHP +D+W D+
Sbjct: 2112 TLYHVPQSFLKPNGNTLVLFEESGGDPTQISFATKQIG-SVCAHVSDSHPPQIDLWNQDT 2170
Query: 759 KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGS 818
+ K GP L L CPN NQVISSIKFAS+GTPLGTCG+F RGRCSS ++LS+V++AC+GS
Sbjct: 2171 ESGGKVGPALLLNCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKTLSIVKKACIGS 2230
Query: 819 KSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
+SCSIGVS +TFGDPCKGV KSLAVEA+C
Sbjct: 2231 RSCSIGVSTDTFGDPCKGVPKSLAVEATC 2259
>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
Length = 739
Score = 852 bits (2201), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/731 (57%), Positives = 512/731 (70%), Gaps = 19/731 (2%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
LLVL + + L + +VTYD +A++I G+RR+LISGSIHYPRSTPEMW DLI+K+K
Sbjct: 9 LLVLVFTILFLGSELIHCSVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIRKAKG 68
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLD I+TYVFWN+HEP YNFEGRYDLV+F+K V GLY HLRIGPYVCAEWNFGG
Sbjct: 69 GGLDAIDTYVFWNVHEPSPGIYNFEGRYDLVRFIKTVQRVGLYVHLRIGPYVCAEWNFGG 128
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WL ++PGI FRTDN PFKA MQ FT KIV MMK EKL+ SQGGPIILSQIENEYG+
Sbjct: 129 FPVWLKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGSE 188
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
G AG +Y WAA MA+ L+TGVPWVMC+Q DAPDP+IN CNGFYCD F+PN KP
Sbjct: 189 SKQLGGAGYAYTNWAAKMAVGLNTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKPYKP 248
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
+WTE+WSGWF FGG + RPV+DLAFAVARF Q+GG++ NYYMYHGGTNF R++GGPF
Sbjct: 249 TLWTESWSGWFTEFGGPIYQRPVQDLAFAVARFIQKGGSYINYYMYHGGTNFGRSAGGPF 308
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
I+TSYDYDAP+DEYGLIR+PK+GHL DLHKAIK CE ALV++DPT SLG +A V+ +
Sbjct: 309 ITTSYDYDAPIDEYGLIREPKYGHLMDLHKAIKQCERALVSSDPTVTSLGAYEQAHVFSS 368
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
+G C+AFLAN +NS V FN Y LP WS+SILPDCK VFNTA++ F
Sbjct: 369 KNGACAAFLANYHSNSAARVTFNNRKYDLPPWSISILPDCKTDVFNTARVR-------FQ 421
Query: 428 RQSLQVAADSSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
+Q+ +S W +E V +S+ T GLLEQ+N T D SDYLWY S +
Sbjct: 422 TTKIQMLPSNSKLF--SWETYDEDVSSLSESSKITASGLLEQLNATRDTSDYLWYITSVD 479
Query: 487 IKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
I + E L G+K + V S GHA+H FING+ +GS +G+S + T + P+ L G N
Sbjct: 480 ISSSESFLRGGNKPSISVHSAGHAVHVFINGQFLGSAFGTSEDRSCTFNGPVNLRAGTNK 539
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF- 605
LLS+ VGL N G +E AGITG V L G +G DL+ Q+W+YQ GLKGE +N
Sbjct: 540 IALLSVAVGLPNVGFHFETWKAGITG-VLLYGLDHGQK-DLTWQKWSYQIGLKGEAMNLV 597
Query: 606 -PSG-SSTQWDSKSTLPKLQP-LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
P+G SS W S + Q L W+K F+AP G EP+A+D + MGKG+ W+NGQSIGR
Sbjct: 598 SPNGVSSVDWVRDSLDVRSQSQLKWHKAYFNAPDGVEPLALDLSSMGKGQVWINGQSIGR 657
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
YW Y G +SCNY G Y KC CG+P+Q YHVPRSWLK + N +VL EE+G
Sbjct: 658 YWMVYAK---GACNSCNYAGTYRPAKCQLGCGQPTQQWYHVPRSWLKPTNNLIVLLEELG 714
Query: 723 GDPTKISFVTK 733
G+P KIS +
Sbjct: 715 GNPWKISLQKR 725
>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 852 bits (2201), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/854 (49%), Positives = 553/854 (64%), Gaps = 53/854 (6%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
LL+LC + +A + +V+YD RA+ I GKR++L SGSIHYPRST EMWP LI+KSK+
Sbjct: 10 LLLLCSALISIAIEAI--DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKE 67
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLDVIETYVFWN+HEP QY+F G DLV+F+K + GL+A LRIGPYVCAEWN+GG
Sbjct: 68 GGLDVIETYVFWNVHEPHPGQYDFSGNLDLVRFIKTIQNQGLHAVLRIGPYVCAEWNYGG 127
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WLH IP I+FRT+N F+ EM++FT IVDMM+ EKL+ASQGGPIIL+QIENEYGNI
Sbjct: 128 FPVWLHNIPNIEFRTNNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNI 187
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
+YG GK Y++W A +A S GVPW+MCQQSD PDP+INTCNGFYCDQ+ PNSNNKP
Sbjct: 188 MGSYGQNGKEYVQWCAQLAQSYQIGVPWIMCQQSDTPDPLINTCNGFYCDQWHPNSNNKP 247
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
KMWTE+W+GWF+ +GG P+R ED+AFAV RFFQ GGTFQNYYMYHGGTNF RTSGGP+
Sbjct: 248 KMWTEDWTGWFMHWGGPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPY 307
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK- 366
I+TSYDYDAPL+EYG + QPKWGHLK LH+ +K E L G + AT++
Sbjct: 308 ITTSYDYDAPLNEYGDLNQPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIFSY 367
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
G +C FL N + D + F Y +PAWSVSILPDC V+NTAK+N+ T +
Sbjct: 368 AGQSVC--FLGNAHPSMDANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSI--- 422
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDD-------AFTKPGLLEQINTTADQSDYL 479
+ ++ ++ W ++ E D A T P LL+Q D SDYL
Sbjct: 423 ------MTINNENSYALDWQWMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYL 475
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
WY S ++K +P+L K + V + GH LH F+NG +GS Y + T + I
Sbjct: 476 WYITSVDVKQGDPILSHDLK--IRVNTKGHVLHVFVNGAHIGSQYATYGKYPFTFEADIK 533
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTG 597
L GKN L+S TVGL NYGA+++ G+TG VQL +G+ + D+S+ W Y+ G
Sbjct: 534 LKLGKNEISLVSGTVGLPNYGAYFDNIHVGVTG-VQLVSQNDGSEVTKDISTNVWHYKVG 592
Query: 598 LKGEELNF--PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWV 655
+ GE + PS SS +W + L + +WYKTTF P G++ V +D G+GKG+AWV
Sbjct: 593 MHGENVKLYSPSRSSEEWFTNG-LQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWV 651
Query: 656 NGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSS-GNT 714
NG +IGRYW +Y++ GC+ +C+YRG Y SNKC NCG P+Q YHVP S+L+ NT
Sbjct: 652 NGNNIGRYWVSYLAGEDGCSSTCDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNT 711
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPN 774
LV+FEE GG+P ++ T + + C+ + H L L C
Sbjct: 712 LVVFEEQGGNPFQVKIATVTIAKA-CAKAYEGHE-------------------LELACKE 751
Query: 775 PNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP- 833
NQVIS I+FASFG P G CGSF +G C S+ +LS+V++ C+G + CSI V+ G
Sbjct: 752 -NQVISEIRFASFGVPEGECGSFKKGHCESSDTLSIVKRLCLGKQQCSIHVNEKMLGPTG 810
Query: 834 CKGVMKSLAVEASC 847
C+ LA++A C
Sbjct: 811 CRVPENRLAIDALC 824
>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
Length = 729
Score = 850 bits (2196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/712 (57%), Positives = 499/712 (70%), Gaps = 23/712 (3%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A V+YD R++VI G+RR+L+SGSIHYPRSTPEMWP LIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 36 AAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEP 95
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
V+ QY F RYDLV+FVKLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PG+ FRTDN
Sbjct: 96 VQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDN 155
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKAEMQ+F KIV MMK E L+ QGGPII+SQ+ENE+G ++S G+ K Y WAA
Sbjct: 156 GPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAK 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ +TGVPWVMC+Q DAPDP+INTCNGFYCD F+PN N KP MWTE W+GWF SFGG
Sbjct: 216 MAVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGG 275
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DE+GL+
Sbjct: 276 VPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLL 335
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHL+DLH+AIK E LV+ DPT S+G +A V+K +G C+AFL+N N+
Sbjct: 336 RQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTA 395
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V V+FNG Y LPAWS+SILPDCK VFNTA + TL+P + +
Sbjct: 396 VKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKM-----------NPVVRFA 444
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +E D AFTK GL+EQ++ T D+SDYLWY+ NI ++ L G L V
Sbjct: 445 WQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTV 502
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GH++ F+NGK GS YG N K+T + + + G N +LS VGL N G +E
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
G+ GPV L S NG DLS Q+WTYQ GLKGE L + S+ +W
Sbjct: 563 NWNVGVLGPVTLS-SLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPG---G 618
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL W+K F+APAG++PVA+D MGKG+ WVNG +GRYW S GGC C+Y
Sbjct: 619 YQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKAS--GGC-GGCSYA 675
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G Y +KC NCG SQ YHVPRSWLK GN LV+ EE GGD +S T+
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATR 727
>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 850 bits (2196), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/836 (50%), Positives = 545/836 (65%), Gaps = 51/836 (6%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+V+YD RA+ I GKR++L SGSIHYPRST EMWP LI+KSK+GGLDVIETYVFWN+HEP
Sbjct: 26 DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHEPH 85
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
QY+F G DLV+F+K + GLYA LRIGPYVCAEWN+GGFP+WLH IP I+FRT+N
Sbjct: 86 PGQYDFSGNLDLVRFIKTIQNQGLYAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTNNA 145
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
F+ EM++FT IVDMM+ EKL+ASQGGPIIL+QIENEYGNI +YG GK Y++W A +
Sbjct: 146 IFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCAQL 205
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A S GVPW+MCQQSDAPDP+INTCNGFYCDQ+ PNSNNKPKMWTE+W+GWF+ +GG
Sbjct: 206 AQSYQIGVPWIMCQQSDAPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWFMHWGGPT 265
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+R ED+AFAV RFFQ GGTFQNYYMYHGGTNF RTSGGP+I+TSYDYDAPL+EYG +
Sbjct: 266 PHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYGDLN 325
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK-TGSGLCSAFLANIGTNSD 384
QPKWGHLK LH+ +K E L G + AT++ G +C FL N + D
Sbjct: 326 QPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIFSYAGQSVC--FLGNAHPSMD 383
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
+ F Y +PAWSVSILPDC V+NTAK+N+ T + + ++ ++
Sbjct: 384 ANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSI---------MTINNENSYALD 434
Query: 445 WSYINEPVGISKDD-------AFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
W ++ E D A T P LL+Q D SDYLWY S ++K +P+L
Sbjct: 435 WQWMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPILSHD 493
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
K + V + GH LH F+NG +GS Y + T + I L GKN L+S TVGL
Sbjct: 494 LK--IRVNTKGHVLHVFVNGAHIGSQYATYGKYTFTFEADIKLKLGKNEISLVSGTVGLP 551
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEELNF--PSGSSTQW 613
NYGA+++ G+TG VQL +G+ + D+S+ W Y+ G+ GE + PS S+ +W
Sbjct: 552 NYGAYFDNIHVGVTG-VQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPSRSTEEW 610
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
+ L + +WYKTTF P G++ V +D G+GKG+AWVNG +IGRYW +Y++ G
Sbjct: 611 FTNG-LQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDG 669
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSS-GNTLVLFEEIGGDPTKISFVT 732
C+ +C+YRG Y SNKC NCG P+Q YHVP S+L+ NTLV+FEE GG+P ++ T
Sbjct: 670 CSSTCDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIAT 729
Query: 733 KQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
+ + C+ + H L L C NQVIS IKFASFG P G
Sbjct: 730 VTIAKA-CAKAYEGHE-------------------LELAC-KENQVISEIKFASFGVPEG 768
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CKGVMKSLAVEASC 847
CGSF +G C S+ +LS+V++ C+G + CSI V+ G C+ LA++A C
Sbjct: 769 ECGSFKKGHCESSDTLSIVKRLCLGKQQCSIQVNEKMLGPTGCRVPENRLAIDALC 824
>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
Length = 729
Score = 850 bits (2195), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/712 (57%), Positives = 499/712 (70%), Gaps = 23/712 (3%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A V+YD R++VI G+RR+L+SGSIHYPRSTPEMWP LIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 36 AAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEP 95
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
V+ QY F RYDLV+FVKLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PG+ FRTDN
Sbjct: 96 VQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDN 155
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKAEMQ+F KIV MMK E L+ QGGPII+SQ+ENE+G ++S G+ K Y WAA
Sbjct: 156 GPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAK 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ +TGVPWVMC+Q DAPDP+INTCNGFYCD F+PN N KP MWTE W+GWF SFGG
Sbjct: 216 MAVRTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGG 275
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DE+GL+
Sbjct: 276 VPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLL 335
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHL+DLH+AIK E LV+ DPT S+G +A V+K +G C+AFL+N N+
Sbjct: 336 RQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTA 395
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V V+FNG Y LPAWS+SILPDCK VFNTA + TL+P + +
Sbjct: 396 VKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKM-----------NPVVRFA 444
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +E D AFTK GL+EQ++ T D+SDYLWY+ NI ++ L G L V
Sbjct: 445 WQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTV 502
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GH++ F+NGK GS YG N K+T + + + G N +LS VGL N G +E
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
G+ GPV L S NG DLS Q+WTYQ GLKGE L + S+ +W
Sbjct: 563 NWNVGVLGPVTLS-SLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPG---G 618
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL W+K F+APAG++PVA+D MGKG+ WVNG +GRYW S GGC C+Y
Sbjct: 619 YQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKAS--GGC-GGCSYA 675
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G Y +KC NCG SQ YHVPRSWLK GN LV+ EE GGD +S T+
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATR 727
>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
Length = 826
Score = 849 bits (2194), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/850 (50%), Positives = 555/850 (65%), Gaps = 53/850 (6%)
Query: 15 FVVLATTSFGAN---VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLD 71
F +L T+ AN V++D RA+ I GKRR+L+SGSIHYPRST +MWPDLI K+KDGGLD
Sbjct: 13 FFILITSFSLANSTIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDLINKAKDGGLD 72
Query: 72 VIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLW 131
IETYVFWN HEP R +Y+F G D+V+F+K + +AGLY+ LRIGPYVCAEWN+GGFP+W
Sbjct: 73 AIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVW 132
Query: 132 LHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY 191
LH +P ++FRT N F EMQ FT KIV+MMK+EKL+ASQGGPIIL+QIENEYGN+ S+Y
Sbjct: 133 LHNMPNMKFRTVNPSFMNEMQNFTTKIVEMMKEEKLFASQGGPIILAQIENEYGNVISSY 192
Query: 192 GAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWT 251
GAAGK+YI W A MA SLD GVPW+MCQQ +AP P++ TCNGFYCDQ+ P + + PKMWT
Sbjct: 193 GAAGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWT 252
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTS 311
ENW+GWF ++GG PYR EDLAF+VARFFQ GGTFQNYYMYHGGTNF R +GGP+I+TS
Sbjct: 253 ENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTS 312
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGL 371
YDY AP+DE+G + QPKWGHLK LH+ +K E +L + + LG +++AT+Y T G
Sbjct: 313 YDYHAPIDEFGNLNQPKWGHLKQLHRVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEG- 371
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSL 431
S F+ N+ ++ V F G Y +PAWSVS+LP+C +NTAK+N+ T +
Sbjct: 372 SSCFIGNVNATANALVNFKGKDYHVPAWSVSVLPECDKEAYNTAKVNTQTSI-------- 423
Query: 432 QVAADSSDAIGSGWSYINEP----VGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI 487
+ DSS W++ E + S D K GL++Q + T D SDYLWY ++
Sbjct: 424 -MTEDSSKPEKLEWTWRPESAQKMILKSSGDLIAK-GLVDQKDVTNDASDYLWYMTRVHL 481
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI-ALAPGKNT 546
+PL L V S H LHA++NGK VG+ + + + L G N
Sbjct: 482 DKKDPLWS--RNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFEKKVNHLVHGTNH 539
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEELN 604
LLS++VGLQNYGAF+E GI GPV L G I DLS QW Y+ GL G
Sbjct: 540 ISLLSVSVGLQNYGAFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNNK 599
Query: 605 FPSGSST---QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
S S +W + P + L WYK F AP G EPV +DF G+GKGEAW+NGQSIG
Sbjct: 600 LFSTKSVGHIKW-ANEMFPTSRMLTWYKAKFKAPLGKEPVIVDFNGLGKGEAWINGQSIG 658
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEE 720
RYWP++ S + GC D C+YRG Y S+KC CG+P+Q YHVPRS+LK+SG NT+ LFEE
Sbjct: 659 RYWPSFNSSDDGCKDECDYRGEYGSDKCAFMCGEPTQRWYHVPRSFLKASGHNTITLFEE 718
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVIS 780
+GG+P+ ++F T +G ++C+ + + + L C N IS
Sbjct: 719 MGGNPSMVNFKTVVVG-TVCARAHEHNK-------------------VELSC--HNHPIS 756
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSAR-SLSVVRQACVGSKSCSIGVSVNTFGDP--CKGV 837
++KFASFG P+G CG+F+ G C + ++ V + CVG +C+I VS +TFG C
Sbjct: 757 AVKFASFGNPVGHCGTFAVGTCQGDKDAVKTVAKECVGKLNCTINVSSDTFGSTLDCGDS 816
Query: 838 MKSLAVEASC 847
K LAVE C
Sbjct: 817 PKKLAVELEC 826
>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
Length = 724
Score = 849 bits (2194), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/717 (57%), Positives = 512/717 (71%), Gaps = 17/717 (2%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
+S A+V+YD RA++I GKR++LISGSIHYPRSTP+MWPDLIQK+KDGGLDVIETYVFWN
Sbjct: 19 SSVKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWN 78
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +YNFEGRYDLV+F+K+V AGLY +LRIGPYVCAEWNFGGFP+WL ++PG++F
Sbjct: 79 GHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEF 138
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+N+PFK MQ F KIV+MMK E L+ SQGGPII++QIENEYG ++ GA GK+Y K
Sbjct: 139 RTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTK 198
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
WAA MA+ L TGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN KPKMWTE W+GW+
Sbjct: 199 WAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTK 258
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
FGG +P RP ED+AF+VARF Q G+F NYYMYHGGTNF RTS G FI+TSYDYDAPLDE
Sbjct: 259 FGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDE 318
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGL+ +PK+GHL+DLHKAIKL E ALV++ SLG N EA VY++ SG C+AFL+N
Sbjct: 319 YGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYD 378
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
+ V V F Y LP WS+SILPDCK V+NTA++NS QS + +
Sbjct: 379 SRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNS---------QSSSIKMTPAGG 429
Query: 441 IGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
G W NE + D D T GL EQ N T D SDYLWY + NI ++E L++G
Sbjct: 430 -GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKD 488
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
L V S GH LH F+NGKL G+ YG+ N K+T + L G N LLS++VGL N
Sbjct: 489 PYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNV 548
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE---ELNFPSGSSTQWDSK 616
G Y+ AG+ GPV L G G+ +L+ Q+W+Y+ GLKGE + SS +W
Sbjct: 549 GVHYDTWNAGVLGPVTLSGLNEGSR-NLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRG 607
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
S + + QPL WYK TF+AP G++P+A+D MGKG+ W+NG+ +GR+WP Y++Q G C+
Sbjct: 608 SLMAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIAQ-GDCS- 665
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C+Y G ++ KC NCG+PSQ YHVPRSWLK SGN LV+FEE GG+PT IS V +
Sbjct: 666 KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRR 722
>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
Length = 731
Score = 848 bits (2191), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/714 (57%), Positives = 508/714 (71%), Gaps = 17/714 (2%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A VTYD +A+++ G+RR+LI+GSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 29 ATVTYDGKAIIVNGQRRILIAGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 88
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
Y FE R+DLVKFVK+V +AGLY +LRIGPY CAEWNFGGFP+WL ++PG+ FRTDN
Sbjct: 89 SPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYACAEWNFGGFPVWLKYVPGMSFRTDN 148
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFKA MQ+FT KIV+MMKQE+L+ QGGPIILSQIENEYG I+ A GK+Y +WAA
Sbjct: 149 EPFKAAMQKFTEKIVNMMKQEQLFEPQGGPIILSQIENEYGPIEWELKAPGKAYAQWAAQ 208
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L+TGVPW+ C+Q DAPDP+I+TCN +YC++FTPN + KPKMWTE W+ WF S+G
Sbjct: 209 MAVGLNTGVPWIACKQEDAPDPLIDTCNAYYCEKFTPNKSYKPKMWTEAWTAWFTSWGNP 268
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
V YRP ED AF+V +F Q GG++ NYYMYHGGTNF RT+GGPF++TSYDYDAPLDEYGL
Sbjct: 269 VLYRPAEDQAFSVLKFIQSGGSYANYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEYGLT 328
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
PK+ HLK +HKAIK E ALV+ D T SLG N EA VY + SG C+AFLAN +
Sbjct: 329 NDPKYTHLKHMHKAIKQSEKALVSADATVTSLGTNQEAHVYSSSSG-CAAFLANYDVSYS 387
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V V F Y LPAWS+SILPDCK V+NTAK+ L P ++ + + D
Sbjct: 388 VKVNFGSGQYDLPAWSISILPDCKTEVYNTAKV----LAPRVHKKMTPLGGFTWD----- 438
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
SYI+E D T+ GL EQ+ T D SDYLWY I +DE L +G L+V
Sbjct: 439 -SYIDEVASGFASDTTTEDGLWEQLYMTKDSSDYLWYMQDVKIGSDEAFLTNGKDPFLNV 497
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
QS GH L+ F+NGKL+GS YGS+ N K+T + L G N LLS +VGL N G +E
Sbjct: 498 QSAGHFLNVFVNGKLIGSAYGSNDNPKLTFSQSVKLNVGVNKIALLSASVGLANVGLHFE 557
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
G+ GPV L G GT +D++ +W+Y+ G++GE+L + SS +W S L K
Sbjct: 558 NYNVGVLGPVTLTGLNQGT-VDMTKWKWSYKVGVQGEKLQLNTVAGSSSVEWVKGSMLAK 616
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL WYK+TF+AP G++PVA+D MGKG+ W+NGQ IGRYWP Y +Q G C C+Y
Sbjct: 617 KQPLTWYKSTFNAPEGNDPVALDMISMGKGQIWINGQGIGRYWPAYTAQ-GNC-GGCSYG 674
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
G ++ KCL CG+P+Q YHVPRSWLK +GN LV+FEE GGDPT IS V + L
Sbjct: 675 GYFTEKKCLTGCGQPTQRWYHVPRSWLKPTGNLLVVFEEWGGDPTGISMVKRTL 728
>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
Length = 732
Score = 847 bits (2189), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/719 (57%), Positives = 514/719 (71%), Gaps = 19/719 (2%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYD +A++I G++R+L SGSIHYPRSTP+MW LIQK+KDGGLDVI+TYVFWNLHEP
Sbjct: 27 NVTYDKKALIINGQKRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDVIDTYVFWNLHEPS 86
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
YNFEGR DLV+F+KLV +AGLY HLRIGPY+C EWNFGGFP+WL +IPG+ FRTDNE
Sbjct: 87 PGNYNFEGRNDLVQFIKLVHKAGLYVHLRIGPYICGEWNFGGFPVWLKYIPGMIFRTDNE 146
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK +MQ+FT KIV MMK E+LY SQGGPIILSQIENEY D A+GAAG +Y+ WAA M
Sbjct: 147 PFKLQMQKFTQKIVQMMKDEQLYESQGGPIILSQIENEYEPEDKAFGAAGHAYMTWAAHM 206
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+SL+TGVPWVMC++ DAPDP++NTCNGFYCD F+PN KP MWTE W+GWF FGG +
Sbjct: 207 AVSLNTGVPWVMCKEFDAPDPVVNTCNGFYCDYFSPNKAYKPTMWTEAWTGWFTDFGGPI 266
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGLIR
Sbjct: 267 HQRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 326
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QPK+GHLKDLHKAIKLCE AL+++DP +LG +A V+ + SG C+AFLAN +
Sbjct: 327 QPKYGHLKDLHKAIKLCERALLSSDPVVTTLGSYEQAHVFSSNSGDCAAFLANYNPKATA 386
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V FN Y LP WSVSILPDCKNVVFNTA++ Q ++ ++A W
Sbjct: 387 KVTFNNMHYNLPPWSVSILPDCKNVVFNTAEVGV---------QPSKIQMLPTEARFLSW 437
Query: 446 SYINEPVGISKDDAF-TKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
++E + DD T GLLEQIN T D SDYLWY+ +I + E L+ G +L V
Sbjct: 438 EALSEDISSVDDDKIGTVAGLLEQINVTRDASDYLWYTTGVHISSSETFLDGGQPPILKV 497
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA-LAPGKNTFDLLSLTVGLQNYGAFY 563
S GH +H F+NG+L GS YG+ N +++ + L G+N LLS+ VGL N G +
Sbjct: 498 ISAGHGIHVFVNGQLSGSVYGTRGNRRISFSGELKQLHAGRNRISLLSVAVGLPNNGPRF 557
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGS---STQWDSKSTL- 619
E G+ GPV + G G DL+ Q+W+Y+ GLKGE+LN S + S W +S +
Sbjct: 558 ETWNTGVLGPVVIHGLDQGHR-DLTWQKWSYKVGLKGEDLNLGSPNSIPSINWMQESAMV 616
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ QPL W++ FDAP G +P+A+D + M KG+ W+NG SIGRYW Y +G CT +C+
Sbjct: 617 AERQPLTWHRAFFDAPRGDDPLALDMSSMVKGQVWINGNSIGRYWTVYA--DGNCT-ACS 673
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
Y G + + C CG+P+Q YH+PRS LK + N LV+FEEIGGD +KI V + + SS
Sbjct: 674 YSGTFRPSTCQFGCGQPTQKWYHIPRSLLKPTENLLVVFEEIGGDVSKIYLVKRLVTSS 732
>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
Length = 715
Score = 846 bits (2185), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/706 (57%), Positives = 495/706 (70%), Gaps = 15/706 (2%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYDHR++ I G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEPV+
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY F RYDLV+FVKLV +AGLY +LRIGPYVCAEWN+GGFP+WL ++PGI FRTDN PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S G+ KSY+ WAA MA+
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+ + GVPW+MC+Q DAPDP+INTCNGFYCD FTPNS NKP MWTE WSGWF +FGG VP
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVEDLAFAVARF Q+GG+F NYYMYHGGTNFDRT+GGPFI+TSYDYDAP+DEYGL+RQP
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHL +LHKAIK E ALVA DPT ++G +A V+++ SG C+AFL+N T++ V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
FNG Y LPAWS+S+LPDC+ V+NTA + + + A + A G W
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAAS-----------SPAKMNPAGGFTWQS 431
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
E + AFTK GL+EQ++ T D+SDYLWY+ NI + E L+ G L V S
Sbjct: 432 YGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSA 491
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
GH++ F+NG+ G+ YG K+T + + G N +LS VGL N G YE
Sbjct: 492 GHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWN 551
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVW 627
G+ GPV L G G DLS Q+WTYQ GLKGE+L S S + QP+ W
Sbjct: 552 IGVLGPVTLSGLNEGKR-DLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAGKQPVTW 610
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
++ F+APAG PVA+D MGKG+AWVNG IGRYW S N G C+Y G YS
Sbjct: 611 HRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKASGNCG---GCSYAGTYSEK 667
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
KC NCG SQ YHVPRSWL SGN +VL EE GGD + ++ +T+
Sbjct: 668 KCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTR 713
>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
Length = 826
Score = 845 bits (2183), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/848 (50%), Positives = 549/848 (64%), Gaps = 50/848 (5%)
Query: 15 FVVLATTSFGAN--VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
F+++ + S + V++D RA+ I GKRR+L+SGSIHYPRST +MWPDLI K+KDGGLD
Sbjct: 14 FILITSLSLAKSTIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDA 73
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
IETYVFWN HEP R +Y+F G D+V+F+K + +AGLY+ LRIGPYVCAEWN+GGFP+WL
Sbjct: 74 IETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWL 133
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
H +P ++FRT N F EMQ FT KIV MMK+EKL+ASQGGPIIL+QIENEYGN+ S+YG
Sbjct: 134 HNMPNMKFRTVNPSFMNEMQNFTTKIVKMMKEEKLFASQGGPIILAQIENEYGNVISSYG 193
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTE 252
A GK+YI W A MA SLD GVPW+MCQQ +AP P++ TCNGFYCDQ+ P + + PKMWTE
Sbjct: 194 AEGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTE 253
Query: 253 NWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSY 312
NW+GWF ++GG PYR EDLAF+VARFFQ GGTFQNYYMYHGGTNF R +GGP+I+TSY
Sbjct: 254 NWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSY 313
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC 372
DY APLDE+G + QPKWGHLK LH +K E +L + + LG +++AT+Y T G
Sbjct: 314 DYHAPLDEFGNLNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEG-S 372
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQ 432
S F+ N+ +D V F G Y +PAWSVS+LPDC +NTAK+N+ T +
Sbjct: 373 SCFIGNVNATADALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSI--------- 423
Query: 433 VAADSSDAIGSGWSYINEPVG---ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
+ DSS W++ E + GL++Q + T D SDYLWY ++
Sbjct: 424 MTEDSSKPERLEWTWRPESAQKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDK 483
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI-ALAPGKNTFD 548
+PL L V S H LHA++NGK VG+ + + + L G N
Sbjct: 484 KDPLWS--RNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHIS 541
Query: 549 LLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKG---EEL 603
LLS++VGLQNYG F+E GI GPV L G I DLS QW Y+ GL G +
Sbjct: 542 LLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLF 601
Query: 604 NFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
+ S +W + LP + L WYK F AP G EPV +D G+GKGEAW+NGQSIGRY
Sbjct: 602 SIKSVGHQKW-ANEKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRY 660
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIG 722
WP++ S + GC D C+YRGAY S+KC CGKP+Q YHVPRS+L +SG NT+ LFEE+G
Sbjct: 661 WPSFNSSDDGCKDECDYRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMG 720
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSI 782
G+P+ ++F T +G ++C+ + + + L C N+ IS++
Sbjct: 721 GNPSMVNFKTVVVG-TVCARAHEHNK-------------------VELSC--HNRPISAV 758
Query: 783 KFASFGTPLGTCGSFSRGRCSSAR-SLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMK 839
KFASFG PLG CGSF+ G C + + V + CVG +C++ VS +TFG C K
Sbjct: 759 KFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKECVGKLNCTVNVSSDTFGSTLDCGDSPK 818
Query: 840 SLAVEASC 847
LAVE C
Sbjct: 819 KLAVELEC 826
>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
Length = 717
Score = 845 bits (2182), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/706 (57%), Positives = 495/706 (70%), Gaps = 15/706 (2%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYDHR++ I G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEPV+
Sbjct: 25 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 84
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY F RYDLV+FVKLV +AGLY +LRIGPYVCAEWN+GGFP+WL ++PGI FRTDN PF
Sbjct: 85 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 144
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S G+ KSY+ WAA MA+
Sbjct: 145 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 204
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+ + GVPW+MC+Q DAPDP+INTCNGFYCD FTPNS NKP MWTE WSGWF +FGG VP
Sbjct: 205 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 264
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVEDLAFAVARF Q+GG+F NYYMYHGGTNFDRT+GGPFI+TSYDYDAP+DEYGL+RQP
Sbjct: 265 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 324
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHL +LHKAIK E ALVA DPT ++G +A V+++ SG C+AFL+N T++ V
Sbjct: 325 KWGHLTNLHKAIKQAEPALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 384
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
FNG Y LPAWS+S+LPDC+ V+NTA + + + A + A G W
Sbjct: 385 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAAS-----------SPAKMNPAGGFTWQS 433
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
E + AFTK GL+EQ++ T D+SDYLWY+ NI + E L+ G L V S
Sbjct: 434 YGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSA 493
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
GH++ F+NG+ G+ YG K+T + + G N +LS VGL N G YE
Sbjct: 494 GHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWN 553
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVW 627
G+ GPV L G G DLS Q+WTYQ GLKGE+L S S + QP+ W
Sbjct: 554 IGVLGPVTLSGLNEGKR-DLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAGKQPVTW 612
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
++ F+APAG PVA+D MGKG+AWVNG IGRYW S N G C+Y G YS
Sbjct: 613 HRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKASGNCG---GCSYAGTYSEK 669
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
KC NCG SQ YHVPRSWL SGN +VL EE GGD + ++ +T+
Sbjct: 670 KCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTR 715
>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 846
Score = 844 bits (2180), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/861 (49%), Positives = 552/861 (64%), Gaps = 57/861 (6%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
+ +LC + +A + V+YD RA+ I GKRR+L SGSIHYPRSTPEMWP LI+K+K+
Sbjct: 11 MFLLCLSLISIAINAL--EVSYDERALTIDGKRRILFSGSIHYPRSTPEMWPYLIRKAKE 68
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
GGLDVIETYVFWN HEP R QY+F DLV+F++ + + GLYA +RIGPY+ +EWN+GG
Sbjct: 69 GGLDVIETYVFWNAHEPQRRQYDFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGG 128
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
P+WLH IP ++FRT N F EM+ FT KIVDMM+ E L+A QGGPII++QIENEYGN+
Sbjct: 129 LPVWLHNIPNMEFRTHNRAFMEEMKTFTRKIVDMMQDETLFAVQGGPIIIAQIENEYGNV 188
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
AYG G Y+KW A +A S +TGVPWVM QQS+AP +I++C+G+YCDQF PN N+KP
Sbjct: 189 MHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKP 248
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
K+WTENW+G + ++G P+RP ED+A+AVARFFQ GGTFQNYYMYHGGTNF RT+GGP+
Sbjct: 249 KIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPY 308
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
++TSYDYDAPLDEYG + QPKWGHL+ LH +K E L + G + ATVY T
Sbjct: 309 VTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQHTDYGNMVTATVY-T 367
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
G + F+ N + D T+ F N Y +PAWSVSILP+C + +NTAK+N+ T
Sbjct: 368 YDGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQT------ 421
Query: 428 RQSLQVAADSSD-AIGSGWSYINEPVGISKDDA------FTKPGLLEQINTTADQSDYLW 480
++ V D+ D W + EP KD T P LL+Q T D SDYLW
Sbjct: 422 --TIMVKKDNEDLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLW 479
Query: 481 YSLSTNIKADEPLLEDGSKT---VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFP 537
Y S +IK D +D S T L V + GH LH F+NGK VG+ + + K +
Sbjct: 480 YITSIDIKGD----DDPSWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESK 535
Query: 538 IALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI-------DLSSQ 590
I L GKN LLS TVGL NYG F++ G+ GPVQL + + DLS
Sbjct: 536 IKLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKN 595
Query: 591 QWTYQTGLKGE-ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMG 649
QW+Y+ GL GE E+++ +S + +P + LVWYKTTF +P G +PV +D +G+G
Sbjct: 596 QWSYKVGLHGEHEMHYSYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLG 655
Query: 650 KGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK 709
KG AWVNG SIGRYW +Y++ GC+ C+YRG Y+SNKCL C +PSQ YHVPRS+L+
Sbjct: 656 KGHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLR 715
Query: 710 SSG-NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVL 768
NTLVLFEE+GG P ++F+T +G +C++ + G L
Sbjct: 716 DDDQNTLVLFEELGGQPYYVNFLTVTVG-KVCANAYE-------------------GNTL 755
Query: 769 SLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVN 828
L C N NQVIS IKFASFG P G CGSF +G C S+ +LS ++ C+G CSI VS
Sbjct: 756 ELAC-NKNQVISEIKFASFGLPKGECGSFQKGNCESSEALSAIKAQCIGKDKCSIQVSER 814
Query: 829 TFG-DPCK-GVMKSLAVEASC 847
G C+ + LAVEA C
Sbjct: 815 ALGPTRCRVAEDRRLAVEAVC 835
>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
Length = 724
Score = 843 bits (2179), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/717 (56%), Positives = 511/717 (71%), Gaps = 17/717 (2%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
+S A+V+YD RA++I GKR++LISGSIHYPRSTP+MWPDLIQK+KDGGLDVIETYVFWN
Sbjct: 19 SSVKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWN 78
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
H P +YNFEGRYDLV+F+K+V AGLY +LRIGPYVCAEWNFGGFP+WL ++PG++F
Sbjct: 79 GHGPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEF 138
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+N+PFK M+ F KIV+MMK E L+ SQGGPII++QIENEYG ++ GA GK+Y K
Sbjct: 139 RTNNQPFKVAMRGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTK 198
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
WAA MA+ L TGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN KPKMWTE W+GW+
Sbjct: 199 WAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTK 258
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
FGG +P RP ED+AF+VARF Q G+F NYYMYHGGTNF RTS G FI+TSYDYDAPLDE
Sbjct: 259 FGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDE 318
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGL+ +PK+GHL+DLHKAIKL E ALV++ SLG N EA VY++ SG C+AFL+N
Sbjct: 319 YGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYD 378
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
+ V V F Y LP WS+SILPDCK V+NTA++NS QS + +
Sbjct: 379 SRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNS---------QSSSIKMTPAGG 429
Query: 441 IGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
G W NE + D D T GL EQ N T D SDYLWY + NI ++E L++G
Sbjct: 430 -GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKD 488
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
L V S GH LH F+NGKL G+ YG+ N K+T + L G N LLS++VGL N
Sbjct: 489 PYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNV 548
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE---ELNFPSGSSTQWDSK 616
G Y+ AG+ GPV L G G+ +L+ Q+W+Y+ GLKGE + SS +W
Sbjct: 549 GVHYDTWNAGVLGPVTLSGLNEGSR-NLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRG 607
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
S + + QPL WYK TF+AP G++P+A+D MGKG+ W+NG+ +GR+WP Y++Q G C+
Sbjct: 608 SLVAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIAQ-GDCS- 665
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C+Y G ++ KC NCG+PSQ YHVPRSWLK SGN LV+FEE GG+PT IS V +
Sbjct: 666 KCSYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRR 722
>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 923
Score = 843 bits (2178), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/860 (49%), Positives = 550/860 (63%), Gaps = 57/860 (6%)
Query: 9 LVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 68
+LC + +A + V+YD RA+ I GKRR+L S SIHYPRSTPEMWP LI+K+K+G
Sbjct: 12 FLLCLSLISIAINAL--EVSYDERALTIDGKRRILFSASIHYPRSTPEMWPYLIRKAKEG 69
Query: 69 GLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGF 128
GLDVIETYVFWN HEP R QY F DLV+F++ + + GLYA +RIGPY+ +EWN+GG
Sbjct: 70 GLDVIETYVFWNAHEPQRRQYEFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGL 129
Query: 129 PLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNID 188
P+WLH IP ++FRT N F EM+ FT KIVDMM+ E L+A QGGPII++QIENEYGN+
Sbjct: 130 PVWLHNIPNMEFRTHNRAFMEEMKTFTTKIVDMMQDETLFAVQGGPIIIAQIENEYGNVM 189
Query: 189 SAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPK 248
AYG G Y+KW A +A S +TGVPWVM QQS+AP +I++C+G+YCDQF PN N+KPK
Sbjct: 190 HAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPK 249
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
+WTENW+G + ++G P+RP ED+A+AVARFFQ GGTFQNYYMYHGGTNF RT+GGP++
Sbjct: 250 IWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYV 309
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
+TSYDYDAPLDEYG + QPKWGHL+ LH +K E L G + ATVY T
Sbjct: 310 TTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQNTDYGNMVTATVY-TY 368
Query: 369 SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR 428
G + F+ N + D T+ F N Y +PAWSVSILP+C + +NTAK+N+ T
Sbjct: 369 DGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQT------- 421
Query: 429 QSLQVAADSSD-AIGSGWSYINEPVGISKDDA------FTKPGLLEQINTTADQSDYLWY 481
++ V D+ D W + EP KD T P LL+Q T D SDYLWY
Sbjct: 422 -TIMVKKDNEDLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLWY 480
Query: 482 SLSTNIKADEPLLEDGSKT---VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
S +IK D +D S T L V + GH LH F+NGK VG+ + + K + I
Sbjct: 481 ITSIDIKGD----DDPSWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKI 536
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI-------DLSSQQ 591
L GKN LLS TVGL NYG F++ G+ GPVQL + + DLS Q
Sbjct: 537 KLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQ 596
Query: 592 WTYQTGLKGE-ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGK 650
W+Y+ GL GE E+++ +S + +P + LVWYKTTF +P G +PV +D +G+GK
Sbjct: 597 WSYKVGLHGEHEMHYSYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGK 656
Query: 651 GEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKS 710
G AWVNG SIGRYW +Y++ GC+ C+YRG Y+SNKCL C +PSQ YHVPRS+L+
Sbjct: 657 GHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRD 716
Query: 711 SG-NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLS 769
+ NTLVLFEE+GG P ++F+T +G +C++ + G L
Sbjct: 717 NDQNTLVLFEELGGQPYYVNFLTVTVG-KVCANAYE-------------------GNTLE 756
Query: 770 LECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNT 829
L C N NQVIS IKFASFG P G CGSF +G C S+ +LS ++ C+G CSI VS T
Sbjct: 757 LAC-NKNQVISEIKFASFGLPKGECGSFQKGNCESSEALSAIKAQCIGKDKCSIQVSERT 815
Query: 830 FG-DPCK-GVMKSLAVEASC 847
G C+ + LAVEA C
Sbjct: 816 LGPTRCRVAEDRRLAVEAVC 835
>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
Length = 754
Score = 843 bits (2177), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/703 (58%), Positives = 494/703 (70%), Gaps = 23/703 (3%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A V+YD R++VI G+RR+L+SGSIHYPRSTPEMWP LIQK+KDGGLDVI+TYVFWN HEP
Sbjct: 36 AAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEP 95
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
V+ QY F RYDLV+FVKLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PG+ FRTDN
Sbjct: 96 VQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDN 155
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKAEMQ+F KIV MMK E L+ QGGPII+SQ+ENE+G ++S G+ K Y WAA
Sbjct: 156 GPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAK 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ +TGVPWVMC+Q DAPDP+INTCNGFYCD F+PN N KP MWTE W+GWF SFGG
Sbjct: 216 MAVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGG 275
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VP+RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DE+GL+
Sbjct: 276 VPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLL 335
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHL+DLH+AIK E LV+ DPT S+G +A V+K +G C+AFL+N N+
Sbjct: 336 RQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTA 395
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V V+FNG Y LPAWS+SILPDCK VFNTA + TL+P + +
Sbjct: 396 VKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKM-----------NPVVRFA 444
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +E D AFTK GL+EQ++ T D+SDYLWY+ NI ++ L G L V
Sbjct: 445 WQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTV 502
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
S GH++ F+NGK GS YG N K+T + + + G N +LS VGL N G +E
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK 621
G+ GPV L S NG DLS Q+WTYQ GLKGE L + S+ +W
Sbjct: 563 NWNVGVLGPVTLS-SLNGGTKDLSHQKWTYQVGLKGETLGLQTVTGSSAVEWGGPG---G 618
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL W+K F+APAG++PVA+D MGKG+ WVNG +GRYW S GGC C+Y
Sbjct: 619 YQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKAS--GGC-GGCSYA 675
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
G Y +KC NCG SQ YHVPRSWLK GN LV+ EE G +
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGAN 718
>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
Length = 724
Score = 843 bits (2177), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/732 (56%), Positives = 516/732 (70%), Gaps = 23/732 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I L +LC + + A+V+YD +AV+I G+RR+L+SGSIHYPRSTPEMWP LIQK+
Sbjct: 11 IFLAILC---CLSLSCIVKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVIETYVFWN HEP QY F RYDLVKF+KLV +AGLY +LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL F+PG+ FRTDNEPFKA M++FT KIV MMK EKL+ +QGGPIIL+QIENEYG
Sbjct: 128 GGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KW A MAL L TGVPW+MC+Q DAP PII+TCNG+YC+ F PNS N
Sbjct: 188 PVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSIN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GW+ FGGAVPYRPVED+A++VARF Q+GG+ NYYMYHGGTNFDRT+ G
Sbjct: 248 KPKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
F+++SYDYDAPLDEYGL R+PK+ HLK LHKAIKL E AL++ D T SLG EA V+
Sbjct: 307 EFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ S C+AFL+N NS V F G Y LP WSVSILPDCK V+NTAK+N+ PS
Sbjct: 367 WSKSS-CAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNA----PS 421
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQSDYLWYSLS 484
R + S W NE + + F + GL+EQI+ T D+SDY WY
Sbjct: 422 VHRNMVPTGTKFS------WGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITD 475
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
I + E L+ G +L V S GHALH F+NG+L G+ YG + K+T I L G
Sbjct: 476 ITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGV 535
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS+ VGL N G +E+ G+ GPV LKG +GT D+S +W+Y+ G+KGE L+
Sbjct: 536 NKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGT-WDMSKWKWSYKIGVKGEALS 594
Query: 605 FPSG---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
+ S +W S + K QPL WYK+TF PAG+EP+A+D MGKG+ W+NG++IG
Sbjct: 595 LHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIG 654
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
R+WP Y +Q G CNY G + + KCL NCG+ SQ YHVPRSWLKS N +V+FEE+
Sbjct: 655 RHWPAYKAQ--GSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLKSQ-NLIVVFEEL 711
Query: 722 GGDPTKISFVTK 733
GGDP IS V +
Sbjct: 712 GGDPNGISLVKR 723
>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
Length = 724
Score = 842 bits (2176), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/732 (56%), Positives = 516/732 (70%), Gaps = 23/732 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I L +LC + + A+V+YD +AV+I G+RR+L+SGSIHYPRSTPEMWP LIQK+
Sbjct: 11 IFLAILC---CLSLSCIVKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVIETYVFWN HEP QY F RYDLVKF+KLV +AGLY +LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL F+PG+ FRTDNEPFKA M++FT KIV MMK EKL+ +QGGPIIL+QIENEYG
Sbjct: 128 GGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KW A MAL L TGVPW+MC+Q DAP PII+TCNG+YC+ F PNS N
Sbjct: 188 PVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSIN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GW+ FGGAVPYRPVED+A++VARF Q+GG+ NYYMYHGGTNFDRT+ G
Sbjct: 248 KPKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLINYYMYHGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
F+++SYDYDAPLDEYGL R+PK+ HLK LHKAIKL E AL++ D T SLG EA V+
Sbjct: 307 EFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ S C+AFL+N NS V F G Y LP WSVSILPDCK V+NTAK+N+ PS
Sbjct: 367 WSKSS-CAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNA----PS 421
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQSDYLWYSLS 484
R + S W NE + + F + GL+EQI+ T D+SDY WY
Sbjct: 422 VHRNMVPTGTKFS------WGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITD 475
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
I + E L+ G +L V S GHALH F+NG+L G+ YG + K+T I L G
Sbjct: 476 ITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGV 535
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS+ VGL N G +E+ G+ GPV LKG +GT D+S +W+Y+ G+KGE L+
Sbjct: 536 NKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGT-WDMSKWKWSYKIGVKGEALS 594
Query: 605 FPSG---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
+ S +W S + K QPL WYK+TF PAG+EP+A+D MGKG+ W+NG++IG
Sbjct: 595 LHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIG 654
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
R+WP Y +Q G CNY G + + KCL NCG+ SQ YHVPRSWLKS N +V+FEE+
Sbjct: 655 RHWPAYKAQ--GSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLKSQ-NLIVVFEEL 711
Query: 722 GGDPTKISFVTK 733
GGDP IS V +
Sbjct: 712 GGDPNGISLVKR 723
>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
Length = 724
Score = 842 bits (2176), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/717 (56%), Positives = 510/717 (71%), Gaps = 17/717 (2%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
+S A+V+YD RA++I GKR++LISGSIHYPRSTP+MWPDLIQK+KDGGLDVIETYVFWN
Sbjct: 19 SSVKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWN 78
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +YNFEGRYDLV+F+K+V AGLY +LRIGPYVCAEWNFGGFP+WL ++PG++F
Sbjct: 79 GHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEF 138
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+N+PFK MQ F KIV+MMK E L+ SQGGPII++QIENEYG ++ GA GK+Y K
Sbjct: 139 RTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTK 198
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
WAA MA+ L TGVPW+MC++ DAPDP+I+TCNGFYC+ F PN KPKMWTE W+GW+
Sbjct: 199 WAAQMAVGLKTGVPWIMCKREDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTK 258
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
FGG +P RP ED+AF+VARF Q G+F NYYMYHGGTNF RTS G FI+TSYDYDAPLDE
Sbjct: 259 FGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDE 318
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGL+ +PK+GHL+DLHKAIKL E ALV++ SLG N EA VY++ SG C+AFL+N
Sbjct: 319 YGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYD 378
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
+ V V F Y LP WS+SILPDCK V+NTA++NS QS + +
Sbjct: 379 SRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNS---------QSSSIKMTPAGG 429
Query: 441 IGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
G W NE + D D T GL EQ N T D SDYLWY + NI ++E L +G
Sbjct: 430 -GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLRNGKD 488
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
L V S GH LH F+NGKL G+ YG+ N K+T + L G N LLS++VGL N
Sbjct: 489 PYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNV 548
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE---ELNFPSGSSTQWDSK 616
G Y+ AG+ GPV L G G+ +L+ Q+W+Y+ GLKGE + SS +W
Sbjct: 549 GVHYDTWNAGVLGPVTLSGLNEGSR-NLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRG 607
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
S + + QPL WYK TF+AP G++P+A+ MGKG+ W+NG+ +GR+WP Y++Q G C+
Sbjct: 608 SLVAQKQPLTWYKATFNAPGGNDPLALGMASMGKGQIWINGEGVGRHWPGYIAQ-GDCS- 665
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C+Y G ++ KC NCG+PSQ +HVPRSWLK SGN LV+FEE GG+PT IS V +
Sbjct: 666 KCSYAGTFNEKKCQTNCGQPSQRWHHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRR 722
>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
Length = 740
Score = 841 bits (2172), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/709 (56%), Positives = 501/709 (70%), Gaps = 21/709 (2%)
Query: 29 YDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQ 88
YDHR++VI G+RR+LISGSIHYPRSTPEMWP LIQK+KDGGLDVI+TYVFWN HEPV+ Q
Sbjct: 47 YDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQ 106
Query: 89 YNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFK 148
Y+F RYDLV+FVKLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI+FRTDN PFK
Sbjct: 107 YHFADRYDLVRFVKLVRQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFK 166
Query: 149 AEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALS 208
A MQ+F KIV MMK E L+ QGGPII++Q+ENE+G ++S G+ K Y WAA MA+
Sbjct: 167 AAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGAKPYAHWAAQMAVG 226
Query: 209 LDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
+TGVPWVMC+Q DAPDP+INTCNGFYCD FTPN KP MWTE W+GWF FGGA+P+R
Sbjct: 227 TNTGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNRKYKPTMWTEAWTGWFTKFGGALPHR 286
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPK 328
PVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DE+GL+RQPK
Sbjct: 287 PVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPK 346
Query: 329 WGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVK 388
WGHL+DLH+AIK E AL++ DPT S+G +A ++K+ +G C+AFL+N + V ++
Sbjct: 347 WGHLRDLHRAIKQAEPALISGDPTIQSIGNYEKAYIFKSKNGACAAFLSNYHMKTAVKIR 406
Query: 389 FNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYI 448
F+G Y LPAWS+SILPDCK VFNTA + TL+P + + W
Sbjct: 407 FDGRHYDLPAWSISILPDCKTAVFNTATVKEPTLLPKM-----------NPVLHFAWQSY 455
Query: 449 NEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLG 508
+E D AFT+ GL+EQ++ T D+SDYLWY+ +I +E L+ G L V S G
Sbjct: 456 SEDTNSLDDSAFTRNGLVEQLSLTWDKSDYLWYTTHVSIGGNEQFLKSGQWPQLTVYSAG 515
Query: 509 HALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGA 568
H++ F+NG+ GS YG N K+T + + + G N +LS VGL N G +E
Sbjct: 516 HSMQVFVNGRSYGSVYGGYDNPKLTFNGHVKMWQGSNKISILSSAVGLPNNGNHFELWNV 575
Query: 569 GITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPL 625
G+ GPV L G G DLS Q+WTYQ GLKGE L + S+ +W QPL
Sbjct: 576 GVLGPVTLSGLNEGKR-DLSHQKWTYQVGLKGESLGLHTVTGSSAVEWAGPG---GKQPL 631
Query: 626 VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYS 685
W+K F+APAGS+PVA+D MGKG+ WVNG GRYW +Y + +G C C+Y G Y
Sbjct: 632 TWHKALFNAPAGSDPVALDMGSMGKGQIWVNGHHAGRYW-SYRAYSGSCR-RCSYAGTYR 689
Query: 686 SNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI-GGDPTKISFVTK 733
++CL NCG SQ YHVPRSWLK SGN LV+ EE GGD ++ T+
Sbjct: 690 EDQCLSNCGDISQRWYHVPRSWLKPSGNLLVVLEEYGGGDLAGVTLATR 738
>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
Length = 828
Score = 840 bits (2170), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/863 (49%), Positives = 548/863 (63%), Gaps = 51/863 (5%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGAN---VTYDHRAVVIGGKRRVLISGSIHYPRSTPEM 57
M K+ LL L F++L T+ AN V++D RA+ I G+RR+L+SGSIHYPRST +M
Sbjct: 1 MKMKQFNLLSL---FLILITSFGSANSTIVSHDERAITIDGQRRILLSGSIHYPRSTSDM 57
Query: 58 WPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGP 117
WPDLI K+KDGGLD IETYVFWN HEP R QY+F G DLV+F+K + AGLY+ LRIGP
Sbjct: 58 WPDLISKAKDGGLDTIETYVFWNAHEPSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIGP 117
Query: 118 YVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIIL 177
YVCAEWN+GGFP+WLH +P ++FRT N F EMQ FT KIV+MMK+E L+ASQGGPIIL
Sbjct: 118 YVCAEWNYGGFPVWLHNMPDMKFRTINPGFMNEMQNFTTKIVNMMKEESLFASQGGPIIL 177
Query: 178 SQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCD 237
+QIENEYGN+ S+YGA GK+YI W A MA SLD GVPW+MCQQ AP P+I TCNGFYCD
Sbjct: 178 AQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWIMCQQPHAPQPMIETCNGFYCD 237
Query: 238 QFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGT 297
Q+ P++ + PKMWTENW+GWF ++GG PYR EDLAF+VARFFQ GGTFQNYYMYHGGT
Sbjct: 238 QYKPSNPSSPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGT 297
Query: 298 NFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
NF R +GGP+I+TSYDYDAPLDEYG + QPKWGHLK LH +K E L + + LG
Sbjct: 298 NFGRVAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHTLLKSMEKPLTYGNISTIDLG 357
Query: 358 PNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI 417
++ ATVY T S F+ N+ +D V F G Y +PAWSVS+LPDC +NTA++
Sbjct: 358 NSVTATVYSTNEK-SSCFIGNVNATADALVNFKGKDYNVPAWSVSVLPDCDKEAYNTARV 416
Query: 418 NSVTLVPSFSRQSLQVAADSSDAIGSGW--SYINEPVGISKDDAFTKPGLLEQINTTADQ 475
N+ T + + + + D + + W + + + GL++Q + T D
Sbjct: 417 NTQTSIIT------EDSCDEPEKLKWTWRPEFTTQKTILKGSGDLIAKGLVDQKDVTNDA 470
Query: 476 SDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVD 535
SDYLWY ++ +P+ L V S H LHA++NGK VG+ + +
Sbjct: 471 SDYLWYMTRVHLDKKDPIWS--RNMSLRVHSNAHVLHAYVNGKYVGNQIVRDNKFDYRFE 528
Query: 536 FPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWT 593
+ L G N LLS++VGLQNYG F+E GI GPV+L G I DLS QW
Sbjct: 529 KKVNLVHGTNHLALLSVSVGLQNYGPFFESGPTGINGPVKLVGYKGDETIEKDLSKHQWD 588
Query: 594 YQTGLKGEELNFPSGSST-----QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGM 648
Y+ GL G S S +W S LP + L WYK F AP G +PV +D G+
Sbjct: 589 YKIGLNGFNHKLFSMKSAGHHHRKW-STEKLPADRMLSWYKANFKAPLGKDPVIVDLNGL 647
Query: 649 GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWL 708
GKGE W+NGQSIGRYWP++ S + GCT+ C+YRG Y S+KC CGKP+Q YHVPRS+L
Sbjct: 648 GKGEVWINGQSIGRYWPSFNSSDEGCTEECDYRGEYGSDKCAFMCGKPTQRWYHVPRSFL 707
Query: 709 KSSG-NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV 767
G NT+ LFEE+GGDP+ + F T G +C+ + +
Sbjct: 708 NDKGHNTITLFEEMGGDPSMVKFKTVVTG-RVCAKAHEHNK------------------- 747
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSAR-SLSVVRQACVGSKSCSIGVS 826
+ L C N+ IS++KFASFG P G CGSF+ G C A+ ++ VV + CVG +C++ VS
Sbjct: 748 VELSC--NNRPISAVKFASFGNPSGQCGSFAAGSCEGAKDAVKVVAKECVGKLNCTMNVS 805
Query: 827 VNTFGD--PCKGVMKSLAVEASC 847
+ FG C K L VE C
Sbjct: 806 SHKFGSNLDCGDSPKRLFVEVEC 828
>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 833
Score = 840 bits (2169), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 432/835 (51%), Positives = 544/835 (65%), Gaps = 45/835 (5%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T D R ++I G+R++LISGS+HYPRSTPEMWPDLIQKSKDGGL+ I+TYVFW+LHEP R
Sbjct: 30 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY+F G DLV+F+K + GLYA LRIGPYVCAEW +GGFP+WLH P IQ RT+N
Sbjct: 90 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+ +EMQ FT IVDMMK+E+L+ASQGGPII+SQIENEYGN+ AY AG YI W A MA
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+LDTGVPW+MCQQ +AP P+INTCNG+YCDQFTPN+ N PKMWTENWSGW+ ++GG+ P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+R EDLAF+VARF+Q GGTFQNYYMYHGGTNF RT+GGP+I+TSYDYDAPL+EYG Q
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKWGHL+DLH + E AL D AT+Y G S F N + DVT
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSY-QGKSSCFFGNSNADRDVT 388
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
+ + G +Y +PAWSVSILPDC N V+NTAK+NS +F ++ + + + W+
Sbjct: 389 INYGGVNYTIPAWSVSILPDCSNEVYNTAKVNS--QYSTFVKKGSEAENEPNSL---QWT 443
Query: 447 YINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQS 506
+ E + FT LL+Q D SDYL+Y + +I D+P+ G L V +
Sbjct: 444 WRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTTVDISNDDPIW--GKDLTLSVNT 501
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
GH LHAF+NG+ +G Y + + L GKN LLS TVGL NYG ++
Sbjct: 502 SGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMV 561
Query: 567 GAGITGPVQLKGSGNGTNI--DLS-SQQWTYQTGLKGEELNFPSGSS--TQWDSKSTLPK 621
GI GPVQ+ S +I DLS + QW Y+ GL GE+ G + QW S + LP
Sbjct: 562 NQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGEDKKIFLGRARYNQWKSDN-LPV 620
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+ VWYK TFDAP G +PV +D G+GKGEAWVNG S+GRYWP+Y+++ GC+ C+YR
Sbjct: 621 NRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYR 680
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y + KC NCG PSQ YHVPRS+L S+ N LVLFEE GG+P+ ++F T +G++ C+
Sbjct: 681 GPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNA-CA 739
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGS----- 796
+ + G L L C + IS IKFASFG P GTCG
Sbjct: 740 NA-------------------REGYTLELSC--QGRAISGIKFASFGDPQGTCGKPFATG 778
Query: 797 ---FSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CKGVMKSLAVEASC 847
F +G C +A SLS++++ CVG SCSI VS G C K LAVEA C
Sbjct: 779 SQVFEKGTCEAADSLSIIQKLCVGKYSCSIDVSEQILGPAGCTADTKRLAVEAIC 833
>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
Length = 803
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/837 (51%), Positives = 547/837 (65%), Gaps = 63/837 (7%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
T G N+TYD R+++I G+R++LIS +IHYPRS P MWP+L+Q +K+GG+DVIETYVFW
Sbjct: 22 TLCCGGNITYDSRSLIIDGQRKLLISAAIHYPRSVPGMWPELVQTAKEGGVDVIETYVFW 81
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
N HEP + Y FE RYDLVKFVK+V +AG+Y LRIGP+V AEWNFGG P+WLH++PG
Sbjct: 82 NGHEPSPSNYYFEKRYDLVKFVKIVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTV 141
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYI 199
FRTDN FK MQ+F IV++MK+EKL+ASQGGPIIL+Q+ENEYG +SAYG GK Y
Sbjct: 142 FRTDNYNFKYHMQKFMTYIVNLMKKEKLFASQGGPIILAQVENEYGFYESAYGEGGKRYA 201
Query: 200 KWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S + GVPW+MCQQ DAP+ +INTCN FYCDQF P +KPK+WTENW GWF
Sbjct: 202 MWAAQMAVSQNIGVPWIMCQQFDAPNSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQ 261
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG P+RP ED+AF+VARFFQ+GG+ QNYYMYHGGTNF RTSGGPFI+TSYDY+AP+D
Sbjct: 262 TFGAPNPHRPAEDIAFSVARFFQKGGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPID 321
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANI 379
EYGL R PKW HLK+LHKAIKLCE L+ + P SLGP+ EA VY SG C+AFLAN+
Sbjct: 322 EYGLARLPKWAHLKELHKAIKLCELTLLNSVPVNLSLGPSQEADVYAEESGACAAFLANM 381
Query: 380 GTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSD 439
+D TV F SY LPAWSVSILPDCKNVVFNTAK+NS T + L+ + +
Sbjct: 382 DEKNDKTVVFRNMSYHLPAWSVSILPDCKNVVFNTAKVNSQTSIVEMVPDDLRSSDKGTK 441
Query: 440 AIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
A+ W E GI K G ++ INTT D +DYLWY+ S + +E L+ G +
Sbjct: 442 AL--KWETFVENAGIWGTSDLVKNGFVDHINTTKDTTDYLWYTTSIFVGENEEFLKKGGR 499
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
VL ++S GHALHAF+N +L G+ G+ +++ P++L GKN LLS+TVGLQN
Sbjct: 500 PVLLIESKGHALHAFVNQELQGTASGNGTHSPFKFKKPVSLVAGKNDIALLSMTVGLQNA 559
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG---SSTQWDSK 616
G+FYE GAG+T V++KG NGT IDLS+ WTY+ GL+GE+L +G + W +
Sbjct: 560 GSFYEWVGAGLTS-VKMKGFNNGT-IDLSTFNWTYKIGLQGEKLGMYNGIAVETVNWVAT 617
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
S PK QPL WYK A M +N + I W Y
Sbjct: 618 SKPPKDQPLTWYKRQIHA-----------RQMLNWMWRINSEMI-LVWTRY--------- 656
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
HVPRSW K SGN LV+FEE GGDPTKI+F +++
Sbjct: 657 -------------------------HVPRSWFKPSGNILVIFEEKGGDPTKITFSRRKI- 690
Query: 737 SSLCSHVTDSHPLP----VDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
S +C+ V + +P+ ++ GS S + + L+CP + +IS+IKFASFG+P G
Sbjct: 691 SGVCALVAEDYPMANLESLENAGSGSSNYKAS---VHLKCPK-SSIISAIKFASFGSPAG 746
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CKGVMKSLAVEASCT 848
CGS+S G C +S+SVV + C+ C + V+ F C G MK LAVEA C+
Sbjct: 747 ACGSYSEGECHDPKSISVVEKVCLNKNQCVVEVTEENFSKGLCPGKMKKLAVEAVCS 803
>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 726
Score = 837 bits (2161), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/734 (56%), Positives = 518/734 (70%), Gaps = 25/734 (3%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I L++LC +V A+V+YD +AV+I G+RR+L+SGSIHYPRSTPEMWP LIQK+
Sbjct: 11 IFLVILCCLSLVCIVK---ASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVIETYVFWN HEP QY F RYDLVKF+KLV +AGLY +LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILS--QIENE 183
GGFP+WL F+PG+ FRTDNEPFKA M++FT KIV MMK EKL+ +QGGPIIL+ QIENE
Sbjct: 128 GGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQGQIENE 187
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG ++ GA GK+Y KW A MAL L TGVPW+MC+Q DAP PII+TCNG+YC+ F PNS
Sbjct: 188 YGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPSPIIDTCNGYYCEDFKPNS 247
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
+NKPKMWTENW+GW+ FGGAVPYRPVED+A++VARF Q+GG+F NYYMYHGGTNFDRT+
Sbjct: 248 SNKPKMWTENWTGWYTEFGGAVPYRPVEDIAYSVARFIQKGGSFVNYYMYHGGTNFDRTA 307
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
G F+++SYDYDAPLDEYGL R+PK+ HLK LHK IKL E AL++ D T SLG EA
Sbjct: 308 -GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKVIKLSEPALLSADATVTSLGAKQEAY 366
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
V+ + S C+AFL+N +S V F G Y+LP WSVSILPDCK +NTAK+N+
Sbjct: 367 VFWSKSS-CAAFLSNKDESSAARVMFRGFPYVLPPWSVSILPDCKTEFYNTAKVNA---- 421
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKD-DAFTKPGLLEQINTTADQSDYLWYS 482
PS R + A S W NE + + F + GL+EQI+ T D+SDY WY
Sbjct: 422 PSVHRNMVPTGARFS------WGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYL 475
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
I + E L+ G + V S GHALH F+NG+L G+ YG + K+T I L
Sbjct: 476 TDITIGSGETFLKTGDFPLFTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFTQKIKLHA 535
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G N LLS+ VGL N G +E+ G+ GPV LKG +GT D+S +W+Y+ G+KGE
Sbjct: 536 GVNKLALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGT-WDMSKWKWSYKIGVKGEA 594
Query: 603 LNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
L+ + S +W S + K QPL WYK+TF PAG+EP+A+D MGKG+ W+NG++
Sbjct: 595 LSLHTDTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRN 654
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
IGR+WP Y +Q G CNY G +++ KCL NCG+ SQ YHVPRSWLKS N +V+FE
Sbjct: 655 IGRHWPAYKAQ--GSCGRCNYAGTFNAKKCLSNCGEASQRWYHVPRSWLKSQ-NLIVVFE 711
Query: 720 EIGGDPTKISFVTK 733
E GGDP IS V +
Sbjct: 712 EWGGDPNGISLVKR 725
>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
Length = 718
Score = 835 bits (2158), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/713 (57%), Positives = 495/713 (69%), Gaps = 22/713 (3%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+A+VI G+RR+LISGSIHYPRSTPEMWPDL QK+KDGGLDVI+TYVFWN HEP
Sbjct: 23 ASVSYDHKALVIDGQRRILISGSIHYPRSTPEMWPDLFQKAKDGGLDVIQTYVFWNGHEP 82
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
Y + R D VK KL +A L HLR+ P F GFP+WL ++PG+ FRTDN
Sbjct: 83 SPGNYTLKDRLDWVKLSKLAQQAVLNVHLRMVP------TFVGFPVWLKYVPGMAFRTDN 136
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFKA MQ+FT KIV MMK E L+ +QGGPII+SQIENEYG ++ GA GK+Y KWAA
Sbjct: 137 EPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAAQ 196
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ LDTGVPW MC+Q DAPDP+I+TCNG+YC+ FTPN N KPKMWTENWSGW+ FGGA
Sbjct: 197 MAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWYTDFGGA 256
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
+ +RP EDLA++VA F Q G+F NYYMYHGGTNF RTS G FI+TSYDYDAP+DEYGL
Sbjct: 257 ISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYGLP 316
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG-PNLEATVYKTGSGLCSAFLANIGTNS 383
+PKW HLK+LHKAIK CE AL++ DPT LG NLEA VY + +C+AFLAN T S
Sbjct: 317 NEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLANYDTKS 376
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
TV F Y LP WSVSILPDCK VVFNTA +N SF ++ V +
Sbjct: 377 AATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNG----HSFHKRMTPV-----ETTFD 427
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
SY EP S DD+ L EQIN T D SDYLWY NI E +++G L
Sbjct: 428 WQSYSEEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQFPTLT 487
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
+ S GH LH F+NG+L G+ YG N KVT + L G N LLS+ VGL N G +
Sbjct: 488 INSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNVGLHF 547
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLP 620
E G+ GPV+LKG GT DLS Q+W+Y+ GLKGE L+ + SS W S+L
Sbjct: 548 ETWNVGVLGPVRLKGLDEGTR-DLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQGSSLA 606
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
K QPL WYKTTFDAP+G++PVA+D + MGKGE W+N QSIGR+WP Y++ G C D CNY
Sbjct: 607 KKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIAH-GNC-DECNY 664
Query: 681 RGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G +++ KC NCG+P+Q YH+PRSWL SSGN LV+ EE GGDPT IS V +
Sbjct: 665 AGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISLVKR 717
>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 826
Score = 835 bits (2157), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/837 (51%), Positives = 553/837 (66%), Gaps = 48/837 (5%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G NV+YD A++I G+RR++ SGSIHYPRST EMWPDLIQK+KDGGLD IETY+FW+ HE
Sbjct: 24 GNNVSYDSNAIIINGERRIIFSGSIHYPRSTEEMWPDLIQKAKDGGLDAIETYIFWDRHE 83
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R +Y+F G + +K+ +L+ EAGLY +RIGPYVCAEWN+GGFPLWLH +PGIQ RT+
Sbjct: 84 PHRRKYDFSGHLNFIKYFQLIQEAGLYVVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTN 143
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
N+ +K EMQ FT KIV+M KQ L+ASQGGPIIL+QIENEYGN+ + YG AGK+YI W A
Sbjct: 144 NQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGEAGKTYINWCA 203
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA SL+ G+PW+MCQQSDAP PIINTCNGFYCD FTPN+ N PKM+TENW GWF +G
Sbjct: 204 QMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPNSPKMFTENWVGWFKKWGD 263
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
P+R ED+AF+VARFFQ GG NYYMYHGGTNF RTSGGPFI+TSYDYDAPLDEYG
Sbjct: 264 KDPHRTAEDVAFSVARFFQSGGILNNYYMYHGGTNFGRTSGGPFITTSYDYDAPLDEYGN 323
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY---KTGSGLCSAFLANIG 380
+ QPKWGHLK LH +IKL E L + + G ++ T + +TG C FL+N
Sbjct: 324 LNQPKWGHLKQLHASIKLGEKILTNSTRSDQDFGSSVTFTKFSNLETGEKFC--FLSNAD 381
Query: 381 TNSDVTVKFNGN-SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSD 439
N+D V G+ Y LPAWSVSIL C +FNTAK++S T + F +Q+ + A S
Sbjct: 382 ENNDAIVDMLGDRKYFLPAWSVSILDGCNKEIFNTAKVSSQTSL-FFKKQNEKENAKLS- 439
Query: 440 AIGSGWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
W++ +EP+ + F LLEQ T D SDYLWY + N L
Sbjct: 440 -----WNWASEPMRDTLQGYGTFKANLLLEQKGATIDSSDYLWYMTNVNSNTTSSL---- 490
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
L V + GH LHAFIN + +GS +GS+ + V + PI L G NT LLS TVGL+
Sbjct: 491 QNLTLQVNTKGHVLHAFINRRYIGSQWGSNGQSFV-FEKPIQLKLGTNTITLLSATVGLK 549
Query: 558 NYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSS-TQW 613
NY AFY+ GI GP+ L G GN T DLSS W+Y+ GL GE +L P S+ T+W
Sbjct: 550 NYDAFYDTVPTGIDGGPIYLIGDGNVT-TDLSSNLWSYKVGLNGERKQLYNPMFSNRTKW 608
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
+ + + + W+K TF P+G++PV +D GMGKG+AWVNG+SIGR+WP++++ N
Sbjct: 609 STLNKKSIGRRMTWFKATFKTPSGTDPVVLDMQGMGKGQAWVNGRSIGRFWPSFIASNDS 668
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C+++C+Y+G+Y+ NKC++NCG SQ YH+PRS++ S NTL+LFEEIGG+P +S T
Sbjct: 669 CSETCDYKGSYNPNKCVRNCGNSSQRWYHIPRSFMNDSINTLILFEEIGGNPQMVSVQTI 728
Query: 734 QLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+G ++C + + G L L C VIS I+FAS+G P G
Sbjct: 729 TIG-TICGNANE-------------------GSTLELSCQG-GHVISEIQFASYGHPEGK 767
Query: 794 CGSFSRGRCSSARSLS-VVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
CGSF G +S + +V +AC+G K+CSI +S N F LAV+A C+
Sbjct: 768 CGSFQSGLWDVTKSTTIIVEKACIGMKNCSIDISPNLFKLSKVAYPYAKLAVQALCS 824
>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
Length = 663
Score = 833 bits (2153), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/666 (60%), Positives = 488/666 (73%), Gaps = 31/666 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+LLL+L +V A V+YDH+A++I G+RR+LISGSIHYPRSTP+MWPDLIQK+
Sbjct: 17 MLLLMLFSSWVCFVE----ATVSYDHKAIIIDGQRRILISGSIHYPRSTPQMWPDLIQKA 72
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDG +DVI+TYVFWN HEP +Y FE RYDLV+F+KLV +AGLY HLRIGPYVCAEWNF
Sbjct: 73 KDG-VDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNF 131
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI+FRTDNEPFKA MQ+FT KIV MMK EKL+ +QGGPIILSQIENE+G
Sbjct: 132 GGFPVWLKYVPGIEFRTDNEPFKAAMQKFTEKIVSMMKAEKLFETQGGPIILSQIENEFG 191
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPWVMC+Q DAPDP+INTCNGFYC+ F PN N
Sbjct: 192 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQDDAPDPVINTCNGFYCENFVPNQKN 251
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GWF +FGG P RP ED+AF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 252 KPKMWTENWTGWFTAFGGPTPQRPAEDVAFSVARFIQNGGSFVNYYMYHGGTNFGRTAGG 311
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAPLDEYGL+R+PKWGHL+DLHKAIKLCE+ALV+TDPT SLG N E V+
Sbjct: 312 PFIATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKLCESALVSTDPTVTSLGNNQEVHVF 371
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS------ 419
SG C+AFLAN T S V F Y LP WS+SILPDCK VFNTA++ +
Sbjct: 372 NPKSGSCAAFLANYDTTSSAKVNFKIMQYELPPWSISILPDCKTAVFNTARLGAQSSLKQ 431
Query: 420 VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
+T V +FS Q SYI E S D FT GL EQ+N T D SDYL
Sbjct: 432 MTPVSTFSWQ----------------SYIEESASSSDDKTFTTDGLWEQLNVTRDASDYL 475
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
WY + NI ++E L++G +L + S GHALH FING+L G+ YG N K+T +
Sbjct: 476 WYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNVK 535
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
+ G N LLS++VGLQN G +E+ G+ GPV L+G GT DLS QQW+Y+ GLK
Sbjct: 536 MRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTR-DLSKQQWSYKIGLK 594
Query: 600 GEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
GE+L+ + SS +W S+L + QPL WYKTTF+APAG+EP+A+D + MGKG W+N
Sbjct: 595 GEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWIN 654
Query: 657 GQSIGR 662
QSIGR
Sbjct: 655 SQSIGR 660
>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 833 bits (2152), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/835 (51%), Positives = 545/835 (65%), Gaps = 49/835 (5%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T D R ++I G+R++LISGS+HYPRSTPEMWPDLIQKSKDGGL+ I+TYVFW+LHEP R
Sbjct: 30 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY+F G DLV+F+K + GLYA LRIGPYVCAEW +GGFP+WLH P IQ RT+N
Sbjct: 90 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+ +EMQ FT IVDMMK+E+L+ASQGGPII+SQIENEYGN+ AY AG YI W A MA
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+LDTGVPW+MCQQ +AP P+INTCNG+YCDQFTPN+ N PKMWTENWSGW+ ++GG+ P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+R EDLAF+VARF+Q GGTFQNYYMYHGGTNF RT+GGP+I+TSYDYDAPL+EYG Q
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKWGHL+DLH + E AL D AT+Y G S F N + DVT
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSY-QGKSSCFFGNSNADRDVT 388
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
+ + G +Y +PAWSVSILPDC N V+NTAK+NS +F ++ + + + W+
Sbjct: 389 INYGGVNYTIPAWSVSILPDCSNEVYNTAKVNS--QYSTFVKKGSEAENEPNSL---QWT 443
Query: 447 YINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQS 506
+ E + FT LL+Q D SDYL+Y ++TN D+P+ G L V +
Sbjct: 444 WRGETIQYITPGRFTASELLDQKTVAEDTSDYLYY-MTTN---DDPIW--GKDLTLSVNT 497
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
GH LHAF+NG+ +G Y + + L GKN LLS TVGL NYG ++
Sbjct: 498 SGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMV 557
Query: 567 GAGITGPVQLKGSGNGTNI--DLS-SQQWTYQTGLKGEELNFPSGSS--TQWDSKSTLPK 621
GI GPVQ+ S +I DLS + QW Y+ GL GE+ G + QW S + LP
Sbjct: 558 NQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGEDKKIFLGRARYNQWKSDN-LPV 616
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+ VWYK TFDAP G +PV +D G+GKGEAWVNG S+GRYWP+Y+++ GC+ C+YR
Sbjct: 617 NRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYR 676
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y + KC NCG PSQ YHVPRS+L S+ N LVLFEE GG+P+ ++F T +G++ C+
Sbjct: 677 GPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNA-CA 735
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGS----- 796
+ + G L L C + IS IKFASFG P GTCG
Sbjct: 736 NA-------------------REGYTLELSC--QGRAISGIKFASFGDPQGTCGKPFATG 774
Query: 797 ---FSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CKGVMKSLAVEASC 847
F +G C +A SLS++++ CVG SCSI VS G C K LAVEA C
Sbjct: 775 SQVFEKGTCEAADSLSIIQKLCVGKYSCSIDVSEQILGPAGCTADTKRLAVEAIC 829
>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
Length = 781
Score = 832 bits (2148), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/730 (56%), Positives = 513/730 (70%), Gaps = 8/730 (1%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
IL LV L G+NV+YD R+++I G+R++LIS SIHYPRS P MWP LIQ +
Sbjct: 6 ILCLVSTSLTFTLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTA 65
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GG+DVIETYVFWN HE Y F GR+DLV+F K+V +AG+Y LRIGP+V AEWNF
Sbjct: 66 KEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNF 125
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P+WLH+IPG FRT N+PF M++FT IV++MK+EKL+ASQGGPIILSQIENEYG
Sbjct: 126 GGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYG 185
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ Y GK Y WAA MA+S +T VPW+MCQQ DAPDP+I+TCN FYCDQFTP S
Sbjct: 186 YYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPK 245
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
+PKMWTENW GWF +FGG P+RPVED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GG
Sbjct: 246 RPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGG 305
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGL R PKWGHLK+LHKAIKLCE L+ SLGP++EA +Y
Sbjct: 306 PFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIY 365
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
SG C+AF++N+ +D V F SY LPAWSVSILPDCKNVVFNTAK++S T + +
Sbjct: 366 TDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVA 425
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ LQ + + W E GI F K G ++ INTT D +DYLW++ S
Sbjct: 426 MIPEHLQQSDKGQKTL--KWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSI 483
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I A+E L+ GSK L ++S GH LHAF+N K G+G G+ S++ T PI+L GKN
Sbjct: 484 LIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKN 543
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
+LSLTVGLQ G FY+ GAG+T V++ G N T IDLSS W Y+ G+ GE L+
Sbjct: 544 EIAILSLTVGLQTAGPFYDFIGAGVTS-VKIIGLNNRT-IDLSSNAWAYKIGVLGEHLSI 601
Query: 606 PSG---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
G +S +W S S PK Q L WYK DAP+G EPV +D MGKG AW+NG+ IGR
Sbjct: 602 YQGEGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGR 661
Query: 663 YWPTYVS-QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
YWP + C C+YRG ++ +KC CG+PSQ YHVPRSW K SGN LV+FEE
Sbjct: 662 YWPRISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEK 721
Query: 722 GGDPTKISFV 731
GGDPTKI+FV
Sbjct: 722 GGDPTKITFV 731
>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 788
Score = 831 bits (2146), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/823 (50%), Positives = 534/823 (64%), Gaps = 48/823 (5%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GKRR+L+SGSIHYPRST +MWPDLI K+KDGGLD IETYVFWN HEP R +Y+F G D+
Sbjct: 1 GKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDV 60
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
V+F+K + +AGLY+ LRIGPYVCAEWN+GGFP+WLH +P ++FRT N F EMQ FT K
Sbjct: 61 VRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTK 120
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVM 217
IV MMK+EKL+ASQGGPIIL+QIENEYGN+ S+YGA GK+YI W A MA SLD GVPW+M
Sbjct: 121 IVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLM 180
Query: 218 CQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAV 277
CQQ +AP P++ TCNGFYCDQ+ P + + PKMWTENW+GWF ++GG PYR EDLAF+V
Sbjct: 181 CQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSV 240
Query: 278 ARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
ARFFQ GGTFQNYYMYHGGTNF R +GGP+I+TSYDY APLDE+G + QPKWGHLK LH
Sbjct: 241 ARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHT 300
Query: 338 AIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLP 397
+K E +L + + LG +++AT+Y T G S F+ N+ +D V F G Y +P
Sbjct: 301 VLKSMEKSLTYGNISRIDLGNSIKATIYTTKEG-SSCFIGNVNATADALVNFKGKDYHVP 359
Query: 398 AWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVG---I 454
AWSVS+LPDC +NTAK+N+ T + + DSS W++ E +
Sbjct: 360 AWSVSVLPDCDKEAYNTAKVNTQTSI---------MTEDSSKPERLEWTWRPESAQKMIL 410
Query: 455 SKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAF 514
GL++Q + T D SDYLWY ++ +PL L V S H LHA+
Sbjct: 411 KGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWS--RNMTLRVHSNAHVLHAY 468
Query: 515 INGKLVGSGYGSSSNAKVTVDFPI-ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGP 573
+NGK VG+ + + + L G N LLS++VGLQNYG F+E GI GP
Sbjct: 469 VNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGINGP 528
Query: 574 VQLKGSGNGTNI--DLSSQQWTYQTGLKG---EELNFPSGSSTQWDSKSTLPKLQPLVWY 628
V L G I DLS QW Y+ GL G + + S +W + LP + L WY
Sbjct: 529 VSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKW-ANEKLPTGRMLTWY 587
Query: 629 KTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNK 688
K F AP G EPV +D G+GKGEAW+NGQSIGRYWP++ S + GC D C+YRGAY S+K
Sbjct: 588 KAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDYRGAYGSDK 647
Query: 689 CLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH 747
C CGKP+Q YHVPRS+L +SG NT+ LFEE+GG+P+ ++F T +G ++C+ + +
Sbjct: 648 CAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVG-TVCARAHEHN 706
Query: 748 PLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSAR- 806
+ L C N+ IS++KFASFG PLG CGSF+ G C +
Sbjct: 707 K-------------------VELSC--HNRPISAVKFASFGNPLGHCGSFAVGTCQGDKD 745
Query: 807 SLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
+ V + CVG +C++ VS +TFG C K LAVE C
Sbjct: 746 AAKTVAKECVGKLNCTVNVSSDTFGSTLDCGDSPKKLAVELEC 788
>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 813
Score = 830 bits (2145), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/837 (51%), Positives = 545/837 (65%), Gaps = 48/837 (5%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G NV+YD A++I G+RRV++SGS+HYPRST MWPDLIQK+KDGGLD IETY+FW+ HE
Sbjct: 9 GDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHE 68
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R +Y+F GR D +KF +LV +AGLY +RIGPYVCAEWN+GGFPLWLH +PGIQFRTD
Sbjct: 69 PQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQFRTD 128
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
N+ +K EMQ FT KIV+M KQ L+ASQGGPIIL+QIENEYGN+ + YG AGKSYI W A
Sbjct: 129 NQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCA 188
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCD-QFTPNSNNKPKMWTENWSGWFLSFG 262
MA SL+ G+PW+MCQQSDAP PIINTCNGFYCD F+PN+ PKM+TENW GWF +G
Sbjct: 189 QMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGWFKKWG 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
PYR ED+AFAVARFFQ GG F NYYMYHGGTNF RT+GGPFI+TSYDY+APLDEYG
Sbjct: 249 DKDPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDEYG 308
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK---TGSGLCSAFLANI 379
+ QPKWGHLK LH +IK+ E L + + L + T + +G C FL+N
Sbjct: 309 NLNQPKWGHLKQLHASIKMGEKILTNSTRSDQKLXSFVTLTKFSNPTSGERFC--FLSNT 366
Query: 380 GTNSDVTVKFNGN-SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+D T+ + Y +PAWSVSIL C VFNTAKINS T ++V
Sbjct: 367 DNKNDATIDLQADGKYFVPAWSVSILDGCNKEVFNTAKINSQT------SMFVKVQNKKE 420
Query: 439 DAIGSGWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+A S W + EP+ + F LLEQ TT D SDYLWY + + A L
Sbjct: 421 NAQFS-WVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSL--- 476
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
L V + GH LHAF+N + +GS + S+ + V PI + PG NT LLS TVGL
Sbjct: 477 -QNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQSFVFXK-PILIKPGTNTITLLSATVGL 534
Query: 557 QNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSS-TQ 612
+NY AFY+ GI GP+ L G GN IDLSS W+Y+ GL GE +L P S T
Sbjct: 535 KNYDAFYDTVPTGIDGGPIYLIGDGN-VKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTN 593
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W + + + + YKT F P+G +PV +D GMGKG+AWVNGQSIGR+WP++++ N
Sbjct: 594 WSTINQKSIGRRMTLYKTNFKTPSGIDPVTLDMQGMGKGQAWVNGQSIGRFWPSFIAGND 653
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
C+ +C+YRGAY+ +KC++NCG PSQ YH+PRS+L NTLVLFEEIGG+P ++S T
Sbjct: 654 SCSTTCDYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQT 713
Query: 733 KQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
+G ++C + + G L L C +IS I+FAS+G P G
Sbjct: 714 ITIG-TICGNANE-------------------GSTLELSCQG-GHIISEIQFASYGNPEG 752
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
CGSF +G S +V + C+G +SCSI VS +FG + LA++A C+
Sbjct: 753 KCGSFKQGSWHVINSAILVEKLCIGMESCSIDVSAKSFGLGDVTNISARLAIQALCS 809
>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 838
Score = 828 bits (2138), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/849 (51%), Positives = 553/849 (65%), Gaps = 52/849 (6%)
Query: 16 VVLATTSF--GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVI 73
V LA F G NV+YD A++I G+RRV++SGS+HYPRST MWPDLIQK+KDGGLD I
Sbjct: 24 VTLACFYFCKGDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAI 83
Query: 74 ETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLH 133
ETY+FW+ HEP R +Y+F GR D +KF +LV +AGLY +RIGPYVCAEWN+GGFPLWLH
Sbjct: 84 ETYIFWDRHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLH 143
Query: 134 FIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGA 193
+PGIQFRTDN+ +K EMQ FT KIV+M KQ L+ASQGGPIIL+QIENEYGN+ + YG
Sbjct: 144 NLPGIQFRTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGN 203
Query: 194 AGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCD-QFTPNSNNKPKMWTE 252
AGKSYI W A MA SL+ G+PW+MCQQ+DAP PIINTCNGFYCD F+PN+ PKM+TE
Sbjct: 204 AGKSYINWCAQMAESLNIGIPWIMCQQNDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTE 263
Query: 253 NWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSY 312
NW GWF +G PYR ED+AFAVARFFQ GG F NYYMYHGGTNF RT+GGPFI+TSY
Sbjct: 264 NWVGWFKKWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSY 323
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK---TGS 369
DY+APLDEYG + QPKWGHLK LH +IK+ E L + + + + T + +G
Sbjct: 324 DYNAPLDEYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQKISSFVTLTKFSNPTSGE 383
Query: 370 GLCSAFLANIGTNSDVTVKF--NGNSYL-LPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
C FL+N +D T+ +G ++ +PAWSVSIL C VFNTAKINS T
Sbjct: 384 RFC--FLSNTDNKNDATIDLQADGKYFVPVPAWSVSILDGCNKEVFNTAKINSQT----- 436
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
++V +A S W + EP+ + F LLEQ TT D SDYLWY +
Sbjct: 437 -SMFVKVQNKKENAQFS-WVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTN 494
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+ A L L V + GH LHAF+N + +GS + S+ + V + PI + PG
Sbjct: 495 IDSNATSSL----QNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQSFV-FEKPILIKPGT 549
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE-- 601
NT LLS TVGL+NY AFY+ GI GP+ L G GN IDLSS W+Y+ GL GE
Sbjct: 550 NTITLLSATVGLKNYDAFYDTVPTGIDGGPIYLIGDGN-VKIDLSSNLWSYKVGLNGEMK 608
Query: 602 ELNFPSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
+L P S T W + + + + WYKT+F P+G + V +D GMGKG+AWVNGQSI
Sbjct: 609 QLYNPVFSQRTNWSTINQKSIGRRMTWYKTSFKTPSGIDRVTLDMQGMGKGQAWVNGQSI 668
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GR+WP++++ N C+ +C+YRGAY+ +KC++NCG PSQ YH+PRS+L NTLVLFEE
Sbjct: 669 GRFWPSFIASNDSCSTTCDYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEE 728
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVIS 780
IGG+P ++S T +G ++C + + G L L C +IS
Sbjct: 729 IGGNPQQVSVQTITIG-TICGNANE-------------------GSTLELSCQG-GHIIS 767
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMK 839
I+FAS+G P G CGSF +G S +V + C+G +SCSI VS +FG +
Sbjct: 768 EIQFASYGNPEGKCGSFKQGSWHVINSAILVEKLCIGRESCSIDVSAKSFGLGDVTNLSA 827
Query: 840 SLAVEASCT 848
LA++A C+
Sbjct: 828 RLAIQALCS 836
>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
Precursor
gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
Length = 741
Score = 825 bits (2131), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/741 (53%), Positives = 517/741 (69%), Gaps = 17/741 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
+AS IL++++ F+ + ANV+YDHR++ IG +R+++IS +IHYPRS P MWP
Sbjct: 9 IASTAILVVMV---FLFSWRSIEAANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPS 65
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
L+Q +K+GG + IE+YVFWN HEP +Y F GRY++VKF+K+V +AG++ LRIGP+V
Sbjct: 66 LVQTAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVA 125
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GG P+WLH++PG FR DNEP+K M+ FT IV+++KQEKL+A QGGPIILSQ+
Sbjct: 126 AEWNYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQV 185
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG + YG GK Y +W+A MA+S + GVPW+MCQQ DAP +I+TCNGFYCDQFT
Sbjct: 186 ENEYGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFT 245
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN+ +KPK+WTENW GWF +FGG P+RP ED+A++VARFF +GG+ NYYMYHGGTNF
Sbjct: 246 PNTPDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFG 305
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RTSGGPFI+TSYDY+AP+DEYGL R PKWGHLKDLHKAI L E L++ + +LG +L
Sbjct: 306 RTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSL 365
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EA VY SG C+AFL+N+ +D V F SY LPAWSVSILPDCK VFNTAK+ S
Sbjct: 366 EADVYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSK 425
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
S + + D + G W +E GI F K L++ INTT D +DYLW
Sbjct: 426 ------SSKVEMLPEDLKSSSGLKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLW 479
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y+ S + +E L+ GS VL ++S GH LH FIN + +G+ G+ ++ + P+AL
Sbjct: 480 YTTSITVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVAL 539
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G+N DLLS+TVGL N G+FYE GAG+T V +KG GT ++L++ +W+Y+ G++G
Sbjct: 540 KAGENNIDLLSMTVGLANAGSFYEWVGAGLTS-VSIKGFNKGT-LNLTNSKWSYKLGVEG 597
Query: 601 EELN-FPSGSS--TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
E L F G+S +W + PK QPL WYK + P+GSEPV +D MGKG AW+NG
Sbjct: 598 EHLELFKPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNG 657
Query: 658 QSIGRYWPTYV---SQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
+ IGRYWP S N C C+YRG + +KCL CG+PSQ YHVPRSW KSSGN
Sbjct: 658 EEIGRYWPRIARKNSPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNE 717
Query: 715 LVLFEEIGGDPTKISFVTKQL 735
LV+FEE GG+P KI +++
Sbjct: 718 LVIFEEKGGNPMKIKLSKRKV 738
>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
Length = 824
Score = 825 bits (2131), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 436/836 (52%), Positives = 541/836 (64%), Gaps = 55/836 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V YD AV+I G+R++++SGSIHYPRST EMW DLIQK+K+GGLD IETY+FWN HE R
Sbjct: 30 VEYDSSAVIINGQRKIILSGSIHYPRSTVEMWSDLIQKAKEGGLDTIETYIFWNAHERRR 89
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+YNF G D VKF + V EAGLY LRIGPY CAEWN+GGFP+WLH IP I+FRTDNE
Sbjct: 90 REYNFTGNLDFVKFFQKVQEAGLYGILRIGPYACAEWNYGGFPVWLHNIPEIKFRTDNEI 149
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK EMQ FT KIV+M K+ KL+ASQGGPIIL+QIENEYGN+ YG AGKSY++W A MA
Sbjct: 150 FKNEMQTFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNVMGPYGEAGKSYVQWCAQMA 209
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
++ + GVPW+MCQQSDAP +INTCNGFYCD FTPNS PKMWTENW+GW+ +G P
Sbjct: 210 VAQNIGVPWIMCQQSDAPSSVINTCNGFYCDTFTPNSPKSPKMWTENWTGWYKKWGQKDP 269
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+R EDLAF+VARFFQ G QNYYMY+GGTNF RTSGGPFI+TSYDYDAPLDEYG + Q
Sbjct: 270 HRTAEDLAFSVARFFQYNGVLQNYYMYYGGTNFGRTSGGPFIATSYDYDAPLDEYGNLNQ 329
Query: 327 PKWGHLKDLHKAIKLCEAALV-ATDPTYPSLGPNLEATVYKT---GSGLCSAFLANIGTN 382
PKWGHLK+LH A+KL E L +T T +E T Y + G LC FL+N +
Sbjct: 330 PKWGHLKNLHAALKLGEKILTNSTVKTTKYSDGWVELTTYTSNIDGERLC--FLSNTKMD 387
Query: 383 S-DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
DV ++ +G Y +PAWSVSIL DC +NTAK+N T + ++ ++ +
Sbjct: 388 GLDVDLQQDG-KYFVPAWSVSILQDCNKETYNTAKVNVQTSL------IVKKLHENDTPL 440
Query: 442 GSGWSYINEPVG--ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
W + EP + F LLEQ T D+SDYLWY S + SK
Sbjct: 441 KLSWEWAPEPTKAPLHGQGGFKATQLLEQKAATYDESDYLWYMTSVDNNG------TASK 494
Query: 500 TV-LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
V L V+ G LHAF+NGK +GS +G + T + P L PG N LLS TVGLQN
Sbjct: 495 NVTLRVKYSGQFLHAFVNGKEIGSQHGYT----FTFEKPALLKPGTNIISLLSATVGLQN 550
Query: 559 YGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQWDS 615
YG F+++ GI GPV+L SGN T DLSS +W+Y+ GL GE F P+ +W S
Sbjct: 551 YGEFFDEGPEGIAGGPVELIDSGN-TTTDLSSNEWSYKVGLNGEGGRFYDPTSGRAKWVS 609
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
L + + WYKTTF AP+G+EPV +D GMGKG AWVNG S+GR+WP + GC
Sbjct: 610 -GNLRVGRAMTWYKTTFQAPSGTEPVVVDLQGMGKGHAWVNGNSLGRFWPILTADPNGCD 668
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
C+YRG Y KCL NCG P+Q YHVPRS+L + NTL+LFEEIGG+P+ +SF
Sbjct: 669 GKCDYRGQYKEGKCLSNCGNPTQRWYHVPRSFLNNGSNTLILFEEIGGNPSDVSFQITAT 728
Query: 736 GSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG-TC 794
++C + + G L L C ++IS I++ASFG P G +C
Sbjct: 729 -ETICGNTYE-------------------GTTLELSCNGGRRIISDIQYASFGDPQGSSC 768
Query: 795 GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKS-LAVEASCT 848
GSF RG ++RS S V +AC+G +SCSI VS TFG + GV + L V+A CT
Sbjct: 769 GSFQRGSVEASRSFSAVEKACMGKESCSINVSKATFGVEDSFGVDNNRLVVQAVCT 824
>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 826
Score = 824 bits (2129), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/844 (50%), Positives = 538/844 (63%), Gaps = 44/844 (5%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
F+ + F VTYD R+++I G+RRV+ SG++HYPRST +MWPD+IQK+KDGGLD IE
Sbjct: 16 FLAFTASCFATEVTYDARSLIINGERRVIFSGAVHYPRSTVQMWPDIIQKAKDGGLDAIE 75
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
+YVFW+ HEPVR +Y+F G D +KF +++ EAGLYA LRIGPYVCAEWNFGGFPLWLH
Sbjct: 76 SYVFWDRHEPVRREYDFSGNLDFIKFFQIIQEAGLYAILRIGPYVCAEWNFGGFPLWLHN 135
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+PGI+ RTDN +K EMQ FT KIV+M K+ KL+ASQGGPIIL+QIENEYGNI + YG A
Sbjct: 136 MPGIELRTDNPIYKNEMQIFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNIMTDYGEA 195
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
GK+YIKW A MAL+ + GVPW+MCQQ DAP P+INTCNG YCD F PN+ PKM+TENW
Sbjct: 196 GKTYIKWCAQMALAQNIGVPWIMCQQHDAPQPMINTCNGHYCDSFQPNNPKSPKMFTENW 255
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
GWF +G VP+R ED AF+VARFFQ GG NYYMYHGGTNF RT+GGP+++TSY+Y
Sbjct: 256 IGWFQKWGERVPHRSAEDSAFSVARFFQNGGILNNYYMYHGGTNFGRTAGGPYMTTSYEY 315
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
DAPLDEYG + QPKWGHLK LH AIKL E + T G + T Y +G
Sbjct: 316 DAPLDEYGNLNQPKWGHLKQLHAAIKLGEKIITNGTRTDKDFGNEVTLTTYTHTNGERFC 375
Query: 375 FLANIGTNSDVTVKFNGN-SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQV 433
FL+N + D V + +Y LPAWSV+IL C VFNTAK+NS T + ++
Sbjct: 376 FLSNTNDSKDANVDLQQDGNYFLPAWSVTILDGCNKEVFNTAKVNSQTSI------MVKK 429
Query: 434 AADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPL 493
+ D+S+ + W + + F LLEQ T D SDYLWY S +I D +
Sbjct: 430 SDDASNKLTWAWIPEKKKDTMHGKGNFKVNQLLEQKELTFDVSDYLWYMTSVDIN-DTSI 488
Query: 494 LEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLT 553
S L V + GH L A++NG+ VG + S T + ++L G N LLS T
Sbjct: 489 W---SNATLRVNTRGHTLRAYVNGRHVGYKF-SQWGGNFTYEKYVSLKKGLNVITLLSAT 544
Query: 554 VGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGS--- 609
VGL NYGA ++K GI GPVQL G+ N T IDLS+ W+Y+ GL GE+
Sbjct: 545 VGLPNYGAKFDKIKTGIAGGPVQLIGNNNET-IDLSTNLWSYKIGLNGEKKRLYDPQPRI 603
Query: 610 STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
W + S P + L WYK F AP+G++PV +D G+GKGEAWVNGQSIGRYW ++++
Sbjct: 604 GVSWRTNSPYPIGRSLTWYKADFVAPSGNDPVVVDLLGLGKGEAWVNGQSIGRYWTSWIT 663
Query: 670 QNGGCTDSCNYRGAY-SSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKI 728
GC+D+C+YRG Y + KC NCG PSQ YHVPRS+LK+ NTLVLFEEIGG+P +
Sbjct: 664 ATNGCSDTCDYRGKYVPAQKCNTNCGNPSQRWYHVPRSFLKNDKNTLVLFEEIGGNPQNV 723
Query: 729 SFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFG 788
SF T G ++C+ V + G +L L C + IS I+F+SFG
Sbjct: 724 SFQTVITG-TICAQVQE-------------------GALLELSCQG-GKTISQIQFSSFG 762
Query: 789 TPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGV-----MKSLAV 843
P G CGSF +G + SVV ACVG SC V+ FG + + LAV
Sbjct: 763 NPTGNCGSFKKGTWEATDGQSVVEAACVGRNSCGFMVTKEAFGVAIGPMNVDERVARLAV 822
Query: 844 EASC 847
+A+C
Sbjct: 823 QATC 826
>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
Full=SR12 protein; Flags: Precursor
gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
Length = 731
Score = 823 bits (2127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/735 (54%), Positives = 510/735 (69%), Gaps = 20/735 (2%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LV + + L + +G NV YD+RA+ I +RR+L+SGSIHYPRSTPEMWPD+I+K+K
Sbjct: 12 MMLVYVFVLITLISCVYG-NVWYDYRAIKINDQRRILLSGSIHYPRSTPEMWPDIIEKAK 70
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
D LDVI+TYVFWN HEP +Y FEGRYDLVKF+KL+ +AGL+ HLRIGP+ CAEWNFG
Sbjct: 71 DSQLDVIQTYVFWNGHEPSEGKYYFEGRYDLVKFIKLIHQAGLFVHLRIGPFACAEWNFG 130
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WL ++PGI+FRTDN PFK +MQ FT KIVDMMK EKL+ QGGPIIL+QIENEYG
Sbjct: 131 GFPVWLKYVPGIEFRTDNGPFKEKMQVFTTKIVDMMKAEKLFHWQGGPIILNQIENEYGP 190
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y WAA MA SL+ GVPW+MC+Q SD PD +I+TCNGFYC+ F P +
Sbjct: 191 VEWEIGAPGKAYTHWAAQMAQSLNAGVPWIMCKQDSDVPDNVIDTCNGFYCEGFVPKDKS 250
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GW+ +G VPYRP ED+AF+VARF Q GG+F NYYM+HGGTNF+ T+ G
Sbjct: 251 KPKMWTENWTGWYTEYGKPVPYRPAEDVAFSVARFIQNGGSFMNYYMFHGGTNFE-TTAG 309
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
F+STSYDYDAPLDEYGL R+PK+ HLK+LHKAIK+CE ALV++D +LG N EA VY
Sbjct: 310 RFVSTSYDYDAPLDEYGLPREPKYTHLKNLHKAIKMCEPALVSSDAKVTNLGSNQEAHVY 369
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ SG C+AFLAN V V F+G + LPAWS+SILPDCK V+NTA++N + P
Sbjct: 370 SSNSGSCAAFLANYDPKWSVKVTFSGMEFELPAWSISILPDCKKEVYNTARVNEPS--PK 427
Query: 426 FSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
+ V ++ + W SY +E F + L EQIN T D+SDYLWY
Sbjct: 428 LHSKMTPVISNLN------WQSYSDEVPTADSPGTFREKKLYEQINMTWDKSDYLWYMTD 481
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+ +E L+ G + L V S GH LH F+NG+L G YGS + ++T + + G
Sbjct: 482 VVLDGNEGFLKKGDEPWLTVNSAGHVLHVFVNGQLQGHAYGSLAKPQLTFSQKVKMTAGV 541
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE-- 602
N LLS VGL N G +E+ G+ GPV L G GT DL+ Q W+Y+ G KGEE
Sbjct: 542 NRISLLSAVVGLANVGWHFERYNQGVLGPVTLSGLNEGTR-DLTWQYWSYKIGTKGEEQQ 600
Query: 603 -LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
N S QW + QPLVWYKTTFDAP G++P+A+D MGKG+AW+NGQSIG
Sbjct: 601 VYNSGGSSHVQWGPPAW---KQPLVWYKTTFDAPGGNDPLALDLGSMGKGQAWINGQSIG 657
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
R+W +++ G C D+CNY G Y+ KCL +CGK SQ YHVPRSWL+ GN LV+FEE
Sbjct: 658 RHWSNNIAK-GSCNDNCNYAGTYTETKCLSDCGKSSQKWYHVPRSWLQPRGNLLVVFEEW 716
Query: 722 GGDPTKISFVTKQLG 736
GGD +S V + +
Sbjct: 717 GGDTKWVSLVKRTIA 731
>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 741
Score = 823 bits (2126), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/741 (53%), Positives = 516/741 (69%), Gaps = 17/741 (2%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
+AS IL++++ F+ + ANV+YDHR++ IG +R+++IS +IHYPRS P MWP
Sbjct: 9 IASTAILVVMV---FLFSWRSIEAANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPS 65
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
L+Q +K+GG + IE+YVFWN HEP +Y F GRY++VKF+K+V +AG++ LRIGP+V
Sbjct: 66 LVQTAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVA 125
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GG P+WLH++PG FR DNEP+K M+ FT IV+++KQEKL+A QGGPIILSQ+
Sbjct: 126 AEWNYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQV 185
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT 240
ENEYG + YG GK Y +W+A MA+S + GVPW+MCQQ DAP +I+TCNGFYCDQFT
Sbjct: 186 ENEYGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFT 245
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
PN+ +KPK+WTENW GWF +FGG P+RP ED+A++VARFF +GG+ NYYMYHGGTNF
Sbjct: 246 PNTPDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFG 305
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
RTSGGPFI+TSYDY+AP+DEYGL R PKWGHLKDLHKAI L E L++ + +LG +L
Sbjct: 306 RTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSL 365
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EA VY SG C+AFL+N+ +D V F SY LPAWSVSILPDCK VFNTAK+ S
Sbjct: 366 EADVYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSK 425
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
S + + D + G W +E GI F K L++ INTT D +DYLW
Sbjct: 426 ------SSKVEMLPEDLKSSSGLKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLW 479
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y+ S + +E L+ GS VL ++S GH LH FIN + +G+ G+ ++ + P+AL
Sbjct: 480 YTTSITVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVAL 539
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G+ DLLS+TVGL N G+FYE GAG+T V +KG GT ++L++ +W+Y+ G++G
Sbjct: 540 KAGETNIDLLSMTVGLANAGSFYEWVGAGLTS-VSIKGFNKGT-LNLTNSKWSYKLGVEG 597
Query: 601 EELN-FPSGSS--TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
E L F G+S +W + PK QPL WYK + P+GSEPV +D MGKG AW+NG
Sbjct: 598 EHLELFKPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNG 657
Query: 658 QSIGRYWPTYV---SQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
+ IGRYWP S N C C+YRG + +KCL CG+PSQ YHVPRSW KSSGN
Sbjct: 658 EEIGRYWPRIARKNSPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNE 717
Query: 715 LVLFEEIGGDPTKISFVTKQL 735
LV+FEE GG+P KI +++
Sbjct: 718 LVIFEEKGGNPMKIKLSKRKV 738
>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
Length = 740
Score = 823 bits (2126), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/718 (54%), Positives = 507/718 (70%), Gaps = 14/718 (1%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
ANV+YDHR++ IG +R+++IS +IHYPRS P MWP L+Q +K+GG + IE+YVFWN HE
Sbjct: 28 AANVSYDHRSLSIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHE 87
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P +Y F GRY++VKF+K+V +AG++ LRIGP+V AEWN+GG P+WLH++PG FR D
Sbjct: 88 PSPRKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRAD 147
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEP+K M+ FT IV+++K+EKL+A QGGPIILSQ+ENEYG + YG GK Y +W+A
Sbjct: 148 NEPWKHYMESFTTYIVNLLKKEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSA 207
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA+S + GVPW+MCQQ DAP +I+TCNGFYCDQFTPN+ +KPK+WTENW GWF +FGG
Sbjct: 208 SMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGG 267
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
P+RP ED+A++VARFF +GG+ NYYMYHGGTNF RTSGGPFI+TSYDY+AP+DEYGL
Sbjct: 268 RDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGL 327
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
R PKWGHLKDLHKAI L E L+ + +LG +LEA VY SG C+AFL+N+ +
Sbjct: 328 PRLPKWGHLKDLHKAIMLSENLLINGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKN 387
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D TV F SY LPAWSVSILPDCKN VFNTAK+ S FS+ + + D + G
Sbjct: 388 DKTVMFRNTSYHLPAWSVSILPDCKNEVFNTAKVTS-----KFSKVEM-LPEDLRSSSGL 441
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W +E GI + F K L++ INTT D +DYLWY+ S + +E L+ GS VL
Sbjct: 442 KWEVFSEKPGIWGEADFVKNELVDHINTTKDTTDYLWYTTSITVSTNEEFLKKGSPPVLF 501
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
++S GH LH FIN + +G+ G+ ++ + +AL G+N DLLS+TVGL N G+FY
Sbjct: 502 IESKGHTLHVFINKEYLGTATGNGTHVPFKLKKSVALKAGENNIDLLSMTVGLSNAGSFY 561
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSS--TQWDSKSTLP 620
E GAG+T V +KG GT ++L++ +W+Y+ G++G L F G S +W + P
Sbjct: 562 EWVGAGLTS-VSIKGFNKGT-LNLTNSKWSYKLGVQGVHLELFKPGDSGAVKWTVTTKPP 619
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ---NGGCTDS 677
K QPL WYK D P+GSEPV +D MGKG AW+NG+ IGRYWP + N C
Sbjct: 620 KKQPLTWYKVVIDPPSGSEPVGLDMMSMGKGMAWLNGEEIGRYWPRIARKSTPNDECVKE 679
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
C+YRG + +KCL CG+PSQ YHVPRSW KSSGN LV+FEE GGDP KI+ +++
Sbjct: 680 CDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGDPMKITLSKRKV 737
>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 785
Score = 820 bits (2119), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/759 (52%), Positives = 500/759 (65%), Gaps = 67/759 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YDHR++VI G+RR+LISGSIHYPRS PEMWP LIQK+KDGGLDV++TYVFWN HEP +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY F RYDLV+FVKLV +AGLY HLR+GPYVCAEWNFGGFP+WL ++PGI+FRTDN P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKA MQ+F KIV MMK E L+ QGGPII++Q+ENE+G ++S G+ GK Y WAA MA
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ + GVPWVMC+Q DAPDP+INTCNGFYCD FTPN+ +KP MWTE W+GWF FGGA P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY----- 321
+RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 322 --------------------------------------------GLIRQPKWGHLKDLHK 337
GL+RQPKWGHL+++H+
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 338 AIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLP 397
AIK E ALV+ DPT S+G +A V+K+ +G C+AFL+N S V ++F+G Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459
Query: 398 AWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKD 457
AWS+SILPDCK VFNTA + TL+P S + A W +E D
Sbjct: 460 AWSISILPDCKTAVFNTATVKEPTLLPKMSPVMHRFA----------WQSYSEDTNSLDD 509
Query: 458 DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFING 517
AF + GL+EQ++ T D+SDYLWY+ NI ++E L+ G L V S GH++ F+NG
Sbjct: 510 SAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVNG 569
Query: 518 KLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLK 577
+ GS YG N K+T + + G N +LS VGL N G +E G+ GPV L
Sbjct: 570 RSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTLS 629
Query: 578 GSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDA 634
G G DLS Q+W YQ GLKGE L + S+ +W QPL W+K F+A
Sbjct: 630 GLNEGKR-DLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGG--GTQPLTWHKALFNA 686
Query: 635 PAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCG 694
PAGS+PVA+D MGKG+ WVNG+ GRYW +Y + + GC C+Y G Y ++C NCG
Sbjct: 687 PAGSDPVALDMGSMGKGQVWVNGRHAGRYW-SYRAHSRGC-GRCSYAGTYREDQCTSNCG 744
Query: 695 KPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
SQ YHVPRSWLK SGN LV+ EE GGD +S T+
Sbjct: 745 DLSQRWYHVPRSWLKPSGNLLVVLEEYGGDLAGVSLATR 783
>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 803
Score = 818 bits (2114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/837 (50%), Positives = 536/837 (64%), Gaps = 47/837 (5%)
Query: 23 FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLH 82
G NV+YD A++I G+RRV+ SGSIHYPRST MWPDLIQK+KDGGLD IETY+FW+ H
Sbjct: 1 MGDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRH 60
Query: 83 EPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRT 142
EP R +Y+F G + +KF +LV +AGLY +RIGPYVCAEWN+GGFPLWLH +PGIQ RT
Sbjct: 61 EPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 120
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA 202
DN+ +K EM FT KIV+M KQ L+ASQGGPIIL+QIENEYGN+ + YG AGK+YI W
Sbjct: 121 DNQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWC 180
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A MA SL+ GVPW+MCQQSDAP PIINTCNGFYCD F+PN+ PKM+TENW GWF +G
Sbjct: 181 AQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWG 240
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
PYR ED+AF+VARFFQ GG F NYYMYHGGTNF RTSGGPFI+TSYDY+APLDEYG
Sbjct: 241 DKDPYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYG 300
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK---TGSGLCSAFLANI 379
+ QPKWGHLK LH +IKL E L + + G + T + T C FL+N
Sbjct: 301 NLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTLTKFSNPTTKERFC--FLSNT 358
Query: 380 GTNSDVTVKFNGN-SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+D T+ + Y +PAWSVSI+ CK VFNTAKINS T + +
Sbjct: 359 DDTNDATIDLQADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTSM-------FVKVQNEK 411
Query: 439 DAIGSGWSYINEPVG--ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+ + W + E + + F + LLEQ TT D SDYLWY + +
Sbjct: 412 ENVKLSWVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSI--- 468
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
L V + GH LHAF+N + +GS +G++ + V + PI L G N LLS TVGL
Sbjct: 469 -HNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQSFV-FEKPILLKAGTNIITLLSATVGL 526
Query: 557 QNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGS-STQ 612
+NY AFY+ GI GP+ L G GN T +LSS W+Y+ GL GE +L P S T
Sbjct: 527 KNYDAFYDTLPTGIDGGPIYLIGDGNVT-TNLSSNLWSYKVGLNGEIKQLYNPVFSQETS 585
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W++ + + + WYKT+F P+G +PV +D GMGKGEAW+NGQSIGR+WP++++ N
Sbjct: 586 WNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWPSFIAGND 645
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
C+++C+YRGAY +KC+ NCG PSQ YH+PRS+L ++ NTLVLFEEIGG P ++S T
Sbjct: 646 NCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQT 705
Query: 733 KQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
+G ++C + + G L L C +IS I+FAS+G P G
Sbjct: 706 ITIG-TICGNANE-------------------GSTLELSCQGE-YIISEIQFASYGNPKG 744
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
CGSF +G S ++ + C KSCS+ VS FG + L V+A C+
Sbjct: 745 KCGSFKQGSWDVTNSALLLEKTCKDMKSCSVDVSAKLFGLGDAVNLSARLVVQALCS 801
>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 813 bits (2099), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/845 (50%), Positives = 537/845 (63%), Gaps = 63/845 (7%)
Query: 23 FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLH 82
G NV+YD A++I G+RRV+ SGSIHYPRST MWPDLIQK+KDGGLD IETY+FW+ H
Sbjct: 1 MGDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRH 60
Query: 83 EPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRT 142
EP R +Y+F G + +KF +LV +AGLY +RIGPYVCAEWN+GGFPLWLH +PGIQ RT
Sbjct: 61 EPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 120
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA 202
DN+ +K EM FT KIV+M KQ L+ASQGGPIIL+QIENEYGN+ + YG AGK+YI W
Sbjct: 121 DNQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWC 180
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A MA S + GVPW+MCQQSDAP PIINTCNGFYCD F+PN+ PKM+TENW GWF +G
Sbjct: 181 AQMAESFNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWG 240
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
PYR ED+AF+VARFFQ GG F NYYMYHGGTNF RTSGGPFI+TSYDY+APLDEYG
Sbjct: 241 DKDPYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYG 300
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALV---------ATDPTYPSLGPNLEATVYK---TGSG 370
+ QPKWGHLK LH +IKL E L + T+ + G + T + T
Sbjct: 301 NLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTFKTFGSFVTLTKFSNPTTKER 360
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
C FL+N T K +G Y +PAWSVSI+ CK VFNTAKINS T +
Sbjct: 361 FC--FLSN-------TXKADG-KYFVPAWSVSIIDGCKKEVFNTAKINSQTSI------- 403
Query: 431 LQVAADSSDAIGSGWSYINEPVG--ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIK 488
+ + + W + E + + F + LLEQ TT D SDYLWY +
Sbjct: 404 FVKVQNEKENVKLSWVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETN 463
Query: 489 ADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFD 548
+ L V + GH LHAF+N + +GS +G++ + V + PI L G N
Sbjct: 464 GTSSI----HNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQSFV-FEKPILLKAGTNIIT 518
Query: 549 LLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNF 605
LLS TVGL+NY AFY+ GI GP+ L G GN IDLSS W+Y+ GL GE +L
Sbjct: 519 LLSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGN-VKIDLSSNLWSYKVGLNGEIKQLYN 577
Query: 606 PSGS-STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
P S T W++ + + + WYKT+F P+G +PV +D GMGKGEAW+NGQSIGR+W
Sbjct: 578 PVFSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFW 637
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
P++++ N C+++C+YRGAY +KC+ NCG PSQ YH+PRS+L ++ NTLVLFEEIGG
Sbjct: 638 PSFIAGNDNCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGS 697
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKF 784
P ++S T +G ++C + + G L L C +IS I+F
Sbjct: 698 PQQVSVQTITIG-TICGNANE-------------------GSTLELSCQG-EYIISEIQF 736
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAV 843
AS+G P G CGSF +G S ++ + C G KSCS+ VS FG + L V
Sbjct: 737 ASYGNPKGKCGSFKQGSWDVTNSALLLEKTCKGMKSCSVDVSAKLFGLGDAVNLSARLVV 796
Query: 844 EASCT 848
+A C+
Sbjct: 797 QALCS 801
>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 831
Score = 812 bits (2098), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/845 (50%), Positives = 555/845 (65%), Gaps = 37/845 (4%)
Query: 14 GFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVI 73
G ++ TS V+YD RA+ + G RR+L+SGSIHYPRSTP MWP LI K+K GGLDVI
Sbjct: 12 GALLQNHTSVAVTVSYDQRALKLDGNRRMLVSGSIHYPRSTPTMWPGLIAKAKKGGLDVI 71
Query: 74 ETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLH 133
+TYVFW+ HEP + YNF GRYDL KF++LV EAG+Y +LRIGPYVCAEWNFGGFP WL
Sbjct: 72 QTYVFWSGHEPTQGVYNFAGRYDLPKFLRLVHEAGMYVNLRIGPYVCAEWNFGGFPGWLR 131
Query: 134 FIPGIQFRTDNEPFKAEM-QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
F+PGI+FRTDNE FK + FT+ ++ + + + Q +I +QIENEYG+ID+ YG
Sbjct: 132 FLPGIEFRTDNESFKVHLSHSFTSSLISVYSRS--FNIQ--LVICAQIENEYGSIDAVYG 187
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTE 252
AG+ Y+ W A MA++ + VPW+MC Q DAP +I+TCNGFYCD F PNS KP +WTE
Sbjct: 188 EAGQKYLNWIANMAVATNISVPWIMCNQPDAPPSVIDTCNGFYCDGFRPNSEGKPALWTE 247
Query: 253 NWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSY 312
NW+GWF S+G P RPV+D+AFAVARFFQ+GG+F +YYMYHGGTNF+R S ++T+Y
Sbjct: 248 NWTGWFQSWGEGAPTRPVQDIAFAVARFFQKGGSFMHYYMYHGGTNFER-SAMEGVTTNY 306
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKTGSG 370
DYDAP+DEYG +RQPKWGHLKDLH A+KLCE LV D P+ SLGP EA VY + +G
Sbjct: 307 DYDAPIDEYGDVRQPKWGHLKDLHAALKLCELCLVGVDTVPSEISLGPYQEAHVYNSSTG 366
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
C+AFLA+ GT+ D TV F G SY LPAWSVSILPDCK+VVFNTAK+ QS
Sbjct: 367 ACAAFLASWGTD-DSTVLFQGQSYDLPAWSVSILPDCKSVVFNTAKVGV---------QS 416
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKAD 490
+ + S+ + + W EP+ F+ L+EQI TT D +DYLWY +TN++
Sbjct: 417 MTMTMQSAIPV-TNWVSYREPLE-PWGSTFSTNELVEQIATTKDTTDYLWY--TTNVEVA 472
Query: 491 EPLLEDG-SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E +G ++ L + L A H F+N L G+ S A + I+L PG N+ +
Sbjct: 473 ESDAPNGLAQATLVMSYLRDAAHIFVNKWLTGTKSAHGSEASQS----ISLRPGINSVKV 528
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPS 607
LS+T GLQ G F EK AGI ++++G +G I + WTYQ GL+GE L +
Sbjct: 529 LSMTTGLQGTGPFLEKEKAGIQFGIRVEGLPSGA-IIMQRNTWTYQVGLQGENNRLFESN 587
Query: 608 GS-STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
GS S W + + + L W+KTTFD P + VA+D + MGKG+ WVNG ++GRYW +
Sbjct: 588 GSLSAVWSTSTDVSNQMSLSWFKTTFDMPERNGTVALDLSSMGKGQVWVNGINLGRYWSS 647
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPT 726
++ GC D+C+YRG++S +KCL CG+PSQS YHVPR WL S N LVLFEE G+P
Sbjct: 648 CIAHTDGCVDNCDYRGSHSESKCLTKCGQPSQSWYHVPREWLLSKQNLLVLFEEQEGNPE 707
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSD---SKIQRKPGPVLSLECPNPNQVISSIK 783
I+ + ++ +CS +++SHP P+ + S S+ P L+LEC + Q IS I
Sbjct: 708 AIT-IAPRIPQHICSRMSESHPFPIPLSSSTKRGSQTSTPPIAPLALECAD-GQHISRIS 765
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIG-VSVNTFGDPCKGVMKSLA 842
FAS+GTP G CG F C + S V+ +ACVG + C + VS GDPC G++KSLA
Sbjct: 766 FASYGTPSGDCGDFKLSSCHANSSKDVLSKACVGRQKCLVPIVSSICGGDPCPGMIKSLA 825
Query: 843 VEASC 847
A C
Sbjct: 826 ATAEC 830
>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 811 bits (2096), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/847 (49%), Positives = 539/847 (63%), Gaps = 77/847 (9%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
L +LC +++++ ++ V++D RA+ I G RRVL+SGSIHYPRST EMWPDLI+K K
Sbjct: 5 LKFLLC--CLLVSSCAYATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGK 62
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
+GGLD IETYVFWN HEP R QY+F G DL++F+K + + G+Y LRIGPYVCAEWN+G
Sbjct: 63 EGGLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQDEGMYGVLRIGPYVCAEWNYG 122
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WLH +PG++FRT N F EMQ FT IV+M+K+EKL+ASQGGPIIL+QIENEYGN
Sbjct: 123 GFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGN 182
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
+ +YG AGK+YIKW A MA SLD GVPW+MCQQ DAP P++NTCNG+YCD FTPN+ N
Sbjct: 183 VIGSYGEAGKAYIKWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFTPNNPNT 242
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENW+GW+ ++GG P+R ED+AFAVARFFQRGGTFQNYYMYHGGTNFDRT+GGP
Sbjct: 243 PKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQRGGTFQNYYMYHGGTNFDRTAGGP 302
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
+I+T+YDYDAPLDE+G + QPK+GHLK LH + E L + + G + ATVYK
Sbjct: 303 YITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYK 362
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
T G S F+ N+ SD + F G Y +PAWSVSILPDCK +NTAKIN+ T S
Sbjct: 363 TEEG-SSCFIGNVNETSDAKINFQGTFYDVPAWSVSILPDCKTETYNTAKINTQT---SV 418
Query: 427 SRQSLQVAADSSDAIGSGWSYIN-EPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ A + + W N + V + T L +Q + D+SDYLWY +
Sbjct: 419 MVKKANEAENEPSTLKWSWRPENIDNVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTV 478
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
NIK +P+ G L + S H LHAF+NG+ +G+ + + PG N
Sbjct: 479 NIKEQDPVW--GKNMSLRINSTAHVLHAFVNGQHIGNYRAENGKFHYVFEQDAKFNPGAN 536
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEEL 603
LLS+TVGL NYGAF+E AGITGPV + G I DLS+ +W+Y+TGL G E
Sbjct: 537 VITLLSITVGLPNYGAFFENVPAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFEN 596
Query: 604 NFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
S S +T+ AP GSEPV +D G+GKG AW+NG +IGRY
Sbjct: 597 QLFSSES------------------PSTWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRY 638
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIG 722
WP +++ GC+ + YHVPRS+L S G NTLVLFEEIG
Sbjct: 639 WPAFLADIDGCS-----------------------AEYHVPRSFLNSDGDNTLVLFEEIG 675
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSI 782
G+P+ ++F T +G ++C++V + + VL L C + ISSI
Sbjct: 676 GNPSLVNFQTIGVG-NVCANVYEKN-------------------VLELSC--NGKPISSI 713
Query: 783 KFASFGTPLGTCGSFSRGRC-SSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CKGVMKS 840
KFASFG P G CGSF +G C +S + +++ Q CVG + CSI VS FG C G+ K
Sbjct: 714 KFASFGNPGGNCGSFEKGTCEASNDAAAILTQECVGKEKCSIDVSEKKFGAADCGGLAKR 773
Query: 841 LAVEASC 847
LAVEA C
Sbjct: 774 LAVEAIC 780
>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
Length = 806
Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/836 (49%), Positives = 535/836 (63%), Gaps = 46/836 (5%)
Query: 23 FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLH 82
F VTYD A++I G+RR++ SG+IHYPRST EMWPDLIQK+KDGGLD IETY+FW+ H
Sbjct: 6 FATEVTYDSNALIINGERRLIFSGAIHYPRSTVEMWPDLIQKAKDGGLDAIETYIFWDRH 65
Query: 83 EPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRT 142
EPVR +YNF G D VKF +L+ +AGLYA +RIGPY CAEWNFGGFP WLH +PGI+ RT
Sbjct: 66 EPVRREYNFSGNLDFVKFFQLIQKAGLYAIMRIGPYACAEWNFGGFPSWLHNMPGIELRT 125
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA 202
+N +K EMQ FT +IV+++K+ KL+ASQGGPIIL+QIENEYG+I Y AGK+Y++WA
Sbjct: 126 NNSVYKNEMQNFTTEIVNVVKEAKLFASQGGPIILAQIENEYGDIMWNYKDAGKAYVQWA 185
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A MAL+ + GVPW+MCQQ DAP PIINTCNG+YC F PN+ PK++TENW GWF +G
Sbjct: 186 AQMALAQNIGVPWIMCQQQDAPQPIINTCNGYYCHNFQPNNPKSPKIFTENWIGWFQKWG 245
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
VP+R ED AF+VARFFQ GG NYYMYHGGTNF RT+GGP+I+TSYDYDAP+DEYG
Sbjct: 246 ERVPHRSAEDSAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGGPYITTSYDYDAPIDEYG 305
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTY-PSLGPNLEATVYKTGSGLCSAFLANIGT 381
+ QPKWGHLK+LH AIKL E L LG L T Y SG FL+N
Sbjct: 306 NLNQPKWGHLKNLHAAIKLGENVLTNYSARKDEDLGNGLTLTTYTNSSGARFCFLSN-NN 364
Query: 382 NSDVTVKF---NGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
N+D+ + N Y++PAWSVSI+ C VFNTAK+NS T + + +D+
Sbjct: 365 NTDLGARVDLKNDGVYIVPAWSVSIINGCNQEVFNTAKVNSQTSM-------MVKKSDNV 417
Query: 439 DAIGSGWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+ W + EP I + + LLEQ T D SDYLWY S +I D +
Sbjct: 418 SSTNLTWEWKVEPKRDTIHGNGSLKAQKLLEQKELTLDASDYLWYMTSADIN-DTSIW-- 474
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
S L V + GH+LH ++N + VG + N + T + ++L G N LLS TVGL
Sbjct: 475 -SNATLRVNTSGHSLHGYVNQRYVGYQFSQYGN-QFTYEKQVSLKNGTNIITLLSATVGL 532
Query: 557 QNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGS---STQ 612
NYGA+++ GI+ GPV+L G N T +DLS+ W+Y+ GL GE + S
Sbjct: 533 ANYGAWFDDKKTGISGGPVELIGKNNVT-MDLSTNLWSYKIGLNGERRHLYDAQQNVSVA 591
Query: 613 WDSKST-LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + S+ +P +PL+WY+ F +P G+ P+ +D G+GKG AWVNG SIGRYW +++S +
Sbjct: 592 WHTNSSYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAWVNGHSIGRYWSSWISPS 651
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
GC+D+C+YRG Y KC NCG PSQ YHVPRS+L NTLVLFEEIGG+P + F
Sbjct: 652 DGCSDTCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNTLVLFEEIGGNPQSVQFQ 711
Query: 732 TKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPL 791
T G ++C++V + G L C + QV+S I+FAS+G P
Sbjct: 712 TVTTG-TICANVYE-------------------GAQFELSCQS-GQVMSQIQFASYGNPE 750
Query: 792 GTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CGSF +G +A S SVV +CVG +C V+ FG + LAV+ +C
Sbjct: 751 GQCGSFKKGNFDAANSQSVVEASCVGKNNCGFNVTKEMFGVTNVSSIPRLAVQVTC 806
>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
Length = 779
Score = 806 bits (2082), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/847 (49%), Positives = 538/847 (63%), Gaps = 77/847 (9%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
L +LC V++++ ++ V++D RA+ I G RRVL+SGSIHYPRST EMWPDLI+K K
Sbjct: 4 LSFILC--CVLVSSCAYATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGK 61
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
+G LD IETYVFWN HEP R QY+F G DL++F+K + G+Y LRIGPYVCAEWN+G
Sbjct: 62 EGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYG 121
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WLH +PG++FRT N F EMQ FT IV+M+K+EKL+ASQGGPIIL+QIENEYGN
Sbjct: 122 GFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGN 181
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
+ +YG AGK+YI+W A MA SLD GVPW+MCQQ DAP P++NTCNG+YCD F+PN+ N
Sbjct: 182 VIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNT 241
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENW+GW+ ++GG P+R ED+AFAVARFFQ+ GTFQNYYMYHGGTNFDRT+GGP
Sbjct: 242 PKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGP 301
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
+I+T+YDYDAPLDE+G + QPK+GHLK LH + E L + + G + ATVY+
Sbjct: 302 YITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYQ 361
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
T G S F+ N+ SD + F G SY +PAWSVSILPDCK +NTAKIN+ T S
Sbjct: 362 TEEG-SSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQT---SV 417
Query: 427 SRQSLQVAADSSDAIGSGWSYIN-EPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ A + + W N + V + T L +Q + D+SDYLWY +
Sbjct: 418 MVKKANEAENEPSTLKWSWRPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTV 477
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
N+K +P+L G L + S H LHAF+NG+ +G+ + + PG N
Sbjct: 478 NLKEQDPVL--GKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPGAN 535
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEEL 603
LLS+TVGL NYGAF+E AGITGPV + G I DLS+ +W+Y+TGL G E
Sbjct: 536 VITLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFEN 595
Query: 604 NFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
S S +T+ AP GSEPV +D G+GKG AW+NG +IGRY
Sbjct: 596 QLFSSES------------------PSTWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRY 637
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIG 722
WP ++S GC+ + YHVPRS+L S G NTLVLFEEIG
Sbjct: 638 WPAFLSDIDGCS-----------------------AEYHVPRSFLNSEGDNTLVLFEEIG 674
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSI 782
G+P+ ++F T +G S+C++V + + VL L C + IS+I
Sbjct: 675 GNPSLVNFQTIGVG-SVCANVYEKN-------------------VLELSC--NGKPISAI 712
Query: 783 KFASFGTPLGTCGSFSRGRC-SSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKS 840
KFASFG P G CGSF +G C +S + +++ Q CVG + CSI VS + FG C + K
Sbjct: 713 KFASFGNPGGDCGSFEKGTCEASNNAAAILTQECVGKEKCSIDVSEDKFGAAECGALAKR 772
Query: 841 LAVEASC 847
LAVEA C
Sbjct: 773 LAVEAIC 779
>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
Length = 828
Score = 800 bits (2066), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/843 (49%), Positives = 529/843 (62%), Gaps = 61/843 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V YD A++I G+RR++ SG+IHYPRST +MWPDL+QK+KDGGLD IETY+FW+ HE VR
Sbjct: 25 VKYDSNALIINGERRLIFSGAIHYPRSTVDMWPDLVQKAKDGGLDAIETYIFWDRHEQVR 84
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+YNF G D VKF K + EAGLY +RIGPY CAEWN+GGFP+WLH IPGI+ RTDN
Sbjct: 85 GRYNFSGNLDFVKFFKTIQEAGLYGIIRIGPYSCAEWNYGGFPVWLHQIPGIEMRTDNAA 144
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+K EMQ F KI+++ K+ L+ASQGGPIIL+QIENEYG+I + GK+YIKWAA MA
Sbjct: 145 YKNEMQIFVTKIINVAKEANLFASQGGPIILAQIENEYGDIMWNFKEPGKAYIKWAAQMA 204
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
L+ + GVPW MCQQ+DAP PIINTCNG+YC F PN+ PKM+TENW GWF +G P
Sbjct: 205 LAQNIGVPWFMCQQNDAPQPIINTCNGYYCHNFKPNNPKSPKMFTENWIGWFQKWGERAP 264
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+R ED A+AVARFFQ GG F NYYMYHGGTNF RTSGGP+I TSYDYDAP++EYG + Q
Sbjct: 265 HRTAEDSAYAVARFFQNGGVFNNYYMYHGGTNFGRTSGGPYIITSYDYDAPINEYGNLNQ 324
Query: 327 PKWGHLKDLHKAIKLCEAALVA-TDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
PK+GHLK LH+AIKL E L T LG + T Y G FL+N N+D
Sbjct: 325 PKYGHLKFLHEAIKLGEKVLTNYTSRNDKDLGNGITLTTYTNSVGARFCFLSNDKDNTDG 384
Query: 386 TVKF-NGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V N Y +PAWSV+IL C VFNTAK+NS T + ++ D+S
Sbjct: 385 NVDLQNDGKYFVPAWSVTILDGCNKEVFNTAKVNSQTSI-------MEKKIDNSSTNKLT 437
Query: 445 WSYINEPVGISKDDAFTKPG------LLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
W++I EP K D G LLEQ T D SDYLWY S +I + S
Sbjct: 438 WAWIMEP----KKDTMNGRGSIKAHQLLEQKELTLDASDYLWYMTSVDIND----TSNWS 489
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
LHV++ GH LH ++N + +G G+ N T + ++L G N LLS TVGL N
Sbjct: 490 NANLHVETSGHTLHGYVNKRYIGYGHSQFGN-NFTYEKQVSLKNGTNIITLLSATVGLAN 548
Query: 559 YGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWD 614
YGA +++ GI+ GPV+L G N IDLS+ W+++ GL GE+ F S W+
Sbjct: 549 YGARFDEIKTGISDGPVKLVGQ-NSVTIDLSTGNWSFKVGLNGEKRRFYDLQPRSGVAWN 607
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ S+ P +PL WYKT F +P G P+ +D G+GKG AWVNG+SIGRYW ++++ GC
Sbjct: 608 T-SSYPTGKPLTWYKTQFKSPLGPNPIVVDLQGLGKGHAWVNGKSIGRYWTSWITSTAGC 666
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
+D+C+YRG Y KC C PSQ YHVPRS+L NTL+LFEEIGG+P +SF+T +
Sbjct: 667 SDTCDYRGNYKKEKCNTGCASPSQRWYHVPRSFLNDDMNTLILFEEIGGNPQNVSFLT-E 725
Query: 735 LGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTC 794
++C++V + G L L C N QVI+SI FASFG P G C
Sbjct: 726 TTKTICANVYE-------------------GGKLELSCQN-GQVITSINFASFGNPQGQC 765
Query: 795 GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG---DP-------CKGVMKSLAVE 844
GSF +G S S S++ +C+G C V+ + FG DP K + LAV+
Sbjct: 766 GSFKKGSWESLNSQSMMETSCIGKTGCGFTVTRDMFGVNLDPLSASKASVKDGIPRLAVQ 825
Query: 845 ASC 847
A+C
Sbjct: 826 ATC 828
>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 643
Score = 798 bits (2061), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/650 (60%), Positives = 463/650 (71%), Gaps = 16/650 (2%)
Query: 89 YNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFK 148
YNFE RYDLV+FVKLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDN PFK
Sbjct: 6 YNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFK 65
Query: 149 AEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALS 208
A MQ+FT KIV +MK EKLY SQGGPIILSQIENEYG ++ GA GKSY KWAA MAL
Sbjct: 66 AAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQMALG 125
Query: 209 LDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
LDTGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN KPKMWTE W+GWF FGG PYR
Sbjct: 126 LDTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGGPAPYR 185
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPK 328
PVED+A++VARF Q GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R+PK
Sbjct: 186 PVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPK 245
Query: 329 WGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVK 388
W HL+DLHKAIKLCE ALV+ DPT LG N EA V+KT SG C+AFLAN +S TV
Sbjct: 246 WSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASSSATVT 305
Query: 389 FNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYI 448
F N Y LP WSVSILPDCK+V+FNTAK+ + T P + S S SY
Sbjct: 306 FGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMTPVSSF----------SWLSYN 355
Query: 449 NEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLG 508
E +D T GL+EQI+ T D +DYLWY I +E L+ G +L V S G
Sbjct: 356 EETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLLTVFSAG 415
Query: 509 HALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGA 568
HALH FING+L G+ YG S N K+T + L G N +LS+ VGL N G YE
Sbjct: 416 HALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHYETWNT 475
Query: 569 GITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPL 625
G+ GPV LKG T D+S +W+Y+ GLKGE LN S SS +W + S + + QPL
Sbjct: 476 GVLGPVTLKGLNEDTR-DMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSLVAQKQPL 534
Query: 626 VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYS 685
WYKTTFD+P G+EP+A+D + MGKG+ W+NGQSIGR+WP Y ++ G CNY G ++
Sbjct: 535 TWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTAK--GSCGKCNYGGIFN 592
Query: 686 SNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
KC NCG+PSQ YHVPR+WLKSSGN LV+FEE GG+P IS V + +
Sbjct: 593 EKKCHSNCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRSI 642
>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 712
Score = 793 bits (2049), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/735 (54%), Positives = 503/735 (68%), Gaps = 31/735 (4%)
Query: 4 KEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
K +LL + + ++ GA VTYD +A++I +RR+LISGSIHYPRSTP+MWPDLIQ
Sbjct: 3 KTVLLFL---SLLTWVGSTIGA-VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQ 58
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEG-RYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
K+KDGGLD+IETYVFWN HEP + +E Y+ + ++ + L P
Sbjct: 59 KAKDGGLDIIETYVFWNGHEPSEGKVTWEDFLYEQILYINC-----FHVALFXFPPYFXF 113
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
F GFP+WL F+PGI FRTDNEPFKA MQ+F KIVDMMK EKLY +QGGPIILSQIEN
Sbjct: 114 QKFSGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIEN 173
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYG ++ GA GKSY KW A MA+ L TGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN
Sbjct: 174 EYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPN 233
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
KPK+WTENWSGW+ +FGG PYRP ED+AF+VARF Q G+ NYY+YHGGTNF RT
Sbjct: 234 QIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRT 293
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEA 362
S G FI+TSYD+DAP+DEYGLIR+PKWGHL+DLHKAIKLCE ALV+ DPT LG N EA
Sbjct: 294 S-GLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEA 352
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
V+K+ S C+AFLAN T++ V V F N Y LP WS+SILPDCK V FNTA+I
Sbjct: 353 RVFKSSSA-CAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCKTVTFNTAQIG---- 407
Query: 423 VPSFSRQSLQVAADSSDAIGSGW-SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
V S+ + + +++ GW SY EP D TK GL+EQ++ T D +DYLWY
Sbjct: 408 VKSYEAKMMPISS-------FGWLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDYLWY 460
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
+I + E L+ G +L V S GH LH FING+L GS YGS + ++T + L
Sbjct: 461 MQDISIDSTEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPRITFSKYVNLK 520
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N +LS+TVGL N G ++ AG+ GPV LKG GT D+S +W+Y+ GL GE
Sbjct: 521 QGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTR-DMSKYKWSYKVGLSGE 579
Query: 602 ELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
LN S +S QW +K +L + QPL WYKTTF PAG+EP+ +D + M KG+ WVNG+
Sbjct: 580 SLNLYSDKGSNSVQW-TKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWVNGR 638
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
SIGRY+P Y++ NG C D C+Y G ++ KCL NCG+PSQ YH+PR WL S N LV+F
Sbjct: 639 SIGRYFPGYIA-NGKC-DKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIF 696
Query: 719 EEIGGDPTKISFVTK 733
EEIGG P IS V +
Sbjct: 697 EEIGGSPDGISLVKR 711
>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
Length = 771
Score = 786 bits (2030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/726 (55%), Positives = 491/726 (67%), Gaps = 42/726 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
LIS SIHYPRS P MWP LIQ +K+GG+DVIETYVFWN HE Y F GR+DLV+F K
Sbjct: 1 LISASIHYPRSVP-MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAK 59
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGG---------------------------------FP 129
+V +AG+Y LRIGP+V AEWNFGG P
Sbjct: 60 VVQDAGMYLILRIGPFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVP 119
Query: 130 LWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDS 189
+WLH+IPG FRT N+PF M++FT IV++MK+EKL+ASQGGPIILSQIENEYG ++
Sbjct: 120 VWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYEN 179
Query: 190 AYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKM 249
Y GK Y WAA MA+S +T VPW+MCQQ DAPDP+I+TCN FYCDQFTP S +PKM
Sbjct: 180 YYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKM 239
Query: 250 WTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS 309
WTENW GWF +FGG P+RPVED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GGPFI+
Sbjct: 240 WTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFIT 299
Query: 310 TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS 369
TSYDYDAP+DEYGL R PKWGHLK+LHKAIKLCE L+ SLGP++EA +Y S
Sbjct: 300 TSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSS 359
Query: 370 GLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQ 429
G C+AF++N+ +D V F SY LPAWSVSILPDCKNVVFNTAK++S T + + +
Sbjct: 360 GACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPE 419
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
LQ + + W E GI F K G ++ INTT D +DYLW++ S I A
Sbjct: 420 HLQQSDKGQKTL--KWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILIDA 477
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
+E L+ GSK L ++S GH LHAF+N K G+G G+ S++ T PI+L GKN +
Sbjct: 478 NEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIAI 537
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG- 608
LSLTVGLQ G FY+ GAG+T V++ G N T IDLSS W Y+ G+ GE L+ G
Sbjct: 538 LSLTVGLQTAGPFYDFIGAGVTS-VKIIGLNNRT-IDLSSNAWAYKIGVLGEHLSIYQGE 595
Query: 609 --SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
+S +W S S PK Q L WYK DAP+G EPV +D MGKG AW+NG+ IGRYWP
Sbjct: 596 GMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWPR 655
Query: 667 YVS-QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
+ C C+YRG ++ +KC CG+PSQ YHVPRSW K SGN LV+FEE GGDP
Sbjct: 656 ISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGDP 715
Query: 726 TKISFV 731
TKI+FV
Sbjct: 716 TKITFV 721
>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
Length = 786
Score = 781 bits (2018), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/846 (47%), Positives = 525/846 (62%), Gaps = 91/846 (10%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
L +LC V++++ ++ V++D RA+ I G RRVL+SGSIHYPRST EMWPDLI+K K
Sbjct: 27 LSFILC--CVLVSSCAYATIVSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGK 84
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
+G LD IETYVFWN HEP R QY+F G DL++F+K + G+Y LRIGPYVCAEWN+G
Sbjct: 85 EGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYG 144
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WLH +PG++FRT N F EMQ FT IV+M+K+EKL+ASQGGPIIL+QIENEYGN
Sbjct: 145 GFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGN 204
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
+ +YG AGK+YI+W A MA SLD GVPW+MCQQ DAP P++NTCNG+YCD F+PN+ N
Sbjct: 205 VIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNT 264
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENW+GW+ ++GG P+R ED+AFAVARFFQ+ GTFQNYYMYHGGTNFDRT+GGP
Sbjct: 265 PKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGP 324
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK 366
+I+T+YDYDAPLDE+G + QPK+GHLK LH + E L + + G + ATVY+
Sbjct: 325 YITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYQ 384
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
T G S F+ N+ SD + F G SY +PAWSVSILPDCK +NTAKIN+ T S
Sbjct: 385 TEEG-SSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQT---SV 440
Query: 427 SRQSLQVAADSSDAIGSGWSYIN-EPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ A + + W N + V + T L +Q + D+SDYLWY +
Sbjct: 441 MVKKANEAENEPSTLKWSWRPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTV 500
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
N+K +P+L G L + S H LHAF+NG+ +G+ + + PG N
Sbjct: 501 NLKEQDPVL--GKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPGAN 558
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKGEEL 603
LLS+TVGL NYGAF+E AGITGPV + G I DLS+ +W+Y+TGL G E
Sbjct: 559 VITLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFEN 618
Query: 604 NFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
S S +T+ AP GSEPV +D G+GKG AW+NG +IGRY
Sbjct: 619 QLFSSES------------------PSTWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRY 660
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG 723
WP ++S G NTLVLFEEIGG
Sbjct: 661 WPAFLSDIDG--------------------------------------DNTLVLFEEIGG 682
Query: 724 DPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIK 783
+P+ ++F T +G S+C++V + + VL L C + IS+IK
Sbjct: 683 NPSLVNFQTIGVG-SVCANVYEKN-------------------VLELSC--NGKPISAIK 720
Query: 784 FASFGTPLGTCGSFSRGRC-SSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSL 841
FASFG P G CGSF +G C +S + +++ Q CVG + CSI VS + FG C + K L
Sbjct: 721 FASFGNPGGDCGSFEKGTCEASNNAAAILTQECVGKEKCSIDVSEDKFGAAECGALAKRL 780
Query: 842 AVEASC 847
AVEA C
Sbjct: 781 AVEAIC 786
>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
Length = 828
Score = 781 bits (2016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/838 (48%), Positives = 522/838 (62%), Gaps = 51/838 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G VTY+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HE
Sbjct: 28 GTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHE 87
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R QYNF G YD+V+F K + AGLYA LRIGPY+C EWN+GG P WL IPG+QFR
Sbjct: 88 PHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLH 147
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKW 201
N PF+ EM+ FT IV+ MK ++A QGGPIIL+QIENEYGNI + YI W
Sbjct: 148 NAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHW 207
Query: 202 AAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
A MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF +
Sbjct: 208 CADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKA 267
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
+ +R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDE
Sbjct: 268 WDKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDE 327
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YG +RQPK+GHLKDLH IK E LV + + + T Y S + F+ N
Sbjct: 328 YGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDS-TSACFINNRN 386
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
N DV V +G ++LLPAWSVSILPDCK V FN+AKI + T V ++ +S
Sbjct: 387 DNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKANMVEKEPESLK- 445
Query: 441 IGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
WS++ E P + ++ K LLEQI T+ DQSDYLWY S N K +
Sbjct: 446 ----WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGE------- 494
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
+ L V + GH L+AF+NG LVG + + + ++ P L GKN LLS T+GL+
Sbjct: 495 ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLK 554
Query: 558 NYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSSTQWD 614
NYG +EK AGI GPV+L NG IDLS+ W+Y+ GL GE +++ T +
Sbjct: 555 NYGPLFEKMPAGIVGGPVKLI-DNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPGCTWDN 613
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y + G
Sbjct: 614 NNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGG 673
Query: 675 TDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKIS 729
C+YRG + + KCL CG+PSQ YHVPRS+LK+ NTL+LFEE GGDP+ +S
Sbjct: 674 CHHCDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTLILFEEAGGDPSHVS 733
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
F T GS S + G ++L C ++ IS+I SFG
Sbjct: 734 FRTVAAGSVCAS--------------------AEVGDTITLSCGQHSKTISAINMTSFGV 773
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CG++ +G C S + +AC+G +SC++ ++ G C + L V+ASC
Sbjct: 774 ARGQCGAY-KGGCESKAAYKAFTEACLGKESCTVQITNAVTGSGC--LSNVLTVQASC 828
>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
Length = 812
Score = 778 bits (2010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/852 (47%), Positives = 528/852 (61%), Gaps = 75/852 (8%)
Query: 8 LLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKD 67
L+ C +L T S V YD A+++ G+R+++ISG+IHYPRST +MWPDLI K+KD
Sbjct: 9 FLIAC--LALLYTCSSATTVEYDSSAIILNGERKLIISGAIHYPRSTSQMWPDLIMKAKD 66
Query: 68 GGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGG 127
G LD IETY+FW+LHEPVR +Y+F G D +KF+K+ E GLY LRIGPYVCAEWN+GG
Sbjct: 67 GDLDAIETYIFWDLHEPVRRKYDFSGNLDFIKFLKIAQEQGLYVVLRIGPYVCAEWNYGG 126
Query: 128 FPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI 187
FP+WLH +PGIQ RTDN FK EM+ FT KIV M K+ L+A QGGPIIL+QIENEYG++
Sbjct: 127 FPMWLHNMPGIQLRTDNAVFKEEMKIFTTKIVTMCKEAGLFAPQGGPIILAQIENEYGDV 186
Query: 188 DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKP 247
S YG AG SYIKW A MAL+ + GVPW+MC+Q +AP II+TCNG+YCD F PN+ P
Sbjct: 187 ISHYGEAGNSYIKWCAEMALAQNIGVPWIMCKQKNAPATIIDTCNGYYCDTFKPNNPKSP 246
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
K++TENW GWF +G P+R ED AF+VARFFQ GG QNYY+YHGGTNF RT+GGPF
Sbjct: 247 KIFTENWVGWFQKWGERRPHRTAEDSAFSVARFFQNGGALQNYYLYHGGTNFGRTAGGPF 306
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-K 366
I T+YDYDAPLDEYG + +PK+GHLK LH AIKL E L T+ S G +L T Y
Sbjct: 307 IITTYDYDAPLDEYGNLIEPKYGHLKRLHAAIKLGEKVLTNGTATWESHGDSLWMTTYTN 366
Query: 367 TGSGLCSAFLANIGTNSDVTVKFNGN-SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
G+G FL+N T+ D V + Y +PAWS+S+L DC V+NTAK + T +
Sbjct: 367 KGTGQKFCFLSNSHTSKDAEVDLQQDGKYYVPAWSMSLLQDCNKEVYNTAKTEAQTNI-- 424
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
+ +Q Q +S + WS+ ++P+ FT LL+Q + T SDYLWY
Sbjct: 425 YMKQLDQKLGNSPE-----WSWTSDPMEDTFQGKGTFTASQLLDQKSVTVGASDYLWY-- 477
Query: 484 STNIKADEPLLEDGS---KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
E ++ D + K + V + GH L+ FING L G+ +G+ S + I+L
Sbjct: 478 -----MTEVVVNDTNTWGKAKVQVNTTGHILYLFINGFLTGTQHGTVSQPGFIHEGNISL 532
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNI-DLSSQQWTYQTGL 598
G N LLS+TVG NYGAF++ GI GPV+L N N+ DLS W+Y+ G+
Sbjct: 533 NQGTNIISLLSVTVGHANYGAFFDMQETGIVGGPVKLFSIENPNNVLDLSKSTWSYKVGI 592
Query: 599 KGEELNFPSGSST---QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWV 655
G F +T QW + + + P+ WYKTTF P G+ PV +D G+ KGEAWV
Sbjct: 593 NGMTKKFYDPKTTIGVQWKTNNVSIGV-PMTWYKTTFKTPDGTNPVVLDLIGLQKGEAWV 651
Query: 656 NGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTL 715
NGQSIGRYWP +++N GC+D+C+YRG Y+++KCL CG+PSQ YHVPRS+L + NTL
Sbjct: 652 NGQSIGRYWPAMLAENKGCSDTCDYRGEYNADKCLSGCGEPSQRFYHVPRSFLNNDVNTL 711
Query: 716 VLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
VLFEE+G D T P
Sbjct: 712 VLFEEMGFDAT----------------------------------------------PFN 725
Query: 776 NQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCK 835
+ +S I+FAS+G P G+CGSF G S S +VV +AC+G +SCSI V+ +TF
Sbjct: 726 GKTMSEIQFASYGDPEGSCGSFKIGEWESRYSKTVVEKACIGKQSCSINVTSSTFRLKKG 785
Query: 836 GVMKSLAVEASC 847
G LAV+ SC
Sbjct: 786 GTNGQLAVQLSC 797
>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
Length = 828
Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/841 (48%), Positives = 530/841 (63%), Gaps = 57/841 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G VTY+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HE
Sbjct: 28 GTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHE 87
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R QYNF G YD+V+F K + AGLYA LRIGPY+C EWN+GG P WL IPG+QFR
Sbjct: 88 PHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLH 147
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKW 201
N PF+ EM+ FT IV+ MK ++A QGGPIIL+QIENEYGNI + YI W
Sbjct: 148 NAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHW 207
Query: 202 AAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
A MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF +
Sbjct: 208 CADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKA 267
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
+ +R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDE
Sbjct: 268 WDKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDE 327
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YG +RQPK+GHLKDLH IK E LV + + N+ T Y GS + F+ N
Sbjct: 328 YGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGS-TSACFINNRN 386
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
N D+ V +GN++LLPAWSVSILPDCK V FN+AKI + T + +++ V + +
Sbjct: 387 DNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTI--MVKKANMVEKEPENL 444
Query: 441 IGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
WS++ E P + ++ K LLEQI T+ DQSDYLWY S + K +
Sbjct: 445 ---KWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGE------- 494
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
+ L V + GH L+AF+NG LVG + + + ++ + L GKN LLS T+GL+
Sbjct: 495 ASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLK 554
Query: 558 NYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQ 612
NYG +EK AGI GPV+L NGT IDLS+ W+Y+ GL GE L+ P +
Sbjct: 555 NYGPLFEKMPAGIVGGPVKLI-DNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKP---GYR 610
Query: 613 WDSKS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
WD+ + T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y +
Sbjct: 611 WDNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAE 670
Query: 672 GGCTDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPT 726
G C+YRG + + KCL CG+PSQ YHVPRS+LK+ NTL+LFEE GGDP+
Sbjct: 671 MGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPS 730
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFAS 786
++ F + G S+C + + G ++L C ++ IS+I S
Sbjct: 731 QVIFHSVVAG-SVC-------------------VSAEVGDAITLSCGQHSKTISTIDVTS 770
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEAS 846
FG G CG++ G C S + +AC+G +SC++ + G C + L V+AS
Sbjct: 771 FGVARGQCGAY-EGGCESKAAYKAFTEACLGKESCTVQIINALTGSGC--LSGVLTVQAS 827
Query: 847 C 847
C
Sbjct: 828 C 828
>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
Length = 824
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/841 (47%), Positives = 529/841 (62%), Gaps = 57/841 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G V Y+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HE
Sbjct: 24 GTTVAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHE 83
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R QYNFEG YD+++F K + AGLYA LRIGPY+C EWN+GG P WL IP +QFR
Sbjct: 84 PHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMH 143
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKW 201
N PF+ EM+ FT I++ MK ++A QGGPIIL+QIENEYGN+ + YI W
Sbjct: 144 NAPFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHW 203
Query: 202 AAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
A MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF +
Sbjct: 204 CADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKA 263
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
+ +R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDE
Sbjct: 264 WDKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDE 323
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YG +RQPK+GHLKDLH IK E LV + + N+ T Y GS + F+ N
Sbjct: 324 YGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGS-TSACFINNRN 382
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
N D+ V +GN++LLPAWSVSILPDCK V FN+AKI + T + +++ V + +
Sbjct: 383 DNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTI--MVKKANMVEKEPENL 440
Query: 441 IGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
WS++ E P + ++ K LLEQI T+ DQSDYLWY S + K +
Sbjct: 441 ---KWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGE------- 490
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
+ L V + GH L+AF+NG LVG + + + ++ + L GKN LLS T+GL+
Sbjct: 491 ASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLK 550
Query: 558 NYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQ 612
NYG +EK AGI GPV+L NGT IDLS+ W+Y+ GL GE L+ P +
Sbjct: 551 NYGPLFEKMPAGIVGGPVKLI-DNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKP---GYR 606
Query: 613 WDSKS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
WD+ + T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y +
Sbjct: 607 WDNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAE 666
Query: 672 GGCTDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPT 726
G C+YRG + + KCL CG+PSQ YHVPRS+LK+ NTL+LFEE GGDP+
Sbjct: 667 MGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPS 726
Query: 727 KISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFAS 786
++ F + G S+C + + G ++L C ++ IS+I S
Sbjct: 727 QVIFHSVVAG-SVC-------------------VSAEVGDAITLSCGQHSKTISTIDVTS 766
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEAS 846
FG G CG++ G C S + +AC+G +SC++ + G C + L V+AS
Sbjct: 767 FGVARGQCGAY-EGGCESKAAYKAFTEACLGKESCTVQIINALTGSGC--LSGVLTVQAS 823
Query: 847 C 847
C
Sbjct: 824 C 824
>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
Length = 828
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/836 (48%), Positives = 534/836 (63%), Gaps = 41/836 (4%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G +VTYD R++++ G+R++L SGSIHYPRSTPEMW LI K+K+GGLDVI+TYVFWNLHE
Sbjct: 21 GGDVTYDGRSLIVDGQRKLLFSGSIHYPRSTPEMWQSLIAKAKEGGLDVIDTYVFWNLHE 80
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P QY+F GR D+V+F+K V GLY LRIGP++ EW++GG P WLH IPGI FR+D
Sbjct: 81 PQPGQYDFSGRRDIVRFIKEVQAQGLYVCLRIGPFIQGEWSYGGLPFWLHDIPGIVFRSD 140
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK +MQ FT KIV MM+ EKLY SQGGPIILSQIENEYG ++ AY G +Y+KWAA
Sbjct: 141 NEPFKVQMQGFTTKIVTMMQSEKLYVSQGGPIILSQIENEYGTVEEAYHEKGPAYVKWAA 200
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSF 261
MA+ L+TGVPWVMC+Q+DAPDP+IN CNG C + PNS NKP +WTENW+ ++
Sbjct: 201 QMAVGLNTGVPWVMCKQNDAPDPVINACNGLRCAETFVGPNSPNKPAIWTENWTTRYVIT 260
Query: 262 GGAVPYRPVEDLAFAVARFF-QRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
G + R VED+AF V +F + G+F NYYMYHGGTNF RT+ F+ TSY AP+DE
Sbjct: 261 GENIRIRSVEDIAFQVTQFIVAKKGSFVNYYMYHGGTNFGRTASA-FVPTSYYDQAPIDE 319
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGLIRQPKWGHLK++H AIKLC L++ SLG +A V+ SG C+AFL N
Sbjct: 320 YGLIRQPKWGHLKEMHAAIKLCLTPLLSGGQVTISLGQQQQAFVFTGLSGECAAFLLNND 379
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
T + +V+F SY LP S+SILPDCK V FNTAK+++ S +R L D
Sbjct: 380 TANTASVQFRNASYDLPPNSISILPDCKTVAFNTAKVSTQYTTRSMTRSKLLDGEDK--- 436
Query: 441 IGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
W E + + + +LEQ++TT D SDYLWY+ ++ + ++
Sbjct: 437 ----WVQYQEAIVNFDETSVKSEAILEQMSTTKDASDYLWYTFRFQQESSD------TQA 486
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
VL+V+SLGH LHAF+NG+ VG GS N + T+ ++L+ G N LLS+ VG+ + G
Sbjct: 487 VLNVRSLGHVLHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVNNVSLLSVMVGMPDSG 546
Query: 561 AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKS 617
A+ E+ AG+ ++K N + ++ W YQ GL GE+L S QW + S
Sbjct: 547 AYMERRAAGLR---KVKIQEKEGNKEFTNYSWGYQVGLLGEKLQIFTDQGSSQVQWANFS 603
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
L PL WYKT FDAP PVA++ MGKGEAWVNGQSIGRYWP+Y + +G
Sbjct: 604 K-NALNPLTWYKTLFDAPLEDAPVALNLGSMGKGEAWVNGQSIGRYWPSYRASDGSSQIW 662
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
Y + + ++ Y+VPRS+LK GN LV+ EE GG+P +IS T + S
Sbjct: 663 YAYFNTGAIFRAVR---------YNVPRSFLKPKGNLLVVLEESGGNPLQISVDTASI-S 712
Query: 738 SLCSHVTDSHPLPVDMW----GSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+CSHVT SH V W +D+ + P + L+CP+ N IS+I FAS+GTP GT
Sbjct: 713 KICSHVTASHLPLVSSWSKRTNTDNNNSLQARPRVKLDCPS-NTKISNILFASYGTPEGT 771
Query: 794 CG-SFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
CG +++ G C S+ S ++V++AC+G CSI VS F GDPC KSL V A C
Sbjct: 772 CGDAYAVGMCHSSSSEAIVQKACLGQMRCSIPVSSKYFGGDPCSANEKSLLVVAEC 827
>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
Length = 828
Score = 774 bits (1999), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/838 (47%), Positives = 526/838 (62%), Gaps = 57/838 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V Y+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HEP R
Sbjct: 31 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEG YD+++F K + AGLYA LRIGPY+C EWN+GG P WL IP +QFR N P
Sbjct: 91 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAAG 204
F+ EM+ FT I++ MK ++A QGGPIIL+QIENEYGN+ + YI W A
Sbjct: 151 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 210
Query: 205 MALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
+R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPK+GHLKDLH IK E LV + + N+ T Y GS + F+ N N
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGS-TSACFINNRNDNK 389
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D+ V +GN++LLPAWSVSILPDCK V FN+AKI + T + ++ +S
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPESLK---- 445
Query: 444 GWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
WS++ E P + ++ K LLEQI T+ DQSDYLWY S + K + +
Sbjct: 446 -WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGE-------ASY 497
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L V + GH L+AF+NG LVG + + + ++ + L GKN LLS T+GL+NYG
Sbjct: 498 TLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYG 557
Query: 561 AFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQWDS 615
+EK AGI GPV+L NGT IDLS+ W+Y+ GL GE L+ P +WD+
Sbjct: 558 PLFEKMPAGIVGGPVKLI-DNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKP---GYRWDN 613
Query: 616 KS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y + G
Sbjct: 614 NNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGG 673
Query: 675 TDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKIS 729
C+YRG + + KCL CG+PSQ YHVPRS+LK+ NTL+LFEE GGDP+++
Sbjct: 674 CHHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVI 733
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
F + G S+C + + G ++L C ++ IS+I SFG
Sbjct: 734 FHSVVAG-SVC-------------------VSAEVGDAITLSCGQHSKTISTIDVTSFGV 773
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CG++ G C S + +AC+G +SC++ + G C + L V+ASC
Sbjct: 774 ARGQCGAY-EGGCESKAAYKAFTEACLGKESCTVQIINALTGSGC--LSGVLTVQASC 828
>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 824
Score = 774 bits (1999), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/838 (47%), Positives = 526/838 (62%), Gaps = 57/838 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V Y+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEG YD+++F K + AGLYA LRIGPY+C EWN+GG P WL IP +QFR N P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAAG 204
F+ EM+ FT I++ MK ++A QGGPIIL+QIENEYGN+ + YI W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 205 MALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF ++
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
+R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPK+GHLKDLH IK E LV + + N+ T Y GS + F+ N N
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGS-TSACFINNRNDNK 385
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D+ V +GN++LLPAWSVSILPDCK V FN+AKI + T + ++ +S
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPESLK---- 441
Query: 444 GWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
WS++ E P + ++ K LLEQI T+ DQSDYLWY S + K + +
Sbjct: 442 -WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGE-------ASY 493
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L V + GH L+AF+NG LVG + + + ++ + L GKN LLS T+GL+NYG
Sbjct: 494 TLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYG 553
Query: 561 AFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQWDS 615
+EK AGI GPV+L NGT IDLS+ W+Y+ GL GE L+ P +WD+
Sbjct: 554 PLFEKMPAGIVGGPVKLI-DNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKP---GYRWDN 609
Query: 616 KS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y + G
Sbjct: 610 NNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGG 669
Query: 675 TDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKIS 729
C+YRG + + KCL CG+PSQ YHVPRS+LK+ NTL+LFEE GGDP+++
Sbjct: 670 CHHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVI 729
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
F + G S+C + + G ++L C ++ IS+I SFG
Sbjct: 730 FHSVVAG-SVC-------------------VSAEVGDAITLSCGQHSKTISTIDVTSFGV 769
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CG++ G C S + +AC+G +SC++ + G C + L V+ASC
Sbjct: 770 ARGQCGAY-EGGCESKAAYKAFTEACLGKESCTVQIINALTGSGC--LSGVLTVQASC 824
>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
Length = 824
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/838 (47%), Positives = 526/838 (62%), Gaps = 57/838 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V Y+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HEP R
Sbjct: 27 VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEG YD+++F K + AGLYA LRIGPY+C EWN+GG P WL IP +QFR N P
Sbjct: 87 RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAAG 204
F+ EM+ FT I++ MK ++A QGGPIIL+QIENEYGN+ + YI W A
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206
Query: 205 MALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF ++
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
+R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDEYG
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPK+GHLKDLH IK E LV + + N+ T Y GS + F+ N N
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGS-TSACFINNRNDNK 385
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
D+ V +GN++LLPAWSVSILPDCK V FN+AKI + T + ++ +S
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEPESLK---- 441
Query: 444 GWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
WS++ E P + ++ K LLEQI T+ DQSDYLWY S + K + +
Sbjct: 442 -WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGE-------ASY 493
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L V + GH L+AF+NG LVG + + + ++ + L GKN LLS T+GL+NYG
Sbjct: 494 TLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYG 553
Query: 561 AFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQWDS 615
+EK AGI GPV+L NGT IDLS+ W+Y+ GL GE L+ P +WD+
Sbjct: 554 PLFEKMPAGIVGGPVKLI-DNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKP---GYRWDN 609
Query: 616 KS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y + G
Sbjct: 610 NNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGG 669
Query: 675 TDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKIS 729
C+YRG + + KCL CG+PSQ YHVPRS+LK+ NTL+LFEE GGDP+++
Sbjct: 670 CHHCDYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVI 729
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
F + G S+C + + G ++L C ++ IS+I SFG
Sbjct: 730 FHSVVAG-SVC-------------------VSAEVGDAITLSCGQHSKTISTIDVTSFGV 769
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CG++ G C S + +AC+G +SC++ + G G+ L V+ASC
Sbjct: 770 ARGQCGAY-EGGCESKAAYKAFTEACLGKESCTVQIINALTGS--GGLSGVLTVQASC 824
>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
gi|223947135|gb|ACN27651.1| unknown [Zea mays]
gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
Length = 822
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/866 (47%), Positives = 541/866 (62%), Gaps = 62/866 (7%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M + + LLL L V + + VTY+ RA+VI G+RR+++SGSIHYPRSTP+MWPD
Sbjct: 1 MTALQFLLLAL----VAVTQVASATTVTYNDRALVIDGQRRIILSGSIHYPRSTPQMWPD 56
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI K+K+GGL+ IETYVFWN HEP R QYNFEG YD+++F K + AG++A LRIGPY+C
Sbjct: 57 LINKAKEGGLNTIETYVFWNGHEPRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIGPYIC 116
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
EWN+GG P WL IPG+QFR N PF+ EM+ FT IV+ MK ++A QGGPIIL+QI
Sbjct: 117 GEWNYGGLPAWLRDIPGMQFRLHNAPFEREMETFTTLIVNKMKDVNMFAGQGGPIILAQI 176
Query: 181 ENEYGNIDSAY--GAAGKSYIKWAAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCD 237
ENEYGNI + YI W A MA + GVPW+MCQQ +D P +INTCNGFYC
Sbjct: 177 ENEYGNIMGQLKNNQSASQYIHWCADMANKQEVGVPWIMCQQDNDVPHNVINTCNGFYCH 236
Query: 238 QFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGT 297
+ PN PK+WTENW+GWF ++ +R ED+AFAVA FFQ+ G+ NYYMYHGGT
Sbjct: 237 DWFPNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGT 296
Query: 298 NFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
NF RTSGGP+I+TSYDYDAPLDEYG IRQPK+GHLKDLH I+ E LV S G
Sbjct: 297 NFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYG 356
Query: 358 PNLEATVYKT-GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAK 416
N+ T Y GS +C F+ N + D+ V G ++L+PAWSVSILP+CK V +NTAK
Sbjct: 357 KNVTVTKYMYGGSSVC--FINNQFVDRDMKVTLGGETHLVPAWSVSILPNCKTVAYNTAK 414
Query: 417 INSVTLVPSFSRQSLQVAADSSDAIGSGWSYINE---PVGISKDDAFTKPGLLEQINTTA 473
I + T V S++ ++ WS++ E P +F + LLEQI T+
Sbjct: 415 IKTQTSVMVKKANSVEKEPETMR-----WSWMPENLKPFMTDHRGSFRQSQLLEQIATST 469
Query: 474 DQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVT 533
DQSDYLWY S K +GS T L+V + GH ++AF+NG+LVG + +
Sbjct: 470 DQSDYLWYRTSLEHKG------EGSYT-LYVNTSGHEMYAFVNGRLVGQNHSADGAFVFQ 522
Query: 534 VDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQW 592
+ P+ L GKN LLS TVGL+NYG +E AGI GPV+L G+ NGT IDL+ W
Sbjct: 523 LQSPVKLHSGKNYVSLLSGTVGLKNYGPSFELVPAGIAGGPVKLVGT-NGTAIDLTKSSW 581
Query: 593 TYQTGLKGE----ELNFPSGSSTQWDSKS-TLPKLQPLVWYKTTFDAPAGSEPVAIDFTG 647
+Y++GL GE L+ P +W S + T+P +P WYKTTF+APAG E V +D G
Sbjct: 582 SYKSGLAGELRQIHLDKP---GYKWQSHNGTIPVNRPFTWYKTTFEAPAGEEAVVVDLLG 638
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN----KCLKNCGKPSQSLYHV 703
+ KG AWVNG S+GRYWP+Y + C+YRG + + +CL CG+P+Q YHV
Sbjct: 639 LNKGVAWVNGNSLGRYWPSYTAAEMPGCHVCDYRGKFIAEGDGIRCLTGCGEPAQRFYHV 698
Query: 704 PRSWLKSSG-NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQR 762
PRS+L++ NTL+LFEE GGDPT+ +F T +G + V
Sbjct: 699 PRSFLRAGEPNTLILFEEAGGDPTRAAFHTVAVGPVCVAAV------------------- 739
Query: 763 KPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCS 822
+ G ++L C +V++S+ ASFG G+CG++ +G C S +L ACVG +SC+
Sbjct: 740 ELGDDVTLSCGGHGRVVASVDVASFGVARGSCGAY-KGGCESKAALKAFTDACVGRESCT 798
Query: 823 IGVSVNTFGDPCKGVMKSLAVEASCT 848
+ + G C+ +L V+A+C+
Sbjct: 799 VKYTAAFAGAGCQS--GALTVQATCS 822
>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
Length = 534
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/529 (70%), Positives = 435/529 (82%), Gaps = 5/529 (0%)
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPKWGHL+DLHKAIKLCE AL+ATDPT SLG NLEA VYKT SG C+AFLAN+GT
Sbjct: 9 GLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGT 68
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
SD TV FNG SY LPAWSVSILPDCKNV FNTAKINS T +F+RQSL+ SS +
Sbjct: 69 KSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFARQSLKPDGGSSAEL 128
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
GS WSYI EP+GISK DAF KPGLLEQINTTAD+SDYLWYSL +IK DE L++GSK V
Sbjct: 129 GSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAV 188
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
LH++SLG ++AFINGKL GSG+G K+++D PI L GKNT DLLS+TVGL NYGA
Sbjct: 189 LHIESLGQVVYAFINGKLAGSGHG---KQKISLDIPINLVAGKNTVDLLSVTVGLANYGA 245
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
F++ GAGITGPV LK + G++IDL+SQQWTYQ GLKGE+ + S++W SKS LP
Sbjct: 246 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLGAVDSSEWVSKSPLPT 305
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QPL+WYKTTFDAP+GSEPVAIDFTG KG AWVNGQSIGRYWPT ++ NGGCTDSC+YR
Sbjct: 306 KQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIGRYWPTSIAGNGGCTDSCDYR 365
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G+Y +NKCLKNCGKPSQ+LYHVPRSWLK SGNTLVLFEE+GGDPT+ISF TKQ GS+LC
Sbjct: 366 GSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQTGSNLCL 425
Query: 742 HVTDSHPLPVDMWGSDSKI--QRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSR 799
V+ SHP PVD W SDSKI + + PVLSL+CP QVISSIKFASFGTP GTCGSF+
Sbjct: 426 TVSQSHPPPVDTWTSDSKISNRNRTRPVLSLQCPVSTQVISSIKFASFGTPKGTCGSFTS 485
Query: 800 GRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
G C+S+RSLS+V++AC+GS+SC+I VS FG+PC+GV+KSLAVEASC+
Sbjct: 486 GSCNSSRSLSLVQKACIGSRSCNIEVSTRVFGEPCRGVVKSLAVEASCS 534
>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
Length = 831
Score = 766 bits (1978), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/850 (48%), Positives = 527/850 (62%), Gaps = 56/850 (6%)
Query: 14 GFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVI 73
GFV A+ + V+YD RA+VI G+RR+++SGSIHYPRSTPEMWPDLIQK+KDGGL+ I
Sbjct: 23 GFVPGASCT---EVSYDERALVIDGQRRIILSGSIHYPRSTPEMWPDLIQKAKDGGLNTI 79
Query: 74 ETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLH 133
ETYVFWN HEP QYNFEG YD+++F K V +AG+YA LRIGPY+C EWN+GG P WL
Sbjct: 80 ETYVFWNGHEPRPRQYNFEGNYDIMRFFKEVQKAGMYAILRIGPYICGEWNYGGLPAWLR 139
Query: 134 FIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY-- 191
IP +QFR NEPF+ EM+ FT IV+ MK ++A QGGPIIL+QIENEYGN+ S
Sbjct: 140 DIPDMQFRLHNEPFEREMETFTTLIVNKMKDANMFAGQGGPIILTQIENEYGNVQSNLPD 199
Query: 192 GAAGKSYIKWAAGMALSLDTGVPWVMCQQS-DAPDPIINTCNGFYCDQFTPNSNNKPKMW 250
+ YI W A MA + GVPW+MCQQS D P +I TCNGFYC F P +N PK+W
Sbjct: 200 QESATKYIHWCADMANKQNVGVPWIMCQQSNDVPPNVIETCNGFYCHDFKPKGSNMPKIW 259
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+GWF ++ +RP ED+A+AVA FFQ G+ QNYYMYHGGTNF RTSGGP+I+T
Sbjct: 260 TENWTGWFKAWDKPDYHRPAEDVAYAVAMFFQNRGSVQNYYMYHGGTNFGRTSGGPYITT 319
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
+YDYDAPLDEYG IRQPK+GHLK LH + E LV +L ++AT Y G
Sbjct: 320 TYDYDAPLDEYGNIRQPKYGHLKALHTVLTSMEKHLVYGQQNETNLDDKVKATKYTLDDG 379
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
+ F++N N DV V F G++Y +PAWSVS+LPDCK V +NTAK+ + T S
Sbjct: 380 SSACFISNSHDNKDVNVTFEGSAYQVPAWSVSVLPDCKTVAYNTAKVKTQT--------S 431
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDD---AFTKPGLLEQINTTADQSDYLWYSLSTNI 487
+ V +S+ G WS++ E + S D +F LLEQI T AD+SDYLWY S
Sbjct: 432 VMVKKESAAKGGLKWSWLPEFLRPSFTDSYGSFKSNELLEQIVTGADESDYLWYKTSLTR 491
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
E L+V + GH L+AF+NG+L G + + + P+ L PGKN
Sbjct: 492 GPKEQF-------TLYVNTTGHELYAFVNGELAGYKHAVNGPYLFQFEAPVTLKPGKNYI 544
Query: 548 DLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
LLS TVGL+NYGA +E AGI GPV+L S +G IDLS+ WTY+TGL GE+
Sbjct: 545 SLLSATVGLKNYGASFELMPAGIVGGPVKLV-SAHGNTIDLSNNTWTYKTGLFGEQKQIH 603
Query: 607 SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
S +P +P WYK TF APAG+E V +D G+ KG +VNG ++GRYWP+
Sbjct: 604 LDKPGLRWSPFAVPTNRPFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNGHNLGRYWPS 663
Query: 667 YVSQNGGCTDSCNYRGAY----SSNKCLKNCGKPSQSLYHVPRSWLKSSG---NTLVLFE 719
YV+ + C+YRG Y + KCL CG+ Q YHVPRS+L ++ NT+VLFE
Sbjct: 664 YVAGDMDGCHRCDYRGEYVTWNNQEKCLTGCGEVGQRFYHVPRSFLNAAHGAPNTVVLFE 723
Query: 720 EIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVI 779
E GGDP K++F T +G P+ D + G ++L C + + I
Sbjct: 724 EAGGDPAKVNFRTVAVG-----------PVCADA---------EKGDAVTLACAH-GRTI 762
Query: 780 SSIKFASFGTPLGTCGSFSRGR-CSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVM 838
SS+ ASFG G CG++ G C S +L + ACVG K C++ + CKG
Sbjct: 763 SSVDTASFGVSGGQCGAYEGGSGCESKPALEAITAACVGKKWCTVSYTDAFDSADCKG-S 821
Query: 839 KSLAVEASCT 848
L V+A+C+
Sbjct: 822 GVLTVQATCS 831
>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
Length = 817
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/849 (49%), Positives = 533/849 (62%), Gaps = 55/849 (6%)
Query: 13 WGFVVLAT---TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGG 69
W F VL++ + G VTYD R+++I G+R++L SGSIHYPRSTPEMWP LI ++K GG
Sbjct: 11 WWFAVLSSAVASVCGGEVTYDGRSLIINGQRKILFSGSIHYPRSTPEMWPSLISQAKQGG 70
Query: 70 LDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFP 129
+DVIETYVFWN HEP QY+F GR D+V+F++ V GLYA LRIGP++ AEWN+GGFP
Sbjct: 71 IDVIETYVFWNQHEPKPGQYDFSGRRDIVRFIREVQAQGLYACLRIGPFIQAEWNYGGFP 130
Query: 130 LWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDS 189
WLH +PGI +RTDNEPFK M+ FT KIV++MK E LYASQGGPIIL QIENEY +++
Sbjct: 131 FWLHDVPGIVYRTDNEPFKFYMRNFTTKIVEIMKSENLYASQGGPIILQQIENEYKTVEA 190
Query: 190 AYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKP 247
+G AGK Y+ WAA MA+ L+TGVPWVMC+Q DAPDP+IN+CNG C + F PNS NKP
Sbjct: 191 NFGEAGKRYVLWAANMAVGLETGVPWVMCKQDDAPDPVINSCNGRLCGETFAGPNSPNKP 250
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGP 306
+WTENW+ + FG RPVED+AF VA F + G+F NYYMYHGGTNF RT+
Sbjct: 251 AIWTENWTSSYPLFGEDARPRPVEDIAFHVALFVAKMNGSFINYYMYHGGTNFGRTASA- 309
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL-EATVY 365
++ T+Y +APLDEYGLI+QP WGHLK+LH A+KLC L+ + SLG L EA V+
Sbjct: 310 YVQTAYYDEAPLDEYGLIQQPTWGHLKELHAAVKLCSETLLQGAQSNLSLGTKLQEAYVF 369
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+ SG C+AFL N + +DVTV F SY LP S+SILPDCKN FNTAK S
Sbjct: 370 RGQSGKCAAFLVNNDSRTDVTVVFQNTSYELPRKSISILPDCKNEAFNTAK-------AS 422
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
F + + + W E + D + LLE +NTT D SDYLWY+
Sbjct: 423 FRPGLISIQTVTKFNSTEQWEEYKESILNFDDTSSRANTLLEHMNTTKDASDYLWYTFRY 482
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
N ++P ++VL S HALHAFING+ GS +GSSSN ++D ++ G N
Sbjct: 483 N---NDP---SNGQSVLSTNSRAHALHAFINGRHTGSQHGSSSNLSFSLDNTVSFRAGIN 536
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
LLS+ VGL + GA+ E+ AG+ +++ NG+ D ++ W YQ GL GE+L
Sbjct: 537 NVSLLSVMVGLPDSGAYLERRVAGLR---RVRIQSNGSLKDFTNNPWGYQVGLLGEKLQI 593
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ QW SK L WYKT FDAPAG+EPVA++ M KGE WVNGQSIGR
Sbjct: 594 YTDVGSQKVQW-SKFGSSTSGLLTWYKTVFDAPAGNEPVALNLVSMRKGEVWVNGQSIGR 652
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
YW ++++ + GKPSQ YH+PRS+LK +GN LVL EE
Sbjct: 653 YWVSFLTPS----------------------GKPSQIWYHIPRSFLKPTGNLLVLLEEET 690
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPG--PVLSLECPNPNQVIS 780
G P IS + K +C HV++SH PV K + G P + L CP+ N+ IS
Sbjct: 691 GHPVGIS-IGKVSIPKICGHVSESHLPPVISRVIYKKHENHHGRRPKVQLRCPS-NRNIS 748
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMK 839
I FASFGTP G C S++ G C S+ S S V +AC+G CS+ +S F GDPC G K
Sbjct: 749 RILFASFGTPSGDCQSYAVGSCHSSNSRSNVEKACLGKGMCSVPLSYKRFGGDPCPGTPK 808
Query: 840 SLAVEASCT 848
+L V+ CT
Sbjct: 809 ALLVDVQCT 817
>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
Length = 763
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/761 (52%), Positives = 485/761 (63%), Gaps = 44/761 (5%)
Query: 123 WNF-GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
W++ GFPLWL +PGI+FRTDN PFK EMQRF KIVD+++ EKL+ QGGP+I+ Q+E
Sbjct: 1 WDYCRGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVE 60
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTP 241
NEYGNI+S+YG G+ YIKW MAL L VPWVMCQQ DAP IIN+CNG+YCD F
Sbjct: 61 NEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKA 120
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
NS +KP WTENW+GWF S+G P+RPVEDLAF+VARFFQR G+FQNYYMY GGTNF R
Sbjct: 121 NSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGR 180
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNL 360
T+GGPF TSYDYD+P+DEYGLIR+PKWGHLKDLH A+KLCE ALV+ D P Y LGP
Sbjct: 181 TAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQ 240
Query: 361 EATVYKTGSGL-------------CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
EA VY S CSAFLANI V VKFNG +Y LP WSVSILPDC
Sbjct: 241 EAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDC 300
Query: 408 KNVVFNTAKINSVTLV-------PSFSRQSLQVAADSSDA---IGSGWSYINEPVGISKD 457
+NVVFNTAK+ + T + P + SL++ A + I + W + EP+GI D
Sbjct: 301 QNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSD 360
Query: 458 DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFI 515
FT G+LE +N T D+SDYLWY ++ D+ E + + S+ F+
Sbjct: 361 QNFTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFV 420
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQ 575
NGKL GS G V P+ G N LLS +GLQN GAF EK GAGI G ++
Sbjct: 421 NGKLTGSAIGQW----VKFVQPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIK 476
Query: 576 LKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTF 632
L G NG +IDLS WTYQ GLKGE LNF S W S WYK F
Sbjct: 477 LTGFKNG-DIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYF 535
Query: 633 DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKN 692
+P G++PVAI+ MGKG+AWVNG IGRYW + VS GC C+YRGAY+S KC N
Sbjct: 536 SSPDGTDPVAINLGSMGKGQAWVNGHHIGRYW-SVVSPKDGCPRKCDYRGAYNSGKCATN 594
Query: 693 CGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH----- 747
CG+P+QS YH+PRSWLK S N LVLFEE GG+P +I G +C V++SH
Sbjct: 595 CGRPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTG-VICGQVSESHYPSLR 653
Query: 748 PLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARS 807
L D + + P + L C + VISS++FAS+GTP G+C FSRG C + S
Sbjct: 654 KLSNDYISDGETLSNRANPEMFLHC-DDGHVISSVEFASYGTPQGSCNKFSRGPCHATNS 712
Query: 808 LSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
LSVV QAC+G SC++ +S + F GDPC ++K+LAVEA C
Sbjct: 713 LSVVSQACLGKNSCTVEISNSAFGGDPCHSIVKTLAVEARC 753
>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
Length = 822
Score = 763 bits (1971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/849 (48%), Positives = 530/849 (62%), Gaps = 53/849 (6%)
Query: 23 FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLH 82
FG VTYD+RA+ I G R++++SGSIHYPRSTPEMWP LI+K+K+GGL+ IETYVFWN H
Sbjct: 3 FGYEVTYDNRAIKIDGARKLILSGSIHYPRSTPEMWPQLIRKAKEGGLNTIETYVFWNAH 62
Query: 83 EPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRT 142
EP + QY+F G DL++F+K + + GLYA LRIGPYVCAEWN+GGFP+WLH +PGIQ RT
Sbjct: 63 EPHQRQYDFSGNLDLIRFIKTIRDEGLYAILRIGPYVCAEWNYGGFPVWLHNLPGIQIRT 122
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA 202
+NE +K EM+ FT IV+MMK KL+ASQGGPIILSQIENEYGN+ S+YG GK Y+KW
Sbjct: 123 NNEVYKNEMEIFTTLIVNMMKDGKLFASQGGPIILSQIENEYGNVQSSYGDEGKEYVKWC 182
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
A +A S GVPW+MCQQSDAP P+I++CNGFYCDQ+ N+ + PK+WTENW+GWF +G
Sbjct: 183 ANLAESFKVGVPWIMCQQSDAPSPMIDSCNGFYCDQYYSNNKSLPKIWTENWTGWFQDWG 242
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
P+R ED+AFAVARFFQ GG+ NYYMYHGGTNF T GGP+I+ SYDYDAPLDEYG
Sbjct: 243 QKNPHRSAEDVAFAVARFFQLGGSVMNYYMYHGGTNFGTTGGGPYITASYDYDAPLDEYG 302
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALV---ATDPTYPSLGPNLEATVYKTGSGLCSAFLANI 379
+RQPKWGHL+DLH + E L + + YP N+ T++ G S F ++I
Sbjct: 303 NLRQPKWGHLRDLHSVLNSMEQTLTYGESKNSNYPD-NNNIFITIFAY-QGKRSCFFSSI 360
Query: 380 GTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSD 439
D T+ F G Y LPAWSVSILPDC V+NTA +N T + ++ AADS
Sbjct: 361 DY-KDQTISFEGTDYFLPAWSVSILPDCFTEVYNTATVNVQTSI----MENKANAADSFR 415
Query: 440 AIGS-GWSYINEPV-GISKDDAF-----TKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
S W + E + G+S F L++Q T SDYLW + + ++
Sbjct: 416 EPNSLQWKWRPEKIRGLSLQGDFVGNTLVANELMDQKAVTNGTSDYLWIMTNYDHNMNDS 475
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFP--IALAPGKNTFDLL 550
L G +L V + GH +HAF+NGK VGS S + + F I L G N L+
Sbjct: 476 LWGAGKDIILQVHTNGHVVHAFVNGKHVGSQSASIESGRFDFVFESKIKLKRGINRISLV 535
Query: 551 SLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTN-----IDLSSQQWTYQTGLKGEELNF 605
S++VGLQNYGA ++ GI GP+ + G N +D+SS +W Y+TGL GE+ F
Sbjct: 536 SVSVGLQNYGANFDTAPTGINGPITIIGRSKLGNQPDVTVDISSNRWVYKTGLHGEDQGF 595
Query: 606 PS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ Q+ +K L QP VWYKT+F+AP G +PV +D G+GKG AWVNG++IGR
Sbjct: 596 QAVRPRHRRQFYTKHVLIN-QPFVWYKTSFNAPLGQDPVVVDLLGLGKGTAWVNGRNIGR 654
Query: 663 YWPTYVS-QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
+WP ++ +G C C+Y G Y +C+ CG+P+Q YH+PR WLK N LVLFEE+
Sbjct: 655 FWPKALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRYYHIPRDWLKPEDNKLVLFEEL 714
Query: 722 GGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISS 781
GG P +S T +G +C H + H + L C + + S
Sbjct: 715 GGTPDFVSVQTVTVG-KVCVHGYEGH-------------------TVELSCQHGRK-FSK 753
Query: 782 IKFASFGTPLGTCGSF--SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGD-PCKGVM 838
I FASFG P G CGSF S A ++V +ACVG + CSI +S C +
Sbjct: 754 ITFASFGLPQGKCGSFTPSNNHDCHADVSTIVEKACVGKERCSIDISEKALAPIHCDARI 813
Query: 839 KSLAVEASC 847
LAVEA C
Sbjct: 814 YRLAVEAVC 822
>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 762 bits (1968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/775 (49%), Positives = 501/775 (64%), Gaps = 47/775 (6%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MWP+L QK+K+GG+D IETY+FW+ HEPVR QY F G D+VKF KL EAGL+ LRIG
Sbjct: 1 MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
PYVCAEW++GGFP+WLH IPGI+ RTDNE +K EMQ FT KIVD+ K+ KL+A QGGPII
Sbjct: 61 PYVCAEWSYGGFPMWLHNIPGIELRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPII 120
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
L+QIENEYGN+ YG AG+ Y+ W A MA+ + GVPW+MCQQS+AP P+INTCNGFYC
Sbjct: 121 LAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFYC 180
Query: 237 DQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGG 296
DQF PN+ PKMWTENWSGWF +GG PYR EDLAF+VARF Q GG +YYMYHGG
Sbjct: 181 DQFKPNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFIQNGGVLNSYYMYHGG 240
Query: 297 TNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSL 356
TNF RT+GGP+I+TSYDY+APLDEYG + QPKWGHLK LH+AIK E L T +
Sbjct: 241 TNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTSKNF 300
Query: 357 GPNLEATVY-KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTA 415
++ T Y G+G FL+N Y LPAWSV+IL DC ++NTA
Sbjct: 301 WGGVDQTTYTNQGTGERFCFLSNTNMEEANVDLGQDGKYSLPAWSVTILQDCNKEIYNTA 360
Query: 416 KINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVG--ISKDDAFTKPGLLEQINTTA 473
K+N+ T + ++ + + W++ EP+ + F LLEQ TT
Sbjct: 361 KVNTQTSI------MVKKLHEEDKPVQLSWTWAPEPMKGVLQGKGRFRATELLEQKETTV 414
Query: 474 DQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVT 533
D +DYLWY S N+ +E L+ + L V + GH LHA++N K +G+ + +NA+ +
Sbjct: 415 DTTDYLWYMTSVNL--NETTLKKWTNVTLRVGTRGHTLHAYVNKKEIGTQFSKQANAQQS 472
Query: 534 V---------DFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGT 583
V + P+ L G NT LLS TVGL NYG +Y+K GI GPVQL +G
Sbjct: 473 VKGDDYSFLFEKPVTLTSGTNTISLLSATVGLANYGQYYDKKPVGIAEGPVQLVANGK-P 531
Query: 584 NIDLSSQQWTYQTGLKGE--ELNFP-SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEP 640
+DL+S QW+Y+ GL GE N P S ++++ + LP + + WYKTTF +P+G+EP
Sbjct: 532 FMDLTSYQWSYKIGLSGEAKRYNDPNSPHASKFTASDNLPTGRAMTWYKTTFASPSGTEP 591
Query: 641 VAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSL 700
V +D GMGKG AWVNG+S+GR+WPT ++ GC D+C+YRG+Y+ +KC+ NCG PSQ
Sbjct: 592 VVVDLLGMGKGHAWVNGKSLGRFWPTQIADAKGCPDTCDYRGSYNGDKCVTNCGNPSQRW 651
Query: 701 YHVPRSWLKSSG-NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK 759
YH+PRS+L G NTL+LFEE+GG+PT +SF + ++C + +
Sbjct: 652 YHIPRSYLNKDGQNTLILFEEVGGNPTNVSFQIVAV-ETICGNAYE-------------- 696
Query: 760 IQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQA 814
G L L C + IS I+FAS+G P GTCG+F +G + RS +VV +
Sbjct: 697 -----GSTLELSCEG-GRTISDIQFASYGDPEGTCGAFMKGSFYATRSAAVVEKV 745
>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
Length = 835
Score = 762 bits (1967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/842 (47%), Positives = 521/842 (61%), Gaps = 66/842 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R+++I GKR +L SGSIHYPRSTP+MWP+LI K+K GGL+VI+TYVFWN+HEP +
Sbjct: 31 VTYDERSLIINGKRELLFSGSIHYPRSTPDMWPELILKAKRGGLNVIQTYVFWNIHEPEQ 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NFEG YDLVKF+K + E G++A LR+GP++ AEWN GG P WL IP I FR+DN P
Sbjct: 91 GKFNFEGPYDLVKFIKTIGENGMFATLRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK M++F KI+DMMK+EKL+ASQGGPIILSQIENEY + AY G SYI+WA MA
Sbjct: 151 FKHHMEKFVTKIIDMMKEEKLFASQGGPIILSQIENEYNTVQLAYKNLGVSYIQWAGNMA 210
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
L L+TGVPWVMC+Q DAP P+INTCNG +C D FT PN NKP +WTENW+ F FG
Sbjct: 211 LGLNTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNKPNKPSLWTENWTAQFRVFGDP 270
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R ED AF+VAR+F + G+ NYYMYHGGTNFDRT+ F++T Y +APLDEYGL
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
R+PKWGHLKDLH+A+ LC+ AL+ +P L ++EA Y+ G+ +C+AFLA+ +
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWGNPNVQKLSADVEARFYEQPGTKVCAAFLASNNSKE 389
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVTLVPSFSRQSLQVAADSSD 439
TVKF G Y LPA S+SILPDCK VV+NT + NS V S L+
Sbjct: 390 AETVKFRGQEYYLPARSISILPDCKTVVYNTMTVVSQHNSRNFVKSRKTNKLE------- 442
Query: 440 AIGSGWSYINE--PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
W+ +E P + D + K E N T D++DY+W++ + N+ +
Sbjct: 443 -----WNMYSETIPAQLQVDSSLPK----ELYNLTKDKTDYVWFTTTINVDRRDMNERKR 493
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
VL V SLGHA+ AF+NG+ +GS +GS + + L PG N LL VGL
Sbjct: 494 INPVLRVASLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHSVDLKPGINFVTLLGTLVGLP 553
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS 617
+ GA+ E AG G V + G GT +DL+S W +Q GL GE + + K
Sbjct: 554 DSGAYMEHRYAGPRG-VSILGLNTGT-LDLTSNGWGHQVGLSGETAKL---FTKEGGGKV 608
Query: 618 TLPKLQ----PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
T K+Q P+ WYKT FDAP G PVA+ TGM KG W+NG+SIGRYW TYVS
Sbjct: 609 TWTKVQKAGPPVTWYKTHFDAPEGKSPVAVRMTGMNKGMIWINGKSIGRYWMTYVSP--- 665
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
G+P+QS YH+PRS+LK + N +V+FEE +P KI +T
Sbjct: 666 -------------------LGEPTQSEYHIPRSYLKPTDNLMVIFEEEEANPEKIEILTV 706
Query: 734 QLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVISSIKFASFGT 789
++CS+VT+ HP V W + + P L+CPN ++I +++FASFG
Sbjct: 707 NR-DTICSYVTEYHPPSVKSWERKNNKFTPVVDNAKPAAHLKCPNQKKII-AVQFASFGD 764
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG---DPCKGVMKSLAVEAS 846
PLGTCG ++ G C S S VV + C+G SC I + F D C G+ K+LAV+
Sbjct: 765 PLGTCGDYAVGTCHSLVSKQVVEEHCLGKTSCDIPIDKGLFAGKKDDCPGISKTLAVQVK 824
Query: 847 CT 848
C+
Sbjct: 825 CS 826
>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
Length = 830
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/837 (48%), Positives = 525/837 (62%), Gaps = 54/837 (6%)
Query: 29 YDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQ 88
Y+ RAVVI G+RR+++SGSIHYPRSTP+MWPDLI K+K+GGL+ IETYVFWN HEP R Q
Sbjct: 30 YNDRAVVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHEPRRRQ 89
Query: 89 YNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFK 148
YNFEG YD+V+F K + AG++A LRIGPY+C EWN+GG P WL IPG+QFR N+PF+
Sbjct: 90 YNFEGNYDIVRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNDPFE 149
Query: 149 AEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAAGMA 206
EM+ FT IV+ MK ++A QGGPIIL+QIENEYGNI + YI W A MA
Sbjct: 150 REMETFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGKLENNQSASQYIHWCADMA 209
Query: 207 LSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
GVPW+MCQQ +D P +INTCNGFYC + PN PK+WTENW+GWF ++
Sbjct: 210 NKQKIGVPWIMCQQDNDVPHNVINTCNGFYCYDWFPNRTGIPKIWTENWTGWFKAWDKPD 269
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
+R ED+AFAVA FFQ+ G+ NYYMYHGGTNF RTSGGP+I+TSYDYDAPLDEYG IR
Sbjct: 270 FHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIR 329
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNSD 384
QPK+GHLKDLH +K E LV + S G N+ T Y GS +C F++N + D
Sbjct: 330 QPKYGHLKDLHNLLKSMEKILVHGEYKDTSHGKNVTVTKYTYGGSSVC--FISNQFDDRD 387
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V V G ++L+PAWSVSILPDCK V +NTAKI + T V S++ ++
Sbjct: 388 VNVTLAG-THLVPAWSVSILPDCKTVAYNTAKIKTQTSVMVKKANSVEKEPEALR----- 441
Query: 445 WSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
WS++ E P +F + LLEQI T+ DQSDYLWY S K +GS T
Sbjct: 442 WSWMPENLKPFMTDDHGSFRQSRLLEQIATSTDQSDYLWYRTSLEHKG------EGSYT- 494
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L+V + GH ++AF+NGKLVG S+ + P+ L GKN LLS TVGL+NYG
Sbjct: 495 LYVNTTGHKIYAFVNGKLVGQNQSSNGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGP 554
Query: 562 FYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQWDSK 616
+E AGI GPV+L G+ N T IDL+ W+Y++GL GE L+ P +
Sbjct: 555 LFELVPAGIAGGPVKLVGA-NDTAIDLTHSSWSYKSGLAGEHRQIHLDKPGYKWRSHNGS 613
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY-VSQNGGCT 675
++P +P WYKTTF APAG E V +D G+ KG AWVNG S+GRYWP+Y ++ GGC
Sbjct: 614 GSIPVNRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAAWVNGNSLGRYWPSYTAAEMGGCH 673
Query: 676 DSCNYRGAYSSN----KCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKISF 730
+C+YRG + + +CL CG+PSQ YHVPRS+L++ NTLVLFEE GGDP + +F
Sbjct: 674 GACDYRGKFKAEGDGIRCLTGCGEPSQRFYHVPRSFLRAGEPNTLVLFEEAGGDPARAAF 733
Query: 731 VTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTP 790
T +G +C + G D LS V++S+ ASFG
Sbjct: 734 HTVAVG-HVCVAAAEV--------GDDV--------TLSCGGGLGGGVVASVDVASFGVT 776
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CG + +G C S +L R ACVG +SC++ + G C+ L V+A+C
Sbjct: 777 RGGCGDY-QGGCESKAALKAFRDACVGRESCTVKYTPAFAGPGCQS--GKLTVQATC 830
>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
Length = 821
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/836 (49%), Positives = 520/836 (62%), Gaps = 56/836 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G +VTYD R+++I G+RR+L SGSIHYPRSTPEMWP LI K+K+GG+DVIETY FWN HE
Sbjct: 29 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 88
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P + QY+F GR D+VKF K V GLYA LRIGP++ +EWN+GG P WLH +PGI +R+D
Sbjct: 89 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 148
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQ FT KIV++MK E LYASQGGPIILSQIENEY N+++A+ G Y++WAA
Sbjct: 149 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 208
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPWVMC+Q DAPDP+IN CNG C + PN NKP +WTENW+ + +
Sbjct: 209 KMAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVY 268
Query: 262 GGAVPYRPVEDLAFAVARFF-QRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
G R EDLAF VA F ++ G+F NYYMYHGGTNF RTS ++ YD APLDE
Sbjct: 269 GEDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDE 327
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGLIRQPKWGHLK+LH IKLC L+ SLG EA ++K SG C+AFL N
Sbjct: 328 YGLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNND 387
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
+VTV F +Y L A S+SILPDCK + FNTAK+++ F+ +S+Q A
Sbjct: 388 KRRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVST-----QFNTRSVQTRATFGST 442
Query: 441 IGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS-- 498
WS E + LLE + TT D SDYLWY+L +++ S
Sbjct: 443 --KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLR--------FIQNSSNA 492
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ VL V SL H LHAF+NGK + S +GS N ++ + L G N LLS+ VGL +
Sbjct: 493 QPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPD 552
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDS 615
G + E AGI V+++ G+ D S W YQ GL GE+ P QW
Sbjct: 553 AGPYLEHKVAGIRR-VEIQDGGDSK--DFSKHPWGYQVGLMGEKSQIYTSPGSQKVQWHG 609
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
+ + PL WYKT FDAP G++PV + F MGKGEAWVNGQSIGRYW +Y++ +
Sbjct: 610 LGSHGR-GPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS---- 664
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
G+PSQ+ Y+VPR++L GN LV+ EE GDP KIS T +
Sbjct: 665 ------------------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSV 706
Query: 736 GSSLCSHVTDSHPLPVDMW-GSDSKIQRKPG--PVLSLECPNPNQVISSIKFASFGTPLG 792
+++C HVTDSHP P+ W SD + G P + L CP P+ IS I FASFGTP+G
Sbjct: 707 -TNVCGHVTDSHPPPIISWTTSDDGNESHHGKIPKVQLRCP-PSSNISKITFASFGTPVG 764
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
C S++ G C S SL+V +AC+G CSI S+ +FG DPC G K+L V A C
Sbjct: 765 GCESYAIGSCHSPNSLAVAEKACLGKNMCSIPHSLKSFGDDPCPGTPKALLVAAQC 820
>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
Length = 828
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/840 (48%), Positives = 522/840 (62%), Gaps = 61/840 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YD R++++ G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGL+ IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NFEG YD+V+F K + AG+YA LRIGPY+C EWN+GG P+WL IPGI+FR N+P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG-------NIDSAYGAAGKSYI 199
F+ EM+ FT IV MK ++A QGGPIIL+QIENEYG NI SA+ YI
Sbjct: 151 FENEMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAH-----EYI 205
Query: 200 KWAAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
W A MA + GVPW+MCQQ +D P ++NTCNGFYC ++ N + PKMWTENW+GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+ RP ED+AFAVA FFQ G+ QNYYMYHGGTNF RT+GGP+I+TSYDYDAPL
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYG +RQPK+GHLK+LH + E L+ D + G N+ T Y T + + F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKY-TLNATSACFINN 384
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+ DV V +G ++ LPAWSVSILPDCK V FN+AKI + T V ++ +
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPDCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHF 444
Query: 439 DAIGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
WS++ E P + F K LLEQI TT DQSDYLWY S K +
Sbjct: 445 K-----WSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHKGE----- 494
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
VL+V + GH L+AF+NGKLVG Y + N + P+ L GKN LLS TVG
Sbjct: 495 --GSYVLYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVG 552
Query: 556 LQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQW 613
L+NYG +E AGI GPV+L S +G+ IDLS+ W+Y+ GL GE + +W
Sbjct: 553 LRNYGGSFELLPAGIVGGPVKLIDS-SGSAIDLSNNSWSYKAGLAGEYRKIYLDKPGNKW 611
Query: 614 DS-KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
S ST+P +P WYKTTF APAG + V +D G+ KG AWVNG S+GRYWP+YV+ +
Sbjct: 612 RSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADM 671
Query: 673 GCTDSCNYRGAY----SSNKCLKNCGKPSQSLYHVPRSWL-KSSGNTLVLFEEIGGDPTK 727
C+YRG + + KCL CG+PSQ LYHVPRS+L K NTL+LFEE GGDP++
Sbjct: 672 PGCHHCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLHKGEPNTLILFEEAGGDPSE 731
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASF 787
++ T GS S + G ++L C + ISS+ ASF
Sbjct: 732 VAVRTVVEGSVCAS--------------------AELGDTVTLSCGAHGRTISSVDVASF 771
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G G CGS+ G C S + ACVG +SC++ V+ C V L V+A+C
Sbjct: 772 GVARGRCGSYD-GGCDSKVAYDAFAAACVGKESCTVLVTDAFANAGC--VSGVLTVQATC 828
>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
Length = 813
Score = 759 bits (1961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/836 (49%), Positives = 520/836 (62%), Gaps = 56/836 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G +VTYD R+++I G+RR+L SGSIHYPRSTPEMWP LI K+K+GG+DVIETY FWN HE
Sbjct: 21 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 80
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P + QY+F GR D+VKF K V GLYA LRIGP++ +EWN+GG P WLH +PGI +R+D
Sbjct: 81 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 140
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQ FT KIV++MK E LYASQGGPIILSQIENEY N+++A+ G Y++WAA
Sbjct: 141 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 200
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPWVMC+Q DAPDP+IN CNG C + PN NKP +WTENW+ + +
Sbjct: 201 KMAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVY 260
Query: 262 GGAVPYRPVEDLAFAVARFF-QRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
G R EDLAF VA F ++ G+F NYYMYHGGTNF RTS ++ YD APLDE
Sbjct: 261 GEDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDE 319
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGLIRQPKWGHLK+LH IKLC L+ SLG EA ++K SG C+AFL N
Sbjct: 320 YGLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNND 379
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
+VTV F +Y L A S+SILPDCK + FNTAK+++ F+ +S+Q A
Sbjct: 380 KRRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVST-----QFNTRSVQTRATFGST 434
Query: 441 IGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS-- 498
WS E + LLE + TT D SDYLWY+L +++ S
Sbjct: 435 --KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLR--------FIQNSSNA 484
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ VL V SL H LHAF+NGK + S +GS N ++ + L G N LLS+ VGL +
Sbjct: 485 QPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPD 544
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDS 615
G + E AGI V+++ G+ D S W YQ GL GE+ P QW
Sbjct: 545 AGPYLEHKVAGIRR-VEIQDGGDSK--DFSKHPWGYQVGLMGEKSQIYTSPGSQKVQWHG 601
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
+ + PL WYKT FDAP G++PV + F MGKGEAWVNGQSIGRYW +Y++ +
Sbjct: 602 LGSHGR-GPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS---- 656
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
G+PSQ+ Y+VPR++L GN LV+ EE GDP KIS T +
Sbjct: 657 ------------------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSV 698
Query: 736 GSSLCSHVTDSHPLPVDMW-GSDSKIQRKPG--PVLSLECPNPNQVISSIKFASFGTPLG 792
+++C HVTDSHP P+ W SD + G P + L CP P+ IS I FASFGTP+G
Sbjct: 699 -TNVCGHVTDSHPPPIISWTTSDDGNESHHGKIPKVQLRCP-PSSNISKITFASFGTPVG 756
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
C S++ G C S SL+V +AC+G CSI S+ +FG DPC G K+L V A C
Sbjct: 757 GCESYAIGSCHSPNSLAVAEKACLGKNMCSIPHSLKSFGDDPCPGTPKALLVAAQC 812
>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 757 bits (1955), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/838 (47%), Positives = 521/838 (62%), Gaps = 58/838 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YD R++VI G+RR+++SGSIHYPRSTPEMWPDLI+K+K+GGLD IETY+FWN HEP R
Sbjct: 31 VSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPHR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEG YD+V+F K + AG+YA LRIGPY+C EWN+GG P WL IPG+QFR NEP
Sbjct: 91 RQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNEP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAAG 204
F+ EM+ FT IV+ MK K++A QGGPIIL+QIENEYGNI + YI W A
Sbjct: 151 FENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCAD 210
Query: 205 MALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA + GVPW+MCQQ D P ++NTCNGFYC + PN PK+WTENW+GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
+R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPK+GHLK+LH +K E LV + + G N+ T Y S + F+ N +
Sbjct: 331 LRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSS-SACFINNRFDDK 389
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
DV V +G ++LLPAWSVSILPDCK V FN+AKI + T V + + +S
Sbjct: 390 DVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQESLK---- 445
Query: 444 GWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
WS++ E P + F K LLEQI T+ DQSDYLWY S N K +
Sbjct: 446 -WSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHKGE-------GSY 497
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L+V + GH L+AF+NGKL+G + + + ++ P+ L GKN LLS TVGL+NYG
Sbjct: 498 KLYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYG 557
Query: 561 AFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQWD- 614
+EK GI GPV+L S NGT IDLS+ W+Y+ GL E L+ P +W+
Sbjct: 558 PSFEKMPTGIVGGPVKLIDS-NGTAIDLSNSSWSYKAGLASEYRQIHLDKP---GYKWNG 613
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ T+P +P WYK TF+AP+G + V +D G+ KG AWVNG ++GRYWP+Y +
Sbjct: 614 NNGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAG 673
Query: 675 TDSCNYRGAYSSN----KCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKIS 729
C+YRGA+ + +CL CG+PSQ YHVPRS+L + NTL+LFEE GGDP+ ++
Sbjct: 674 CHRCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVA 733
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
T G+ S + G ++L C +SS+ ASFG
Sbjct: 734 LRTVVPGAVCTS--------------------GEAGDAVTLSC-GGGHAVSSVDVASFGV 772
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CG + G C S + ACVG +SC++ ++ G C + L V+A+C
Sbjct: 773 GRGRCGGY-EGGCESKAAYEAFTAACVGKESCTVEITGAFAGAGC--LSGVLTVQATC 827
>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 650
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/629 (59%), Positives = 444/629 (70%), Gaps = 26/629 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
++L++LC L A+VTYDH+A+V+ GKRR+LISGSIHYPRSTP+MWPDLIQK+
Sbjct: 9 VVLMMLC-----LWVCGVTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKA 63
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP QY FE R+DLVKFVKL +AGLY HLRIGPY+CAEWN
Sbjct: 64 KDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNL 123
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FTAKIV +MK+ +L+ SQGGPIILSQIENEYG
Sbjct: 124 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYG 183
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN N
Sbjct: 184 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNT 243
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GW+ FGGAVP RP EDLAF+VARF Q GG+F NYYMYHGGTNF RTSGG
Sbjct: 244 KPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGG 303
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL +PK+ HL+ LHKAIK E ALVATDP SLG NLEA V+
Sbjct: 304 LFIATSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVF 363
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS---VTL 422
+ G C+AF+AN T S KF Y LP WS+SILPDCK VV+NTAK+ +
Sbjct: 364 -SAPGACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKVGYGWLKKM 422
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
P S + Q SY EP S+ D+ L EQ+N T D SDYLWY
Sbjct: 423 TPVNSAFAWQ-------------SYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYM 469
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
N+ A+E L++G +L V S GH LH FING+L G+ +G N K+T + L
Sbjct: 470 TDVNVNANEGFLKNGQSPLLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRA 529
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G N LLS+ VGL N G +E AG+ GPV LKG GT DLS Q+W+Y+ GLKGE
Sbjct: 530 GNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTR-DLSRQKWSYKVGLKGES 588
Query: 603 LNFPS---GSSTQWDSKSTLPKLQPLVWY 628
L+ + SS +W S + K QPL WY
Sbjct: 589 LSLHTESGSSSVEWIQGSLVAKKQPLTWY 617
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/33 (66%), Positives = 26/33 (78%)
Query: 701 YHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
YHVPRSWL S GN+LV+FEE GGDP I+ V +
Sbjct: 617 YHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKR 649
>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
Length = 828
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/840 (47%), Positives = 521/840 (62%), Gaps = 61/840 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YD R++++ G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGL+ IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NFEG YD+V+F K + AG+YA LRIGPY+C EWN+GG P+WL IPGI+FR N+P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG-------NIDSAYGAAGKSYI 199
F+ M+ FT IV MK ++A QGGPIIL+QIENEYG NI SA+ YI
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAH-----EYI 205
Query: 200 KWAAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
W A MA + GVPW+MCQQ +D P ++NTCNGFYC ++ N + PKMWTENW+GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+ RP ED+AFAVA FFQ G+ QNYYMYHGGTNF RT+GGP+I+TSYDYDAPL
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYG +RQPK+GHLK+LH + E L+ D + G N+ T Y T + + F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKY-TLNATSACFINN 384
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+ DV V +G ++ LPAWSVSILP+CK V FN+AKI + T V ++ +
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHF 444
Query: 439 DAIGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
WS++ E P + F K LLEQI TT DQSDYLWY S K +
Sbjct: 445 K-----WSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHKGE----- 494
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
VL+V + GH L+AF+NGKLVG Y + N + P+ L GKN LLS TVG
Sbjct: 495 --GSYVLYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVG 552
Query: 556 LQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQW 613
L+NYG +E AGI GPV+L S +G+ IDLS+ W+Y+ GL GE + +W
Sbjct: 553 LRNYGGSFELLPAGIVGGPVKLIDS-SGSAIDLSNNSWSYKAGLAGEYRKIYLDKPGNKW 611
Query: 614 DS-KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
S ST+P +P WYKTTF APAG + V +D G+ KG AWVNG S+GRYWP+YV+ +
Sbjct: 612 RSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADM 671
Query: 673 GCTDSCNYRGAY----SSNKCLKNCGKPSQSLYHVPRSWL-KSSGNTLVLFEEIGGDPTK 727
C+YRG + + KCL CG+PSQ LYHVPRS+L K NTL+LFEE GGDP++
Sbjct: 672 PGCHHCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSE 731
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASF 787
++ T GS S + G ++L C + ISS+ ASF
Sbjct: 732 VAVRTVVEGSVCAS--------------------AEVGDTVTLSCGAHGRTISSVDVASF 771
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G G CGS+ G C S + ACVG +SC++ V+ C V L V+A+C
Sbjct: 772 GVARGRCGSYD-GGCESKVAYDAFAAACVGKESCTVLVTDAFANAGC--VSGVLTVQATC 828
>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
Length = 809
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/778 (50%), Positives = 498/778 (64%), Gaps = 78/778 (10%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTP--------------------------EMWPD 60
VTYD +AV+I G+RR+L SGSIHYPRSTP EMW
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPDVTAFYKISSPPTIPWRGLWLRIYGSEMWEG 86
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LIQK+KDGGLDVI+TYVFWN HEP G++ R Y
Sbjct: 87 LIQKAKDGGLDVIQTYVFWNGHEPTPGN----------------DSDGIF--FRFEQYYF 128
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ- 179
E GFP+WL ++PGI FRTDNEPFK MQ FT KIV MMK E L+ASQGGPIILSQ
Sbjct: 129 EE---SGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPIILSQA 185
Query: 180 --------IENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTC 231
IENEYG +GAAG++YI WAA MA+ L TGVPWVMC++ DAPDP+IN C
Sbjct: 186 SIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGTGVPWVMCKEEDAPDPVINAC 245
Query: 232 NGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYY 291
NGFYCD F+PN KP MWTE WSGWF FGG + RPVEDLAFAVARF Q+GG+F NYY
Sbjct: 246 NGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYY 305
Query: 292 MYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP 351
MYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R+PK HLK+LH+A+KLCE ALV+ DP
Sbjct: 306 MYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVSVDP 365
Query: 352 TYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVV 411
+LG EA V+++ SG C+AFLAN +NS V FN Y LP WS+SILPDCKNVV
Sbjct: 366 AITTLGTMQEARVFQSPSG-CAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKNVV 424
Query: 412 FNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPV-GISKDDAFTKPGLLEQIN 470
FN+A + T +Q+ D + ++ W +E V ++ T GLLEQ+N
Sbjct: 425 FNSATVGVQT-------SQMQMWGDGASSM--TWERYDEEVDSLAAAPLLTTTGLLEQLN 475
Query: 471 TTADQSDYLWYSLSTNIKADEPLLEDGSKTV-LHVQSLGHALHAFINGKLVGSGYGSSSN 529
T D SDYLWY S +I + E L+ G K + L VQS GHALH F+NG+L GS YG+ +
Sbjct: 476 VTRDSSDYLWYITSVDISSSENFLQGGGKPLSLSVQSAGHALHVFVNGQLQGSAYGTRED 535
Query: 530 AKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSS 589
++ + +L G N LLS+ GL N G YE G+ GPV L G G+ DL+
Sbjct: 536 RRIKYNGNASLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLDEGSR-DLTW 594
Query: 590 QQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPK-LQPLVWYKTTFDAPAGSEPVAIDF 645
Q W+YQ GLKGE++N S SS +W S + + QPL WY+ F+ P+G EP+A+D
Sbjct: 595 QTWSYQVGLKGEQMNLNSIEGSSSVEWMQGSLIAQNQQPLAWYRAYFETPSGDEPLALDM 654
Query: 646 TGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPR 705
MGKG+ W+NGQSIGRYW Y +G C + C+Y G + + KC CG+P+Q YHVP+
Sbjct: 655 GSMGKGQIWINGQSIGRYWTAYA--DGDCKE-CSYTGTFRAPKCQSGCGQPTQRWYHVPK 711
Query: 706 SWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRK 763
SWL+ + N LV+FEE+GGD +KI+ V + + SS+C+ V++ HP + W +S +R+
Sbjct: 712 SWLQPTRNLLVVFEELGGDSSKIALVKRSV-SSVCADVSEDHP-NIKNWQIESYGERE 767
>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
Length = 798
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/837 (47%), Positives = 526/837 (62%), Gaps = 57/837 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G+NVTYD R++VI GK +++ SGSIHYPRSTP+MWP LI K++ GGLD I+TYVFWNLHE
Sbjct: 5 GSNVTYDSRSLVINGKHKIIFSGSIHYPRSTPQMWPYLISKARAGGLDAIDTYVFWNLHE 64
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P + QY+F GR DLV+F+K V GLY LRIGP++ +EW +GG P WLH +PGI FR+D
Sbjct: 65 PQQGQYDFSGRKDLVRFIKEVHAQGLYVCLRIGPFIESEWTYGGLPFWLHDVPGIVFRSD 124
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
N+PFK M+R+ IV M+K EKLYASQGGPIILSQIENEYGN+++A+ G Y+KWAA
Sbjct: 125 NKPFKYHMERYAKMIVKMLKAEKLYASQGGPIILSQIENEYGNVEAAFHEKGPPYVKWAA 184
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPWVMC+Q DAPDP+IN CNG C + F+ PNS KP +WTENW+ + ++
Sbjct: 185 KMAVGLHTGVPWVMCKQDDAPDPVINACNGLRCGETFSGPNSPRKPAIWTENWTSVYQTY 244
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
G R ED+AF A F +GG+F NYYMYHGGTNF RT+ ++ TSY APLDEY
Sbjct: 245 GKETRSRSAEDIAFHAALFIAKGGSFVNYYMYHGGTNFGRTA-AEYVPTSYYDQAPLDEY 303
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPK GHLK+LH AIKLC L++ SLG EA ++ S C+AFL N
Sbjct: 304 GLLRQPKHGHLKELHAAIKLCRKPLLSRKWINFSLGQLQEAFAFERNSDECAAFLVNHDG 363
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
S+ TV F G+SY LP S+SILP CK V FNTA++++ +R+ D+I
Sbjct: 364 RSNATVHFKGSSYKLPPKSISILPHCKTVAFNTAQVSTQYGTRLATRR------HKFDSI 417
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
W E + + LLE +NTT D SDYLWY+ + + + +V
Sbjct: 418 -EQWKEYKEYIPSFDKSSLRANTLLEHMNTTKDSSDYLWYTFRFHQNSSN------AHSV 470
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V SLGH LHAF+NG+ +GS +GS N T+ + L G N LLS+ GL + GA
Sbjct: 471 LTVNSLGHNLHAFVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNYVSLLSVMTGLPDAGA 530
Query: 562 FYEKTGAGITG-PVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSST---QWDSKS 617
+ E+ AG+ +Q + + D ++ W Y+ GL GE + +++ W +
Sbjct: 531 YLERRVAGLRRVTIQRQHELH----DFTTYLWGYKVGLSGENIQLHRNNASVKAYWSRYA 586
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
+ + PL WYK+ FDAPAG++PVA++ MGKGEAWVNG+SIGRYW +++ +
Sbjct: 587 SSSR--PLTWYKSIFDAPAGNDPVALNLASMGKGEAWVNGRSIGRYWVSFLDSD------ 638
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
G P Q+ H+PRS+LK SGN LV+ EE G+P IS T + +
Sbjct: 639 ----------------GNPYQTWNHIPRSFLKPSGNLLVILEEERGNPLGISLGTMSI-T 681
Query: 738 SLCSHVTDSHPLPVDMWGSDSKI----QRKPG--PVLSLECPNPNQVISSIKFASFGTPL 791
+C HV+ SHP PV W +++I +RK G P + L CP + ISS+ F+SFGTP
Sbjct: 682 KVCGHVSISHPPPVISWQGENQINGTRKRKYGRRPKVQLRCPR-GRKISSVLFSSFGTPS 740
Query: 792 GTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
G C +++ G C ++ S + V +AC+G + CSI VS F GDPC G+ KSL V+A C
Sbjct: 741 GDCETYAIGSCHASNSRATVEKACLGKERCSIPVSSKNFKGDPCPGIAKSLLVDAKC 797
>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 830
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/845 (48%), Positives = 525/845 (62%), Gaps = 58/845 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G V YD RA+VI G+RR+LISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HE
Sbjct: 23 GTEVGYDDRALVIDGERRLLISGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNGHE 82
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R QYNFEG YD+V+F K V +AG+YA LRIGPY+C EWN+GG P WL I G+QFR
Sbjct: 83 PRRRQYNFEGSYDIVRFFKEVQDAGMYAILRIGPYICGEWNYGGLPAWLRDISGMQFRMH 142
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKW 201
N PF+ EM+ FT IVD +K+ K++A QGGPIILSQIENEYGNI + YI W
Sbjct: 143 NHPFEQEMETFTTLIVDKLKEAKMFAGQGGPIILSQIENEYGNIMGKLNNNESASEYIHW 202
Query: 202 AAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
A MA + GVPW+MCQQ D P +INT NGFYC + P + PK+WTENW+GWF +
Sbjct: 203 CAAMANKQNVGVPWIMCQQDDDVPSNVINTWNGFYCHDWFPKRTDIPKIWTENWTGWFKA 262
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
+ +R ED+AF+VA FFQ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDE
Sbjct: 263 WDKPDFHRSAEDIAFSVAMFFQTRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDE 322
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYK-TGSGLCSAFLANI 379
YG IRQPK+GHLKDLH +K E L+ D ++G N TV K T + F++N
Sbjct: 323 YGNIRQPKYGHLKDLHNVLKSMEKILLHGDYKDTTMG-NTNVTVTKYTLDNSSACFISNK 381
Query: 380 GTNSDVTVKF-NGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+ +V V NG ++ +PAWSVSILPDCK V +N+AKI + T V R + D
Sbjct: 382 FDDKEVNVTLDNGATHTVPAWSVSILPDCKTVAYNSAKIKTQTSV-MVKRPGAETVTD-- 438
Query: 439 DAIGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
G WS++ E P + F K LLEQI T+ DQSDYLWY S K +
Sbjct: 439 ---GLAWSWMPENLQPFMTDEKGNFRKNELLEQIATSGDQSDYLWYRTSFEHKGE----- 490
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
S LHV + GH L+AF+NGKLVG Y + ++ P+ L GKN LLS T+G
Sbjct: 491 --SNYKLHVNTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPVKLHSGKNYISLLSATIG 548
Query: 556 LQNYGAFYEKTGAGIT-GPVQLKGS-GNGTNIDLSSQQWTYQTGLKGE--ELNF-PSGSS 610
L+NYGA +E AGI GPV+L + N T DLS+ W+Y+ GL GE E + +
Sbjct: 549 LKNYGALFEMMPAGIVGGPVKLVDTVTNTTAYDLSNSSWSYKAGLAGEYRETHLDKANDR 608
Query: 611 TQWDS--KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYV 668
+QW T+P +P WYK TF+APAG EPV D G+GKG WVNG ++GRYWP+YV
Sbjct: 609 SQWSGGLNGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKGVVWVNGNNLGRYWPSYV 668
Query: 669 SQNGGCTDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGG 723
+ + C+YRG + + KCL C +PSQ YHVPRS++K+ NT+VLFEE GG
Sbjct: 669 AADMDGCQRCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSFIKAGEPNTMVLFEEAGG 728
Query: 724 DPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIK 783
DPT++SF T +G++ ++L C + + ISS+
Sbjct: 729 DPTRVSFHTVAVGAACAEAAEVGDE-------------------VALACSH-GRTISSVD 768
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVN-TFGDPCKGVMKSLA 842
AS G G CG++ +G C S +L+ ACVG +SC++ + + G C + L
Sbjct: 769 VASLGVARGKCGAY-QGGCESKAALAAFTAACVGKESCTVRHTEDFRAGSGCDSGV--LT 825
Query: 843 VEASC 847
V+A+C
Sbjct: 826 VQATC 830
>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
Length = 829
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/867 (46%), Positives = 537/867 (61%), Gaps = 63/867 (7%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGA----NVTYDHRAVVIGGKRRVLISGSIHYPRSTPE 56
MA + L++L L T + GA V Y+ RA+VI G+RR+++SGSIHYPRSTPE
Sbjct: 6 MARASLALVLL------LITAAVGAANCTTVAYNDRALVIDGQRRIVLSGSIHYPRSTPE 59
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MWPDLI+K+K+GGLD IETYVFWN HEP QYNF G YD+V+F K + AG+YA LRIG
Sbjct: 60 MWPDLIKKAKEGGLDAIETYVFWNGHEPRPRQYNFAGNYDIVRFFKEIQNAGMYAILRIG 119
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
PY+C EWN+GG P WL IPG+QFR N+PF+ EM+ FT IV+ +K ++A QGGPII
Sbjct: 120 PYICGEWNYGGLPAWLRDIPGMQFRMHNQPFEHEMETFTTLIVNKLKDANMFAGQGGPII 179
Query: 177 LSQIENEYGNIDSAYGAA--GKSYIKWAAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNG 233
LSQIENEYGNI + A YI W A MA + GVPW+MCQQ +D P +INTCNG
Sbjct: 180 LSQIENEYGNIMANLTDAQSASEYIHWCAAMANKQNVGVPWIMCQQDADVPPNVINTCNG 239
Query: 234 FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMY 293
FYC + P + PK+WTENW+GWF ++ +R +D+AFAVA FFQ+ G+ QNYYMY
Sbjct: 240 FYCHDWFPKRTDIPKIWTENWTGWFKAWDKPDFHRSAQDIAFAVAMFFQKRGSLQNYYMY 299
Query: 294 HGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY 353
HGGTNF RT+GGP+I+TSYDYDAPLDEYG IR+PK+GHLKDLH +K E LV D +
Sbjct: 300 HGGTNFGRTAGGPYITTSYDYDAPLDEYGNIREPKYGHLKDLHAVLKSMEKILVHGDFSD 359
Query: 354 PSLGPNLEATVYK-TGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVF 412
+ G N+ T Y GS +C F++N + D +G ++++PAWSVS+LPDCK V +
Sbjct: 360 INYGRNVTVTKYTLDGSSVC--FISNQFDDRDANATIDGTTHVVPAWSVSVLPDCKAVAY 417
Query: 413 NTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINE---PVGISKDDAFTKPGLLEQI 469
NTAKI + T V +++ ++ WS++ E P + +F K LLEQI
Sbjct: 418 NTAKIKAQTSVMVKKPNTVEQEPENLK-----WSWMPEHLKPFMTDEKGSFRKNELLEQI 472
Query: 470 NTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSN 529
T+ DQSDYLWY S K + +K L V + GH ++AF+NGKL G + +
Sbjct: 473 TTSTDQSDYLWYRTSFEHKGE-------AKYKLSVNTTGHQIYAFVNGKLAGRQHSPNGA 525
Query: 530 AKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLS 588
++ P+ L GKN LLS T+GL+NYGA +E AGI GPV+L NG+ IDLS
Sbjct: 526 FIFQLESPVKLHDGKNYLSLLSATMGLKNYGALFELMPAGIVGGPVKLV-DNNGSTIDLS 584
Query: 589 SQQWTYQTGLKGE--ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFT 646
+ W+Y+ GL GE +++ T+P + WYK TF APAG E V D
Sbjct: 585 NSSWSYKAGLAGEHRQIHLDKPGYKWHGDNGTIPINRAFTWYKATFQAPAGEEAVVADLM 644
Query: 647 GMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN----KCLKNCGKPSQSLYH 702
G+ KG AWVNG ++GRYWP+YV+ G C+YRGA+ + KCL C +P+Q YH
Sbjct: 645 GLNKGVAWVNGNNLGRYWPSYVAAEMGGCHHCDYRGAFKAEGDGLKCLTGCNEPAQRFYH 704
Query: 703 VPRSWLKSSG-NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQ 761
VPR +L++ NT+VLFEE GGDP+++ F T +G P+ V+
Sbjct: 705 VPRVFLRAGEPNTVVLFEEAGGDPSRVGFHTVAVG-----------PVCVEA-------- 745
Query: 762 RKPGPVLSLEC-PNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKS 820
+ G ++L C + + ISS+ AS+G G CG++ +G C S + +ACVG +S
Sbjct: 746 AEKGDNVTLSCGQHKGRTISSVDLASYGVTRGQCGAY-QGGCESKAAYEAFAEACVGKES 804
Query: 821 CSIGVSVNTFGDPCKGVMKSLAVEASC 847
C++ + G C+ + L V+A+C
Sbjct: 805 CTVQHTDAFSGAGCQSGV--LTVQATC 829
>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 774
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/752 (51%), Positives = 485/752 (64%), Gaps = 38/752 (5%)
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFP+WL +PGI+FRTDNEP+KAEMQ F KIVD+MK+EKLY+ QGGPIIL QIENEYGN
Sbjct: 19 GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 78
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
I YG AGK Y+ WAA MAL+LDTGVPWVMC+Q+DAP+ I+NTCN FYCD F PNS NK
Sbjct: 79 IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNK 138
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
P +WTE+W GW+ +G ++P+RP +D AFAVARF+QRGG+ QNYYMY GGTNF+RT+GGP
Sbjct: 139 PTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGP 198
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATV 364
TSYDYDAP+DEYG++RQPKWGHLKDLH AIKLCE+AL A D P Y LGP EA V
Sbjct: 199 LQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHV 258
Query: 365 YKT-----------GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFN 413
Y + S CSAFLANI + +V G SY LP WSVSILPDC+ V FN
Sbjct: 259 YSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFN 318
Query: 414 TAKINSVTLV-------PSFS--RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPG 464
TA++ + T PS+S + ++ + + W EPVGI + FT G
Sbjct: 319 TARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQG 378
Query: 465 LLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGS 522
+LE +N T D SDYL Y+ NI ++ L G L + + F+NGKL GS
Sbjct: 379 ILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGS 438
Query: 523 GYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNG 582
G V+++ P+ L G N LLS VGLQNYGAF EK GAG G V+L G NG
Sbjct: 439 KVGH----WVSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNG 494
Query: 583 TNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSE 639
+IDL++ WTYQ GLKGE S S +W S + P W+KT FDAP G+
Sbjct: 495 -DIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNG 553
Query: 640 PVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQS 699
PV ID MGKG+AWVNG IGRYW + V+ GC SCNY G YS +KC NCG +QS
Sbjct: 554 PVTIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCPSSCNYAGTYSDSKCRSNCGIATQS 612
Query: 700 LYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK 759
YH+PR WL+ SGN LVLFEE GGDP++IS ++CS +++++ P+ W +
Sbjct: 613 WYHIPREWLQESGNLLVLFEETGGDPSQISLEV-HYTKTICSKISETYYPPLSAWSRAAN 671
Query: 760 IQ---RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACV 816
+ P L L+C + VIS I FAS+GTP G C +FS G C ++ +L +V +AC
Sbjct: 672 GRPSVNTVAPELRLQC-DDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVEACE 730
Query: 817 GSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
G C+I V+ FGDPC+ V+K LAVEA C+
Sbjct: 731 GKNRCAISVTNEVFGDPCRKVVKDLAVEAECS 762
>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 747 bits (1929), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/832 (46%), Positives = 509/832 (61%), Gaps = 45/832 (5%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R++++ G+R +L SGSIHYPRSTPEMWPD++QK+K GGL++I+TYVFWN+HEPV
Sbjct: 32 VTYDGRSLIVNGRRELLFSGSIHYPRSTPEMWPDILQKAKHGGLNLIQTYVFWNIHEPVE 91
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Q+NFEG YDLVKF+KL+ + GLYA LRIGP++ AEWN GGFP WL +P I FR+ NEP
Sbjct: 92 GQFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSYNEP 151
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK M++++ I++MMK+ KL+A QGGPIIL+QIENEY +I AY G Y++WA MA
Sbjct: 152 FKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYRELGVQYVQWAGKMA 211
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
+ L GVPW+MC+Q DAPDP+INTCNG +C D FT PN NKP +WTENW+ + FG
Sbjct: 212 VGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVFGDP 271
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R EDLAF+VARF + GT NYYMYHGGTNF RT G F++T Y +APLDEYGL
Sbjct: 272 PSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEYGLQ 330
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTNS 383
R+PKWGHLKDLH A++LC+ AL P LG + E Y K G+ +C+AFL N +
Sbjct: 331 REPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNHSRE 390
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
T+ F G Y LP S+SILPDCK VV+NT ++ + +F + ++A +
Sbjct: 391 AATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARNFVKS--KIANKNLK---- 444
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W EP+ + D +E N D+SDY W+ S + + ++ VL
Sbjct: 445 -WEMSQEPIPVMTDMKILTKSPMELYNFLKDRSDYAWFVTSIELSNYDLPMKKDIIPVLQ 503
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
+ +LGHA+ AF+NG +GS +GS+ P+ G N LL +TVGL N GA+
Sbjct: 504 ISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFKAGTNYIALLCMTVGLPNSGAYM 563
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKSTLPKL 622
E AGI VQ+ G GT +D+++ W Q G+ GE + + G S + + K
Sbjct: 564 EHRYAGIHS-VQILGLNTGT-LDITNNGWGQQVGVNGEHVKAYTQGGSHRVQWTAAKGKG 621
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
+ WYKT FD P G++PV + T M KG AWVNG++IGRYW +Y+S
Sbjct: 622 PAMTWYKTYFDMPEGNDPVILRMTSMAKGMAWVNGKNIGRYWLSYLSP------------ 669
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
KPSQS YHVPR+WLK S N LV+FEE GG+P +I V ++CS
Sbjct: 670 ----------LEKPSQSEYHVPRAWLKPSDNLLVIFEETGGNPEEIE-VELVNRDTICSI 718
Query: 743 VTDSHPLPVDMWGS-DSKIQR---KPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFS 798
VT+ HP V W DSKI+ + P L+CPN +VI + FASFG PLG CG F
Sbjct: 719 VTEYHPPHVKSWQRHDSKIRAVVDEVKPKGHLKCPN-YKVIVKVDFASFGNPLGACGDFE 777
Query: 799 RGRCSSARSLSVVRQACVGSKSCSIGVSVNTF---GDPCKGVMKSLAVEASC 847
G C++ S VV Q C+G +C I + F C + K+LAV+ C
Sbjct: 778 MGNCTAPNSKKVVEQHCMGKTTCEIPMEAGIFDGNSGACSDITKTLAVQVRC 829
>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
Length = 773
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/835 (48%), Positives = 506/835 (60%), Gaps = 101/835 (12%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T D R ++I G+R++LISGS+HYPRSTPEMWPDLIQKSKDGGL+ I+TYVFW+LHEP R
Sbjct: 26 ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 85
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY+F G DLV+F+K + GLYA LRIGPYVCAEW +GGFP+WLH P IQ RT+N
Sbjct: 86 RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 145
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+ IENEYGN+ AY AG YI W A MA
Sbjct: 146 Y-------------------------------MIENEYGNVMRAYHDAGVQYINWCAQMA 174
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+LDTGVPW+MCQQ +AP P+INTCNG+YCDQFTPN+ N PKMWTENWSGW+ ++GG+ P
Sbjct: 175 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 234
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQ 326
+R EDLAF+VARF+Q GGTFQNYYMYHGGTNF RT+GGP+I+TSYDYDAPL+EYG Q
Sbjct: 235 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 294
Query: 327 PKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
PKWGHL+DLH + E AL D AT+Y G S F N + DVT
Sbjct: 295 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSY-QGKSSCFFGNSNADRDVT 353
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
+ + G +Y +PAWSVSILPDC N V+NTAK+NS +F ++ + + + W+
Sbjct: 354 INYGGVNYTIPAWSVSILPDCSNEVYNTAKVNS--QYSTFVKKGSEAENEPNSL---QWT 408
Query: 447 YINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQS 506
+ E + + PG S +I D+P+ G L V +
Sbjct: 409 WRGETI------QYITPG-------------------SVDISNDDPIW--GKDLTLSVNT 441
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
GH LHAF+NG+ +G Y + I L GKN LLS+TVGL NYG ++
Sbjct: 442 SGHILHAFVNGEHIGYQYALLGQFEFQFRRSITLQLGKNEITLLSVTVGLTNYGPDFDMV 501
Query: 567 GAGITGPVQLKGSGNGTNI--DLSSQ-QWTYQTGLKGEELNFPSGSS--TQWDSKSTLPK 621
GI GPVQ+ S +I DLS+ QW Y+ GL GE+ G + QW S + LP
Sbjct: 502 NQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGEDKKIFLGRARYNQWKSDN-LPV 560
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+ VWYK TFDAP G +PV +D G+GKGEAWVNG S+GRYWP+Y+++ GC+ C+YR
Sbjct: 561 NRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYR 620
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G Y + KC NCG PSQ YHVPRS+L S+ N LVLFEE G+P+ ++F T +G++ C+
Sbjct: 621 GPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFXGNPSSVTFQTVTVGNA-CA 679
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGS----- 796
+ + G L L C + IS IKFASFG P GTCG
Sbjct: 680 NA-------------------REGYTLELSC--QGRAISXIKFASFGDPQGTCGKPFATG 718
Query: 797 ---FSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CKGVMKSLAVEASC 847
F +G C +A SLS++++ CVG SCSI VS G C K LAVEA C
Sbjct: 719 SQVFEKGTCEAADSLSIIQKLCVGKYSCSIDVSEQILGPAGCTADTKRLAVEAIC 773
>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 742 bits (1915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/826 (47%), Positives = 525/826 (63%), Gaps = 65/826 (7%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYD R+++I G+ ++L SGSIHYPRSTP+MW LI K+K GG+DVI+TYVFWNLHEP
Sbjct: 1 NVTYDGRSLIINGQHKILFSGSIHYPRSTPDMWSSLISKAKAGGIDVIQTYVFWNLHEPQ 60
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ Q+ F GR DLV+FVK + GLYA LRIGP++ +EW +GG P WLH IPG+ +R+DN+
Sbjct: 61 QGQFYFNGRADLVRFVKEIQAQGLYACLRIGPFIESEWTYGGLPFWLHDIPGMVYRSDNQ 120
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK M+RF ++IV MMK EKLYASQGGPIILSQ+ENEY N+++A+ G SY++WAA M
Sbjct: 121 PFKYHMKRFVSRIVSMMKSEKLYASQGGPIILSQVENEYKNVEAAFHEKGPSYVRWAALM 180
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGG 263
A++L TGVPWVMC+Q DAPDP+IN+CNG C + F PNS NKP +WTE+W+ ++ +G
Sbjct: 181 AVNLQTGVPWVMCKQDDAPDPVINSCNGMRCGETFAGPNSPNKPSIWTEDWTSFYQVYGE 240
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
R +D+AF VA F + G++ NYYMYHGGTNF RT+ I++ YD APLDEYGL
Sbjct: 241 ETYMRSAQDIAFHVALFIAKTGSYVNYYMYHGGTNFGRTASAFTITSYYD-QAPLDEYGL 299
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
IRQPKWGHLK+LH AIK C L+ SLGP +A V++ SG C+AFL N
Sbjct: 300 IRQPKWGHLKELHAAIKSCSKLLLHGAHKTFSLGPLQQAYVFQGNSGQCAAFLVNNDGKQ 359
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
+V V F NSY LP S+SILPDCK + FNTAK+N+ ++ +S++ +++G
Sbjct: 360 EVEVLFQSNSYKLPQKSISILPDCKTMTFNTAKVNA-----QYTTRSMK-PNQKFNSVGK 413
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W NEP+ + LLE ++TT D SDYLWY+ + + P +++V +
Sbjct: 414 -WEEYNEPIPEFDKTSLRANRLLEHMSTTKDTSDYLWYTF--RFQQNLP----NAQSVFN 466
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
QS GH LHA++NG G G+GS N ++ + L G N+ LLS TVGL + GA+
Sbjct: 467 AQSHGHVLHAYVNGVHAGFGHGSHQNTSFSLQTTVRLKNGTNSVALLSATVGLPDSGAYL 526
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQ 623
E+ AG+ V+++ N D ++ W YQ GL GE L + + + + L +
Sbjct: 527 ERRVAGLR-RVRIQ------NKDFTTYTWGYQVGLLGERLQIYTENGSNKVKWNKLGTNR 579
Query: 624 PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGA 683
PL+WYKT FDAPAG++PVA++ MGKGEAWVNGQSIGRYW ++ + G
Sbjct: 580 PLMWYKTLFDAPAGNDPVALNLGSMGKGEAWVNGQSIGRYWVSFHTSQGS---------- 629
Query: 684 YSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHV 743
PSQ+ Y++PR++LK +GN LVL EE G P I+ T + + +C +
Sbjct: 630 ------------PSQTWYNIPRAFLKPTGNLLVLLEEEKGYPPGITVDTVSV-TKVCGYA 676
Query: 744 TDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCS 803
++SH V L CP + ISSI FASFGTP G C S++ G C
Sbjct: 677 SESHLSAVQ-----------------LSCP-LKRNISSIIFASFGTPSGNCESYAIGNCH 718
Query: 804 SARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
S+ S + V +AC+G +SCSI S + F GDPC G+ K L VEA CT
Sbjct: 719 SSSSKANVEKACIGKRSCSIPQSNHFFGGDPCPGIPKVLLVEAKCT 764
>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
Length = 730
Score = 741 bits (1912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/729 (52%), Positives = 475/729 (65%), Gaps = 21/729 (2%)
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDN PFK MQ FT KIV M+K E L+ASQGGPIILSQIENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEYG 60
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
A GAAG+SYI WAA MA+ L+TGVPWVMC++ DAPDP+IN CNGFYCD F+PN
Sbjct: 61 PESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYCDGFSPNKPY 120
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP +WTE WSGWF FGG V RPV+DLAFAVARF Q+GG++ NYYMYHGGTNF RT+GG
Sbjct: 121 KPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQKGGSYFNYYMYHGGTNFGRTAGG 180
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAP+DEYGL R+PK+ HLK+LHKAIKL E ALV+ PT SLG +A +Y
Sbjct: 181 PFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSAGPTITSLGTYEQAYIY 240
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
+G C+AFLAN + S V FN Y LP WS+SILPDC+NV +NTA +
Sbjct: 241 NSGPRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALVGV------ 294
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDA-FTKPGLLEQINTTADQSDYLWYSLS 484
Q+ V + W +E + + A T GLLEQIN T D SDYLWY S
Sbjct: 295 ---QTSHVHMLPTGTSLLSWETYDEVISSLDERARMTAVGLLEQINVTRDTSDYLWYMTS 351
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+I + E L G K L+VQS GHA+ FING+ GS +G+ + + T P+ L G
Sbjct: 352 VDISSSESFLRGGQKPTLNVQSAGHAVRVFINGQFSGSAFGTREHRQFTFTGPVNLRAGS 411
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS+ VGL N G YE G+ GPV L G NG DL+ Q+W+YQ GLKGE +N
Sbjct: 412 NKISLLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGKR-DLTWQKWSYQVGLKGEAMN 470
Query: 605 F--PSG-SSTQWDSKSTLPK-LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
P G SS W S + +QPL WYK F+AP G+EP+A+D MGKG+ +NGQSI
Sbjct: 471 LVTPEGASSADWVRGSLAARSVQPLTWYKAYFNAPNGNEPLALDLRSMGKGQVRINGQSI 530
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW Y G C ++C+Y G P+Q YHVPRSWLK N LV+FEE
Sbjct: 531 GRYWTAYA--KGDC-EACSYTGHSGRQNVNLVVASPTQRWYHVPRSWLKPKQNLLVIFEE 587
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVIS 780
+GGD +KI+ + + L +++C++ ++HP S + ++L+C P Q IS
Sbjct: 588 LGGDASKIALLRRSL-TNVCANAFENHPSMAKYSTSSQDGSKVKEATVNLQC-GPGQSIS 645
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMK 839
+I+FASFGTP GTCGSF G C + S S++ + CVG KSCS+ +S + FG DPC V+K
Sbjct: 646 AIEFASFGTPSGTCGSFHIGTCHAPNSRSIIEKKCVGQKSCSVTISNSIFGADPCPNVLK 705
Query: 840 SLAVEASCT 848
L VEA C+
Sbjct: 706 RLTVEAVCS 714
>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 636
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/632 (58%), Positives = 448/632 (70%), Gaps = 29/632 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
ILL +LC ++ S A VTYD +AV+I G+RR+L+SGSIHYPRSTPEMWPDLIQK+
Sbjct: 11 ILLGILCCSSLI---CSVKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLDVI+TYVFWN HEP QY FE RYDLVKF+K+V +AGLY HLRIGPYVCAEWNF
Sbjct: 68 KDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PG+ FRTDNEPFKA MQ+FT KIV MMK+EKL+ +QGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
I+ GA GK+Y KW A MA L TGVPW+MC+Q DAP+ IINTCNGFYC+ F PNS+N
Sbjct: 188 PIEWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTENW+GWF FGGAVPYRP ED+A +VARF Q GG+F NYYMYHGGTNFDRT+ G
Sbjct: 248 KPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAPLDEYGL R+PK+ HLK LHK IKLCE ALV+ DPT SLG EA V+
Sbjct: 307 EFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS----VT 421
K+ S C+AFL+N T+S V F G++Y LP WSVSILPDCK +NTAK+ + +
Sbjct: 367 KSKSS-CAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVRTSSIHMK 425
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDD-AFTKPGLLEQINTTADQSDYLW 480
+VP+ + S W NE + + D+ F++ GL+EQI+ T D++DY W
Sbjct: 426 MVPTNTPFS--------------WGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFW 471
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y I DE L G +L + S GHALH F+NG+L G+ YGS K+T I L
Sbjct: 472 YLTDITISPDEKFL-TGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKL 530
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N LLS GL N G YE G+ GPV L G +GT D++ +W+Y+ G KG
Sbjct: 531 HAGVNKLALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGT-WDMTKWKWSYKIGTKG 589
Query: 601 EELNFPS--GSST-QWDSKSTLPKLQPLVWYK 629
E L+ + GSST +W S + K QPL WYK
Sbjct: 590 EALSVHTLAGSSTVEWKEGSLVAKKQPLTWYK 621
>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 737 bits (1903), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/848 (45%), Positives = 515/848 (60%), Gaps = 57/848 (6%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
V+A VTYD R+++I GKR +L SGSIHYPRSTPEMWP+LIQK+K GGL+VI+TY
Sbjct: 21 VIAHGDKKKGVTYDGRSLIINGKRELLFSGSIHYPRSTPEMWPELIQKAKRGGLNVIQTY 80
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
VFWN+HEP + ++NFEG YDLVKF+K + E G+ A +R+GP++ AEWN GG P WL IP
Sbjct: 81 VFWNIHEPEQGKFNFEGSYDLVKFIKTIGENGMSATIRLGPFIQAEWNHGGLPYWLREIP 140
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK 196
I FR+DN PFK M+RF I++ +K+EKL+ASQGGPIIL+QIENEY + AY G
Sbjct: 141 DIIFRSDNAPFKLHMERFVTMIINKLKEEKLFASQGGPIILAQIENEYNTVQLAYRNLGV 200
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENW 254
SY++WA MAL L TGVPWVMC+Q DAP P+INTCNG +C D FT PNS +KP +WTENW
Sbjct: 201 SYVQWAGNMALGLKTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNSPDKPSLWTENW 260
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDY 314
+ F FG R ED AF+VAR+F + G+ NYYMYHGGTNFDRT+ F++T Y
Sbjct: 261 TAQFRVFGDPPSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYD 319
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCS 373
+APLDEYGL R+PKWGHLKDLH+A+ LC+ AL+ P L ++EA ++ + C+
Sbjct: 320 EAPLDEYGLQREPKWGHLKDLHRALNLCKKALLWGTPNVQRLSADVEARFFEQPRTNDCA 379
Query: 374 AFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQV 433
AFLAN T TV F G Y LPA S+SILPDCK VV+NT +T+V + ++
Sbjct: 380 AFLANNNTKDPETVTFRGKKYYLPAKSISILPDCKTVVYNT-----MTVVSQHNSRNFVK 434
Query: 434 AADSSDAIGSGWSYINE--PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
+ + + W +E P + D + E N T D++DY W++ + N+ ++
Sbjct: 435 SRKTDGKL--EWKMFSETIPSNLLVDSRIPR----ELYNLTKDKTDYAWFTTTINVDRND 488
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
VL V SLGHA+ AFING+ +GS +GS + + L PG N LL
Sbjct: 489 LSARKDINPVLRVASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLKPGINFVTLLG 548
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSST 611
VGL + GA+ E AG G V + G GT +DLSS W +Q L GE +
Sbjct: 549 SLVGLPDSGAYMEHRYAGPRG-VSILGLNTGT-LDLSSNGWGHQVALSGETAKV---FTK 603
Query: 612 QWDSKSTLPKLQ----PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
+ K T K+ P+ WYKT FDAP G PVA+ TGM KG W+NG+SIGRYW Y
Sbjct: 604 EGGRKVTWTKVNKDGPPVTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGRYWMNY 663
Query: 668 VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTK 727
+S G+P+QS YH+PRS+LK + N +V+ EE G P K
Sbjct: 664 ISP----------------------LGEPTQSEYHIPRSYLKPTNNLMVILEEEGASPEK 701
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVISSIK 783
I +T ++CS+VT+ HP V W +K + P L+CPN +++ +++
Sbjct: 702 IEILTVNR-DTICSYVTEYHPPNVRSWERKNKKFTPVADDAKPAARLKCPNKKKIV-AVQ 759
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG---DPCKGVMKS 840
FASFG P GTCG+F+ G C S S VV Q C+G SC I + F D C + K+
Sbjct: 760 FASFGDPSGTCGNFAVGTCDSPISKQVVEQHCLGKTSCDIPMDKGLFNGKKDNCPNLTKN 819
Query: 841 LAVEASCT 848
LAV+ C+
Sbjct: 820 LAVQVKCS 827
>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 736 bits (1899), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/848 (45%), Positives = 516/848 (60%), Gaps = 65/848 (7%)
Query: 16 VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIET 75
V L +++ +VTYD R++ I G+R+++ISG+IHYPRS+P MWP L++K+K+GGL+ IET
Sbjct: 5 VALFSSAKKISVTYDGRSLKINGERKIIISGAIHYPRSSPGMWPMLMKKAKNGGLNAIET 64
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI 135
YVFWN HEP R QY+F G DLV+F+K V + LYA LRIGPYVCAEWN+GGFP+WLH +
Sbjct: 65 YVFWNAHEPQRGQYDFSGNNDLVQFIKAVQKERLYAILRIGPYVCAEWNYGGFPVWLHNL 124
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAG 195
PGI+FRT+N+ +K F ++ K ++ + + IENE+GN++ +YG G
Sbjct: 125 PGIKFRTNNQVYKVTFXFFFL-TKNLKKINNMF-------LKNXIENEFGNVEGSYGQEG 176
Query: 196 KSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWS 255
K Y+KW A +A S + PW+MCQQ DAP PI+ CN CDQF PN+ N PKMWTE+W+
Sbjct: 177 KEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIV--CN---CDQFKPNNKNSPKMWTESWA 231
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYD 315
GWF +G PYR EDLAFAVARFFQ GG+ NYYMYHGGTNF R++GGP+I+TSYDY+
Sbjct: 232 GWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNFGRSAGGPYITTSYDYN 291
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAF 375
APLDEYG + QPKWGHLK LH+ I+ E L D + G + AT Y T G S F
Sbjct: 292 APLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHSTTATSY-TYKGKSSCF 350
Query: 376 LANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVT----LVPSFSRQSL 431
N NSD + F Y +P WSV++LPDCK V+NTAK+N+ T +VPS +
Sbjct: 351 FGN-PENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTTIREMVPSLVGKHK 409
Query: 432 QVAADSSDAIGSGWSYINEPV------GISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ W + NE + G A T L++Q T D SDYLWY
Sbjct: 410 KPLK---------WQWRNEKIEHLTHEGDISGSAITANSLIDQKMVTNDSSDYLWYLTGF 460
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA-LAPGK 544
++ ++PL G + L V++ GH LHAF+N K +G+ +G T++ + L G
Sbjct: 461 HLNGNDPLF--GKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRNLRHGF 518
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS TVGL NYGA+YE GI GPV+L G T DLS+ +W Y+ GL GE+
Sbjct: 519 NQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGK-TIRDLSTNEWIYKVGLDGEKYE 577
Query: 605 F--PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
F P + + LP Q WYKT+F P G E V +D GMGKG+AWVNG+SIGR
Sbjct: 578 FFDPDHKFRKPWLSNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVNGKSIGR 637
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKS-SGNTLVLFEEI 721
YWP+Y++ GC+ SC+YRGAY +KC NCGKP+Q YH+PRS++ NTL+LFEE
Sbjct: 638 YWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTLILFEEF 697
Query: 722 GGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISS 781
GG P I T ++ +C+ VD+ G L L C ++ +
Sbjct: 698 GGMPLNIEIKTTRV-KKVCA--------KVDL-----------GSKLELTC--HDRTVKR 735
Query: 782 IKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKS 840
I F FG P G C +F +G C S+ + SV+ + C+ + CSI V+ + G CK +
Sbjct: 736 IIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKLGLTGCKNPKDN 795
Query: 841 -LAVEASC 847
LAV+ SC
Sbjct: 796 WLAVQVSC 803
>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
Length = 830
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/839 (45%), Positives = 516/839 (61%), Gaps = 58/839 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R++++ G+R +L SGSIHYPR PEMWP++I+K+K+GGL+VI+TYVFWN+HEPV+
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPEIIRKAKEGGLNVIQTYVFWNIHEPVQ 87
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Q+NFEG YDLVKF+K + E GLY LRIGPY+ AEWN GGFP WL +P I FR+ NEP
Sbjct: 88 GQFNFEGNYDLVKFIKAIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
F M++++ ++D++K+EKL+A QGGPII++QIENEY N+ AY GK YI+WAA MA
Sbjct: 148 FIHHMKKYSEMVIDLVKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYIEWAANMA 207
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
SL GVPW+MC+Q DAP +INTCNG +C D FT PN NKP +WTENW+ + +FG
Sbjct: 208 TSLYNGVPWIMCKQKDAPPQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R ED+AF+VARFF + GT NYYMY+GGTN+ RTS F++T Y +APLDE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRTSSS-FVTTRYYDEAPLDEFGLY 326
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTNS 383
R+PKW HL+DLH+A++L AL+ PT + +LE TV+ K GS C+AFL N T
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPTVQKINQDLEITVFEKPGSTDCAAFLTNNHTTQ 386
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVTLVPSFSRQSLQVAADSSD 439
T+KF G Y LP SVSILPDCK VV+NT I NS + S ++L+
Sbjct: 387 PSTIKFRGKDYYLPEKSVSILPDCKTVVYNTQTIVSQHNSRNFITSEKSKNLK------- 439
Query: 440 AIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
W E V D LE + T D SDY WYS S ++ + +
Sbjct: 440 -----WEMYQEKVPTIADLPLKNREPLELYSLTKDTSDYAWYSTSITLERHDLPMRPDIL 494
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
VL + S+GHAL AF+NG+ VG G+G++ PI L PG NT +L+ TVG N
Sbjct: 495 PVLQIASMGHALAAFVNGEYVGFGHGNNIEKSFVFQKPIILKPGTNTITILAETVGFPNS 554
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGS-STQWDSK 616
GA+ EK AG G V ++G GT +D++ W ++ G+ GE EL G+ QW
Sbjct: 555 GAYMEKRFAGPRG-VTIQGLMAGT-LDITQNNWGHEVGVFGEKQELFTEEGAKKVQWTPV 612
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
+ PK + WYKT FDAP G+ PVA+ M KG WVNG+S+GRYW +++S
Sbjct: 613 TGPPK-GAVTWYKTYFDAPEGNNPVALKMDKMEKGMMWVNGKSLGRYWTSFLSP------ 665
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
G+P+Q+ YH+PR++LK + N LV+FEE GG PT I T
Sbjct: 666 ----------------LGQPTQAEYHIPRAYLKPTNNLLVIFEETGGHPTNIEVQTVNR- 708
Query: 737 SSLCSHVTDSHPLPVDMW---GSD-SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLG 792
++CS +T+ HP V W G+D + L CP+ N++I ++FAS+G P G
Sbjct: 709 DTICSIITEYHPPHVKSWERSGTDFVAVVEDLKSGAHLTCPD-NKIIEKVEFASYGNPDG 767
Query: 793 TCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG----DPCKGVMKSLAVEASC 847
CG+ G C+SA SL VV Q C+G +C+I + + DPC + K+LAV+ C
Sbjct: 768 ACGNLFNGNCNSANSLKVVEQHCLGKNTCTIPIEREIYDEPSKDPCPNIFKTLAVQVKC 826
>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
Length = 607
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/593 (60%), Positives = 430/593 (72%), Gaps = 15/593 (2%)
Query: 4 KEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
+ +L LC+ FV T A+VTYDH+A+VI GKRR+LISGSIHYPRSTP+MWPDLIQ
Sbjct: 10 RNCYILFLCF-FVCYVT----ASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQ 64
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
K+KDGG+DVIETYVFWN HEP + +Y FE R+DLVKF+K+V +AGLY HLRIGPYVCAEW
Sbjct: 65 KAKDGGVDVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEW 124
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
NFGGFP+WL ++PG+ FRTDNEPFKA MQ+FT KIV +MK E L+ SQGGPIILSQIENE
Sbjct: 125 NFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENE 184
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
YG ++ GA GKSY KW + MA+ L+TGVPWVMC+Q DAPDPII+TCNG+YC+ F+PN
Sbjct: 185 YGPVEWEIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNK 244
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
N KPKMWTENW+GW+ FG AVPYRP EDLAF+VARF Q G++ NYYMYHGGTNF RTS
Sbjct: 245 NYKPKMWTENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTS 304
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
G FI+TSYDYDAP+DEYGLI +PKWGHL+DLHKAIK CE+ALV+ DPT G NLE
Sbjct: 305 SGLFIATSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVH 364
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
+YKT G C+AFLAN T S V F Y LP WS+SILPDCK VFNTAK+ +
Sbjct: 365 LYKTSFGACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRA---- 420
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
P R + +++ + SY +P + ++T GLLEQ++ T D+SDYLWY
Sbjct: 421 PRVHR-----SMTPANSAFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMT 475
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
NI +E +++G VL S GH LH FING+ G+ YGS N K+T + L G
Sbjct: 476 DVNISPNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVG 535
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQT 596
N LLS+ VGL N G YEK G+ GPV LKG GT DLS Q+W+Y+
Sbjct: 536 NNKISLLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTR-DLSKQKWSYKV 587
>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
Flags: Precursor
gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 809
Score = 733 bits (1893), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/838 (46%), Positives = 506/838 (60%), Gaps = 70/838 (8%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G VTY+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HE
Sbjct: 28 GTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHE 87
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R QYNF G YD+V+F K + AGLYA LRIGPY+C EWN+GG P WL IPG+QFR
Sbjct: 88 PHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLH 147
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKW 201
N PF+ EM+ FT IV+ MK ++A QGGPIIL+QIENEYGNI + YI W
Sbjct: 148 NAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHW 207
Query: 202 AAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
A MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF +
Sbjct: 208 CADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKA 267
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
+ +R ED+AFAVA FFQ+ GGP+I+TSYDYDAPLDE
Sbjct: 268 WDKPDFHRSAEDIAFAVAMFFQK-------------------RGGPYITTSYDYDAPLDE 308
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YG +RQPK+GHLKDLH IK E LV + + + T Y S + F+ N
Sbjct: 309 YGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDS-TSACFINNRN 367
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
N DV V +G ++LLPAWSVSILPDCK V FN+AKI + T V + ++ +S
Sbjct: 368 DNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKAKMVEKEPESLK- 426
Query: 441 IGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
WS++ E P + ++ K LLEQI T+ DQSDYLWY S N K +
Sbjct: 427 ----WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGE------- 475
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
+ L V + GH L+AF+NG LVG + + + ++ P L GKN LLS T+GL+
Sbjct: 476 ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLK 535
Query: 558 NYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSSTQWD 614
NYG +EK AGI GPV+L NG IDLS+ W+Y+ GL GE +++ T +
Sbjct: 536 NYGPLFEKMPAGIVGGPVKLI-DNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPGCTWDN 594
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y + G
Sbjct: 595 NNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGG 654
Query: 675 TDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKIS 729
C+YRG + + KCL CG+PSQ YHVPRS+LK+ NT++LFEE GGDP+ +S
Sbjct: 655 CHHCDYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVS 714
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
F T GS S + G ++L C ++ IS+I SFG
Sbjct: 715 FRTVAAGSVCAS--------------------AEVGDTITLSCGQHSKTISAINVTSFGV 754
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G CG++ +G C S + +AC+G +SC++ ++ G C + L V+ASC
Sbjct: 755 ARGQCGAY-KGGCESKAAYKAFTEACLGKESCTVQITNAVTGSGC--LSNVLTVQASC 809
>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/859 (46%), Positives = 516/859 (60%), Gaps = 88/859 (10%)
Query: 4 KEILLLVLCWGFVVLATTSF-GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
+ +L LV V+ + ++ G +VTYD R+++I G+R+++ SGSIHYPRSTPEMWP LI
Sbjct: 2 RRVLFLVAAVLAVIGSGSAVRGGDVTYDGRSLIIDGQRKIVFSGSIHYPRSTPEMWPSLI 61
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
K+K+GGLD IETYVFWN+HEP Y+F G +D+V+F+K V GLYA LRIGP++ +E
Sbjct: 62 AKAKEGGLDAIETYVFWNVHEPQPGHYDFSGGHDIVRFIKEVQAQGLYACLRIGPFIQSE 121
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
W++GG P WLH IPGI FR+DNEPFK MQ FTAK+V MM+ E LYASQGGPIILSQIEN
Sbjct: 122 WSYGGLPFWLHDIPGIVFRSDNEPFKVYMQNFTAKVVSMMQSENLYASQGGPIILSQIEN 181
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FT 240
EYG + AYG G +Y++WAA MA L TGVPWVMC+Q++AP +IN+CNG C Q
Sbjct: 182 EYGTVQKAYGQEGLAYVQWAAQMAEGLQTGVPWVMCKQNNAPGHVINSCNGMKCGQTFVG 241
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFF-QRGGTFQNYYMYHGGTNF 299
PNS NKP +WTENW+ + ED+AF V F + G+F NYYMYHGGTNF
Sbjct: 242 PNSPNKPSIWTENWTT-----------QSAEDIAFHVTLFIAAKKGSFVNYYMYHGGTNF 290
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
RT+ F++TSY APLDEYGL QPKWGHLK+LH AIKLC L++ LGP
Sbjct: 291 GRTASA-FVTTSYYDQAPLDEYGLTTQPKWGHLKELHAAIKLCSTPLLSGVQVNLYLGPQ 349
Query: 360 LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS 419
+A ++ SG C+AFL N +++ +V F SY LP S+SILPDCKNV + + +
Sbjct: 350 QQAYIFNAVSGECAAFLINNDSSNAASVPFRNASYDLPPMSISILPDCKNV---STQYTT 406
Query: 420 VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
T+ R + AAD W E + + LLEQ+NTT D SDYL
Sbjct: 407 RTM----GRGEVLDAADV-------WQEFTEAIPNFDSTSTRSETLLEQMNTTKDSSDYL 455
Query: 480 WYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
WY+ ++ + ++ +L V SLGHALHAF+NG+ VGS GS N + + ++
Sbjct: 456 WYTFRFQHESSD------TQAILDVSSLGHALHAFVNGQAVGSVQGSRKNPRFKFETSVS 509
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
L+ G N LLS+ VG+ + GAF E AG+ V ++ + N D ++ W YQ GL+
Sbjct: 510 LSKGINNVSLLSVMVGMPDSGAFLENRAAGLR-TVMIRDKQD--NNDFTNYSWGYQIGLQ 566
Query: 600 GEELNF---PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVN 656
GE L S QW S PL WYKT DAP G PV ++ MGKGEAWVN
Sbjct: 567 GETLQIYTEQGSSQVQWKKFSNAGN--PLTWYKTQVDAPPGDVPVGLNLASMGKGEAWVN 624
Query: 657 GQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLV 716
GQSIGRYWP+ YHVPRS+LK +GN LV
Sbjct: 625 GQSIGRYWPS----------------------------------YHVPRSFLKPTGNLLV 650
Query: 717 LFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPG------PVLSL 770
L EE GG+P ++S T + S +C HVT SH PV W ++ + P P + L
Sbjct: 651 LQEEEGGNPLQVSLDTVTI-SQVCGHVTASHLAPVSSWIEHNQRYKNPAKVSGRRPKVLL 709
Query: 771 ECPNPNQVISSIKFASFGTPLGTC-GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNT 829
CP+ ++ IS I FAS+GTPLG C S + G C S S +VV +AC+G CSI VSV
Sbjct: 710 ACPSKSK-ISRISFASYGTPLGNCRNSMAVGTCHSQNSKAVVEEACLGKMKCSIPVSVRQ 768
Query: 830 F-GDPCKGVMKSLAVEASC 847
F GDPC KSL V A C
Sbjct: 769 FGGDPCPAKAKSLMVVAEC 787
>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
Precursor
gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
Length = 815
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/856 (46%), Positives = 525/856 (61%), Gaps = 73/856 (8%)
Query: 15 FVVLATTSFG--ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
V++A G ANVTYD R+++I G+ ++L SGSIHY RSTP+MWP LI K+K GG+DV
Sbjct: 11 LVLMAVIVAGDVANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDV 70
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
++TYVFWN+HEP + Q++F G D+VKF+K V GLY LRIGP++ EW++GG P WL
Sbjct: 71 VDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWL 130
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
H + GI FRTDNEPFK M+R+ IV +MK E LYASQGGPIILSQIENEYG + A+
Sbjct: 131 HNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFR 190
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT--PNSNNKPKMW 250
GKSY+KW A +A+ LDTGVPWVMC+Q DAPDP++N CNG C + PNS NKP +W
Sbjct: 191 QEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIW 250
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+ ++ ++G R ED+AF VA F + G+F NYYMYHGGTNF R + F+ T
Sbjct: 251 TENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVIT 309
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
SY APLDEYGL+RQPKWGHLK+LH A+KLCE L++ T SLG A V+ +
Sbjct: 310 SYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKAN 369
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS-VTLVPSFSRQ 429
LC+A L N + TV+F +SY L SVS+LPDCKNV FNTAK+N+ +RQ
Sbjct: 370 LCAAILVN-QDKCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQ 428
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
+L SS + W E V + + LLE +NTT D SDYLW +T +
Sbjct: 429 NL-----SSPQM---WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQ--TTRFQQ 478
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E G+ +VL V LGHALHAF+NG+ +GS +G+ + ++ ++L G N L
Sbjct: 479 SE-----GAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLAL 533
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS-- 607
LS+ VGL N GA E+ + G +K + ++ W YQ GLKGE+ + +
Sbjct: 534 LSVMVGLPNSGAHLERR---VVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTED 590
Query: 608 -GSSTQW----DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ QW DSKS QPL WYK +FD P G +PVA++ MGKGEAWVNGQSIGR
Sbjct: 591 GSAKVQWKQYRDSKS-----QPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGR 645
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF-EEI 721
YW ++ + G PSQ YH+PRS+LK + N LV+ EE
Sbjct: 646 YWVSFHTYKGN----------------------PSQIWYHIPRSFLKPNSNLLVILEEER 683
Query: 722 GGDPTKISFVTKQLGSSLCSHVTDSHPLPV--------DMWGSDSKIQRKPGPVLSLECP 773
G+P I+ T + + +C HV++++P PV + + RKP + L+CP
Sbjct: 684 EGNPLGITIDTVSV-TEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPK--VQLQCP 740
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GD 832
+ IS I FASFGTP G+CGS+S G C S SL+VV++AC+ CS+ V TF GD
Sbjct: 741 TGRK-ISKILFASFGTPNGSCGSYSIGSCHSPNSLAVVQKACLKKSRCSVPVWSKTFGGD 799
Query: 833 PCKGVMKSLAVEASCT 848
C +KSL V A C+
Sbjct: 800 SCPHTVKSLLVRAQCS 815
>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 801
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/836 (47%), Positives = 518/836 (61%), Gaps = 62/836 (7%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+ TYD R++++ G+ ++L SGSIHYPRSTP+MWP LI K+K+GG+DVI+TYVFWNLHEP
Sbjct: 15 SATYDGRSLIVNGEHKLLFSGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQ 74
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ Y F GR D+V+FVK + GLYA LRIGP++ AEW++GG P WLH + GI +R+DNE
Sbjct: 75 QGTYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNE 134
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQ FT KIV+MMK E LYASQGGPIILSQIENEY +++A+G G Y++WAA M
Sbjct: 135 PFKLHMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKM 194
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGG 263
A+SL TGVPW MC+Q+DAPDP+INTCNG C + FT PNS NKP +WTENW+ ++ ++G
Sbjct: 195 AVSLQTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 254
Query: 264 AVPYRPVEDLAFAVARFF-QRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
R E++AF VA F + GT+ NYYMYHGGTNF R++ I+ YD +PLDEYG
Sbjct: 255 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QSPLDEYG 313
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
L R+PKWGHLK+LH A+KLC L+ + SLG ++EA V+KT S C+AFL N G
Sbjct: 314 LTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVNRGA- 372
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
D V F +Y LP S+SILPDCKNV FNT +++ V +R + A D +
Sbjct: 373 IDSNVLFQNVTYELPLGSISILPDCKNVAFNTRRVS----VQHNTRSMM--AVQKFDLL- 425
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
W EP+ D LLE + TT D+SDYLWY+ ++ D P S+ L
Sbjct: 426 -EWEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYLWYTF--RVQQDSP----DSQQTL 478
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V S HALHAF+NG GS +G ++ I L G N LLS+ VGL + GAF
Sbjct: 479 EVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGINNISLLSVMVGLPDSGAF 538
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSS-TQWDSKSTL 619
E AG+ V ++G D S Q W Y+ GL GE ++ +GSS QW
Sbjct: 539 LETRVAGLRR-VGIQGE------DFSEQHWGYKVGLSGEQSQIFLDTGSSNVQWSRLGN- 590
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
QPL WYKT FDAP G +P+A++ MGKG WVNG+ IGRYW ++++
Sbjct: 591 -SSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPK-------- 641
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
G+PSQ Y+VPRS+LK + N LV+ EE G+P +IS + L +
Sbjct: 642 --------------GEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEIS-LDSVLITKT 686
Query: 740 CSHVTDSH-PLPVDMWGSDSKIQRK-----PGPVLSLECPNPNQVISSIKFASFGTPLGT 793
C V++SH PL G+ + R+ P + L CP+ + IS+I FASFGTP G
Sbjct: 687 CGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSKKK-ISNILFASFGTPSGD 745
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVS-VNTFGDPCKGVMKSLAVEASCT 848
C S++ G C S S ++V AC+G CSI +S +N GDPC V K+L V+A CT
Sbjct: 746 CQSYAIGLCHSPNSRAIVEHACLGRAKCSIPISNLNFRGDPCPHVTKTLLVDAQCT 801
>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
Length = 837
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/723 (51%), Positives = 476/723 (65%), Gaps = 34/723 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YD R++VI G+RR+++SGSIHYPRSTPEMWPDLI+K+K+GGLD IETY+FWN HEP R
Sbjct: 31 VSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPHR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEG YD+V+F K + AG+YA LRIGPY+C EWN+GG P WL IPG+QFR NEP
Sbjct: 91 RQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNEP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAAG 204
F+ EM+ FT IV+ MK K++A QGGPIIL+QIENEYGNI + YI W A
Sbjct: 151 FENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCAD 210
Query: 205 MALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA + GVPW+MCQQ D P ++NTCNGFYC + PN PK+WTENW+GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
+R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPK+GHLK+LH +K E LV + + G N+ T Y S + F+ N +
Sbjct: 331 LRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSS-SACFINNRFDDK 389
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
DV V +G ++LLPAWSVSILPDCK V FN+AKI + T V + + +S
Sbjct: 390 DVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQESLK---- 445
Query: 444 GWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
WS++ E P + F K LLEQI T+ DQSDYLWY S N K +
Sbjct: 446 -WSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHKGE-------GSY 497
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L+V + GH L+AF+NGKL+G + + + ++ P+ L GKN LLS TVGL+NYG
Sbjct: 498 KLYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYG 557
Query: 561 AFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE----ELNFPSGSSTQWD- 614
+EK GI GPV+L S NGT IDLS+ W+Y+ GL E L+ P +W+
Sbjct: 558 PSFEKMPTGIVGGPVKLIDS-NGTAIDLSNSSWSYKAGLASEYRQIHLDKP---GYKWNG 613
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ T+P +P WYK TF+AP+G + V +D G+ KG AWVNG ++GRYWP+Y +
Sbjct: 614 NNGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAG 673
Query: 675 TDSCNYRGAYSSN----KCLKNCGKPSQSLYHVPRSWLKS-SGNTLVLFEEIGGDPTKIS 729
C+YRGA+ + +CL CG+PSQ YHVPRS+L + NTL+LFEE GGDP+ ++
Sbjct: 674 CHRCDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVA 733
Query: 730 FVT 732
T
Sbjct: 734 LRT 736
>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
Length = 818
Score = 725 bits (1872), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/857 (46%), Positives = 522/857 (60%), Gaps = 72/857 (8%)
Query: 15 FVVLAT--TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
FV++A ANVTYD R+++I G+ ++L SGSIHY RSTP+MWP LI K+K GG+DV
Sbjct: 11 FVLMAVIVARDAANVTYDGRSLIIDGQHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDV 70
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
I+TYVFWN+HEP + Q++F GR D+VKF+K V GLY LRIGP++ EW++GG P WL
Sbjct: 71 IDTYVFWNIHEPQQGQFDFSGRRDIVKFIKEVKAHGLYVCLRIGPFIQGEWSYGGLPFWL 130
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
H + GI FRTDNEPFK M+R+ IV +MK E LYASQGGPIILSQIENEYG + A+
Sbjct: 131 HNVQGIVFRTDNEPFKYHMKRYAQMIVKLMKSENLYASQGGPIILSQIENEYGMVARAFR 190
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT--PNSNNKPKMW 250
GKSY+KWAA +A+ LDTGVPWVMC+Q DAPDP++N CNG C + PNS NKP +W
Sbjct: 191 QDGKSYVKWAAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIW 250
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+ ++ ++G R ED+AF VA F + G+F NYYMYHGGTNF R + F+ T
Sbjct: 251 TENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVIT 309
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
SY APLDEYGL+RQPKWGHLK+LH A+KLCE L++ T SLG A V+ +
Sbjct: 310 SYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKAN 369
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS-RQ 429
LC+A L N D TV+F +SY L S+S+LPDCKNV FNTAK+N+ + RQ
Sbjct: 370 LCAALLVN-QDKCDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTAKVNAQYNTRTRKPRQ 428
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
+L SS + W E V + + LLE +NTT D SDYLW +T +
Sbjct: 429 NL-----SSPHM---WEKFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQ--TTRFEQ 478
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E G+ +VL V LGH LHAF+N + +GS +G+ ++ ++L G N L
Sbjct: 479 SE-----GAPSVLKVNHLGHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTNNMAL 533
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS-- 607
LS+ VGL N GA E+ G GS + ++ W YQ GLKGE+ + +
Sbjct: 534 LSVMVGLPNSGAHLERRVVGSRSVNIWNGS---YQLFFNNYSWGYQVGLKGEKYHVYTED 590
Query: 608 -GSSTQW----DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
QW DSKS QPL WYK +FD P G +PVA++ MGKGEAWVNGQSIGR
Sbjct: 591 GAKKVQWKQYRDSKS-----QPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGR 645
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF-EEI 721
YW ++ Y+S G PSQ YH+PRS+LK + N LV+ EE
Sbjct: 646 YWVSF----------------YTSK------GNPSQIWYHIPRSFLKPNSNLLVILEEER 683
Query: 722 GGDPTKISFVTKQLGSSLCSHVTDSHPLPV---DMWGSDSKIQRK------PGPVLSLEC 772
G P I+ T + + +C HV+++HP PV G + QR P + L+C
Sbjct: 684 EGYPLGITIDTVSV-TEVCGHVSNTHPHPVISPRKKGHNRNEQRHLKYRYDRKPKVQLQC 742
Query: 773 PNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-G 831
P + IS + FA+FG P G+CGS+S G C S SL+VV++AC+ CS+ V TF G
Sbjct: 743 PTGRK-ISKVLFATFGNPNGSCGSYSVGSCHSPNSLAVVQKACLRKSRCSVPVWSKTFGG 801
Query: 832 DPCKGVMKSLAVEASCT 848
D C +KSL V A C+
Sbjct: 802 DLCPQTVKSLLVRAQCS 818
>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 808
Score = 724 bits (1870), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/840 (47%), Positives = 510/840 (60%), Gaps = 81/840 (9%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YD R++++ G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGL+ IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NFEG YD+V+F K + AG+YA LRIGPY+C EWN+GG P+WL IPGI+FR N+P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG-------NIDSAYGAAGKSYI 199
F+ M+ FT IV MK ++A QGGPIIL+QIENEYG NI SA+ YI
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAH-----EYI 205
Query: 200 KWAAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWF 258
W A MA + GVPW+MCQQ +D P ++NTCNGFYC ++ N + PKMWTENW+GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+ RP ED+AFAVA FFQ G+ QNYYMYHGGTNF RT+GGP+I+TSYDYDAPL
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYG +RQPK+GHLK+LH + E L+ D + G N+ T Y T + + F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKY-TLNATSACFINN 384
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
+ DV V +G ++ LPAWSVSILP+CK V FN+AKI + T V ++ +
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHF 444
Query: 439 DAIGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
WS++ E P + F K LLEQI TT DQSDYLWY S K +
Sbjct: 445 K-----WSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHKGE----- 494
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
VL+V + GH L+AF+NGKLVG Y + N TF L S
Sbjct: 495 --GSYVLYVNTTGHELYAFVNGKLVGQQYSPNENF---------------TFQLKS---- 533
Query: 556 LQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQW 613
NYG +E AGI GPV+L S +G+ IDLS+ W+Y+ GL GE + +W
Sbjct: 534 -PNYGGSFELLPAGIVGGPVKLIDS-SGSAIDLSNNSWSYKAGLAGEYRKIYLDKPGNKW 591
Query: 614 DS-KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
S ST+P +P WYKTTF APAG + V +D G+ KG AWVNG S+GRYWP+YV+ +
Sbjct: 592 RSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADM 651
Query: 673 GCTDSCNYRGAY----SSNKCLKNCGKPSQSLYHVPRSWL-KSSGNTLVLFEEIGGDPTK 727
C+YRG + + KCL CG+PSQ LYHVPRS+L K NTL+LFEE GGDP++
Sbjct: 652 PGCHHCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSE 711
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASF 787
++ T GS S + G ++L C + ISS+ ASF
Sbjct: 712 VAVRTVVEGSVCAS--------------------AEVGDTVTLSCGAHGRTISSVDVASF 751
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G G CGS+ G C S + ACVG +SC++ V+ C V L V+A+C
Sbjct: 752 GVARGRCGSYD-GGCESKVAYDAFAAACVGKESCTVLVTDAFANAGC--VSGVLTVQATC 808
>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
Length = 811
Score = 723 bits (1867), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/840 (45%), Positives = 505/840 (60%), Gaps = 72/840 (8%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G VTY+ R++VI G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGLD IETYVFWN HE
Sbjct: 28 GTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHE 87
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R QYNF G YD+V+F K + AGLYA LRIGPY+C EWN+GG P WL IPG+QFR
Sbjct: 88 PHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLH 147
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKW 201
N PF+ EM+ FT IV+ MK ++A QGGPIIL+QIENEYGNI + YI W
Sbjct: 148 NAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHW 207
Query: 202 AAGMALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
A MA + GVPW+MCQQ SD P ++NTCNGFYC + PN PK+WTENW+GWF +
Sbjct: 208 CADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKA 267
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
+ +R ED+AFAVA FFQ+ GGP+I+TSYDYDAPLDE
Sbjct: 268 WDKPDFHRSAEDIAFAVAMFFQK-------------------RGGPYITTSYDYDAPLDE 308
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YG +RQPK+GHLKDLH IK E LV + + + T Y S + F+ N
Sbjct: 309 YGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDS-TSACFINNRN 367
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
N DV V +G ++LLPAWSVSILPDCK V FN+AKI + T V + ++ +S
Sbjct: 368 DNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKAKMVEKEPESLK- 426
Query: 441 IGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
WS++ E P + ++ K LLEQI T+ DQSDYLWY S N K +
Sbjct: 427 ----WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGE------- 475
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
+ L V + GH L+AF+NG LVG + + + ++ P L GKN LLS T+GL+
Sbjct: 476 ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLK 535
Query: 558 NYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSSTQWD 614
NYG +EK AGI GPV+L NG IDLS+ W+Y+ GL GE +++ T +
Sbjct: 536 NYGPLFEKMPAGIVGGPVKLI-DNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPGCTWDN 594
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG-- 672
+ T+P +P WYKTTF APAG + V +D G+ KG AWVNG ++GRYWP+Y +
Sbjct: 595 NNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAARSMR 654
Query: 673 GCTDSCNYRGAYSS----NKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTK 727
+ +YRG + + KCL CG+PSQ YHVPRS+LK+ NT++LFEE GGDP+
Sbjct: 655 RLPTTAHYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSH 714
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASF 787
+SF T GS S + G ++L C ++ IS+I SF
Sbjct: 715 VSFRTVAAGSVCAS--------------------AEVGDTITLSCGQHSKTISAINVTSF 754
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G G CG++ +G C S + +AC+G +SC++ ++ G C + L V+ASC
Sbjct: 755 GVARGQCGAY-KGGCESKAAYKAFTEACLGKESCTVQITNAVTGSGC--LSNVLTVQASC 811
>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
Length = 579
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/569 (59%), Positives = 412/569 (72%), Gaps = 12/569 (2%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYDHR++ I G+RR+LISGSIHYPRSTPEMWPDLIQK+KDGGLDVI+TYVFWN HEPV+
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY F RYDLV+FVKLV +AGLY +LRIGPYVCAEWN+GGFP+WL ++PGI FRTDN PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S G+ KSY+ WAA MA+
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+ + GVPW+MC+Q DAPDP+INTCNGFYCD FTPNS NKP MWTE WSGWF +FGG VP
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQP 327
RPVEDLAFAVARF Q+GG+F NYYMYHGGTNFDRT+GGPFI+TSYDYDAP+DEYGL+RQP
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTV 387
KWGHL +LHKAIK E ALVA DPT ++G +A V+++ SG C+AFL+N T++ V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
FNG Y LPAWS+S+LPDC+ V+NTA + + + A + A G W
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAAS-----------SPAKMNPAGGFTWQS 431
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
E + AFTK GL+EQ++ T D+SDYLWY+ NI + E L+ G L V S
Sbjct: 432 YGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSA 491
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
GH++ F+NG+ G+ YG K+T + + G N +LS VGL N G YE
Sbjct: 492 GHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWN 551
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQT 596
G+ GPV L G G DLS Q+WTYQ
Sbjct: 552 IGVLGPVTLSGLNEGKR-DLSKQKWTYQV 579
>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
Length = 620
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/635 (55%), Positives = 442/635 (69%), Gaps = 20/635 (3%)
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
LV +AGLY +LRIGPYVCAEWNFGGFP+WL F+PG+ FRTDNEPFKA M++FT KIV MM
Sbjct: 1 LVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMM 60
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSD 222
K EKL+ +QGGPIIL+QIENEYG ++ GA GK+Y KW A MAL L TGVPW+MC+Q D
Sbjct: 61 KAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQED 120
Query: 223 APDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ 282
AP PII+TCNG+YC+ F PNS NKPKMWTENW+GW+ +FGGAVPYRPVED+A++VARF Q
Sbjct: 121 APGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFIQ 180
Query: 283 RGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLC 342
+GG+ NYYMYHGGTNFDRT+ G F+++SYDYDAPLDEYGL R+PK+ HLK LHKAIKL
Sbjct: 181 KGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLS 239
Query: 343 EAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVS 402
E AL++ D T SLG EA V+ + S C+AFL+N NS V F G Y LP WSVS
Sbjct: 240 EPALLSADATVTSLGAKQEAYVFWSKSS-CAAFLSNKDENSAARVLFRGFPYDLPPWSVS 298
Query: 403 ILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKD-DAFT 461
ILPDCK V+NTAK+N+ PS R + S W NE + + F
Sbjct: 299 ILPDCKTEVYNTAKVNA----PSVHRNMVPTGTKFS------WGSFNEATPTANEAGTFA 348
Query: 462 KPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVG 521
+ GL+EQI+ T D+SDY WY I + E L+ G +L V S GHALH F+NG+L G
Sbjct: 349 RNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSG 408
Query: 522 SGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGN 581
+ YG + K+T I L G N LLS+ VGL N G +E+ G+ GPV LKG +
Sbjct: 409 TAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNS 468
Query: 582 GTNIDLSSQQWTYQTGLKGEELNFPSG---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGS 638
GT D+S +W+Y+ G+KGE L+ + S +W S + K QPL WYK+TF PAG+
Sbjct: 469 GT-WDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGN 527
Query: 639 EPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQ 698
EP+A+D MGKG+ W+NG++IGR+WP Y +Q G CNY G + + KCL NCG+ SQ
Sbjct: 528 EPLALDMNTMGKGQVWINGRNIGRHWPAYKAQ--GSCGRCNYAGTFDAKKCLSNCGEASQ 585
Query: 699 SLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
YHVPRSWLKS N +V+FEE+GGDP IS V +
Sbjct: 586 RWYHVPRSWLKSQ-NLIVVFEELGGDPNGISLVKR 619
>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 851
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/871 (43%), Positives = 536/871 (61%), Gaps = 67/871 (7%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
+A + + +C +++A S A NVTYD RA+++ G+RR+LI+G IHYPRSTPEMWP
Sbjct: 23 LAVLMVAAVAMCCSAILVALPSTSAMNVTYDSRALLLDGQRRLLIAGCIHYPRSTPEMWP 82
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
+L ++K GLDVI+TY+FW++++P ++ R+D V+F+KL +AGL + RIGPYV
Sbjct: 83 ELFARAKANGLDVIQTYLFWDVNQPTPGEFVMTDRFDYVRFIKLAQQAGLMVNFRIGPYV 142
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
CAEWN+GGFP WL I GI FR +++P+ + + K V ++K KL A+ GGP+IL Q
Sbjct: 143 CAEWNYGGFPAWLRQISGIVFRDNDKPWLDVVGPYITKTVQVLKDNKLLAADGGPVILLQ 202
Query: 180 IENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF 239
IENEYGNI+ +Y A G +Y++W +A SL+ G W+MCQQ DAP I TCNGFYCD +
Sbjct: 203 IENEYGNIEDSY-AGGPAYVQWCGQLAASLNAGAQWIMCQQDDAPANTIATCNGFYCDNY 261
Query: 240 TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 299
P+ +P MWTENW GWF ++G P+RP +D+AFA ARF+ +GGT+ +YYMYHGGTNF
Sbjct: 262 VPH-KGQPMMWTENWPGWFQTWGQPSPHRPAQDVAFAAARFYAKGGTYMSYYMYHGGTNF 320
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP-SLGP 358
RT+GGP I+TSYDYD LDEYG+ +PK+ HL LH + E +++ + P SLG
Sbjct: 321 GRTAGGPGITTSYDYDVALDEYGMPSEPKYSHLGSLHAVLHANEHIIMSMNVPAPISLGK 380
Query: 359 NLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI- 417
NLEA V+ + SG C AFL+NI ++ D V+FNG ++ LPAWSVSIL +C ++NTA +
Sbjct: 381 NLEAHVFNSSSG-CVAFLSNIDSSVDAEVQFNGRTFELPAWSVSILHNCAFAIYNTAAVS 439
Query: 418 ---NSVTLVPSFSRQ-SLQVAADSSDAIGSG-----------WSYINEPVGISKDDA--F 460
N+ + P + ++ AAD ++ G ++ E +G ++A F
Sbjct: 440 APLNARRMTPLVVHEDAVSDAADHRRSLSKGEGQERVGAFSTFASYAETIGRRAEEAVYF 499
Query: 461 TKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLV 520
T P EQINTT D +DYLWY+ + N + + VL + ++ ++ ++N + V
Sbjct: 500 TSPQ--EQINTTNDTTDYLWYTTTYNSAS-------ATSQVLSISNVNDVVYVYVNRQFV 550
Query: 521 GSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSG 580
+ S N V L G N D+LS T GLQNYG F E+ GI G V+L +
Sbjct: 551 TMSWSGSVNKAV------PLMAGTNVIDVLSTTFGLQNYGTFLEQVTRGIQGTVKLGST- 603
Query: 581 NGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAG 637
DL+ W +Q GL GEEL + S+ W + +T + L WY+++FD P
Sbjct: 604 -----DLTQNGWWHQVGLLGEELGIFLPQNASNVPWATPATTNR--GLTWYRSSFDLPQS 656
Query: 638 SE-PVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKP 696
S+ P+A+D TGMGKG WVNG ++GRYWP+ ++ + C D C+YRGAY ++C + C P
Sbjct: 657 SQAPLALDMTGMGKGFVWVNGHNLGRYWPSRIADSMAC-DDCDYRGAYDDSRCRQGCNIP 715
Query: 697 SQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGS 756
SQ YHVPR WL+ + N +V+ EEIGG+P IS V ++ S C V + +P
Sbjct: 716 SQRYYHVPREWLQPTNNLIVMLEEIGGNPALISLVEREEDIS-CGAVGEDYP------AD 768
Query: 757 DSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACV 816
D + L C +Q I ++FASFGTP+GTC FS G C++A S ++V C+
Sbjct: 769 DLSV--------VLGC-GLHQTIRRVEFASFGTPVGTCRQFSLGSCNAANSTAIVESLCL 819
Query: 817 GSKSCSIGVSVNTFGDPCKGVMKSLAVEASC 847
G ++C + V++N FGDPC K L V+ SC
Sbjct: 820 GRQACHVPVAINHFGDPCPDTTKRLFVQVSC 850
>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 715 bits (1846), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/838 (46%), Positives = 515/838 (61%), Gaps = 72/838 (8%)
Query: 15 FVVLATTSFG--ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
V++A G ANVTYD R+++I G+ ++L SGSIHY RSTP+MWP LI K+K GG+DV
Sbjct: 11 LVLMAVIVAGDVANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDV 70
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
++TYVFWN+HEP + Q++F G D+VKF+K V GLY LRIGP++ EW++GG P WL
Sbjct: 71 VDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWL 130
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
H + GI FRTDNEPFK M+R+ IV +MK E LYASQGGPIILSQIENEYG + A+
Sbjct: 131 HNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFR 190
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT--PNSNNKPKMW 250
GKSY+KW A +A+ LDTGVPWVMC+Q DAPDP++N CNG C + PNS NKP +W
Sbjct: 191 QEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIW 250
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+ ++ ++G R ED+AF VA F + G+F NYYMYHGGTNF R + F+ T
Sbjct: 251 TENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVIT 309
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
SY APLDEYGL+RQPKWGHLK+LH A+KLCE L++ T SLG A V+ +
Sbjct: 310 SYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKAN 369
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS-VTLVPSFSRQ 429
LC+A L N + TV+F +SY L SVS+LPDCKNV FNTAK+N+ +RQ
Sbjct: 370 LCAAILVN-QDKCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQ 428
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
+L SS + W E V + + LLE +NTT D SDYLW +T +
Sbjct: 429 NL-----SSPQM---WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQ--TTRFQQ 478
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E G+ +VL V LGHALHAF+NG+ +GS +G+ + ++ ++L G N L
Sbjct: 479 SE-----GAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLAL 533
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS-- 607
LS+ VGL N GA E+ + G +K + ++ W YQ GLKGE+ + +
Sbjct: 534 LSVMVGLPNSGAHLERR---VVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTED 590
Query: 608 -GSSTQW----DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
+ QW DSKS QPL WYK +FD P G +PVA++ MGKGEAWVNGQSIGR
Sbjct: 591 GSAKVQWKQYRDSKS-----QPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGR 645
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF-EEI 721
YW ++ + G PSQ YH+PRS+LK + N LV+ EE
Sbjct: 646 YWVSFHTYKGN----------------------PSQIWYHIPRSFLKPNSNLLVILEEER 683
Query: 722 GGDPTKISFVTKQLGSSLCSHVTDSHPLPV--------DMWGSDSKIQRKPGPVLSLECP 773
G+P I+ T + + +C HV++++P PV + + RKP + L+CP
Sbjct: 684 EGNPLGITIDTVSV-TEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPK--VQLQCP 740
Query: 774 NPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG 831
+ IS I FASFGTP G+CGS+S G C S SL+VV++AC+ CS+ V TFG
Sbjct: 741 TGRK-ISKILFASFGTPNGSCGSYSIGSCHSPNSLAVVQKACLKKSRCSVPVWSKTFG 797
>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
Length = 843
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/836 (45%), Positives = 506/836 (60%), Gaps = 57/836 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R+++I GKR +L SG+IHYPRSTP+MWPDLI+K+K GG++ IETYVFWN HEPV
Sbjct: 49 VTYDARSLIINGKRELLFSGAIHYPRSTPDMWPDLIKKAKQGGINAIETYVFWNGHEPVE 108
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEG +DLVKF+KL+ E LYA +R+GP++ AEWN GG P WL +PGI FR+DNEP
Sbjct: 109 GQYNFEGEFDLVKFIKLIHEHKLYAVVRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNEP 168
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK M+RF IVD +KQEKL+A QGGPIIL+QIENEY I A+ G SY++WA +A
Sbjct: 169 FKKHMKRFVTLIVDKLKQEKLFAPQGGPIILAQIENEYNTIQRAFREKGDSYVQWAGKLA 228
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSFGGA 264
LSL+ VPW+MC+Q DAPDPIINTCNG +C + PN NKP +WTENW+ + FG
Sbjct: 229 LSLNANVPWIMCKQRDAPDPIINTCNGRHCGDTFYGPNKRNKPALWTENWTAQYRVFGDP 288
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R EDLA++VARFF + G+ NYYM++GGTNF RTS F +T Y + PLDE+GL
Sbjct: 289 PSQRSAEDLAYSVARFFSKNGSMVNYYMHYGGTNFGRTSAS-FTTTRYYDEGPLDEFGLQ 347
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
R+PKWGHLKD+H+A+ LC+ AL PT LGP+ +A V++ G+ C+AFLAN T
Sbjct: 348 REPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAAFLANNNTRL 407
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
V F G LPA S+S+LPDCK VVFNT + + +F R ++A + +
Sbjct: 408 AQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSRNFVRS--EIANKNFN---- 461
Query: 444 GWSYINE--PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
W E PVG+ F E + T D +DY WY+ S + + ++ + V
Sbjct: 462 -WEMCREVPPVGL----GFKFDVPRELFHLTKDTTDYAWYTTSLLLGRRDLPMKKNVRPV 516
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V SLGH +HA++NG+ GS +GS + ++L G+N LL VGL + GA
Sbjct: 517 LRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAVSLKEGENHIALLGYLVGLPDSGA 576
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGS-STQWDSKST 618
+ EK AG + + G GT +D+S W +Q G+ GE +L GS S QW
Sbjct: 577 YMEKRFAGPRS-ITILGLNTGT-LDISQNGWGHQVGIDGEKKKLFTEEGSKSVQWTKPD- 633
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
+ PL WYK FDAP G PVAI TGMGKG WVNG+SIGRYW Y+S
Sbjct: 634 --QGGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSP-------- 683
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
KP+QS YH+PR++LK N +VL EE GG+P + VT +
Sbjct: 684 --------------LKKPTQSEYHIPRAYLKPK-NLIVLLEEEGGNPKDVHIVTVNR-DT 727
Query: 739 LCSHVTDSH-PLPVDMWGSDSKIQRKPG---PVLSLECPNPNQVISSIKFASFGTPLGTC 794
+CS V++ H P P + +Q K P L+CP Q++ +++FAS+G P G C
Sbjct: 728 ICSAVSEIHPPSPRLFETKNGSLQAKVNDLKPRAELKCPGKKQIV-AVEFASYGDPFGAC 786
Query: 795 GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF---GDPCKGVMKSLAVEASC 847
G++ G C++ S VV + C+G SC I + F D C + K+LAV+ C
Sbjct: 787 GAYFIGNCTAPESKQVVEKYCLGKPSCQIPLDSIPFSNQNDACTHLRKTLAVQLKC 842
>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/847 (44%), Positives = 512/847 (60%), Gaps = 62/847 (7%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G +TYD R+++I G+R + SGSIHYPRS WPDLI ++K+GGL+VIE+YVFWN
Sbjct: 30 TKPGTVITYDRRSLMIDGRREIFFSGSIHYPRSPFHEWPDLIARAKEGGLNVIESYVFWN 89
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
+HEP YNFEGRYD++KF KL+ E ++A +RIGP+V AEWN GG P WL +P I F
Sbjct: 90 IHEPEMGVYNFEGRYDMIKFFKLIQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIVF 149
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RTDNEP+K MQ+F +V+ +K KL+ASQGGPIIL+QIENEY ++++A+ G YI
Sbjct: 150 RTDNEPYKKLMQKFVTLVVNKLKDAKLFASQGGPIILAQIENEYQHMEAAFKENGTRYID 209
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNSNNKPKMWTENWSGWF 258
WAA MA+S TGVPW+MC+Q+ AP +I TCNG +C P NKP +WTENW+ +
Sbjct: 210 WAAKMAISTSTGVPWIMCKQTKAPAEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQY 269
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AFAVARFF GG+ NYYMYHGGTNF RT G F+ Y +APL
Sbjct: 270 RVFGDPPSQRSAEDIAFAVARFFSVGGSMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPL 328
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLA 377
DE+G+ ++PKWGHL+DLH A++LC+ AL+ +P+ LG EA +++ +C AFL+
Sbjct: 329 DEFGMYKEPKWGHLRDLHHALRLCKKALLRGNPSTQPLGKLYEARLFEIPEQKVCVAFLS 388
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF--SRQSLQVAA 435
N T D TV F G Y +P SVSIL DCK VVF+T +N+ +F + Q+LQ
Sbjct: 389 NHNTKEDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNAQHNQRTFHLTDQTLQ--- 445
Query: 436 DSSDAIGSGWSYINE----PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
+ W E P D KP LE N T D++DYLWY+ S ++A++
Sbjct: 446 ------NNVWEMYTEGDKVPTYKFTTDRSEKP--LEAYNMTKDKTDYLWYTTSFKLEAED 497
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
K VL S GHA+ AF+NGKLVG+ +G+ N +++ PI + G N +LS
Sbjct: 498 LPFRQDIKPVLEASSHGHAMVAFVNGKLVGAAHGTKMNKAFSLEKPIEVRAGINHVSILS 557
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGS 609
T+GLQ+ GA+ E AG+ V ++G GT +DLSS W + GL GE + + G
Sbjct: 558 STLGLQDSGAYLEHRQAGVHS-VTIQGLNTGT-LDLSSNGWGHIVGLDGERKQAHMDKGG 615
Query: 610 STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
QW K + L PL WY+ FD P+G +PV ID MGKG +VNG+ +GRYW +Y
Sbjct: 616 EVQW--KPAVFDL-PLTWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLGRYWSSYKH 672
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
G+PSQ LYHVPR +LK +GN L +FEE GG P I
Sbjct: 673 A----------------------LGRPSQYLYHVPRCFLKPTGNVLTIFEEEGGRPDAIM 710
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMW-GSDSKI-----QRKPGPVLSLECPNPNQVISSIK 783
+T + ++CS +++ +P V W DS++ KP VL+ CP + I +
Sbjct: 711 ILTVKR-DNICSFISEKNPGHVRSWERKDSQLTVVADDLKPRAVLT--CPE-KKTIQQVV 766
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSL 841
FAS+G PLG CG+++ G C + ++ VV +ACVG KSC + VS +G C G +L
Sbjct: 767 FASYGNPLGICGNYTVGNCHTPKAKEVVEKACVGKKSCVLAVSHEVYGGDLNCPGTTATL 826
Query: 842 AVEASCT 848
AV+A C+
Sbjct: 827 AVQAKCS 833
>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 700
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/656 (52%), Positives = 430/656 (65%), Gaps = 65/656 (9%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YDHR++VI G+RR+LISGSIHYPRS PEMWP LIQK+KDGGLDV++TYVFWN HEP +
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY F RYDLV+FVKLV +AGLY HLR+GPYVCAEWNFGGFP+WL ++PGI+FRTDN P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FKA MQ+F KIV MMK E L+ QGGPII++Q+ENE+G ++S G+ GK Y WAA MA
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ + GVPWVMC+Q DAPDP+INTCNGFYCD FTPN+ +KP MWTE W+GWF FGGA P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY----- 321
+RPVEDLAFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DE+
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339
Query: 322 --------------------------------------------GLIRQPKWGHLKDLHK 337
GL+RQPKWGHL+++H+
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399
Query: 338 AIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLP 397
AIK E ALV+ DPT S+G +A V+K+ +G C+AFL+N S V ++F+G Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459
Query: 398 AWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKD 457
AWS+SILPDCK VFNTA + TL+P S + A W +E D
Sbjct: 460 AWSISILPDCKTAVFNTATVKEPTLLPKMSPVMHRFA----------WQSYSEDTNSLDD 509
Query: 458 DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFING 517
AF + GL+EQ++ T D+SDYLWY+ NI ++E L+ G L V S GH++ F+NG
Sbjct: 510 SAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVNG 569
Query: 518 KLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLK 577
+ GS YG N K+T + + G N +LS VGL N G +E G+ GPV L
Sbjct: 570 RSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTLS 629
Query: 578 GSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKT 630
G G DLS Q+W YQ GLKGE L + S+ +W QPL W+K
Sbjct: 630 GLNEGKR-DLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGG--GTQPLTWHKV 682
>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
Length = 569
Score = 705 bits (1819), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/580 (59%), Positives = 421/580 (72%), Gaps = 28/580 (4%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I L +LC+ ++ +T A VTYDH+A++I G+RR+LISGSIHYPRSTPEMWPDLI+K+
Sbjct: 11 IFLAILCFSSLIHSTE---AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKA 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVI+TYVFWN HEP Y F+ RYDLVKF KLV +AGLY LRIGPYVCAEWNF
Sbjct: 68 KEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PG+ FRTDNEPFK MQ+FT KIVDMMK+EKL+ +QGGPIILSQIENEYG
Sbjct: 128 GGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYG 187
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
+ GAAGK+Y KW A MAL L TGVPW+MC+Q DAP PII+TCNGFYC+ F PNS+N
Sbjct: 188 PMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDN 247
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPK+WTENW+GWF FGGA+P RPVED+AF+VARF Q GG+F NYYMY GGTNFDRT+ G
Sbjct: 248 KPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYXGGTNFDRTA-G 306
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
FI+TSYDYDAP+DEYGL+R+PK+ HLK+LHK IKLCE ALV+ DPT SLG E V+
Sbjct: 307 VFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF 366
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVT---- 421
K+ + C+AFL+N T+S V F G Y LP WSVSILPDCK +NTAKI + T
Sbjct: 367 KSKTS-CAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMK 425
Query: 422 LVPS---FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
++P+ FS +S + SS+ G+ F K GL+EQI+ T D++DY
Sbjct: 426 MIPTSTKFSWESYNEGSPSSNEAGT----------------FVKDGLVEQISMTRDKTDY 469
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
WY I +DE L+ G +L + S GHALH F+NG L G+ YG+ SN+K+T I
Sbjct: 470 FWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNI 529
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKG 578
L+ G N LLS VGL N G YE GI GPV LKG
Sbjct: 530 KLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKG 569
>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
Length = 771
Score = 703 bits (1814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/850 (45%), Positives = 506/850 (59%), Gaps = 103/850 (12%)
Query: 7 LLLVLCWGFVVLATTSF---GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
+L L + F+ +A + NVTYD R+++I G+ R+L SGSIHYPRSTPE
Sbjct: 17 MLFWLGFAFLSMAIITVQGKAGNVTYDGRSLIINGEHRILFSGSIHYPRSTPE------- 69
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
Y+F+GR DLVKF+ V GLYA LRIGP++ EW
Sbjct: 70 -------------------------YDFDGRKDLVKFLLEVQAQGLYAALRIGPFIEGEW 104
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
+GG P WLH + GI FR+DNEPFK MQRF KIV+MMK +LYASQGGPII+SQIENE
Sbjct: 105 TYGGLPFWLHDVSGIVFRSDNEPFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQIENE 164
Query: 184 YGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-P 241
Y N+++A+ G Y+ WAA MA+ L+TGVPWVMC+Q+DAPDP+INTCNG C + F P
Sbjct: 165 YQNVETAFHEKGSRYVHWAANMAVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFAGP 224
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
NS NKP MWTENW+ ++ FGG R ED+AF VA F R G++ NYYMYHGGTNF R
Sbjct: 225 NSPNKPSMWTENWTSFYQVFGGEPYIRTAEDIAFHVALFIARNGSYVNYYMYHGGTNFGR 284
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALV-ATDPTYPSLGPNL 360
T G F++TSY APLDEYGLIRQPKWGHLKDLH IK C L+ T T+P LG
Sbjct: 285 T-GSAFVTTSYYDQAPLDEYGLIRQPKWGHLKDLHAKIKSCSKTLIRGTHQTFP-LGRLQ 342
Query: 361 EATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
EA V++ SG C AFL N DVTV+F SY LP S+SILPDCK++ FNTAK+N+
Sbjct: 343 EAYVFREKSGDCVAFLVNNDGRRDVTVRFQNRSYELPHKSISILPDCKSITFNTAKVNT- 401
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
++ +S ++ + S ++G W E V + LL+ ++TT D SDYLW
Sbjct: 402 ----QYATRSATLSQEFS-SVGK-WEEYKETVATFDSTSLRAKTLLDHLSTTKDTSDYLW 455
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
Y+ P ++ L S GH LHA++NG GS +GS + T++ + L
Sbjct: 456 YTFRFQNHFSRP------QSTLRAYSRGHVLHAYVNGVYAGSAHGSHESTSFTLENSVRL 509
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N LLS+TVGL + GA+ E+ AG+ V+++ N D ++ W YQ GL G
Sbjct: 510 KNGTNNVALLSVTVGLPDSGAYLERRVAGLH-RVRIQ------NKDFTTYSWGYQVGLLG 562
Query: 601 EELNFPSGSSTQWDSKSTLP-KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
E+L + + S + QPL WYKT FDAPAGS+P+A++ MGKGEAWVNGQS
Sbjct: 563 EKLQIYTDNGLNKVSWNEFRGTTQPLTWYKTQFDAPAGSDPIALNLHSMGKGEAWVNGQS 622
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
IGRYW ++S++K G PSQ+ YH+P+S++K +GN LVL E
Sbjct: 623 IGRYWV-----------------SFSTSK-----GNPSQTRYHIPQSFVKPTGNLLVLLE 660
Query: 720 EIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVI 779
E G P I+ + + S +C HV++SH V+ L CP PN+ I
Sbjct: 661 EEKGYPPGITVDSISI-SKVCGHVSESHK-----------------SVVQLSCP-PNRNI 701
Query: 780 SSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVM 838
S I F+SFGTP G C ++ G+C S+ S ++V +AC+G C I S F GDPC G+
Sbjct: 702 SRILFSSFGTPEGNCNQYAIGKCHSSNSRAIVEKACIGKTKCIILRSNRFFGGDPCPGIR 761
Query: 839 KSLAVEASCT 848
K L V+A CT
Sbjct: 762 KGLLVDAKCT 771
>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
Length = 821
Score = 703 bits (1814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/831 (45%), Positives = 516/831 (62%), Gaps = 59/831 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD RA+++ G RR+L SG +HY RSTPEMWP +I K++ GG+DVI+TYVFWN+HEPV+
Sbjct: 39 VTYDGRALLLNGTRRMLFSGEMHYTRSTPEMWPKIIAKARKGGIDVIQTYVFWNVHEPVQ 98
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+YNFEGRY++VKF++ + GLY LRIGP++ AEW +GGFP WLH +P I FRTDNEP
Sbjct: 99 GKYNFEGRYNIVKFIREIQAQGLYVSLRIGPFIEAEWKYGGFPFWLHEVPNITFRTDNEP 158
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ F +V+MMK E LY QGGPII+SQIENEY ++ A+G G Y++WAA +A
Sbjct: 159 FKQHMQGFVTHMVNMMKNEGLYYPQGGPIIISQIENEYQMVEPAFGPGGPRYVQWAASLA 218
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSFGGA 264
+ L TGVPW+MC+Q+DAPDPIINTCNG C + PNS NKP +WTENW+ + +G
Sbjct: 219 VGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPNKPALWTENWTTRYPIYGND 278
Query: 265 VPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
R D+ FAVA F R GG+F +YYMYHGGTNF R + +++TSY APLDEYGL
Sbjct: 279 TKLRSTGDITFAVALFIARKGGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGL 337
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
I QP WGHLK+LH A+KL L+ + SLG + EA V++T C AFL N +
Sbjct: 338 IWQPTWGHLKELHAAVKLSSEPLLYGTYSNFSLGEDQEAHVFETKLK-CVAFLVNFDKHQ 396
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
TV F S L S+SIL DC+ VVF T K+N+ ++ +V +D
Sbjct: 397 RPTVIFRNISLQLAPKSISILSDCRTVVFETGKVNA-----QHGSRTAEVVQSLNDT--H 449
Query: 444 GWSYINE--PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
W E P ISK A+T L E ++TT D++DYLWY S + +D +
Sbjct: 450 TWKAFKESIPQDISK-AAYTGKQLFEHLSTTKDETDYLWYIASYEYRPS----DDSHLVL 504
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSS-SNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L+V+S H LHAF+NG+ VGS +GS + + ++ I+L G+NT LL++ VG + G
Sbjct: 505 LNVESQAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKEGQNTISLLNVMVGSPDSG 564
Query: 561 AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGS-STQWDSKS 617
A E+ GI V ++ + ++ L+++ W YQ GL GE + GS S +W +
Sbjct: 565 AHMERRSFGIH-KVSIQQGQHALHL-LNNELWGYQVGLFGEGNRIYTQEGSHSVEWTDVN 622
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
L L PL WY+TTF P G++ V ++ T MGKGE W+NG+SIGRYW ++ + +
Sbjct: 623 NLTYL-PLTWYQTTFATPMGNDAVTLNLTSMGKGEVWINGESIGRYWVSFKTPS------ 675
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
G+PSQSLYH+P+ +LK++ N LVL EE+GG+P +I+ T + +
Sbjct: 676 ----------------GQPSQSLYHIPQHFLKNTDNLLVLVEEMGGNPLQITVNTVSI-T 718
Query: 738 SLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
++CS V + PV G D P + L C + IS+++FAS+G P G C +F
Sbjct: 719 TVCSSVNELSAPPVQSQGKD--------PEVRLRC-QKGKHISAVEFASYGNPAGDCRTF 769
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+ G C + S SVV+QAC+G +SCSI V +F GDPC G+ KSL V A C
Sbjct: 770 TIGSCHAESSESVVKQACIGKRSCSIPVGPGSFGGDPCPGIQKSLLVVAHC 820
>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
Length = 823
Score = 702 bits (1813), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/852 (44%), Positives = 516/852 (60%), Gaps = 69/852 (8%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G +T+D R++++ G+R + SGSIHYPRS P MWPDLI ++K+GGL+VIE+YVFWN
Sbjct: 9 TKEGTAITFDRRSLMVDGRRDLFFSGSIHYPRSPPHMWPDLIARAKEGGLNVIESYVFWN 68
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP YNFEGRYD++KF KLV E ++A +RIGP+V AEWN GG P WL +P I F
Sbjct: 69 GHEPEMGVYNFEGRYDMIKFFKLVQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIIF 128
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+NEPFK MQ+F IV+ +K KL+ASQGGPIIL+QIENEY ++++A+ G +YI
Sbjct: 129 RTNNEPFKKHMQKFVTMIVNKLKDAKLFASQGGPIILAQIENEYQHLEAAFKENGTTYIH 188
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNSNNKPKMWTENWSGWF 258
WAA MA L+ GVPW+MC+Q+ AP +I TCNG +C P NKP +WTENW+ +
Sbjct: 189 WAAKMASDLNIGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQY 248
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AFAVARF+ GGT NYYMYHGGTNF RT G F+ Y +APL
Sbjct: 249 RVFGDPPSQRSAEDIAFAVARFYSVGGTMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPL 307
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLA 377
DE+GL ++PKWGHL+DLH A++LC+ A++ +P+ LG EA +++ +C AFL+
Sbjct: 308 DEFGLYKEPKWGHLRDLHHALRLCKKAILWGNPSNQPLGKLYEARLFEIPEQKICVAFLS 367
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV--TLVPSFSRQSLQVAA 435
N T D TV F G Y +P SVSIL DCK VVF+T +NS FS Q++Q
Sbjct: 368 NHNTKEDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNSQHNQRTFHFSDQTVQ--- 424
Query: 436 DSSDAIGSGWSYINE----PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
G+ W E P + KP LE N T D++DY+WY+ S ++A++
Sbjct: 425 ------GNVWEMYTESDKVPTYKFTNIRTQKP--LEAYNLTKDKTDYVWYTTSFKLEAED 476
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
VL V S GHA+ AF+NGK VG+G+G+ N T++ PI + G N +LS
Sbjct: 477 LPFRKDIWPVLEVSSHGHAMVAFVNGKYVGAGHGTKINKAFTMEKPIEVRTGINHVSILS 536
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---G 608
T+G+Q+ G + E AGI G V ++G GT +DL+S W + GL+GE N + G
Sbjct: 537 TTLGMQDSGVYLEHRQAGIDG-VTIQGLNTGT-LDLTSNGWGHLVGLEGERRNAHTEKGG 594
Query: 609 SSTQWDSKSTLPKL--QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
QW +P + +PL WY+ FD P G +PV ID + MGKG +VNG+ +GRYW +
Sbjct: 595 DGVQW-----VPAVFDRPLTWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGLGRYWSS 649
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI-GGDP 725
Y G+PSQ LYHVPR +LK +GN + +FEE GG P
Sbjct: 650 YKHA----------------------LGRPSQYLYHVPRCFLKPTGNVMTIFEEEGGGQP 687
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVDMW-GSDSKIQR------KPGPVLSLECPNPNQV 778
I +T + ++CS +++ +P V W DS ++ KP VLS CP ++
Sbjct: 688 DGIMILTVKR-DNICSFISEKNPAHVKSWERKDSHLKSVADADLKPQAVLS--CPE-KKL 743
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKG 836
I + FAS+G PLG CG+++ G C + ++ +V +ACVG KSC + VS +G C G
Sbjct: 744 IQQVVFASYGNPLGICGNYTVGNCHAPKAKEIVEKACVGKKSCVLQVSHEVYGADLNCPG 803
Query: 837 VMKSLAVEASCT 848
+LAV+A C+
Sbjct: 804 STGTLAVQAKCS 815
>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
Length = 811
Score = 702 bits (1812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/844 (46%), Positives = 518/844 (61%), Gaps = 75/844 (8%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
G +TYD RA+V+ G RR+ SG +HY RSTPEMWP LI K+K+GGLDVI+TYVFWN+
Sbjct: 24 ELGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNV 83
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP++ QYNFEGRYDLVKF++ + GLY LRIGP+V AEW +GGFP WLH +P I FR
Sbjct: 84 HEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFR 143
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
+DNEPFK MQ F KIV MMK E LY QGGPII+SQIENEY I+ A+GA+G Y++W
Sbjct: 144 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 203
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFL 259
AA MA+ L TGVPW+MC+Q+DAPDP+INTCNG C + PNS NKP +WTENW+ +
Sbjct: 204 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYP 263
Query: 260 SFGGAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+G R ED+AFAVA + R G+F +YYMYHGGTNF R + +++TSY APL
Sbjct: 264 IYGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPL 322
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEYGLI QP WGHL++LH A+K L+ + SLG EA V++T C AFL N
Sbjct: 323 DEYGLIWQPTWGHLRELHCAVKQSSEPLLFGSYSNFSLGQQQEAHVFETDFK-CVAFLVN 381
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
++ V+F S L S+S+L DC+NVVF TAK+N+ Q + ++
Sbjct: 382 FDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNA------------QHGSRTA 429
Query: 439 DAIGS-----GWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
+A+ S W EPV +SK +T L EQ+ TT D++DYLWY +S +A
Sbjct: 430 NAVQSLNDINNWKAFIEPVPQDLSK-STYTGNQLFEQLPTTKDETDYLWYIVSYKNRAS- 487
Query: 492 PLLEDGSKTV-LHVQSLGHALHAFINGKLVGSGYGSSSNAK-VTVDFPIALAPGKNTFDL 549
DG++ L+V+SL H LHAF+N + VGS +GS + + ++ ++L G NT L
Sbjct: 488 ----DGNQIARLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISL 543
Query: 550 LSLTVGLQNYGAFYEKTGAGI--TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
LS+ VG + GA+ E+ GI G Q + + N DL W YQ GL GE+ + +
Sbjct: 544 LSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDL----WGYQVGLFGEKDSIYT 599
Query: 608 G---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+S +W + L PL WYKTTF P G++ V ++ T MGKGE WVNG+SIGRYW
Sbjct: 600 QEGPNSVRWMDINNL-IYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYW 658
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
++ + + G+PSQSLYH+PR +L N LVL EE+GGD
Sbjct: 659 VSFKAPS----------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGD 696
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKF 784
P +I+ T + +++C +V + P+ G K++ + C + ISSI+F
Sbjct: 697 PLQITVNTMSV-TTVCGNVDEFSVPPLQSRGKVPKVR--------IWCQG-GKRISSIEF 746
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAV 843
AS+G P+G C SF G C + S SVV+Q+C+G + CSI V F GDPC G+ KSL V
Sbjct: 747 ASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFGGDPCPGIQKSLLV 806
Query: 844 EASC 847
A C
Sbjct: 807 VADC 810
>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
Length = 825
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/844 (43%), Positives = 515/844 (61%), Gaps = 46/844 (5%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
++ + +TYD R++++ GK + SGSIHYPRSTP+MWPD++ K++ GGL++I+
Sbjct: 16 ITIVCAQNAAQTITYDGRSLLLDGKGELFFSGSIHYPRSTPDMWPDILDKARRGGLNLIQ 75
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TYVFWN HEP +++ NFEGRYDLVKF+KLV E G+Y LRIGP++ AEWN GG P WL
Sbjct: 76 TYVFWNGHEPEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIGPFIQAEWNHGGLPYWLRE 135
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+P I FR++NEPFK M+ + + +++ MK+EKL+A QGGPIIL+QIENEY +I AY A
Sbjct: 136 VPDIIFRSNNEPFKKYMKEYVSIVINRMKEEKLFAPQGGPIILAQIENEYNHIQLAYEAD 195
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTE 252
G +Y++WAA MA+SL GVPWVMC+Q DAPDP+IN CNG +C D FT PN KP +WTE
Sbjct: 196 GDNYVQWAAKMAVSLYNGVPWVMCKQKDAPDPVINACNGRHCGDTFTGPNKPYKPFIWTE 255
Query: 253 NWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSY 312
NW+ + FG R ED+AF+VARFF + G+ NYYMYHGGTNF RT+ F +T Y
Sbjct: 256 NWTAQYRVFGDPPSQRSAEDIAFSVARFFSKHGSLVNYYMYHGGTNFGRTTSA-FTTTRY 314
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGL 371
+APLDE+GL R+PKW HL+D HKA+ LC+ +L+ PT + E VY K S L
Sbjct: 315 YDEAPLDEFGLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQKISQYHEVIVYEKKESNL 374
Query: 372 CSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSL 431
C+AF+ N T + T+ F G+ Y LP S+SILPDCK VVFNT I S S +
Sbjct: 375 CAAFITNNHTQTAKTLSFRGSDYFLPPRSISILPDCKTVVFNTQNIAS-----QHSSRHF 429
Query: 432 QVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
+ + +D W +EP+ +K+ + E + D++DY WY+ S + ++
Sbjct: 430 EKSKTGNDF---KWEVFSEPIPSAKELPSKQKLPAELYSLLKDKTDYGWYTTSVELGPED 486
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
+ VL + SLGH+L AF+NG+ +GS +GS P+ G N +L+
Sbjct: 487 IPKKSDVAPVLRILSLGHSLQAFVNGEYIGSKHGSHEEKGFEFQKPVNFKVGVNQIAILA 546
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSS 610
VGL + GA+ E AG + + G +GT IDL+S W +Q GL+GE + F S
Sbjct: 547 NLVGLPDSGAYMEHRYAG-PKTITILGLMSGT-IDLTSNGWGHQVGLQGENDSIFTEKGS 604
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ 670
+ + K K + WYKT FD P G+ PVAI GM KG WVNG+SIGR+W +Y+S
Sbjct: 605 KKVEWKDGKGKGSTISWYKTNFDTPEGTNPVAIGMEGMAKGMIWVNGESIGRHWMSYLSP 664
Query: 671 NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
GKP+QS YH+PRS+LK N LV+FEE P KI+
Sbjct: 665 ----------------------LGKPTQSEYHIPRSFLKPKDNLLVIFEEEAISPDKIAI 702
Query: 731 VTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLE----CPNPNQVISSIKFAS 786
+T ++CS +T++HP + + S ++ + G L+ E CP+ + I++++FAS
Sbjct: 703 LTVNR-DTICSFITENHPPNIRSFASKNQKLERVGENLTPEAFITCPDQKK-ITAVEFAS 760
Query: 787 FGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF---GDPCKGVMKSLAV 843
FG P G CGSF G+C++ S +V Q C+G +CS+ + TF D C V+K+LA+
Sbjct: 761 FGDPSGFCGSFIMGKCNAPSSKKIVEQLCLGKPTCSVPMVKATFTGGNDGCPDVVKTLAI 820
Query: 844 EASC 847
+ C
Sbjct: 821 QVKC 824
>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
Length = 705
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/640 (56%), Positives = 428/640 (66%), Gaps = 34/640 (5%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRAV+IGGKRR+L+S +HYPR+TPEMWP LI K K+GG DVIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKFKEGGADVIETYVFWNGHEPA 122
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QY FE R+DLVKF KLVA GL+ LRIGPY CAEWNFGGFP+WL IPGI+FRTDNE
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFKAEMQ F KIV +MK+EKLY+ QGGPIIL QIENEYGNI YG AGK Y++WAA M
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ LDTG+PWVMC+Q+DAP+ II+TCN FYCD F PNS NKP +WTE+W GW+ +GGA+
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP ED AFAVARF+QRGG+ QNYYMY GGTNF RT+GGP TSYDYDAP+DEYG++R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362
Query: 326 QPKWGHLKDLHKAIKLCEAALVAT--DPTYPSLGPNLEATVYKTG-----------SGLC 372
QPKWGHLKDLH AIKLCE AL+A P Y LG EA VY TG + +C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVVGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV-------PS 425
SAFLANI + +V G SY LP WSVSILPDC+NV FNTA+I + T V PS
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482
Query: 426 FS---RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
S + S+ + S W E +G + F G+LE +N T D SDYLWY+
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542
Query: 483 LSTNIKADEPLLEDGSKTV---LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIA 539
NI +D + SK V L + + F+NGKL GS G V++ PI
Sbjct: 543 TRVNI-SDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQ 597
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLK 599
L G N LLS VGLQNYGAF EK GAG G V L G +G ++DL++ WTYQ GLK
Sbjct: 598 LVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDG-DVDLTNSLWTYQVGLK 656
Query: 600 GE--ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAG 637
GE + P S+ +QP WYK + G
Sbjct: 657 GEFSMIYAPEKQGCAGWSRMQKDSVQPFTWYKNICNQSVG 696
>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
Length = 628
Score = 700 bits (1807), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/625 (56%), Positives = 440/625 (70%), Gaps = 7/625 (1%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
IL LV L G+NV+YD R+++I G+R++LIS SIHYPRS P MWP LIQ +
Sbjct: 6 ILCLVSTSLTFTLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTA 65
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GG+DVIETYVFWN HE Y F GR+DLV+F K+V +AG+Y LRIGP+V AEWNF
Sbjct: 66 KEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNF 125
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P+WLH+IPG FRT N+PF M++FT IV++MK+EKL+ASQGGPIILSQIENEYG
Sbjct: 126 GGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYG 185
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ Y GK Y WAA MA+S +T VPW+MCQQ DAPDP+I+TCN FYCDQFTP S
Sbjct: 186 YYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPK 245
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
+PKMWTENW GWF +FGG P+RPVED+AF+VARFFQ+GG+ NYYMYHGGTNF RT+GG
Sbjct: 246 RPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGG 305
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PFI+TSYDYDAP+DEYGL R PKWGHLK+LHKAIKLCE L+ SLGP++EA +Y
Sbjct: 306 PFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIY 365
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
SG C+AF++N+ +D V F SY LPAWSVSILPDCKNVVFNTAK++S T + +
Sbjct: 366 TDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVA 425
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ LQ + + W E GI F K G ++ INTT D +DYLW++ S
Sbjct: 426 MIPEHLQQSDKGQKTL--KWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSI 483
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
I A+E L+ GSK L ++S GH LHAF+N K G+G G+ S++ T PI+L GKN
Sbjct: 484 LIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKN 543
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
+LSLTVGLQ G FY+ GAG+T V++ G N T IDLSS W Y+ G+ GE L+
Sbjct: 544 EIAILSLTVGLQTAGPFYDFIGAGVTS-VKIIGLNNRT-IDLSSNAWAYKIGVLGEHLSI 601
Query: 606 PSG---SSTQWDSKSTLPKLQPLVW 627
G +S +W S S PK Q L W
Sbjct: 602 YQGEGMNSVKWTSTSEPPKGQALTW 626
>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
Length = 683
Score = 700 bits (1806), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/694 (51%), Positives = 461/694 (66%), Gaps = 37/694 (5%)
Query: 168 YASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPI 227
+ASQGGPIILSQIENEYG A GAAG +YI WAA MA++LDTGVPWVMC++ DAPDP+
Sbjct: 2 FASQGGPIILSQIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPM 61
Query: 228 INTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTF 287
IN CNGFYCD F+PN KP MWTE WSGWF FGG + +RPV+DLAF+VARF Q+GG++
Sbjct: 62 INACNGFYCDGFSPNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQKGGSY 121
Query: 288 QNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALV 347
NYYMYHGGTNF RT+GGPFI+TSYDYD P+DEYGLIRQPK+GHLK+LHKAIKLCE ALV
Sbjct: 122 INYYMYHGGTNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEHALV 181
Query: 348 ATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
++DPT SLG +A V+ +G C+AFL+N + + + FN Y LPAWS+SILPDC
Sbjct: 182 SSDPTVTSLGAYQQAYVFNSGPRRCAAFLSNFHS-TGARMTFNNMHYDLPAWSISILPDC 240
Query: 408 KNVVFNTAKI----NSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKP 463
+NVVFNTAK+ + V ++P+ SR + S +Y + + + +
Sbjct: 241 RNVVFNTAKVGVQTSRVQMIPTNSR------------LFSWQTYDEDVSSLHERSSIAAG 288
Query: 464 GLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSG 523
GLLEQIN T D SDYLWY + +I + E L G K L VQS GHALH F+NG+ GS
Sbjct: 289 GLLEQINVTRDTSDYLWYMTNVDISSSE--LRGGKKPTLTVQSAGHALHVFVNGQFSGSA 346
Query: 524 YGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGT 583
+G+ + + T P+ L G N LLS+ VGL N G YE GI GPV L G G G
Sbjct: 347 FGTREHRQFTFAKPVHLRAGINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGR 406
Query: 584 NIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKL-QPLVWYKTTFDAPAGSE 639
DL+ Q+W + GLKGE ++ S GSS W S + Q L WYK F+AP G E
Sbjct: 407 K-DLTMQKWFNKVGLKGEAMDLVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDE 465
Query: 640 PVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQS 699
P+A+D MGKG+ W+NGQSIG+YW Y NG C+ C+Y G + KC CG+P+Q
Sbjct: 466 PLALDMRSMGKGQVWINGQSIGKYWMAYA--NGDCS-LCSYIGTFRPTKCQLGCGQPTQR 522
Query: 700 LYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWG 755
YHVPRSWLK + N +V+FEE+GGDP+KI+ V + + + +C+ + + HP L +D
Sbjct: 523 WYHVPRSWLKPTQNLVVVFEELGGDPSKITLVKRSV-AGVCADLQEHHPNAEKLDIDSHE 581
Query: 756 SDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQAC 815
+ + + L+C P Q ISSIKFASFGTP GTCGSF +G C + S ++V + C
Sbjct: 582 ESKTLHQAQ---VHLQCV-PGQSISSIKFASFGTPTGTCGSFQQGTCHATNSHAIVEKNC 637
Query: 816 VGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
+G +SC + VS + FG DPC V+K L+VEA C+
Sbjct: 638 IGRESCLVTVSNSIFGTDPCPNVLKRLSVEAVCS 671
>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
Length = 719
Score = 700 bits (1806), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/738 (50%), Positives = 468/738 (63%), Gaps = 56/738 (7%)
Query: 9 LVLCWGFVVLATTSFGA----NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
+ L V + SFG VTYD R+++I G+R +L SGSIHYPRSTP+MWP LI K
Sbjct: 5 VCLMMMLVAILELSFGVKGAEEVTYDGRSLIINGQRNILFSGSIHYPRSTPQMWPGLIAK 64
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
+K GGLDVI+TYVFWNLHEP +Y+F GR DLV F+K + GLY LRIGP++ +EWN
Sbjct: 65 AKQGGLDVIQTYVFWNLHEPQPGKYDFSGRNDLVGFIKEIHAQGLYVSLRIGPFIESEWN 124
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
+GGFP WLH +PGI +RTDNEPFK MQ FT KIV+MMK+E LYASQGGPIILSQIENEY
Sbjct: 125 YGGFPFWLHDVPGIVYRTDNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEY 184
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PN 242
GNI A+G AG Y++WAA MA+ L+TGVPWVMC+Q DAPDP+INTCNG C + FT PN
Sbjct: 185 GNIQKAFGTAGSQYVEWAAKMAVGLNTGVPWVMCKQPDAPDPVINTCNGMRCGETFTGPN 244
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPY-RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
S NKP MWTENW+ ++ +GG VPY R ED+AF V F R G+F NYYMYHGGTNF R
Sbjct: 245 SPNKPAMWTENWTSFYQVYGG-VPYIRSAEDIAFHVTLFVARNGSFVNYYMYHGGTNFGR 303
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
TS I+ YD APLDEYGL RQPKWGHLK+LH AIK C L+ SLG E
Sbjct: 304 TSSAYMITGYYD-QAPLDEYGLFRQPKWGHLKELHAAIKSCSTTLLQGVQRNFSLGELQE 362
Query: 362 ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVT 421
V++ +G C+AFL N + VTV+FN +SY L S+SILPDC+NV FNTA +N+ +
Sbjct: 363 GYVFEEENGKCAAFLINNDKGNTVTVQFNNSSYKLLPKSISILPDCQNVAFNTAHLNTTS 422
Query: 422 LVPSF-SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
SRQ+ D W + + D + LLEQ+NTT D+SDYLW
Sbjct: 423 NRRIITSRQNFSSVDD--------WKQFQDVIPNFDDTSLRSDSLLEQMNTTKDKSDYLW 474
Query: 481 YS--LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
Y+ L N+ ++P +LHVQS H +AF+N +G +G+ T++ PI
Sbjct: 475 YTLRLENNLSCNDP--------ILHVQSSAHVAYAFVNNTYIGGEHGNHDVKSFTLELPI 526
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
L N +LS VGL + GAF EK AG+ V+L+ S + ++L++ W YQ GL
Sbjct: 527 TLNERTNNISILSGMVGLPDSGAFLEKRFAGLNN-VELQCSEQES-LNLNNSTWGYQVGL 584
Query: 599 KGEELNF---PSGSSTQWDSKSTLPKLQ-PLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
GE+L + + +W + + L WYKTTFD P G +P+A+D + M KGEAW
Sbjct: 585 LGEQLKVYTEQNSTDIKWTQLGNITIDEVTLTWYKTTFDTPKGDDPIALDLSSMAKGEAW 644
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNGQSIGRYW + L + G PSQSLYHVPRS+LK S N+
Sbjct: 645 VNGQSIGRYWILF----------------------LDSKGNPSQSLYHVPRSFLKDSENS 682
Query: 715 LVLFEEIGGDPTKISFVT 732
LVL +E GG+P IS T
Sbjct: 683 LVLLDEGGGNPLDISLNT 700
>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
Length = 847
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/845 (42%), Positives = 507/845 (60%), Gaps = 54/845 (6%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYD +++ + G+R +L SGSIHY RSTP+ WPD++ K++ GGL+VI+TYVFWN HEP
Sbjct: 34 NVTYDGKSLFVNGRRELLFSGSIHYTRSTPDAWPDILDKARHGGLNVIQTYVFWNAHEPE 93
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ ++NFEG DLVKF++LV G+Y LR+GP++ AEWN GG P WL +PGI FR+DNE
Sbjct: 94 QGKFNFEGNNDLVKFIRLVQSKGMYVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNE 153
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
P+K M+ + +KI+ MMK EKL+A QGGPIIL+QIENEY +I AY G SY++WAA M
Sbjct: 154 PYKKYMKAYVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANM 213
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGG 263
A++LD GVPW+MC+Q DAPDP+IN CNG +C D F+ PN KP +WTENW+ + FG
Sbjct: 214 AVALDIGVPWIMCKQKDAPDPVINACNGRHCGDTFSGPNKPYKPSLWTENWTAQYRVFGD 273
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
V R ED+AF+VARFF + G NYYMYHGGTNF RT+ F +T Y +APLDEYG+
Sbjct: 274 PVSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTTSA-FTTTRYYDEAPLDEYGM 332
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTN 382
RQPKW HL+D HKA+ LC A++ PT L E ++ K G+ CSAF+ N TN
Sbjct: 333 ERQPKWSHLRDAHKALLLCRKAILGGVPTVQKLNDYHEVRIFEKPGTSTCSAFITNNHTN 392
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTA---------KINSVTLVPSF--SRQSL 431
T+ F G++Y LPA S+S+LPDCK VV+NT K+ S L+ S+ +
Sbjct: 393 QAATISFRGSNYFLPAHSISVLPDCKTVVYNTQNVMNQLVYYKLISSHLIIKLIVSQHNK 452
Query: 432 QVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
+ S+ A W E + SK + LE D +DY WY+ S + ++
Sbjct: 453 RNFVKSAVANNLKWELFLEAIPSSKKLESNQKIPLELYTLLKDTTDYGWYTTSFELGPED 512
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
L S +L + SLGH L AF+NG+ +G+ +G+ + P G N +L+
Sbjct: 513 --LPKKS-AILRIMSLGHTLSAFVNGQYIGTDHGTHEEKSFEFEQPANFKVGTNYISILA 569
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSG 608
TVGL + GA+ E AG + + G G ++L+ W ++ GL+GE+L
Sbjct: 570 TTVGLPDSGAYMEHRYAG-PKSISILGLNKG-KLELTKNGWGHRVGLRGEQLKVFTEEGS 627
Query: 609 SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYV 668
QWD + + + L W KT F P G PVAI TGMGKG WVNG+SIGR+W +++
Sbjct: 628 KKVQWDPVTG--ETRALSWLKTRFATPEGRGPVAIRMTGMGKGMIWVNGKSIGRHWMSFL 685
Query: 669 SQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKI 728
S G+PSQ YH+PR +L + N LV+ EE G P KI
Sbjct: 686 SP----------------------LGQPSQEEYHIPRDYLNAKDNLLVVLEEEKGSPEKI 723
Query: 729 SFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSLECPNPNQVISSIKF 784
+ ++CS++T++ P V+ WGS + + + GP SL+CP+ +++ +++F
Sbjct: 724 EIMIVDR-DTICSYITENSPANVNSWGSKNGEFRSVGKNSGPQASLKCPSGKKIV-AVEF 781
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAV 843
ASFG P G CG F+ G C+ + VV +AC+G + C + V+ F G C G + +LA+
Sbjct: 782 ASFGNPSGYCGDFALGNCNGGAAKGVVEKACLGKEECLVEVNRANFNGQGCAGSVNTLAI 841
Query: 844 EASCT 848
+A C+
Sbjct: 842 QAKCS 846
>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 846
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/840 (42%), Positives = 508/840 (60%), Gaps = 48/840 (5%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++I G+R + SGSIHYPRS P+MWP+LI K+K+GGL+ IETY+FWN
Sbjct: 35 TKNGTVVSYDRRSLIIDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYIFWN 94
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
+HEP + Q++FEGRYD+V+F KL+ E +YA +R+GP++ AEWN GG P WL IP I F
Sbjct: 95 IHEPEKGQFDFEGRYDIVRFFKLIQEHNMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVF 154
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+NEP+K M+ F I+ +K L+ASQGGPIIL+QIENEY ++++A+ G YIK
Sbjct: 155 RTNNEPYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKNDGTKYIK 214
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK--PKMWTENWSGWF 258
WAA MA+S + G+PW+MC+Q+ AP +I TCNG C P NK P +WTENW+ +
Sbjct: 215 WAANMAISTNVGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPMNKSMPLLWTENWTAQY 274
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AFAVARFF GGT NYYMYHGGTNF RTS F+ Y +APL
Sbjct: 275 RVFGDPPSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTSAA-FVMPKYYDEAPL 333
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLA 377
DE+GL ++PKWGHL+DLH A+KLC+ AL+ + LG EA V++ +C AFL+
Sbjct: 334 DEFGLYKEPKWGHLRDLHLALKLCKKALLWGKTSTEKLGKQFEARVFEIPEQKVCVAFLS 393
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T DVT+ F G SY +P S+SIL DCK VVF T +N+ ++++ A +
Sbjct: 394 NHNTKDDVTLTFRGQSYFVPRHSISILADCKTVVFGTQHVNA-----QHNQRTFHFADQT 448
Query: 438 SDAIGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL 494
+ + W +E P K G L N T D++DY+WY+ S ++AD+ +
Sbjct: 449 TQ--NNVWQMFDEEKVPKYKQSKIRLRKAGDL--YNLTKDKTDYVWYTSSFKLEADDMPI 504
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
KTVL V S GHA AF+N K VG G+G+ N T++ P+ L G N +L+ T+
Sbjct: 505 RRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTM 564
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD 614
G+ + GA+ E AG+ VQ+KG GT +DL++ W + GL GE+ +
Sbjct: 565 GMMDSGAYLEHRLAGVDR-VQIKGLNAGT-LDLTNNGWGHIVGLVGEQKQIYTDKGMGSV 622
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ +PL WYK FD P+G +P+ +D + MGKG +VNGQ IGRYW +Y
Sbjct: 623 TWKPAVNDRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISYKHA---- 678
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
G+PSQ LYH+PRS+L+ N LVLFEE G P I +T +
Sbjct: 679 ------------------LGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVK 720
Query: 735 LGSSLCSHVTDSHPLPVDMW-GSDSKIQRKPG---PVLSLECPNPNQVISSIKFASFGTP 790
++C+ +++ +P + W DS+I P +L C +P ++I + FAS+G P
Sbjct: 721 R-DNICTFISERNPAHIKSWERKDSQITVTAADLKPRATLTC-SPKKLIQQVVFASYGNP 778
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASCT 848
+G CG+++ G C + R+ +V +AC+G + C++ VS + +G C G +LAV+A C+
Sbjct: 779 MGICGNYTIGSCHTPRAKELVEKACLGKRICTLPVSADVYGGDVNCPGTTATLAVQAKCS 838
>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 713
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/648 (56%), Positives = 429/648 (66%), Gaps = 42/648 (6%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRAV+IGGKRR+L+S +HYPR+TPEMWP LI K K+GG DVIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 86 RNQYNFEGRYDLVKFVK--------LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPG 137
+ QY FE R+DLVKF K LVA GL+ LRIGPY CAEWNFGGFP+WL IPG
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPG 182
Query: 138 IQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKS 197
I+FRTDNEPFKAEMQ F KIV +MK+EKLY+ QGGPIIL QIENEYGNI YG AGK
Sbjct: 183 IEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKR 242
Query: 198 YIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGW 257
Y++WAA MA+ LDTG+PWVMC+Q+DAP+ II+TCN FYCD F PNS NKP +WTE+W GW
Sbjct: 243 YMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGW 302
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
+ +GGA+P+RP ED AFAVARF+QRGG+ QNYYMY GGTNF RT+GGP TSYDYDAP
Sbjct: 303 YADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAP 362
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD--PTYPSLGPNLEATVYKTG------- 368
+DEYG++RQPKWGHLKDLH AIKLCE AL+A D P Y LG EA VY TG
Sbjct: 363 IDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGS 422
Query: 369 ----SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV- 423
+ +CSAFLANI + +V G SY LP WSVSILPDC+NV FNTA+I + T V
Sbjct: 423 MAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVF 482
Query: 424 ------PSFS---RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTAD 474
PS S + S+ + S W E +G + F G+LE +N T D
Sbjct: 483 TVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKD 542
Query: 475 QSDYLWYSLSTNIKADEPLLEDGSKTV---LHVQSLGHALHAFINGKLVGSGYGSSSNAK 531
SDYLWY+ NI +D + SK V L + + F+NGKL GS G
Sbjct: 543 ISDYLWYTTRVNI-SDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW---- 597
Query: 532 VTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ 591
V++ PI L G N LLS VGLQNYGAF EK GAG G V L G +G ++DL++
Sbjct: 598 VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDG-DVDLTNSL 656
Query: 592 WTYQTGLKGE--ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAG 637
WTYQ GLKGE + P S+ +QP WYK + G
Sbjct: 657 WTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQPFTWYKNICNQSVG 704
>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
Length = 844
Score = 696 bits (1796), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/836 (42%), Positives = 496/836 (59%), Gaps = 51/836 (6%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYD +++ I G+R +L SGS+HY RSTP+MWPD++ K++ GGL+VI+TYVFWN HEP
Sbjct: 45 NVTYDGKSLFINGRREILFSGSVHYTRSTPDMWPDILDKARRGGLNVIQTYVFWNAHEPE 104
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
++NF+G YDLVKF++LV G++ LR+GP++ AEWN GG P WL +PGI FR+DNE
Sbjct: 105 PGKFNFQGNYDLVKFIRLVQAKGMFVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNE 164
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
P+K M+ F +KI+ MMK EKL+A QGGPIIL+QIENEY +I AY G SY++WAA M
Sbjct: 165 PYKFHMKAFVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANM 224
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGG 263
A++ D GVPW+MC+Q DAPDP+IN CNG +C D F PN KP +WTENW+ + G
Sbjct: 225 AVATDIGVPWLMCKQRDAPDPVINACNGRHCGDTFAGPNKPYKPAIWTENWTAQYRVHGD 284
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
R ED+AF+VARFF + G NYYMYHGGTNF RTS F +T Y +APLDEYGL
Sbjct: 285 PPSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTS-SVFSTTRYYDEAPLDEYGL 343
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTN 382
R+PKW HL+D+HKA+ LC A++ P+ L E + + G+ +C+AF+ N T
Sbjct: 344 PREPKWSHLRDVHKALLLCRRAILGGVPSVQKLNHFHEVRTFERVGTNMCAAFITNNHTM 403
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
T+ F G +Y LP S+SILPDCK VVFNT +I S ++ R S A
Sbjct: 404 EPATINFRGTNYFLPPHSISILPDCKTVVFNTQQIVSQHNSRNYER--------SPAANN 455
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
W NE + +K P E + D +DY WY+ S + ++ ++ G VL
Sbjct: 456 FHWEMFNEAIPTAKKMPINLPVPAELYSLLKDTTDYAWYTTSFELSQEDMSMKPGVLPVL 515
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V SLGH++ AF+NG +VG+ +G+ P+ L G N LLS TVGL + GA+
Sbjct: 516 RVMSLGHSMVAFVNGDIVGTAHGTHEEKSFEFQTPVLLRVGTNYISLLSSTVGLPDSGAY 575
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTL 619
E AG + + G GT +DL+ W ++ GLKGE S +S +W +
Sbjct: 576 MEHRYAGPKS-INILGLNRGT-LDLTRNGWGHRVGLKGEGKKVFSEEGSTSVKWKPLGAV 633
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
P+ L WY+T F P G+ PVAI +GM KG WVNG +IGRYW +Y+S
Sbjct: 634 PR--ALSWYRTRFGTPEGTGPVAIRMSGMAKGMVWVNGNNIGRYWMSYLSP--------- 682
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
GKP+QS YH+PRS+L N LV+FEE P ++ + ++
Sbjct: 683 -------------LGKPTQSEYHIPRSFLNPQDNLLVIFEEEARVPAQVEILNVNR-DTI 728
Query: 740 CSHVTDSHPLPVDMW----GSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCG 795
CS V + P V+ W G+ + + G S+ C +++ +++FASFG P G CG
Sbjct: 729 CSVVGERDPANVNSWVSRRGNFHPVVKSVGAAASMACATGKRIV-AVEFASFGNPSGYCG 787
Query: 796 SFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG----DPCKGVMKSLAVEASC 847
F+ G C++A S +V + C+G ++C++ + F D C ++K LAV+ C
Sbjct: 788 DFAMGSCNAAASKQIVERECLGQEACTLALDRAVFNNNGVDACPDLVKQLAVQVRC 843
>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
Length = 729
Score = 696 bits (1795), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/731 (50%), Positives = 463/731 (63%), Gaps = 42/731 (5%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
GA VTYD R+++I G R++L SGSIHYPRSTP+MW LI K+K+GG+DVI+TYVFWN HE
Sbjct: 23 GAQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHE 82
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P QY+F GRYDL KF+K + GLYA LRIGP++ +EW++GG P WLH + GI +RTD
Sbjct: 83 PQPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTD 142
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQ FT KIV++MK E LYASQGGPIILSQIENEY NI++A+ G SY++WAA
Sbjct: 143 NEPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAA 202
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ-FT-PNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPWVMC+QSDAPDP+INTCNG C Q FT PNS NKP MWTENW+ ++ F
Sbjct: 203 KMAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVF 262
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG R ED+AF VA F R G++ NYYMYHGGTNF R S +I TSY APLDEY
Sbjct: 263 GGETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRASSA-YIKTSYYDQAPLDEY 321
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GLIRQPKWGHLK+LH AI LC L+ + SLG EA V++ G C AFL N
Sbjct: 322 GLIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDE 381
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
++ TV F S L S+SILPDCKNV+FNTAK+ S + ++ Q L + S
Sbjct: 382 GNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKVCSSSRQSAYKIQELSRSCIQSFDA 441
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS--LSTNIKADEPLLEDGSK 499
W + + D + +LE +N T D+SDYLWY+ N EPL
Sbjct: 442 VDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPNSSCTEPL------ 495
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
LH++SL HA+HAF+N VG+ +GS T PI+L N +LS+ VG +
Sbjct: 496 --LHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDS 553
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSK 616
GA+ E AG+T V+++ + G D ++ W YQ GL GE+L+ + S+ +W K
Sbjct: 554 GAYLESRFAGLTR-VEIQCTEKGI-YDFANYTWGYQVGLSGEKLHIYKEENLSNVEW-RK 610
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
+ + QPL WYK F+ P+G +PVA++ + MGKGEAWVNGQSIGRYW ++ +
Sbjct: 611 TEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSK----- 665
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
G PSQ+LYHVPR++LK+S N LVL EE GDP IS T
Sbjct: 666 -----------------GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLETIS-R 707
Query: 737 SSLCSHVTDSH 747
+ L HV H
Sbjct: 708 TDLPDHVLYHH 718
>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
Flags: Precursor
gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
sativa Japonica Group]
gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
Length = 848
Score = 695 bits (1793), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/847 (44%), Positives = 502/847 (59%), Gaps = 55/847 (6%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G +TYD R+++I G R + SGSIHYPRS P+ WPDLI K+K+GGL+VIE+YVFWN
Sbjct: 27 TKNGTVITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWN 86
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP + YNFEGRYDL+KF KL+ E +YA +RIGP+V AEWN GG P WL IP I F
Sbjct: 87 GHEPEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIF 146
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+NEPFK M++F IV+ +K+ KL+ASQGGPIIL+QIENEY +++ A+ AG YI
Sbjct: 147 RTNNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYIN 206
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNSNNKPKMWTENWSGWF 258
WAA MA++ +TGVPW+MC+Q+ AP +I TCNG +C P KP +WTENW+ +
Sbjct: 207 WAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQY 266
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AF+VARFF GGT NYYMYHGGTNF R +G F+ Y +APL
Sbjct: 267 RVFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPL 325
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS-GLCSAFLA 377
DE+GL ++PKWGHL+DLH A++ C+ AL+ +P+ LG EA V++ +C AFL+
Sbjct: 326 DEFGLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLS 385
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T D TV F G Y + S+SIL DCK VVF+T +NS +F AD
Sbjct: 386 NHNTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFH------FADQ 439
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
+ Y E + + LEQ N T D++DYLWY+ S ++ D+
Sbjct: 440 TVQDNVWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKE 499
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
K VL V S GHA+ AF+N VG G+G+ N T++ + L G N +LS T+GL
Sbjct: 500 VKPVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLM 559
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS 617
+ G++ E AG+ V ++G GT +DL++ W + GL GE S +
Sbjct: 560 DSGSYLEHRMAGVY-TVTIRGLNTGT-LDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWK 617
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
QPL WY+ FD P+G++PV ID T MGKG +VNG+ +GRYW +Y
Sbjct: 618 PGKDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHA------- 670
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
GKPSQ LYHVPRS L+ GNTL+ FEE GG P I +T +
Sbjct: 671 ---------------LGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKR-D 714
Query: 738 SLCSHVTDSHPLPVDMWGSDSKIQR--------------KPGPVLSLECPNPNQVISSIK 783
++C+ +T+ +P V W +SK + KP VLS CP + I S+
Sbjct: 715 NICTFMTEKNPAHV-RWSWESKDSQPKAVAGAGAGAGGLKPTAVLS--CPT-KKTIQSVV 770
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSL 841
FAS+G PLG CG+++ G C + R+ VV +AC+G K+CS+ VS +G C G +L
Sbjct: 771 FASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDVHCPGTTGTL 830
Query: 842 AVEASCT 848
AV+A C+
Sbjct: 831 AVQAKCS 837
>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 694 bits (1791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/711 (50%), Positives = 454/711 (63%), Gaps = 44/711 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R+++I G+R++L SGSIHYPRSTP+MWPDLI K+K GGLDVI+TYVFWNLHEP
Sbjct: 27 VTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHEPQP 86
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Y+F GRYDLV F+K + GLY LRIGP++ +EW +GGFP WLH +PGI +RTDNEP
Sbjct: 87 GMYDFSGRYDLVGFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPGIVYRTDNEP 146
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQ FT KIV+MMK+E LYASQGGPIILSQIENEY NI A+G AG Y++WAA MA
Sbjct: 147 FKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWAAKMA 206
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
+ LDTGVPW+MC+Q+DAPDP+INTCNG C + FT PNS NKP +WTENW+ ++ +GG
Sbjct: 207 VGLDTGVPWIMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQVYGGL 266
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R ED+AF V F R G++ NYYMYHGGTNF RT G ++ T Y APLDEYGL+
Sbjct: 267 PYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRT-GSAYVITGYYDQAPLDEYGLL 325
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLK LH+ IK C L+ +LG LE V++ G C AFL N ++
Sbjct: 326 RQPKWGHLKQLHEVIKSCSTTLLQGVQRNFTLGQLLEVYVFEEEKGECVAFLINNDRDNK 385
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV+F +SY L S+SILPDC+NV F+TA +N+ + +R+ + + S
Sbjct: 386 ATVQFRNSSYELLPKSISILPDCQNVTFSTANVNTTS-----NRRIISPKQNFSSV--DD 438
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W + + + + LLEQ+NTT D+SDYLWY+L SK L V
Sbjct: 439 WQQFQDVISNFDNTSLKSDSLLEQMNTTKDKSDYLWYTLRFEYNL------SCSKPTLSV 492
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
QS H HAF+N +G +G+ T++ P+ + G N +LS+ VGL + GAF E
Sbjct: 493 QSAAHVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSVMVGLPDSGAFLE 552
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLPK 621
+ AG+ V+L+ S +++L++ W YQ GL GE+L + S T W + +
Sbjct: 553 RRFAGLIS-VELQCS-EQESLNLTNSTWGYQVGLMGEQLQVYKEQNNSDTGWSQLGNVME 610
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
Q L WYKTTFD P G +PV +D + MGKGEAWVNG+SIGRYW +
Sbjct: 611 -QTLFWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYWILFHDSK---------- 659
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
G PSQSLYHVPRS+LK SGN LVL EE GG+P IS T
Sbjct: 660 ------------GNPSQSLYHVPRSFLKDSGNVLVLLEEGGGNPLGISLDT 698
>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
Length = 780
Score = 694 bits (1791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/844 (45%), Positives = 505/844 (59%), Gaps = 93/844 (11%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
ANVTYD R+++I G+ ++L SGSIHY RSTP+MWP LI K+K GG+DV++TYVFWN+HEP
Sbjct: 10 ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 69
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ Q++F G D+VKF+K V GLY LRIGP++ EW++GG P WLH + GI FRTDN
Sbjct: 70 QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 129
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPFK M+R+ IV +MK E LYASQGGPIILSQIENEYG + A+ GKSY+KW A
Sbjct: 130 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 189
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNSNNKPKMWTENWSGWFLSFG 262
+A+ LDTGVPWVMC+Q DAPDP++N CNG C + PNS NKP +WTENW+
Sbjct: 190 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSL----- 244
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
ED+AF VA F + G+F NYYMYHGGTNF R + F+ TSY APLDEYG
Sbjct: 245 ------SAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 297
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
L+RQPKWGHLK+LH A+KLCE L++ T SLG A V+ + LC+A L N
Sbjct: 298 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVN-QDK 356
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS-VTLVPSFSRQSLQVAADSSDAI 441
+ TV+F +SY L SVS+LPDCKNV FNTAK+N+ +RQ+L SS +
Sbjct: 357 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQNL-----SSPQM 411
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
W E V + + LLE +NTT D SDYLW +T + E G+ +V
Sbjct: 412 ---WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQ--TTRFQQSE-----GAPSV 461
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LGHALHAF+NG+ +GS +G+ + ++ ++L G N LLS+ VGL N GA
Sbjct: 462 LKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGA 521
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQW----D 614
E+ + G +K + ++ W YQ GLKGE+ + + + QW D
Sbjct: 522 HLERR---VVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD 578
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
SKS QPL WYK +FD P G +PVA++ MGKGEAWVNGQSI +
Sbjct: 579 SKS-----QPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIAMF----------- 622
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF-EEIGGDPTKISFVTK 733
S YH+PRS+LK + N LV+ EE G+P I+ T
Sbjct: 623 ----------------------SYFRYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTV 660
Query: 734 QLGSSLCSHVTDSHPLPV--------DMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFA 785
+ + +C HV++++P PV + + RKP + L+CP + IS I FA
Sbjct: 661 SV-TEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPK--VQLQCPTGRK-ISKILFA 716
Query: 786 SFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVE 844
SFGTP G+CGS+S G C S SL+VV++AC+ CS+ V TF GD C +KSL V
Sbjct: 717 SFGTPNGSCGSYSIGSCHSPNSLAVVQKACLKKSRCSVPVWSKTFGGDSCPHTVKSLLVR 776
Query: 845 ASCT 848
A C+
Sbjct: 777 AQCS 780
>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
Length = 848
Score = 693 bits (1789), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/847 (44%), Positives = 501/847 (59%), Gaps = 55/847 (6%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G +TYD R+++I G R + SGSIHYPRS P+ WPDLI K+K+GGL+VIE+YVFWN
Sbjct: 27 TKNGTVITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWN 86
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP + YNFEGRYDL+KF KL+ E +YA +RIGP+V AEWN GG P WL IP I F
Sbjct: 87 GHEPEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIF 146
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+NEPFK M++F IV+ +K+ KL+ASQGGPIIL+QIENEY +++ A+ AG YI
Sbjct: 147 RTNNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYIN 206
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNSNNKPKMWTENWSGWF 258
WAA MA++ +TGVPW+MC+Q+ AP +I TCNG +C P KP +WTENW+ +
Sbjct: 207 WAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQY 266
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AF+VARFF GGT NYYMYHGGTNF R +G F+ Y +AP
Sbjct: 267 RVFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPF 325
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS-GLCSAFLA 377
DE+GL ++PKWGHL+DLH A++ C+ AL+ +P+ LG EA V++ +C AFL+
Sbjct: 326 DEFGLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLS 385
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T D TV F G Y + S+SIL DCK VVF+T +NS +F AD
Sbjct: 386 NHNTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFH------FADQ 439
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
+ Y E + + LEQ N T D++DYLWY+ S ++ D+
Sbjct: 440 TVQDNVWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKE 499
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
K VL V S GHA+ AF+N VG G+G+ N T++ + L G N +LS T+GL
Sbjct: 500 VKPVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLM 559
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS 617
+ G++ E AG+ V ++G GT +DL++ W + GL GE S +
Sbjct: 560 DSGSYLEHRMAGVY-TVTIRGLNTGT-LDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWK 617
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
QPL WY+ FD P+G++PV ID T MGKG +VNG+ +GRYW +Y
Sbjct: 618 PGKDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHA------- 670
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
GKPSQ LYHVPRS L+ GNTL+ FEE GG P I +T +
Sbjct: 671 ---------------LGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKR-D 714
Query: 738 SLCSHVTDSHPLPVDMWGSDSKIQR--------------KPGPVLSLECPNPNQVISSIK 783
++C+ +T+ +P V W +SK + KP VLS CP + I S+
Sbjct: 715 NICTFMTEKNPAHV-RWSWESKDSQPKAVAGAGAGAGGFKPTAVLS--CPT-KKTIQSVV 770
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSL 841
FAS+G PLG CG+++ G C + R+ VV +AC+G K+CS+ VS +G C G +L
Sbjct: 771 FASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDVHCPGTTGTL 830
Query: 842 AVEASCT 848
AV+A C+
Sbjct: 831 AVQAKCS 837
>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
Length = 758
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/731 (50%), Positives = 465/731 (63%), Gaps = 49/731 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
GA VTYD R+++I G R++L SGSIHYPRSTP+MW LI K+K+GG+DVI+TYVFWN HE
Sbjct: 59 GAQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHE 118
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P QY+F GRYDL KF+K + GLYA LRIGP++ +EW++GG P WLH + GI +RTD
Sbjct: 119 PQPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTD 178
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQ FT KIV++MK E LYASQGGPIILSQIENEY NI++A+ G SY++WAA
Sbjct: 179 NEPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAA 238
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ-FT-PNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPWVMC+QSDAPDP+INTCNG C Q FT PNS NKP MWTENW+ ++ F
Sbjct: 239 KMAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVF 298
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG R ED+AF VA F R G++ NYYMYHGGTNF R S +I TSY APLDEY
Sbjct: 299 GGETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRASSA-YIKTSYYDQAPLDEY 357
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GLIRQPKWGHLK+LH AI LC L+ + SLG EA V++ G C AFL N
Sbjct: 358 GLIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDE 417
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
++ TV F S L S+SILPDCKNV+FNTAKIN+ + + ++ S DA+
Sbjct: 418 GNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGY------NERIATSSQSFDAV 471
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS--LSTNIKADEPLLEDGSK 499
W + + D + +LE +N T D+SDYLWY+ N EPL
Sbjct: 472 DR-WEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPNSSCTEPL------ 524
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
LH++SL HA+HAF+N VG+ +GS T PI+L N +LS+ VG +
Sbjct: 525 --LHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDS 582
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSK 616
GA+ E AG+T V+++ + G D ++ W YQ GL GE+L+ + S+ +W K
Sbjct: 583 GAYLESRFAGLTR-VEIQCTEKGI-YDFANYTWGYQVGLSGEKLHIYKEENLSNVEW-RK 639
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
+ + QPL WYK F+ P+G +PVA++ + MGKGEAWVNGQSIGRYW ++ +
Sbjct: 640 TEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSK----- 694
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
G PSQ+LYHVPR++LK+S N LVL EE GDP IS T
Sbjct: 695 -----------------GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLETIS-R 736
Query: 737 SSLCSHVTDSH 747
+ L HV H
Sbjct: 737 TDLPDHVLYHH 747
>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
Length = 844
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/837 (41%), Positives = 502/837 (59%), Gaps = 42/837 (5%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G ++YD R++++ G+R + SGSIHYPRS P+MWP+LI K+K+GGL+ IETYVFWN
Sbjct: 32 TKNGTVISYDRRSLMVDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWN 91
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
+HEP + Q+NFEGRYD+VKF KL+ E ++A +R+GP++ AEWN GG P WL IP I F
Sbjct: 92 IHEPEKGQFNFEGRYDMVKFFKLIQEHDMFAMVRLGPFIQAEWNHGGLPYWLREIPDIVF 151
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+NEP+K M+ F ++ +K L+ASQGGPIIL+QIENEY ++++A+ G YI
Sbjct: 152 RTNNEPYKMHMETFVKIVIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKEEGTKYIH 211
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK--PKMWTENWSGWF 258
WAA MA+ + G+PW+MC+Q+ AP +I TCNG C P NK P +WTENW+ +
Sbjct: 212 WAAQMAIGTNIGIPWIMCKQTKAPGDVIPTCNGRNCGDTWPGPMNKTMPLLWTENWTAQY 271
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AFAVARFF GGT NYYMYHGGTNF RT+ F+ Y +APL
Sbjct: 272 RVFGDPPSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTAAA-FVMPKYYDEAPL 330
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLA 377
DE+GL ++PKWGHL+DLH A+KLC+ AL+ P+ LG LEA V++ +C AFL+
Sbjct: 331 DEFGLYKEPKWGHLRDLHLALKLCKKALLWGKPSTEKLGKQLEARVFEIPEQKVCVAFLS 390
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T DVT+ F G Y +P S+SIL DCK VVF T +N+ +F AD
Sbjct: 391 NHNTKDDVTLTFRGQPYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFH------FADQ 444
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
++ + E V K + N T D++DY+WY+ S ++ D+ +
Sbjct: 445 TNQNNVWQMFDEEKVPKYKQAKIRTRKAADLYNLTKDKTDYVWYTSSFKLEPDDMPIRRD 504
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
KTV+ V S GHA AF+N K G G+G+ N T++ P+ L G N +L+ ++G+
Sbjct: 505 IKTVVEVNSHGHASVAFVNNKFAGCGHGTKMNKAFTLEKPMELKKGVNHVAVLASSMGMM 564
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS 617
+ GA+ E AG+ VQ+ G GT +DL++ W + GL GE+ + +
Sbjct: 565 DSGAYLEHRLAGVDR-VQITGLNAGT-LDLTNNGWGHIVGLVGEQKEIYTEKGMASVTWK 622
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
+PL WYK FD P+G +P+ +D + MGKG +VNGQ IGRYW +Y
Sbjct: 623 PAVNDKPLTWYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIGRYWMSYKHA------- 675
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
G+PSQ LYH+PRS+L+ N LVLFEE G P I +T +
Sbjct: 676 ---------------LGRPSQQLYHIPRSFLRPKDNVLVLFEEEFGRPDAIMILTVKR-D 719
Query: 738 SLCSHVTDSHPLPVDMW-GSDSKIQRKPGPV---LSLECPNPNQVISSIKFASFGTPLGT 793
++C+++++ +P + W DS+I + +L CP P ++I + FAS+G P+G
Sbjct: 720 NICTYISERNPAHIKSWERKDSQITATADDLKARATLTCP-PKKLIQQVVFASYGNPVGI 778
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASCT 848
CG+++ G C + R+ VV ++C+G ++C++ VS + +G C G +LAV+A C+
Sbjct: 779 CGNYTIGSCHTPRAKEVVEKSCLGKRTCTLPVSADVYGGDVNCPGTTATLAVQAKCS 835
>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
Length = 806
Score = 688 bits (1776), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/835 (41%), Positives = 500/835 (59%), Gaps = 52/835 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R+++I G+R +L SGSIHYPRSTPE W ++ K++ GG++V++TYVFWN+HE +
Sbjct: 9 VTYDGRSLIINGRRELLFSGSIHYPRSTPEEWAGILDKARQGGINVVQTYVFWNIHETEK 68
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+Y+ E +YD +KF+KL+ + G+Y LR+GP++ AEWN GG P WL +P I FR++NEP
Sbjct: 69 GKYSIEPQYDYIKFIKLIQKKGMYVTLRVGPFIQAEWNHGGLPYWLREVPEIIFRSNNEP 128
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK M+++ + ++ +K L+A QGGPIIL+QIENEY +I A+ G +Y++WAA MA
Sbjct: 129 FKKHMKKYVSTVIKTVKDANLFAPQGGPIILAQIENEYNHIQRAFREEGDNYVQWAAKMA 188
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
+SLD GVPW+MC+Q+DAPDP+IN CNG +C D F+ PN KP +WTENW+ + FG
Sbjct: 189 VSLDIGVPWIMCKQTDAPDPVINACNGRHCGDTFSGPNKPYKPAIWTENWTAQYRVFGDP 248
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R ED+AF+VARFF + G+ NYYMYHGGTNF RTS F +T Y +APLDEYG+
Sbjct: 249 PSQRSAEDIAFSVARFFSKNGSLVNYYMYHGGTNFGRTSSA-FTTTRYYDEAPLDEYGMQ 307
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTNS 383
R+PKW HL+D+H+A+ LC+ AL T + + E V+ K GS LC+AF+ N T
Sbjct: 308 REPKWSHLRDVHRALSLCKRALFNGASTVTKMSQHHEVIVFEKPGSNLCAAFITNNHTKV 367
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
T+ F G Y +P S+SILPDCK VVFNT I S +F R S A
Sbjct: 368 PTTISFRGTDYYMPPRSISILPDCKTVVFNTQCIASQHSSRNFKR--------SMAANDH 419
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W +E + +K + +E + D SDY WY+ S ++ ++ ++ T+L
Sbjct: 420 KWEVYSETIPTTKQIPTHEKNPIELYSLLKDTSDYAWYTTSVELRPEDLPKKNDIPTILR 479
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
+ SLGH+L AF+NG+ +GS +GS P+ L G N +L+ TVGL + GA+
Sbjct: 480 IMSLGHSLLAFVNGEFIGSNHGSHEEKGFEFQKPVTLKVGVNQIAILASTVGLPDSGAYM 539
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQW-DSKSTL 619
E AG L N +DL+S W ++ G+KGE+L QW ++K
Sbjct: 540 EHRFAGPKSIFIL--GLNSGKMDLTSNGWGHEVGIKGEKLGIFTEEGSKKVQWKEAKGPG 597
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
P + WYKT F P G++PVAI TGMGKG W+NG+SIGR+W +Y+S
Sbjct: 598 PAVS---WYKTNFATPEGTDPVAIRMTGMGKGMVWINGKSIGRHWMSYLSP--------- 645
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
G+P+QS YH+PR++ N LV+FEE +P K+ +T ++
Sbjct: 646 -------------LGQPTQSEYHIPRTYFNPKDNLLVVFEEEIANPEKVEILTVNR-DTI 691
Query: 740 CSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCG 795
CS VT++HP V W S+ + P SL+CP+ + I +++FASFG P G CG
Sbjct: 692 CSFVTENHPPNVKSWAIKSEKFQAVVNDLVPSASLKCPH-QRTIKAVEFASFGDPAGACG 750
Query: 796 SFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF---GDPCKGVMKSLAVEASC 847
+F+ G+C++ +V + C+G SC + + + F D C V K+LA++ C
Sbjct: 751 AFALGKCNAPAIKQIVEKQCLGKASCLVPIDKDAFTKGQDACPNVTKALAIQVRC 805
>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
Length = 589
Score = 688 bits (1775), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/600 (57%), Positives = 411/600 (68%), Gaps = 16/600 (2%)
Query: 138 IQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKS 197
+ FRTDNEPFKA MQ+FT KIV MMK E L+ +QGGPII+SQIENEYG ++ GA GK+
Sbjct: 1 MAFRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKA 60
Query: 198 YIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGW 257
Y KWAA MA+ LDTGVPW MC+Q DAPDP+I+TCNG+YC+ FTPN N KPKMWTENWSGW
Sbjct: 61 YTKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGW 120
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
+ FGGA+ +RP EDLA++VA F Q G+F NYYMYHGGTNF RTS G FI+TSYDYDAP
Sbjct: 121 YTDFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAP 180
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG-PNLEATVYKTGSGLCSAFL 376
+DEYGL +PKW HLK+LHKAIK CE AL++ DPT LG NLEA VY + +C+AFL
Sbjct: 181 IDEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFL 240
Query: 377 ANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAAD 436
AN T S TV F Y LP WSVSILPDCK VVFNTA +N SF ++ V
Sbjct: 241 ANYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNG----HSFHKRMTPV--- 293
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+ SY EP S DD+ L EQIN T D SDYLWY NI E +++
Sbjct: 294 --ETTFDWQSYSEEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKN 351
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G L + S GH LH F+NG+L G+ YG N KVT + L G N LLS+ VGL
Sbjct: 352 GQFPTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGL 411
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQW 613
N G +E G+ GPV+LKG GT DLS Q+W+Y+ GLKGE L+ + SS W
Sbjct: 412 PNVGLHFETWNVGVLGPVRLKGLDEGTR-DLSWQKWSYKVGLKGESLSLHTITGSSSIDW 470
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
S+L K QPL WYKTTFDAP+G++PVA+D + MGKGE W+N QSIGR+WP Y++ G
Sbjct: 471 TQGSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIAH-GN 529
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C D CNY G +++ KC NCG+P+Q YH+PRSWL SSGN LV+ EE GGDPT IS V +
Sbjct: 530 C-DECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISLVKR 588
>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/738 (49%), Positives = 462/738 (62%), Gaps = 57/738 (7%)
Query: 6 ILLLVLCW----GFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDL 61
++LL++ W GF V A VTYD R+++I G+R++L SG IHYPRSTP+MWPDL
Sbjct: 7 LVLLLVFWKIREGFGVKA-----EEVTYDGRSLIIDGQRKILFSGLIHYPRSTPQMWPDL 61
Query: 62 IQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCA 121
I K+K GGLDVI+TYVFWNLHEP Y+F GRYDLV F+K + GLY LRIGP++ +
Sbjct: 62 IAKAKQGGLDVIQTYVFWNLHEPQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIGPFIQS 121
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EW +GGFP WLH +PGI +RTDNE FK MQ FT KIV+MMK+E LYASQGGPIILSQIE
Sbjct: 122 EWKYGGFPFWLHDVPGIVYRTDNESFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIE 181
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT 240
NEY NI A+G AG Y++WAA MA+ L+TGVPWVMC+Q+DAPDP+INTCNG C + FT
Sbjct: 182 NEYQNIQKAFGTAGSQYVQWAAKMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFT 241
Query: 241 -PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 299
PNS NKP +WTENW+ ++ +GG R ED+AF V F R G++ NYYMYHGGTNF
Sbjct: 242 GPNSPNKPALWTENWTSFYQVYGGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNF 301
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
RT+ I+ YD APLDEYGL+RQPKWGHLK LH+ IK C L+ SLG
Sbjct: 302 GRTASAYVITGYYD-QAPLDEYGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFSLGQL 360
Query: 360 LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS 419
E V++ G C AFL N ++ VTV+F SY L S+SILPDC+NV FNTA +N+
Sbjct: 361 QEGYVFEEEKGECVAFLKNNDRDNKVTVQFRNRSYELLPRSISILPDCQNVAFNTANVNT 420
Query: 420 VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYL 479
+ +R+ + + S W + + + + LLEQ+NTT D+SDYL
Sbjct: 421 TS-----NRRIISPKQNFSSL--DDWKQFQDVIPYFDNTSLRSDSLLEQMNTTKDKSDYL 473
Query: 480 WYSL--STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFP 537
WY+L N+ +P L VQS H HAFIN +G +G+ T++ P
Sbjct: 474 WYTLRFEYNLSCRKP--------TLSVQSAAHVAHAFINNTYIGGEHGNHDVKSFTLELP 525
Query: 538 IALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTG 597
+ + G N +LS VGL + GAF E+ AG+ V+L+ S +++L++ W YQ G
Sbjct: 526 VTVNQGTNNLSILSAMVGLPDSGAFLERRFAGLIS-VELQCS-EQESLNLTNSTWGYQVG 583
Query: 598 LKGEELNF---PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
L GE+L + S W + + Q L+WYKTTFD P G +PV +D + MGKGEAW
Sbjct: 584 LLGEQLQVYKKQNNSDIGWSQLGNIME-QLLIWYKTTFDTPEGDDPVVLDLSSMGKGEAW 642
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VN QSIGRYW + G PSQSLYHVPRS+LK +GN
Sbjct: 643 VNEQSIGRYWILFHDSK----------------------GNPSQSLYHVPRSFLKDTGNV 680
Query: 715 LVLFEEIGGDPTKISFVT 732
LVL EE GG+P IS T
Sbjct: 681 LVLVEEGGGNPLGISLDT 698
>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 756
Score = 686 bits (1771), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/805 (46%), Positives = 492/805 (61%), Gaps = 62/805 (7%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MWP LI K+K+GG+DVI+TYVFWNLHEP + Y F GR D+V+FVK + GLYA LRIG
Sbjct: 1 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
P++ AEW++GG P WLH + GI +R+DNEPFK MQ FT KIV+MMK E LYASQGGPII
Sbjct: 61 PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 120
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
LSQIENEY +++A+G G Y++WAA MA+SL TGVPW MC+Q+DAPDP+INTCNG C
Sbjct: 121 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 180
Query: 237 -DQFT-PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFF-QRGGTFQNYYMY 293
+ FT PNS NKP +WTENW+ ++ ++G R E++AF VA F + GT+ NYYMY
Sbjct: 181 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 240
Query: 294 HGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY 353
HGGTNF R++ I+ YD +PLDEYGL R+PKWGHLK+LH A+KLC L+ +
Sbjct: 241 HGGTNFGRSASAFMITGYYD-QSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSN 299
Query: 354 PSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFN 413
SLG ++EA V+KT S C+AFL N G D V F +Y LP S+SILPDCKNV FN
Sbjct: 300 FSLGQSVEAIVFKTESNECAAFLVNRGA-IDSNVLFQNVTYELPLGSISILPDCKNVAFN 358
Query: 414 TAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTA 473
T +++ V +R + A D + W EP+ D LLE + TT
Sbjct: 359 TRRVS----VQHNTRSMM--AVQKFDLL--EWEEFKEPIPNIDDTELRANELLEHMGTTK 410
Query: 474 DQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVT 533
D+SDYLWY+ ++ D P S+ L V S HALHAF+NG GS +G +
Sbjct: 411 DRSDYLWYTF--RVQQDSP----DSQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFS 464
Query: 534 VDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWT 593
+ I L G N LLS+ VGL + GAF E AG+ V ++G D S Q W
Sbjct: 465 LAKNITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRR-VGIQGE------DFSEQHWG 517
Query: 594 YQTGLKGE--ELNFPSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGK 650
Y+ GL GE ++ +GSS QW QPL WYKT FDAP G +P+A++ MGK
Sbjct: 518 YKVGLSGEQSQIFLDTGSSNVQWSRLGN--SSQPLTWYKTQFDAPPGDDPIALNLGSMGK 575
Query: 651 GEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKS 710
G WVNG+ IGRYW ++++ G+PSQ Y+VPRS+LK
Sbjct: 576 GAVWVNGRGIGRYWVSFLTPK----------------------GEPSQKWYNVPRSFLKP 613
Query: 711 SGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH-PLPVDMWGSDSKIQRK-----P 764
+ N LV+ EE G+P +IS + L + C V++SH PL G+ + R+
Sbjct: 614 TDNQLVILEEETGNPVEIS-LDSVLITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRTR 672
Query: 765 GPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIG 824
P + L CP+ + IS+I FASFGTP G C S++ G C S S ++V AC+G CSI
Sbjct: 673 RPKVQLSCPSKKK-ISNILFASFGTPSGDCQSYAIGLCHSPNSRAIVEHACLGRAKCSIP 731
Query: 825 VS-VNTFGDPCKGVMKSLAVEASCT 848
+S +N GDPC V K+L V+A CT
Sbjct: 732 ISNLNFRGDPCPHVTKTLLVDAQCT 756
>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
Length = 887
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/856 (42%), Positives = 511/856 (59%), Gaps = 52/856 (6%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+L++ LC VTYD +++I GKR +L SGS+HYPRSTP MWP +I K+
Sbjct: 20 LLVISLCSKASSHDDEKKKKGVTYDGTSLIINGKRELLFSGSVHYPRSTPHMWPSIIDKA 79
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
+ GGL+ I+TYVFWN+HEP + +Y+F+GR+DLVKF+KL+ E GLY LR+GP++ AEWN
Sbjct: 80 RIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNH 139
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P WL +P + FRT+NEPFK +R+ KI+ MMK+EKL+ASQGGPIIL QIENEY
Sbjct: 140 GGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYN 199
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNS 243
+ AY G+ YIKWAA + S++ G+PWVMC+Q+DAP +IN CNG +C D F PN
Sbjct: 200 AVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNR 259
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
++KP +WTENW+ F FG R VED+AF+VAR+F + G+ NYYMYHGGTNF RTS
Sbjct: 260 HDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTS 319
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
F++T Y DAPLDE+GL + PK+GHLK +H+A++LC+ AL +LGP+ E
Sbjct: 320 AH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVR 378
Query: 364 VYKT-GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
Y+ G+ +C+AFL+N T T+KF G Y+LP+ S+SILPDCK VV+NTA+I
Sbjct: 379 YYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQI----- 433
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
V S + + +S + N P + D PG L + T D++DY WY+
Sbjct: 434 VAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLI--PGELYYL--TKDKTDYAWYT 489
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
S I D+ + G KT+L V SLGHAL ++NG+ G +G P+
Sbjct: 490 TSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKT 549
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE- 601
G N +L + GL + G++ E AG + + G +GT + +W + GL+GE
Sbjct: 550 GDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISIIGLKSGTRDLTENNEWGHLAGLEGEK 608
Query: 602 -ELNFPSGS-STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
E+ GS +W+ K +PL WYKT F+ P G VAI MGKG WVNG
Sbjct: 609 KEVYTEEGSKKVKWEKDG---KRKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIG 665
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK--SSGNTLVL 717
+GRYW +++S G+P+Q+ YH+PRS++K N LV+
Sbjct: 666 VGRYWMSFLSP----------------------LGEPTQTEYHIPRSFMKGEKKKNMLVI 703
Query: 718 FEEIGGDPTK-ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK--IQRKPGPVLS--LEC 772
EE G + I FV ++CS+V + +P+ V W + + R L + C
Sbjct: 704 LEEEPGVKLESIDFVLVNR-DTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRC 762
Query: 773 PNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGD 832
P P + + ++FASFG P GTCG+F+ G+CS+++S VV + C+G CSI V+ TFGD
Sbjct: 763 P-PEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGD 821
Query: 833 P-CKGVMKSLAVEASC 847
C ++K+LAV+ C
Sbjct: 822 KGCPEIVKTLAVQVKC 837
>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 887
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/856 (42%), Positives = 511/856 (59%), Gaps = 52/856 (6%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+L++ LC VTYD +++I GKR + SGS+HYPRSTP+MWP +I K+
Sbjct: 20 LLVISLCSKASSHDDEKKKKGVTYDGTSLIINGKRELFFSGSVHYPRSTPDMWPSIIDKA 79
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
+ GGL+ I+TYVFWN+HEP + +Y+F+GR+DLVKF+KL+ E GLY LR+GP++ AEWN
Sbjct: 80 RIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNH 139
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P WL +P + FRT+NEPFK +R+ KI+ MMK+EKL+ASQGGPIIL QIENEY
Sbjct: 140 GGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYN 199
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNS 243
+ AY G+ YIKWAA + S++ G+PWVMC+Q+DAP +IN CNG +C D F PN
Sbjct: 200 AVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNR 259
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
++KP +WTENW+ F FG R ED+AF+VAR+F + G+ NYYMYHGGTNF RTS
Sbjct: 260 HDKPSLWTENWTTQFRVFGDPPTQRTAEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTS 319
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
F++T Y DAPLDE+GL + PK+GHLK +H+A++LC+ AL +LGP+ E
Sbjct: 320 AH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVR 378
Query: 364 VYKT-GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
Y+ G+ +C+AFL+N T T+KF G Y+LP+ S+SILPDCK VV+NTA+I
Sbjct: 379 YYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQI----- 433
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
V S + + +S + N P + D PG L + T D++DY WY+
Sbjct: 434 VAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLI--PGELYYL--TKDKTDYAWYT 489
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
S I D+ + G KT+L V SLGHAL ++NG+ G +G P+
Sbjct: 490 TSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKT 549
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE- 601
G N +L + GL + G++ E AG + + G +GT + +W + GL+GE
Sbjct: 550 GDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISIIGLKSGTRDLTENNEWGHLAGLEGEK 608
Query: 602 -ELNFPSGS-STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQS 659
E+ GS +W+ + +PL WYKT F+ P G VAI GMGKG WVNG
Sbjct: 609 KEVYTEEGSKKVKWEKDG---ERKPLTWYKTYFETPEGVNAVAIRMKGMGKGLIWVNGIG 665
Query: 660 IGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK--SSGNTLVL 717
+GRYW +++S G+P+Q+ YH+PRS++K N LV+
Sbjct: 666 VGRYWMSFLSP----------------------LGEPTQTEYHIPRSFMKGEKKKNMLVI 703
Query: 718 FEEIGGDPTK-ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK--IQRKPGPVLS--LEC 772
EE G + I FV ++CS+V + +P+ V W + + R L + C
Sbjct: 704 LEEEPGVKLESIDFVLVNR-DTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRC 762
Query: 773 PNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGD 832
P P + + ++FASFG P GTCG+F+ G+CS+++S VV + C+G CSI V+ TFGD
Sbjct: 763 P-PEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGD 821
Query: 833 P-CKGVMKSLAVEASC 847
C ++K+LAV+ C
Sbjct: 822 KGCPEIVKTLAVQVKC 837
>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
Length = 850
Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/842 (42%), Positives = 502/842 (59%), Gaps = 50/842 (5%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++ G R + +SGSIHYPRS P+MWP+LI K+K+GGL+ IETYVFWN
Sbjct: 37 TRNGTVVSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWN 96
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
+HEP + ++NFEG+ D+V+F +L+ E +YA +R+GP++ AEWN GG P WL IP I F
Sbjct: 97 IHEPEKGEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVF 156
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+NEP+K M+ F I+ +K L+ASQGGPIIL+QIENEY ++++A+ G YI
Sbjct: 157 RTNNEPYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYIN 216
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK--PKMWTENWSGWF 258
WAA MA+S + G+PW+MC+Q+ AP +I TCNG C P NK P +WTENW+ +
Sbjct: 217 WAAKMAISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQY 276
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AFAVARFF GGT NYYMYHGGTNF RTS F+ Y +APL
Sbjct: 277 RVFGDPPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAA-FVMPKYYDEAPL 335
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLA 377
DE+GL ++PKWGHL+DLH+A+KLC+ AL+ P+ LG LEA V++ +C AFL+
Sbjct: 336 DEFGLYKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLS 395
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T D T+ F G Y +P S+S+L DC+ VVF T +N+ ++++ A +
Sbjct: 396 NHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNA-----QHNQRTFHFADQT 450
Query: 438 SDAIGSGWSYI---NEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL 494
A + W N P K G L N T D++DY+WY+ S ++AD+ +
Sbjct: 451 --AQNNVWEMFDGENVPKYKQAKIRLRKAGDL--YNLTKDKTDYVWYTSSFKLEADDMPI 506
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
KTVL V S GHA AF+N K VG G+G+ N T++ P+ L G N +L+ ++
Sbjct: 507 RSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSM 566
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD 614
G+ + GA+ E AG+ VQ+ G GT +DL++ W + GL GE +
Sbjct: 567 GMTDSGAYMEHRLAGVDR-VQITGLNAGT-LDLTNNGWGHIVGLVGERKQIYTDKGMGSV 624
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ +PL WYK FD P+G +PV +D + MGKG +VNGQ IGRYW +Y
Sbjct: 625 TWKPAMNDRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISYKHA---- 680
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
G+PSQ LYHVPRS+L+ N LVLFEE G P I +T +
Sbjct: 681 ------------------LGRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDAIMILTVK 722
Query: 735 LGSSLCSHVTDSHPLPVDMW-GSDSKIQRKPG-----PVLSLECPNPNQVISSIKFASFG 788
++C+ +++ +P + W DS+I K +L CP P ++I + FAS+G
Sbjct: 723 R-DNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACP-PKKLIQQVVFASYG 780
Query: 789 TPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEAS 846
P G CG+++ G C + R+ VV +AC+G + C++ V+ + +G C G +LAV+A
Sbjct: 781 NPAGICGNYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDANCSGTTATLAVQAK 840
Query: 847 CT 848
C+
Sbjct: 841 CS 842
>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 697
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/728 (49%), Positives = 468/728 (64%), Gaps = 54/728 (7%)
Query: 16 VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIET 75
V + TT +G NVTYD R+++I G+ ++L SGSIHYPRSTP+MWP+LI K+K+GGLDVI+T
Sbjct: 17 VFIGTTVYGGNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQT 76
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI 135
YVFWNLHEP + QY+F G ++V+F+K + GLY LRIGPY+ +E +GG PLWLH I
Sbjct: 77 YVFWNLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDI 136
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAG 195
PGI FR+DNE FK MQ+F+AKIV++MK L+ASQGGPIILSQIENEYGN++ A+ G
Sbjct: 137 PGIVFRSDNEQFKFHMQKFSAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKG 196
Query: 196 KSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT--PNSNNKPKMWTEN 253
SYI+WAA MA+ L TGVPWVMC+Q +APDP+INTCNG C + PNS NKP +WTEN
Sbjct: 197 LSYIRWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPSLWTEN 256
Query: 254 WSGWFLSFGGAVPY-RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSY 312
W+ ++ F G VPY R ED+A+ VA F + G++ NYYMYHGGTNFDR + F+ T+Y
Sbjct: 257 WTSFYQVF-GEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASA-FVITAY 314
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC 372
+APLDEYGL+R+PKWGHLK+LH AIK C +++ T SLG A V+K S C
Sbjct: 315 YDEAPLDEYGLVREPKWGHLKELHAAIKSCSNSILHGTQTSFSLGTQQNAYVFKRSSIEC 374
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQ 432
+AFL N S VT++F Y LP S+SILPDCKNV FNTAK++ + + Q
Sbjct: 375 AAFLENTEDQS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVS----IQNARAMKSQ 429
Query: 433 VAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
+ +S++ W E + D + LL+QI+TT D SDYLWY+ + + P
Sbjct: 430 LEFNSAET----WKVYKEAIPSFGDTSLRANTLLDQISTTKDTSDYLWYTF--RLYDNSP 483
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
++++L S GH LHAF+NG LVGS +GS N ++ + L G N LS
Sbjct: 484 ----NAQSILSAYSHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNNISFLSA 539
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSS 610
TVGL N GA+ E+ AG+ LK G D ++Q W YQ GL GE+L SGSS
Sbjct: 540 TVGLPNSGAYLERRVAGLRS---LKVQGR----DFTNQAWGYQIGLLGEKLQIYTASGSS 592
Query: 611 -TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
QW+S + K PL WYKTTFDAP G++PV ++ MGKG W+NGQ IGRYW ++ +
Sbjct: 593 KVQWESFQSSTK--PLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYWVSFHT 650
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
G PSQ YH+PRS LKS+GN LVL EE G+P I+
Sbjct: 651 PQ----------------------GTPSQKWYHIPRSLLKSTGNLLVLLEEETGNPLGIT 688
Query: 730 FVTKQLGS 737
T + S
Sbjct: 689 LDTVYITS 696
>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
Length = 514
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/504 (64%), Positives = 382/504 (75%), Gaps = 25/504 (4%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+V+YDH+A+ I GKRR+L+SGSIHYPRSTPEMWPDLIQK+K+GGLDVI+TYVFWN HEP
Sbjct: 19 ASVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 78
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+Y F G YDLV+F+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL +IPGI FRT+N
Sbjct: 79 SPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNN 138
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
PFKA MQRFT KIVDMMK E L+ SQGGPIILSQIENEYG ++ GAAG++Y +WAA
Sbjct: 139 GPFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQ 198
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+ L TGVPWVMC+Q DAPDPIIN+CNGFYCD F+PN KPKMWTE W+GWF FGGA
Sbjct: 199 MAVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 258
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VPYRPVEDLAF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL+
Sbjct: 259 VPYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLV 318
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLH+AIKLCE ALV+ DP+ LG EA V+K+ G C+AFLAN S
Sbjct: 319 RQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSF 378
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS----VTLVP-----SFSRQSLQVAA 435
V F Y LP WS+SILPDCKN V+NTA++ + + +VP +FS Q+ A
Sbjct: 379 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVPVPIHGAFSWQAYNEEA 438
Query: 436 DSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
SS+ + +FT GL+EQINTT D SDYLWYS I DE L+
Sbjct: 439 PSSNG----------------ERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLK 482
Query: 496 DGSKTVLHVQSLGHALHAFINGKL 519
G L V S GHALH F+N +L
Sbjct: 483 TGKYPTLTVLSAGHALHVFVNDQL 506
>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 673
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/714 (50%), Positives = 448/714 (62%), Gaps = 55/714 (7%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A VTYD R+++I G+R++L SGSIHYPRSTP+MWP LI K+K+GGLDVI+TYVFWNLHEP
Sbjct: 2 AEVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPALISKAKEGGLDVIQTYVFWNLHEP 61
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
QY+F GRYDLV+F+K + GLY LRIGPY+ +EW +GGFP WLH +P I +RTDN
Sbjct: 62 QFGQYDFSGRYDLVRFIKEIQVQGLYVCLRIGPYIESEWTYGGFPFWLHDVPAIVYRTDN 121
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
+PFK MQ FT KIV MM+ E LYASQGGPIILSQIENEY N++ A+G G Y++WAA
Sbjct: 122 QPFKLYMQNFTTKIVSMMQSEGLYASQGGPIILSQIENEYQNVEKAFGEDGSRYVQWAAE 181
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFG 262
MA+ L TGVPW+MC+Q+DAPDP+INTCNG C + FT PNS NKP WTENW+ ++ +G
Sbjct: 182 MAVGLKTGVPWLMCKQTDAPDPLINTCNGMRCGETFTGPNSPNKPAFWTENWTSFYQVYG 241
Query: 263 GAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
G R ED+AF V F R G++ NYYMYHGGTN RTS I++ YD APLDEY
Sbjct: 242 GEPYIRSAEDIAFHVTLFIARKNGSYVNYYMYHGGTNLGRTSSSYVITSYYD-QAPLDEY 300
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL+RQPKWGHLK+LH AIK C L+ + SLG E V++ G C AFL N
Sbjct: 301 GLLRQPKWGHLKELHAAIKSCSTTLLEGKQSNFSLGQLQEGYVFEE-EGKCVAFLVNNDH 359
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
TV+F SY LP+ S+SILPDC+NV FNTA +N+ S R + + SS
Sbjct: 360 VKMFTVQFRNRSYELPSKSISILPDCQNVTFNTATVNT----KSNRRMTSTIQTFSS--- 412
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
W + + LLEQ+N T D+SDYLWY+L S++
Sbjct: 413 ADKWEQFQDVIPNFDQTTLISNSLLEQMNVTKDKSDYLWYTL--------------SESK 458
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L QS H HAF +G +G +GS T P+ L G N +LS+ VGL + GA
Sbjct: 459 LTAQSAAHVTHAFADGTYLGGAHGSHDVKSFTTQVPLKLNEGTNNISILSVMVGLPDAGA 518
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKST 618
F E+ AG+T V+++ S + DL++ W YQ GL GE+L S SS QW
Sbjct: 519 FLERRFAGLTA-VEIQCSEE--SYDLTNSTWGYQVGLLGEQLEIYEEKSNSSIQWSPLGN 575
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
Q L WYKT FD+P G EPVA++ MGKG+AWVNG+SIGRYW ++
Sbjct: 576 TCN-QTLTWYKTAFDSPKGDEPVALNLESMGKGQAWVNGESIGRYWISFHDSK------- 627
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
G+PSQ+LYHVPRS+LK GN+LVLFEE GG+P IS T
Sbjct: 628 ---------------GQPSQTLYHVPRSFLKDIGNSLVLFEEEGGNPLHISLDT 666
>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 830
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/854 (42%), Positives = 506/854 (59%), Gaps = 64/854 (7%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
V+AT+++ NVTYD RA++I G+RR+L+SGSIHYPRSTP+MWP+L ++K G+DVI+TY
Sbjct: 17 VMATSAYAMNVTYDSRALLIDGRRRLLVSGSIHYPRSTPDMWPELFARAKANGIDVIQTY 76
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
+FWN + P ++ R+D V+FV+L EAGLY + RIGP+VCAEW +GG P WL IP
Sbjct: 77 LFWNTNVPTPGEFVMSDRFDYVRFVQLAQEAGLYVNFRIGPFVCAEWTYGGLPAWLRQIP 136
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK 196
I FR ++P+ + K V ++K +L A QGGPIIL QIENEYG +S Y A G
Sbjct: 137 DIMFRDYDQPWLQVAGEYITKTVQILKDNRLLAGQGGPIILLQIENEYGGTESRY-AGGP 195
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
Y++W +A +L W+MC Q DAP II TCN FYCD F P+ +P MWTENW G
Sbjct: 196 QYVEWCGQLAANLTDAAQWIMCSQPDAPANIIATCNAFYCDDFVPHP-GQPSMWTENWPG 254
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
WF +G P+RP +D+A+AV R++ +GG++ NYYMYHGGTNF+RT+GGPFI+T+YDYDA
Sbjct: 255 WFQKWGDPTPHRPAQDVAYAVTRYYIKGGSYMNYYMYHGGTNFERTAGGPFITTNYDYDA 314
Query: 317 PLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP-SLGPNLEATVYKTGSGLCSAF 375
LDEYG+ +PK+ HL +H + EA ++A P SLG NLEA +Y + G C AF
Sbjct: 315 SLDEYGMPNEPKYSHLGSMHAVLHDNEAIMMAVPAPKPISLGTNLEAHIYNSSVG-CVAF 373
Query: 376 LANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS----FSRQSL 431
L+N +DV V+FNG +Y LPAWSVS+L C ++NTA + P +R+S
Sbjct: 374 LSNNNNKTDVEVQFNGRTYELPAWSVSVLHGCVTAIYNTAVCRAHQRAPHDAACCARESR 433
Query: 432 QV------------AADSSDAIGSGWSYINEPVG-ISKDDAFTKPGLLEQINTTADQSDY 478
+V A S I + +G + + LEQI+ T D +DY
Sbjct: 434 RVCDRLPPLRPKARAPCQSGRIRHLCLVVLTSIGPQAPATKYWNKTPLEQIDQTLDHTDY 493
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
LWYS S + + L + + + ++NGK V + + +A V+
Sbjct: 494 LWYSTSY-------VSSSATYAQLSLPQITDVAYVYVNGKFVTVSWSGNVSATVS----- 541
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
L G NT D+LSLT+GL N G + G+ G V L +++L+ W +QTG+
Sbjct: 542 -LVAGPNTIDILSLTMGLDNGGDILSEYNCGLLGGVYLG------SVNLTENGWWHQTGV 594
Query: 599 KGEE--LNFPSG-SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSE-PVAIDFTGMGKGEAW 654
GE + P W + + L L WYK++FD P S+ P+A+D TGMGKG W
Sbjct: 595 VGERNAIFLPENLKKVAWTTPAVLNT--GLTWYKSSFDVPRDSQAPLALDLTGMGKGYVW 652
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG ++GRYWPT ++ N C D C+YRG Y + C + C PSQ+ YHVPR WL++ N
Sbjct: 653 VNGHNLGRYWPTILATNWPC-DVCDYRGTYDAPHCKQGCNMPSQTHYHVPREWLQAENNV 711
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPN 774
LVL EE+GG+P+KI+ V ++ S C V + +P D + L C
Sbjct: 712 LVLLEEMGGNPSKIALVEREEYVS-CGVVGEDYP------ADDLAV--------VLGC-G 755
Query: 775 PNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPC 834
+Q I+ + FAS+GTP+G+C S+ +G C ++ S +V C G ++CSI VS FG+PC
Sbjct: 756 THQTIAGVDFASYGTPMGSCRSYQQGSCHASNSTEIVLSLCHGKQACSIPVSAAMFGNPC 815
Query: 835 KGVM-KSLAVEASC 847
V K LAV+ +C
Sbjct: 816 PDVTNKRLAVQVAC 829
>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
Length = 694
Score = 679 bits (1753), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/732 (50%), Positives = 463/732 (63%), Gaps = 60/732 (8%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
IL + LC T GANVTYD ++VI G ++L SGSIHYPRSTP+MWPDLI K+
Sbjct: 13 ILTVSLC--------TVHGANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVI+TYVFWNLHEP + QY F GR+DLV F+K + GLY LRIGPY+ +E +
Sbjct: 65 KEGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTY 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG PLWLH +PGI FRTDN+ FK MQRFT KIV+MMK L+ASQGGPIILSQIENEYG
Sbjct: 125 GGLPLWLHDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ-FT-PNS 243
+I S + A G YI WAA MA+ L TGVPW+MC+Q DAPDP+IN CNG C + F PNS
Sbjct: 185 SIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNS 244
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
NKP +WTENW+ + +FGGA R D+A+ VA F + G++ NYYMYHGGTNFDR +
Sbjct: 245 PNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLA 304
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
FI T+Y +APLDEYGL+RQPKWGHLK+LH +IK C L+ T SLG +A
Sbjct: 305 SA-FIITAYYDEAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQAY 363
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
V+++ S C+AFL N G DVT++F SY LP S+SILP CKNVVFNT K++ V
Sbjct: 364 VFRS-STECAAFLENSGPR-DVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNNV 421
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
+ + LQ + W E + + LL+QI+T D SDY+WY+
Sbjct: 422 RAM-KPRLQFNS------AENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYTF 474
Query: 484 STNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPG 543
N K+ +K+VL + S G LH+FING L GS +GS +N +VT+ + L G
Sbjct: 475 RFNNKSPN------AKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLING 528
Query: 544 KNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL 603
N +LS TVGL N GAF E AG+ V+++G D SS W YQ GL GE+L
Sbjct: 529 MNNISILSATVGLPNSGAFLESRVAGLR-KVEVQGR------DFSSYSWGYQVGLLGEKL 581
Query: 604 NF--PSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
SGSS QW KS +PL WY+TTF APAG++PV ++ MGKG AWVNGQ I
Sbjct: 582 QIFTVSGSSKVQW--KSFQSSTKPLTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGI 639
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW ++ K G PSQ YH+PRS+LKS+GN LV+ EE
Sbjct: 640 GRYWVSF----------------------HKPDGTPSQQWYHIPRSFLKSTGNLLVILEE 677
Query: 721 IGGDPTKISFVT 732
G+P I+ T
Sbjct: 678 ETGNPLGITLDT 689
>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
Length = 715
Score = 677 bits (1747), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/733 (47%), Positives = 466/733 (63%), Gaps = 49/733 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G +VTYD R+++I G+R++L SGSIHYPRSTPEMWP L+ K+++GG+DVI+TYVFWNLHE
Sbjct: 22 GGDVTYDGRSLIIDGQRKILFSGSIHYPRSTPEMWPSLVAKAREGGVDVIQTYVFWNLHE 81
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P +Y+F GR DLV+F+K + GLY LRIGP++ +EW +GGFP WLH +P I +R+D
Sbjct: 82 PRPGEYDFSGRNDLVRFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPDIVYRSD 141
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQ FT KIV+MMK E LYASQGGPIILSQIENEY N+++A+ G Y+ WAA
Sbjct: 142 NEPFKFYMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYQNVEAAFRDKGPPYVIWAA 201
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT--PNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPWVMC+Q+DAPDP+INTCNG C + PNS KP +WTENW+ ++ +
Sbjct: 202 KMAVELQTGVPWVMCKQTDAPDPVINTCNGMRCGETFGGPNSPTKPSLWTENWTSFYQVY 261
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG R ED+AF V F + G++ NYYM+HGGTNF RT+ I++ YD APLDEY
Sbjct: 262 GGEPYIRSAEDIAFHVTLFIAKNGSYINYYMFHGGTNFGRTASAYVITSYYD-QAPLDEY 320
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GLIRQPKWGHLK+LH AIK C + ++ + SLG +A +++ C+AFL N
Sbjct: 321 GLIRQPKWGHLKELHAAIKSCSSTILEGVQSNFSLGQLQQAYIFEEEGAGCAAFLVNNDQ 380
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
++ TV+F ++ L S+S+LPDC+N++FNTAK+N+ +R S Q+ D+
Sbjct: 381 KNNATVEFRNITFELLPKSISVLPDCENIIFNTAKVNAKG--NEITRTSSQLFDDADR-- 436
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS--TNIKADEPLLEDGSK 499
W + + D LLE +NTT D+SDYLWY+ S N EP
Sbjct: 437 ---WEAYTDVIPNFADTNLKSDTLLEHMNTTKDKSDYLWYTFSFLPNSSCTEP------- 486
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGS-SSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+LHV+SL H AF+N K GS +GS + T++ PI L NT +LS VGLQ+
Sbjct: 487 -ILHVESLAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQMNTISILSTMVGLQD 545
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDS 615
GAF E+ AG+T V+++ + ++ +W YQ GL GE LN + +W S
Sbjct: 546 SGAFLERRYAGLTR-VEIRCAQQEIYNFTNNYEWGYQAGLSGESLNIYMREHLDNIEW-S 603
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
+ QPL W+K FDAP G++PV ++ + MGKGEAWVNGQSIGRYW ++
Sbjct: 604 EVVSATDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVNGQSIGRYWLSF-------- 655
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
L + G+PSQ+LYH+PR++L SSGN LVL EE GGDP IS T
Sbjct: 656 --------------LTSKGQPSQTLYHIPRAFLNSSGNLLVLLEESGGDPLHISLDTVSR 701
Query: 736 GSSLCSHVTDSHP 748
+ L H + HP
Sbjct: 702 -TGLQEHASRYHP 713
>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 696
Score = 676 bits (1745), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/740 (49%), Positives = 465/740 (62%), Gaps = 65/740 (8%)
Query: 12 CWGFVVLATTSF-----GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
C+ F + F G NVTYD R+++I G+ ++L SGSIHYPRSTP+MWP+LI K+K
Sbjct: 7 CFSFAFILIRVFIGAVYGDNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAK 66
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
+GGLDVI+TYVFWNLHEP + QY+F G ++V+F+K + GLY LRIGPY+ +E +G
Sbjct: 67 EGGLDVIQTYVFWNLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYG 126
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
G PLWLH IPGI FR+DNE FK MQRFTAKIV++MK L+ASQGGPIILSQIENEYGN
Sbjct: 127 GLPLWLHDIPGIVFRSDNEQFKFHMQRFTAKIVNLMKSANLFASQGGPIILSQIENEYGN 186
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT--PNSN 244
++ A+ G SYI+WAA MA+ L TGVPWVMC+Q +APDP+INTCNG C + PNS
Sbjct: 187 VEGAFHEKGLSYIRWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSP 246
Query: 245 NKPKMWTENWSGWFLSFGGAVPY-RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
NKP +WTENW+ ++ F G VPY R ED+A+ VA F + G++ NYYMYHGGTNFDR +
Sbjct: 247 NKPSLWTENWTSFYQVF-GEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIA 305
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
F+ T+Y +APLDEYGL+R+PKWGHLK+LH+AIK C +L+ T SLG A
Sbjct: 306 SA-FVVTAYYDEAPLDEYGLVREPKWGHLKELHEAIKSCSNSLLYGTQTSFSLGTQQNAY 364
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
V++ S C+AFL N S VT++F Y LP S+SILPDCKNV FNTAK+ +
Sbjct: 365 VFRRSSIECAAFLENTEDRS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVRAQNAR 423
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
S Q+ +S++ W E + D + LL+QI+T D SDYLWY+
Sbjct: 424 AMKS----QLQFNSAEK----WKVYREAIPSFADTSLRANTLLDQISTAKDTSDYLWYTF 475
Query: 484 STNIKADEPLLEDGS---KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIAL 540
L D S +++L S GH LHAF+NG LVGS +GS N ++ + L
Sbjct: 476 R---------LYDNSANAQSILSAYSHGHVLHAFVNGNLVGSKHGSHKNVSFVMENKLNL 526
Query: 541 APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKG 600
G N LS TVGL N GA+ E AG+ LK G D ++Q W YQ GL G
Sbjct: 527 ISGMNNISFLSATVGLPNSGAYLEGRVAGLRS---LKVQGR----DFTNQAWGYQVGLLG 579
Query: 601 EELNF--PSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNG 657
E+L SGSS +W+ S L +PL WYKTTFDAP G++PV ++ MGKG WVNG
Sbjct: 580 EKLQIYTASGSSKVKWE--SFLSSTKPLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWVNG 637
Query: 658 QSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVL 717
Q IGRYW ++ + G PSQ YH+PRS LKS+GN LVL
Sbjct: 638 QGIGRYWVSFHTPQ----------------------GTPSQKWYHIPRSLLKSTGNLLVL 675
Query: 718 FEEIGGDPTKISFVTKQLGS 737
EE G+P I+ T + S
Sbjct: 676 LEEETGNPLGITLDTVYITS 695
>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
Length = 848
Score = 676 bits (1744), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/844 (43%), Positives = 502/844 (59%), Gaps = 68/844 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +++I G R +L SGSIHYPRSTPEMWP++I+++K GGL+ I+TYVFWN+HEP +
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF GR DLVKF+KL+ + G+Y LR+GP++ AEW GG P WL +PGI FRTDN P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNTP 163
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK +R+ I+D MK+EKL+ASQGGPIIL QIENEY + AY G +YIKWA+ +
Sbjct: 164 FKEHTERYVKVILDKMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
S+D G+PWVMC+Q+DAPDP+IN CNG +C D F PN NKP +WTENW+ F +G
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVYGDP 283
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R VED+A++VARFF + GT NYYMYHGGTNF RTS +++T Y DAPLDEYGL
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 342
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
R+PK+GHLK LH A+ LC+ AL+ P E Y+ G+ +C+AFLAN T S
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTES 402
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS-------LQVAAD 436
+KF G Y++P S+SILPDCK VV+NT +I S +F + +V +
Sbjct: 403 AEKIKFKGKEYIIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKNFDFKVFTE 462
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+ + G SYI PV E T D++DY WY+ S I ++ +
Sbjct: 463 TVPSKIKGDSYI--PV--------------ELYGLTKDETDYGWYTTSFKIDDNDLSKKK 506
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
GSK L + SLGHALH ++NG+ +G+G+GS PI+L G+N +L + G
Sbjct: 507 GSKPTLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPISLKEGENHLTMLGVLTGF 566
Query: 557 QNYGAFYEKTGAGITGP--VQLKGSGNGTNIDLSSQ-QWTYQTGLKGEELNFPSGSSTQW 613
+ G++ E TGP V + G G+GT +DL+ + +W + G++GE+L + +
Sbjct: 567 PDSGSYMEHR---YTGPRSVSILGLGSGT-LDLTEENKWGNKVGMEGEKLGIHAEEGLKK 622
Query: 614 DSKSTLPKLQP-LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
+P L WY+T FDAP AI GMGKG WVNG+ +GRYW +++S
Sbjct: 623 VKWQKFSGKEPGLTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP-- 680
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG-DPTKISFV 731
G+P+Q YH+PRS+LK N LV+FEE P I FV
Sbjct: 681 --------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFV 720
Query: 732 TKQLGSSLCSHVTDSHPLPVDMW-GSDSKIQRKPGPV---LSLECPNPNQVISSIKFASF 787
++CSH+ +++ V W + ++Q V SL+C + IS ++FASF
Sbjct: 721 IINR-DTVCSHIGENYTPSVRHWTRKNDQVQAITDDVHLTASLKCSGTKK-ISEVEFASF 778
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF----GDPCKGVMKSLAV 843
G P GTCG+F+ G C++ S VV + C+G C I V+ +TF D C V K LAV
Sbjct: 779 GNPNGTCGNFTLGTCNAPVSKKVVEKYCLGKAECVIPVNKSTFQQDKKDSCPKVEKKLAV 838
Query: 844 EASC 847
+ C
Sbjct: 839 QVKC 842
>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
Length = 848
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/846 (43%), Positives = 505/846 (59%), Gaps = 72/846 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +++I G R +L SGSIHYPRSTPEMWP++I+++K GGL+ I+TYVFWN+HEP +
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF GR DLVKF+KL+ + GLY LR+GP++ AEW GG P WL +PGI FRTDNEP
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNEP 163
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK +R+ ++DMMK+EKL+ASQGGPIIL QIENEY + AY G +YIKWA+ +
Sbjct: 164 FKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
S+D G+PWVMC+Q+DAPDP+IN CNG +C D F PN +NKP +WTENW+ F FG
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGDP 283
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R VED+A++VARFF + GT NYYMYHGGTNF RTS +++T Y DAPLDE+GL
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEFGLE 342
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
R+PK+GHLK LH A+ LC+ AL+ P E Y+ G+ +C+AFLAN T +
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTEA 402
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS-------LQVAAD 436
+KF G YL+P S+SILPDCK VV+NT +I S +F + +V +
Sbjct: 403 AEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKNFDFKVFTE 462
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
S + G S+I PV E T D+SDY WY+ S I ++ +
Sbjct: 463 SVPSKIKGDSFI--PV--------------ELYGLTKDESDYGWYTTSFKIDDNDLSKKK 506
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G K L + SLGHALH ++NG+ +G+G+GS P+ L G+N +L + G
Sbjct: 507 GGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGF 566
Query: 557 QNYGAFYEKTGAGITGP--VQLKGSGNGTNIDLSSQ-QWTYQTGLKGEELNFPSGS---S 610
+ G++ E TGP V + G G+GT +DL+ + +W + G++GE L +
Sbjct: 567 PDSGSYMEHR---YTGPRSVSILGLGSGT-LDLTEENKWGNKVGMEGERLGIHAEEGLKK 622
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ 670
+W+ S K + WY+T FDAP AI GMGKG WVNG+ +GRYW +++S
Sbjct: 623 VKWEKASG--KEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP 680
Query: 671 NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG-DPTKIS 729
G+P+Q YH+PRS+LK N LV+FEE P I
Sbjct: 681 ----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELID 718
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMW-GSDSKIQRKPGPV---LSLECPNPNQVISSIKFA 785
FV ++CS++ +++ V W + ++Q V +L+C + IS+++FA
Sbjct: 719 FVIVNR-DTVCSYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKK-ISAVEFA 776
Query: 786 SFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF----GDPCKGVMKSL 841
SFG P GTCG+F+ G C++ S VV + C+G C I V+ +TF D C V K L
Sbjct: 777 SFGNPNGTCGNFTLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFEQDKKDSCPKVEKKL 836
Query: 842 AVEASC 847
AV+ C
Sbjct: 837 AVQVKC 842
>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
Length = 766
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/832 (45%), Positives = 486/832 (58%), Gaps = 95/832 (11%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G +VTYD R+++I G+RR+L SGSIHYPRSTPEMWP LI K+K+GG+DVIETY FWN HE
Sbjct: 21 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 80
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P + QY+F GR D+VKF K V GLYA LRIGP++ +EWN+GG P WLH +PGI +R+D
Sbjct: 81 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 140
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQ FT KIV++MK E LYASQGGPIILSQIENEY N+++A+ G Y++WAA
Sbjct: 141 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 200
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA+ L T + + + E+ G
Sbjct: 201 KMAVDLQTAM----------------------------------RYYGEDKRG------- 219
Query: 264 AVPYRPVEDLAFAVARFF-QRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
R EDLAF VA F ++ G+F NYYMYHGGTNF RTS ++ YD APLDEYG
Sbjct: 220 ----RAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEYG 274
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
LIRQPKWGHLK+LH IKLC L+ SLG EA ++K SG C+AFL N
Sbjct: 275 LIRQPKWGHLKELHAVIKLCSDTLLXGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDKR 334
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
+VTV F +Y L A S+SILPDCK + FNTAK+++ F+ +S+Q A
Sbjct: 335 RNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVST-----QFNTRSVQTRATFGST-- 387
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
WS E + LLE + TT D SDYLWY+L + ++ VL
Sbjct: 388 KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRFIHNSSN------AQPVL 441
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V SL H L AF+NGK + S +GS N ++ + L G N LLS+ VGL + G +
Sbjct: 442 RVDSLAHVLLAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPY 501
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTL 619
E AGI +++ G + D S W YQ GL GE+L P QW +
Sbjct: 502 LEHKVAGIR---RVEIQDGGXSKDFSKHPWGYQVGLMGEKLQIYTSPGSQKVQWYGLGSH 558
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
+ PL WYKT FDAP G++PV + F MGKGEAWVNGQSIGRYW +Y++ +
Sbjct: 559 GR-GPLTWYKTLFDAPRGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS-------- 609
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
G+PSQ+ Y+VPR++L GN LV+ EE GDP KIS T + +++
Sbjct: 610 --------------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSV-TNV 654
Query: 740 CSHVTDSHPLPVDMW-GSDSKIQRKPG--PVLSLECPNPNQVISSIKFASFGTPLGTCGS 796
C HVTDSHP P+ W SD + G P + L CP P+ IS I FASFGTP+G C S
Sbjct: 655 CGHVTDSHPPPIISWTTSDDGNESHHGKIPKVQLRCP-PSSNISKITFASFGTPVGGCES 713
Query: 797 FSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
++ G C S SL+V +AC+G CSI S+ +FG DPC G K+L V A C
Sbjct: 714 YAIGSCHSPNSLAVAEKACLGKNXCSIPHSLKSFGDDPCPGTPKALLVAAQC 765
>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 832
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/846 (43%), Positives = 505/846 (59%), Gaps = 72/846 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+TYD +++I G R +L SGSIHYPRSTPEMWP++I+++K GGL+ I+TYVFWN+HEP +
Sbjct: 28 ITYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 87
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF GR DLVKF+KL+ + GLY LR+GP++ AEW GG P WL +PGI FRTDNEP
Sbjct: 88 GKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNEP 147
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK +R+ ++DMMK+EKL+ASQGGPIIL QIENEY + AY G +YIKWA+ +
Sbjct: 148 FKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 207
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
S+D G+PWVMC+Q+DAPDP+IN CNG +C D F PN +NKP +WTENW+ F FG
Sbjct: 208 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGDP 267
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R VED+A++VARFF + GT NYYMYHGGTNF RTS +++T Y DAPLDE+GL
Sbjct: 268 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEFGLE 326
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
R+PK+GHLK LH A+ LC+ AL+ P E Y+ G+ +C+AFLAN T +
Sbjct: 327 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTEA 386
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS-------LQVAAD 436
+KF G YL+P S+SILPDCK VV+NT +I S +F + +V +
Sbjct: 387 AEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKNFDFKVFTE 446
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
S + G S+I PV E T D+SDY WY+ S I ++ +
Sbjct: 447 SVPSKIKGDSFI--PV--------------ELYGLTKDESDYGWYTTSFKIDDNDLSKKK 490
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G K L + SLGHALH ++NG+ +G+G+GS P+ L G+N +L + G
Sbjct: 491 GGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGF 550
Query: 557 QNYGAFYEKTGAGITGP--VQLKGSGNGTNIDLSSQ-QWTYQTGLKGEELNFPSGS---S 610
+ G++ E TGP V + G G+GT +DL+ + +W + G++GE L +
Sbjct: 551 PDSGSYMEHR---YTGPRSVSILGLGSGT-LDLTEENKWGNKVGMEGERLGIHAEEGLKK 606
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQ 670
+W+ S K + WY+T FDAP AI GMGKG WVNG+ +GRYW +++S
Sbjct: 607 VKWEKASG--KEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP 664
Query: 671 NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG-DPTKIS 729
G+P+Q YH+PRS+LK N LV+FEE P I
Sbjct: 665 ----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELID 702
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMW-GSDSKIQRKPGPV---LSLECPNPNQVISSIKFA 785
FV ++CS++ +++ V W + ++Q V +L+C + IS+++FA
Sbjct: 703 FVIVNR-DTVCSYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKK-ISAVEFA 760
Query: 786 SFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF----GDPCKGVMKSL 841
SFG P GTCG+F+ G C++ S VV + C+G C I V+ +TF D C V K L
Sbjct: 761 SFGNPNGTCGNFTLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFEQDKKDSCPKVEKKL 820
Query: 842 AVEASC 847
AV+ C
Sbjct: 821 AVQVKC 826
>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/738 (48%), Positives = 462/738 (62%), Gaps = 51/738 (6%)
Query: 7 LLLVLCWGFVVLATTSF------GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
L+ LC +V F VTYD R+++I G+R++L SGSIHYPRSTPEMWP
Sbjct: 6 LVFGLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPS 65
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+K+K+GG+DVI+TYVFWNLHEP QY+F GR DLVKF+K + GLY LRIGP++
Sbjct: 66 LIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIE 125
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GG P WL +PG+ +RTDNEPFK MQ+FTAKIVD+MK E LYASQGGPIILSQI
Sbjct: 126 AEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQI 185
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF- 239
ENEY N++ A+ G SYIKWA MA+ L TGVPW+MC+ DAPDP+INTCNG C +
Sbjct: 186 ENEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETF 245
Query: 240 -TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTN 298
PNS NKPKMWTE+W+ +F +G R ED+AF A F + G++ NYYMYHGGTN
Sbjct: 246 PGPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTN 305
Query: 299 FDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
F RTS FI+ YD APLDEYGL+RQPK+GHLK+LH AIK L+ T SLGP
Sbjct: 306 FGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGP 364
Query: 359 NLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKIN 418
+A V++ + C AFL N + ++F N+Y L S+ IL +CKN+++ TAK+N
Sbjct: 365 MQQAYVFEDANNGCVAFLVNNDAKAS-QIQFRNNAYSLSPKSIGILQNCKNLIYETAKVN 423
Query: 419 SVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
V +R + V + + W+ E + S+ LLE N T D++DY
Sbjct: 424 ----VKMNTRVTTPVQVFN---VPDNWNLFRETIPASQAHLLKTNALLEHTNLTKDKTDY 476
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
LWY +++ K D P + ++ +S GH +H F+N L GSG+GS V + P+
Sbjct: 477 LWY--TSSFKLDSPC----TNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPV 530
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
+L G+N +LS VGL + GA+ E+ G+T VQ+ G IDLS QW Y GL
Sbjct: 531 SLINGQNNISILSGMVGLPDSGAYMERRSYGLT-KVQISCGGTKP-IDLSRSQWGYSVGL 588
Query: 599 KGEELNF---PSGSSTQWD-SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
GE++ + + +W +K+ L K +PL WYKTTFD P G PV + + MGKGE W
Sbjct: 589 LGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIW 648
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYW ++ L G+PSQS+YH+PR++LK SGN
Sbjct: 649 VNGESIGRYWVSF----------------------LTPAGQPSQSIYHIPRAFLKPSGNL 686
Query: 715 LVLFEEIGGDPTKISFVT 732
LV+FEE GGDP IS T
Sbjct: 687 LVVFEEEGGDPLGISLNT 704
>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 718
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/738 (47%), Positives = 461/738 (62%), Gaps = 51/738 (6%)
Query: 7 LLLVLCWGFVVLATTSF------GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
L+ LC +V F VTYD R+++I G+R++L SGSIHYPRSTPEMWP
Sbjct: 6 LVFGLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPS 65
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+K+K+GG+DVI+TYVFWNLHEP QY+F GR DLVKF+K + GLY LRIGP++
Sbjct: 66 LIKKAKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIE 125
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GG P WL +PG+ +RTDNEPFK MQ+FTAKIVD+MK E LYASQGGPIILSQI
Sbjct: 126 AEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQI 185
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF- 239
ENEY N++ A+ G SYIKWA MA+ L TGVPW+MC+ DAPDP+INTCNG C +
Sbjct: 186 ENEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETF 245
Query: 240 -TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTN 298
PNS NKPKMWTE+W+ +F +G R ED+AF A F + G++ NYYMYHGGTN
Sbjct: 246 PGPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTN 305
Query: 299 FDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
F RTS FI+ YD APLDEYGL+RQPK+GHLK+LH AIK L+ T SLGP
Sbjct: 306 FGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGP 364
Query: 359 NLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKIN 418
+A V++ + C AFL N + ++F N+Y L S+ IL +CKN+++ TAK+N
Sbjct: 365 MQQAYVFEDANNGCVAFLVNNDAKAS-QIQFRNNAYSLSPKSIGILQNCKNLIYETAKVN 423
Query: 419 SVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
V +R + V + + W+ E + + LLE N T D++DY
Sbjct: 424 ----VKMNTRVTTPVQVFN---VPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDY 476
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
LWY +++ K D P + ++ +S GH +H F+N L GSG+GS V + P+
Sbjct: 477 LWY--TSSFKLDSPC----TNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPV 530
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
+L G+N +LS VGL + GA+ E+ G+T VQ+ G IDLS QW Y GL
Sbjct: 531 SLINGQNNISILSGMVGLPDSGAYMERRSYGLT-KVQISCGGTKP-IDLSRSQWGYSVGL 588
Query: 599 KGEELNF---PSGSSTQWD-SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
GE++ + + +W +K+ L K +PL WYKTTFD P G PV + + MGKGE W
Sbjct: 589 LGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIW 648
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYW ++ L G+PSQS+YH+PR++LK SGN
Sbjct: 649 VNGESIGRYWVSF----------------------LTPAGQPSQSIYHIPRAFLKPSGNL 686
Query: 715 LVLFEEIGGDPTKISFVT 732
LV+FEE GGDP IS T
Sbjct: 687 LVVFEEEGGDPLGISLNT 704
>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
Length = 718
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/738 (47%), Positives = 461/738 (62%), Gaps = 51/738 (6%)
Query: 7 LLLVLCWGFVVLATTSF------GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
L+ LC +V F VTYD R+++I G+R++L SGSIHYPRSTPEMWP
Sbjct: 6 LVFGLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPS 65
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
LI+K+K+GG+DVI+TYVFWNLHEP QY+F GR DLVKF+K + GLY LRIGP++
Sbjct: 66 LIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIE 125
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEWN+GG P WL +PG+ +RTDNEPFK MQ+FTAKIVD+MK E LYASQGGPIILSQI
Sbjct: 126 AEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQI 185
Query: 181 ENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF- 239
ENEY N++ A+ G SYIKWA MA+ L TGVPW+MC+ DAPDP+INTCNG C +
Sbjct: 186 ENEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETF 245
Query: 240 -TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTN 298
PNS NKPKMWTE+W+ +F +G R ED+AF A F + G++ NYYMYHGGTN
Sbjct: 246 PGPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTN 305
Query: 299 FDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
F RTS FI+ YD APLDEYGL+RQPK+GHLK+LH AIK L+ T SLGP
Sbjct: 306 FGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGP 364
Query: 359 NLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKIN 418
+A V++ + C AFL N + ++F N+Y L S+ IL +CKN+++ TAK+N
Sbjct: 365 MQQAYVFEDANNGCVAFLVNNDAKAS-QIQFRNNAYSLSPKSIGILQNCKNLIYETAKVN 423
Query: 419 SVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDY 478
V +R + V + + W+ E + + LLE N T D++DY
Sbjct: 424 ----VKMNTRVTTPVQVFN---VPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDY 476
Query: 479 LWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI 538
LWY +++ K D P + ++ +S GH +H F+N L GSG+GS V + P+
Sbjct: 477 LWY--TSSFKLDSPC----TNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPV 530
Query: 539 ALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL 598
+L G+N +LS VGL + GA+ E+ G+T VQ+ G IDLS QW Y GL
Sbjct: 531 SLINGQNNISILSGMVGLPDSGAYMERRSYGLT-KVQISCGGTKP-IDLSRSQWGYSVGL 588
Query: 599 KGEELNF---PSGSSTQWD-SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
GE++ + + +W +K+ L K +PL WYKTTFD P G PV + + MGKGE W
Sbjct: 589 LGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIW 648
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYW ++ L G+PSQS+YH+PR++LK SGN
Sbjct: 649 VNGESIGRYWVSF----------------------LTPAGQPSQSIYHIPRAFLKPSGNL 686
Query: 715 LVLFEEIGGDPTKISFVT 732
LV+FEE GGDP IS T
Sbjct: 687 LVVFEEEGGDPLGISLNT 704
>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
Length = 716
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/734 (47%), Positives = 464/734 (63%), Gaps = 48/734 (6%)
Query: 7 LLLVLCWGFVVLATTSFGAN-VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
L L+L F+V + A VTYD R+++I G+R++L SGSIHYPRSTPEMWP LI+K+
Sbjct: 9 LCLILVGMFLVFPGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKT 68
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GG+DVI+TYVFWNLHEP QY+F GR DLVKF+K + GLY LRIGP++ AEWN+
Sbjct: 69 KEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNY 128
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P WL +PG+ +RTDNEPFK MQ+FT KIV++MK E LYASQGGPIILSQIENEY
Sbjct: 129 GGLPFWLRDVPGMVYRTDNEPFKFHMQKFTTKIVNLMKSEGLYASQGGPIILSQIENEYA 188
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNS 243
N+++A+ G SYIKWA MA+ L TGVPW+MC+ DAPDP+INTCNG C + PNS
Sbjct: 189 NVEAAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMRCGETFPGPNS 248
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
NKPKMWTE+W+ +F +G R ED+AF F + G++ NYYMYHGGTNF RTS
Sbjct: 249 PNKPKMWTEDWTSFFQVYGTEPYIRSAEDIAFHAVLFIAKNGSYINYYMYHGGTNFGRTS 308
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
FI+ YD APLDEYGL+RQPK+GHLK+LH AIK L+ T SLGP +A
Sbjct: 309 SSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAY 367
Query: 364 VYKTGSGLCSAFLANIGTNSDVT-VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTL 422
V++ S C AFL N ++ V+ ++F +SY L S+ IL +CKN+++ TAK+N
Sbjct: 368 VFEDASSGCVAFLVN--NDAKVSQIQFRKSSYSLSPKSIGILQNCKNLIYETAKVN---- 421
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS 482
V R + V + + W E + + LLE N T D++DYLWY
Sbjct: 422 VEKNKRVTTPVQVFN---VPEKWEGFRETIPAFSGTSLKANALLEHTNLTKDKTDYLWY- 477
Query: 483 LSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
+++ K D P + ++++S GH +H F+N L GSG+GS V + P +L
Sbjct: 478 -TSSFKPDSPC----TNPSIYIESSGHVVHVFVNNALAGSGHGSRDIKVVKLQVPASLTN 532
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
G+N+ +LS VGL + GA+ E+ G+T VQ+ G IDLS QW Y GL GE+
Sbjct: 533 GQNSISILSGMVGLPDSGAYMERKSYGLT-KVQIS-CGGTKPIDLSGSQWGYSVGLLGEK 590
Query: 603 LNFP---SGSSTQWD-SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
+ + + +W + + L K +PL+WYKT FD P G PV ++ + MGKGE WVNG+
Sbjct: 591 VRLQQWRNLNRVKWSMNNAGLIKNRPLIWYKTIFDGPNGDGPVGLNMSSMGKGEIWVNGE 650
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
SIGRYW ++++ + G PSQS+YH+PR +LK SGN LV+F
Sbjct: 651 SIGRYWVSFLTPS----------------------GHPSQSIYHIPREFLKPSGNLLVVF 688
Query: 719 EEIGGDPTKISFVT 732
EE GGDP IS T
Sbjct: 689 EEEGGDPLGISLNT 702
>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
Length = 844
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/844 (42%), Positives = 497/844 (58%), Gaps = 68/844 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +++I GKR +L SGSIHYPRSTPEMWP +I+++K GGL+ I+TYVFWN+HEP +
Sbjct: 40 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 99
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF GR DLVKF+KL+ + G+Y LR+GP++ AEW GG P WL +PGI FRTDN+P
Sbjct: 100 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKP 159
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK +R+ I+D MK+E+L+ASQGGPIIL QIENEY + AY G +YIKWA+ +
Sbjct: 160 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASKLV 219
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
S+ G+PWVMC+Q+DAPDP+IN CNG +C D F PN NKP +WTENW+ F FG
Sbjct: 220 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVFGDP 279
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R VED+A++VARFF + G+ NYYMYHGGTNF RTS +++T Y DAPLDEYGL
Sbjct: 280 PTQRSVEDIAYSVARFFSKNGSHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 338
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
R+PK+GHLK LH A+ LC+ L+ P G + E Y+ G+ C+AFLAN T +
Sbjct: 339 REPKYGHLKHLHSALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 398
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS-------LQVAAD 436
T+KF G Y++ S+SILPDCK VV+NTA+I S +F + +V +
Sbjct: 399 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKKFDFKVFTE 458
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+ + G SYI PV E T D++DY WY+ S + + +
Sbjct: 459 TLPSKLEGNSYI--PV--------------ELYGLTKDKTDYGWYTTSFKVHKNHLPTKK 502
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G KT + + SLGHALH ++NG+ +GSG+GS + L G+N +L + G
Sbjct: 503 GVKTFVRIASLGHALHIWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLIMLGVLTGF 562
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGS---STQ 612
+ G++ E G G V + G +GT +DL+ S +W + G++GE+L + +
Sbjct: 563 PDSGSYMEHRYTGPRG-VSILGLTSGT-LDLTESSKWGNKIGMEGEKLGIHTEEGLKKVE 620
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W K K L WY+ FDAP AI GMGKG WVNG+ +GRYW +++S
Sbjct: 621 W--KKFTGKAPGLTWYQAYFDAPESLNAAAIRMNGMGKGLIWVNGEGVGRYWQSFLSP-- 676
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG-DPTKISFV 731
G+P+Q YH+PRS+LK N LV+FEE P + FV
Sbjct: 677 --------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFV 716
Query: 732 TKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVISSIKFASF 787
++CS+V +++ V W I +L+C + I++++FASF
Sbjct: 717 IVNR-DTVCSYVGENYTPSVRHWTRKQDQVQAITDNVSLTATLKCSGTKK-IAAVEFASF 774
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF----GDPCKGVMKSLAV 843
G P+G CG+F+ G C++ S V+ + C+G C I V+ +TF D CK V K+LAV
Sbjct: 775 GNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVAKTLAV 834
Query: 844 EASC 847
+ C
Sbjct: 835 QVKC 838
>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
Precursor
gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
Length = 845
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/844 (42%), Positives = 496/844 (58%), Gaps = 68/844 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +++I GKR +L SGSIHYPRSTPEMWP +I+++K GGL+ I+TYVFWN+HEP +
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF GR DLVKF+KL+ + G+Y LR+GP++ AEW GG P WL +PGI FRTDN+
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK +R+ I+D MK+E+L+ASQGGPIIL QIENEY + AY G +YIKWA+ +
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
S+ G+PWVMC+Q+DAPDP+IN CNG +C D F PN NKP +WTENW+ F FG
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R VED+A++VARFF + GT NYYMYHGGTNF RTS +++T Y DAPLDEYGL
Sbjct: 281 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 339
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
++PK+GHLK LH A+ LC+ L+ P G + E Y+ G+ C+AFLAN T +
Sbjct: 340 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 399
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS-------LQVAAD 436
T+KF G Y++ S+SILPDCK VV+NTA+I S +F + +V +
Sbjct: 400 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKKFDFKVFTE 459
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+ + G SYI PV E T D++DY WY+ S + + +
Sbjct: 460 TLPSKLEGNSYI--PV--------------ELYGLTKDKTDYGWYTTSFKVHKNHLPTKK 503
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G KT + + SLGHALHA++NG+ +GSG+GS + L G+N +L + G
Sbjct: 504 GVKTFVRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGF 563
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGS---STQ 612
+ G++ E G G + + G +GT +DL+ S +W + G++GE+L + +
Sbjct: 564 PDSGSYMEHRYTGPRG-ISILGLTSGT-LDLTESSKWGNKIGMEGEKLGIHTEEGLKKVE 621
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W K K L WY+T FDAP I GMGKG WVNG+ +GRYW +++S
Sbjct: 622 W--KKFTGKAPGLTWYQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSP-- 677
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGG-DPTKISFV 731
G+P+Q YH+PRS+LK N LV+FEE P + F
Sbjct: 678 --------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFA 717
Query: 732 TKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVISSIKFASF 787
++CS+V +++ V W I +L+C + I++++FASF
Sbjct: 718 IVNR-DTVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGTKK-IAAVEFASF 775
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF----GDPCKGVMKSLAV 843
G P+G CG+F+ G C++ S V+ + C+G C I V+ +TF D CK V+K LAV
Sbjct: 776 GNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVVKMLAV 835
Query: 844 EASC 847
+ C
Sbjct: 836 QVKC 839
>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
Length = 706
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/742 (48%), Positives = 455/742 (61%), Gaps = 68/742 (9%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
IL + LC T GANVTYD ++VI G ++L SGSIHYPRSTP+MWPDLI K+
Sbjct: 13 ILTVSLC--------TVHGANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKA 64
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K+GGLDVI+TYVFWNLHEP + QY F GR+DLV F+K + GLY LRIGPY+ +E +
Sbjct: 65 KEGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTY 124
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG PLWLH +PGI FRTDN+ FK MQRFT KIV+MMK L+ASQGGPIILSQIENEYG
Sbjct: 125 GGLPLWLHDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYG 184
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ-FT-PNS 243
+I S + A G YI WAA MA+ L TGVPW+MC+Q DAPDP+IN CNG C + F PNS
Sbjct: 185 SIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNS 244
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
NKP +WTENW+ + +FGGA R D+A+ VA F + G++ NYYMYHGGTNFDR +
Sbjct: 245 PNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLA 304
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
FI T+Y +APLDEYGL+RQPKWGHLK+LH +IK C L+ T SLG +
Sbjct: 305 SA-FIITAYYDEAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQVI 363
Query: 364 VYKTGSGLCSAFLANIGTN----------SDVTVKFNGNSYLLPAWSVSILPDCKNVVFN 413
++ + + N DVT++F SY LP S+SILP CKNVVFN
Sbjct: 364 KNESSWTYFPLMFSEVPQNVLLSWKISGPRDVTIQFQNISYELPGKSISILPGCKNVVFN 423
Query: 414 TAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTA 473
T K++ V + + LQ + W E + + LL+QI+T
Sbjct: 424 TGKVSIQNNVRAM-KPRLQFNS------AENWKVYTEAIPNFAHTSKRADTLLDQISTAK 476
Query: 474 DQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVT 533
D SDY+WY+ N K+ +K+VL + S G LH+FING L GS +GS +N +VT
Sbjct: 477 DTSDYMWYTFRFNNKSPN------AKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVT 530
Query: 534 VDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWT 593
+ + L G N +LS TVGL N GAF E AG+ V+++G D SS W
Sbjct: 531 MKKNVNLINGMNNISILSATVGLPNSGAFLESRVAGLR-KVEVQGR------DFSSYSWG 583
Query: 594 YQTGLKGEELNF--PSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGK 650
YQ GL GE+L SGSS QW KS +PL WY+TTF APAG++PV ++ MGK
Sbjct: 584 YQVGLLGEKLQIFTVSGSSKVQW--KSFQSSTKPLTWYQTTFHAPAGNDPVVVNLGSMGK 641
Query: 651 GEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKS 710
G AWVNGQ IGRYW ++ K G PSQ YH+PRS+LKS
Sbjct: 642 GLAWVNGQGIGRYWVSF----------------------HKPDGTPSQQWYHIPRSFLKS 679
Query: 711 SGNTLVLFEEIGGDPTKISFVT 732
+GN LV+ EE G+P I+ T
Sbjct: 680 TGNLLVILEEETGNPLGITLDT 701
>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
Length = 625
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/637 (53%), Positives = 417/637 (65%), Gaps = 18/637 (2%)
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
V+C+Q DAPDPIIN CNGFYCD F+PN KPKMWTE W+GWF FGG VPYRP ED+AF
Sbjct: 1 VLCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAF 60
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAPLDEYGL RQPKWGHLKDL
Sbjct: 61 SVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDL 120
Query: 336 HKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYL 395
H+AIKLCE ALV+ +PT LG EA VYK+ SG CSAFLAN S V F N Y
Sbjct: 121 HRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYN 180
Query: 396 LPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGIS 455
LP WS+SILPDCKN V+NTA++ + T R + G W NE
Sbjct: 181 LPPWSISILPDCKNTVYNTARVGAQTSRMKMVRVPVHG--------GLSWQAYNEDPSTY 232
Query: 456 KDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFI 515
D++FT GL+EQINTT D SDYLWY + A+E L +G L V S GHA+H FI
Sbjct: 233 IDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFI 292
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQ 575
NG+L GS YGS + K+T + L G N +LS+ VGL N G +E AG+ GPV
Sbjct: 293 NGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVS 352
Query: 576 LKGSGNGTNIDLSSQQWTYQTGLKGE---ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTF 632
L G NG DLS Q+WTY+ GLKGE + SS +W + + + QPL WYKTTF
Sbjct: 353 LNGL-NGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTF 411
Query: 633 DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKN 692
APAG P+A+D MGKG+ W+NGQS+GR+WP Y + G C++ C+Y G + +KCL+N
Sbjct: 412 SAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAV-GSCSE-CSYTGTFREDKCLRN 469
Query: 693 CGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD 752
CG+ SQ YHVPRSWLK SGN LV+FEE GGDP I+ V +++ S+C+ + + V+
Sbjct: 470 CGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREV-DSVCADIYEWQSTLVN 528
Query: 753 -MWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVV 811
+ K+ + P L+C P Q I+++KFASFGTP GTCGS+ +G C + S
Sbjct: 529 YQLHASGKVNKPLHPKAHLQC-GPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAF 587
Query: 812 RQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+ CVG CS+ V+ F GDPC VMK LAVEA C
Sbjct: 588 NKLCVGQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVC 624
>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
Length = 713
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/667 (50%), Positives = 425/667 (63%), Gaps = 55/667 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YD R++VI G+RR+++SGSIHYPRSTPEMWPDLI+K+K+GGLD IETY+FWN HEP R
Sbjct: 31 VSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEPHR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEG YD+V+F K + AG+YA LRIGPY+C EWN+GG P WL IPG+QFR NEP
Sbjct: 91 RQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNEP 150
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAAG 204
F+ EM+ FT IV+ MK K++A QGGPIIL+QIENEYGNI + YI W A
Sbjct: 151 FENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCAD 210
Query: 205 MALSLDTGVPWVMCQQ-SDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
MA + GVPW+MCQQ D P ++NTCNGFYC + PN PK+WTENW+GWF ++
Sbjct: 211 MANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGL 323
+R ED+AFAVA FFQ+ G+ QNYYMYHGGTNF RTSGGP+I+TSYDYDAPLDEYG
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330
Query: 324 IRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNS 383
+RQPK+GHLK+LH +K E LV + + G N+ T Y S + F+ N +
Sbjct: 331 LRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSS-SACFINNRFDDK 389
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
DV V +G ++LLPAWSVSILPDCK V FN+AKI + T V + + +S
Sbjct: 390 DVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQESLK---- 445
Query: 444 GWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
WS++ E P + F K LLEQI T+ DQSDYLWY S N K +
Sbjct: 446 -WSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHKGE-------GSY 497
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
L+V + GH L+AF+NGKL+G + + + ++ P+ L GKN LLS TVGL+NYG
Sbjct: 498 KLYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYG 557
Query: 561 AFYEKTGAGIT-GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
+EK GI GPV+L S NGT IDLS+ W+
Sbjct: 558 PSFEKMPTGIVGGPVKLIDS-NGTAIDLSNSSWS-------------------------- 590
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
YK TF+AP+G +PV +D G+ KG AWVNG ++GRYWP+Y + C+
Sbjct: 591 --------YKATFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHRCD 642
Query: 680 YRGAYSS 686
YRGA+ +
Sbjct: 643 YRGAFQA 649
>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
Length = 765
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/844 (43%), Positives = 490/844 (58%), Gaps = 121/844 (14%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
G +TYD RA+V+ G RR+ SG +HY RSTPEMWP LI K+K+GGLDVI+TYVFWN+
Sbjct: 24 ELGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNV 83
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP++ QYNFEGRYDLVKF++ + GLY LRIGP+V AEW +GGFP WLH +P I FR
Sbjct: 84 HEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFR 143
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
+DNEPFK MQ F KIV MMK E LY QGGPII+SQIENEY I+ A+GA+G Y++W
Sbjct: 144 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 203
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFL 259
AA MA+ L TGVPW+MC+Q+DAPDP+INTCNG C + PNS NKP +WTENW+ +
Sbjct: 204 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYP 263
Query: 260 SFGGAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+G R ED+AFAVA F R G+F +YYMYHGGTNF R + +++TSY APL
Sbjct: 264 IYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPL 322
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEY C AFL N
Sbjct: 323 DEYDF-----------------------------------------------KCVAFLVN 335
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
++ V+F S L S+S+L DC+NVVF TAK+N+ Q + ++
Sbjct: 336 FDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNA------------QHGSRTA 383
Query: 439 DAIGS-----GWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
+A+ S W EPV +SK +T L EQ+ TT D++DYLWY +S +A
Sbjct: 384 NAVQSLNDINNWKAFIEPVPQDLSK-STYTGNQLFEQLTTTKDETDYLWYIVSYKNRAS- 441
Query: 492 PLLEDGSKTV-LHVQSLGHALHAFINGKLVGSGYGSSSNAK-VTVDFPIALAPGKNTFDL 549
DG++ L+V+SL H LHAF+N + VGS +GS + + ++ ++L G NT L
Sbjct: 442 ----DGNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISL 497
Query: 550 LSLTVGLQNYGAFYEKTGAGI--TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF-- 605
LS+ VG + GA+ E+ GI G Q + + N DL W YQ GL GE+ +
Sbjct: 498 LSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDL----WGYQVGLFGEKDSIYT 553
Query: 606 -PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+S +W + L PL WYKTTF P G++ V ++ T MGKGE WVNG+SIGRYW
Sbjct: 554 QEGTNSVRWMDINNL-IYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYW 612
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
++ + + G+PSQSLYH+PR +L N LVL EE+GGD
Sbjct: 613 VSFKAPS----------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGD 650
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKF 784
P +I+ T + +++C +V + P+ G K++ + C N+ ISSI+F
Sbjct: 651 PLQITVNTMSV-TTVCGNVDEFSVPPLQSRGKVPKVR--------IWCQGGNR-ISSIEF 700
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAV 843
AS+G P+G C SF G C + S SVV+Q+C+G + CSI V F GDPC G+ KSL V
Sbjct: 701 ASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFGGDPCPGIQKSLLV 760
Query: 844 EASC 847
A C
Sbjct: 761 VADC 764
>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 613
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/625 (53%), Positives = 417/625 (66%), Gaps = 26/625 (4%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MWPDLIQK+KDGGLD IETY+FW+ HEP R +Y+F GR D +KF +L+ +AGLY +RIG
Sbjct: 1 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
PYVCAEWN+GGFP+WLH +PGIQ RT+N+ +K EMQ FT KIV+M KQ L+ASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 120
Query: 177 LSQIENEYGNIDS-AYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFY 235
L+QIENEYGN+ + AYG AGK+YI W A MA SL+ GVPW+MCQQSDAP P+INTCNGFY
Sbjct: 121 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 180
Query: 236 CDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHG 295
CD FTPN+ PKM+TENW GWF +G PYR ED+AF+VARFFQ GG F NYYMYHG
Sbjct: 181 CDNFTPNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHG 240
Query: 296 GTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPS 355
GTNF RTSGGPFI+TSYDY+APLDEYG + QPKWGHLK LH +IKL E L + + +
Sbjct: 241 GTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSNQN 300
Query: 356 LGPNLEATVYK---TGSGLCSAFLANIGTNSDVTVKFNGN-SYLLPAWSVSILPDCKNVV 411
G ++ T + TG C FL+N +D T+ + Y +PAWSVSIL C V
Sbjct: 301 FGSSVTLTKFSNPTTGERFC--FLSNTDGKNDATIDLQEDGKYFVPAWSVSILDGCNKEV 358
Query: 412 FNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVG--ISKDDAFTKPGLLEQI 469
+NTAK+NS T + + + A S W++ EP+ + + F LLEQ
Sbjct: 359 YNTAKVNSQTSMFVKEQNEKENAQLS-------WAWAPEPMKDTLQGNGKFAANLLLEQK 411
Query: 470 NTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSN 529
T D SDY WY + L L V + GH LHAF+N + +GS +GS+
Sbjct: 412 RVTVDFSDYFWYMTKVDTNGTSSL----QNVTLQVNTKGHVLHAFVNKRYIGSKWGSNGQ 467
Query: 530 AKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKGSGNGTNIDLS 588
+ V + PI L G NT LLS TVGL+NY AFY+ GI GP+ L G GN T DLS
Sbjct: 468 SFV-FEKPILLKSGINTITLLSATVGLKNYDAFYDMVPTGIDGGPIYLIGDGNVT-TDLS 525
Query: 589 SQQWTYQTGLKGE--ELNFPSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDF 645
S W+Y+ GL GE ++ P S T W + + + WYKT+F PAG +PV +D
Sbjct: 526 SNLWSYKVGLNGEMKQIYNPVFSQRTNWIPLNQKSIGRRMTWYKTSFKTPAGIDPVVLDM 585
Query: 646 TGMGKGEAWVNGQSIGRYWPTYVSQ 670
GMGKG+AWVNGQSIGR+WP+++ +
Sbjct: 586 QGMGKGQAWVNGQSIGRFWPSFIXK 610
>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
Length = 761
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/844 (43%), Positives = 490/844 (58%), Gaps = 121/844 (14%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
G +TYD RA+V+ G RR+ SG +HY RSTPEMWP LI K+K+GGLDVI+TYVFWN+
Sbjct: 20 ELGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNV 79
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP++ QYNFEGRYDLVKF++ + GLY LRIGP+V AEW +GGFP WLH +P I FR
Sbjct: 80 HEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFR 139
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
+DNEPFK MQ F KIV MMK E LY QGGPII+SQIENEY I+ A+GA+G Y++W
Sbjct: 140 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 199
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFL 259
AA MA+ L TGVPW+MC+Q+DAPDP+INTCNG C + PNS NKP +WTENW+ +
Sbjct: 200 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYP 259
Query: 260 SFGGAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+G R ED+AFAVA + R G+F +YYMYHGGTNF R + +++TSY APL
Sbjct: 260 IYGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPL 318
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DEY C AFL N
Sbjct: 319 DEYDF-----------------------------------------------KCVAFLVN 331
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
++ V+F S L S+S+L DC+NVVF TAK+N+ Q + ++
Sbjct: 332 FDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNA------------QHGSRTA 379
Query: 439 DAIGS-----GWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
+A+ S W EPV +SK +T L EQ+ TT D++DYLWY +S +A
Sbjct: 380 NAVQSLNDINNWKAFIEPVPQDLSK-STYTGNQLFEQLTTTKDETDYLWYIVSYKNRA-- 436
Query: 492 PLLEDGSKTV-LHVQSLGHALHAFINGKLVGSGYGSSSNAK-VTVDFPIALAPGKNTFDL 549
DG++ L+V+SL H LHAF+N + VGS +GS + + ++ ++L G NT L
Sbjct: 437 ---SDGNQIARLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISL 493
Query: 550 LSLTVGLQNYGAFYEKTGAGI--TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
LS+ VG + GA+ E+ GI G Q + + N DL W YQ GL GE+ + +
Sbjct: 494 LSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDL----WGYQVGLFGEKDSIYT 549
Query: 608 G---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+S +W + L PL WYKTTF P G++ V ++ T MGKGE WVNG+SIGRYW
Sbjct: 550 QEGPNSVRWMDINNL-IYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYW 608
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
++ + + G+PSQSLYH+PR +L N LVL EE+GGD
Sbjct: 609 VSFKAPS----------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGD 646
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKF 784
P +I+ T + +++C +V + P+ G K++ + C + ISSI+F
Sbjct: 647 PLQITVNTMSV-TTVCGNVDEFSVPPLQSRGKVPKVR--------IWCQGGKR-ISSIEF 696
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAV 843
AS+G P+G C SF G C + S SVV+Q+C+G + CSI V F GDPC G+ KSL V
Sbjct: 697 ASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFGGDPCPGIQKSLLV 756
Query: 844 EASC 847
A C
Sbjct: 757 VADC 760
>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
Length = 766
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/808 (43%), Positives = 475/808 (58%), Gaps = 59/808 (7%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MW D++ K++ GGL+VI+TYVFWN+HEPV Q+NFEG YDLVKF+KL+ E +Y LR+G
Sbjct: 1 MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
P++ AEWN GG P WL P I FR+ N FK M+++ A IVDMMK+ KL+ASQGGPI+
Sbjct: 61 PFIQAEWNHGGLPYWLREKPNIIFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPIV 120
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
L+QIENEY ++ AY G Y++WAA MA+ L GVPW+MC+Q DAPDP+INTCNG +C
Sbjct: 121 LAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRHC 180
Query: 237 -DQFT-PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 294
D FT PN KP +WTENW+ + FG R ED+AF+VARFF + G+ NYYMYH
Sbjct: 181 GDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSKNGSLVNYYMYH 240
Query: 295 GGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP 354
GGTNF RTS F +T Y +APLDE+GL R+PKWGHL+D+HKA+ LC+ L+ P
Sbjct: 241 GGTNFGRTS-AVFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGIQ 299
Query: 355 SLGPNLEATVY-KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFN 413
+G LEA Y K G+ +C+AFLAN T S T+ F G +LLP S+SILPDCK VVFN
Sbjct: 300 VIGKGLEARFYEKPGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCKTVVFN 359
Query: 414 TAKI----NSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQI 469
T I N+ +PS + L+ W E + + LE
Sbjct: 360 TETIVSQHNARNFIPSKNANKLK------------WKMSPESIPTVEQVPVNNKIPLELY 407
Query: 470 NTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSN 529
+ D +DY WY+ S + ++ VL + SLGHA+ F+NG+ +G+ +GS
Sbjct: 408 SLLKDTTDYGWYTTSIELDKEDVSKRPDILPVLRIASLGHAMLVFVNGEYIGTAHGSHEE 467
Query: 530 AKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSS 589
+ G N LL + VGL + GA+ E AG + + G GT +D+S
Sbjct: 468 KNFVFQGSVPFKAGVNNIALLGILVGLPDSGAYMEHRFAGPRS-ITILGLNTGT-LDISK 525
Query: 590 QQWTYQTGLKGEELN-FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGM 648
W +Q L+GE++ F G S + D + L WYKT FDAP G++PVAI GM
Sbjct: 526 NGWGHQVALQGEKVKVFTQGGSHRVDWSEIKEEKSALTWYKTYFDAPEGNDPVAIRMNGM 585
Query: 649 GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWL 708
GKG+ WVNG+SIGRYW +Y+S T QS YH+PRS++
Sbjct: 586 GKGQIWVNGKSIGRYWMSYLSPLKLST----------------------QSEYHIPRSFI 623
Query: 709 KSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQR------ 762
K S N LV+ EE P K+ + ++CS +T HP V W K R
Sbjct: 624 KPSENLLVILEEENVTPEKVEILLVNR-DTICSFITQYHPPNVKSWERKDKQFRAVVDDV 682
Query: 763 KPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRC-SSARSLSVVRQACVGSKSC 821
K G L CP+ ++ I++I+FASFG P G CG+F G+C SS+ + +V Q C+G ++C
Sbjct: 683 KTGA--HLRCPH-DKKITNIEFASFGDPSGVCGNFEHGKCHSSSDTKKLVEQHCLGKENC 739
Query: 822 SIGV-SVNTFGDPCKGVMKSLAVEASCT 848
S+ + + + F + C K+LA++A C+
Sbjct: 740 SVPMDAFDNFKNECDS--KTLAIQAKCS 765
>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
Length = 740
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/675 (51%), Positives = 417/675 (61%), Gaps = 69/675 (10%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NVTYDHRAV+IGGKRR+L+S +HYPR+TPEMWP LI K K+GG DVIETYVFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 86 RNQYNFEGRYDLVKFVKL-----------------------------------VAEAGLY 110
+ QY FE R+DLVKF K+ A+ Y
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLMWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYY 182
Query: 111 AHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYAS 170
R P + GFP+WL IPGI+FRTDNEPFKAEMQ F KIV +MK+EKLY+
Sbjct: 183 FEERFDPVKFEKHVIFGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSW 242
Query: 171 QGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINT 230
QGGPIIL QIENEYGNI YG AGK Y++WAA MA+ LDTG+PWVMC+Q+DAP+ II+T
Sbjct: 243 QGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDT 302
Query: 231 CNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNY 290
CN FYCD F PNS NKP +WTE+W GW+ +GGA+P+RP ED AFAVARF+QRGG+ QNY
Sbjct: 303 CNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNY 362
Query: 291 YMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD 350
YMY GGTNF RT+GGP TSYDYDAP+DEYG++RQPKWGHLKDLH AIKLCE AL+A D
Sbjct: 363 YMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVD 422
Query: 351 --PTYPSLGPNLEATVYKTG-----------SGLCSAFLANIGTNSDVTVKFNGNSYLLP 397
P Y LG EA VY TG + +CSAFLANI + +V G SY LP
Sbjct: 423 GSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLP 482
Query: 398 AWSVSILPDCKNVVFNTAKINSVTLV-------PSFS---RQSLQVAADSSDAIGSGWSY 447
WSVSILPDC+NV FNTA+I + T V PS S + S+ + S W
Sbjct: 483 PWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWT 542
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV---LHV 504
E +G + F G+LE +N T D SDYLWY+ NI +D + SK V L +
Sbjct: 543 SKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNI-SDADVAFWSSKGVLPSLTI 601
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+ F+NGKL GS G V++ PI L G N LLS VGLQNYGAF E
Sbjct: 602 DKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLE 657
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSSTQWDSKSTLPKL 622
K GAG G V L G +G ++DL++ WTYQ GLKGE + P S+ +
Sbjct: 658 KDGAGFRGQVTLTGLSDG-DVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSV 716
Query: 623 QPLVWYKTTFDAPAG 637
QP WYK + G
Sbjct: 717 QPFTWYKNICNQSVG 731
>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
Length = 715
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/726 (45%), Positives = 446/726 (61%), Gaps = 49/726 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R++++ G+R +L SGSIHYPR PEMWPD+I+K+K+GGL++I+TYVFWN+HEPV+
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPDIIRKAKEGGLNLIQTYVFWNIHEPVQ 87
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Q+NFEG YD+VKF+K + E GLY LRIGPY+ AEWN GGFP WL +P I FR+ NEP
Sbjct: 88 GQFNFEGNYDVVKFIKTIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
F M++++ ++D+MK+EKL+A QGGPII++QIENEY N+ AY GK Y++WAA MA
Sbjct: 148 FIHHMKKYSEMVIDLMKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYVEWAANMA 207
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
L GVPW+MC+Q DAP +INTCNG +C D FT PN NKP +WTENW+ + +FG
Sbjct: 208 TGLYNGVPWIMCKQKDAPAQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R ED+AF+VARFF + GT NYYMY+GGTN+ RT G F++T Y +APLDE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRT-GSSFVTTRYYDEAPLDEFGLY 326
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R+PKW HL+DLH+A++L AL+ P+ + +LE TVY+ C+AFL N T
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPSVQKINQHLEITVYEKPGTDCAAFLTNNHTTLP 386
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI----NSVTLVPSFSRQSLQVAADSSDA 440
T+KF G Y LP SVSILPDCK + NT I NS +PS ++L+
Sbjct: 387 ATIKFRGREYYLPEKSVSILPDCKLLSTNTQTIVSQHNSRNFLPSEKAKNLK-------- 438
Query: 441 IGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI-KADEPLLEDGSK 499
W E V D + LE + T D SDY WYS S N + D P+ D
Sbjct: 439 ----WEMYQEKVPTISDLSLKNREPLELYSLTKDTSDYAWYSTSINFDRHDLPMRPD-IL 493
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
VL + S+GHAL AF+NG+ VG G+G++ P+ L PG NT +L+ TVG N
Sbjct: 494 PVLQIASMGHALSAFVNGEFVGFGHGNNIEKSFVFQKPVILKPGTNTISILAETVGFPNS 553
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSK 616
GA+ EK AG G + ++G GT +D++ W ++ G+ GE+ +W +
Sbjct: 554 GAYMEKRFAGPRG-ITVQGLMAGT-LDITQNNWGHEVGVFGEKEQLFTEEGAKKVKW-TP 610
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
P + WYKT FDAP G+ PVA+ M KG WVNG S+GRYW +++S
Sbjct: 611 VNGPTKGAVTWYKTYFDAPEGNNPVALKMDKMQKGMMWVNGNSLGRYWSSFLSP------ 664
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
G+P+Q YH+PR++LK + N LV+FEE GG P I
Sbjct: 665 ----------------LGQPTQFEYHIPRAFLKPTNNLLVIFEETGGHPETIEVQIVNRD 708
Query: 737 SSLCSH 742
++L H
Sbjct: 709 TNLQHH 714
>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
Length = 775
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/854 (43%), Positives = 490/854 (57%), Gaps = 131/854 (15%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
G +TYD RA+V+ G RR+ SG +HY RSTPEMWP LI K+K+GGLDVI+TYVFWN+
Sbjct: 24 ELGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNV 83
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP++ QYNFEGRYDLVKF++ + GLY LRIGP+V AEW +GGFP WLH +P I FR
Sbjct: 84 HEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFR 143
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
+DNEPFK MQ F KIV MMK E LY QGGPII+SQIENEY I+ A+GA+G Y++W
Sbjct: 144 SDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRW 203
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGW-- 257
AA MA+ L TGVPW+MC+Q+DAPDP+INTCNG C + PNS NKP +WTENW+
Sbjct: 204 AAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSN 263
Query: 258 --------FLSFGGAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFI 308
+ +G R ED+AFAVA F R G+F +YYMYHGGTNF R + ++
Sbjct: 264 GQNNSAFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YV 322
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
+TSY APLDEY
Sbjct: 323 TTSYYDGAPLDEYDF--------------------------------------------- 337
Query: 369 SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR 428
C AFL N ++ V+F S L S+S+L DC+NVVF TAK+N+
Sbjct: 338 --KCVAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNA--------- 386
Query: 429 QSLQVAADSSDAIGS-----GWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWY 481
Q + +++A+ S W EPV +SK +T L EQ+ TT D++DYLWY
Sbjct: 387 ---QHGSRTANAVQSLNDINNWKAFIEPVPQDLSK-STYTGNQLFEQLTTTKDETDYLWY 442
Query: 482 SLSTNIKADEPLLEDGSKTV-LHVQSLGHALHAFINGKLVGSGYGSSSNAK-VTVDFPIA 539
+S +A DG++ L+V+SL H LHAF+N + VGS +GS + + ++ ++
Sbjct: 443 IVSYKNRAS-----DGNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMS 497
Query: 540 LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI--TGPVQLKGSGNGTNIDLSSQQWTYQTG 597
L G NT LLS+ VG + GA+ E+ GI G Q + + N DL W YQ G
Sbjct: 498 LKEGDNTISLLSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDL----WGYQVG 553
Query: 598 LKGEELNF---PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
L GE+ + +S +W + L PL WYKTTF P G++ V ++ T MGKGE W
Sbjct: 554 LFGEKDSIYTQEGTNSVRWMDINNL-IYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVW 612
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYW ++ + + G+PSQSLYH+PR +L N
Sbjct: 613 VNGESIGRYWVSFKAPS----------------------GQPSQSLYHIPRGFLTPKDNL 650
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPN 774
LVL EE+GGDP +I+ T + +++C +V + P+ G K++ + C
Sbjct: 651 LVLVEEMGGDPLQITVNTMSV-TTVCGNVDEFSVPPLQSRGKVPKVR--------IWCQG 701
Query: 775 PNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDP 833
N+ ISSI+FAS+G P+G C SF G C + S SVV+Q+C+G + CSI V F GDP
Sbjct: 702 GNR-ISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFGGDP 760
Query: 834 CKGVMKSLAVEASC 847
C G+ KSL V A C
Sbjct: 761 CPGIQKSLLVVADC 774
>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
Length = 835
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/838 (41%), Positives = 480/838 (57%), Gaps = 48/838 (5%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++I GKR + SG+IHYPRS PEMWP L+ ++KDGGL+ IETYVFWN
Sbjct: 27 TKKGTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWPKLLDRAKDGGLNTIETYVFWN 86
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +YNFEGR DL+KF+KL+ + +YA +RIGP++ AEWN GG P WL IP I F
Sbjct: 87 AHEPEPGKYNFEGRCDLIKFLKLIQDNDMYAVIRIGPFIQAEWNHGGLPYWLREIPHIIF 146
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +NEP+K EM++F IV +K ++ASQGGPIIL+QIENEYGNI + G Y++
Sbjct: 147 RANNEPYKKEMEKFVRFIVQKLKDADMFASQGGPIILAQIENEYGNIKKDHITDGDKYLE 206
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MALS + G+PW+MC+Q+ AP +I TCNG +C D +T NKP++WTENW+ F
Sbjct: 207 WAAEMALSTNIGIPWIMCKQTTAPGVVIPTCNGRHCGDTWTLRDKNKPRLWTENWTAQFR 266
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG R ED+A++V RFF +GGT NYYMY+GGTNF RT G ++ T Y +AP+D
Sbjct: 267 AFGDQAAVRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEAPID 325
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLAN 378
EYGL ++PK+GHL+DLHK IK A + ++ LG EA Y+ LC AF++N
Sbjct: 326 EYGLNKEPKFGHLRDLHKLIKSYHKAFLVGKQSFELLGHGYEAHNYELPEENLCLAFISN 385
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
T D TV F G Y +P+ SVSIL DC +VV+NT ++ S +S A +S+
Sbjct: 386 NNTGEDGTVMFRGKKYYIPSRSVSILADCNHVVYNTKRV-----FVQHSERSFHTADEST 440
Query: 439 DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
+ W +EP+ K + LEQ N T D+SDYLWY+ S ++AD+
Sbjct: 441 K--NNVWEMYSEPIPRYKVTSVRTKEPLEQYNLTKDKSDYLWYTTSFRLEADDLPFRRDI 498
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ V+ V+S HA+ F+N GSG GS + + PI L G N LLS ++G+++
Sbjct: 499 RPVVQVKSSAHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLRIGINHLALLSSSMGMKD 558
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGSST-QWDS 615
G + GI ++G GT +DL W ++ L GE E+ G T +W
Sbjct: 559 SGGELVEVKGGIQD-CMIQGLNTGT-LDLQGNGWGHKINLDGEDKEIYTEKGMGTVKWKP 616
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
+ WY+ FD P G +PV +D + M KG +VNG+ +GRYW +Y +
Sbjct: 617 AEN---GHAVTWYRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWTSYKTI----- 668
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
G PSQSLYH+PR +LKS N LV+FEE G P I T +
Sbjct: 669 -----------------AGLPSQSLYHIPRPFLKSKKNLLVVFEEEIGKPEGILIQTVRR 711
Query: 736 GSSLCSHVTDSHPLPVDMWGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPL 791
+C +++ +P V W +D I L CP+ + I + FASFG P
Sbjct: 712 -DDICFLMSEHNPAQVKTWDADGGQIKLIAEDHSSRGILTCPH-KKTIEEVVFASFGNPE 769
Query: 792 GTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
G CG+F+ G C + + V + C+G KSC + + +G C +LAV+ C
Sbjct: 770 GACGNFTAGTCHTPNAKEFVAKECLGKKSCVLPLIHTLYGADINCPTTTATLAVQVRC 827
>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 988
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/805 (42%), Positives = 481/805 (59%), Gaps = 52/805 (6%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MWP +I K++ GGL+ I+TYVFWN+HEP + +Y+F+GR+DLVKF+KL+ E GLY LR+G
Sbjct: 1 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
P++ AEWN GG P WL +P + FRT+NEPFK +R+ KI+ MMK+EKL+ASQGGPII
Sbjct: 61 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 120
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
L QIENEY + AY G+ YIKWAA + S++ G+PWVMC+Q+DAP +IN CNG +C
Sbjct: 121 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 180
Query: 237 -DQFT-PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 294
D F PN ++KP +WTENW+ F FG R VED+AF+VAR+F + G+ NYYMYH
Sbjct: 181 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 240
Query: 295 GGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP 354
GGTNF RTS F++T Y DAPLDE+GL + PK+GHLK +H+A++LC+ AL
Sbjct: 241 GGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ 299
Query: 355 SLGPNLEATVYKT-GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFN 413
+LGP+ E Y+ G+ +C+AFL+N T T+KF G Y+LP+ S+SILPDCK VV+N
Sbjct: 300 TLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYN 359
Query: 414 TAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTA 473
TA+I V S + + +S + N P + D PG L + T
Sbjct: 360 TAQI-----VAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLI--PGELYYL--TK 410
Query: 474 DQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVT 533
D++DY WY+ S I D+ + G KT+L V SLGHAL ++NG+ G +G
Sbjct: 411 DKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFE 470
Query: 534 VDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWT 593
P+ G N +L + GL + G++ E AG + + G +GT + +W
Sbjct: 471 FAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISIIGLKSGTRDLTENNEWG 529
Query: 594 YQTGLKGE--ELNFPSGS-STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGK 650
+ GL+GE E+ GS +W+ K +PL WYKT F+ P G VAI MGK
Sbjct: 530 HLAGLEGEKKEVYTEEGSKKVKWEKDG---KRKPLTWYKTYFETPEGVNAVAIRMKAMGK 586
Query: 651 GEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK- 709
G WVNG +GRYW +++S G+P+Q+ YH+PRS++K
Sbjct: 587 GLIWVNGIGVGRYWMSFLSP----------------------LGEPTQTEYHIPRSFMKG 624
Query: 710 -SSGNTLVLFEEIGGDPTK-ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK--IQRKPG 765
N LV+ EE G + I FV ++CS+V + +P+ V W + + R
Sbjct: 625 EKKKNMLVILEEEPGVKLESIDFVLVNR-DTICSNVGEDYPVSVKSWKREGPKIVSRSKD 683
Query: 766 PVLS--LECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSI 823
L + CP P + + ++FASFG P GTCG+F+ G+CS+++S VV + C+G CSI
Sbjct: 684 MRLKAVMRCP-PEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSI 742
Query: 824 GVSVNTFGDP-CKGVMKSLAVEASC 847
V+ TFGD C ++K+LAV+ C
Sbjct: 743 VVARETFGDKGCPEIVKTLAVQVKC 767
>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
Length = 710
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/716 (47%), Positives = 439/716 (61%), Gaps = 75/716 (10%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
GA VTYD R+++I G R++L SGSIHYPRSTP+MW LI K+K+GG+DVI+TYVFWN HE
Sbjct: 23 GAQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHE 82
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P QY+F GRYDL KF+K + GLYA LRIGP++ +EW++GG P WLH + GI +RTD
Sbjct: 83 PQPGQYDFNGRYDLXKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTD 142
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQ FT KIV++MK E LYASQGGPIILSQIENEY NI++A+ G SY++WAA
Sbjct: 143 NEPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAA 202
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ-FT-PNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPWVMC+QSDAPDP+INTCNG C Q FT PNS NKP MWTENW+ ++ F
Sbjct: 203 KMAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVF 262
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GG R ED+AF VA F R G++ NYYM
Sbjct: 263 GGETYLRSAEDIAFHVALFIARNGSYVNYYM----------------------------V 294
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
LIRQPKWGHLK+LH AI LC L+ + SLG EA V++ G C AFL N
Sbjct: 295 SLIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDE 354
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
++ TV F S L S+SILPDCKNV+FNTAKIN+ + + ++ S DA+
Sbjct: 355 GNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGY------NERITTSSQSFDAV 408
Query: 442 GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYS--LSTNIKADEPLLEDGSK 499
W + + D + +LE +N T D+SDYLWY+ N EPL
Sbjct: 409 DR-WEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPNSSCTEPL------ 461
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
LH++SL HA+HAF+N VG+ +GS T PI+L N +LS+ VG +
Sbjct: 462 --LHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDS 519
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSK 616
GA+ E AG+T V+++ + G D ++ W YQ GL GE+L+ + S+ +W K
Sbjct: 520 GAYLESRFAGLTR-VEIQCTEKGI-YDFANYTWGYQVGLSGEKLHIYKEENLSNVEW-RK 576
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
+ + QPL WYK F+ P+G +PVA++ + MGKGEAWVNGQSIGRYW ++ +
Sbjct: 577 TEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSK----- 631
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
G PSQ+LYHVPR++LK+S N LVL EE GDP IS T
Sbjct: 632 -----------------GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 670
>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 837
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 338/846 (39%), Positives = 480/846 (56%), Gaps = 48/846 (5%)
Query: 13 WGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
W T G+ VTYD R+++I GKR + SG+IHYPRS PE+WP LI+++K+GGL+
Sbjct: 22 WAAAEWNLTKKGSVVTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNT 81
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
IETY+FWN HEP +YNFEGR+DL+K++K++ E +YA +RIGP++ AEWN GG P WL
Sbjct: 82 IETYIFWNAHEPEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWL 141
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
I I FR +N+P+K EM++F IV +K +L+ASQGGPIIL+QIENEYGNI +
Sbjct: 142 REIDHIIFRANNDPYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHA 201
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWT 251
G Y++WAA MALS TGVPW+MC+QS AP +I TCNG +C D +T NKP +WT
Sbjct: 202 TDGDKYLEWAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWT 261
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTS 311
ENW+ F ++G V R ED+A+AV RFF +GG+ NYYMYHGGTNF RT G ++ T
Sbjct: 262 ENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTG 320
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSG 370
Y +AP+DEYG+ ++PK+GHL+DLH I+ + A + + LG EA +++
Sbjct: 321 YYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEEN 380
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
LC +FL+N T D TV F G + +P+ SVSIL CKNVV+NT ++ + +S
Sbjct: 381 LCLSFLSNNNTGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRV-----FVQHNERS 435
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKAD 490
+ +S + W +E + +D LEQ N T D SDYLWY+ S +++D
Sbjct: 436 YHTSEVTSK--NNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESD 493
Query: 491 EPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLL 550
+ + + VL V+S H++ F N VG GS + P+ L G N LL
Sbjct: 494 DLPFRNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLL 553
Query: 551 SLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGS- 609
S T+G+++ G + +GI + ++G GT +DL W ++ L+GE+ S
Sbjct: 554 SSTMGMKDSGGELAEVKSGIQECL-IQGLNTGT-LDLQVNGWGHKAALEGEDKEIYSEKG 611
Query: 610 --STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
QW + WYK FD P G +PV +D + M KG +VNG+ +GRYW +Y
Sbjct: 612 VGKVQWKPAEN---GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY 668
Query: 668 VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTK 727
+ G PSQ+LYH+PR +LKS N LV+FEE G P
Sbjct: 669 RTL----------------------AGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDG 706
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVISSIK 783
I V +C +++ +P + W +D I +L CP P + I +
Sbjct: 707 I-LVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCP-PEKTIQEVV 764
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSL 841
FASFG P G CG+F+ G C + + +V + C+G SC + V +G C+ +L
Sbjct: 765 FASFGNPEGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTTATL 824
Query: 842 AVEASC 847
V+ C
Sbjct: 825 GVQVRC 830
>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
Length = 838
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 336/836 (40%), Positives = 477/836 (57%), Gaps = 44/836 (5%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++I GKR + SG+IHYPRS PEMW L++ +K GGL+ IETYVFWN
Sbjct: 30 TKKGTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWN 89
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +Y FEGR+DL++F+ ++ + +YA +RIGP++ AEWN GG P WL I I F
Sbjct: 90 GHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIF 149
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +NEPFK EM++F IV +K +++A QGGPIILSQIENEYGNI G Y++
Sbjct: 150 RANNEPFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLE 209
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S GVPWVMC+QS AP +I TCNG +C D +T NKP++WTENW+ F
Sbjct: 210 WAAEMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFR 269
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG + R ED+A+AV RFF +GGT NYYMYHGGTNF RT G ++ T Y +AP+D
Sbjct: 270 TFGDQLAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMD 328
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLAN 378
EYG+ ++PK+GHL+DLH IK A + ++ LG EA Y+ LC +FL+N
Sbjct: 329 EYGMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSN 388
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
T D TV F G + +P+ SVSIL DCK VV+NT ++ S +S ++S
Sbjct: 389 NNTGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV-----FVQHSERSFHTTDETS 443
Query: 439 DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
+ W +E + + LEQ N T D SDYLWY+ S +++D+
Sbjct: 444 K--NNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDI 501
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ V+ ++S HA+ F N VG+G GS + P+ L G N +LS ++G+++
Sbjct: 502 RPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKD 561
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKS 617
G + GI V ++G GT +DL W ++ L+GE+ + Q+ K
Sbjct: 562 SGGELVEVKGGIQDCV-VQGLNTGT-LDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKP 619
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
L P+ WYK FD P G +P+ +D + M KG +VNG+ IGRYW ++++
Sbjct: 620 AENDL-PITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL------- 671
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
G PSQS+YH+PR++LK GN L++FEE G P I T +
Sbjct: 672 ---------------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRR-D 715
Query: 738 SLCSHVTDSHPLPVDMWGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+C +++ +P + W SD I +L CP P + I + FASFG P G
Sbjct: 716 DICVFISEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCP-PKRTIQEVVFASFGNPEGA 774
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
CG+F+ G C + + ++V + C+G +SC + V +G C +LAV+ C
Sbjct: 775 CGNFTAGTCHTPDAKAIVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRC 830
>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
Length = 911
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 335/833 (40%), Positives = 476/833 (57%), Gaps = 44/833 (5%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++I GKR + SG+IHYPRS PEMW L++ +K GGL+ IETYVFWN
Sbjct: 30 TKKGTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWN 89
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +Y FEGR+DL++F+ ++ + +YA +RIGP++ AEWN GG P WL I I F
Sbjct: 90 GHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIF 149
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +NEPFK EM++F IV +K +++A QGGPIILSQIENEYGNI G Y++
Sbjct: 150 RANNEPFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLE 209
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S GVPWVMC+QS AP +I TCNG +C D +T NKP++WTENW+ F
Sbjct: 210 WAAEMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFR 269
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG + R ED+A+AV RFF +GGT NYYMYHGGTNF RT G ++ T Y +AP+D
Sbjct: 270 TFGDQLAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMD 328
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLAN 378
EYG+ ++PK+GHL+DLH IK A + ++ LG EA Y+ LC +FL+N
Sbjct: 329 EYGMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSN 388
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
T D TV F G + +P+ SVSIL DCK VV+NT ++ S +S ++S
Sbjct: 389 NNTGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV-----FVQHSERSFHTTDETS 443
Query: 439 DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
+ W +E + + LEQ N T D SDYLWY+ S +++D+
Sbjct: 444 K--NNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDI 501
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ V+ ++S HA+ F N VG+G GS + P+ L G N +LS ++G+++
Sbjct: 502 RPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKD 561
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKS 617
G + GI V ++G GT +DL W ++ L+GE+ + Q+ K
Sbjct: 562 SGGELVEVKGGIQDCV-VQGLNTGT-LDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKP 619
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
L P+ WYK FD P G +P+ +D + M KG +VNG+ IGRYW ++++
Sbjct: 620 AENDL-PITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL------- 671
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
G PSQS+YH+PR++LK GN L++FEE G P I T +
Sbjct: 672 ---------------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRR-D 715
Query: 738 SLCSHVTDSHPLPVDMWGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+C +++ +P + W SD I +L CP P + I + FASFG P G
Sbjct: 716 DICVFISEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCP-PKRTIQEVVFASFGNPEGA 774
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVE 844
CG+F+ G C + + ++V + C+G +SC + V +G C +LAV+
Sbjct: 775 CGNFTAGTCHTPDAKAIVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQ 827
>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 1052
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 343/811 (42%), Positives = 479/811 (59%), Gaps = 60/811 (7%)
Query: 53 STPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAH 112
S MWP +I K++ GGL+ I+TYVFWN+HEP + +Y+F+GR+DLVKF+KL+ E GLY
Sbjct: 65 SRKHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVT 124
Query: 113 LRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQG 172
LR+GP++ AEWN GG P WL +P + FRT+NEPFK +R+ KI+ MMK+EKL+ASQG
Sbjct: 125 LRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQG 184
Query: 173 GPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN 232
GPIIL QIENEY + AY G+ YIKWAA + S++ G+PWVMC+Q+DAP +IN CN
Sbjct: 185 GPIILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACN 244
Query: 233 GFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNY 290
G +C D F PN ++KP +WTENW+ F FG R VED+AF+VAR+F + G+ NY
Sbjct: 245 GRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNY 304
Query: 291 YMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD 350
YMYHGGTNF RTS F++T Y DAPLDE+GL + PK+GHLK +H+A++LC+ AL
Sbjct: 305 YMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQ 363
Query: 351 PTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKN 409
+LGP+ E Y+ G+ +C+AFL+N T T+KF G Y+LP+ S+SILPDCK
Sbjct: 364 LRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKT 423
Query: 410 VVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQI 469
VV+NTA+I V S + + +S + N P + D PG L +
Sbjct: 424 VVYNTAQI-----VAQHSWRDFVKSEKTSKGLKFEMFSENIPSLLDGDSLI--PGELYYL 476
Query: 470 NTTADQSDYLWYSLSTNIKADEPLLED--GSKTVLHVQSLGHALHAFINGKLVGSGYGSS 527
T D++DY +K DE D G KT+L V SLGHAL ++NG+ G +G
Sbjct: 477 --TKDKTDY------ACVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRH 528
Query: 528 SNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDL 587
P+ G N +L + GL + G++ E AG + + G +GT
Sbjct: 529 EMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRA-ISIIGLKSGTRDLT 587
Query: 588 SSQQWTYQTGLKGE--ELNFPSGS-STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAID 644
+ +W + GL+GE E+ GS +W+ K +PL WYKT F+ P G VAI
Sbjct: 588 ENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDG---KRKPLTWYKTYFETPEGVNAVAIR 644
Query: 645 FTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVP 704
MGKG WVNG +GRYW +++S G+P+Q+ YH+P
Sbjct: 645 MKAMGKGLIWVNGIGVGRYWMSFLSP----------------------LGEPTQTEYHIP 682
Query: 705 RSWLK--SSGNTLVLFEEIGGDPTK-ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK-- 759
RS++K N LV+ EE G + I FV ++CS+V + +P+ V W +
Sbjct: 683 RSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNR-DTICSNVGEDYPVSVKSWKREGPKI 741
Query: 760 IQRKPGPVLS--LECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVG 817
+ R L + CP P + + ++FASFG P GTCG+F+ G+CS+++S VV + C+G
Sbjct: 742 VSRSKDMRLKAVMRCP-PEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLG 800
Query: 818 SKSCSIGVSVNTFGDP-CKGVMKSLAVEASC 847
CSI V+ TFGD C ++K+LAV+ C
Sbjct: 801 RNYCSIVVARETFGDKGCPEIVKTLAVQVKC 831
>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
Length = 839
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 338/841 (40%), Positives = 482/841 (57%), Gaps = 54/841 (6%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G VTYD +++I G+R + SG+IHYPRS +MWP L++ +K+GGL+ IETYVFWN
Sbjct: 32 TKKGTTVTYDKYSLMIDGRRELFFSGAIHYPRSPTQMWPKLLKTAKEGGLNTIETYVFWN 91
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP ++NFEGR D++KF+KL+ G+YA +RIGP++ EWN G P WL IP I F
Sbjct: 92 AHEPEPGKFNFEGRNDMIKFLKLIQSFGMYAIVRIGPFIQGEWNHGALPYWLREIPHIIF 151
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +NEP+K EM++F IV M+K E L+ASQGG +IL+QIENEYGNI + G Y++
Sbjct: 152 RANNEPYKREMEKFVRFIVQMLKDENLFASQGGNVILAQIENEYGNIKKDHITEGDKYLE 211
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S + GVPW+MC+QS AP +I TCNG +C D + NKP +WTENW+ F
Sbjct: 212 WAAEMAISTNIGVPWIMCKQSTAPGVVIPTCNGRHCGDTWIMKDENKPHLWTENWTAQFR 271
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG + R ED+A++V RFF +GGT NYYMY+GGTNF RT G ++ T Y + P+D
Sbjct: 272 AFGNDLAQRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPID 330
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLAN 378
EYG+ + PK+GHL+DLH IK A + ++ LG EA ++ LC AF++N
Sbjct: 331 EYGMPKAPKYGHLRDLHNVIKSYSRAFLEGKQSFELLGQGYEARNFEIPEEKLCLAFISN 390
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
T D TV F G+ Y +P+ SVSIL DCK+VV+NT ++ S +S A ++
Sbjct: 391 NNTGEDGTVIFRGDKYYIPSRSVSILADCKHVVYNTKRV-----FVQHSERSFHKAEKAT 445
Query: 439 DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
+ W +E + K LEQ N T DQSDYLWY+ S ++AD+ +
Sbjct: 446 K--NNVWEMFSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWYTTSFRLEADDLPIRGDI 503
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ V+ V+S HA+ F+N G+G+GS T + PI+L G N LLS ++G+++
Sbjct: 504 RPVIAVKSTAHAMVGFVNDAFAGNGHGSKKEKFFTFETPISLRLGVNHLALLSSSMGMKD 563
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSG-SSTQWDS 615
G + GI ++G GT +DL W ++ L+GE E+ G + +W
Sbjct: 564 SGGELVELKGGIQD-CTIQGLNTGT-LDLQINGWGHKAKLEGEVKEIYTEKGMGAVKW-- 619
Query: 616 KSTLPKL--QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
+P + Q + WYK FD P G +PV +D T M KG +VNG+ +GRYW +Y
Sbjct: 620 ---VPAVSGQAVTWYKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGRYWTSY------ 670
Query: 674 CTDSCNYRGAYSSNKCLKNCGK-PSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
K GK SQ++YH+PR++LKS N LV+FEE G P I T
Sbjct: 671 -----------------KTPGKVASQAVYHIPRTFLKSKNNLLVVFEEELGKPEGILIQT 713
Query: 733 KQLGSSLCSHVTDSHPLPVDMW----GSDSKIQRKPGPVLSLECPNPNQVISSIKFASFG 788
+ +C +++ +P + W G I L CP P ++I + FASFG
Sbjct: 714 VRR-DDICVFISEHNPAQIKPWDEHGGQIKLIAEDHNTRGFLNCP-PKKIIQEVVFASFG 771
Query: 789 TPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEAS 846
P+G+C +F+ G C + + +V + C+G K C + V +G C +LAV+
Sbjct: 772 NPVGSCANFTVGTCHTPNAKEIVEKECLGKKGCVLPVLHTFYGADINCPTTTATLAVQVR 831
Query: 847 C 847
C
Sbjct: 832 C 832
>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
Length = 759
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 359/826 (43%), Positives = 477/826 (57%), Gaps = 92/826 (11%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTY+ RA+V+ G RR+L +G +HYPRSTPEMWP LI K+K+GGLDVI+TYVFWN+HEP++
Sbjct: 18 VTYEQRALVLDGARRMLFAGEMHYPRSTPEMWPKLIAKAKEGGLDVIQTYVFWNVHEPIQ 77
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QYNFEGRYDLV+F+K + GLY LRIGP++ +EW +GGFP WLH +P I FR+DNEP
Sbjct: 78 GQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDNEP 137
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK MQRF IV+MMK E LY QGGPII SQIENEY ++ A+G++G+ Y+ WAA MA
Sbjct: 138 FKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEPAFGSSGQRYVSWAAAMA 197
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+ L TGVPW MC+Q+DAPDP++ + F +S N +L +G
Sbjct: 198 VDLQTGVPWTMCKQNDAPDPVVGIHSYTIPVNFQNDSRN------------YLIYGNDTK 245
Query: 267 YRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
R +D+ FAVA F R G++ +YYMYHGGTNF R + +++TSY APLDEYGLI
Sbjct: 246 LRSPQDITFAVALFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGLIW 304
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
QP WGHL++LH A+K L+ + S+G EA +++T + C AFL N +
Sbjct: 305 QPTWGHLRELHAAVKQSSEPLLFGTYSNLSIGQEQEAHIFETETQ-CVAFLVNFDQHHIS 363
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V F S L S+SIL DCK VVF TAK+N+ SR + +V + S S W
Sbjct: 364 EVVFRNISLELAPKSISILLDCKQVVFETAKVNA----QHGSRTAEEVQSFSDI---STW 416
Query: 446 SYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
EP+ +SK A++ L E ++TT D +DYLWY +
Sbjct: 417 KAFKEPIPQDVSK-SAYSGNRLFEHLSTTKDATDYLWYIV-------------------- 455
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
L I G++ GS G A + I+L G NT LLS VG + GA
Sbjct: 456 ------GLFLNILGRIHGSHGGP---ANIIFSTNISLQEGPNTISLLSAMVGSPDSGAHM 506
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSS--TQWDSKSTLPK 621
E+ GI V ++ N+ L+++ W YQ GL GE N + S T+W + L
Sbjct: 507 ERRVFGIR-KVSIQQGQEPENL-LNNELWGYQVGLFGERNNIYTQDSKITEWTTIDNL-T 563
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
PL WYKTTF P G++ V ++ TGMGKGE WVNG+SIGRYW ++ + +
Sbjct: 564 YSPLTWYKTTFSTPVGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS---------- 613
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G PSQSLYH+PR +L NTLVLFEE+GG+P I+ T + S +C
Sbjct: 614 ------------GNPSQSLYHIPREFLNPQDNTLVLFEEMGGNPQLITVNTMSV-SRVCG 660
Query: 742 HVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGR 801
+V + + S + P + L CP IS+I+FAS+G P G C F GR
Sbjct: 661 NVNE--------LSAPSLQYKDKEPAVDLWCPEGKH-ISAIEFASYGGPTGDCKKFGFGR 711
Query: 802 CSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEAS 846
C + S SVV+QAC+G CS+ V+ F GDPC G+ KSL V A+
Sbjct: 712 CHAGSSESVVKQACLGKSGCSVPVTPIKFGGDPCPGIQKSLLVVAN 757
>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 822
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 335/846 (39%), Positives = 473/846 (55%), Gaps = 63/846 (7%)
Query: 13 WGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
W T G+ VTYD R+++I GKR + SG+IHYPRS PE+WP LI+++K+GGL+
Sbjct: 22 WAAAEWNLTKKGSVVTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNT 81
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
IETY+FWN HEP +YNFEGR+DL+K++K++ E +YA +RIGP++ AEWN GG P WL
Sbjct: 82 IETYIFWNAHEPEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWL 141
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
I I FR +N+P+K EM++F IV +K +L+ASQGGPIIL+QIENEYGNI +
Sbjct: 142 REIDHIIFRANNDPYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHA 201
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWT 251
G Y++WAA MALS TGVPW+MC+QS AP +I TCNG +C D +T NKP +WT
Sbjct: 202 TDGDKYLEWAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWT 261
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTS 311
ENW+ F ++G V R ED+A+AV RFF +GG+ NYYMYHGGTNF RT G ++ T
Sbjct: 262 ENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTG 320
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSG 370
Y +AP+DEYG+ ++PK+GHL+DLH I+ + A + + LG EA +++
Sbjct: 321 YYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEEN 380
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
LC +FL+N T D TV F G + +P+ SVSIL CKNVV+NT ++ + +S
Sbjct: 381 LCLSFLSNNNTGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRV-----FVQHNERS 435
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKAD 490
+ +S + W +E + +D LEQ N T D SDYLWY+ S +++D
Sbjct: 436 YHTSEVTSK--NNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESD 493
Query: 491 EPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLL 550
+ + + VL V+S H++ F N VG GS + P+ L G N LL
Sbjct: 494 DLPFRNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLL 553
Query: 551 SLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGS- 609
S T+G+++ G + +GI + ++G GT +DL W ++ L+GE+ S
Sbjct: 554 SSTMGMKDSGGELAEVKSGIQECL-IQGLNTGT-LDLQVNGWGHKAALEGEDKEIYSEKG 611
Query: 610 --STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
QW + WYK FD P G +PV +D + M KG +VNG+ +GRYW +Y
Sbjct: 612 VGKVQWKPAEN---GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY 668
Query: 668 VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTK 727
+ G PSQ+LYH+PR +LKS N LV+FEE G P
Sbjct: 669 RTL----------------------AGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDG 706
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVISSIK 783
I V +C +++ +P + W +D I +L CP P + I +
Sbjct: 707 I-LVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCP-PEKTIQEVV 764
Query: 784 FASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSL 841
FASFG P G CG+F+ C+G SC + V +G C+ +L
Sbjct: 765 FASFGNPEGMCGNFTE---------------CLGKPSCMLPVDHTVYGADINCQSTTATL 809
Query: 842 AVEASC 847
V+ C
Sbjct: 810 GVQVRC 815
>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
Length = 833
Score = 617 bits (1590), Expect = e-173, Method: Compositional matrix adjust.
Identities = 337/839 (40%), Positives = 477/839 (56%), Gaps = 54/839 (6%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++I GKR + SG+IHYPRS P+MW L++ +KDGGL+ IETYVFWN
Sbjct: 29 TKKGTVVSYDERSLLIDGKRDLFFSGAIHYPRSPPDMWHKLLKTAKDGGLNTIETYVFWN 88
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +YNFEGR DL+KF+KL+ +YA +RIGP++ AEWN GG P WL IP I F
Sbjct: 89 AHEPEPGKYNFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIF 148
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +NEP+K EM++F IV +K +++ASQGGP+IL+QIENEYGNI + G Y++
Sbjct: 149 RANNEPYKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLE 208
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S +TGVPW+MC+QS AP +I TCNG +C D +T NKP++WTENW+ F
Sbjct: 209 WAAQMAISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFR 268
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYM-YHGGTNFDRTSGGPFISTSYDYDAPL 318
+FG + R ED+A++V RFF +GGT NYYM Y+GGTNF RT G ++ T Y + P+
Sbjct: 269 AFGDQLALRSAEDIAYSVLRFFAKGGTLVNYYMQYYGGTNFGRT-GASYVLTGYYDEGPV 327
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLA 377
DE + + PK+GHL+DLH IK A + ++ L EA ++ LC AF++
Sbjct: 328 DE-CMPKAPKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFIS 386
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T D TV F G+ Y +P+ SVSIL DCK+VV+NT ++ S +S A
Sbjct: 387 NNNTGEDGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRV-----FVQHSERSFHTAQKL 441
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
+ + + W +EP+ K + +EQ N T D SDYL + L +AD+
Sbjct: 442 AKS--NAWEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLCFRL----EADDLPFRGD 495
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
+ V+ V+S HAL F+N G+G GS + PI L G N LLS ++G++
Sbjct: 496 IRPVVQVKSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMK 555
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSG-SSTQWD 614
+ G + GI ++G GT +DL W ++ L+GE E+ G + +W
Sbjct: 556 DSGGELVEVKGGIQD-CTIQGLNTGT-LDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWV 613
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+T + + WYK FD P G +PV +D T MGKG +VNG+ +GRYWP+Y +
Sbjct: 614 PATT---GRAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVG--- 667
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
G PSQ++YH+PR +LK N LV+FEE G P I T +
Sbjct: 668 -------------------GVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVR 708
Query: 735 LGSSLCSHVTDSHPLPVDMWGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGTP 790
+C +++ +P + W D I L+CP P + I + FASFG P
Sbjct: 709 R-DDICVFISEHNPAQIKTWDKDGGQIKLIAEDHSTRGILKCP-PKKTIQEVVFASFGNP 766
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
G+C +F+ G C + + +V + C+G KSC + V +G C +LAV+ C
Sbjct: 767 EGSCANFTAGTCHTPNAKDIVAKECLGKKSCVLPVLHTVYGADINCPTTTATLAVQVRC 825
>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
Length = 784
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 351/828 (42%), Positives = 472/828 (57%), Gaps = 91/828 (10%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
V+ D RA+V+ G RR+L +G +HY RSTPEMWP LI K+K+GGLD+I+TYVFWN+HEPV
Sbjct: 41 QVSLDARALVVDGTRRLLFAGEMHYTRSTPEMWPKLIAKAKEGGLDMIQTYVFWNVHEPV 100
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ QYNFEGRYDLV+F+K + GLY LRIGP++ +EW +GGFP WLH +P I FR+DNE
Sbjct: 101 QGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDNE 160
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PFK MQRF IV+MMK E LY QGGPII SQIENEY ++ A+G++G+ Y+ WAA M
Sbjct: 161 PFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEHAFGSSGQRYVSWAAAM 220
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
A+ TGVPW MC+Q+DAPDP++ + F S N +L +G
Sbjct: 221 AVDRQTGVPWTMCKQNDAPDPVVGIHSHTIPLDFPNASRN------------YLIYGNDT 268
Query: 266 PYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R ED+AFAV F R G++ +YYMYHGGTNF R + +++TSY APLDEYGLI
Sbjct: 269 KLRSPEDIAFAVVYFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDAAPLDEYGLI 327
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
QP WGHL++LH A+K L+ +Y SLG EA +++T S C AFL N +
Sbjct: 328 WQPTWGHLRELHAAVKQSSEPLLFGTYSYLSLGQEQEAHIFETESQ-CVAFLVNFDRHHI 386
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F S L S+SIL DCK VVF TAK+ + SR + +V + S +
Sbjct: 387 SEVVFRNISLELAPKSISILSDCKRVVFETAKVTA----QHGSRTAEEVQSFSDI---NT 439
Query: 445 WSYINEPVGISKDDA-FTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W+ EP+ A ++ L E ++TT D +DYLWY +
Sbjct: 440 WTAFKEPIPQDVSKAMYSGNRLFEHLSTTKDDTDYLWYIV-------------------- 479
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
L I G++ GS G A + ++ I+L G NT LLS VG + GA
Sbjct: 480 ------GLFHNILGRIHGSHGGP---ANIILNTNISLKEGPNTISLLSAMVGSPDSGAHM 530
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLP 620
E+ G+ V ++ N+ L+++ W YQ GL GE + S +W + L
Sbjct: 531 ERRVFGLQ-KVSIQQGQEPENL-LNNELWGYQVGLFGERNSIYTQEGSKSVEWTTIYNL- 587
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
PL WYKTTF PAG++ V ++ TGMGKGE WVNG+SIGRYW ++ + +
Sbjct: 588 AYSPLTWYKTTFSTPAGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS--------- 638
Query: 681 RGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLC 740
G PSQSLYH+PR +L N LVLFEE+GG+P +I+ T + + +C
Sbjct: 639 -------------GNPSQSLYHIPRQFLNPQDNILVLFEEMGGNPQQITVNTVSV-TRVC 684
Query: 741 SHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRG 800
+V + + S + P + L C Q IS+I+FAS+G P+G C G
Sbjct: 685 VNVNE--------LSAPSLQYKNKEPAVDLRCQEGKQ-ISAIEFASYGNPIGDCKKIRFG 735
Query: 801 RCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
C + S SVV+QAC+G CSI ++ F GDPC G+ KSL V A+C
Sbjct: 736 SCHAGSSESVVKQACLGKSGCSIPITPIKFGGDPCPGIKKSLLVVANC 783
>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
[Cucumis sativus]
Length = 635
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 314/644 (48%), Positives = 412/644 (63%), Gaps = 25/644 (3%)
Query: 213 VPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVED 272
VPWVMC+Q DAPDP+INTCNGFYCD F+PN KP WTE W+ WF +FGG RPVED
Sbjct: 3 VPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPVED 62
Query: 273 LAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHL 332
LAF VARF Q+GG+ NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGLIRQPK+GHL
Sbjct: 63 LAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFGHL 122
Query: 333 KDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGN 392
K LH A+KLCE AL+ +P +L +A V+ + SG C+AFL+N +N+ V FNG
Sbjct: 123 KRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTFNGR 182
Query: 393 SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPV 452
Y LP WS+SILPDCK+V++NTA++ T SF ++ + W NE +
Sbjct: 183 HYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSFLPTKVESFS---------WETYNENI 233
Query: 453 -GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHAL 511
I +D + + GLLEQ+ T D SDYLWY+ S N+ +E L G L S GH +
Sbjct: 234 SSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGHGM 293
Query: 512 HAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT 571
H FINGKL GS +G+ N+K T I L G N LLS+ GL N G YE+ G+
Sbjct: 294 HVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMGVL 353
Query: 572 GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ---WDSKSTLPK--LQPLV 626
GPV + G G +DLS Q+W+Y+ GLKGE +N S SS Q W +K +L + QPL
Sbjct: 354 GPVAIHGLDXG-KMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDW-AKDSLKQENAQPLT 411
Query: 627 WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSS 686
WYK FDAP G EP+A+D M KG+ W+NGQ++GRYW ++ NG CTD C+Y G Y
Sbjct: 412 WYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWT--ITANGNCTD-CSYSGTYRP 468
Query: 687 NKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDS 746
KC CG+P+Q YHVPRSWL + N +V+FEE+GG+P++IS V + + +S+C+ +
Sbjct: 469 RKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSV-TSICTEASQY 527
Query: 747 HPL--PVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSS 804
P+ V M ++ ++ + ++L C Q IS+IKFASFGTP G CGS +G C S
Sbjct: 528 RPVIKNVHMHQNNGELNEQNVLKINLHCA-AGQFISAIKFASFGTPSGACGSHKQGTCHS 586
Query: 805 ARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
+S V+++ CVG + C + + FG DPC + K L+ E C
Sbjct: 587 PKSDYVLQKLCVGRQRCLATIPTSIFGEDPCPNLRKKLSAEVVC 630
>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
Length = 803
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 329/835 (39%), Positives = 470/835 (56%), Gaps = 76/835 (9%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G+ VTYD R+++I GKR + SG+IHYPRS PE+WP L+ ++K+GGL+ IETY+FWN
Sbjct: 30 TKKGSVVTYDARSLLIDGKRDLFFSGAIHYPRSPPEVWPKLLDRAKEGGLNTIETYIFWN 89
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +YNFEGR DLVKF+K++ E G+YA +RIGP++ AEWN GG P WL I I F
Sbjct: 90 AHEPEPGKYNFEGRLDLVKFLKMIQEHGMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIF 149
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +N+P+K EM+++T +V +K +L+ASQGGP+IL+QIENEYGNI + G Y++
Sbjct: 150 RANNDPYKKEMEKWTRFVVQKLKDAELFASQGGPVILTQIENEYGNIKKDHKIEGDKYLE 209
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MALS TGVPW+MC+QS AP +I TCNG +C D +T NKP +WTENW+ F
Sbjct: 210 WAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFR 269
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
++G + R ED+A+AV RFF +GG+ NYYMYHGGTNF RTS ++ YD +APLD
Sbjct: 270 AYGDQLAMRSAEDIAYAVLRFFAKGGSMVNYYMYHGGTNFGRTSASYVLTGYYD-EAPLD 328
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLAN 378
EYG+ ++PK+GHL+DLH I+ + A ++ + LG EA +++ LC +FL+N
Sbjct: 329 EYGMYKEPKFGHLRDLHNVIRSYQKAFLSGKHSSEILGHGYEAQIFELPEENLCLSFLSN 388
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
T D TV F G + +P+ SVSIL CK+VV+NT ++ S +S + +S
Sbjct: 389 NNTGEDGTVIFRGVKHYVPSRSVSILAGCKDVVYNTKRV-----FVQHSERSYHTSEVTS 443
Query: 439 DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
+ W +E V KD LEQ N T D SDYLWY+ S +++D+
Sbjct: 444 K--NNQWEMYSEMVPKYKDTKIRTKEPLEQYNQTKDASDYLWYTTSFRLESDDLPFRGDI 501
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ VL V+S H++ F N VGS G+ + P+ L G N LLS T+G+++
Sbjct: 502 RPVLQVKSSAHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLKAGVNHVVLLSSTMGMKD 561
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
G + GI + ++G GT +DL W
Sbjct: 562 SGGELAEVKGGIQECL-IQGLNTGT-LDLQVNGWG------------------------- 594
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
+K FD P G +P+ +D + M KG +VNG+ IGRYW ++ +
Sbjct: 595 ---------HKRYFDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYWVSFRTL-------- 637
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSS 738
G PSQ++YH+PR +LK N LV+FEE G P I V
Sbjct: 638 --------------AGTPSQAVYHIPRPFLKPKDNLLVVFEEEMGKPDGI-LVQTVTRDD 682
Query: 739 LCSHVTDSHPLPVDMWGSDS---KIQRKPGPVL-SLECPNPNQVISSIKFASFGTPLGTC 794
+C +++ +P + W +D K+ + V +L CP P ++I + FASFG P G C
Sbjct: 683 ICLLISEHNPGQIKTWDTDGVKIKLIAEDHSVRGTLMCP-PEKIIQEVVFASFGNPDGMC 741
Query: 795 GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
G+F+ G C + + +V + C+G SC + V +G C+ +L V+ C
Sbjct: 742 GNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTTGTLGVQVRC 796
>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
thaliana]
Length = 636
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 318/651 (48%), Positives = 414/651 (63%), Gaps = 37/651 (5%)
Query: 15 FVVLATTSFG--ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
V++A G ANVTYD R+++I G+ ++L SGSIHY RSTP+MWP LI K+K GG+DV
Sbjct: 11 LVLMAVIVAGDVANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDV 70
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
++TYVFWN+HEP + Q++F G D+VKF+K V GLY LRIGP++ EW++GG P WL
Sbjct: 71 VDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWL 130
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
H + GI FRTDNEPFK M+R+ IV +MK E LYASQGGPIILSQIENEYG + A+
Sbjct: 131 HNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFR 190
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT--PNSNNKPKMW 250
GKSY+KW A +A+ LDTGVPWVMC+Q DAPDP++N CNG C + PNS NKP +W
Sbjct: 191 QEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIW 250
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+ ++ ++G R ED+AF VA F + G+F NYYMYHGGTNF R + F+ T
Sbjct: 251 TENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVIT 309
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
SY APLDEYGL+RQPKWGHLK+LH A+KLCE L++ T SLG A V+ +
Sbjct: 310 SYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKAN 369
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS-VTLVPSFSRQ 429
LC+A L N + TV+F +SY L SVS+LPDCKNV FNTAK+N+ +RQ
Sbjct: 370 LCAAILVN-QDKCESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQ 428
Query: 430 SLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKA 489
+L SS + W E V + + LLE +NTT D SDYLW +T +
Sbjct: 429 NL-----SSPQM---WEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQ--TTRFQQ 478
Query: 490 DEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL 549
E G+ +VL V LGHALHAF+NG+ +GS +G+ + ++ ++L G N L
Sbjct: 479 SE-----GAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLAL 533
Query: 550 LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS-- 607
LS+ VGL N GA E+ + G +K + ++ W YQ GLKGE+ + +
Sbjct: 534 LSVMVGLPNSGAHLERR---VVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTED 590
Query: 608 -GSSTQW----DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEA 653
+ QW DSKS QPL WYK +FD P G +PVA++ MGKGEA
Sbjct: 591 GSAKVQWKQYRDSKS-----QPLTWYKASFDTPEGEDPVALNLGSMGKGEA 636
>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 672
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 314/654 (48%), Positives = 417/654 (63%), Gaps = 28/654 (4%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G VTYD RA+V+ G RR+L SG +HY RSTPEMWP LI +K GGLDVI+TYVFWN+HE
Sbjct: 37 GGEVTYDGRALVVNGTRRMLFSGEMHYTRSTPEMWPKLIANAKKGGLDVIQTYVFWNVHE 96
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
PV+ QYNF+GRYDLVKF++ + GLY LRIGP++ AEW +GGFP WLH +P I FRTD
Sbjct: 97 PVQGQYNFQGRYDLVKFIREIQTQGLYVSLRIGPFIEAEWKYGGFPFWLHDVPNITFRTD 156
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
NEPFK MQRF +IV+MMK E LY QGGPII+SQIENEY ++ A+G+ G Y++WAA
Sbjct: 157 NEPFKQHMQRFVTQIVNMMKHEGLYYPQGGPIIISQIENEYQMVEPAFGSGGPRYVRWAA 216
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSF 261
MA+ L TGVPW+MC+Q+DAPDPIINTCNG C + PNS KP +WTENW+ + +
Sbjct: 217 EMAVGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPTKPALWTENWTTRYPIY 276
Query: 262 GGAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
G R ED+AFAVA F R G+F +YYMYHGGTNF R + +++TSY APLDE
Sbjct: 277 GNDTKLRSTEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDE 335
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIG 380
YGLI +P WGHL++LH A+KL AL+ + SLGP EA +++T C AFL N
Sbjct: 336 YGLIWRPTWGHLRELHAAVKLSSEALLFGRYSNFSLGPEQEAHIFETELK-CVAFLVNFD 394
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
+ TV F + L S+S+L +C+ VVF TA++N+ + ++ +V +D
Sbjct: 395 KHQTPTVVFRNIYFQLAPKSISVLSECRTVVFETARVNA-----QYGSRTAEVVESLNDI 449
Query: 441 IGSGWSYINEPV--GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTN-IKADEPLLEDG 497
W EP+ ISK +T L E ++ T D++DYLWY +S I +D DG
Sbjct: 450 --HTWKAFKEPIPEDISK-AVYTGNQLFEHLSMTKDETDYLWYIVSYEYIPSD-----DG 501
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSN-AKVTVDFPIALAPGKNTFDLLSLTVGL 556
+L+V+S H LHAF+N + GS +GS + ++ I+L G+NT LLS+ VG
Sbjct: 502 QLVLLNVESRAHVLHAFVNTEYAGSVHGSHDGPGNIILNTNISLNEGQNTISLLSVMVGS 561
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSSTQW 613
+ GA E+ GI V ++ ++ L+++ W YQ GL GE SS +W
Sbjct: 562 PDSGAHMERRSFGIH-KVSIQQGQQPLHL-LNNELWAYQVGLYGEANRIYTQEESSSAEW 619
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
+ L P WYKTTF P G++ VA++ T MGKGE WVNG+S+GRYW ++
Sbjct: 620 TEINNL-TYHPFTWYKTTFATPVGNDVVALNLTSMGKGEVWVNGESLGRYWVSF 672
>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
Length = 592
Score = 602 bits (1553), Expect = e-169, Method: Compositional matrix adjust.
Identities = 314/604 (51%), Positives = 390/604 (64%), Gaps = 18/604 (2%)
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
MWTE W+GWF FGG VPYRP ED+AF+VARF Q+GG+F NYYMYHGGTNF RT+GGPFI
Sbjct: 1 MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 60
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
+TSYDYDAPLDEYGL RQPKWGHLKDLH+AIKLCE ALV+ +PT LG EA VYK+
Sbjct: 61 ATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSK 120
Query: 369 SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR 428
SG CSAFLAN S V F N Y LP WS+SILPDCKN V+NTA++ + T R
Sbjct: 121 SGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMKMVR 180
Query: 429 QSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIK 488
+ G W NE D++FT GL+EQINTT D SDYLWY +
Sbjct: 181 VPVHG--------GLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVD 232
Query: 489 ADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFD 548
A+E L +G L V S GHA+H FING+L GS YGS + K+T + L G N
Sbjct: 233 ANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIA 292
Query: 549 LLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE---ELNF 605
+LS+ VGL N G +E AG+ GPV L G NG DLS Q+WTY+ GLKGE +
Sbjct: 293 ILSIAVGLPNVGPHFETWNAGVLGPVSLNGL-NGGRRDLSWQKWTYKVGLKGESLSLHSL 351
Query: 606 PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
SS +W + + + QPL WYKTTF APAG P+A+D MGKG+ W+NGQS+GR+WP
Sbjct: 352 SGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWP 411
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
Y + G C++ C+Y G + +KCL+NCG+ SQ YHVPRSWLK SGN LV+FEE GGDP
Sbjct: 412 AYKAV-GSCSE-CSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDP 469
Query: 726 TKISFVTKQLGSSLCSHVTDSHPLPVD-MWGSDSKIQRKPGPVLSLECPNPNQVISSIKF 784
I+ V +++ S+C+ + + V+ + K+ + P L+C P Q I+++KF
Sbjct: 470 NGITLVRREV-DSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQC-GPGQKITTVKF 527
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAV 843
ASFGTP GTCGS+ +G C + S + CVG CS+ V+ F GDPC VMK LAV
Sbjct: 528 ASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCPNVMKKLAV 587
Query: 844 EASC 847
EA C
Sbjct: 588 EAVC 591
>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
Length = 1036
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 326/774 (42%), Positives = 456/774 (58%), Gaps = 52/774 (6%)
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY+F+GR+DLVKF+KL+ E GLY LR+GP++ AEWN GG P WL +P + FRT+NEPF
Sbjct: 80 QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPF 139
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
K +R+ KI+ MMK+EKL+ASQGGPIIL QIENEY + AY G+ YIKWAA +
Sbjct: 140 KEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVE 199
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGAV 265
S++ G+PWVMC+Q+DAP +IN CNG +C D F PN ++KP +WTENW+ F FG
Sbjct: 200 SMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPP 259
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
R VED+AF+VAR+F + G+ NYYMYHGGTNF RTS F++T Y DAPLDE+GL +
Sbjct: 260 TQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEK 318
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNSD 384
PK+GHLK +H+A++LC+ AL +LGP+ E Y+ G+ +C+AFL+N T
Sbjct: 319 APKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDT 378
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
T+KF G Y+LP+ S+SILPDCK VV+NTA+I V S + + +S +
Sbjct: 379 NTIKFKGQDYVLPSRSISILPDCKTVVYNTAQI-----VAQHSWRDFVKSEKTSKGLKFE 433
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
N P + D PG L + T D++DY WY+ S I D+ + G KT+L V
Sbjct: 434 MFSENIPSLLDGDSLI--PGELYYL--TKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRV 489
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
SLGHAL ++NG+ G +G P+ G N +L + GL + G++ E
Sbjct: 490 ASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYME 549
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSGS-STQWDSKSTLPK 621
AG + + G +GT + +W + GL+GE E+ GS +W+ K
Sbjct: 550 HRFAGPRA-ISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDG---K 605
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+PL WYKT F+ P G VAI MGKG WVNG +GRYW +++S
Sbjct: 606 RKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP----------- 654
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLK--SSGNTLVLFEEIGGDPTK-ISFVTKQLGSS 738
G+P+Q+ YH+PRS++K N LV+ EE G + I FV +
Sbjct: 655 -----------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNR-DT 702
Query: 739 LCSHVTDSHPLPVDMWGSDSK--IQRKPGPVLS--LECPNPNQVISSIKFASFGTPLGTC 794
+CS+V + +P+ V W + + R L + CP P + + ++FASFG P GTC
Sbjct: 703 ICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCP-PEKQMVEVQFASFGDPTGTC 761
Query: 795 GSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP-CKGVMKSLAVEASC 847
G+F+ G+CS+++S VV + C+G CSI V+ TFGD C ++K+LAV+ C
Sbjct: 762 GNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGDKGCPEIVKTLAVQVKC 815
>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
Length = 767
Score = 600 bits (1547), Expect = e-169, Method: Compositional matrix adjust.
Identities = 339/834 (40%), Positives = 451/834 (54%), Gaps = 115/834 (13%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R++++ G+R +L SGSIHYPRSTPE
Sbjct: 32 VTYDGRSLIVNGRRELLFSGSIHYPRSTPE------------------------------ 61
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+NFEG YDLVKF+KL+ + GLYA LRIGP++ AEWN GGFP WL +P I FR+ NEP
Sbjct: 62 --FNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSYNEP 119
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK M++++ I++MMK+ KL+A QGGPIIL+QIENEY +I AY G Y++WA MA
Sbjct: 120 FKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYKELGVQYVQWAGKMA 179
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
+ L GVPW+MC+Q DAPDP+INTCNG +C D FT PN NKP +WTENW+ + FG
Sbjct: 180 VGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVFGDP 239
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R EDLAF+VARF + GT NYYMYHGGTNF RT G F++T Y +APLDEYGL
Sbjct: 240 PSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEYGLQ 298
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTNS 383
R+PKWGHLKDLH A++LC+ AL P LG + E Y K G+ +C+AFL N +
Sbjct: 299 REPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNHSRE 358
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGS 443
T+ F G Y LP S+SILPDCK VV+NT ++ + +F + + A+ +
Sbjct: 359 AATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARNFVKSKI---ANKN----L 411
Query: 444 GWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLH 503
W EP+ + D +E D+SDY W+ S + + ++ VL
Sbjct: 412 KWEMSQEPIPVMTDMKILTKSPMELYXFLKDRSDYAWFVTSIELSNYDLPMKKDIIPVLQ 471
Query: 504 VQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFY 563
+ +LGHA+ AF+NG +GS +GS+ P+ G+N A Y
Sbjct: 472 ISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFQ-GRNKLHC----------PAVY 520
Query: 564 EKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKSTLPKL 622
+ GI VQ+ G GT +D+++ W Q G+ GE + + G S + + K
Sbjct: 521 DSGTTGIHS-VQILGLNTGT-LDITNNGWGQQVGVNGEHVKAYTQGGSHRVQWTAAKGKG 578
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
+ WYKT FD P G++PV + T M KG NG
Sbjct: 579 PAMTWYKTYFDMPEGNDPVILRMTSMAKG----NGLE----------------------- 611
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKI--SFVTKQLGSSLC 740
YHVPR+WLK S N LV+FEE GG+P +I V + ++C
Sbjct: 612 ------------------YHVPRAWLKPSDNLLVIFEETGGNPEEIEXELVNR---DTIC 650
Query: 741 SHVTDSHPLPVDMWGS-DSKIQR---KPGPVLSLECPNPNQVISSIKFASFGTPLGTCGS 796
S VT+ HP V W DSKI+ + P L+CPN +VI + FASFG PLG CG
Sbjct: 651 SIVTEYHPPHVKSWQRHDSKIRAVVDEVKPKGHLKCPN-YKVIVKVDFASFGNPLGACGD 709
Query: 797 FSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF---GDPCKGVMKSLAVEASC 847
F G C++ S VV Q C G +C I + F C + K+LAV+ C
Sbjct: 710 FEMGNCTAPNSKKVVEQHCXGKTTCEIPMEAGIFXGNSGACSDITKTLAVQVRC 763
>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
Length = 807
Score = 582 bits (1500), Expect = e-163, Method: Compositional matrix adjust.
Identities = 320/836 (38%), Positives = 454/836 (54%), Gaps = 75/836 (8%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++I GKR + SG+IHYPRS PEMW L++ +K GGL+ IETYVFWN
Sbjct: 30 TKKGTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWN 89
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +Y FEGR+DL++F+ ++ + +YA +RIGP++ AEWN GG P WL I I F
Sbjct: 90 GHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIF 149
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +NEPFK IENEYGNI G Y++
Sbjct: 150 RANNEPFK-------------------------------IENEYGNIKKDRKVEGDKYLE 178
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S GVPWVMC+QS AP +I TCNG +C D +T NKP++WTENW+ F
Sbjct: 179 WAAEMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFR 238
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG + R ED+A+AV RFF +GGT NYYMYHGGTNF RT G ++ T Y +AP+D
Sbjct: 239 TFGDQLAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMD 297
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLAN 378
EYG+ ++PK+GHL+DLH IK A + ++ LG EA Y+ LC +FL+N
Sbjct: 298 EYGMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSN 357
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSS 438
T D TV F G + +P+ SVSIL DCK VV+NT ++ S +S ++S
Sbjct: 358 NNTGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV-----FVQHSERSFHTTDETS 412
Query: 439 DAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGS 498
+ W +E + + LEQ N T D SDYLWY+ S +++D+
Sbjct: 413 K--NNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDI 470
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ V+ ++S HA+ F N VG+G GS + P+ L G N +LS ++G+++
Sbjct: 471 RPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKD 530
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKS 617
G + GI V ++G GT +DL ++ L+GE+ + Q+ K
Sbjct: 531 SGGELVEVKGGIQDCV-VQGLNTGT-LDLQGNGRGHKARLEGEDKEIYTEKGMAQFQWKP 588
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
L P+ WYK FD P G +P+ +D + M KG +VNG+ IGRYW ++++
Sbjct: 589 AENDL-PITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL------- 640
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
G PSQS+YH+PR++LK GN L++FEE G P I T +
Sbjct: 641 ---------------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRR-D 684
Query: 738 SLCSHVTDSHPLPVDMWGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
+C +++ +P + W SD I +L CP P + I + FASFG P G
Sbjct: 685 DICVFISEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCP-PQRTIQEVVFASFGNPEGA 743
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
CG+F+ G C + + +VV + C+G +SC + V +G C +LAV+ C
Sbjct: 744 CGNFTAGTCHTPDAKAVVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRC 799
>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
Length = 706
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 294/653 (45%), Positives = 406/653 (62%), Gaps = 18/653 (2%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++ G R + +SGSIHYPRS P+MWP+LI K+K+GGL+ IETYVFWN
Sbjct: 37 TRNGTVVSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWN 96
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
+HEP + ++NFEG+ D+V+F +L+ E +YA +R+GP++ AEWN GG P WL IP I F
Sbjct: 97 IHEPEKGEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVF 156
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
RT+NEP+K M+ F I+ +K L+ASQGGPIIL+QIENEY ++++A+ G YI
Sbjct: 157 RTNNEPYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYIN 216
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK--PKMWTENWSGWF 258
WAA MA+S + G+PW+MC+Q+ AP +I TCNG C P NK P +WTENW+ +
Sbjct: 217 WAAKMAISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQY 276
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
FG R ED+AFAVARFF GGT NYYMYHGGTNF RTS F+ Y +APL
Sbjct: 277 RVFGDPPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAA-FVMPKYYDEAPL 335
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLA 377
DE+GL ++PKWGHL+DLH+A+KLC+ AL+ P+ LG LEA V++ +C AFL+
Sbjct: 336 DEFGLYKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLS 395
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
N T D T+ F G Y +P S+S+L DC+ VVF T +N+ ++++ A +
Sbjct: 396 NHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNA-----QHNQRTFHFADQT 450
Query: 438 SDAIGSGWSYI---NEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL 494
A + W N P K G L N T D++DY+WY+ S ++AD+ +
Sbjct: 451 --AQNNVWEMFDGENVPKYKQAKIRLRKAGDL--YNLTKDKTDYVWYTSSFKLEADDMPI 506
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
KTVL V S GHA AF+N K VG G+G+ N T++ P+ L G N +L+ ++
Sbjct: 507 RSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSM 566
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD 614
G+ + GA+ E AG+ VQ+ G GT +DL++ W + GL GE +
Sbjct: 567 GMTDSGAYMEHRLAGVD-RVQITGLNAGT-LDLTNNGWGHIVGLVGERKQIYTDKGMGSV 624
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
+ +PL WYK FD P+G +PV +D + MGKG +VNGQ IGRYW +Y
Sbjct: 625 TWKPAMNDRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY 677
>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
Length = 706
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 303/666 (45%), Positives = 395/666 (59%), Gaps = 50/666 (7%)
Query: 180 IENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF 239
IENE+GN++ +YG GK Y+KW A +A S + PW+MCQQ DAP PIINTCNGFYCDQF
Sbjct: 1 IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYCDQF 60
Query: 240 TPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF 299
PN+ N PKMWTE+W+GWF +G PYR EDLAFAVARFFQ GG+ NYYMYHGGTNF
Sbjct: 61 KPNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNF 120
Query: 300 DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
R++GGP+I+TSYDY+APLDEYG + QPKWGHLK LH+ I+ E L D + G +
Sbjct: 121 GRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHS 180
Query: 360 LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINS 419
AT Y T G S F N NSD + F Y +P WSV++LPDCK V+NTAK+N+
Sbjct: 181 TTATSY-TYKGKSSCFFGN-PENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNT 238
Query: 420 VT----LVPSFSRQSLQVAADSSDAIGSGWSYINEPV------GISKDDAFTKPGLLEQI 469
T +VPS + + W + NE + G A T L++Q
Sbjct: 239 QTTIREMVPSLVGKHKKPLK---------WQWRNEKIEHLTHEGDISGSAITANSLIDQK 289
Query: 470 NTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSN 529
T D SDYLWY ++ ++PL G + L V++ GH LHAF+N K +G+ +G
Sbjct: 290 MVTNDSSDYLWYLTGFHLNGNDPLF--GKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGK 347
Query: 530 AKVTVDFPIA-LAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS 588
T++ + L G N LLS TVGL NYGA+YE GI GPV+L G T DLS
Sbjct: 348 YSFTLEKKVRNLRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGK-TIRDLS 406
Query: 589 SQQWTYQTGLKGEELNF--PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFT 646
+ +W Y+ GL GE+ F P + + LP Q WYKT+F P G E V +D
Sbjct: 407 TNEWIYKVGLDGEKYEFFDPDHKFRKPWLSNNLPLNQNFTWYKTSFSTPKGREGVVVDLM 466
Query: 647 GMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRS 706
GMGKG+AWVNG+SIGRYWP+Y++ GC+ SC+YRGAY +KC NCGKP+Q YH+PRS
Sbjct: 467 GMGKGQAWVNGKSIGRYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRS 526
Query: 707 WLKS-SGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPG 765
++ NTL+LFEE GG P I T ++ +C+ VD+ G
Sbjct: 527 YMNDGKENTLILFEEFGGMPLNIEIKTTRV-KKVCA--------KVDL-----------G 566
Query: 766 PVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGV 825
L L C ++ + I F FG P G C +F +G C S+ + SV+ + C+ + CSI V
Sbjct: 567 SKLELTC--HDRTVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEKECLWKRKCSIEV 624
Query: 826 SVNTFG 831
+ + G
Sbjct: 625 TKDKLG 630
>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
Length = 532
Score = 577 bits (1486), Expect = e-161, Method: Compositional matrix adjust.
Identities = 283/537 (52%), Positives = 363/537 (67%), Gaps = 14/537 (2%)
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA+S + GVPW+MCQQ DAP +I+TCNGFYCDQFTPN+ +KPK+WTENW GWF +FGG
Sbjct: 1 MAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGR 60
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
P+RP ED+A++VARFF +GG+ NYYMYHGGTNF RTSGGPFI+TSYDY+AP+DEYGL
Sbjct: 61 DPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLP 120
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
R PKWGHLKDLHKAI L E L++ + +LG +LEA VY SG C+AFL+N+ +D
Sbjct: 121 RLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKND 180
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F SY LPAWSVSILPDCK VFNTAK+ S S + + D + G
Sbjct: 181 KAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSK------SSKVEMLPEDLKSSSGLK 234
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +E GI F K L++ INTT D +DYLWY+ S + +E L+ GS VL +
Sbjct: 235 WEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFI 294
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+S GH LH FIN + +G+ G+ ++ + P+AL G+N DLLS+TVGL N G+FYE
Sbjct: 295 ESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYE 354
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSS--TQWDSKSTLPK 621
GAG+T V +KG GT ++L++ +W+Y+ G++GE L F G+S +W + PK
Sbjct: 355 WVGAGLTS-VSIKGFNKGT-LNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPK 412
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYV---SQNGGCTDSC 678
QPL WYK + P+GSEPV +D MGKG AW+NG+ IGRYWP S N C C
Sbjct: 413 KQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKEC 472
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
+YRG + +KCL CG+PSQ YHVPRSW KSSGN LV+FEE GG+P KI +++
Sbjct: 473 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLSKRKV 529
>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 578
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 298/585 (50%), Positives = 380/585 (64%), Gaps = 28/585 (4%)
Query: 273 LAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHL 332
LAF VARF Q+GG+F NYYMYHGGTNF RT+GGPF++TSYDYDAP+DEYGLIRQPK+GHL
Sbjct: 1 LAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHL 60
Query: 333 KDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGN 392
K+LH+AIK+CE ALV+ DP S+G +A VY SG CSAFLAN T S V FN
Sbjct: 61 KELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNV 120
Query: 393 SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW-SYINEP 451
Y LP WS+SILPDC+N VFNTAK+ Q+ Q+ +D W SY+ +
Sbjct: 121 HYNLPPWSISILPDCRNAVFNTAKVGV---------QTSQMEMLPTDTKNFQWESYLEDL 171
Query: 452 VGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHAL 511
+ FT GLLEQIN T D SDYLWY S +I E L G L +QS GHA+
Sbjct: 172 SSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHAV 231
Query: 512 HAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT 571
H F+NG+L GS +G+ N + T I L G N LLS+ VGL N G +E GI
Sbjct: 232 HIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGIL 291
Query: 572 GPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN--FPSGS-STQW-DSKSTLPKLQPLVW 627
GPV L G G +DLS Q+WTYQ GLKGE +N FP+ + S W D+ T+ K QPL W
Sbjct: 292 GPVALHGLSQG-KMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTW 350
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
+KT FDAP G+EP+A+D GMGKG+ WVNG+SIGRYW + + G C+ C+Y G Y N
Sbjct: 351 HKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFAT--GDCSH-CSYTGTYKPN 407
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH 747
KC CG+P+Q YHVPR+WLK S N LV+FEE+GG+P+ +S V + + S +C+ V++ H
Sbjct: 408 KCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSV-SGVCAEVSEYH 466
Query: 748 P----LPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCS 803
P ++ +G R P + L+C +P Q I+SIKFASFGTPLGTCGS+ +G C
Sbjct: 467 PNIKNWQIESYGKGQTFHR---PKVHLKC-SPGQAIASIKFASFGTPLGTCGSYQQGECH 522
Query: 804 SARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
+A S +++ + CVG C++ +S + FG DPC V+K L VEA C
Sbjct: 523 AATSYAILERKCVGKARCAVTISNSNFGKDPCPNVLKRLTVEAVC 567
>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
Length = 1064
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 259/363 (71%), Positives = 298/363 (82%), Gaps = 2/363 (0%)
Query: 4 KEILLLVLCWGFVVLATTSFGA-NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
+ + +LC+ + SF NV+YDHRA++I GKRR+L+S IHYPR+TPEMWPDLI
Sbjct: 5 RALFAALLCFSLTIQLGVSFAPFNVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLI 64
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
KSK+GG DVI+TYVFWN HEPVR QYNFEGRYD+VKFVKLV +GLY HLRIGPYVCAE
Sbjct: 65 AKSKEGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAE 124
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
WNFGGFP+WL IPGI+FRTDN PFK EMQRF KIVD+M++E L++ QGGPII+ QIEN
Sbjct: 125 WNFGGFPVWLRDIPGIEFRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIEN 184
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPN 242
EYGN++S++G GK Y+KWAA MAL LD GVPWVMCQQ+DAPD IIN CNGFYCD F PN
Sbjct: 185 EYGNVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPN 244
Query: 243 SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRT 302
S NKPK+WTE+W+GWF S+GG P RPVED+AFAVARFFQRGG+F NYYMY GGTNF R+
Sbjct: 245 SANKPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRS 304
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLE 361
SGGPF TSYDYDAP+DEYGL+ QPKWGHLK+LH AIKLCE ALVA D P Y LGP E
Sbjct: 305 SGGPFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQE 364
Query: 362 ATV 364
V
Sbjct: 365 VGV 367
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 222/502 (44%), Positives = 293/502 (58%), Gaps = 32/502 (6%)
Query: 359 NLEATVYKTGSG---LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTA 415
++ ++Y T SG CSAFLANI + +V F G Y LP WSVSILPDC+ VFNTA
Sbjct: 571 RVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNTA 630
Query: 416 KINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQ 475
K+ + T + + + W + EP+ + ++ FT G+LE +N T D
Sbjct: 631 KVGAQT----------SIKTNKISYVPKTWMTLKEPISVWSENNFTIQGVLEHLNVTKDH 680
Query: 476 SDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVT 533
SDYLW N+ A++ E+ L + S+ LH F+NG+L+GS G V
Sbjct: 681 SDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVIGHW----VK 736
Query: 534 VDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWT 593
V PI L G N LLS TVGLQNYGAF EK GAG G V+L G NG IDLS WT
Sbjct: 737 VVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNG-EIDLSEYSWT 795
Query: 594 YQTGLKGE-ELNFPSGSSTQWDSKSTLPKLQP--LVWYKTTFDAPAGSEPVAIDFTGMGK 650
YQ GL+GE + + S + + P P WYKT FDAP G PVA+D MGK
Sbjct: 796 YQVGLRGEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGK 855
Query: 651 GEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKS 710
G+AWVNG IGRYW T V+ GC C+YRG Y ++KC NCG P+Q YH+PRSWL++
Sbjct: 856 GQAWVNGHHIGRYW-TRVAPKDGC-GKCDYRGHYHTSKCATNCGNPTQIWYHIPRSWLQA 913
Query: 711 SGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGS----DSKIQRKPGP 766
S N LVLFEE GG P +IS V + ++C+ V++SH + W D + K P
Sbjct: 914 SNNLLVLFEETGGKPFEIS-VKSRSTQTICAEVSESHYPSLQNWSPSDFIDQNSKNKMTP 972
Query: 767 VLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVS 826
+ L+C + ISSI+FAS+GTP G+C FS+G+C + SL++V +AC G SC I +
Sbjct: 973 EMHLQC-DDGHTISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSKACQGKGSCVIRIL 1031
Query: 827 VNTF-GDPCKGVMKSLAVEASC 847
+ F GDPC+G++K+LAVEA C
Sbjct: 1032 NSAFGGDPCRGIVKTLAVEAKC 1053
>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 621
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 301/656 (45%), Positives = 394/656 (60%), Gaps = 48/656 (7%)
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA SLD GVPW+MCQQ +AP P++ TCNGFYCDQ+ P + + PKMWTENW+GWF ++GG
Sbjct: 1 MANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGK 60
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
PYR EDLAF+VARFFQ GGTFQNYYMYHGGTNF R +GGP+I+TSYDY APLDE+G +
Sbjct: 61 HPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNL 120
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
QPKWGHLK LH +K E +L + + LG +++AT+Y T G S F+ N+ +D
Sbjct: 121 NQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEG-SSCFIGNVNATAD 179
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
V F G Y +PAWSVS+LPDC +NTAK+N+ T + + DSS
Sbjct: 180 ALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSI---------MTEDSSKPERLE 230
Query: 445 WSYINEPVG---ISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTV 501
W++ E + GL++Q + T D SDYLWY ++ +PL
Sbjct: 231 WTWRPESAQKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWS--RNMT 288
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPI-ALAPGKNTFDLLSLTVGLQNYG 560
L V S H LHA++NGK VG+ + + + L G N LLS++VGLQNYG
Sbjct: 289 LRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYG 348
Query: 561 AFYEKTGAGITGPVQLKGSGNGTNI--DLSSQQWTYQTGLKG---EELNFPSGSSTQWDS 615
F+E GI GPV L G I DLS QW Y+ GL G + + S +W +
Sbjct: 349 PFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKW-A 407
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
LP + L WYK F AP G EPV +D G+GKGEAW+NGQSIGRYWP++ S + GC
Sbjct: 408 NEKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCK 467
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKISFVTKQ 734
D C+YRGAY S+KC CGKP+Q YHVPRS+L +SG NT+ LFEE+GG+P+ ++F T
Sbjct: 468 DKCDYRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVV 527
Query: 735 LGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTC 794
+G ++C+ + + + L C N+ IS++KFASFG PLG C
Sbjct: 528 VG-TVCARAHEHNK-------------------VELSC--HNRPISAVKFASFGNPLGHC 565
Query: 795 GSFSRGRCSSAR-SLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
GSF+ G C + + V + CVG +C++ VS +TFG C K LAVE C
Sbjct: 566 GSFAVGTCQGDKDAAKTVAKECVGKLNCTVNVSSDTFGSTLDCGDSPKKLAVELEC 621
>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
Length = 831
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 326/856 (38%), Positives = 464/856 (54%), Gaps = 119/856 (13%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +++I GKR +L SGSIHYPRSTPEMWP +I+++K GGL+ I+TYVFWN+HEP +
Sbjct: 54 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 113
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF GR DLVKF+KL+ + G+Y LR+GP++ AEW G + H +R
Sbjct: 114 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGYITRYDHKNIAGAYR----- 168
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+IENEY + AY G +YIKWA+ +
Sbjct: 169 --------------------------------KIENEYSAVQRAYKQDGLNYIKWASNLV 196
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
S+ G+PWVMC+Q+DAPDP+IN CNG +C D F PN NKP +WTENW+ F FG
Sbjct: 197 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 256
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
R VED+A++VARFF + GT NYYMYHGGTNF RTS +++T Y DAPLDEYGL
Sbjct: 257 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYDDAPLDEYGLE 315
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNS 383
++PK+GHLK LH A+ LC+ L+ P G + E Y+ G+ C+AFLAN T +
Sbjct: 316 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 375
Query: 384 DVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS-------LQVAAD 436
T+KF G Y++ S+SILPDCK VV+NTA+I S +F + +V +
Sbjct: 376 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKKFDFKVFTE 435
Query: 437 SSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLED 496
+ + G SYI PV E T D++DY WY+ S + + +
Sbjct: 436 TLPSKLEGNSYI--PV--------------ELYGLTKDKTDYGWYTTSFKVHKNHLPTKK 479
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
G KT + + SLGHALHA++NG+ +GSG+GS + L G+N +L + G
Sbjct: 480 GVKTFVRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGF 539
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGS---STQ 612
+ G++ E G G + + G +GT +DL+ S +W + G++GE+L + +
Sbjct: 540 PDSGSYMEHRYTGPRG-ISILGLTSGT-LDLTESSKWGNKIGMEGEKLGIHTEEGLKKVE 597
Query: 613 WDSKSTLPKLQPLVWY----------KTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
W K K L WY +T FDAP I GMGKG WVNG+ +GR
Sbjct: 598 W--KKFTGKAPGLTWYQKFSKECETLQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGR 655
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE-- 720
YW +++S G+P+Q YH+PRS+LK N LV+FEE
Sbjct: 656 YWQSFLSP----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEP 693
Query: 721 -IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNP 775
+ + + V + ++CS+V +++ V W I +L+C
Sbjct: 694 NVKPELMDFAIVNR---DTVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGT 750
Query: 776 NQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF----G 831
+ I++++FASFG P+G CG+F+ G C++ S V+ + C+G C I V+ +TF
Sbjct: 751 KK-IAAVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKK 809
Query: 832 DPCKGVMKSLAVEASC 847
D CK V+K LAV+ C
Sbjct: 810 DSCKNVVKMLAVQVKC 825
>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 493
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 276/483 (57%), Positives = 338/483 (69%), Gaps = 18/483 (3%)
Query: 8 LLVLCWGFVVLATTSF--GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+ L W LA +F G NV+YD A++I G+RR++ SGSIHYPRST MWPDLIQK+
Sbjct: 1 MFKLQWLVATLACLTFCLGDNVSYDSNAIIINGERRIIFSGSIHYPRSTEAMWPDLIQKA 60
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLD IETY+FW+ HEP R +Y+F GR D +KF +L+ +AGLY +RIGPYVCAEWN+
Sbjct: 61 KDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNY 120
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WLH +PGIQ RT+N+ +K EMQ FT KIV+M KQ L+ASQGGPIIL+QIENEYG
Sbjct: 121 GGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYG 180
Query: 186 NIDS-AYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
N+ + AYG AGK+YI W A MA SL+ GVPW+MCQQSDAP PIINTCNGFYCD FTPN+
Sbjct: 181 NVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDNFTPNNP 240
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
PKM+TENW GWF +G PYR ED+AF+VARFFQ GG F NYYMYHGGTNF RTSG
Sbjct: 241 KSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSG 300
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
GPFI+TSYDY+APLDEYG + QPKWGHLK LH +IKL E L T + G ++ T
Sbjct: 301 GPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNGTHTNQNFGSSVTLTK 360
Query: 365 Y---KTGSGLCSAFLANIGTNSDVTVKFNGN-SYLLPAWSVSILPDCKNVVFNTAKINSV 420
+ TG C FL+N +D T+ + Y +PAWSVSIL C V+NTAK+NS
Sbjct: 361 FFNPTTGERFC--FLSNTDGKNDATIDLQADGKYFVPAWSVSILDGCNKEVYNTAKVNSQ 418
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVG--ISKDDAFTKPGLLEQINTTADQSDY 478
T + F ++ + + W++ EP+ + + F LEQ TAD SDY
Sbjct: 419 TSM--FVKEQ-----NEKENAQLSWAWAPEPMKDTLQGNGKFAANLFLEQKRVTADFSDY 471
Query: 479 LWY 481
WY
Sbjct: 472 FWY 474
>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
vinifera]
Length = 563
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 280/544 (51%), Positives = 366/544 (67%), Gaps = 6/544 (1%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MW L++ +K+GG+DVIETYVF N HE + Y F G YDL+KFVK+V +AG+Y L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
P+V EWNFGG P+WLH++P F+T+++PFK MQ+F IV++MK++KL+ASQGGPII
Sbjct: 61 PFVATEWNFGGVPIWLHYVPRTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 120
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
L+Q+ENEYG+ Y GK Y+ WAA M LS + GVPW+MCQ + DP+INTCN FYC
Sbjct: 121 LTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFYC 180
Query: 237 DQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGG 296
DQFTPNS +K +MWTENW WF +FG + +R ED+AF+VA FF NYYMYHGG
Sbjct: 181 DQFTPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFFFPKSX--NYYMYHGG 238
Query: 297 TNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSL 356
TNF TSGGPFI+T+Y+Y+AP+DEYGL R PK GHLK+L +AIK CE L+ +P L
Sbjct: 239 TNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPINLXL 298
Query: 357 GPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAK 416
GP+ E VY G +AF++N+ D + F SY +PAWSVSILPDCKNVVFNTAK
Sbjct: 299 GPSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVVFNTAK 358
Query: 417 INSVTLVPSFSRQSLQ--VAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTAD 474
+ S + LQ + + D G W E GI + F K G ++ INTT D
Sbjct: 359 VVSQISQVEMVLEDLQPSLVPSNKDLKGLXWKTFVEKAGIWGEADFVKNGFVDHINTTKD 418
Query: 475 QSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTV 534
+D LWY++S + E L++ S+ +L V+S GHALHAF+N KL GS G+ S++
Sbjct: 419 TTDXLWYTVSITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNGSHSPFKF 478
Query: 535 DFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTY 594
+ PI+L GKN +LS+TVGLQN FYE GA +T V++KG NG +DLS+ W Y
Sbjct: 479 ECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLTS-VKIKGLNNGI-MDLSTYPWIY 536
Query: 595 QTGL 598
++ L
Sbjct: 537 KSLL 540
>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 568
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 291/591 (49%), Positives = 383/591 (64%), Gaps = 33/591 (5%)
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
P+RP ED+AFAVARF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYGL+R
Sbjct: 1 PHRPAEDIAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 60
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDV 385
+PKWGHL+DLH+AIKLCE ALV+ DPT S+G ++ V+++ +G C+AFL+N + S
Sbjct: 61 EPKWGHLRDLHRAIKLCEPALVSGDPTVTSIGHYQQSHVFRSKAGACAAFLSNYDSGSYA 120
Query: 386 TVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGW 445
V FNG Y +P WS+SILPDCK VFNTA+I + T S+ ++ A S W
Sbjct: 121 RVVFNGIHYDIPPWSISILPDCKTTVFNTARIGAQT-----SQLKMEWAGKFS------W 169
Query: 446 SYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQ 505
NE D +FTK GL+EQI+ T D +DYLWY+ NI +E L++G VL V
Sbjct: 170 ESYNEDTNSFDDRSFTKVGLVEQISMTRDNTDYLWYTTYVNIGENEGFLKNGHYPVLTVN 229
Query: 506 SLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEK 565
S GH++H +ING+L G+ YG+ N K+T + L G N +LS+ VGL N G +E
Sbjct: 230 SAGHSMHIYINGQLTGTIYGALENPKLTYTGSVKLWAGSNKISILSVAVGLPNIGGHFET 289
Query: 566 TGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKL 622
G+ GPV L G G DLS Q+W YQ GLKGE LN + SS +W S +
Sbjct: 290 WNTGVLGPVTLSGLNEGKR-DLSWQKWIYQIGLKGEALNLHTLSGSSSVEWGGPS---QK 345
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
Q L WYKT+F+APAG++P+A+D MGKG+ W+NGQS+GRYWP Y + G C+YRG
Sbjct: 346 QSLTWYKTSFNAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPAYKAS--GSCGGCDYRG 403
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSH 742
Y+ KC NCG+ +Q YHVPRSWL +GN LV+FEE GGDP+ IS V +++ S+C+
Sbjct: 404 TYNEKKCQSNCGESTQRWYHVPRSWLNPTGNLLVVFEEWGGDPSGISMVRRKV-ESVCAE 462
Query: 743 VTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRC 802
+ + P +D + + + K L C P Q +++IKFASFGTP GTCG+FS G C
Sbjct: 463 IAEWQP-NMDNVHTGNYGRSKA----HLSCA-PGQKMTNIKFASFGTPQGTCGAFSEGTC 516
Query: 803 SSARSLSVVR-----QACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+ +S Q C+G +SC++ V+ F GDPC G MK LAVEA C
Sbjct: 517 HAHKSYDAFEKESLLQNCIGQQSCAVLVAPEVFGGDPCPGTMKKLAVEAIC 567
>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 846
Score = 556 bits (1434), Expect = e-155, Method: Compositional matrix adjust.
Identities = 302/772 (39%), Positives = 431/772 (55%), Gaps = 48/772 (6%)
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Q FEGR DL+KF+KL+ +YA +RIGP++ AEWN GG P WL IP I FR +NEP
Sbjct: 104 RQVQFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 163
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+K EM++F IV +K +++ASQGGP+IL+QIENEYGNI + G Y++WAA MA
Sbjct: 164 YKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMA 223
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
+S +TGVPW+MC+QS AP +I TCNG +C D +T NKP++WTENW+ F +FG +
Sbjct: 224 ISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQL 283
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIR 325
R ED+A++V RFF +GGT NYYMY+GGTNF RT G ++ T Y + P+DEYG+ +
Sbjct: 284 ALRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPVDEYGMPK 342
Query: 326 QPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNSD 384
PK+GHL+DLH IK A + ++ L EA ++ LC AF++N T D
Sbjct: 343 APKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGED 402
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV F G+ Y +P+ SVSIL DCK+VV+NT ++ S +S A + + +
Sbjct: 403 GTVNFRGDKYYIPSRSVSILADCKHVVYNTKRV-----FVQHSERSFHTAQKLAKS--NA 455
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHV 504
W +EP+ K + +EQ N T D SDYLWY+ S ++AD+ + V+ V
Sbjct: 456 WEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLWYTTSFRLEADDLPFRGDIRPVVQV 515
Query: 505 QSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYE 564
+S HAL F+N G+G GS + PI L G N LLS ++G+++ G
Sbjct: 516 KSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELV 575
Query: 565 KTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNFPSG-SSTQWDSKSTLPK 621
+ GI ++G GT +DL W ++ L+GE E+ G + +W +T
Sbjct: 576 EVKGGIQD-CTIQGLNTGT-LDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPATT--- 630
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+ + WYK FD P G +PV +D T MGKG +VNG+ +GRYWP+Y +
Sbjct: 631 GRAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVG---------- 680
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G PSQ++YH+PR +LK N LV+FEE G P I T + +C
Sbjct: 681 ------------GVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRR-DDICV 727
Query: 742 HVTDSHPLPVDMWGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
+++ +P + W D I L+CP P + I + FASFG P G+C +F
Sbjct: 728 FISEHNPAQIKTWDKDGGQIKVIAEDHSTRGILKCP-PKKTIQEVVFASFGNPEGSCANF 786
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
+ G C + + +V + C+G KSC + V +G C +LAV+ C
Sbjct: 787 TAGSCHTPNAKDIVAKECLGKKSCVLPVLHTVYGADINCPTTTATLAVQVRC 838
>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
Length = 500
Score = 544 bits (1402), Expect = e-152, Method: Compositional matrix adjust.
Identities = 275/518 (53%), Positives = 337/518 (65%), Gaps = 23/518 (4%)
Query: 219 QQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVA 278
+Q DAPDP+INTCNGFYCD F+PN N KP MWTE W+GWF SFGG VP+RPVEDLAFAVA
Sbjct: 1 KQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVA 60
Query: 279 RFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKA 338
RF Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DE+GL+RQPKWGHL+DLH+A
Sbjct: 61 RFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRA 120
Query: 339 IKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPA 398
IK E LV+ DPT S+G +A V+K +G C+AFL+N N+ V V+FNG Y LPA
Sbjct: 121 IKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPA 180
Query: 399 WSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDD 458
WS+SILPDCK VFNTA + TL+P + + W +E D
Sbjct: 181 WSISILPDCKTAVFNTATVKEPTLMPKM-----------NPVVRFAWQSYSEDTNSLSDS 229
Query: 459 AFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGK 518
AFTK GL+EQ++ T D+SDYLWY+ NI ++ L G L V S GH++ F+NGK
Sbjct: 230 AFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTND--LRSGQSPQLTVYSAGHSMQVFVNGK 287
Query: 519 LVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKG 578
GS YG N K+T + + + G N +LS VGL N G +E G+ GPV L
Sbjct: 288 SYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTLS- 346
Query: 579 SGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAP 635
S NG DLS Q+WTYQ GLKGE L + S+ +W QPL W+K F+AP
Sbjct: 347 SLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPG---GYQPLTWHKAFFNAP 403
Query: 636 AGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGK 695
AG++PVA+D MGKG+ WVNG +GRYW S GGC C+Y G Y +KC NCG
Sbjct: 404 AGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKAS--GGC-GGCSYAGTYHEDKCRSNCGD 460
Query: 696 PSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
SQ YHVPRSWLK GN LV+ EE GGD +S T+
Sbjct: 461 LSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATR 498
>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 727
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 298/720 (41%), Positives = 424/720 (58%), Gaps = 51/720 (7%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
NV+YDHR+++I G+R++L+S SIHYPR+TP MW +++ +K G+D+IETY FWNLHEP
Sbjct: 42 NVSYDHRSLIINGERKLLLSASIHYPRATPSMWRPVLEATKAAGIDLIETYTFWNLHEPT 101
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
YNFEG ++ F+ + AE GLY +R GPYVCAEWN+GGFP WL I GI FR N+
Sbjct: 102 PGTYNFEGNANVTAFLDICAELGLYVTVRFGPYVCAEWNYGGFPFWLKEIDGIVFRDYNQ 161
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
PF +M + IV+ ++ YAS GGPIIL+Q+ENEYG +++AYGA+G Y WAA
Sbjct: 162 PFMDQMSNWMTYIVNYLR--PYYASNGGPIILAQVENEYGWLEAAYGASGTKYALWAAQF 219
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYC----DQFTPNSNNKPKMWTENWSGWFLSF 261
A SLD G+PW+MC Q D +INTCNGFYC D N+P WTENW GWF ++
Sbjct: 220 ANSLDIGIPWIMCSQDDIAT-VINTCNGFYCHDWIDVHWTAYPNQPAFWTENWPGWFQNW 278
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
G VP+RPV+D+ ++VAR+ GG+ NYYM+ GGT F R +GGPFI+TSYDYD +DEY
Sbjct: 279 EGGVPHRPVQDVLYSVARWIAYGGSMMNYYMWFGGTTFGRWTGGPFITTSYDYDGAIDEY 338
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPS-LGPNLEAT-VYKTGSGLCSAFLANI 379
G +PK+ + H I E +++ +P P LG N+E + Y +G +FLAN
Sbjct: 339 GYPYEPKYSQSLEFHTIIHAYEHIILSMNPPKPILLGENVEISHFYSVETGESFSFLANF 398
Query: 380 GTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSD 439
G TV++NG ++ + WSV +L +N I + P S Q S
Sbjct: 399 GATGVQTVQWNGITFKVQPWSVQLL-------YNNVSIFDTSATPIGSPVPKQFTPIKSF 451
Query: 440 AIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSK 499
WS E ++ + P +EQ++ T DQ+DYLWY T I+ + G++
Sbjct: 452 ENIGQWS---ESFDLTFTNYSETP--MEQLSLTRDQTDYLWY--VTKIEVNRV----GAQ 500
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
L + ++ +H F++ + + +G G ++ +T++ I + G +T +L VGL NY
Sbjct: 501 --LSLPNISDMVHVFVDNQYIATGRGPTN---ITLNSTIGV--GGHTLQVLHTKVGLVNY 553
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF--PSGS-STQWDSK 616
E T AGI PV L ++D+SS W+ + ++GE L P+ S S QW +
Sbjct: 554 AEHMEATVAGIFEPVTLD------SVDISSNGWSMKPFVQGETLQLYNPNHSGSVQWTNV 607
Query: 617 STLPKLQPLVWYKTTFDAPAGSE-PVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
+ P PL WYK F+ S +A+D GM KG +VNG +IGRYW ++ GC
Sbjct: 608 TGNP---PLTWYKFNFNLELSSNMSLALDMLGMTKGMIFVNGYNIGRYW---LALAYGC- 660
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
+ C Y+G YS + C CG+PSQ YHVP WL + N +V+FEE+ G+P I+ V + +
Sbjct: 661 NPCTYQGGYSPSMCQLGCGEPSQQYYHVPTDWLMNGENEIVIFEEVYGNPEAITLVQRVI 720
>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
Length = 585
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 285/583 (48%), Positives = 355/583 (60%), Gaps = 34/583 (5%)
Query: 292 MYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD- 350
MY GGTNF RTSGGPF TSYDYDAPLDEYGL +PKWGHLKDLH AIKLCE ALVA D
Sbjct: 1 MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADA 60
Query: 351 PTYPSLGPNLEATVY----KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPD 406
P Y LG EA +Y +TG +C+AFLANI + VKFNG SY LP WSVSILPD
Sbjct: 61 PQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPD 120
Query: 407 CKNVVFNTAKINSVTLV-------PSFSRQSLQ---VAADSSDAIGSGWSYINEPVGISK 456
C++V FNTAK+ + T V PS S+ V D+ I W + EP+GI
Sbjct: 121 CRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWG 180
Query: 457 DDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGSKTVLHVQSLGHALHAF 514
++ FT GLLE +N T D+SDYLW+ ++ D+ ++G + + + S+ L F
Sbjct: 181 ENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVF 240
Query: 515 INGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPV 574
+N +L GS G A P+ G N LL+ TVGLQNYGAF EK GAG G
Sbjct: 241 VNKQLAGSIVGHWVKAVQ----PVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKA 296
Query: 575 QLKGSGNGTNIDLSSQQWTYQTGLKGEE---LNFPSGSSTQWDSKSTLPKLQPLVWYKTT 631
+L G NG ++DLS WTYQ GLKGE +W + T +WYKT
Sbjct: 297 KLTGFKNG-DLDLSKSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTY 355
Query: 632 FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLK 691
FD PAG++PV ++ MG+G+AWVNGQ IGRYW +SQ GC +C+YRGAY+S+KC
Sbjct: 356 FDPPAGTDPVVLNLESMGRGQAWVNGQHIGRYW-NIISQKDGCDRTCDYRGAYNSDKCTT 414
Query: 692 NCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPV 751
NCGKP+Q+ YHVPRSWLK S N LVLFEE GG+P KIS T G LC V++SH P+
Sbjct: 415 NCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAG-ILCGQVSESHYPPL 473
Query: 752 DMWGSDSKIQ-----RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSAR 806
W + I P + L C + VISSI+FAS+GTP G+C FS G+C ++
Sbjct: 474 RKWSTPDYINGTMSINSVAPEVHLHCED-GHVISSIEFASYGTPRGSCDGFSIGKCHASN 532
Query: 807 SLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASCT 848
SLS+V +AC G SC I VS F DPC G +K+LAV + C+
Sbjct: 533 SLSIVSEACKGRNSCFIEVSNTAFISDPCSGTLKTLAVMSRCS 575
>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
Length = 775
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 283/637 (44%), Positives = 382/637 (59%), Gaps = 55/637 (8%)
Query: 228 INTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTF 287
INTCNG+YCD F PN+ PKM+TENWSGW+ +GG YR ED+AF+VARF Q GG F
Sbjct: 164 INTCNGYYCDTFKPNNPKSPKMFTENWSGWYKLWGGKTSYRTAEDMAFSVARFVQAGGVF 223
Query: 288 QNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALV 347
NYYMY+GGTNF RT+GGP+I+ SYDYD+PLDEYG + QPKWGHLK LH +IKL E +
Sbjct: 224 NNYYMYYGGTNFGRTAGGPYITASYDYDSPLDEYGNLNQPKWGHLKQLHASIKLGEKIIT 283
Query: 348 ATDPTYPSLGPNLEATVYK---TGSGLCSAFLANIG-TNSDVTVKFNGNSYLLPAWSVSI 403
T + ++ T Y T C FL+NI ++ + ++ +GN Y +PAWSVSI
Sbjct: 284 NGTVTIKNFQAGVDLTAYTNNATRERFC--FLSNINIADAHIDLQQDGN-YTIPAWSVSI 340
Query: 404 LPDCKNVVFNTAKINSVTLVPSFSRQSLQVAA--DSSDAIGSGWSYINEPVG--ISKDDA 459
L +C +FNTAK+N+ T SL V ++ W + EP+ +
Sbjct: 341 LQNCSKEIFNTAKVNTQT--------SLMVKKLYENDKPTNLSWVWAPEPMKDTLLGKGR 392
Query: 460 FTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKL 519
F LL+Q TT D SDYLWY S ++ + + L V S GH LHA++N KL
Sbjct: 393 FRTSQLLDQKETTVDASDYLWYMTSFDMNKNTL---QWTNVTLRVTSRGHVLHAYVNKKL 449
Query: 520 VGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT-GPVQLKG 578
+ G + T + P+ L PG N LLS TVGL NYG+F++KT GI GPVQL
Sbjct: 450 I-VGSQLVIQGEFTFEKPVTLKPGNNVISLLSATVGLANYGSFFDKTPVGIVDGPVQLMA 508
Query: 579 SGNGTNIDLSSQQWTYQTGLKGEELNF--PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPA 636
+G +DLSS W+Y+ GL GE F P+ +W + + + +P+ WYKTTF +P+
Sbjct: 509 NGKPV-MDLSSNLWSYKIGLNGEAKRFYDPTSRHNKWSAANGVSTARPMTWYKTTFSSPS 567
Query: 637 GSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKP 696
G++PV +D GMGKG AW NG+S+GRYWP+ ++ GC+ +C+YRG Y++ KC +NCG P
Sbjct: 568 GTDPVVVDLQGMGKGHAWANGKSLGRYWPSQIANANGCSGTCDYRGPYNAGKCTRNCGIP 627
Query: 697 SQSLYHVPRSWLKSSG-NTLVLFEEIGGDPTKISF--VTKQLGSSLCSHVTDSHPLPVDM 753
+Q YHVPRS+L S+G NTL+LFEE+GGDP+ ISF VT + ++C + +
Sbjct: 628 TQRWYHVPRSFLNSNGKNTLILFEEVGGDPSGISFQIVTTE---TICGNAYE-------- 676
Query: 754 WGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQ 813
G L L C + IS I+FAS+G P GTC SF +G + S+ +V++
Sbjct: 677 -----------GSTLELSCQG-GRTISEIQFASYGNPQGTCSSFKKGSFDAMNSVQMVQK 724
Query: 814 ACVGSKSCSIGVSVNTF--GDPCKGVMKSLAVEASCT 848
CVG SCSI S TF +P K LAV+A C+
Sbjct: 725 ECVGKDSCSIIASDETFMVNEPQGISNKRLAVQAHCS 761
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 87/146 (59%), Positives = 111/146 (76%), Gaps = 1/146 (0%)
Query: 9 LVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDG 68
+VL +L+ S V YD A++I G+R+++ SG+IHYPRSTPEMWP+LI K+KDG
Sbjct: 8 IVLISTLALLSLCS-ATTVEYDSNALIINGERKIIFSGAIHYPRSTPEMWPELINKAKDG 66
Query: 69 GLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGF 128
GLD IETYVFW+ HEPVR QY+F G D+VKF +++ EAGLY LRIGPYVCAEWN+GGF
Sbjct: 67 GLDAIETYVFWDRHEPVRRQYDFSGNLDIVKFFRVIQEAGLYVILRIGPYVCAEWNYGGF 126
Query: 129 PLWLHFIPGIQFRTDNEPFKAEMQRF 154
P+WLH PG++ RTDNE +K + F
Sbjct: 127 PMWLHNTPGVELRTDNEIYKVPLLIF 152
>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 326
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 226/306 (73%), Positives = 260/306 (84%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
++A + A V+YDHRAVVI G+RR+LISGSIHYPRSTPEMWP L+QK+KDGGLDV++TY
Sbjct: 18 MIAPSPANAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTY 77
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
VFWN HEPVR QY F RYDLV+FVKL +AGLY HLRIGPYVCAEWNFGGFP+WL ++P
Sbjct: 78 VFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVP 137
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK 196
GI FRTDN PFKA MQ F KIV MMK E L+ QGGPIIL+Q+ENEYG ++S GA K
Sbjct: 138 GISFRTDNGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAK 197
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
Y WAA MA++ GVPWVMC+Q DAPDP+INTCNGFYCD F+PNSN+KP MWTE W+G
Sbjct: 198 PYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTG 257
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
WF +FGGAVP+RPVED+AFAVARF Q+GG+F NYYMYHGGTNFDRTSGGPFI+TSYDYDA
Sbjct: 258 WFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDA 317
Query: 317 PLDEYG 322
P+DEYG
Sbjct: 318 PIDEYG 323
>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 592
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 247/547 (45%), Positives = 343/547 (62%), Gaps = 10/547 (1%)
Query: 13 WGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDV 72
W T G+ VTYD R+++I GKR + SG+IHYPRS PE+WP LI+++K+GGL+
Sbjct: 22 WAAAEWNLTKKGSVVTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNT 81
Query: 73 IETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
IETY+FWN HEP +YNFEGR+DL+K++K++ E +YA +RIGP++ AEWN GG P WL
Sbjct: 82 IETYIFWNAHEPEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWL 141
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
I I FR +N+P+K EM++F IV +K +L+ASQGGPIIL+QIENEYGNI +
Sbjct: 142 REIDHIIFRANNDPYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHA 201
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWT 251
G Y++WAA MALS TGVPW+MC+QS AP +I TCNG +C D +T NKP +WT
Sbjct: 202 TDGDKYLEWAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWT 261
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTS 311
ENW+ F ++G V R ED+A+AV RFF +GG+ NYYMYHGGTNF RT G ++ T
Sbjct: 262 ENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTG 320
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSG 370
Y +AP+DEYG+ ++PK+GHL+DLH I+ + A + + LG EA +++
Sbjct: 321 YYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEEN 380
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
LC +FL+N T D TV F G + +P+ SVSIL CKNVV+NT ++ + +S
Sbjct: 381 LCLSFLSNNNTGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRV-----FVQHNERS 435
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKAD 490
+ +S + W +E + +D LEQ N T D SDYLWY+ S +++D
Sbjct: 436 YHTSEVTSK--NNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESD 493
Query: 491 EPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLL 550
+ + + VL V+S H++ F N VG GS + P+ L G N LL
Sbjct: 494 DLPFRNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLL 553
Query: 551 SLTVGLQ 557
S T+G++
Sbjct: 554 SSTMGMK 560
>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 338
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 227/338 (67%), Positives = 270/338 (79%), Gaps = 3/338 (0%)
Query: 8 LLVLCWGFVVLATTSF--GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+ L W LA +F G NV+YD A++I G+RR++ SGSIHYPRST MWPDLIQK+
Sbjct: 1 MFKLQWLVATLACLTFCIGDNVSYDSNALIINGERRIIFSGSIHYPRSTEAMWPDLIQKA 60
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
KDGGLD IETY+FW+ HEP R +Y+F GR D +KF +L+ +AGLY +RIGPYVCAEWN+
Sbjct: 61 KDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNY 120
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WLH +PGIQ RT+N+ +K EMQ FT KIV+M KQ L+ASQGGPIIL+QIENEYG
Sbjct: 121 GGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYG 180
Query: 186 NIDS-AYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
N+ + AYG AGK+YI W A MA SL+ GVPW+MCQQSDAP P+INTCNGFYCD FTPN+
Sbjct: 181 NVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFYCDNFTPNNP 240
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
PKM+TENW GWF +G PYR ED+AF+VARFFQ GG F NYYMYHGGTNF RTSG
Sbjct: 241 KSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSG 300
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLC 342
GPFI+TSYDY+APLDEYG + QPKWGHLK LH +I +C
Sbjct: 301 GPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIXIC 338
>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
Length = 473
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 258/488 (52%), Positives = 316/488 (64%), Gaps = 20/488 (4%)
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
MWTE W+GWF +FGGAVP+RPVED+AFAVARF Q+GG+F NYYMYHGGTNFDRTSGGPFI
Sbjct: 1 MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFI 60
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
+TSYDYDAP+DEYGL+RQPKWGHL+DLHKAIK E ALV+ DPT SLG +A V+K+
Sbjct: 61 ATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSS 120
Query: 369 SGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR 428
G C+AFL+N T++ V FNG Y LPAWS+S+LPDCK VFNTA ++ PS
Sbjct: 121 GGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSE----PS--- 173
Query: 429 QSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIK 488
A S A G W +E AFTK GL+EQ++ T D+SDYLWY+ NI
Sbjct: 174 ----APARMSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNIN 229
Query: 489 ADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFD 548
++E L+ G L + S GH+L F+NG+ G+ YG + K+T + + G N
Sbjct: 230 SNEQFLKSGQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKIS 289
Query: 549 LLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS- 607
+LS VGL N G YE G+ GPV L G G DLS Q+WTYQ GL GE L S
Sbjct: 290 ILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKR-DLSDQKWTYQIGLHGESLGVQSV 348
Query: 608 --GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWP 665
SS +W S + QPL W+K F AP+G PVA+D MGKG+AWVNG+ IGRYW
Sbjct: 349 AGSSSVEWGSAA---GKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW- 404
Query: 666 TYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
+Y + + GC C+Y G YS KC CG SQ YHVPRSWL SGN LV+ EE GGD
Sbjct: 405 SYKASSSGC-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDL 463
Query: 726 TKISFVTK 733
+ + VT+
Sbjct: 464 SGVKLVTR 471
>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 486
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 230/338 (68%), Positives = 276/338 (81%), Gaps = 6/338 (1%)
Query: 5 EILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
+ +LL LC + ++ G+ VTYDH+A++I G+RR+LISGSIHYPRSTP+MWPDLIQK
Sbjct: 3 KTVLLFLC--LLTWVCSTIGS-VTYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQK 59
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
+KDGGLD+IETYVFWN HEP +Y FE RYDLV+F+KLV +AGLY HLRIGPYVCAEWN
Sbjct: 60 AKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWN 119
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
+GGFP+WL F+PGI FRTDN PFKA MQ+F KIVDMMK EKL+ +QGGPIILSQIENEY
Sbjct: 120 YGGFPIWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEY 179
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
G ++ GA GKSY KWAA MA+ L TGVPWVMC+Q DAPDP+I+TCNGFYC+ F PN
Sbjct: 180 GPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQI 239
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
KPK+WTENWSGW+ +FGG PYRP ED+AF+VARF Q GG+ NYYMYHGGTNF RTS
Sbjct: 240 YKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS- 298
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWG--HLKDLHKAIK 340
G F++TSYD+DAP+DEYGL+R+P G LK L++ +
Sbjct: 299 GLFVTTSYDFDAPIDEYGLLREPILGPVTLKGLNEGTR 336
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/181 (49%), Positives = 118/181 (65%), Gaps = 12/181 (6%)
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQ 612
+ YG E I GPV LKG GT D+S +W+Y+ GL+GE LN S +S Q
Sbjct: 312 IDEYGLLREP----ILGPVTLKGLNEGTR-DMSKYKWSYKVGLRGEILNLYSVKGSNSVQ 366
Query: 613 WDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
W K + K QPL WYKTTF+ PAG+EP+A+D + M KG+ WVNG+SIGRY+P Y+++
Sbjct: 367 W-MKGSFQK-QPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIAR-- 422
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVT 732
G + C+Y G ++ KCL NCG PSQ YH+PR WL +GN L++ EEIGG+P IS V
Sbjct: 423 GKCNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVK 482
Query: 733 K 733
+
Sbjct: 483 R 483
>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
Length = 735
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 290/733 (39%), Positives = 412/733 (56%), Gaps = 61/733 (8%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G N+TYDHR+++I G+R++L+SGS+HYPR++ W ++++ SK G+D+IETY+FWN+H+
Sbjct: 39 GLNITYDHRSLIINGERKLLVSGSVHYPRASVSKWNEILKSSKLAGVDIIETYIFWNVHQ 98
Query: 84 P-VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRT 142
P N++ E ++ F+ L E L+ +LRIGPYVCAEWN+GGFP+WL I GI FR
Sbjct: 99 PNTPNEFYLEDNANITLFLDLCKENELFVNLRIGPYVCAEWNYGGFPIWLKNIEGIVFRD 158
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA 202
N+PF M + +VD K + +A GGPII++QIENEYG +++ YGA+G+ Y WA
Sbjct: 159 YNQPFMDAMSTWVTMVVD--KLQDYFAPNGGPIIIAQIENEYGWLENEYGASGREYALWA 216
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN----KPKMWTENWSGWF 258
A SL+ G+PW+MC Q D D INTCNGFYC + N +P WTENW GWF
Sbjct: 217 INFAKSLNIGIPWIMCAQEDI-DSAINTCNGFYCHDWIDRHWNAFPDQPAFWTENWVGWF 275
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
++G AVP RPV+D+ F+ ARF GG+ NYYM+ GGTNF R+ GGP+I TSY+YDAPL
Sbjct: 276 ENWGQAVPKRPVQDMLFSSARFIAYGGSLFNYYMWFGGTNFGRSVGGPWIITSYEYDAPL 335
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLAN 378
DE+G +PK+ H I E+ ++ DP P N+ + + G L FL N
Sbjct: 336 DEFGFPNEPKYSMSTQFHFVIHKYESIIMGMDPPTPVPLSNI-SEAHPYGEDL--VFLTN 392
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVA-ADS 437
G D +++ G +Y L WSV I+ +VVF+T+ + + PS Q V A +
Sbjct: 393 FGLVIDY-IQWQGTNYTLQPWSVVIVY-SGSVVFDTSYVPDEYIKPSTRDQFKDVPNAIN 450
Query: 438 SDAI--GSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLE 495
D+I S W I D LEQIN T D +DYLWY +TNI +E
Sbjct: 451 YDSILSFSEWG----QSDIINDCIINNESPLEQINLTNDTTDYLWY--TTNITLNE---- 500
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFD----LLS 551
T L ++++ H F+NG G+G+ + I L P + +L+
Sbjct: 501 ---TTTLTIENMYDFCHVFLNGAYQGNGWSPVAY--------ITLEPTNGNINYQLQILT 549
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL---NFPSG 608
+T+GL+NY A E G+ G + L G N TN QW+ + G+ GE+L N S
Sbjct: 550 MTMGLENYAAHMESYSRGLLGSISL-GQTNITN-----NQWSMKPGILGEKLQIYNEYSS 603
Query: 609 SSTQWDSKSTLPKLQPLVWYKTT-----FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRY 663
S W + Q + WY+ + S ++ T M KG +VNG +IGRY
Sbjct: 604 SKVNWQPYNP-SATQSMTWYQFNISLDGLSSDPSSNAYVLNMTSMNKGFVYVNGFNIGRY 662
Query: 664 WPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGN----TLVLFE 719
+ +Q CT +Y G Y+ + +C +PSQSLYH+P WL + T++LFE
Sbjct: 663 FLMEATQ-SNCTLKQDYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFLQQDKQYATVILFE 721
Query: 720 EIGGDPTKISFVT 732
E+ GDPTKI ++
Sbjct: 722 EVNGDPTKIQLLS 734
>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
Length = 268
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 229/267 (85%), Positives = 242/267 (90%), Gaps = 4/267 (1%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++LVL W F NV YDHRA+VI GKRRVLISGSIHYPRSTP+MWPDLIQKSK
Sbjct: 6 IVLVLLW----FLPKMFCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSK 61
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGGLDVIETYVFWNLHEPV+ QY+F+GR DLVKFVK VAEAGLY HLRIGPYVCAEWN+G
Sbjct: 62 DGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYG 121
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFIPGI+FRTDNEPFKAEM+RFTAKIVD+MKQEKLYASQGGPIILSQIENEYGN
Sbjct: 122 GFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGN 181
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
IDS YG+AGKSYI WAA MA SLDTGVPWVMCQQ DAPDPIINTCNGFYCDQFTPNSN K
Sbjct: 182 IDSHYGSAGKSYINWAAKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTK 241
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDL 273
PKMWTENWSGWFLSFGGAVP+RPVE L
Sbjct: 242 PKMWTENWSGWFLSFGGAVPHRPVEIL 268
>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
Length = 759
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 294/736 (39%), Positives = 402/736 (54%), Gaps = 82/736 (11%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V YD R++ I G+R+++ISGSIHYPRSTP MWP LI+KSKD G+++IETYVFWNLH+P
Sbjct: 46 VEYDQRSLKINGERKLMISGSIHYPRSTPSMWPSLIKKSKDAGINMIETYVFWNLHQPNN 105
Query: 87 NQ-YNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+Q YNFEG ++ F+ L + GLY HLRIGPYVCAEWN+GG P WL IPGI FR N+
Sbjct: 106 SQEYNFEGNANITHFLDLCQQEGLYVHLRIGPYVCAEWNYGGIPSWLRNIPGIVFRDYNQ 165
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
P+ EM + IV+ +K +AS GGPIIL+Q+ENEYG +++ YG +GK Y +WA
Sbjct: 166 PWMTEMASWMTFIVNYLK--PYFASNGGPIILAQVENEYGWLENEYGDSGKLYAEWAISF 223
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCD-------QFTPNSNNKPKMWTENWSGWF 258
A SL+ G+PW MCQQ+D D INTCNGFYC Q P N+P +TENW+GW
Sbjct: 224 AKSLNIGIPWTMCQQNDIDDA-INTCNGFYCHDWIQYHFQVYP---NQPAFFTENWAGWI 279
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+ VP+RP EDL ++VAR+F RGG+ NYYM+HGGT F R S F++ SYDYDA L
Sbjct: 280 QYYSEGVPHRPTEDLLYSVARWFSRGGSLMNYYMWHGGTTFARYS-STFLTNSYDYDAAL 338
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN-------LEATVYKT---G 368
DEYG +PK+ L LH + L+++ + + +E Y T G
Sbjct: 339 DEYGYEAEPKYSALAQLHSVLSQYSYILLSSGEVARPVNISNITTCNTIEIIQYNTTING 398
Query: 369 SGLCSAFLANIGTNSD--VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSF 426
+ F+ N G +S V + +NG + + WSV IL + + V+ +T+ +
Sbjct: 399 TLETITFVTNFGVSSSAPVQLNWNGQTITVNPWSVLILYNNQTVI-DTSYVKQQYSAQKE 457
Query: 427 SRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGL-LEQINTTADQSDYLWYSLST 485
QS +V + + S W+ EP+G+ L EQ++ T DQ+DYL
Sbjct: 458 FYQSKRV----KNVLVSSWT---EPIGVGNYSNVVTANLPSEQLDLTLDQTDYL------ 504
Query: 486 NIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN 545
AD+ ++ +I+G+ GS ++ + F I G +
Sbjct: 505 -CNADD------------------MIYIYIDGEYQSWSRGSPAHFVLDTKFGI----GTH 541
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
+LSLT+GL +YG+ +E G+ G V L GT D+++ W+ + L GE
Sbjct: 542 KLSILSLTMGLISYGSHFESYKRGLNGTVTL-----GTQ-DITNNGWSMRPYLVGEMQGI 595
Query: 606 PSGSS-TQWDSKSTLPKLQPLVWYKTTFDAPA---GSEPVAIDFTGMGKGEAWVNGQSIG 661
S T W + L QPL WYK + + A+D GM KG VNG SIG
Sbjct: 596 QSNPHLTSWSINNELSINQPLTWYKLNLIIQSEIQDTSSFALDMIGMNKGFIIVNGNSIG 655
Query: 662 RYWPTYVSQNGGCTDSCNYRG-AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTL---VL 717
RYW T GC CNY G Y C CG+PS+ YHVP +L N L ++
Sbjct: 656 RYWLTL---GWGCGSGCNYTGDGYQGYLCRTGCGEPSERYYHVPNDYLYLEPNQLNEIIV 712
Query: 718 FEEIGGDPTKISFVTK 733
FEE+ GDP I V +
Sbjct: 713 FEELSGDPNSIQLVQR 728
>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
gi|194699714|gb|ACF83941.1| unknown [Zea mays]
gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 346
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 216/295 (73%), Positives = 251/295 (85%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYD +AVV+ G+RR+L+SGSIHYPRS PEMWPDLIQK+KDGGLDV++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY FEGRYDLV F+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KAEMQ FT KIVDMMK E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA+
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+L+T VPWVMC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP+
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
RPVEDLA+ VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324
>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
Length = 346
Score = 483 bits (1243), Expect = e-133, Method: Compositional matrix adjust.
Identities = 215/295 (72%), Positives = 250/295 (84%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYD +AVV+ G+RR+L+SGSIHYPRS PEMWPDLIQK+KDGGLDV++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY FEGRYDLV F+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI RTDNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISLRTDNEPF 149
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
KAEMQ FT KIVDMMK E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA+
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+L+T VPWVMC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP+
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
RPVEDLA+ VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324
>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
Length = 1078
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 280/717 (39%), Positives = 387/717 (53%), Gaps = 94/717 (13%)
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
M++F IV+ +K+ KL+ASQGGPIIL+QIENEY +++ A+ AG YI WAA MA++ +
Sbjct: 426 MKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMAIATN 485
Query: 211 TGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
TGVPW+MC+Q+ AP +I TCNG +C P KP +WTENW+ + FG R
Sbjct: 486 TGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDPPSQR 545
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPK 328
ED+AF+VARFF GGT NYYMYHGGTNF R +G F+ Y +APLDE+GL ++PK
Sbjct: 546 SAEDIAFSVARFFSVGGTMANYYMYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGLYKEPK 604
Query: 329 WGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS-GLCSAFLANIGTNSDVTV 387
WGHL+DLH A++ C+ AL+ +P+ LG EA V++ +C AFL+N T D TV
Sbjct: 605 WGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKEDGTV 664
Query: 388 KFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSY 447
F G Y + S+SIL DCK VVF+T +NS +F AD + Y
Sbjct: 665 TFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFH------FADQTVQDNVWEMY 718
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
E + + LEQ N T D++DYLWY+ S ++ D+ K VL
Sbjct: 719 SEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPVLE---- 774
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
G+G G S T++ + L G N +LS T+GL + G++ E
Sbjct: 775 -------------GAGTGRRSTRSFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHRM 821
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVW 627
AG+ V ++G GT +DL++ W + G QPL W
Sbjct: 822 AGVY-TVTIRGLNTGT-LDLTTNGWGHVPGKDN----------------------QPLTW 857
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
Y+ FD P+G++PV ID T MGKG +VNG+ +GRYW +Y
Sbjct: 858 YRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHA----------------- 900
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSH 747
GKPSQ LYHVPRS L+ GNTL+ FEE GG P I +T + ++C+ +T+ +
Sbjct: 901 -----LGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKR-DNICTFMTEKN 954
Query: 748 PLPVDMWGSDSKIQR--------------KPGPVLSLECPNPNQVISSIKFASFGTPLGT 793
P V W +SK + KP VLS CP + I S+ FAS+G PLG
Sbjct: 955 PAHV-RWSWESKDSQPKAVAGAGAGAGGLKPTAVLS--CPT-KKTIQSVVFASYGNPLGI 1010
Query: 794 CGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASCT 848
CG+++ G C + R+ VV +AC+G K+CS+ VS +G C G +LAV+A C+
Sbjct: 1011 CGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDVHCPGTTGTLAVQAKCS 1067
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 202/453 (44%), Positives = 261/453 (57%), Gaps = 77/453 (16%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G +TYD R+++I G R + SGSIHYPRS P+ WPDLI K+K+GGL+VIE+YVFWN
Sbjct: 27 TKNGTVITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWN 86
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF----IP 136
HEP + YNFEGRYDL+KF KL+ E +YA +RIGP+V AEWN G H IP
Sbjct: 87 GHEPEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHG---FVCHIGSGEIP 143
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK 196
I FRT+NEPFK M++F IV+ +K+ KL+ASQGGPIIL+QIENEY +++ A+ AG
Sbjct: 144 DIIFRTNNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGT 203
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF--TPNSNNKPKMWTENW 254
YI WAA MA++ +TGVPW+MC+Q+ AP +I TCNG +C P KP +WTENW
Sbjct: 204 KYINWAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENW 263
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYM---------------------- 292
+ + FG R ED+AF+VARFF GGT NYYM
Sbjct: 264 TAQYRVFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMVVLNSNSNLFLTKKRDEISDRT 323
Query: 293 ------------YHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIK 340
YHGGTNF R +G F+ Y +APLDE+GL ++PKWGHL+DLH A++
Sbjct: 324 DTGGFTCVNNQQYHGGTNFGR-NGAAFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALR 382
Query: 341 LCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWS 400
C+ AL+ +P+ LG G Y + S
Sbjct: 383 HCKKALLWGNPSVQPLGKLTR-----------------------------GQKYFVARRS 413
Query: 401 VSILPDCKNVV----FNTAKINSVTLVPSFSRQ 429
+SIL DCK V F T +N + F+ Q
Sbjct: 414 ISILADCKTVKYMKQFVTLIVNKLKEAKLFASQ 446
>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
Flags: Precursor
gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
Length = 761
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 278/747 (37%), Positives = 411/747 (55%), Gaps = 65/747 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R+++I G+R++L SGSIHYPR++ EMWP ++++SKD G+D+I+TY+FWN+H+P
Sbjct: 40 VTYDGRSLIINGERKLLFSGSIHYPRTSEEMWPIILKQSKDAGIDIIDTYIFWNIHQPNS 99
Query: 87 -NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
++Y F+G ++ KF+ L E LY +LRIGPYVCAEW +GGFP+WL IP I +R N+
Sbjct: 100 PSEYYFDGNANITKFLDLCKEFDLYVNLRIGPYVCAEWTYGGFPIWLKEIPNIVYRDYNQ 159
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
+ EM + +V + + +A GGPIIL+Q+ENEYG ++ YG G Y KW+
Sbjct: 160 QWMNEMSIWMEFVVKYL--DNYFAPNGGPIILAQVENEYGWLEQEYGINGTEYAKWSIDF 217
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS----NNKPKMWTENWSGWFLSF 261
A SL+ G+PW+MCQQ+D + INTCNG+YC + + N+P WTENW GWF ++
Sbjct: 218 AKSLNIGIPWIMCQQNDI-ESAINTCNGYYCHDWISSHWEQFPNQPSFWTENWIGWFENW 276
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
G A P RPV+D+ ++ ARF GG+ NYYM+ GGTNF RTSGGP+I TSYDYDAPLDE+
Sbjct: 277 GQAKPKRPVQDILYSNARFIAYGGSLINYYMWFGGTNFGRTSGGPWIITSYDYDAPLDEF 336
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
G +PK+ H+ + E+ L+ P P L + G+ +F+ N GT
Sbjct: 337 GQPNEPKFSLSSKFHQVLHAIESDLLNNQP--PKSPTFLSQFIEVHQYGINLSFITNYGT 394
Query: 382 NSD-VTVKFNGNSYLLPAWSVSILPDCK----------NVVFNTAKINSVTLVPSFSRQS 430
++ +++ +Y + WSV I+ + + N +FN IN+ + QS
Sbjct: 395 STTPKIIQWMNQTYTIQPWSVLIIYNNEILFDTSFIPPNTLFNNNTINNFKPINQNIIQS 454
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKAD 490
+ +D + + SG G ++ +EQ+ T D SDY WY STN+
Sbjct: 455 IFQISDFN--LNSG-----GGGGDGDGNSVNSVSPIEQLLITKDTSDYCWY--STNVTTT 505
Query: 491 EPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN--TFD 548
+ L + +H FI+ + GS + S + L P N TF
Sbjct: 506 SLSYNEKGNIFLTITEFYDYVHIFIDNEYQGSAFSPSLCQ-------LQLNPINNSTTFQ 558
Query: 549 L--LSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-F 605
L LS+T+GL+NY + E GI G + L GS N TN QW ++GL GE + F
Sbjct: 559 LQILSMTIGLENYASHMENYTRGILGSI-LIGSQNLTN-----NQWLMKSGLIGENIKIF 612
Query: 606 PSGSSTQWDSKSTLPKL----QPLVWYKTTFD-----APAGSEPVAIDFTGMGKGEAWVN 656
+ ++ W + + +PL WYK S A+D + M KG WVN
Sbjct: 613 NNDNTINWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVYALDMSSMNKGMIWVN 672
Query: 657 GQSIGRYWPTYVSQ---NGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG- 712
G SIGRYW +Q N ++ +Y G Y + +C KPSQS+Y VP WL ++
Sbjct: 673 GYSIGRYWLIEATQSICNQSAIENYSYIGEYDPSNYRIDCNKPSQSIYSVPIDWLFNNNY 732
Query: 713 ----NTLVLFEEIGGDPTKISFVTKQL 735
T+++ EE+ G+P +I ++ ++
Sbjct: 733 NNQYATIIIIEELNGNPNEIQLLSNKI 759
>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
Length = 827
Score = 474 bits (1221), Expect = e-131, Method: Compositional matrix adjust.
Identities = 285/752 (37%), Positives = 411/752 (54%), Gaps = 61/752 (8%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I L++L + VL+ V+YD+RA++I G+R++L S SIHYPRST MWPD+++++
Sbjct: 14 IFLILLIFPNYVLSDK---LTVSYDNRAIIINGERKLLYSASIHYPRSTRTMWPDILKRT 70
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K G++ IETY+FWNLH+P + Y+FEG D+ F+ L E G + +R GPYVCAEWN
Sbjct: 71 KAAGINTIETYIFWNLHQPTPDTYDFEGSSDVKHFLDLCKEEGFHVIVRFGPYVCAEWNN 130
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P WL +PGI +RT NEPF EM+++ IV + YA GGPII++QIENEYG
Sbjct: 131 GGLPSWLKAVPGIVYRTHNEPFMREMKKWMDYIVHYLSD--YYAPNGGPIIMAQIENEYG 188
Query: 186 NIDSAYGA-AGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
++ Y G Y+ WA +A S +TG+PW+MCQQ+ D +INTCNGFYC +
Sbjct: 189 WLEYEYREQGGPEYVDWAVKLAKSYNTGIPWIMCQQNTRSD-VINTCNGFYCHDWLQYHQ 247
Query: 245 ----NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
++P +TE W+GW F P RP D+ ++ ARF+ RGG NYYM+HGGT F
Sbjct: 248 RTFPDQPAFFTELWTGWPQYFEEGFPTRPTVDVLYSAARFYSRGGGMVNYYMWHGGTTFG 307
Query: 301 RTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPS--LGP 358
R + PF++TSYDYDAPLDEYG ++PK+ L LH ++ ++++ DP P + P
Sbjct: 308 RFT-SPFLTTSYDYDAPLDEYGFPQEPKYSMLTKLHVTLEKY-SSVILHDPNVPPPYVFP 365
Query: 359 N--LEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAK 416
+ +E YK + FL N V NG + + WSV I + +VF+T +
Sbjct: 366 DNTVEMIEYKKDAE-SVVFLVNWDDTFAKQVDMNGKNVKINQWSVQIYYN-NELVFDTFE 423
Query: 417 INSVTLVPS-----FSRQSLQVAADSSDAIG-----SGWSYINEPVGISKDDAFTKPGLL 466
I + P+ ++ SL A ++ G S W NEP +A ++
Sbjct: 424 IPANLTRPNPPFKPIAKTSLDATAAATSRTGLVNLVSSW---NEPFSFLTYNASSQTP-T 479
Query: 467 EQINTTADQSDYLWYSLSTNI-KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYG 525
Q+ T D SDY+WY ++ K DE +L++ + F++G+ + G
Sbjct: 480 AQLKLTGDNSDYIWYETEIDLTKTDE---------ILYLYKSYDFSYVFVDGQFLYWHRG 530
Query: 526 SSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI 585
S A FP+ GK+T +L +G+ +YGA E+ G+TG + L GS N
Sbjct: 531 SPIQAYFNGKFPV----GKHTLQILCAAMGVPSYGAHIEQHERGLTGDIFL-GSKN---- 581
Query: 586 DLSSQQWTYQTGLKGEELNFPSGSST-QWDSKSTLPKLQPLVWYKTTFDAPAGSE--PVA 642
++ W + L GE L + ST +W S + WYK P+ + A
Sbjct: 582 -ITDNGWKMRPFLSGELLGLHASPSTVKWSPVSKGTAGSGVTWYKFNVKTPSFEDGPAFA 640
Query: 643 IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYH 702
+D M KG +VNG SIGRYW G C + CN G Y + C +NCG+ SQ YH
Sbjct: 641 LDLKSMWKGLVFVNGNSIGRYW----VAKGWCEEKCNQTGLYDNYGCRENCGESSQRYYH 696
Query: 703 VPRSWLK-SSGNTLVLFEEIGGDPTKISFVTK 733
VP+ +LK SS N +++FEE+ GDP I V +
Sbjct: 697 VPKDFLKESSDNEVIIFEELQGDPYSIELVQR 728
>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 342
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 212/295 (71%), Positives = 247/295 (83%), Gaps = 4/295 (1%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TYD +AVV+ G+RR+L+SGSIHYPRS PEMWPDLIQK+KDGGLDV++TYVFWN HEP R
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
QY FEGRYDLV F+KLV +AGLY HLRIGPYVCAEWNFGGFP+WL ++PGI FRTDNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
K FT KIVDMMK E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA+
Sbjct: 150 K----NFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 205
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
+L+T VPWVMC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP+
Sbjct: 206 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 265
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
RPVEDLA+ VA+F Q+GG+F NYYMYHGGTNF RT+GGPFI+TSYDYDAP+DEYG
Sbjct: 266 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 320
>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
Length = 283
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 227/280 (81%), Positives = 250/280 (89%), Gaps = 2/280 (0%)
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MA SLDTGVPW+MCQQ++APDPIINTCN FYCDQFTPNS+NKPKMWTENWSGWFL+FGGA
Sbjct: 1 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGA 60
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF RT+GGPFISTSYDYDAP+DEYG I
Sbjct: 61 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDI 120
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSD 384
RQPKWGHLKDLHKAIKLCE AL+A+DPT S GPNLE VYKTG+ +CSAFLANIG SD
Sbjct: 121 RQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVYKTGA-VCSAFLANIGM-SD 178
Query: 385 VTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSG 444
TV FNGNSY LP WSVSILPDCKNVV NTAK+N+ +++ SF+ +SL+ DS D+ SG
Sbjct: 179 ATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLKEKVDSLDSSSSG 238
Query: 445 WSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
WS+I+EPVGIS DAFTK GLLEQINTTAD+SDYLWYSLS
Sbjct: 239 WSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLS 278
>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
Length = 347
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 223/356 (62%), Positives = 263/356 (73%), Gaps = 10/356 (2%)
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFKA MQ+FT KIV MMK EKL+ +QGGPIILSQIENE+G
Sbjct: 1 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 60
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
++ GA GK+Y KWAA MA+ LDTGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN +
Sbjct: 61 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 120
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KPKMWTE W+GW+ FGGAVP RP ED+AF+VARF Q GG+F NYYMYHGGTNF RT+GG
Sbjct: 121 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQGGGSFLNYYMYHGGTNFGRTAGG 180
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
PF++TSYDYDAPLDEYGL R+PKWGHL+DLHKAIK CE+ALV+ DP+ LG N EA V+
Sbjct: 181 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF 240
Query: 366 KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPS 425
K+ S C+AFLAN V V F G Y LP WS+SILPDCK V+NTAK+ S
Sbjct: 241 KSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGS------ 293
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
QS QV + S+I E + D T GL EQIN T D +DYLWY
Sbjct: 294 ---QSSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWY 346
>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 655
Score = 447 bits (1150), Expect = e-122, Method: Compositional matrix adjust.
Identities = 248/544 (45%), Positives = 322/544 (59%), Gaps = 22/544 (4%)
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
G + Y D L GL+R+PKWGHLK+LHKAIKLCE ALVA DP SLG +A+V
Sbjct: 132 GADVQMPYRLDHILVADGLLREPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASV 191
Query: 365 YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVP 424
+++ + C AFL N S V FNG Y LP WS+SILPDCK V+NTA + S
Sbjct: 192 FRSSTDACVAFLENKDKVSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGS----- 246
Query: 425 SFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
S+ ++ A G W NE + D++F GLLEQIN T D +DYLWY+
Sbjct: 247 QISQMKMEWAG------GFTWQSYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTTY 300
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
+I DE L +G +L V S GHALH F+NG+L G+ YGS + K+T + L G
Sbjct: 301 VDIAQDEQFLSNGKNPMLTVMSAGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSGS 360
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
NT LS+ VGL N G +E AGI GPV L G G DL+ Q+WTY+ GLKGE L+
Sbjct: 361 NTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRR-DLTWQKWTYKVGLKGEALS 419
Query: 605 FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
S S + + QPL WYK F+AP G EP+A+D + MGKG+ W+NGQ IGRYW
Sbjct: 420 LHSLSGSSSVEWGEPVQKQPLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYW 479
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
P Y + G C+YRG Y KC NCG SQ YHVPRSWL +GN LV+FEE GGD
Sbjct: 480 PGY--KASGTCGICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGD 537
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVISSIKF 784
PT IS V K++ S+C+ V++ P + W + + K + L+C + + ++ IKF
Sbjct: 538 PTGISMV-KRIAGSICADVSEWQPSMAN-WRTKGYEKAK----VHLQCDHGRK-MTHIKF 590
Query: 785 ASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAV 843
ASFGTP G+CGS+S G C + +S + ++C+G + C + V + F GDPC G MK V
Sbjct: 591 ASFGTPQGSCGSYSEGGCHAHKSYDIFWKSCIGQERCGVSVVPDAFGGDPCPGTMKRAVV 650
Query: 844 EASC 847
EA C
Sbjct: 651 EAIC 654
>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
Length = 450
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 240/464 (51%), Positives = 293/464 (63%), Gaps = 19/464 (4%)
Query: 273 LAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHL 332
+AFAVARF Q+GG+F NYYMYHGGTNFDRTSGGPFI+TSYDYDAP+DEYGL+RQPKWGHL
Sbjct: 1 MAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHL 60
Query: 333 KDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGN 392
+DLHKAIK E ALV+ DPT SLG +A V+K+ G C+AFL+N T++ V FNG
Sbjct: 61 RDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGR 120
Query: 393 SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPV 452
Y LPAWS+S+LPDCK VFNTA ++ PS A S A G W +E
Sbjct: 121 RYDLPAWSISVLPDCKAAVFNTATVSE----PS-------APARMSPAGGFSWQSYSEAT 169
Query: 453 GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALH 512
AFTK GL+EQ++ T D+SDYLWY+ NI ++E L+ G L V S GH+L
Sbjct: 170 NSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHSLQ 229
Query: 513 AFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITG 572
F+NG+ G+ YG + K+T + + G N +LS VGL N G YE G+ G
Sbjct: 230 VFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVLG 289
Query: 573 PVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYK 629
PV L G G DLS+Q+WTYQ GL GE L S SS +W S + QPL W+K
Sbjct: 290 PVTLSGLNEGKR-DLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAA---GKQPLTWHK 345
Query: 630 TTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKC 689
F AP+G PVA+D MGKG+AWVNG+ IGRYW +Y + + G C+Y G YS KC
Sbjct: 346 AYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSGGCGGCSYAGTYSETKC 404
Query: 690 LKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
CG SQ YHVPRSWL SGN LVL EE GGD + VT+
Sbjct: 405 QTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTR 448
>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
Length = 504
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 248/514 (48%), Positives = 311/514 (60%), Gaps = 23/514 (4%)
Query: 341 LCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWS 400
+CE AL++TDP SLG +A VY T SG CSAFL+N + S V FN Y LP WS
Sbjct: 1 MCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWS 60
Query: 401 VSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAF 460
VSILPDC+N VFNTAK+ T +Q+ +S+ W E S
Sbjct: 61 VSILPDCRNAVFNTAKVGVQT-------SQMQMLPTNSERFS--WESFEEDTSSSSATTI 111
Query: 461 TKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLV 520
T GLLEQIN T D SDYLWY S ++ + E L G L VQS GHA+H FING+L
Sbjct: 112 TASGLLEQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFINGRLS 171
Query: 521 GSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSG 580
GS YG+ + + + L G NT LLS+ VGL N G +E GI GPV + G
Sbjct: 172 GSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHGLD 231
Query: 581 NGTNIDLSSQQWTYQTGLKGEELNF--PSG-SSTQW-DSKSTLPKLQPLVWYKTTFDAPA 636
G +DLS Q+WTYQ GLKGE +N P G SS +W S + + QPL W+KT FDAP
Sbjct: 232 KG-KLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPE 290
Query: 637 GSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKP 696
G EP+A+D GMGKG+ W+NG SIGRYW + G C D CNY G++ KC CG+P
Sbjct: 291 GEEPLALDMDGMGKGQIWINGISIGRYWTAIAT--GSCND-CNYAGSFRPPKCQLGCGQP 347
Query: 697 SQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGS 756
+Q YHVPRSWLK + N LV+FEE+GGDP+KIS + + SS+C+ V++ HP + W
Sbjct: 348 TQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSV-SSVCADVSEYHP-NLKNWHI 405
Query: 757 DS--KIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQA 814
DS K + P + L C NP Q ISSIKFASFGTPLGTCGS+ +G C S+ S ++ Q
Sbjct: 406 DSYGKSENFRPPKVHLHC-NPGQAISSIKFASFGTPLGTCGSYEQGACHSSSSYDILEQK 464
Query: 815 CVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
C+G C + VS + FG DPC V+K L+VEA C
Sbjct: 465 CIGKPRCIVTVSNSNFGRDPCPNVLKRLSVEAVC 498
>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
Length = 446
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 209/414 (50%), Positives = 277/414 (66%), Gaps = 6/414 (1%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
T G V+YD R+++I GKR + SG+IHYPRS PEMW L++ +K GGL+ IETYVFWN
Sbjct: 30 TKKGTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWN 89
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEP +Y FEGR+DL++F+ ++ + +YA +RIGP++ AEWN GG P WL I I F
Sbjct: 90 GHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIF 149
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R +NEPFK EM++F IV +K +++A QGGPIILSQIENEYGNI G Y++
Sbjct: 150 RANNEPFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLE 209
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNNKPKMWTENWSGWFL 259
WAA MA+S GVPWVMC+QS AP +I TCNG +C D +T NKP++WTENW+ F
Sbjct: 210 WAAEMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFR 269
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLD 319
+FG + R ED+A+AV RFF +GGT NYYMYHGGTNF RT G ++ T Y +AP+D
Sbjct: 270 TFGDQLAQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMD 328
Query: 320 EYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSAFLAN 378
EYG+ ++PK+GHL+DLH IK A + ++ LG EA Y+ LC +FL+N
Sbjct: 329 EYGMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSN 388
Query: 379 IGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQ 432
T D TV F G + +P+ SVSIL DCK VV+NT + V ++ F+ L+
Sbjct: 389 NNTGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKR---VCVLHKFTENKLR 439
>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
Length = 285
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 196/286 (68%), Positives = 228/286 (79%), Gaps = 1/286 (0%)
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EWNFGGFP+WL ++PGIQFRTDN PFKA+MQ+FT KIV+MMK EKL+ Q GPII+SQIE
Sbjct: 1 EWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQIE 60
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTP 241
NEYG I+ GA GK+Y KWAA MA+ L TGVPW+MC+Q DAPDPII+TCNGFYC+ F P
Sbjct: 61 NEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMP 120
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
N+N KPKM+TE W+GW+ FGG VPYRP ED+A++VARF Q G+F NYYMYHGGTNF R
Sbjct: 121 NANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGR 180
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
T+GGPFI+TSYDYDAPLDEYGL R+PKWGHL+DLHK IKLCE +LV+ DP SLG N E
Sbjct: 181 TAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQE 240
Query: 362 ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
A V+ T + C+AFLAN V V F Y LP WSVSILPDC
Sbjct: 241 AHVFWTKTS-CAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285
>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
Length = 425
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 221/429 (51%), Positives = 277/429 (64%), Gaps = 8/429 (1%)
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCS 373
YDAP+DEYGL R PKWGHLKDLHKAIKLCE L+ SLGP++EA VY SG C+
Sbjct: 1 YDAPVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVNVSLGPSVEADVYTDSSGACA 60
Query: 374 AFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQV 433
AF+AN+ +D TV+F SY +PAWSVSILPDCKNVV+NTAK+ + T + + LQ
Sbjct: 61 AFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTNKIAMIPEKLQQ 120
Query: 434 AADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPL 493
+ W E GI F G ++ INTT D +DYLW++ S +I +E L
Sbjct: 121 SDKGQKTF--KWDVWKENPGIWGKPDFVINGFVDHINTTKDTTDYLWHTTSISIDENEEL 178
Query: 494 LEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLT 553
L+ GSK VL ++S GHALHAF+N K G+ YG+ S++ T PI+L GKN LLSLT
Sbjct: 179 LKKGSKPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKNPISLKAGKNEIALLSLT 238
Query: 554 VGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSG---SS 610
VGLQ G FY+ GAG+T V++KG N T IDLSS WTY+ G++GE L G +S
Sbjct: 239 VGLQTAGPFYDFVGAGVTS-VKIKGLNNKT-IDLSSNAWTYKIGVQGEHLKIYQGNGLNS 296
Query: 611 TQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS- 669
W S S PK Q L WYK DAP G EPV +D MGKG AW+NG+ IGRYWP
Sbjct: 297 VSWTSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAWLNGEGIGRYWPRISEF 356
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
+ C + C+YRG ++ +KC CG+PSQ YHVPRSW K SGN LV FEE GGDPTKI+
Sbjct: 357 KKEDCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVFFEEKGGDPTKIT 416
Query: 730 FVTKQLGSS 738
FV +++ ++
Sbjct: 417 FVRRKVSTT 425
>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
Length = 286
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 192/286 (67%), Positives = 227/286 (79%)
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EWNFGGFP+WL F+PGI FRTDNEPFK MQ FT KIV MMK EKL+ SQGGPIILSQIE
Sbjct: 1 EWNFGGFPVWLKFVPGISFRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQIE 60
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTP 241
NEY +G+AG++Y+ WAA MA L+TGVPWVMC++ DAPDP+INTCNGFYCD+F+P
Sbjct: 61 NEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKFSP 120
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
N KPK+WTE W+GWF FGG + RPVEDLAFAVARF Q GG+F NYYMYHGGTNF R
Sbjct: 121 NKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFIQAGGSFVNYYMYHGGTNFGR 180
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
T+GGPFI+TSYDYDAP+DEYGLIR+PK+ HLK+LH+A+KLCE AL+ DP SLG +
Sbjct: 181 TAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYADPYVMSLGNYEQ 240
Query: 362 ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
A V+ + SG C+AFL+N + S V FN + LP WS+SILPDC
Sbjct: 241 AHVFSSTSGGCAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286
>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 707
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 243/618 (39%), Positives = 351/618 (56%), Gaps = 48/618 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD R+++I G+R++ +SGS+HYPRSTP +W ++ SK+ G+++I+TYVFW+LHEP R
Sbjct: 108 VTYDGRSLLINGERKLFVSGSVHYPRSTPTIWKKVLALSKNSGINMIDTYVFWDLHEPQR 167
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
YNFEG +L F+ L + GL+ +LRIGPY+CAEWN+GG P+WL IPGI+ R N
Sbjct: 168 GVYNFEGNANLKHFLDLCQQNGLFVNLRIGPYICAEWNYGGLPIWLKDIPGIKMRDFNTQ 227
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+ E++R+ IVD + +A QGGPI+L+QIENEY + Y +G+ + W A +A
Sbjct: 228 YMEEVERWMKFIVDYL--HGYFAPQGGPIVLAQIENEYNWVQWRYQESGRKFAHWCADLA 285
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFT----PNSNNKPKMWTENWSGWFLSFG 262
LD G+PW+MCQQ D P +INTCNG+YC ++ N ++P ++TENWSGWF ++
Sbjct: 286 NRLDIGIPWIMCQQDDIPT-VINTCNGYYCHEWINFHWNNFKDQPPLFTENWSGWFNNWV 344
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYG 322
AV +RPV DL ++ AR+F GG NYYM+HGGTNF R S GP I+ SYDYDAPL+EYG
Sbjct: 345 NAVRHRPVADLLYSAARWFASGGALMNYYMWHGGTNFGRKS-GPMIALSYDYDAPLNEYG 403
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPS-LGPNLEATVYKTGSGLCSAFLANIGT 381
R PK+ +D +K I E L++ P P L N+ Y+ G+ ++F+ N
Sbjct: 404 NPRNPKYSQTRDFNKLILSLEDILLSQYPPTPIFLANNISVIHYRNGNN-SASFIINSNE 462
Query: 382 NSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAI 441
N + V F G SY A+SV IL KN V SV R +S I
Sbjct: 463 NGNSKVMFEGRSYFSYAYSVQIL---KNYV-------SVFDSSQNPRNYTDTVVESEPNI 512
Query: 442 GSGWSYINEPV-GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKT 500
S I++ V +++ L+EQ+N T D++DY+WY+ N D +L+ +KT
Sbjct: 513 PFANSIISKHVERFDFEESLYDNRLMEQLNLTKDETDYIWYTTMINHDQDGEILKVINKT 572
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
+ +H F++ VG+ S++ P+ G +T LL +G+Q+Y
Sbjct: 573 DI--------VHVFVDSYYVGT---IMSDSLAITGVPL----GPSTLQLLHTKMGIQHYE 617
Query: 561 AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE-LNFPSGSS-TQWDSKST 618
E T AGI GPV +I++++Q W + + E+ + P S +W
Sbjct: 618 LHMENTKAGILGPVYYG------DIEITNQMWGSKPFVSSEKVITDPIQSKFVRWSPLDR 671
Query: 619 LPKL----QPLVWYKTTF 632
P PL WYK F
Sbjct: 672 KPNEVFYSVPLTWYKFIF 689
>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
Length = 744
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 265/789 (33%), Positives = 384/789 (48%), Gaps = 126/789 (15%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
V++DHRA+++ G+R +++SG++HYPRSTP MWP +++ + GL+ +ETY+FWNLHE
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
R +F GR DLV+F +L GL LRIGPY+CAE N+GG P WL +P I+ RTDNE
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
FK E R+ + ++++ L A GGP+IL+QIENEY NI + YG G+ Y++W+ +
Sbjct: 122 AFKREKARWVRLVAEVIR--PLCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 206 ALSLDTGVPWVMC-----QQSDAPDPI------INTCNGFYCD----QFTPNSNNKPKMW 250
A SL G+PWV C ++ D + + T N F Q +P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+GW+ ++GG +P R E+LA+A ARFF GG+ NY+++HGGTNF R G ++T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGR-DGMYLLTT 298
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
+Y++ PLDEYGL K HL L+KA+ C ++A++ G ++ SG
Sbjct: 299 AYEFGGPLDEYGLP-TTKARHLARLNKALAACADKILASERPRAITGERNGLLKFQYSSG 357
Query: 371 LCSAFLANIGTNSDVTVKFNG-NSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQ 429
L F + + TV+ G N +L S + P V T K + V P
Sbjct: 358 LT--FWCD---DVARTVRIVGKNGEVLYDSSARVAP-----VRRTWKASGVRFAP----- 402
Query: 430 SLQVAADSSDAIGSGWSYINEPVGIS----KDDAFTKPGLLEQINTTADQSDYLWYSLST 485
W + EP+ + A T LEQ+ T D++DY WY +
Sbjct: 403 ---------------WGWRAEPLPAAWPAEAQSAVTARKPLEQLLLTKDETDYCWYETAI 447
Query: 486 NIKADEPLL---EDGSKT--------------------------------VLHVQSLGHA 510
++ +L DGS L + +
Sbjct: 448 VVEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADI 507
Query: 511 LHAFINGKLVGS-------GYGSSSNAKVTVDFPIAL-----APGKNTFDLLSLTVGL-- 556
+H FI+G V + G T F + L PGK+ LL +GL
Sbjct: 508 VHVFIDGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIK 567
Query: 557 QNYGAFYEKTG---AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSS 610
++ YE G+ PV NG ++ +W +Q GL GE F +GS
Sbjct: 568 GDWMIGYENMALEKKGLWAPVFW----NGKKLE---GEWRHQPGLLGERCGFADPAAGSL 620
Query: 611 TQWDSKSTLP---KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
W + +PL W++TTF P G P A+D GMGKG AW+NG IGRYW
Sbjct: 621 LAWKTAKAATGRGARRPLRWWRTTFTRPKGHGPWALDLGGMGKGMAWINGHCIGRYWLLA 680
Query: 668 VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG--NTLVLFEEIGGDP 725
+ G + + P+Q YHVP WL++ G +TLVLFEE+GGDP
Sbjct: 681 DTDPMG-----PWMAWMKGSLTAAPSSGPTQRYYHVPDDWLRTDGGPDTLVLFEELGGDP 735
Query: 726 TKISFVTKQ 734
+ V ++
Sbjct: 736 ATVRLVRRE 744
>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
Length = 338
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 193/357 (54%), Positives = 243/357 (68%), Gaps = 33/357 (9%)
Query: 12 CWGFVVLATTSF-----GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
C+G +++ T+ G V+YD R+++I G+R++L SGSIHYPRSTP+MWP LI K+K
Sbjct: 8 CFGLLMVMWTTTRGGVEGGQVSYDGRSLIIEGQRKLLFSGSIHYPRSTPDMWPSLISKAK 67
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
GGLDVIETYVFWNLHEP QY+F+GR+++V+F++ + GLYA +RIGP++ AEW +G
Sbjct: 68 HGGLDVIETYVFWNLHEPRHGQYDFKGRHNIVRFIREIQAHGLYAFIRIGPFIEAEWTYG 127
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
G P WLH +PGI +R+DNEPFK MQ FT KIV++ K E LYA QGGPIIL QIENEY N
Sbjct: 128 GLPFWLHDVPGIVYRSDNEPFKYHMQNFTTKIVNLFKSEGLYAPQGGPIILQQIENEYKN 187
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ--FTPNSN 244
+ A+ G Y++WAA MA+ L TGVPWVMC+Q DAPDP+INTCNG C + PNS
Sbjct: 188 AERAFHEKGPPYVQWAAAMAVGLQTGVPWVMCKQDDAPDPVINTCNGRTCGETFVGPNSP 247
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
NKP +WT+NW+ + G+F NYYMYHGGTNF RT G
Sbjct: 248 NKPAIWTDNWTS-------------------------LKNGSFVNYYMYHGGTNFGRT-G 281
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
F+ TSY +AP+DEYGLIRQPKWGHLK LH IK C L+ + LG E
Sbjct: 282 SAFVLTSYYDEAPIDEYGLIRQPKWGHLKQLHSVIKSCSQTLLHGVISVSPLGQQQE 338
>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
Length = 774
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 178/288 (61%), Positives = 220/288 (76%), Gaps = 20/288 (6%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
++VTYDHR+++I G+RR+LIS SIHYPRS PEMWP L+ ++KDGG D +ETYVFWN HEP
Sbjct: 36 SSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 95
Query: 85 VRNQ--------------------YNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
+ Q Y FE R+DLV+F K+V +AGLY LRIGP+V AEW
Sbjct: 96 AQGQVRAASPKFVMDLACSIRDKPYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWT 155
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
FGG P+WLH+ PG FRT+NEPFK+ M+RFT IVDMMK+E+ +ASQGG IIL+Q+ENEY
Sbjct: 156 FGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEY 215
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN 244
G+++ AYGA K Y WAA MAL+ +TGVPW+MCQQ DAPDP+INTCN FYCDQF PNS
Sbjct: 216 GDMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSP 275
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYM 292
KPK WTENW GWF +FG + P+RP ED+AF+VARFF +GG+ QNYY+
Sbjct: 276 TKPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYV 323
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 213/498 (42%), Positives = 284/498 (57%), Gaps = 58/498 (11%)
Query: 362 ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVT 421
A VY SG C AFL+N+ + D V F SY LPAWSVSILPDCKNV FNTAK+ S T
Sbjct: 324 ADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQT 383
Query: 422 LVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
L+ V A+ + GWS E GI + + G ++ INTT D +DYLWY
Sbjct: 384 LMMDM------VPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWY 437
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
+ S ++ G VLH++S GHA+ AF+N +L+GS YG+ S + +V+ P+ L
Sbjct: 438 TTSFDVDGSHLA---GGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLR 494
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
GKN LLS+TVGLQN G YE GAGIT V++ G N IDLSS +W Y
Sbjct: 495 AGKNKLSLLSMTVGLQNGGPMYEWAGAGITS-VKISGMENRI-IDLSSNKWEY------- 545
Query: 602 ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIG 661
K D P G +PV +D MGKG AW+NG +IG
Sbjct: 546 ---------------------------KVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIG 578
Query: 662 RYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
RYWP + CT SC+YRG +S NKC + CG+P+Q YHVPRSW SGNTLV+FEE
Sbjct: 579 RYWPRISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEK 638
Query: 722 GGDPTKISFVTKQLGSSLCSHVTDSHP-LPVDMWGSDSKIQRKPGPVLSLECPNPNQVIS 780
GGDPTKI+F +++ +S+CS V++ +P + ++ W +++ + + L CP + IS
Sbjct: 639 GGDPTKITF-SRRTVASVCSFVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPK-GKSIS 696
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVV---------RQACVGSKSCSIGVSVNTFG 831
S+KF SFG P GTC S+ +G C S+SVV R+AC+ C++ +S FG
Sbjct: 697 SVKFVSFGNPSGTCRSYQQGSCHHPNSISVVEKGTLGWAHRRACLNMNGCTVSLSDEGFG 756
Query: 832 -DPCKGVMKSLAVEASCT 848
D C GV K+LA+EA C+
Sbjct: 757 EDLCPGVTKTLAIEADCS 774
>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
Length = 743
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 263/788 (33%), Positives = 383/788 (48%), Gaps = 125/788 (15%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
V++DHRA+++ G+R +++SG++HYPRSTP MWP +++ + GL+ +ETY+FWNLHE
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
R +F GR DLV+F +L GL LRIGPY+CAE N+GG P WL +P I+ RTDNE
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
FK E R+ + ++++ L A GGP+IL+QIENEY NI + YG G+ Y++W+ +
Sbjct: 122 AFKREKARWVRLVAEVIR--PLCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 206 ALSLDTGVPWVMC-----QQSDAPDPI------INTCNGFYCD----QFTPNSNNKPKMW 250
A SL G+PWV C ++ D + + T N F Q +P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIST 310
TENW+GW+ ++GG +P R E+LA+A ARFF GG+ NY+++HGGTNF R G ++T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGR-DGMYLLTT 298
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSG 370
+Y++ PLDEYGL K HL L+ A+ C L+A++ V + SG
Sbjct: 299 AYEFGGPLDEYGLP-TTKARHLARLNAALAACAGELLASE----------RPGVVEKSSG 347
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
+ V+++ +S L V + D V K V S
Sbjct: 348 V---------------VEYHYDSGL-----VFVCDDTARAVRIVKKSGEVLYDSSVRVAP 387
Query: 431 LQVAADSSDAIGSGWSYINEPVGIS----KDDAFTKPGLLEQINTTADQSDYLWYSLSTN 486
++ A SS + W + EP+ + A T LEQ+ T D++DY WY +
Sbjct: 388 VRRAWKSSGVRFAPWGWRAEPLPAAWPAEAQSAVTARKPLEQLLPTKDETDYCWYETAIV 447
Query: 487 IKADEPLL---EDGSKT--------------------------------VLHVQSLGHAL 511
++ +L DGS L + + +
Sbjct: 448 VEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIV 507
Query: 512 HAFINGKLVGS-------GYGSSSNAKVTVDFPIAL-----APGKNTFDLLSLTVGL--Q 557
H FI+G V + G T F + L PGK+ LL +GL
Sbjct: 508 HVFIDGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIKG 567
Query: 558 NYGAFYEKTG---AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGSST 611
++ YE G+ PV NG ++ +W +Q GL GE F +GS
Sbjct: 568 DWMIGYENMALEKKGLWAPVFW----NGKKLE---GEWRHQPGLLGERCGFADPAAGSLL 620
Query: 612 QWDSKSTLP---KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYV 668
W + +PL W++TTF P G P A+D GMGKG W+NG IGRYW
Sbjct: 621 AWKTAKAATGRGARRPLNWWRTTFTRPKGHGPWALDLGGMGKGFCWINGHCIGRYWLLPD 680
Query: 669 SQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG--NTLVLFEEIGGDPT 726
+ G + + G P+Q YHVP WL++ G +TLVLFEE+GGDP
Sbjct: 681 TDPMG-----PWMAWMKGSLTAAPSGGPTQRYYHVPDDWLRTDGGPDTLVLFEELGGDPA 735
Query: 727 KISFVTKQ 734
+ V ++
Sbjct: 736 TVRLVRRE 743
>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
Length = 263
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 180/264 (68%), Positives = 214/264 (81%), Gaps = 1/264 (0%)
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDNEPFKA MQ+FT KIV MMK E+L+ SQGGPIILSQIENE+G ++ GA GK+Y KW
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA+ L+TGVPW+MC+Q DAPDP+I+TCNGFYC+ FTPN N KPKMWTE W+GW+ F
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGAVP RP EDLAF++AR Q+GG+F NYYMYHGGTNF RT+GGPF++TSYDYDAPLDEY
Sbjct: 121 GGAVPTRPAEDLAFSIARLIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL R+PKWGHL+DLHKAIK E+ALV+ +P+ SLG + EA V+K+ SG C+AFLAN T
Sbjct: 181 GLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSG-CAAFLANYDT 239
Query: 382 NSDVTVKFNGNSYLLPAWSVSILP 405
S V F Y LP WS+SILP
Sbjct: 240 KSSAKVSFGNGQYELPPWSISILP 263
>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
vinifera]
Length = 722
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 267/746 (35%), Positives = 371/746 (49%), Gaps = 128/746 (17%)
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
P + + GG L+ I F +EP + M+RFT I+DMM +EK ASQGGPII
Sbjct: 88 PDIIXKARHGG----LNVIHTYAFWNLHEPVQDHMKRFTRMIIDMMSKEKXIASQGGPII 143
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC 236
L+ +++ A+ G + WA MA+ L TG+P VMC+Q DAPDP+INTC G C
Sbjct: 144 LALVDSAI-----AFKEMGTRCVHWAGTMAVGLKTGIPXVMCKQKDAPDPVINTCKGRNC 198
Query: 237 -DQFT-PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 294
D FT PN NK + + + G + FG R EDLAF+ F + GT NYYMY+
Sbjct: 199 GDTFTGPNRPNKRSV-SNHXLGMYRVFGDPPSQRAAEDLAFSX--FISKNGTLANYYMYY 255
Query: 295 GGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP 354
TNF RT+ F +T Y +APLDEYGL R+ KWGHL+DLH A++L + AL+ +
Sbjct: 256 SVTNFGRTTSS-FATTCYYDEAPLDEYGLPRETKWGHLRDLHAALRLSKKALLWGVTSAQ 314
Query: 355 SLGPNLEATVY-KTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFN 413
LG +LEA +Y K GS +C+ FL N T + T G+ Y LP S+S LPDCK VVFN
Sbjct: 315 KLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGSKYYLPQHSISNLPDCKTVVFN 374
Query: 414 TAKINSVTLVPSFS-RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTT 472
T T+V +S ++LQ W + + ++ +E + T
Sbjct: 375 TQ-----TVVSQYSVNKNLQ------------WXMSQDALPTYEECPTKTKSPVELMTMT 417
Query: 473 ADQSDYLWYSLSTNI-KADEPLLEDGSKTVLHVQSLGHALHAFINGK-----LVGSGYGS 526
D +DYLWY+ + + + P +D + V V +LGH +HAF+NG+ L G+ +GS
Sbjct: 418 KDTTDYLWYTTNIELARTGLPFRKDVLR-VPQVSNLGHVMHAFLNGEYMEFYLTGTRHGS 476
Query: 527 SSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNID 586
+ + PI L G N L TVGL + G++ E AG+ V ++G N ID
Sbjct: 477 NVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHRLAGVHN-VAIQGL-NTRTID 534
Query: 587 LSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFT 646
L W + K FDAP G PVA++ +
Sbjct: 535 LPKNGWGH----------------------------------KAYFDAPEGDVPVALELS 560
Query: 647 GMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRS 706
M KG AW+NG+SI YW +Y+S GKPSQS+YHVPR+
Sbjct: 561 TMAKGMAWINGKSIDXYWVSYLSP----------------------LGKPSQSVYHVPRA 598
Query: 707 WLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGP 766
+LK+S N LVLFEE G +P I +T ++C ++++ HP V W ++
Sbjct: 599 FLKTSDNLLVLFEETGRNPDGIEILTLN-RDTICCYISEHHPTHVRSWKREA-------- 649
Query: 767 VLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIG-- 824
S I+ FG P GTC F G C++ S VV + C+G SCSI
Sbjct: 650 -------------SDIQI--FGDPTGTCXEFIPGNCAAPNSXKVVEKHCLGKSSCSIPVE 694
Query: 825 ---VSVNTFGDPCKGVMKSLAVEASC 847
VS + G+ K+LAV+ C
Sbjct: 695 QEIVSKDGISISGSGITKALAVQVLC 720
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/97 (49%), Positives = 64/97 (65%), Gaps = 3/97 (3%)
Query: 15 FVVLATTSFGAN-VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVI 73
VV+ + G V+YD R +++ GKR +L SGSIHYPRS PEMWPD+I K++ GGL+VI
Sbjct: 43 LVVVRLSMVGVKGVSYDGRPLIVNGKRELLFSGSIHYPRSIPEMWPDIIXKARHGGLNVI 102
Query: 74 ETYVFWNLHEPVRNQYNFEGRY--DLVKFVKLVAEAG 108
TY FWNLHEPV++ R D++ K +A G
Sbjct: 103 HTYAFWNLHEPVQDHMKRFTRMIIDMMSKEKXIASQG 139
>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
Length = 735
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 259/756 (34%), Positives = 394/756 (52%), Gaps = 97/756 (12%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+V+YDHRA+ I G R +L SG IHYPRSTP MWP L+ K+K+ GL+ I+TYVFWN+HE
Sbjct: 33 HVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNMHEQK 92
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
R Y+F GR +L F++ A AGL+ +LR+GPYVCAEW++G P+WL+ IP I FR+ N+
Sbjct: 93 RGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSND 152
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
+K+EM+RF + I+ + + A GGPIIL+QIENEYG D ++Y+ W +
Sbjct: 153 AWKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGGND-------RAYVDWCGSL 203
Query: 206 ALS--LDTGVPWVMCQQSDAPDPIINTCNGFYC------DQFTPNSNNKPKMWTENWSGW 257
+ T +PW+MC A + I TCNG C D+ N+P ++TENW GW
Sbjct: 204 VSNDFASTQIPWIMC-NGLAANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GW 261
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
F +G + R EDLA++VA +F GG + YYM+HGG ++ RT GG ++T+Y D
Sbjct: 262 FQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDDVI 320
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCS---- 373
L G +PK+ HL L + + L++ D + P + + G+
Sbjct: 321 LRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSARLPI-PYWDGKQWSVGTQQMVYSYP 379
Query: 374 ---AFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI-----NSVTLVPS 425
F+ N S + V FN + + SV I + +++++N+A + N+ LVP
Sbjct: 380 PSIQFVINQAAFS-LFVLFNKQNISIAGQSVQIYDNNEHLLWNSADVSGIFRNNTFLVP- 437
Query: 426 FSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLST 485
+ D W +EP +S LEQ+N T D++ YLWY
Sbjct: 438 -------IVVGPLD-----WQVYSEPF-LSDLPVIVASTPLEQLNLTNDETIYLWY--RR 482
Query: 486 NIKADEPLLEDGSKTVLHVQS-LGHALHAFINGKLVG-----SGYGSSSNAKVTVDFPIA 539
N+ +P ++T++ VQ+ ++L F++ + VG S + N +T++
Sbjct: 483 NVSLSQP----SAQTIVQVQTRRANSLIFFMDRQFVGYFDDHSHAQGTINVNITLNLSQF 538
Query: 540 LAPGKNTFDLLSLTVGLQNY----GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQ 595
L + F++LS+++G+ N+ G+F K GI G V L G + + + W +Q
Sbjct: 539 LPNQQYLFEILSVSLGIDNFNIGPGSFEYK---GIVGNVSLGGQ---SLVGDEASIWEHQ 592
Query: 596 TGLKGE--ELNFPSGSST-QWDSKSTLPKLQPLVWYKTTFD------APAGSEPVAIDFT 646
GL GE ++ GS T +W+ + T + + W++T FD + PV +D
Sbjct: 593 KGLFGEAYQIYTEQGSKTVEWNPRWTTAINKSVTWFQTRFDLNHLVREDLNANPVLLDAF 652
Query: 647 GMGKGEAWVNGQSIGRYWPTY-VSQNGGCTDSCNYRGAYSSNKCLK---NCGKPSQSLYH 702
G+ +G A+VNG IG YW QN C CL+ NC +PSQ YH
Sbjct: 653 GLNRGHAFVNGNDIGLYWLIEGTCQNKLCC-------------CLQNQTNCQQPSQRYYH 699
Query: 703 VPRSWLKSSGNTLVLFEEIGG-DPTKISFVTKQLGS 737
+P WLK + N L +FEEIG P + V + + S
Sbjct: 700 IPSDWLKPTNNLLTVFEEIGASSPKSVGLVQRIVNS 735
>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
Length = 263
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 178/264 (67%), Positives = 213/264 (80%), Gaps = 1/264 (0%)
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TDNEPFKA MQ+FT KIV MMK E+L+ SQGGPIILSQIENE+G ++ GA GK+Y KW
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
AA MA+ L+TGVPW+MC+Q DAPDP+I+TCNGFYC+ FTPN N KPKMWTE W+GW+ F
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEY 321
GGAVP RP EDLAF++ARF Q+GG+ NYYMYHGGTNF RT+GGPF++TSYDYDAPLDEY
Sbjct: 121 GGAVPTRPAEDLAFSIARFIQKGGSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
GL R+PKWGHL++LHKAIK E+ALV+ +P+ SLG + EA +K+ SG C+AFLAN T
Sbjct: 181 GLPREPKWGHLRNLHKAIKSSESALVSAEPSVTSLGNSQEAHAFKSKSG-CAAFLANYDT 239
Query: 382 NSDVTVKFNGNSYLLPAWSVSILP 405
S V F Y LP WS+SILP
Sbjct: 240 KSSAKVSFGNGQYELPPWSISILP 263
>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 288
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/289 (65%), Positives = 227/289 (78%), Gaps = 9/289 (3%)
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
F+SFG VP+RPVEDLAFAVARF+QRGGTFQNYYM+HGGTNF RT+GGPFISTSYD+D P
Sbjct: 6 FVSFGDVVPHRPVEDLAFAVARFYQRGGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTP 65
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLA 377
+DEYG+IRQPKW HLK++HKAIKLCE AL+AT PT LGPN+EA VY G+ + +AFLA
Sbjct: 66 IDEYGIIRQPKWDHLKNVHKAIKLCEKALLATGPTITYLGPNIEAAVYNIGA-VSAAFLA 124
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
NI +D V FNGNSY LPAW VS LPDCK+VV NTAKINS +++ SF+ +SL+ S
Sbjct: 125 NIA-KTDAKVSFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMISSFTTESLKEEVGS 183
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
D GSGWS+I+EP+GISK +F+K LLEQINTTAD+SDYLWYS S ++ A
Sbjct: 184 LDDSGSGWSWISEPIGISKAHSFSKFWLLEQINTTADRSDYLWYSSSIDLDA-------A 236
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT 546
++TVLH++SLGHALHAF+NGKL GSG G+ V VD PI L GKNT
Sbjct: 237 TETVLHIESLGHALHAFVNGKLAGSGTGNHEKVSVKVDIPITLVYGKNT 285
>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
Length = 735
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 256/750 (34%), Positives = 390/750 (52%), Gaps = 87/750 (11%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YDHRA+ I G R +L SG IHYPRSTP MWP L+ K+K+ GL+ I+TYVFWN+HE R
Sbjct: 34 VSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNIHEQKR 93
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Y+F GR +L F++ A AGL+ +LR+GPYVCAEW++G P+WL+ IP I FR+ N+
Sbjct: 94 GTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSNDA 153
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+K+EM+RF + I+ + + A GGPIIL+QIENEYG D ++Y+ W +
Sbjct: 154 WKSEMKRFLSDIIVYV--DGFLAKNGGPIILAQIENEYGGND-------RAYVDWCGSLV 204
Query: 207 LS--LDTGVPWVMCQQSDAPDPIINTCNGFYC------DQFTPNSNNKPKMWTENWSGWF 258
+ T +PW+MC A + I TCNG C D+ N+P ++TENW GWF
Sbjct: 205 SNDFASTQIPWIMC-NGLAANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GWF 262
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPL 318
+G + R EDLA++VA +F GG + YYM+HGG ++ RT GG ++T+Y D L
Sbjct: 263 QGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDDVIL 321
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC------ 372
G +PK+ HL L + + L++ D S+ P + G+
Sbjct: 322 RADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSNRLSI-PYWNGKQWTVGTQQMVYSYPP 380
Query: 373 -SAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSL 431
F+ N S + V FN + + SV I +++++N+A ++ ++ +F +
Sbjct: 381 SVQFVINQAAFS-LFVLFNKQNISIAGQSVQIYDYNEHLLWNSADVSGISRNNTFLVPIV 439
Query: 432 QVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADE 491
D W +EP S LEQ+N T D++ YLWY N+ +
Sbjct: 440 VGPLD--------WQVYSEPF-TSDLPVIVASTPLEQLNLTNDETIYLWY--RRNVSLSQ 488
Query: 492 PLLEDGSKTVLHVQS-LGHALHAFINGKLVG-----SGYGSSSNAKVTVDFPIALAPGKN 545
P ++ T++ VQ+ ++L F++ + VG S + N +T++ L +
Sbjct: 489 PSVQ----TIVQVQTRRANSLLFFMDRQFVGYFDDHSHTQGTINVNITLNLSQFLPNQQY 544
Query: 546 TFDLLSLTVGLQNY----GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
F++LS+++G+ N+ G+F K GI G V L G + + + W +Q GL GE
Sbjct: 545 IFEILSVSLGIDNFNIGPGSFEYK---GIVGNVSLGGQ---SLVGDEASIWEHQKGLFGE 598
Query: 602 --ELNFPSGSST-QWDSKSTLPKLQPLVWYKTTFD------APAGSEPVAIDFTGMGKGE 652
++ GS T +W+ K T +P+ W++T FD + P+ +D G +G
Sbjct: 599 AHQIYTEQGSKTVEWNPKWTTVINKPVTWFQTRFDLNHLAREDLNANPILLDAFGFNRGH 658
Query: 653 AWVNGQSIGRYWPTY-VSQNGGCTDSCNYRGAYSSNKCLK---NCGKPSQSLYHVPRSWL 708
A+VNG IG YW QN C CL+ NC +PSQ YH+ WL
Sbjct: 659 AFVNGNDIGLYWLIEGTCQNNLCC-------------CLQNQTNCQQPSQRYYHISSDWL 705
Query: 709 KSSGNTLVLFEEIGG-DPTKISFVTKQLGS 737
K + N L +FEEIG P + V + + +
Sbjct: 706 KPTNNLLTVFEEIGASSPKSVGLVQRIINT 735
>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
Length = 267
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/277 (68%), Positives = 223/277 (80%), Gaps = 10/277 (3%)
Query: 292 MYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP 351
MYHGGTNFDR++GGPFI+TSYDYDAP+DEYG+IRQ KWGHLKD++KAIKLCE AL+ TDP
Sbjct: 1 MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTDP 60
Query: 352 TYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVV 411
SLG NLEA VYKTGS +C+AFLAN+ T +D TV F+GNSY LPAWSVS+LPDCKNVV
Sbjct: 61 KISSLGQNLEAAVYKTGS-VCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPDCKNVV 119
Query: 412 FNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINT 471
NTAKINS + + +F + + SS S WS+INEPVGISKDD +K GLLEQINT
Sbjct: 120 LNTAKINSASAISNFVTEDISSLETSS----SKWSWINEPVGISKDDILSKTGLLEQINT 175
Query: 472 TADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAK 531
TAD+SDYLWYSLS ++ AD+P GS+TVLH++SLGH LHAFINGKL G+ G+S +K
Sbjct: 176 TADRSDYLWYSLSLDL-ADDP----GSQTVLHIESLGHTLHAFINGKLAGNQAGNSDKSK 230
Query: 532 VTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGA 568
+ VD PIAL GKN DLLSLTVGLQNYGAF++ GA
Sbjct: 231 LNVDIPIALVSGKNKIDLLSLTVGLQNYGAFFDTVGA 267
>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
max]
Length = 482
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 172/321 (53%), Positives = 219/321 (68%), Gaps = 5/321 (1%)
Query: 21 TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWN 80
+ F V+YD + +I ++ ++ SG +HYP ST ++WP + ++ K GGLD IE+Y+FW+
Sbjct: 3 SCFATEVSYDAHSHIINEEKHIIFSGVVHYPXSTVDLWPAIFKRXKYGGLDAIESYIFWD 62
Query: 81 LHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQF 140
HEPVR +Y+ G D + F+KL+ EA LY LRIGPYVC WNFGGF LWLH +P I+
Sbjct: 63 RHEPVRREYDCSGNLDFIDFLKLIQEAELYFILRIGPYVCEXWNFGGFSLWLHNMPEIEL 122
Query: 141 RTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK 200
R DN K EMQ FT KIV+M K+ KL+A GGPIIL+ IENEYGNI + Y A K YIK
Sbjct: 123 RIDNPIXKNEMQIFTTKIVNMAKEAKLFAPXGGPIILTPIENEYGNIMTDYREARKPYIK 182
Query: 201 WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLS 260
W A MAL+ + GVPW+MC DAP P+INTCNG YCD F PN+ KM+ F
Sbjct: 183 WCAQMALTQNIGVPWIMCXXRDAPQPMINTCNGHYCDSFXPNNPKSSKMFRX-----FQK 237
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDE 320
+G VP++ E+ F+VARFFQ GG NYYMYHGGTNF GGP+++ SY+YDAPLDE
Sbjct: 238 WGERVPHKSAEESTFSVARFFQSGGILNNYYMYHGGTNFGHMVGGPYMTASYEYDAPLDE 297
Query: 321 YGLIRQPKWGHLKDLHKAIKL 341
YG + +PKW H K LHK +
Sbjct: 298 YGNLNKPKWEHFKQLHKELTF 318
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 30/50 (60%)
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVS 826
+ IS I+FASFG P G CGSF G + S SVV AC+G SC V+
Sbjct: 429 KTISQIQFASFGNPEGNCGSFKGGTWEATDSQSVVEVACIGRNSCGFTVT 478
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 30/43 (69%)
Query: 632 FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
F+AP G +P+ +D GK +AWVNG+SIG YW ++++ GC
Sbjct: 363 FEAPFGIDPMVMDLQDSGKRQAWVNGKSIGCYWSSWITNTNGC 405
>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
Length = 281
Score = 369 bits (946), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 177/286 (61%), Positives = 210/286 (73%), Gaps = 5/286 (1%)
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EWNFGGFP+WL ++PGI FRTDN PFKA M +FT KIV MMK E L+ SQGGPIILSQIE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTP 241
NEYG ++ G A K+Y+ WAA MA+ L+T VPWVMC+Q DAPDP+IN CNGFYCD F+P
Sbjct: 61 NEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYCDYFSP 120
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
N KP MWTE W+GWF F G V + A V R + T + GTNF R
Sbjct: 121 NKPYKPTMWTEAWTGWFTGFRGPVLTDCEDCFAVQVIRRWILVTTIVPW-----GTNFGR 175
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
T+GGPFISTSYDYDAP+DEYGL+RQPKWGHL+DLHKAIK+CE ALV+ DPT LG E
Sbjct: 176 TAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNYQE 235
Query: 362 ATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
A VY++ SG C+AFL+N +S +V FNG Y +P+WS+SILPDC
Sbjct: 236 AHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281
>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
Length = 450
Score = 368 bits (944), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 215/502 (42%), Positives = 283/502 (56%), Gaps = 67/502 (13%)
Query: 180 IENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF 239
IENEYGNI++A+ G SY+ WAA MA+ L TGVPW+MC+Q DAPDP+INTCNG C +
Sbjct: 1 IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKCGET 60
Query: 240 T--PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGT 297
PNS NKP +WTENW+ ++ +GG R +D+AF VA F + G++ NYYMYHGGT
Sbjct: 61 FGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 120
Query: 298 NFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
NF RT+ I+ YD APLDEYGLIRQPKWGHLK+LH IK C L+ T S+G
Sbjct: 121 NFGRTAAAYVITGYYD-QAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTNLSVG 179
Query: 358 PNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKI 417
+A +++ G C AFL N + + TV F S+ L S+SILPDC N++FNTAK+
Sbjct: 180 QLQQAYMFEAQGGGCVAFLVN-NDSVNATVGFRNKSFELLPKSISILPDCDNIIFNTAKV 238
Query: 418 NSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSD 477
N+ S + SS + + YI+ S D LLE +NTT D+SD
Sbjct: 239 NA---------GSNRRITTSSKKLNTWEKYIDVIPNYS-DSTIKSDTLLEHMNTTKDKSD 288
Query: 478 YLWYSLS--TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKV--T 533
YLWY+ S N+ +PL LHV+SL H +AF+N K GS +G S N KV
Sbjct: 289 YLWYTFSFQPNLSCTKPL--------LHVESLAHVAYAFVNNKYSGSAHG-SKNGKVPFI 339
Query: 534 VDFPIALAPG--KNTFDLLSLTVGLQNYGAFYEKTGAGITGP-VQLKGSGNGTNIDLSSQ 590
++ PI L N +LS+ VGL G+ G +QL G + L
Sbjct: 340 MEVPIVLDDDGLSNNISILSVLVGLS----------VGLLGETLQLYGKEH-----LEMV 384
Query: 591 QWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGK 650
+W SK+ + QPL W+K FD P G++PV ++ M K
Sbjct: 385 KW----------------------SKADISIAQPLTWFKLEFDTPKGNDPVVLNLATMSK 422
Query: 651 GEAWVNGQSIGRYWPTYVSQNG 672
GEAWVNGQSIGRYW ++++ G
Sbjct: 423 GEAWVNGQSIGRYWISFLTSKG 444
>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
Length = 219
Score = 367 bits (943), Expect = 1e-98, Method: Composition-based stats.
Identities = 163/219 (74%), Positives = 183/219 (83%)
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
EMWPDLIQ++KDGGLDVI+TYVFWN HEP +Y FE YDLVKF+KLV +AGLY HLRI
Sbjct: 1 EMWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRI 60
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPI 175
GPYVCAEWNFGGFP+WL +IPGIQFRTDN PFK +MQRFT KIV+MMK E+L+ S GGPI
Sbjct: 61 GPYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPI 120
Query: 176 ILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFY 235
ILSQIENEYG ++ GA GK+Y WAA MA+ L TGVPWVMC+Q DAPDP+IN CNGFY
Sbjct: 121 ILSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFY 180
Query: 236 CDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLA 274
CD F+PN KPKMWTE W+GWF FGGAVPYRP EDLA
Sbjct: 181 CDYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219
>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
Length = 377
Score = 367 bits (941), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 168/292 (57%), Positives = 216/292 (73%), Gaps = 3/292 (1%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +++I GKR +L SGSIHYPRSTPEMWP +I+++K GGL+ I+TYVFWN+HEP +
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF GR DLVKF+KL+ + G+Y LR+GP++ AEW GG P WL +PGI FRTDN+
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK +R+ I+D MK+E+L+ASQGGPIIL QIENEY + AY G +YIKWA+ +
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGA 264
S+ G+PWVMC+Q+DAPDP+IN CNG +C D F PN NKP +WTENW+ F FG
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDA 316
R VED+A++VARFF + GT NYYMYHGGTNF RTS +++T Y DA
Sbjct: 281 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYEDA 331
>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
Length = 825
Score = 365 bits (938), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 251/769 (32%), Positives = 381/769 (49%), Gaps = 104/769 (13%)
Query: 19 ATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 78
A G +V+Y R I G+R +L+ GSIHYPRS+ W L++ +K GL+ IE YVF
Sbjct: 79 AKRQAGYSVSYSARGFEIDGRRTLLLGGSIHYPRSSEGEWETLLRAAKRDGLNHIEMYVF 138
Query: 79 WNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
WNLHE R +NF G + +F +L AE GL+ H+R GPYVCAEW+ GG PLWL++IPG+
Sbjct: 139 WNLHEQERGVFNFAGNANATRFYELAAEVGLFLHVRFGPYVCAEWSNGGLPLWLNWIPGM 198
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSY 198
+ R+ N P++ EM+RF +V++ + A GGPII++QIENE+ D Y
Sbjct: 199 KVRSSNAPWQWEMERFVTYMVELSR--PFLAKNGGPIIMAQIENEFAMHDP-------EY 249
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF----TPNSNNKPKMWTENW 254
++W + LDT +PWVMC + A + I+ +CNG C F + P +WTE+
Sbjct: 250 VEWCGDLVKRLDTSIPWVMCYANAAENTIL-SCNGNDCVDFAVKHVKERPSDPLVWTED- 307
Query: 255 SGWFLSFG----GAVP--YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
GWF ++ +P R ED+A+AVAR+F GG NYYMYHGG NF R + +
Sbjct: 308 EGWFQTWAKDKKNPLPNDQRTAEDMAYAVARWFAVGGAAHNYYMYHGGNNFGRAASAG-V 366
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY-------PSLGPNLE 361
+T Y L GL +PK HL+ LH+A+ C L+ D P+ G E
Sbjct: 367 TTKYADGVNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDRQLLHPHELAPTHGETAE 426
Query: 362 AT-------VYKTGSGLCS-AFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFN 413
A+ +Y G AFL N + VTV F N Y L S+ I+ D ++FN
Sbjct: 427 ASSLQQRAFIYGAEDGPNQVAFLEN-QADKKVTVVFRDNKYELAPTSMMIIKDGA-LLFN 484
Query: 414 TAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTA 473
TA + P ++ ++ WS +N ++ +EQ+ TA
Sbjct: 485 TADVRKS--FPGTVHRAYTPIVQAATLQWETWSELNVS-SLTPRRRVVAERPVEQLRLTA 541
Query: 474 DQSDYLWYSLSTNIK-ADEPLLEDGSKTVLHVQSL-GHALHAFINGKLVGSGYGSSSNAK 531
D+SDYL Y + + AD P+ D + + V S ++ AF++G L+G +
Sbjct: 542 DRSDYLTYETTFTVDPADTPIDIDSDASTVKVTSCEASSIIAFVDGWLIGERNLAYPGGN 601
Query: 532 VTVDFPIAL-----APGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNID 586
+ +F +L +++ L+S+++G+ + G+ + K G+TG V++ G
Sbjct: 602 CSKEFRFSLPTNIDVTRQHSLKLVSVSLGIYSLGSNHTK---GLTGKVRV-----GRKNL 653
Query: 587 LSSQQWTYQTGLKGEELNFPSG---SSTQWDSKSTLPKL-----QPLVWYKTT-----FD 633
QW L GE+L SS W + +P++ Q + WY T+ F+
Sbjct: 654 AKGHQWEMYPTLVGEQLEIYRPEWLSSVPW---TPVPRVVASGRQLMSWYWTSFSYPAFE 710
Query: 634 APAGSEPVA------IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
PA ++PV+ +D G+ +G A++NG +GRYW
Sbjct: 711 LPAEADPVSEPFSILLDCIGLTRGRAYINGHDLGRYW----------------------- 747
Query: 688 KCLKNCGKPSQSLYHVPRSWL-KSSGNTLVLFEEIGGDPTKISFVTKQL 735
+ + G+ Q YHVPR WL K N LV+F+E+GG + V+ +
Sbjct: 748 -LVNDEGEFVQRYYHVPRDWLVKDQANVLVVFDELGGSVADVRLVSSSM 795
>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
Length = 282
Score = 356 bits (913), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 175/291 (60%), Positives = 212/291 (72%), Gaps = 14/291 (4%)
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EWNFGGFP+WL ++PGI FRTDN PFKA M +FT KIV MMK E L+ SQGGPIILSQIE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 182 NEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTP 241
NEYG ++ GAA K+Y+ WAA MA+ L+TGVPWVMC+Q DAPDP+IN NGFYCD F+P
Sbjct: 61 NEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYCDYFSP 120
Query: 242 NSNNKPKMWTENWSG-----WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGG 296
NS + + G W + G+ + V F V + + G F+NYYMYHGG
Sbjct: 121 NS-------LKTFFGGLKLDWLVPVSGSSSSQTVRT-GFCV-QVYTEGWIFRNYYMYHGG 171
Query: 297 TNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSL 356
TNF RT+GG FISTSYDYDAP+DEY L+RQPKWGHL+DLHKAIK+CE ALV+ DPT L
Sbjct: 172 TNFGRTAGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKL 231
Query: 357 GPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
G EA VY++ SG C+AFL+N +S +V FNG Y +P+WS+SILPDC
Sbjct: 232 GNYQEAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282
>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
Length = 580
Score = 355 bits (912), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 209/609 (34%), Positives = 310/609 (50%), Gaps = 47/609 (7%)
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
+WTENW+ F ++G V R ED+A+AV RFF +GG+ NYYMYHGGTNF RT G ++
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT- 367
T Y +AP+DEYG+ ++PK+GHL+DLH I+ + A + + LG EA +++
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
LC +FL+N T D TV F G+ + +P+ SVSIL CKNVV+NT ++ S
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRV-----FVQHS 175
Query: 428 RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI 487
+S + +S + W +E + +D LEQ N T D +DYLWY+ S +
Sbjct: 176 ERSFHTSDVTSK--NNQWEMFSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRL 233
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
++D+ + + VL V+S HA+ F N VG G+ + P+ L G N
Sbjct: 234 ESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHV 293
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNF 605
LLS T+G+++ G + GI ++G GT +DL W ++ L+GE E+
Sbjct: 294 VLLSSTMGMKDSGGELAEVKGGIQ-ECLIQGLNTGT-LDLQVNGWGHKAALEGEYKEIYS 351
Query: 606 PSG-SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
G QW + WYK FD P G +PV +D + M KG +VNG+ +GRYW
Sbjct: 352 EKGLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYW 408
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
+Y + G PSQ++YH+PR +LKS N LV+FEE G
Sbjct: 409 VSYRTL----------------------AGTPSQAVYHIPRPFLKSKDNLLVIFEEEMGK 446
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVIS 780
P I V +C +++ +P + W +D I +L CP P + I
Sbjct: 447 PDGI-LVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCP-PEKTIQ 504
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVM 838
+ FASFG P G CG+F+ G C + + +V + C+G SC + V +G C+
Sbjct: 505 EVVFASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTT 564
Query: 839 KSLAVEASC 847
+L V+ C
Sbjct: 565 ATLGVQVRC 573
>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
gi|224029591|gb|ACN33871.1| unknown [Zea mays]
Length = 580
Score = 355 bits (910), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 209/609 (34%), Positives = 310/609 (50%), Gaps = 47/609 (7%)
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
+WTENW+ F ++G V R ED+A+AV RFF +GG+ NYYMYHGGTNF RT G ++
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYV 60
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT- 367
T Y +AP+DEYG+ ++PK+GHL+DLH I+ + A + + LG EA +++
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS 427
LC +FL+N T D TV F G+ + +P+ SVSIL CKNVV+NT ++ S
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRV-----FVQHS 175
Query: 428 RQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI 487
+S + +S + W +E + +D LEQ N T D +DYLWY+ S +
Sbjct: 176 ERSFHTSDVTSK--NNQWEMSSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRL 233
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
++D+ + + VL V+S HA+ F N VG G+ + P+ L G N
Sbjct: 234 ESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHV 293
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE--ELNF 605
LLS T+G+++ G + GI ++G GT +DL W ++ L+GE E+
Sbjct: 294 VLLSSTMGMKDSGGELAEVKGGIQ-ECLIQGLNTGT-LDLQVNGWGHKAALEGEYKEIYS 351
Query: 606 PSG-SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
G QW + WYK FD P G +PV +D + M KG +VNG+ +GRYW
Sbjct: 352 EKGLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYW 408
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
+Y + G PSQ++YH+PR +LKS N LV+FEE G
Sbjct: 409 VSYRTL----------------------AGTPSQAVYHIPRPFLKSKDNLLVIFEEEMGK 446
Query: 725 PTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSK----IQRKPGPVLSLECPNPNQVIS 780
P I V +C +++ +P + W +D I +L CP P + I
Sbjct: 447 PDGI-LVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCP-PEKTIQ 504
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVM 838
+ FASFG P G CG+F+ G C + + +V + C+G SC + V +G C+
Sbjct: 505 EVVFASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTT 564
Query: 839 KSLAVEASC 847
+L V+ C
Sbjct: 565 ATLGVQVRC 573
>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
Length = 721
Score = 348 bits (894), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 237/761 (31%), Positives = 363/761 (47%), Gaps = 102/761 (13%)
Query: 18 LATTSFGAN---VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
LA T F VTYD R+ + GKR + ++GS+HYPR+TPEMW ++ ++ + GL++I+
Sbjct: 23 LAYTDFRGKPYKVTYDERSFFLDGKRSIFLAGSVHYPRATPEMWDTILDQAVEDGLNLIQ 82
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
Y FWNLHEPV+ QYN+EG D+ F++ A+ GL+ ++RIGPYVCAEW+ GG P+W+++
Sbjct: 83 IYTFWNLHEPVKGQYNWEGIADIRLFLQKCADRGLFVNMRIGPYVCAEWDNGGIPVWVNY 142
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+ G++ R +N+ +K EM + + D + +A +GGPII SQIENE +G A
Sbjct: 143 LDGVRLRANNDVWKKEMGDWMKVLTDYTRD--FFADRGGPIIFSQIENE------LWGGA 194
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN-------NKP 247
+ YI W A SL+ VPW+MC D + IN CNG C + + ++P
Sbjct: 195 -REYIDWCGEFAESLELNVPWMMC-NGDTSEKTINACNGNDCSSYLESHGQSGRILVDQP 252
Query: 248 KMWTENWSGWFLSFGGAVPYRP---------VEDLAFAVARFFQRGGTFQNYYMYHGGTN 298
WTEN GWF G A R ED F V +F RGG++ NYYM+ GG +
Sbjct: 253 GCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGGSYHNYYMWFGGNH 311
Query: 299 FDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG- 357
+ + +G ++ Y + L +PK H +H+ + L+ +
Sbjct: 312 YGKWAGNG-MTNWYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLLNDKAQVNNQKH 370
Query: 358 ---PNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNT 414
N A Y+ G L S N G+ V + Y LPAWS+ +L + NV+F T
Sbjct: 371 LNCDNCNAFEYRYGDRLVSFVENNKGSADKVI--YRDIVYELPAWSMIVLDEYDNVLFET 428
Query: 415 AKINSVTLVPSFS-RQSLQVAADSSDAIGSGWSYINEPVGISKDDA---FTKPGLLEQIN 470
+ V + + L+ Y NEPV +A P EQ+N
Sbjct: 429 NNVKPVNKHRVYHCEEKLEF------------EYWNEPVSTLSQEAPRVVVSPKANEQLN 476
Query: 471 TTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGS-GYGSSSN 529
T D +++L+Y DE L G +A A+++ VGS + +
Sbjct: 477 MTRDLTEFLYYETEVEFPQDECTLSIGGTD-------ANAFVAYVDDHFVGSDDEHTHHD 529
Query: 530 AKVTVDFPIALAPGKNTFDLLSLTVGLQN------YGAFYEKTGAGITGPVQLKGSGNGT 583
T++ + GK+ LLS ++G+ N ++ GI G ++L G+
Sbjct: 530 GWHTMNINMKSGKGKHKLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWIKLCGN---- 585
Query: 584 NIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSE--- 639
D+ +Q+W + GL GE F KS + L WY++TF P G +
Sbjct: 586 --DIFNQEWKHYPGLVGEAKQVFTDEGMKTVTWKSDVENADNLAWYRSTFKTPQGLKRGI 643
Query: 640 PVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQS 699
V + GM +G+A+VNG +IGRYW + ++G G+ +Q
Sbjct: 644 EVLLRPEGMNRGQAYVNGHNIGRYW---MIKDGN--------------------GEYTQG 680
Query: 700 LYHVPRSWLKSSG--NTLVLFEEIGGDPTKISFVTKQLGSS 738
YH+P+ WLK G N LVL E +G ++ T + S+
Sbjct: 681 YYHIPKDWLKGEGEENVLVLGETLGASDPSVTICTTEYVSN 721
>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
Length = 208
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 163/203 (80%), Positives = 176/203 (86%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
+ VL W V SF +NVTYDH+A+VI GKRRVL+SGSIHYPRSTP+MWPDLIQKSK
Sbjct: 6 IAFVLLWFLGVYVPASFCSNVTYDHKALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
DGG+DVIETYVFWNLHEPVR QYNFEGR DLV FVK+VA AGLY HLRIGPYVCAEWN+G
Sbjct: 66 DGGIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKVVAAAGLYVHLRIGPYVCAEWNYG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GFPLWLHFI GI+FRT+NEPFKAEM+RFTAKIVDMMKQE LYASQGGPIILSQIENEYGN
Sbjct: 126 GFPLWLHFIAGIKFRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGN 185
Query: 187 IDSAYGAAGKSYIKWAAGMALSL 209
ID+ A KSYI WAA MA SL
Sbjct: 186 IDTHDARAAKSYIDWAASMATSL 208
>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
Length = 362
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 182/380 (47%), Positives = 233/380 (61%), Gaps = 28/380 (7%)
Query: 352 TYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVV 411
T SLG N E V+ SG C+AFLAN T S V F Y LP WS+SILPDCK V
Sbjct: 1 TVTSLGNNQEVHVFNPKSGSCAAFLANYDTTSSAKVNFQNMQYELPPWSISILPDCKTAV 60
Query: 412 FNTAKINS------VTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGL 465
FNTA++ + +T V +FS QS YI E S D FT GL
Sbjct: 61 FNTARLGAQSSLKQMTPVSTFSWQS----------------YIEESASSSDDKTFTTDGL 104
Query: 466 LEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYG 525
EQ+N T D SDYLWY + NI ++E L++G +L + S GHALH FING+L G+ YG
Sbjct: 105 WEQLNVTRDASDYLWYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYG 164
Query: 526 SSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI 585
N K+T + + G N LLS++VGLQN G +E+ G+ GPV L+G GT
Sbjct: 165 GVDNPKLTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTR- 223
Query: 586 DLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVA 642
DLS QQW+Y+ GLKGE+L+ + SS +W S+L + QPL WYKTTF+APAG+EP+A
Sbjct: 224 DLSKQQWSYKIGLKGEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLA 283
Query: 643 IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYH 702
+D + MGKG W+N QSIGR+WP Y++ G C + CNY G Y+ KC NCG+PSQ YH
Sbjct: 284 LDMSTMGKGLIWINSQSIGRHWPGYIAH-GSCGE-CNYAGTYTDKKCHTNCGQPSQRWYH 341
Query: 703 VPRSWLKSSGNTLVLFEEIG 722
VPRSWL +GN LV+ + +G
Sbjct: 342 VPRSWLNPTGNLLVVLKRVG 361
>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 611
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 209/592 (35%), Positives = 328/592 (55%), Gaps = 50/592 (8%)
Query: 153 RFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTG 212
RF K + E+ +A+ GGPII+SQ+ENEYG + YG +G Y +W+A +A SL+ G
Sbjct: 6 RFITKYL-----ERHFAANGGPIIMSQVENEYGWVQERYGESGTKYAQWSARLAQSLNVG 60
Query: 213 VPWVMCQQSDAPDPIINTCNGFYCDQFTPNS----NNKPKMWTENWSGWFLSFGGAVPYR 268
VPW+MCQQ D D +INTCNGFYC + N+P +TENW GWF + + P+R
Sbjct: 61 VPWIMCQQDDI-DSVINTCNGFYCHDWIEGHWARYPNQPAFFTENWPGWFQQWKQSTPHR 119
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPK 328
PVED+ +AV +F RGG+ NYYM+HGGTNF RTS P + SYDYDA LDEYG +PK
Sbjct: 120 PVEDVLYAVGNWFARGGSLMNYYMWHGGTNFGRTS-SPMVVNSYDYDAALDEYGNPSEPK 178
Query: 329 WGHLKDLHKAI-KLCEAALVATD-PTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVT 386
+ H + + K L A + P LG + Y G G +FL N ++
Sbjct: 179 YSHAAKFNNLLQKYSHIFLNAPEIPRSEYLGGSSSIYHYTFG-GESLSFLINNHESALND 237
Query: 387 VKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWS 446
+ +NG ++++ WSV +L + + VF++A V+ + S++ V + ++A S W
Sbjct: 238 IVWNGQNHIIKPWSVHLLYN-NHTVFDSAATPEVSKLAMTSKRFSPVNS-FNNAYISQWV 295
Query: 447 YINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQS 506
E + ++ +KP LEQ++ T D++DYLWY N++ G++ +
Sbjct: 296 ---EEIDMTDSTWSSKP--LEQLSLTHDKTDYLWYVTEINLQV------RGAEVF--TTN 342
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
+ LHA+I+GK + + S++ + D P+ G + +L+ +G+Q+Y EK
Sbjct: 343 VSDVLHAYIDGKYQSTIW-SANPFNIKSDIPL----GWHKLQILNSKLGVQHYTVDMEKV 397
Query: 567 GAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSS---TQWDSKSTLPKLQ 623
G+ G + + G+ D+++ W+ + + GE L + ++ W S S + Q
Sbjct: 398 TGGLLGNIWVGGT------DITNNGWSMKPYVNGERLAIYNPNNIFKVDWSSFSGVQ--Q 449
Query: 624 PLVWYKTTF-DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
PL WYK F + ++ +++ +GM KG W+NG+ + RYW +++ GC + C+Y+G
Sbjct: 450 PLTWYKINFLHELSPNKHYSLNMSGMNKGMIWLNGKHVARYW---ITKGWGC-NGCSYQG 505
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
Y+ C NCG+PSQ YH+P+ WL N LV+FEE+GG+P I K+
Sbjct: 506 GYTDQLCSTNCGEPSQINYHLPQDWLIEGANLLVIFEEVGGNPKSIKLEEKE 557
>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
Length = 811
Score = 335 bits (858), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 237/737 (32%), Positives = 350/737 (47%), Gaps = 86/737 (11%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G +V Y R VI GK +L+ GSIHY RSTP+ W L+ K+K+ GL++++ Y+FWN HE
Sbjct: 96 GYDVKYTKRGFVIDGKASILLGGSIHYARSTPDTWDSLLAKAKEDGLNLVQLYIFWNFHE 155
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R + F R +L F + V GL+ HLR GPYVCAEWN GG PLWL IPG++ R++
Sbjct: 156 PRRGSFYFADRGNLTHFFERVVAHGLFVHLRFGPYVCAEWNRGGLPLWLDRIPGMKVRSN 215
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
+E ++ EM R ++++ + ++ GGPII++QIENEY D +Y+ W +
Sbjct: 216 SESWRQEMNRIILIMINLAR--PYFSVNGGPIIMAQIENEYNGHDP-------TYVAWLS 266
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN----NKPKMWTEN------ 253
+ L G+PW MC + A + I+TCN C QF + ++P +WTEN
Sbjct: 267 QLVRKLGIGIPWTMCNGASAVN-TISTCNDNDCFQFAEKNAKVFPSQPLVWTENEAWYEK 325
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYD 313
W+ ++ G R E +A+ VAR+F GG NYYMYHGG NF RT+ ++T Y
Sbjct: 326 WATKNIAQDGQNDQRSPEQVAYVVARWFAVGGAMHNYYMYHGGNNFGRTASAG-VTTMYA 384
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY---PSLGPN------LEATV 364
A L GL +PK HL+ LH + C AL++ + LGP A +
Sbjct: 385 DGAILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNERQLNHAKPLGPEGKNAYTQRAYI 444
Query: 365 YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVP 424
Y G CS FL N ++ Y LP ++ IL D NV++NT+ ++
Sbjct: 445 Y----GNCS-FLENTHAIHRACFRYQLKEYCLPPQTIVIL-DHNNVLYNTSDVSGTLGSR 498
Query: 425 SFSRQSLQVAADSSD-AIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSL 483
S S + SD I S W P + D LEQ+ T D +DYL Y
Sbjct: 499 STRSFSPLIRFRKSDWKIWSEWDV--NPHNVR--DQIVNDSPLEQLLVTQDTTDYLMYQN 554
Query: 484 STNIKADEPLLEDGSKTVLHVQSL-GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAP 542
++ P ++L S ++ FING+ +G + + + F L P
Sbjct: 555 EVRWGSNGPTKNKMKSSILKFISCDANSFLVFINGEFIGEQHLAYPGDDCSNIFRFDLGP 614
Query: 543 ----GKN-TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTG 597
G N T +LS+++G+ + G EK GI VQ+ + + ++W +G
Sbjct: 615 LGKYGANLTLSILSISLGIHSLG---EKHQKGIVSDVQID---ERSLVYGPHERWVMFSG 668
Query: 598 LKGEELNFPS---GSSTQWDSKST-LPKLQPLVWYKTTFDAPA----GSEPVAIDFTGMG 649
L GE L +S W + + + + WY T F V +D GM
Sbjct: 669 LIGELLKLYDPMWSNSVPWRNLNVQTDRKRTSKWYMTKFVLKQLDWDTETSVLLDCKGMN 728
Query: 650 KGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK 709
+G ++NG +GRYW S GAY Q Y +P +WL
Sbjct: 729 RGRIYLNGHDLGRYWLIRRSD-----------GAY------------VQRYYTIPVAWLH 765
Query: 710 SSG--NTLVLFEEIGGD 724
++ N LV+FEE+ +
Sbjct: 766 AANKSNYLVIFEELRNE 782
>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
Length = 203
Score = 329 bits (843), Expect = 5e-87, Method: Composition-based stats.
Identities = 148/204 (72%), Positives = 169/204 (82%), Gaps = 1/204 (0%)
Query: 51 PRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLY 110
PRSTPEMWPDLIQ +K+GGLDVI+TYVFWN HEP Y FE RYD VKF+KLV +AGLY
Sbjct: 1 PRSTPEMWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLY 60
Query: 111 AHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYAS 170
HLRIGPY+C EWNFGGFP+WL ++PGIQFRTDN PFKA+MQ+FT KIV+MMK EKL+
Sbjct: 61 VHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEP 120
Query: 171 QGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINT 230
QGGP I+SQIE EYG I GA GK+Y KWAA MA+ L TGVPW+MC+Q DAPDPII+T
Sbjct: 121 QGGP-IMSQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDT 179
Query: 231 CNGFYCDQFTPNSNNKPKMWTENW 254
CNGFYC+ F PN+N KPKMWTE W
Sbjct: 180 CNGFYCENFMPNANYKPKMWTEAW 203
>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
gi|217314871|gb|ACK36970.1| lectin [Glycine max]
Length = 447
Score = 324 bits (831), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 188/420 (44%), Positives = 244/420 (58%), Gaps = 30/420 (7%)
Query: 441 IGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL--EDGS 498
I W EP+ I +FT G+ E +N T DQSDYLWYS + + L E+
Sbjct: 31 ISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSDSDILFWEENDV 90
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG-LQ 557
L + + L FING+L+ + V++ GKN T G +
Sbjct: 91 HPKLTIDGVRDILRVFINGQLIVKDEQFKAVISVSI--------GKN-----DCTAGSIN 137
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS 617
NYGAF EK GAGI G +++ G NG +IDLS WTYQ GL+GE L F S + +
Sbjct: 138 NYGAFLEKDGAGIRGKIKITGFENG-DIDLSKSLWTYQVGLQGEFLKFYSEENENSEWVE 196
Query: 618 TLPKLQP--LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
P P WYKT FD P G +PVA+DF MGKG+AWVNGQ IGRYW T VS GC
Sbjct: 197 LTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYW-TRVSPKSGCQ 255
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
C+YRGAY+S+KC NCGKP+Q+LYHVPRSWLK++ N LV+ EE GG+P +IS V
Sbjct: 256 QVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVILEETGGNPFEIS-VKLHS 314
Query: 736 GSSLCSHVTDSHPLPV------DMWGSDSKIQRKPGPVLSLECPNPNQVISSIKFASFGT 789
+C+ V++S+ P+ D+ G + P L L C ISS+ FASFGT
Sbjct: 315 SRIICAQVSESNYPPLQKLVNADLIGEEVSANNMI-PELHLHC-QQGHTISSVAFASFGT 372
Query: 790 PLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGVMKSLAVEASCT 848
P G+C +FSRG C + S+S+V +AC G +SCSI +S + FG DPC GV+K+L+VEA CT
Sbjct: 373 PGGSCQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAFGVDPCPGVVKTLSVEARCT 432
>gi|217075719|gb|ACJ86219.1| unknown [Medicago truncatula]
Length = 200
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 151/201 (75%), Positives = 176/201 (87%), Gaps = 1/201 (0%)
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKGEAWVNGQSIGRYWPTY+S N GCTDSCNYRG YS++KCLKNCGKPSQ+LYHVPR+W
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAW 60
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV 767
LK NT VLFEE GGDPTKISF TKQ+ S+CSHVT+SHP PVD W S+++ +RK GPV
Sbjct: 61 LKPDSNTFVLFEESGGDPTKISFGTKQI-ESVCSHVTESHPPPVDTWNSNAESERKVGPV 119
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSV 827
LSLECP PNQ ISSIKFASFGTP TCG+++ G CSS R+LS+V++AC+GS SC+IGVS+
Sbjct: 120 LSLECPYPNQAISSIKFASFGTPRRTCGNYNHGSCSSNRALSIVQKACIGSSSCNIGVSI 179
Query: 828 NTFGDPCKGVMKSLAVEASCT 848
NTFG+PC+GV KSLAVEA+CT
Sbjct: 180 NTFGNPCRGVTKSLAVEAACT 200
>gi|217070908|gb|ACJ83814.1| unknown [Medicago truncatula]
Length = 200
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 149/200 (74%), Positives = 175/200 (87%), Gaps = 1/200 (0%)
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKGEAWVNGQSIGRYWPTYV+ N GCTDSCNYRG Y+S+KC KNCGKPSQ+LYHVPRS+
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSF 60
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV 767
LK +GNTLVLFEE GGDPT+ISF TKQL S+CSHV+DSHP +D+W D++ K GP
Sbjct: 61 LKPNGNTLVLFEENGGDPTQISFATKQL-ESVCSHVSDSHPPQIDLWNQDTESGGKVGPA 119
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSV 827
L L CPN NQVISSIKFAS+GTPLGTCG+F RGRCSS ++LS+V++AC+GS+SCS+GVS
Sbjct: 120 LLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSVGVST 179
Query: 828 NTFGDPCKGVMKSLAVEASC 847
+TFGDPC+GV KSLAVEA+C
Sbjct: 180 DTFGDPCRGVPKSLAVEATC 199
>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
Length = 244
Score = 312 bits (800), Expect = 4e-82, Method: Composition-based stats.
Identities = 133/205 (64%), Positives = 162/205 (79%)
Query: 23 FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLH 82
G +TYD RA+V+ G RR+ SG +HY RSTPEMWP LI K+K+GGLDVI+TYVFWN+H
Sbjct: 25 LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84
Query: 83 EPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRT 142
EP++ QYNFEGRYDLVKF++ + GLY LRIGP+V AEW +GGFP WLH +P I FR+
Sbjct: 85 EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA 202
DNEPFK MQ F KIV MMK E LY QGGPII+SQIENEY I+ A+GA+G Y++WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPI 227
A MA+ L TGVPW+MC+Q+DAPDP+
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPV 229
>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
Length = 652
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 191/528 (36%), Positives = 280/528 (53%), Gaps = 36/528 (6%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A VT+D RAVVI GKR +L GS HYP+ E WP ++ +KD GL+ +E Y+FWN+HE
Sbjct: 4 AQVTFDKRAVVIDGKRTILYCGSYHYPKIHYEHWPQALELAKDCGLNCLEVYIFWNVHEK 63
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ Y+FE ++ +F++L E GL LR+GPY+CAE ++GGFP WL IPGI+FRT N
Sbjct: 64 KKGVYHFEREGNIFRFLQLAQERGLKVILRMGPYICAETSYGGFPYWLREIPGIEFRTYN 123
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
EPF EM+R+ I M+K+ KLY +GGPIIL QIENEY + S YGAAG+ Y+ W
Sbjct: 124 EPFMKEMKRWLTDINRMLKENKLYHQKGGPIILVQIENEYDIVSSIYGAAGQKYLHWC-- 181
Query: 205 MALSLDTGVPWVMCQQSD-----APDPIINTCNGFY----CDQFTPNSNNKPKMWTENWS 255
L + W+ + S+ + D I T N FY D ++P +WTE W
Sbjct: 182 YELYKEGASEWLTSKDSEYFRVASIDKSIETINDFYGHRRIDSLKALKPHQPLLWTEFWI 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYD 315
GW+ + GA RPV+D+ +A ARF +GG+ NYYM+HGGT+F + +T YD+D
Sbjct: 242 GWYNIWRGAQRQRPVDDVIYAAARFIAQGGSGMNYYMFHGGTHFGNLAMYG-QTTGYDFD 300
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEATVYK-TGSGLCS 373
AP+D YG + K+ LK L+ + E L++ D P L PN+ +K SG
Sbjct: 301 APVDSYGRPTE-KFERLKQLNHCLSNLEYILLSQDEPEVQKLTPNVNVYRWKDIESGDEC 359
Query: 374 AFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQV 433
+F+ N S V + L SV I + + V ++ +V+ Q
Sbjct: 360 SFVCN-DQRSQSYVIVAERAVCLKPLSVKIYLNHEEVFDSSQNSYNVS----------QK 408
Query: 434 AADSSDAIGSGWSYINEPV-GISKDD----AFTKPGLLEQINTTADQSDYLWYSLSTNIK 488
+ D + + W + P+ K D F+ P + + ++ T D++DY+WY+ I
Sbjct: 409 SYHRLDYVCNEWKTMQIPIPSKEKKDKEHFEFSFPHIPDMLHITQDETDYMWYTGVGTIY 468
Query: 489 ADEPLLEDGSKTVLHVQSLGHA---LHAFINGKLVGSGYGSSSNAKVT 533
P + + L + A +H F+N K VGS + + T
Sbjct: 469 C--PFKGENTPHCLKIHMELEAADYVHVFLNRKYVGSCRSPCYDERFT 514
>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
Length = 376
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 170/371 (45%), Positives = 224/371 (60%), Gaps = 14/371 (3%)
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
TN+ L G K L VQS GHALH F+NG+ GS +G+ + T P+ L G
Sbjct: 1 TNVDISSSELHGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRAGI 60
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
N LLS+ VGL N G YE GI GPV L G G G DL+ Q+W + GLKGE ++
Sbjct: 61 NKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGRK-DLTMQKWFNKVGLKGEAMD 119
Query: 605 FPS---GSSTQWDSKSTLPKL-QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSI 660
S GSS W S + Q L WYK F+AP G EP+A+D MGKG+ W+NGQSI
Sbjct: 120 LVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSI 179
Query: 661 GRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
GRYW Y NG C+ C+Y G + KC CG+P+Q YHVPRSWLK + N +V+FEE
Sbjct: 180 GRYWMAYA--NGDCS-LCSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKNLMVMFEE 236
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKP--GPVLSLECPNPNQV 778
+GGDP+KI+ V + + + +C+ + + HP + + DS + K + L+C P Q
Sbjct: 237 LGGDPSKITLVKRSV-AGVCADLQEHHP-NAEKFDIDSHEESKTLHQAQVHLQCV-PGQS 293
Query: 779 ISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG-DPCKGV 837
ISSIKFASFGTP GTCGSF +G C + S ++V + C+G +SC + VS + FG DPC V
Sbjct: 294 ISSIKFASFGTPTGTCGSFQQGTCHATNSHAIVEKNCIGRESCLVTVSNSIFGTDPCPNV 353
Query: 838 MKSLAVEASCT 848
+K L+VEA C+
Sbjct: 354 LKRLSVEAVCS 364
>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 304 bits (779), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 182/445 (40%), Positives = 249/445 (55%), Gaps = 43/445 (9%)
Query: 292 MYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP 351
MYHGGTNF RTS FI+ YD APLDEYGL+RQPK+GHLK+LH AIK L+
Sbjct: 1 MYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQ 59
Query: 352 TYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVV 411
T SLGP +A V++ + C AFL N + ++F N+Y L S+ IL +CKN++
Sbjct: 60 TILSLGPMQQAYVFEDANNGCVAFLVNNDAKAS-QIQFRNNAYSLSPKSIGILQNCKNLI 118
Query: 412 FNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINT 471
+ TAK+N V +R + V + + W+ E + + LLE N
Sbjct: 119 YETAKVN----VKMNTRVTTPVQVFN---VPDNWNLFRETIPAFPGTSLKTNALLEHTNL 171
Query: 472 TADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAK 531
T D++DYLWY+ ++ K D P + ++ +S GH +H F+N L GSG+GS
Sbjct: 172 TKDKTDYLWYT--SSFKLDSPC----TNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRV 225
Query: 532 VTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ 591
V + P++L G+N +LS VGL + GA+ E+ G+T VQ+ G IDLS Q
Sbjct: 226 VKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYGLT-KVQISCGGTKP-IDLSRSQ 283
Query: 592 WTYQTGLKGEELNF---PSGSSTQWD-SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTG 647
W Y GL GE++ + + +W +K+ L K +PL WYKTTFD P G PV + +
Sbjct: 284 WGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSS 343
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKGE WVNG+SIGRYW ++ L G+PSQS+YH+PR++
Sbjct: 344 MGKGEIWVNGESIGRYWVSF----------------------LTPAGQPSQSIYHIPRAF 381
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVT 732
LK SGN LV+FEE GGDP IS T
Sbjct: 382 LKPSGNLLVVFEEEGGDPLGISLNT 406
>gi|1669595|dbj|BAA13685.1| AR782 [Arabidopsis thaliana]
Length = 206
Score = 303 bits (776), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 144/204 (70%), Positives = 173/204 (84%), Gaps = 2/204 (0%)
Query: 647 GMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRS 706
G GKG AWVNGQSIGRYWPT ++ NGGCT+SC+YRG+Y +NKCLKNCGKPSQ+LYHVPRS
Sbjct: 3 GTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRS 62
Query: 707 WLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKI--QRKP 764
WLK SGN LVLFEE+GGDPT+ISF TKQ GS+LC V+ SHP PVD W SDSKI + +
Sbjct: 63 WLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNRT 122
Query: 765 GPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIG 824
PVLSL+CP QVI SIKFASFGTP GTCGSF++G C+S+RSLS+V++AC+G +SC++
Sbjct: 123 RPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSCNVE 182
Query: 825 VSVNTFGDPCKGVMKSLAVEASCT 848
VS FG+PC+GV+KSLAVEASC+
Sbjct: 183 VSTRVFGEPCRGVVKSLAVEASCS 206
>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 249
Score = 302 bits (774), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 132/210 (62%), Positives = 167/210 (79%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
TT+ VTYD RA+++ G RR+L SG +HYPRSTPEMWPDLI K+K GGLDVI+TYVFW
Sbjct: 31 TTAGRGEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFW 90
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
N HEPV+ Q+NFEGRYDLVKF++ + GLY LRIGP+V +EW +GG P WL IP I
Sbjct: 91 NAHEPVQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYGGLPFWLRGIPNIT 150
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYI 199
FR+DNEPFK MQ+F KIV++MK E+L+ QGGPII+SQIENEY +++A+ + G SY+
Sbjct: 151 FRSDNEPFKRHMQKFVTKIVNLMKDERLFYPQGGPIIISQIENEYKLVEAAFHSKGSSYV 210
Query: 200 KWAAGMALSLDTGVPWVMCQQSDAPDPIIN 229
WAA MA++L TGVPW+MC+Q DAPDPI++
Sbjct: 211 HWAAAMAVNLQTGVPWMMCKQDDAPDPIVS 240
>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
Length = 1171
Score = 302 bits (773), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 159/383 (41%), Positives = 221/383 (57%), Gaps = 10/383 (2%)
Query: 41 RVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKF 100
R+L SIHYPR P W LI+ +K+ G++ IETYVFWN HE + Y+F GR DL F
Sbjct: 476 RILFPASIHYPRCQPSDWQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGF 535
Query: 101 VKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVD 160
++ +A+AGLYA LRIGPY+CAE +FGGFP WL I GI+FRT NEPF+ E R+ +V+
Sbjct: 536 IRTIAKAGLYALLRIGPYICAETHFGGFPHWLRDIDGIEFRTQNEPFQRESSRWVRFLVE 595
Query: 161 MMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQ 220
+ + SQGGPI++ Q ENEY I YG AG +Y+KW + +A L VP MC+
Sbjct: 596 KLNSNNCFYSQGGPIVMVQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCKG 655
Query: 221 SDAPDPIINTCNGFYCDQFTPNSN----NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFA 276
S + ++ T N FY Q N + N+P +WTE W+GW+ +G A RP +DL +A
Sbjct: 656 S--IENVLETINDFYGHQEMENHHREYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYA 713
Query: 277 VARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLH 336
V RFF +GG NYYM+HGGTN+D+ + +TSYDYDAP+DEYG + K+ L+ +H
Sbjct: 714 VLRFFAQGGKGINYYMFHGGTNYDQLAMY-LQTTSYDYDAPIDEYGR-KTKKYFGLQYIH 771
Query: 337 KAIK--LCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSY 394
+ ++ AL P S N G F N S V++ Y
Sbjct: 772 RQLEQHFASLALKLEAPIAHSYEDNYVWIFIWEEQGSNCIFFCNDHPTSTKQVQWKEQEY 831
Query: 395 LLPAWSVSILPDCKNVVFNTAKI 417
L SV ++ D ++ + ++
Sbjct: 832 CLAPLSVQMVVDHHRLILKSDQL 854
>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
Length = 706
Score = 295 bits (754), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 197/600 (32%), Positives = 296/600 (49%), Gaps = 65/600 (10%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G +VTY R I GK+ +L+ GSIHYPRS+P W L++++K GL+ IE YVFWNLHE
Sbjct: 82 GYSVTYSPRGFEIDGKQTLLLGGSIHYPRSSPGEWEQLLREAKRDGLNHIEMYVFWNLHE 141
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
R +NF G ++ +F +L AE GL+ H+R GPYVCAEWN GG PLWL++IPG++ R+
Sbjct: 142 QERGVFNFAGNANITRFYELAAEVGLFLHVRFGPYVCAEWNNGGLPLWLNWIPGMEVRSS 201
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA 203
N P++ EM+RF +V++ + A GGPII++QIENE+ D YI W
Sbjct: 202 NAPWQREMERFIRYMVELSR--PFLAKNGGPIIMAQIENEFAWHDP-------EYIAWCG 252
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQF----TPNSNNKPKMWTENWSGWFL 259
+ LDT +PWVMC + A + I+ +CN C F + P +WTE+ GWF
Sbjct: 253 NLVKQLDTSIPWVMCYANAAENTIL-SCNDDDCVDFAVKHVKERPSDPLVWTED-EGWFQ 310
Query: 260 SF----GGAVP--YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYD 313
++ +P R ED+A+AVAR+F GG NYYMYHGG N+ R + ++T Y
Sbjct: 311 TWQKDKKNPLPNDQRSPEDVAYAVARWFAVGGAAHNYYMYHGGNNYGRAASAG-VTTMYA 369
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY--PSLGPNL-EATVYKTGSG 370
L GL +PK HL+ LH+A+ C L+ D P P + E TV +
Sbjct: 370 DGVNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRNDRQVLNPRELPLVDEQTVKASSQQ 429
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
+ N D ++F+TA + P ++
Sbjct: 430 RAFVYGPEAEPNQDGA-----------------------ILFDTADVRKS--FPGRQHRT 464
Query: 431 LQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKAD 490
+S WS +N + +EQ+ TADQSDYL Y + K
Sbjct: 465 YTPLVKASALAWKAWSELNVSSTTPRRRVVADQP-IEQLRLTADQSDYLTYETTFTPKQL 523
Query: 491 EPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGY----GSSSNAKVTVDFPIALAPGK-N 545
++D TV ++ A ++G L+G G + + + + P ++ G+ +
Sbjct: 524 SD-VDDDMWTVKVTSCEASSIIALVDGWLIGERNLAYPGGNCSKEFSFHLPASIEVGRQH 582
Query: 546 TFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
L+S+++G+ + G+ + K G+TG V++ G Q+W L GE+L
Sbjct: 583 DLKLVSVSLGIYSLGSNHSK---GVTGSVRI-----GHKDLARGQRWEMYPSLIGEQLEI 634
>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
Length = 317
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 158/322 (49%), Positives = 207/322 (64%), Gaps = 15/322 (4%)
Query: 535 DFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTY 594
+ PI+L PG N LLS+ VGL N G +E+ AGI+ V L+G +GT DLS + WTY
Sbjct: 3 ELPISLIPGTNDIALLSVMVGLPNSGGHFERKIAGIS-TVTLRGFKDGTR-DLSQELWTY 60
Query: 595 QTGLKGEELNFPSGS---STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKG 651
Q GL GE S S W S ST P PL WYK D P G EPV +D + MGKG
Sbjct: 61 QIGLLGEMSTIYSDVGFISVNWTSSST-PN-PPLTWYKAVIDVPDGDEPVILDLSSMGKG 118
Query: 652 EAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSS 711
+AW+NG+ IGRYW ++++ G C+ C+YRG YS +KC NCG+PSQ+LYHVPRSWL+ +
Sbjct: 119 QAWINGEHIGRYWISFLAPLGDCS-KCDYRGNYSLHKCATNCGQPSQTLYHVPRSWLRPT 177
Query: 712 GNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGS---DSKIQRK-PGPV 767
GN LVLFEE GGDP+K+S +T+ + S+C+H ++HP + W +S++ R+ P
Sbjct: 178 GNLLVLFEETGGDPSKVSLLTRSI-DSVCAHAFETHPPSIQSWQKTKVNSEVLRENVEPS 236
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSV 827
L L+C + + ISSIKFASFG P G CG+F +G C S S V +AC+G CSI S
Sbjct: 237 LQLDC-SVGRRISSIKFASFGNPKGVCGNFMKGTCHSVESEKAVEKACLGQHGCSITNSP 295
Query: 828 NTF-GDPCKGVMKSLAVEASCT 848
F GD C G +KSLAVEA+C+
Sbjct: 296 KEFGGDACVGTVKSLAVEATCS 317
>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
Length = 307
Score = 286 bits (732), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 190/315 (60%), Gaps = 13/315 (4%)
Query: 423 VPSFSRQSLQVAADSSDAIGSGWSYINE-PVGISKDDAFTKPGLLEQINTTADQSDYLWY 481
+PSF R+ V++ W NE P DD+ T LLEQI T D SDYLWY
Sbjct: 1 LPSFHRKMTPVSS------AFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWY 54
Query: 482 SLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALA 541
NI +E +++G VL S GH LH F+NG+ G+ YG N K+T + L
Sbjct: 55 MTDVNISPNEGFIKNGQYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLR 114
Query: 542 PGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
G N LLS+ VGL N G YE G+ GPV LKG GT DLS Q+W+Y+ GLKGE
Sbjct: 115 VGNNKISLLSVAVGLSNVGLHYETWNVGVLGPVTLKGLNEGTR-DLSGQKWSYKIGLKGE 173
Query: 602 ELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQ 658
LN + SS QW S+L + QPL WYK TFDAPAG++P+A+D + MGKGE WVNG+
Sbjct: 174 TLNLHTLIGSSSVQWTKGSSLVEKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGE 233
Query: 659 SIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF 718
SIGR+WP Y+++ G CNY G ++ KC +CG+P+Q YH+PRSW+ GN LV+
Sbjct: 234 SIGRHWPAYIAR--GSCGGCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVL 291
Query: 719 EEIGGDPTKISFVTK 733
EE GGDP+ IS V +
Sbjct: 292 EEWGGDPSGISLVKR 306
>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
gi|194695440|gb|ACF81804.1| unknown [Zea mays]
Length = 467
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 172/494 (34%), Positives = 258/494 (52%), Gaps = 56/494 (11%)
Query: 371 LCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQS 430
+C AFL+N T D T+ F G Y +P S+S+L DC+ VVF T +N+ ++++
Sbjct: 6 VCVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNA-----QHNQRT 60
Query: 431 LQVAADSSDAIGSGWSYI---NEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNI 487
A + A + W N P K G L N T D++DY+WY+ S +
Sbjct: 61 FHFADQT--AQNNVWEMFDGENVPKYKQAKIRLRKAGDL--YNLTKDKTDYVWYTSSFKL 116
Query: 488 KADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+AD+ + KTVL V S GHA AF+N K VG G+G+ N T++ P+ L G N
Sbjct: 117 EADDMPIRSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHV 176
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS 607
+L+ ++G+ + GA+ E AG+ VQ+ G GT +DL++ W + GL GE +
Sbjct: 177 AVLASSMGMTDSGAYMEHRLAGV-DRVQITGLNAGT-LDLTNNGWGHIVGLVGERKQIYT 234
Query: 608 GS---STQWDSKSTLPKL--QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
S W P + +PL WYK FD P+G +PV +D + MGKG +VNGQ IGR
Sbjct: 235 DKGMGSVTWK-----PAMNDRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGR 289
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
YW +Y G+PSQ LYHVPRS+L+ N LVLFEE
Sbjct: 290 YWISYKHA----------------------LGRPSQQLYHVPRSFLRQKDNMLVLFEEEF 327
Query: 723 GDPTKISFVTKQLGSSLCSHVTDSHPLPVDMW-GSDSKIQRKPG-----PVLSLECPNPN 776
G P I +T + ++C+ +++ +P + W DS+I K +L CP P
Sbjct: 328 GRPDAIMILTVKR-DNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACP-PK 385
Query: 777 QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--C 834
++I + FAS+G P G CG+++ G C + R+ VV +AC+G + C++ V+ + +G C
Sbjct: 386 KLIQQVVFASYGNPAGICGNYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDANC 445
Query: 835 KGVMKSLAVEASCT 848
G +LAV+A C+
Sbjct: 446 SGTTATLAVQAKCS 459
>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
Length = 448
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 132/277 (47%), Positives = 176/277 (63%), Gaps = 26/277 (9%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
VTYD +++I GKR +L S S+HYPRSTP+MWP +I K++ GGL+ I+TYVFWN+HEP
Sbjct: 42 VTYDGTSLIINGKRELLFSVSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEH 101
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+Y+F+GR+DLV F+KL+ E GLY LR+GP++ AEWN GG P WL +P + FRTDNEP
Sbjct: 102 RKYDFKGRFDLVTFIKLIQEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPEVYFRTDNEP 161
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
FK +R+ KI+ MMK+EKL ASQ L ENE + AY G+ YIKWAA +
Sbjct: 162 FKEHTERYVRKILGMMKEEKLLASQRRSHHLG-TENECNAVQLAYKENGERYIKWAANLV 220
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
S+ G+PWVMC+Q++A D +IN CNG +C +F G
Sbjct: 221 ESMKLGIPWVMCKQNNASDNLINACNGRHCFEF---------------------LGILQL 259
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYM----YHGGTNF 299
ED+AF+VAR+F + G+ NYYM YH +F
Sbjct: 260 IEQSEDIAFSVARYFSKNGSHVNYYMMVDRYHIPRSF 296
Score = 73.6 bits (179), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 71/120 (59%), Gaps = 9/120 (7%)
Query: 701 YHVPRSWLKSSG--NTLVLFEEIGGDPTK-ISFVTKQLGSSLCSHVTDSHPLPVDMWGSD 757
YH+PRS++K N LV+ EE G + I FV ++CS+V + +P+ V W +
Sbjct: 290 YHIPRSFMKEEKKKNMLVILEEEPGVKLEAIDFVLVNR-DTICSYVGEDYPVSVKSWKRE 348
Query: 758 S-KIQRKPGPVL---SLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQ 813
KI + + ++CP P + + +++FASFG P GTCG+F+ G+CS+++S VV +
Sbjct: 349 RPKIASRSKDMRLKAVMKCP-PEKQMVAVEFASFGDPTGTCGNFTMGKCSASKSKEVVEK 407
>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 154
Score = 276 bits (706), Expect = 4e-71, Method: Composition-based stats.
Identities = 119/154 (77%), Positives = 138/154 (89%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+VTYDH+A++I G+RR+LISGSIHYPRSTP+MWPDLIQK+KDGGLD+IETYVFWN HEP
Sbjct: 1 SVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPS 60
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
++Y FE RYDLV+F+KLV +AGLY HLRIGPYVCAEWN+GGFPLWL F+PGI FRTDN
Sbjct: 61 PDKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTDNA 120
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
PFKA MQ+F KIVDMMK EKL+ +QGGPIILSQ
Sbjct: 121 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 154
>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 402
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 155/383 (40%), Positives = 219/383 (57%), Gaps = 16/383 (4%)
Query: 289 NYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVA 348
NYYMYHGGTNF RTS + YD +APLDE+GL ++PKWGHL+DLH A+KLC+ AL+
Sbjct: 3 NYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61
Query: 349 TDPTYPSLGPNLEATVYKT-GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDC 407
+ LG EA V++ +C AFL+N T DVT+ F G SY +P S+SIL DC
Sbjct: 62 GKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADC 121
Query: 408 KNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINE---PVGISKDDAFTKPG 464
K VVF T +N+ ++++ A ++ + W +E P K G
Sbjct: 122 KTVVFGTQHVNA-----QHNQRTFHFADQTTQ--NNVWQMFDEEKVPKYKQSKIRLRKAG 174
Query: 465 LLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGY 524
L N T D++DY+WY+ S ++AD+ + KTVL V S GHA AF+N K VG G+
Sbjct: 175 DL--YNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGH 232
Query: 525 GSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTN 584
G+ N T++ P+ L G N +L+ T+G+ + GA+ E AG+ VQ+KG GT
Sbjct: 233 GTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAGVD-RVQIKGLNAGT- 290
Query: 585 IDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAID 644
+DL++ W + GL GE+ + + +PL WYK FD P+G +P+ +D
Sbjct: 291 LDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAVNDRPLTWYKRHFDMPSGEDPIVLD 350
Query: 645 FTGMGKGEAWVNGQSIGRYWPTY 667
+ MGKG +VNGQ IGRYW +Y
Sbjct: 351 MSTMGKGLMFVNGQGIGRYWISY 373
>gi|62529271|gb|AAX84941.1| beta-galactosidase [Prunus persica]
Length = 287
Score = 271 bits (694), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 147/297 (49%), Positives = 185/297 (62%), Gaps = 11/297 (3%)
Query: 305 GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
GPF++TSYDYDAPLDEYGL R+PKWGHL+DLHKAIK E+ALV+ +P+ SLG EA V
Sbjct: 1 GPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHV 60
Query: 365 YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVP 424
+K+ SG C+AFLAN T S V F Y LP WS+SILPDCK V+NTA++ S
Sbjct: 61 FKSKSG-CAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCKTAVYNTARLGS----- 114
Query: 425 SFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLS 484
QS Q+ + S++ E + D T GL EQIN T D +DYLWY
Sbjct: 115 ----QSSQMKMTPVKSALPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTD 170
Query: 485 TNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGK 544
I DE ++ G +L + S GHALH FING+L G+ YG+ N K+T + L G
Sbjct: 171 ITISPDEGFIKRGESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGI 230
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE 601
N LLS++VGL N G +E AG+ GPV LKG +GT D+S +WTY+TGLKGE
Sbjct: 231 NKLALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGT-WDMSRWKWTYKTGLKGE 286
>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
Length = 288
Score = 270 bits (689), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 134/220 (60%), Positives = 156/220 (70%), Gaps = 4/220 (1%)
Query: 172 GGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTC 231
G ++L + G I++ YG GK Y KWAA ALSL GVPWVMC+Q DAP II+TC
Sbjct: 29 GIHLVLGTVSLGVGAIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTC 88
Query: 232 NGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYY 291
N +YCD F PNS+NKP MWTENW GW+ +G +P+RPVEDLAFAVA FFQRGG+FQNYY
Sbjct: 89 NAYYCDGFKPNSHNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQRGGSFQNYY 148
Query: 292 MYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD- 350
MY G TNF RT+GGP TSYDY A +DEYG +R+PKWGHLKDLH A+KLCE ALVATD
Sbjct: 149 MYFGRTNFGRTAGGPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLCEPALVATDS 208
Query: 351 PTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFN 390
PTY LGPN E T S L S F + G + V F+
Sbjct: 209 PTYIKLGPNQEIG---TLSMLRSRFQSLPGAFNTCLVPFD 245
>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
Length = 418
Score = 268 bits (686), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 190/326 (58%), Gaps = 39/326 (11%)
Query: 46 GSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVA 105
GS+HYPR PEMWPD+ +K+K Q+NFEG YDL+KF+K++
Sbjct: 11 GSVHYPRCPPEMWPDIFKKAK---------------------QFNFEGNYDLIKFIKMIG 49
Query: 106 EAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQE 165
HL + + P+WL IP I FR+DN+PF M++FT I+ M+ E
Sbjct: 50 IMICMQHLEL------VHSLKELPIWLREIPNIIFRSDNQPFMYHMEQFTKMIIKKMRDE 103
Query: 166 KLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPD 225
K + + QIENE+ + AY G Y++W MA+ LDTGVPW+MC+Q +A
Sbjct: 104 KFFPRK-------QIENEHTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNALG 156
Query: 226 PIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQR 283
P++NTCNG YC D F+ PN N+ + ++ + +FG R ED+A AVARFF +
Sbjct: 157 PVMNTCNGRYCGDTFSGPNKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFFSK 214
Query: 284 GGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCE 343
GT NYYMY+GGTNF RTS F++T Y +AP+ EYGL R+PKWGH +DLH A+KLC+
Sbjct: 215 KGTMANYYMYYGGTNFGRTSSS-FVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKLCQ 273
Query: 344 AALVATDPTYPSLGPNLEATVYKTGS 369
AL+ LG +LE + GS
Sbjct: 274 KALLWGTQPVQMLGKDLEVGQKQFGS 299
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 6/126 (4%)
Query: 691 KNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLP 750
K G LYH PR+ L+ N LV+ EE+GG I +T ++CS + +P
Sbjct: 295 KQFGSYVSMLYHTPRAILQPKNNFLVVLEEMGGKLDGIEILTVN-RDTICSIAGEHYPPN 353
Query: 751 VDMWGSDSKIQRK----PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSAR 806
V+ W + R P P +L C + N+ I+ + FAS+G P+G CG F G+C++
Sbjct: 354 VETWSRYKGVIRTNVDTPKPAANLVCLD-NKTITQVDFASYGDPVGNCGHFILGKCNAPN 412
Query: 807 SLSVVR 812
S +V
Sbjct: 413 SQKIVE 418
>gi|223942939|gb|ACN25553.1| unknown [Zea mays]
Length = 199
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 131/201 (65%), Positives = 160/201 (79%), Gaps = 2/201 (0%)
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKGEAWVNGQSIGRYWPT ++ GC +SCNYRGAYSS+KCLK CG+PSQ+LYHVPRS+
Sbjct: 1 MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSF 60
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV 767
L+ N LVLFE GGDP+KISFV +Q G S+C+ V+++HP +D W S +QR GP
Sbjct: 61 LQPGSNDLVLFEHFGGDPSKISFVMRQTG-SVCAQVSEAHPAQIDSWSSQQPMQRY-GPA 118
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSV 827
L LECP QVISS+KFASFGTP GTCGS+S G CSS ++LS+V++AC+G SCS+ VS
Sbjct: 119 LRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSS 178
Query: 828 NTFGDPCKGVMKSLAVEASCT 848
N FG+PC GV KSLAVEA+C+
Sbjct: 179 NYFGNPCTGVTKSLAVEAACS 199
>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
Length = 315
Score = 266 bits (680), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 124/179 (69%), Positives = 147/179 (82%), Gaps = 2/179 (1%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M +K+++LLVL V + + VTYDHRA+VI GKRRVL SGSIHYPRS PE+WP+
Sbjct: 136 MGNKDLVLLVLIA--VCVFEGCYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPE 193
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
+I+KSK+GGLDVIETYVFWN HEPVR +Y FEGR+DLV+FVK V EAGL HLRIGPY C
Sbjct: 194 IIRKSKEGGLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYAC 253
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
AEWN+GGFP+WLHFIPGIQFRT N+ FK EM+RF AKIV +MK+ L+A QGGPIIL+Q
Sbjct: 254 AEWNYGGFPVWLHFIPGIQFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 312
>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
Length = 177
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 124/179 (69%), Positives = 147/179 (82%), Gaps = 2/179 (1%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M +K+++LLVL V + + VTYDHRA+VI GKRRVL SGSIHYPRS PE+WP+
Sbjct: 1 MGNKDLVLLVLI--AVCVFEGCYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPE 58
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
+I+KSK+GGLDVIETYVFWN HEPVR +Y FEGR+DLV+FVK V EAGL HLRIGPY C
Sbjct: 59 IIRKSKEGGLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYAC 118
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
AEWN+GGFP+WLHFIPGIQFRT N+ FK EM+RF AKIV +MK+ L+A QGGPIIL+Q
Sbjct: 119 AEWNYGGFPVWLHFIPGIQFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 177
>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
Length = 172
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 119/172 (69%), Positives = 136/172 (79%)
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GGFP+WL ++PGI FRTDNEPFK MQ FT KIV++MK E L+ SQGGPIILSQIENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYG 60
Query: 186 NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNN 245
G AG Y+ WAA MA+ L TGVPWVMC++ DAPDP+INTCNGFYCD F+PN
Sbjct: 61 PQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPY 120
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGT 297
KP +WTE WSGWF FGG + RPV+DLAFAVARF Q+GG+F NYYMYHGGT
Sbjct: 121 KPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQKGGSFFNYYMYHGGT 172
>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
Length = 173
Score = 262 bits (670), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 117/169 (69%), Positives = 136/169 (80%)
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
GF ++PGI FRTDN PFKA MQ+FT KIV+MMK EKL+ QGGPII+SQIENEYG
Sbjct: 3 GFSCLAQYVPGIAFRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIENEYGP 62
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
++ GA GKSY KWAA MA+ L+TGVPW+MC+Q DAPDP+I+TCNGFYC+ F PN N K
Sbjct: 63 VEWEIGAPGKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKNYK 122
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHG 295
PKMWTENW+GW+ FGG PYRPVEDLAF+VARF Q G+F NYYMYHG
Sbjct: 123 PKMWTENWTGWYTKFGGPAPYRPVEDLAFSVARFIQNNGSFVNYYMYHG 171
>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
Length = 752
Score = 261 bits (667), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 209/765 (27%), Positives = 356/765 (46%), Gaps = 82/765 (10%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
++D RA+ + GKR +L+ GS+ YP+ W + ++ +K+ GL+ ++ YVFWN+HE R
Sbjct: 8 SFDSRAITLNGKRTLLLGGSLQYPKIHHTQWNNTLKLAKECGLNFLDIYVFWNVHEKKRG 67
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF 147
+ F D+ +F+++ + GL LR+GPY+CAE ++GGFP WL IPGIQFRT N+PF
Sbjct: 68 IFTFTEEADIFRFLQMAHQHGLLVMLRLGPYICAETSYGGFPCWLREIPGIQFRTYNDPF 127
Query: 148 KAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMAL 207
E++R+ I ++K+++L+ QGGPI+L Q+ENEY + + G+ Y+ W +
Sbjct: 128 MREVKRWLFYITTLLKEKRLFFPQGGPIVLVQLENEYDLVSKIQLSKGEQYLNWYNELYR 187
Query: 208 SLDTGVPWVMCQQS-------------------DAPDPIINTCNGFYCDQ----FTPNSN 244
L VP +MC+ S + + I T N FY +
Sbjct: 188 ELAFDVPLIMCRSSPEEVGEFCSCSKEPELSTIASVETCIETFNSFYGHKKIADLRRRKP 247
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
++P +WTE W GW+ + A R ED+ +A RF +GG +YYM+HGGT+F+ +
Sbjct: 248 HQPILWTEFWIGWYDIWTSAPRKRSTEDVIYAALRFIAQGGAGFSYYMFHGGTHFNNLAM 307
Query: 305 GPFISTSYDYDAPLDEYGLIRQPK--WGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLE 361
+TSY +D+P+DEYG +P + LK ++ + + L++ D P L P +
Sbjct: 308 YS-QTTSYYFDSPIDEYG---RPSFLFYMLKRINHILHQFSSHLLSQDHPQVLHLLPQVV 363
Query: 362 ATVYKTGSGLCS-AFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSV 420
A +++ S S +FL N + + F + + SV++ + ++F+++
Sbjct: 364 AFIWQEHSSQQSLSFLCN-DSEQIAYIMFQQSMMKMNPLSVAVFLE-NELLFDSSSGYDW 421
Query: 421 TLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLW 480
+P + L+ A + + I P S D P +L + T D++DY+W
Sbjct: 422 Q-IPFRDFKPLERAYFRE--LKTFQLDIPIPPLSSSCDFSQLPDML---SVTQDETDYMW 475
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGY---------GSSSNAK 531
Y S + E + VL + +H FIN + +GS + + +
Sbjct: 476 YISSATLPVSSK--EFTCEKVLLQIEMADLIHLFINQQYMGSSWIKIDDERFANGKNGFR 533
Query: 532 VTVDF-------PIALAPGKNTFDLLSLTVGLQN------YGAFYEKTGAGITGPVQLKG 578
+++F P+ + K +L ++GL GA EK G+ +
Sbjct: 534 FSIEFENSVYPQPVFSSNSKLYVSILVCSLGLIKGEFQLWKGATMEKEKKGLFKQPIIHF 593
Query: 579 SGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKL-QPL----VWYKTTF- 632
+ ++ + ++ + L+ + + + + + +PL +YK T
Sbjct: 594 VVKHSELETETIPLSFTSSWAMMPLSIMKDHQSAFVKEYNIKNVDKPLSLGPTYYKQTVI 653
Query: 633 ----DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNK 688
A + IDF+ M KG N GRY Y Q G + R +
Sbjct: 654 INKAMIDALKWGLVIDFSSMTKGIFRWNSFCCGRY---YSIQVLGKERDPSLRNSPVQED 710
Query: 689 CLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD--PTKISFV 731
L K +Q YH+P+ L+ N L +FEEIGG+ +I FV
Sbjct: 711 HL---FKSTQRYYHIPKGVLQER-NELEVFEEIGGNFMQLRILFV 751
>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 275
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 134/272 (49%), Positives = 184/272 (67%), Gaps = 17/272 (6%)
Query: 585 IDLSSQQWTYQTGLKGEELN--FPSGS-STQW-DSKSTLPKLQPLVWYKTTFDAPAGSEP 640
+DLS Q+WTYQ GLKGE +N FP+ + S W D+ T+ K QPL W+KT FDAP G+EP
Sbjct: 1 MDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEP 60
Query: 641 VAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSL 700
+A+D GMGKG+ WVNG+SIGRYW + + G C+ C+Y G Y NKC CG+P+Q
Sbjct: 61 LALDMEGMGKGQIWVNGESIGRYWTAFAT--GDCSH-CSYTGTYKPNKCQTGCGQPTQRW 117
Query: 701 YHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHP----LPVDMWGS 756
YHVPR+WLK S N LV+FEE+GG+P+ +S V + + S +C+ V++ HP ++ +G
Sbjct: 118 YHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSV-SGVCAEVSEYHPNIKNWQIESYGK 176
Query: 757 DSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACV 816
R P + L+C +P Q I+SIKFASFGTPLGTCGS+ +G C +A S +++ + CV
Sbjct: 177 GQTFHR---PKVHLKC-SPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCV 232
Query: 817 GSKSCSIGVSVNTFG-DPCKGVMKSLAVEASC 847
G C++ +S + FG DPC V+K L VEA C
Sbjct: 233 GKARCAVTISNSNFGKDPCPNVLKRLTVEAVC 264
>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
Length = 777
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 220/776 (28%), Positives = 348/776 (44%), Gaps = 129/776 (16%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE-- 83
+TYD R++ I GK +SG++HY RS P WP + + + GL+ +ETYVFW HE
Sbjct: 9 EITYDSRSLRINGKPFFCLSGAVHYVRSHPSAWPQIFRCMRRDGLNTVETYVFWGDHEFE 68
Query: 84 -----PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI--- 135
+ +F G DLV+F++ GL A LR+GPYVCAE N+GGFP WL +
Sbjct: 69 PPEMPDAEPRADFSGPRDLVRFLRCAKLHGLNAILRLGPYVCAEVNYGGFPWWLRQVCEK 128
Query: 136 ---PGIQFRTDNEPFKAEMQRFTAKIVD-MMKQEKLYASQGGPIILSQIENEYGNIDSAY 191
++FRT + + A+++R+ +VD ++K +++A QGGP+IL+QIENEY I +Y
Sbjct: 129 GSSKPVRFRTWDPAYCAQVERWLKYLVDHVLKPARVFAPQGGPVILAQIENEYAMIAESY 188
Query: 192 GAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDP--IINTCNGFYCDQFTPN------S 243
G G+ Y+ W A +A L GVP VMC + + +I T N FY + + +
Sbjct: 189 GPDGQQYLDWIASLANQLALGVPLVMCYGASQRESGRVIETINAFYAHEHVESLRRAQGA 248
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
N +P +WTE W+GW+ +G R DLA+AV RF GG NYYMY GGTN+ R +
Sbjct: 249 NPQPLLWTECWTGWYDVWGAPHHRRDAADLAYAVLRFLAAGGAGINYYMYFGGTNWRREN 308
Query: 304 GGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
+TSYDYDAPL+EY ++ K HL+ LH++I + L D ++
Sbjct: 309 TMYLQATSYDYDAPLNEY-VMETTKSRHLRRLHESI---QPFLSDRDGVL-----DMSRL 359
Query: 364 VYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLV 423
K G A L T S G++ SV + D ++ + A +V
Sbjct: 360 ELKVFEGERRAILYERSTVS-------GDADHRSEESVRCVFDSADIRVHLALELREIIV 412
Query: 424 PSFSRQSLQVAADSSDAIGSGWSYINEPVGIS---KDDAFTKPGLLEQINTTADQSDYLW 480
+ SR + Q W + EP + D + T + + ++ TA SDY W
Sbjct: 413 NAASRDTGQ---------DLRWRMLPEPPPLRAALSDTSATLATIPDLVDATAGTSDYAW 463
Query: 481 YSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSG----------YGSSSNA 530
Y L LL+ L V G K V G +
Sbjct: 464 YILRCPTAQGSGLLQ------LEVADFGRVWRR----KAVDQGDDAERQPLEWAAAGPEP 513
Query: 531 KVTVDFPIALAPGKNTFDLLSL--------------TVGLQN--------YGAFYEKTGA 568
V FP A + + ++ + ++G+ YG E+ G
Sbjct: 514 PVEDRFPNAWNSTEYGYGIVEVGAIDCHEEYVVLVSSLGMVKGDWQLPPGYGMARERKG- 572
Query: 569 GITGPVQLKGSGNGTNIDLSSQQWT------YQTGLKGEEL-NFPSGSSTQ----WDSKS 617
L + +++ + +W + GL+GE + + G + W +
Sbjct: 573 -------LLRASYRSDVTFADDEWRDALVVGFAAGLRGERIRSVIEGDADAYPYLWTPQK 625
Query: 618 TLPKLQPLVW---YKTTFDAPA----GSEPVAIDF--TGMGKGEAWVNGQSIGRYWPTY- 667
+ W Y+ + P +E + +D +G+ KG ++NG+ GR+W +
Sbjct: 626 AALSGRRFSWPRWYRASLAIPPPNADETEGIILDLYESGVEKGWIYMNGEPCGRHWRVHG 685
Query: 668 -VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG--NTLVLFEE 720
+ +NG +G + G+P+Q +++P L + G +TLV+F+E
Sbjct: 686 TMPKNGFLR-----QGDQEAPIEQVGHGQPTQRYFYIPPWHLHAKGRPSTLVIFDE 736
>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
Length = 480
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 151/331 (45%), Positives = 194/331 (58%), Gaps = 20/331 (6%)
Query: 521 GSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSG 580
G+ YGS + K+T + L G NT LS+ VGL N G +E AGI GPV L G
Sbjct: 165 GTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLN 224
Query: 581 NGTNIDLSSQQWTYQTGLKGEE--LNFPSGSST-QWDSKSTLPKLQPLVWYKTTFDAPAG 637
G DL+ Q+WTYQ GLKGE L+ SGSST +W F+AP G
Sbjct: 225 EGRR-DLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNASNMAF------FNAPDG 277
Query: 638 SEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPS 697
EP+A+D + MGKG+ W+NGQ IGRYWP Y + +G C +C+YRG Y KC NCG S
Sbjct: 278 DEPLALDMSSMGKGQIWINGQGIGRYWPGYKA-SGNC-GTCDYRGEYDETKCQTNCGDSS 335
Query: 698 QSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSD 757
Q YHVPRSWL +GN LV+FEE GGDPT IS V + +G S+C+ V++ P + W +
Sbjct: 336 QRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIG-SVCADVSEWQP-SMKNWHTK 393
Query: 758 SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVG 817
+ K + L+C N Q I+ IKFASFGTP G+CGS++ G C + +S + + CVG
Sbjct: 394 DYEKAK----VHLQCDN-GQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIFWKNCVG 448
Query: 818 SKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
+ C + V F GDPC G MK VEA C
Sbjct: 449 QERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 479
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 90/143 (62%), Positives = 113/143 (79%)
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
MQ+FT KIV+MMK E L+ QGGPIILSQIENE+G ++ G K+Y WAA MA++L+
Sbjct: 1 MQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALN 60
Query: 211 TGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
T VPW+MC++ DAPDPIINTCNGFYCD F+PN +KP MWTE W+ W+ FG VP+RPV
Sbjct: 61 TSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPV 120
Query: 271 EDLAFAVARFFQRGGTFQNYYMY 293
EDLA+ VA+F Q+GG+F NYYM+
Sbjct: 121 EDLAYGVAKFIQKGGSFVNYYMF 143
>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
Length = 770
Score = 257 bits (657), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 232/782 (29%), Positives = 353/782 (45%), Gaps = 145/782 (18%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP- 84
+VTYD RA I G R +L+ GSIHYPR + W ++++ GL+ ++ YVFWN HEP
Sbjct: 50 SVTYDSRAFKIDGVRTLLLGGSIHYPRVAVDEWEPMLEEMGRDGLNHVQLYVFWNYHEPR 109
Query: 85 ----------VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
+ ++Y+F GR DL+ F++ A+ L+ LRIGPYVCAEW FGG PLWL
Sbjct: 110 PPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAKKDLFVSLRIGPYVCAEWAFGGLPLWLRD 169
Query: 135 IPGIQFRT--------------------DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGP 174
+ G+ FR+ +P++ M F +I M+K+ L A+QGGP
Sbjct: 170 VEGMCFRSICGYNGSPGKCKPWEGGKFRSCDPWRKYMADFVMEIGRMVKEANLMAAQGGP 229
Query: 175 IILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGF 234
+IL Q+ENEYG+ + AG++YI W ++ L VPWVMC A + +N CNG
Sbjct: 230 VILGQLENEYGH----HSDAGRAYIDWVGELSFGLGLDVPWVMCNGISA-NGTLNVCNGD 284
Query: 235 YC-DQFTPNSNNK----PKMWTENWSGWFLSFGGAV--PYRPVEDLAFAVARFFQRGGTF 287
C D++ + + + P WTEN GWF ++GGAV R E++A+ +A++ GG+
Sbjct: 285 DCADEYKTDHDKRWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVAVGGSH 343
Query: 288 QNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALV 347
NYYM++GG + + G ++ +Y GL +PK HL+ LH+ + L+
Sbjct: 344 HNYYMWYGGNHLAQW-GAASLTNAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNGELM 402
Query: 348 ATDPTYPSLGPNLE--ATVYKTGSGLCSAFLANIG-TNSDVTVKFNGNSYLLPAWSVSIL 404
+ + + LE VY+ +GL AFL + S V V + +Y + V ++
Sbjct: 403 QVEDRHSVMPVQLENGVEVYEWTAGL--AFLHRPACSGSPVEVHYAKATYSIACREVLVV 460
Query: 405 -PDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPV--GISKDDAFT 461
P V+F TA SV P R+ VA ++D WS E + G++ +
Sbjct: 461 DPSSSTVLFATA---SVEPPPELVRRV--VATLTADR----WSMRKEELLHGMATVEG-R 510
Query: 462 KPGLLEQINTTADQSDYLWYSLSTNIKADEPL----LEDGSKT--VLHVQSLGHALHAFI 515
+P +E + + +DY+ Y T + A E + LE S+ V HV S+ +A
Sbjct: 511 EP--VEHLRVSGLDTDYVTY--KTTVTATEGVTNVSLEIDSRISQVFHV-SVDNASSLAA 565
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDL--LSLTVGLQNYGAFYEKTGA----- 568
V G N + T + T+DL LS ++G++N G Y A
Sbjct: 566 TVMDVNKG-----NTEWTAVAQLHNLTAGRTYDLWILSESLGVEN-GMLYGAPAATEPSL 619
Query: 569 --GITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPL- 625
GI G ++L + +W+ GL GE G K+ LP L
Sbjct: 620 QKGIFGDIRL------NEKSIRKGRWSMVKGLDGE---VDGGQ-----GKAELPCCDSLG 665
Query: 626 -VWYKTTF-----DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
W+ F + + S + + G W+NG IGR+ GG
Sbjct: 666 PAWFVAGFTLHSVRSKSISLTLPLGLPQQAGGHIWLNGVDIGRW-----RAVGG------ 714
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLF-------EEIGGDPTKISFVT 732
Q+ Y +P LK N L +F E GG PT +
Sbjct: 715 -----------------RQASYRLPSDVLKRGSNRLAVFSATGHWVSEQGGPPTVVEEFY 757
Query: 733 KQ 734
K+
Sbjct: 758 KK 759
>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
Length = 383
Score = 256 bits (654), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 163/419 (38%), Positives = 218/419 (52%), Gaps = 45/419 (10%)
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT-GSGLCSA 374
PLDE+GL R+PKWGHLKD+H+A+ LC+ AL PT LGP+ +A V++ G+ C+A
Sbjct: 4 GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAA 63
Query: 375 FLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVA 434
LAN T V F G LPA S+S+LPDCK VVFNT + + +F R ++A
Sbjct: 64 LLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSRNFVRS--EIA 121
Query: 435 ADSSDAIGSGWSYINE--PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEP 492
+ + W E PVG+ F P E + T D +DY WY+ S + +
Sbjct: 122 NKNFN-----WEMYREVPPVGLGFK--FDVP--RELFHLTKDTTDYAWYTTSLLLGRRDL 172
Query: 493 LLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSL 552
++ + VL V SLGH +HA++NG+ GS +GS +L G+N LL
Sbjct: 173 PMKKNVRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRELSSLKEGENHIALLGY 232
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF---PSGS 609
VGL + GA+ EK AG + + G GT +D+S W +Q G GE+
Sbjct: 233 LVGLPDSGAYMEKRFAGPRS-ITILGLNTGT-LDISQNGWGHQVGTDGEKKKLFTEEGSK 290
Query: 610 STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
S QW + PL WYK FDAP G PVAI TGMGKG WVNG+SIGRYW Y+S
Sbjct: 291 SVQWTKPD---QGGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLS 347
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKI 728
KP+QS YH+PR++LK N +VL EE GG+P +
Sbjct: 348 P----------------------LKKPTQSEYHIPRAYLKPK-NLIVLLEEEGGNPKDV 383
>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
Length = 172
Score = 251 bits (640), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 112/160 (70%), Positives = 133/160 (83%), Gaps = 1/160 (0%)
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
MAL L TGVPW+MC+Q DAP PII+TCNG+YC+ F PNS NKPKMWTENW+GW+ FGGA
Sbjct: 1 MALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFGGA 60
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLI 324
VPYRPVED+A++VARF Q+GG+ NYYMYHGGTNFDRT+ G F+++SYDYDAPLDEYGL
Sbjct: 61 VPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLP 119
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
R+PK+ HLK LHKAIKL E AL++ D T SLG E T+
Sbjct: 120 REPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEVTI 159
>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
Length = 601
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 191/663 (28%), Positives = 297/663 (44%), Gaps = 99/663 (14%)
Query: 113 LRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQG 172
+RIGPYVCAEW+ GG P+W++++ G++ R +N+ +K EM + + D + +A +G
Sbjct: 1 MRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTRD--FFADRG 58
Query: 173 GPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN 232
GPII SQIENE +G A + YI W A SL+ VPW+MC D + IN CN
Sbjct: 59 GPIIFSQIENE------LWGGA-REYIDWCGEFAESLELNVPWMMC-NGDTSEKTINACN 110
Query: 233 GFYCDQFTPNSN-------NKPKMWTENWSGWFLSFGGAVPYRP---------VEDLAFA 276
G C + + ++P WTEN GWF G A R ED F
Sbjct: 111 GNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFN 169
Query: 277 VARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLH 336
V +F RGG++ NYYM+ GG ++ + +G ++ Y + L +PK H +H
Sbjct: 170 VLKFMDRGGSYHNYYMWFGGNHYGKWAGNG-MTNWYTNGVMIHSDTLPNEPKHSHTAKMH 228
Query: 337 KAIKLCEAALVATDPTYPSLG----PNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGN 392
+ + L+ + N A Y+ G L S F+ N ++D + +
Sbjct: 229 RMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYRYGDRLVS-FVENSKGSADKVI-YRDI 286
Query: 393 SYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFS-RQSLQVAADSSDAIGSGWSYINEP 451
Y LPAWS+ +L + NV+F T + V + + L+ Y NEP
Sbjct: 287 VYELPAWSMIVLDEYDNVLFETNNVKPVNKHRVYHCEEKLEF------------EYWNEP 334
Query: 452 VGISKDDA---FTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLG 508
V +A P EQ+N T D +++L+Y DE L G
Sbjct: 335 VSTLSQEAPRVVVSPKANEQLNMTRDLTEFLYYETEVEFPQDECTLSIGGTDA------- 387
Query: 509 HALHAFINGKLVGS-GYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN------YGA 561
+A A+++ VGS + + T++ + GK+ LLS ++G+ N +
Sbjct: 388 NAFVAYVDDHFVGSDDEHTHHDGWHTMNINMKSGKGKHKLVLLSESLGVSNGMDSNLDPS 447
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKSTLP 620
+ GI G ++L G+ D+ +Q+W + GL GE F KS +
Sbjct: 448 WASSRLKGICGWIKLCGN------DIFNQEWKHYPGLVGEAKQVFTDEGMKTVTWKSDVE 501
Query: 621 KLQPLVWYKTTFDAPAGSE---PVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
L WY++TF P G + V + GM +G+A+ NG +IGRYW + ++G
Sbjct: 502 NADNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRGQAYANGHNIGRYW---MIKDGN---- 554
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSG--NTLVLFEEIGGDPTKISFVTKQL 735
G+ +Q YH+P+ WLK G N LVL E +G ++ T +
Sbjct: 555 ----------------GEYTQGFYHIPKDWLKGEGEENVLVLGETLGASDPSVTICTTEY 598
Query: 736 GSS 738
S+
Sbjct: 599 VSN 601
>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
Length = 296
Score = 245 bits (626), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 136/299 (45%), Positives = 172/299 (57%), Gaps = 9/299 (3%)
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
S A G W +E AFTK GL+EQ++ T D+SDYLWY+ NI ++E L+ G
Sbjct: 2 SPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSG 61
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
L + S GH+L F+NG+ G+ YG + K+T + + G N +LS VGL
Sbjct: 62 QWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLP 121
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWD 614
N G YE G+ GPV L G G DLS Q+WTYQ GL GE L S SS +W
Sbjct: 122 NQGTHYETWNVGVLGPVTLSGLNEGKR-DLSDQKWTYQIGLHGESLGVQSVAGSSSVEWG 180
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
S + QPL W+K F AP+G PVA+D MGKG+AWVNG+ IGRYW +Y + + GC
Sbjct: 181 SAA---GKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSGC 236
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
C+Y G YS KC CG SQ YHVPRSWL SGN LV+ EE GGD + + VT+
Sbjct: 237 -GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTR 294
>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
Length = 270
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/272 (47%), Positives = 174/272 (63%), Gaps = 9/272 (3%)
Query: 581 NGTNIDLSSQQWTYQTGLKGEELNFPSGSS---TQWDSKSTLPKLQPLVWYKTTFDAPAG 637
NG DLS Q+WTY+ GLKGE L+ S S +W + + + QPL WYKTTF APAG
Sbjct: 2 NGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAG 61
Query: 638 SEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPS 697
P+A+D MGKG+ W+NGQS+GR+WP Y + G C++ C+Y G + +KCL+NCG+ S
Sbjct: 62 DSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAV-GSCSE-CSYTGTFREDKCLRNCGEAS 119
Query: 698 QSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVD-MWGS 756
Q YHVPRSWLK SGN LV+FEE GGDP I+ V +++ S+C+ + + V+ +
Sbjct: 120 QRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREV-DSVCADIYEWQSTLVNYQLHA 178
Query: 757 DSKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACV 816
K+ + P L+C P Q I+++KFASFGTP GTCGS+ +G C + S + CV
Sbjct: 179 SGKVNKPLHPKAHLQC-GPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCV 237
Query: 817 GSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
G CS+ V+ F GDPC VMK LAVEA C
Sbjct: 238 GQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVC 269
>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
Length = 446
Score = 243 bits (620), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 148/472 (31%), Positives = 230/472 (48%), Gaps = 41/472 (8%)
Query: 383 SDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIG 442
D TV F G + +P+ SVSIL DCK VV+NT ++ S +S ++S
Sbjct: 1 EDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV-----FVQHSERSFHTTDETSK--N 53
Query: 443 SGWSYINEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVL 502
+ W +E + + LEQ N T D SDYLWY+ S +++D+ + V+
Sbjct: 54 NVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVI 113
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
++S HA+ F N VG+G GS + P+ L G N +LS ++G+++ G
Sbjct: 114 QIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGE 173
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDSKSTLPK 621
+ GI V ++G GT +DL W ++ L+GE+ + Q+ K
Sbjct: 174 LVEVKGGIQDCV-VQGLNTGT-LDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAEND 231
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
L P+ WYK FD P G +P+ +D + M KG +VNG+ IGRYW ++++
Sbjct: 232 L-PITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITL----------- 279
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCS 741
G PSQS+YH+PR++LK GN L++FEE G P I T + +C
Sbjct: 280 -----------AGHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRR-DDICV 327
Query: 742 HVTDSHPLPVDMWGSD----SKIQRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSF 797
+++ +P + W SD I +L CP P + I + FASFG P G CG+F
Sbjct: 328 FISEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCP-PKRTIQEVVFASFGNPEGACGNF 386
Query: 798 SRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEASC 847
+ G C + + ++V + C+G +SC + V +G C +LAV+ C
Sbjct: 387 TAGTCHTPDAKAIVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRC 438
>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
Length = 317
Score = 239 bits (609), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 137/298 (45%), Positives = 179/298 (60%), Gaps = 25/298 (8%)
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGE-ELNFPSGSSTQWDSK 616
NYGAF EK GAG G V+L G NG IDLS WTYQ GL+GE + + S + +
Sbjct: 26 NYGAFLEKDGAGFKGQVKLTGFKNG-EIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWT 84
Query: 617 STLPKLQP--LVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
P P WYKT FDAP G PVA+D MGKG+AWVNG IGRYW T V+ GC
Sbjct: 85 DLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYW-TRVAPKDGC 143
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
C+YRG Y ++K YH+PRSWL++S N LVLFEE GG P +IS V +
Sbjct: 144 -GKCDYRGHYHTSK------------YHIPRSWLQASNNLLVLFEETGGKPFEIS-VKSR 189
Query: 735 LGSSLCSHVTDSHPLPVDMWGS----DSKIQRKPGPVLSLECPNPNQVISSIKFASFGTP 790
++C+ V++SH + W D + K P + L+C + ISSI+FAS+GTP
Sbjct: 190 STQTICAEVSESHYPSLQNWSPSDFIDQNSKNKMTPEMHLQC-DDGHTISSIEFASYGTP 248
Query: 791 LGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTF-GDPCKGVMKSLAVEASC 847
G+C FS+G+C + SL++V +AC G SC I + + F GDPC+G++K+LAVEA C
Sbjct: 249 QGSCQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFGGDPCRGIVKTLAVEAKC 306
>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 138/350 (39%), Positives = 188/350 (53%), Gaps = 55/350 (15%)
Query: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIG 116
MW L++ +K+GG+DVIETYVF N HE + Y F G YDL+KFVK+V +AG+Y L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 117 PYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 176
P+V EWNFG F+T+++PFK MQ+F IV++MK++KL+ASQGGPII
Sbjct: 61 PFVATEWNFGTI-----------FQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 109
Query: 177 LSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPI-INTCNGFY 235
L+Q +NEYG+ Y GK Y+ WAA M LS + GVPW+MCQ S I I G Y
Sbjct: 110 LTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQYSYVDIYIYIVKKEGLY 169
Query: 236 CDQF------------TPNSNNKPKMWTENWSGWFLSFGGA--VPYRPVED-LAFAVARF 280
+ + +N+ + + G + G + +R + D + +
Sbjct: 170 SLSYQYALILSTLVTHSIVTNSHQILQAKPKCGLKIGLDGLKHLGHRILTDYMKILLFLL 229
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIK 340
NYYMYHGGTNF TSGGPFI+T+Y+Y+AP+DEYGL R PK
Sbjct: 230 LFFFFQKVNYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK------------ 277
Query: 341 LCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFN 390
C P+ E VY G +AF++N+ D + F
Sbjct: 278 -C---------------PSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQ 311
>gi|359496328|ref|XP_003635211.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
gi|296080974|emb|CBI18606.3| unnamed protein product [Vitis vinifera]
Length = 198
Score = 225 bits (574), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 109/201 (54%), Positives = 145/201 (72%), Gaps = 4/201 (1%)
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKG+AWVNGQSIGRYWP Y++ + GCT +C+YRGAY ++KCL+NCG+P+Q+LYH+PR+W
Sbjct: 1 MGKGQAWVNGQSIGRYWPAYLAPSTGCTTNCDYRGAYDASKCLRNCGQPAQTLYHIPRTW 60
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPV 767
+ S N LVL EE+GGDP+KIS +T+ G +C+HV+++ P P D W + + + V
Sbjct: 61 VHSGKNLLVLHEELGGDPSKISLLTR-TGQEVCAHVSEADPPPADSWQPNLEFMSQSSQV 119
Query: 768 LSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSV 827
L C IS I FASFGTP G CG+F+ G C A LSVV+QAC+G + C+I VS
Sbjct: 120 -RLTCEQ-GWHISMINFASFGTPRGHCGTFNPGNC-HANVLSVVQQACIGQEGCAIPVST 176
Query: 828 NTFGDPCKGVMKSLAVEASCT 848
GDPC GV+KSLA+EA C+
Sbjct: 177 ARLGDPCPGVLKSLAIEALCS 197
>gi|293334807|ref|NP_001170541.1| uncharacterized protein LOC100384558 [Zea mays]
gi|238005922|gb|ACR33996.1| unknown [Zea mays]
Length = 345
Score = 224 bits (571), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 130/363 (35%), Positives = 198/363 (54%), Gaps = 33/363 (9%)
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
P+ D KTVL V S GHA AF+N K VG G+G+ N T++ P+ L G N +L+
Sbjct: 2 PIRRD-IKTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLA 60
Query: 552 LTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSST 611
T+G+ + GA+ E AG+ VQ+KG GT +DL++ W + GL GE+ +
Sbjct: 61 STMGMMDSGAYLEHRLAGVD-RVQIKGLNAGT-LDLTNNGWGHIVGLVGEQKQIYTDKGM 118
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
+ +PL WYK FD P+G +P+ +D + MGKG +VNGQ IGRYW +Y
Sbjct: 119 GSVTWKPAVNDRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISYKHA- 177
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G+PSQ LYH+PRS+L+ N LVLFEE G P I +
Sbjct: 178 ---------------------LGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMIL 216
Query: 732 TKQLGSSLCSHVTDSHPLPVDMW-GSDSKIQRKPG---PVLSLECPNPNQVISSIKFASF 787
T + ++C+ +++ +P + W DS+I P +L C +P ++I + FAS+
Sbjct: 217 TVKR-DNICTFISERNPAHIKSWERKDSQITVTAADLKPRATLTC-SPKKLIQQVVFASY 274
Query: 788 GTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDP--CKGVMKSLAVEA 845
G P+G CG+++ G C + R+ +V +AC+G + C++ VS + +G C G +LAV+A
Sbjct: 275 GNPMGICGNYTIGSCHTPRAKELVEKACLGKRICTLPVSADVYGGDVNCPGTTATLAVQA 334
Query: 846 SCT 848
C+
Sbjct: 335 KCS 337
>gi|222616996|gb|EEE53128.1| hypothetical protein OsJ_35926 [Oryza sativa Japonica Group]
Length = 314
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 109/223 (48%), Positives = 146/223 (65%), Gaps = 6/223 (2%)
Query: 629 KTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNK 688
+T F P G++PVAID MGKG+AWVNG IGRYW + V+ GC+ SC Y GAY+ K
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSSSCYYPGAYNERK 141
Query: 689 CLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHP 748
C NCG P+Q+ YH+PR WLK S N LVLFEE GGDP+ IS ++CS +++++
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISL-EAHYAKTVCSRISENYY 200
Query: 749 LPVDMWGSDSKIQ---RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSA 805
P+ W S + P L L+C + VIS I FAS+GTP G C +FS+G C ++
Sbjct: 201 PPLSAWSHLSSGRASVNAATPELRLQCDD-GHVISEITFASYGTPSGGCLNFSKGNCHAS 259
Query: 806 RSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
+L +V +ACVG+ C+I VS + FGDPC+GV+K LAVEA C+
Sbjct: 260 STLDLVTEACVGNTKCAISVSNDVFGDPCRGVLKDLAVEAKCS 302
>gi|77554857|gb|ABA97653.1| Galactose binding lectin domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 317
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 109/223 (48%), Positives = 146/223 (65%), Gaps = 6/223 (2%)
Query: 629 KTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNK 688
+T F P G++PVAID MGKG+AWVNG IGRYW + V+ GC+ SC Y GAY+ K
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSSSCYYPGAYNERK 141
Query: 689 CLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHP 748
C NCG P+Q+ YH+PR WLK S N LVLFEE GGDP+ IS ++CS +++++
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISL-EAHYAKTVCSRISENYY 200
Query: 749 LPVDMWGSDSKIQ---RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSA 805
P+ W S + P L L+C + VIS I FAS+GTP G C +FS+G C ++
Sbjct: 201 PPLSAWSHLSSGRASVNAATPELRLQCDD-GHVISEITFASYGTPSGGCLNFSKGNCHAS 259
Query: 806 RSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
+L +V +ACVG+ C+I VS + FGDPC+GV+K LAVEA C+
Sbjct: 260 STLDLVTEACVGNTKCAISVSNDVFGDPCRGVLKDLAVEAKCS 302
>gi|125536445|gb|EAY82933.1| hypothetical protein OsI_38150 [Oryza sativa Indica Group]
Length = 314
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 109/223 (48%), Positives = 146/223 (65%), Gaps = 6/223 (2%)
Query: 629 KTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNK 688
+T F P G++PVAID MGKG+AWVNG IGRYW + V+ GC+ SC Y GAY+ K
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSSSCYYPGAYNERK 141
Query: 689 CLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHP 748
C NCG P+Q+ YH+PR WLK S N LVLFEE GGDP+ IS ++CS +++++
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISL-EAHYAKAVCSRISENYY 200
Query: 749 LPVDMWGSDSKIQ---RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSA 805
P+ W S + P L L+C + VIS I FAS+GTP G C +FS+G C ++
Sbjct: 201 PPLSAWSHLSSGRASVNAATPELRLQCDD-GHVISEITFASYGTPSGGCLNFSKGNCHAS 259
Query: 806 RSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKSLAVEASCT 848
+L +V +ACVG+ C+I VS + FGDPC+GV+K LAVEA C+
Sbjct: 260 STLDLVTEACVGNTKCAISVSNDVFGDPCRGVLKDLAVEAKCS 302
>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Length = 469
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 136/374 (36%), Positives = 176/374 (47%), Gaps = 80/374 (21%)
Query: 292 MYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP 351
MYHG TNFDRT+GGPFI+T+YDYDAPLDE+G + QPK+GHLK LH E L +
Sbjct: 23 MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLTYGNI 82
Query: 352 TYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVV 411
+ G + TVY+T G S F+ N+ + + F G SY +PAW VSILPDCK
Sbjct: 83 STADFGNLVMTTVYQTEEG-SSCFIGNV----NAKINFQGTSYDVPAWYVSILPDCKTES 137
Query: 412 FNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINT 471
+NTAK R L+ + L N
Sbjct: 138 YNTAK-----------RMKLRTS-------------------------------LRFKNV 155
Query: 472 TADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAK 531
+ D+SD+LWY + N+K +P G L + S H LH F+NG+ G+ +
Sbjct: 156 SNDESDFLWYMTTVNLKEQDPAW--GKNMSLRINSTAHVLHGFVNGQHTGNYRVENGKFH 213
Query: 532 VTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ 591
+ PG N LLS+TV L NYGAF+E AGITGPV
Sbjct: 214 YVFEQDAKFNPGVNVITLLSVTVDLPNYGAFFENVPAGITGPV----------------- 256
Query: 592 WTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKG 651
+ G G+E S+ +K T+ F AP GSEPV +D G GKG
Sbjct: 257 --FIIGRNGDETVVKYLSTHNGATKLTI------------FKAPLGSEPVVVDLLGFGKG 302
Query: 652 EAWVNGQSIGRYWP 665
+A +N GRYWP
Sbjct: 303 KASINENYTGRYWP 316
Score = 43.1 bits (100), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 17/30 (56%), Positives = 20/30 (66%)
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF 234
M SLD GVPW+MCQQ DAP P+ + F
Sbjct: 1 MTNSLDVGVPWIMCQQDDAPQPMYHGHTNF 30
>gi|166092020|gb|ABY82047.1| beta-galactosidase [Hymenaea courbaril var. stilbocarpa]
Length = 138
Score = 216 bits (551), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 96/138 (69%), Positives = 113/138 (81%)
Query: 179 QIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ 238
QIENEYG ++ A GK+Y WAA MA+ L+TGVPWVMC+Q DAPDP+I+TCNG+YC+
Sbjct: 1 QIENEYGPVEWEIRAPGKAYTAWAAKMAVGLNTGVPWVMCKQDDAPDPVIDTCNGYYCEN 60
Query: 239 FTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTN 298
FTPN N KPKMWTENWSGW+ +GGAVP RPVED+A++V RF Q GG+F NYYMYHGGTN
Sbjct: 61 FTPNKNYKPKMWTENWSGWYTEYGGAVPKRPVEDIAYSVTRFIQNGGSFVNYYMYHGGTN 120
Query: 299 FDRTSGGPFISTSYDYDA 316
F RT G FI+TSYDYDA
Sbjct: 121 FGRTYSGLFIATSYDYDA 138
>gi|297734971|emb|CBI17333.3| unnamed protein product [Vitis vinifera]
Length = 447
Score = 216 bits (551), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 149/399 (37%), Positives = 204/399 (51%), Gaps = 45/399 (11%)
Query: 217 MCQQSDAPDPIINTCNGFYC-DQFT-PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLA 274
MC+Q DAPDP+INTC G C D FT PN NK + TE L + +
Sbjct: 1 MCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSVSTEYLETPHLKGQQKILH------- 53
Query: 275 FAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKD 334
+ F + GT NYYMY+ TNF RT+ F +T Y +APLDEYGL R+ KWGHL+D
Sbjct: 54 ---SLFISKNGTLANYYMYYSVTNFGRTTSS-FATTCYYDEAPLDEYGLPRETKWGHLRD 109
Query: 335 LHKAIKLCEAALVATDPTYPSLGPNLEATVY-KTGSGLCSAFLANIGTNSDVTVKFNGNS 393
LH A++L + AL+ + LG +LEA +Y K GS +C+ FL N T + T G+
Sbjct: 110 LHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGSK 169
Query: 394 YLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSR-QSLQVAADSSDAIGSGWSYINEPV 452
Y LP S+S LPDCK VVFNT + S L+ FS SL +DA+ +Y P
Sbjct: 170 YYLPQHSISNLPDCKTVVFNTQTVASNYLIFPFSMFDSLNEPNMKTDALP---TYEECPT 226
Query: 453 GISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALH 512
+E + T D +DYLWY+ ++ V V +LGH +H
Sbjct: 227 KTKSP--------VELMTMTKDTTDYLWYTTKKDVLR-----------VPQVSNLGHVMH 267
Query: 513 AFINGK------LVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
AF+NG+ L G+ +GS+ + PI L G N L TVGL + G++ E
Sbjct: 268 AFLNGEYVMEFYLTGTRHGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHR 327
Query: 567 GAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNF 605
AG+ V ++G N IDL W ++ GL G++L+
Sbjct: 328 LAGVHN-VAIQGL-NTRTIDLPKNGWGHKVGLNGDKLHL 364
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 44/67 (65%), Gaps = 2/67 (2%)
Query: 696 PSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWG 755
PSQS+YHVPR++LK+S N LVLFEE G +P I +T ++C ++++ HP V W
Sbjct: 369 PSQSVYHVPRAFLKTSDNLLVLFEETGRNPDGIEILTLN-RDTICCYISEHHPTHVRSWK 427
Query: 756 SD-SKIQ 761
+ S IQ
Sbjct: 428 REASDIQ 434
>gi|388518087|gb|AFK47105.1| unknown [Lotus japonicus]
Length = 220
Score = 209 bits (533), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 110/208 (52%), Positives = 142/208 (68%), Gaps = 11/208 (5%)
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
MGKG+AWVNG IGRYW T VS GC C+YRGAY+S+KC NCGKP+Q+LYHVPRSW
Sbjct: 1 MGKGQAWVNGHHIGRYW-TRVSPKSGCEQVCDYRGAYNSDKCTTNCGKPTQTLYHVPRSW 59
Query: 708 LKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPV------DMWGSDSKIQ 761
LK+S N LV+FEE GG+P +IS V +C+ V++SH P+ D+ G +
Sbjct: 60 LKASDNLLVIFEETGGNPFRIS-VKLHSARIVCAKVSESHYQPLHKLMNADLIGHEVSAN 118
Query: 762 RKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSC 821
P L L C + ++ISSI FAS+G P G+C SFSRG C + S+++V +AC G +SC
Sbjct: 119 SMI-PELHLRCQD-GRIISSITFASYGNPEGSCQSFSRGNCHAPSSMAIVSKACQGKRSC 176
Query: 822 SIGVSVNTF-GDPCKGVMKSLAVEASCT 848
SI +S F GDPC+GVMK+L+VEA CT
Sbjct: 177 SIKISDTIFGGDPCQGVMKTLSVEARCT 204
>gi|219117911|ref|XP_002179741.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408794|gb|EEC48727.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 951
Score = 204 bits (518), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 225/834 (26%), Positives = 344/834 (41%), Gaps = 154/834 (18%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+V+YD RA+ I KR +L+SGS+H R+T W + ++ GL++I Y+FW H+
Sbjct: 149 SVSYDERAIRINDKRVLLLSGSMHPVRATRGTWEHALDEAVYNGLNMITVYIFWGAHQSF 208
Query: 86 RNQ-YNF----------EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
R++ N+ E +++L ++ A GL+ H+RIGPY C E+ +GG P WL
Sbjct: 209 RDEPLNWSLDGSSIGPKESQWELADALRSAANRGLFIHVRIGPYACGEYTYGGIPEWLPL 268
Query: 135 IPG-IQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN-IDSA-- 190
++ R N P+ M+ F A + + L+A QGGPI+++QIENE G+ +D +
Sbjct: 269 QSSTMRMRRLNRPWLDAMEGFVAATITYLSSFNLWAHQGGPILIAQIENELGSGVDGSAA 328
Query: 191 --------------------------YG-----------------AAGKSYIKWAAGMAL 207
YG A + Y W +
Sbjct: 329 ANYVVLERDEFNDDKHEDSHLLQLDRYGHILENASSRGMDSELRNATVQDYADWCGNLVA 388
Query: 208 SLDTGVPWVMCQQSDAPDPI--INTCNGF-YCDQFTPNSN---NKPKMWTENWSGWFLSF 261
L V W MC A + I N NG + +++ + ++P +WTE+ G+ L
Sbjct: 389 RLAPNVIWTMCNGLSAENTISTFNGNNGIDWLEKYGDSGRIQVDQPAIWTEDEGGFQL-- 446
Query: 262 GGAVPYRPVE--------DLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYD 313
G P +P + +A ++F RGGT NYYM+ GG N R+S I +Y
Sbjct: 447 WGDQPSKPSDYFWGRTSRAMATDALQWFARGGTHLNYYMWWGGYNRGRSSAAG-IMNAYA 505
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHKAIK-------------LCEAALVATDPTYPSLGPNL 360
DA L G R PK+ H LH I L A++ D +G N
Sbjct: 506 TDAFLCSSGQRRHPKYDHFLALHLVIADIAAILLHAPTSLLKNASVEIMDGDDWIVGDNQ 565
Query: 361 EATVYK---TGSGLCSAFLANIGTNSDVTVKFNGNS------YLLPAWSVSILPDCKNVV 411
+Y+ T FL N N+ + G +++ +S I+ D V
Sbjct: 566 RQFLYQVLDTHDSKQVIFLEN-DANTTEMARLTGAKADDSLVFVMKPYSSQIVIDGI-VA 623
Query: 412 FNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDD--AFTKPGLLEQI 469
F+++ I++ + SF R++L + S WS EP+ + D A LEQ
Sbjct: 624 FDSSTISTKAM--SF-RRTLHYEPAVLLHLTS-WS---EPIAGADTDQNAHVSTEPLEQT 676
Query: 470 NTTAD---QSDYLWYSLSTNIKADEPLLEDGSKTVLHV-QSLGHALHAFINGKLVGSGYG 525
N + SDY WY T++K D L S+ L++ AL FI+G +G
Sbjct: 677 NLNSKASISSDYAWY--GTDVKIDVVL----SQVKLYIGTEKATALAVFIDGAFIGEANN 730
Query: 526 SSSNAKVTV-DFPI-ALAPGKNTFDLLSLTVGLQN----YGAFYEKTGAGITGPVQLKGS 579
TV I +LA G + +L ++G N +GA GITG V +
Sbjct: 731 HQHAEGPTVLSIEIESLAAGTHRLAILCESLGYHNLIGRWGAITTAKPKGITGNVLIGSP 790
Query: 580 GNGTNIDL--SSQQWTYQTGLKGEELNFPSGSSTQ--WDSKSTLPKLQPLVWYKTTFDAP 635
NI L Q W GL E G + D+ L PL W F +P
Sbjct: 791 LLSENISLVDGRQMWWSLPGLSVERKAARHGLRRESFEDAAQAEAGLHPL-WSSVLFTSP 849
Query: 636 AGSEPVAIDFTGM--GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
V F + G+G W+NG+ +GRYW +++ D
Sbjct: 850 QFDSTVHSLFLDLTSGRGHLWLNGKDLGRYWN--ITRGNSWNDY---------------- 891
Query: 694 GKPSQSLYHVPRSWLKSSG--NTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTD 745
SQ Y +P +L G N L+LF+ +GGD + + + S S +D
Sbjct: 892 ---SQRYYFLPADFLHLDGQLNELILFDMLGGDHSAARLLLSSIEESETSKFSD 942
>gi|255550369|ref|XP_002516235.1| beta-galactosidase, putative [Ricinus communis]
gi|223544721|gb|EEF46237.1| beta-galactosidase, putative [Ricinus communis]
Length = 451
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 134/421 (31%), Positives = 194/421 (46%), Gaps = 93/421 (22%)
Query: 292 MYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAI--KLCEAALVAT 349
MYHGGTNF R SGGP I TSYDYDAPLDEYG + QPKWGHL+DLH I L ++ +
Sbjct: 38 MYHGGTNFRRMSGGPMIVTSYDYDAPLDEYGNLNQPKWGHLRDLHVRILLHLSQSRGLGF 97
Query: 350 DPTYPSLGPNLEATVY---KTGSGLCSAFLANIGTNSDVTVKFNGNS-YLLPAWSVSILP 405
Y L T Y TG C FL+N TN D + + + +PAW
Sbjct: 98 ATVYA-----LNLTTYINNATGERFC--FLSNTKTNEDANIDLQQDGIFFVPAW------ 144
Query: 406 DCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGL 465
+ + ++++ Q + +D +D + Y + +S D ++
Sbjct: 145 ----IYYYSSRVQQGNF------QQCKATSDETDYLRYITRYF-DFFTVSVKDVHSR--- 190
Query: 466 LEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYG 525
+Q N T + + L+ + P ++ +Q + H+++
Sbjct: 191 CQQCNNTEE------HDLACDFFGTSPACS--CQSAARLQQVFHSIY------------- 229
Query: 526 SSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNI 585
+LT G QNYG F+++ GI G
Sbjct: 230 -------------------------NLTSGKQNYGEFFDEGPEGIAGAA----------- 253
Query: 586 DLSSQQWTYQTGLKGEELNF---PSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVA 642
DLSS QW Y+ GL GE SG + + + LP + + WYKTTF P+G++P+
Sbjct: 254 DLSSNQWAYKIGLGGEAKRLYDPNSGHRDVFRTSAILPVGRAMTWYKTTFHVPSGTDPLV 313
Query: 643 IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYH 702
++ GMGKG AWVNG S+GR+WP + G + SC+YRG Y +KCL NCG P+Q H
Sbjct: 314 LNLQGMGKGHAWVNGHSLGRFWPMQSADPTGYSGSCDYRGKYDKDKCLTNCGNPTQRWKH 373
Query: 703 V 703
+
Sbjct: 374 I 374
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/58 (53%), Positives = 41/58 (70%), Gaps = 1/58 (1%)
Query: 775 PN-QVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFG 831
PN ++IS I+FASFG P GTCGS +G +A + V +ACVG +SCS+GVS +T G
Sbjct: 379 PNGRIISVIQFASFGNPEGTCGSLQKGDFEAAYTAFAVEKACVGKESCSLGVSESTLG 436
Score = 43.1 bits (100), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 22/27 (81%)
Query: 161 MMKQEKLYASQGGPIILSQIENEYGNI 187
M K+ KL+AS GGPI+ +QIEN+YGN
Sbjct: 1 MAKEAKLFASSGGPIVFAQIENDYGNF 27
>gi|449532986|ref|XP_004173458.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 213
Score = 199 bits (505), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 105/217 (48%), Positives = 142/217 (65%), Gaps = 8/217 (3%)
Query: 522 SGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGN 581
S YGS + ++T + L G N +LS+TVGL N G ++ AG+ GPV LKG
Sbjct: 1 SVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNE 60
Query: 582 GTNIDLSSQQWTYQTGLKGEELNFPS---GSSTQWDSKSTLPKLQPLVWYKTTFDAPAGS 638
GT D+S +W+Y+ GLKGE LN S +S QW K + K QPL WYKTTF+ PAG+
Sbjct: 61 GTR-DMSKYKWSYKVGLKGEILNLYSVKGSNSVQW-MKGSFQK-QPLTWYKTTFNTPAGN 117
Query: 639 EPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQ 698
EP+A+D + M KG+ WVNG+SIGRY+P Y++ +G C + C+Y G ++ KCL NCG PSQ
Sbjct: 118 EPLALDMSSMSKGQIWVNGRSIGRYFPGYIA-SGKC-NKCSYTGFFTEKKCLWNCGGPSQ 175
Query: 699 SLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
YH+PR WL +GN L++ EEIGG+P IS V + +
Sbjct: 176 KWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRTV 212
>gi|62321383|dbj|BAD94714.1| beta-galactosidase [Arabidopsis thaliana]
Length = 199
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 99/199 (49%), Positives = 129/199 (64%), Gaps = 7/199 (3%)
Query: 538 IALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTG 597
I L G N LLS+ VGL N G +E+ G GPV LKG +GT D+S +W+Y+ G
Sbjct: 4 IKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGALGPVTLKGVNSGT-WDMSKWKWSYKIG 62
Query: 598 LKGEELNFPSG---SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAW 654
+KGE L+ + S +W S + K QPL WYK+TF PAG+EP+A+D MGKG+ W
Sbjct: 63 VKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVW 122
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
+NG++IGR+WP Y +Q G CNY G + + KCL NCG+ SQ YHVPRSWLKS N
Sbjct: 123 INGRNIGRHWPAYKAQ--GSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLKSQ-NL 179
Query: 715 LVLFEEIGGDPTKISFVTK 733
+V+FEE+GGDP IS V +
Sbjct: 180 IVVFEELGGDPNGISLVKR 198
>gi|383128340|gb|AFG44826.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 196 bits (498), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 93/159 (58%), Positives = 122/159 (76%), Gaps = 6/159 (3%)
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYWP+Y++ GGCTDSC+YRGAYSS+KCL NCGKPSQ LYHVPRSW++ +GN
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGKPSQKLYHVPRSWIQPTGNV 60
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSL 770
LVLFEE+GGDPT+ISF+ + +G ++C+ V+++H PV W S + K+ KP L L
Sbjct: 61 LVLFEELGGDPTQISFMARSVG-TVCARVSETHLPPVGSWKSSATSGLKVN-KPKGELQL 118
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
CP+ +I SIKFASFGTP G CGSF+ G C++ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTNSTMS 157
>gi|376338072|gb|AFB33581.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
gi|376338074|gb|AFB33582.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 196 bits (498), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 92/159 (57%), Positives = 122/159 (76%), Gaps = 6/159 (3%)
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYWP+Y++ GCTDSC+YRGAYSS+KCL NCG+PSQ LYHVPRSW++S+GN
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSL 770
LVLFEE+GGDPT+ISF+ + +G ++C+ V+++H PV W S + K+ KP L L
Sbjct: 61 LVLFEELGGDPTQISFMARSVG-TVCARVSETHLPPVGSWKSSATSVLKVN-KPKAELQL 118
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
CP+ +I SIKFASFGTP G CGSF+ G C++ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|376338078|gb|AFB33584.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 196 bits (498), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 92/159 (57%), Positives = 122/159 (76%), Gaps = 6/159 (3%)
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYWP+Y++ GCTDSC+YRGAYSS+KCL NCG+PSQ LYHVPRSW++S+GN
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSL 770
LVLFEE+GGDPT+ISF+ + +G ++C+ V+++H PV W S + K+ KP L L
Sbjct: 61 LVLFEELGGDPTQISFMARSVG-TVCARVSETHLPPVGSWKSSATSGLKVN-KPKAELQL 118
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
CP+ +I SIKFASFGTP G CGSF+ G C++ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|361068121|gb|AEW08372.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128330|gb|AFG44821.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128334|gb|AFG44823.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 92/159 (57%), Positives = 122/159 (76%), Gaps = 6/159 (3%)
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYWP+Y++ GGCTDSC+YRGAYSS+KCL NCG+PSQ LYHVPRSW++ +GN
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSL 770
LVLFEE+GGDPT+ISF+ + +G ++C+ V+++H PV W S + K+ KP L L
Sbjct: 61 LVLFEELGGDPTQISFMARSVG-TVCARVSETHLPPVGSWKSSATSGLKVN-KPKAELQL 118
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
CP+ +I SIKFASFGTP G CGSF+ G C++ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|376338076|gb|AFB33583.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
Length = 157
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 92/159 (57%), Positives = 120/159 (75%), Gaps = 6/159 (3%)
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYWP+Y++ GCTDSC+YRGAYSS+KCL NCG+PSQ LYHVPRSW++S+GN
Sbjct: 1 VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSL 770
LVLFEE+GGDPT+ISF+ + +G ++C+ V+++H PV W S + K+ KP L L
Sbjct: 61 LVLFEELGGDPTQISFMARSVG-TVCARVSETHLPPVGSWKSSATSGLKVN-KPKAELQL 118
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
CP+ +I SIKFASFGTP G CGSF+ G C ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCXXXSTMS 157
>gi|383128326|gb|AFG44819.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128328|gb|AFG44820.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128336|gb|AFG44824.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128338|gb|AFG44825.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 196 bits (497), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 92/159 (57%), Positives = 122/159 (76%), Gaps = 6/159 (3%)
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYWP+Y++ GGCTDSC+YRGAYSS+KCL NCG+PSQ LYHVPRSW++ +GN
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSL 770
LVLFEE+GGDPT+ISF+ + +G ++C+ V+++H PV W S + K+ KP L L
Sbjct: 61 LVLFEELGGDPTQISFMARSVG-TVCARVSETHLPPVGSWKSSATSGLKVN-KPKAELQL 118
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
CP+ +I SIKFASFGTP G CGSF+ G C++ ++S
Sbjct: 119 HCPSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTNSTMS 157
>gi|383128332|gb|AFG44822.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 194 bits (493), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 91/159 (57%), Positives = 121/159 (76%), Gaps = 6/159 (3%)
Query: 655 VNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNT 714
VNG+SIGRYWP+Y++ GGCTDSC+YRGAYSS+KCL NCG+PSQ LYHVPRSW++ +GN
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60
Query: 715 LVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDS----KIQRKPGPVLSL 770
LVLFEE+GGDPT+ISF+ + +G ++C+ V+++H PV W S + K+ KP L L
Sbjct: 61 LVLFEELGGDPTQISFMARSVG-TVCARVSETHLPPVGSWKSSATSGLKVN-KPKAELQL 118
Query: 771 ECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLS 809
CP+ +I SIKF SFGTP G CGSF+ G C++ ++S
Sbjct: 119 HCPSSGHLIKSIKFVSFGTPTGRCGSFTYGHCNTNSTMS 157
>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
Length = 586
Score = 193 bits (491), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/345 (35%), Positives = 175/345 (50%), Gaps = 26/345 (7%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
+ +TYD + ++ GK L+SG++HY R+ PE W D + K K G + +ETYV WNLHEP
Sbjct: 2 SQLTYDD-SFLLDGKEIRLLSGAMHYFRTVPEYWEDRLLKLKACGFNTVETYVAWNLHEP 60
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
Q+ FEG D+V+F+K + GL+ +R GP++CAEW FGGFP WL +P I+ R N
Sbjct: 61 EEGQFVFEGIADIVRFIKTAEKVGLHVIVRPGPFICAEWEFGGFPYWLLTVPNIKLRCFN 120
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWA 202
+P+ ++ + + + ++ L +S GGPII QIENEYG+ D Y + IK
Sbjct: 121 QPYLEKVDAYFDVLFERLR--PLLSSNGGPIIALQIENEYGSFGNDQKYLQYLRDGIKKR 178
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCN-------GF-YCDQFTPNSNNKPKMWTENW 254
G L + P + I T N F Q+ PN+ P M E W
Sbjct: 179 VGNELLFTSDGPEPSMLSGGMIEGIFETVNFGSRAESAFAQLKQYQPNA---PLMCMEFW 235
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGP 306
GWF +G R E + + ++ G+ N+YM HGGTNF + T P
Sbjct: 236 HGWFDHWGEEHHTRSAESVVETLEEILKQNGSV-NFYMAHGGTNFGFYNGANHNETDYQP 294
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP 351
I TSYDYD L E G + + + K K + L E L A P
Sbjct: 295 TI-TSYDYDGLLTESGDVTEKFYAVRKVFEKYVDLPELNLPAPIP 338
>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
Length = 582
Score = 192 bits (487), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 162/306 (52%), Gaps = 26/306 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG++HY R P++W D I K++ GL+ IETYV WN H P R ++ +G
Sbjct: 12 LLDGEPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTDGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F++ VA AGLYA +R GPY+CAEW+ GG P WL PG+ R F A ++++
Sbjct: 72 LDLGRFLEQVAAAGLYAIVRPGPYICAEWDNGGLPAWLFQEPGVGVRRYEPRFLAAVEQY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+++D+++ L QGGP++L Q+ENEYG A+G Y++ AGM VP
Sbjct: 132 LEQVLDLVR--PLQVDQGGPVLLLQVENEYG----AFG-NDPEYLEAVAGMIRKAGITVP 184
Query: 215 WVMCQQSDAPDPIINTCNG-FYCDQFTPNSNNK-----------PKMWTENWSGWFLSFG 262
V Q +G F S + P M E W GWF +G
Sbjct: 185 LVTVDQPTGEMLAAGGLDGVLRTGSFGSRSAERLATLREHQPTGPLMCMEFWDGWFDHWG 244
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPF--ISTSYDYDA 316
G VED A + G + N YM+HGGTNF TSG G F TSYDYDA
Sbjct: 245 GPHHTTSVEDAARELDALLAAGASV-NIYMFHGGTNFGLTSGADDKGVFRPTVTSYDYDA 303
Query: 317 PLDEYG 322
PLDE G
Sbjct: 304 PLDEAG 309
Score = 47.0 bits (110), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 102/285 (35%), Gaps = 69/285 (24%)
Query: 448 INEPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSL 507
+ EPV + D A P T D T + D P+L L + +
Sbjct: 347 LGEPVRLLTDPAVWGPASRHATMPTLDDLGARLALFRTELDGDGPVL-------LSIGEV 399
Query: 508 GHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTG 567
F++G VG + D + L G+ +++ G NYG +
Sbjct: 400 RDRALVFLDGDPVGV------LERDHRDRALMLPRGRGRLEIVVEDQGRVNYGPRIGEV- 452
Query: 568 AGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS--TLPKLQPL 625
G+ G VQL G ++ WT WDS + T P + P
Sbjct: 453 KGLLGDVQL-----GPDLLTDWSAWTIDL----------DAVPALWDSAAPATGPGVGPT 497
Query: 626 VWYKTTFDAPAGSEPVAIDFTGM---GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
W +T+F A PV F G GKG AWVNG +GRYW +RG
Sbjct: 498 AW-RTSF---AAEHPVD-HFLGTDAWGKGIAWVNGFCLGRYW---------------HRG 537
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE-EIGGDPT 726
P +LY VP ++S N LV+ E E DPT
Sbjct: 538 -------------PQHTLY-VPAPLIRSGDNDLVVLELETMADPT 568
>gi|255550379|ref|XP_002516240.1| beta-galactosidase, putative [Ricinus communis]
gi|223544726|gb|EEF46242.1| beta-galactosidase, putative [Ricinus communis]
Length = 216
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 100/206 (48%), Positives = 119/206 (57%), Gaps = 46/206 (22%)
Query: 194 AGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTEN 253
AGK+Y+ W + MA SLD GVPW++CQQ DAP P+INTC G+YCDQFTPN+ N PK WTEN
Sbjct: 56 AGKAYLDWCSDMAESLDIGVPWIICQQRDAPQPMINTCYGWYCDQFTPNTANSPKKWTEN 115
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF-ISTSY 312
W+GWF S+G P+R E +AFAVARFFQ FQN YMYHGGTNF RT+GGP+ +TS+
Sbjct: 116 WTGWFKSWGDKDPHRTAEGVAFAVARFFQ----FQNCYMYHGGTNFGRTAGGPYSTTTSH 171
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC 372
DYDAPLDE H I E
Sbjct: 172 DYDAPLDE---------------HVTIHATEKE--------------------------S 190
Query: 373 SAFLANIGTNSDVTVKFNGNSYLLPA 398
S F NI SD ++F G Y +PA
Sbjct: 191 SCFFGNINETSDAVIEFRGAKYKIPA 216
>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
Length = 867
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 115/326 (35%), Positives = 171/326 (52%), Gaps = 16/326 (4%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+TYD ++ I KR ++S +IHY R W D+++K+K GG + IETY+ WN HE
Sbjct: 2 ITYDKKSWKIHNKRIFILSAAIHYFRLPKAEWDDVLEKAKAGGCNTIETYIPWNFHEMKE 61
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+++F G DL F++L A GLY R GPY+CAEW+FGGFP WL IQ+R+
Sbjct: 62 GEWDFSGDKDLAHFLQLCANKGLYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPS 121
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
F + ++ +++ ++ + +L ++ G +I+ QIENE+ AYG K Y+++
Sbjct: 122 FLHYVDQYFDQVISIIDEYQL--TKNGSVIMVQIENEF----QAYGKPDKKYMEYLRDGM 175
Query: 207 LSLDTGVPWVMCQQS-DAPDPIINTCNG--FYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
++ VP+V C + D N +G + ++PK E W GWF +GG
Sbjct: 176 IARGIEVPFVTCYGAVDGAVEFRNFWSGANRAAEILDERFADQPKGVMEFWIGWFEHWGG 235
Query: 264 -AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP-----FISTSYDYDAP 317
+ E L + + G T NYYMY GGTNFD G F +T+YDYD
Sbjct: 236 NKANQKTPEQLERECYQLLRNGFTTINYYMYFGGTNFDHWGGRTVSEQVFCTTTYDYDVA 295
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCE 343
+DEY L K+ LK H +K E
Sbjct: 296 IDEY-LQPTRKYEVLKRYHLFVKWLE 320
>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
Length = 857
Score = 189 bits (481), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 113/334 (33%), Positives = 173/334 (51%), Gaps = 21/334 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ +D + +I GKR+ +IS ++HY R W +I+K++ GG + IETY+ WN HE
Sbjct: 2 IQFDSNSWIIDGKRKFIISAAVHYFRLPRAEWAAVIRKARLGGCNAIETYIAWNYHETAE 61
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Q++F G DL F + + G+Y +R GPY+CAEW+FGG P +L+ GI++R N
Sbjct: 62 EQWDFSGDKDLAAFFAICHDEGMYVIVRPGPYICAEWDFGGLPYYLNNTDGIEYRCSNAA 121
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
++ ++R+ +I+ ++++ +L GG II+ QIENEY A+G ++I++ +
Sbjct: 122 YEQAVRRYFERIMPIIRRYQL--GSGGSIIMVQIENEY----HAFGKKDLAHIRFLEELT 175
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ-----FTPNSNNKPKMWTENWSGWFLSF 261
VP V C A + N + + + +P E W GW +
Sbjct: 176 RGFGITVPLVSCY--GAGRNTVEMRNFWSGAERAAAVLRERQSGQPLGIMEFWIGWVEHW 233
Query: 262 GGA-VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF----DRTSGGP--FISTSYDY 314
GG ++P E + + G F NYYMY GG+NF RT G F++ SYDY
Sbjct: 234 GGEPQKHKPAEAVLSHCFEALKSGFVFFNYYMYFGGSNFGSWGGRTIGAHKIFMTQSYDY 293
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVA 348
DAPLDE+G K+ L LH I E L A
Sbjct: 294 DAPLDEFGF-ETEKYRLLAVLHTFIAWLENDLTA 326
>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
queenslandica]
Length = 689
Score = 189 bits (481), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 192/359 (53%), Gaps = 40/359 (11%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
++ D + I GK+ ++SGSIHY R P+ W D ++K K GL+ ++TYV WNLHEP+
Sbjct: 71 LSLDEDSFYIRGKKTHILSGSIHYFRVVPDYWTDRLKKLKAMGLNTVDTYVSWNLHEPMP 130
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+++F G ++ +F+K+ L +R GPY+C+EW+ GG P WL P ++ R++ +P
Sbjct: 131 GEFDFSGLLNIHEFIKIAHSLELNVIVRPGPYICSEWDNGGLPAWLLHDPNMKIRSNYKP 190
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG---AAGKSYIKWAA 203
++ ++RF K+ +++ L +S GGPII Q+ENEY +AYG A G+ ++++ A
Sbjct: 191 YQDAVKRFFTKLFEILT--PLQSSYGGPIIAFQVENEY----AAYGPRNATGRHHMQYLA 244
Query: 204 GMALSLDTGVPWVMCQ-QSD-------APDPIINTCNGFYCD------QFTPNSNNKPKM 249
+ SL ++ Q+D AP+ + T N F D + NKP +
Sbjct: 245 NLMRSLGAVELFITSDGQNDIKASSDMAPNNALLTVN-FQNDPSEALNKLLLVQPNKPPL 303
Query: 250 WTENWSGWFLSFGGAVPYRPV--EDLAFAVARFFQRGGTFQNYYMYHGGTNF-----DRT 302
E W+GWF +G R + L + Q GG+F N YM+HGGTNF
Sbjct: 304 VMEYWTGWFDHWGRRHLERTLSPSQLIVNIGTILQMGGSF-NLYMFHGGTNFGFMNGANI 362
Query: 303 SGGPFIS--TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
GG + TSYDYDAPL E G I + K+ L++L L EA + P + PN
Sbjct: 363 EGGEYRPDVTSYDYDAPLSEAGDITK-KYTLLREL-----LKEAVPHSIPNPLPDIPPN 415
>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 83/150 (55%), Positives = 112/150 (74%), Gaps = 7/150 (4%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
MA I LC+ T F NV+YD R+++I G+R++LIS +IHYPRS P MWP+
Sbjct: 1 MALGLIFFFSLCF------TLCFAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPE 54
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVR-NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
L++ +K+GG+DVIETYVFWN+H+P ++Y+F+GR+DLVKF+ +V EAG+Y LRIGP+V
Sbjct: 55 LVKTAKEGGVDVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFV 114
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
AEWNFGG P+WLH++ G FRTDN FK
Sbjct: 115 AAEWNFGGIPVWLHYVNGTVFRTDNYNFKV 144
>gi|227538632|ref|ZP_03968681.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
gi|227241551|gb|EEI91566.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
Length = 638
Score = 186 bits (472), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 209/768 (27%), Positives = 308/768 (40%), Gaps = 182/768 (23%)
Query: 5 EILLLVLCWGFVVLATTSFGANVTYDHRA------------VVIGGKRRVLISGSIHYPR 52
+++ LC + VL TT A D +A V GK ++SG +HY R
Sbjct: 2 KLIKKALC--YAVLTTTFMSAIAFQDVQAQKKHTFEIKDGNFVYDGKTTRILSGEMHYAR 59
Query: 53 STPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAH 112
+ W +Q K GL+ + TYVFWN HE +NFEG +DL F+K E GL+
Sbjct: 60 IPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEGDHDLAAFIKTAGEVGLHVI 119
Query: 113 LRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQE--KLYAS 170
LR GPY CAEW+FGG+P WL I G++ R DN A+ +T K +D + +E L +
Sbjct: 120 LRPGPYACAEWDFGGYPWWLQKIDGLEIRRDN----AKFLEYTKKYIDRLAKEVGSLQIT 175
Query: 171 QGGPIILSQIENEYGNIDS-----------AYGAAGKSYIKWAAGMALSLDTGVPWVMCQ 219
GGPII+ Q ENE+G+ S AY A K ++ AG + L T + +
Sbjct: 176 NGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLE-EAGFNVPLFTSDGSWLFE 234
Query: 220 QSDAPD--PIINTCNGF-----YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVE- 271
P P N N DQ+ N+N P M E + GW + A P+ V+
Sbjct: 235 GGAIPGALPTANGENNISNLKKVVDQY--NNNQGPYMVAEFYPGWLDHW--AEPFAKVDA 290
Query: 272 -DLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWG 330
+A ++ Q +F NYYM HGGTN +G +
Sbjct: 291 GRIARQTEKYLQNDISF-NYYMVHGGTN----------------------FGFTSGANYN 327
Query: 331 HLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFN 390
+ D+ I T Y + + A A +S TV
Sbjct: 328 NKSDIQPDI-----------------------TSYDYDAPISEAGWATPKYDSIRTVIQK 364
Query: 391 GNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINE 450
Y +PA K N V +PS L A+ D SG + INE
Sbjct: 365 YADYTVPA---------------VPKANPVIEIPSIK---LTAVANVFDYAKSGKTTINE 406
Query: 451 PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHA 510
P EQ+N + Y+ YS N + L DG + V
Sbjct: 407 -----------TPLNFEQLNQA---NGYVLYSKQFNQPINGKLKIDGLRDFAVV------ 446
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+I+G VG N ++ +D P +T +L +G NYG+ GI
Sbjct: 447 ---YIDGTKVGELNRVFKNYEMDIDIPF-----NSTLQILVENMGRINYGSEMIHNHKGI 498
Query: 571 TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ--WDSKSTLPKLQPL--- 625
PV + +++++ WT Q L +++ +G T ++K+ K+ L
Sbjct: 499 ISPVLI------NDMEITGD-WTMQQ-LPMDKVPDLAGKQTAAIQNTKTNASKIAALTGQ 550
Query: 626 -VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAY 684
V Y+ TFD + ID GKG ++NG +IGRYW T
Sbjct: 551 PVLYQGTFDLKEIGDTF-IDMEKWGKGIVFINGINIGRYWKT------------------ 591
Query: 685 SSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD-PTKISFV 731
P +LY +P +LK N++V+FE++ + T++S V
Sbjct: 592 ----------GPQHTLY-IPAPYLKKGSNSIVIFEQLNDEIKTEVSTV 628
>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 586
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 163/307 (53%), Gaps = 28/307 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG++HY R P+ W D I+K++ GL+ IETYV WN H P ++ +G
Sbjct: 12 LLDGEPFRILSGALHYFRVHPDQWADRIEKARLMGLNTIETYVPWNAHSPRPGVFDTDGI 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F++LV +AG+YA +R GP++CAEW+ GG P WL PG+ R F E++++
Sbjct: 72 LDLPRFLRLVKDAGMYAIVRPGPFICAEWDNGGLPPWLFREPGVGIRRHEPRFLDEVEKY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+++ +++ ++ GGP++L Q+ENEYG AYG + Y++ A M VP
Sbjct: 132 LHQVLALVRPHQV--DLGGPVLLVQVENEYG----AYG-DDRDYLQAVADMIRGAGIDVP 184
Query: 215 WVMCQQSDAPDPIINTCNG-FYCDQFTPNSNNK-----------PKMWTENWSGWFLSFG 262
V Q +G F +S N+ P M E W GWF +G
Sbjct: 185 LVTVDQPVDAMLAAGGLDGVLRTSSFGSDSANRLRTLRDHQPTGPLMCMEFWDGWFDHWG 244
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFISTSYDYD 315
G PVE A + G + N YM+HGGTNF TSG P + TSYDYD
Sbjct: 245 GRHHTTPVEQAAEELDALLAAGASV-NVYMFHGGTNFGLTSGANDKGIYRPTV-TSYDYD 302
Query: 316 APLDEYG 322
APLDE G
Sbjct: 303 APLDEAG 309
>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 781
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 166/325 (51%), Gaps = 29/325 (8%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ G+ V+ + IHYPR E W I+ SK G++ I YVFWN HEP +Y+F
Sbjct: 33 KTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMSKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G+ D+ F ++ E G+Y +R GPYVCAEW GG P WL I+ R + + +
Sbjct: 93 TGQKDIAAFCRMAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKEDIKLREQDPYYMERV 152
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSL 209
+ F ++ + L S+GG II+ Q+ENEYG+ ID Y AA + +K A
Sbjct: 153 KLFMNEVGKQLAD--LQISKGGNIIMVQVENEYGSFGIDKPYIAAIRDMVKQAGF----- 205
Query: 210 DTGVPWVMCQ-----QSDAPDPIINTCN---GFYCDQ----FTPNSNNKPKMWTENWSGW 257
TGVP C +++A D ++ T N G DQ N P M +E WSGW
Sbjct: 206 -TGVPLFQCDWNSNFENNALDDLLWTVNFGTGANIDQQFERLKELRPNTPLMCSEFWSGW 264
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF-----ISTSY 312
F +G R E+L + R +F + YM HGGT+F G F TSY
Sbjct: 265 FDHWGAKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHK 337
DYDAP++E G + PK+ ++DL K
Sbjct: 324 DYDAPINESGKV-TPKFLEVRDLLK 347
>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
Length = 584
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 164/318 (51%), Gaps = 28/318 (8%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
R + G+ +ISG+IHY R P+ W D I+K++ GL+ IETYV WN H P R++++
Sbjct: 9 RDFTLDGEPFQIISGAIHYFRVHPDSWRDRIRKARLMGLNTIETYVAWNFHAPSRDEFHT 68
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
+G DL +F+ ++ E GL A +R GPY+CAEW+ GG P WL P I R+ + + E+
Sbjct: 69 DGARDLGRFLDIIQEEGLRAIVRPGPYICAEWDNGGLPTWLTATPDIVVRSSDPTYLTEV 128
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
+R+ + +++ ++ + GGPIIL Q+ENEYG AYG ++Y+ + +L
Sbjct: 129 ERYLEHLAPIVEPRQI--NHGGPIILMQVENEYG----AYG-NDRAYLTHLTNVYRNLGF 181
Query: 212 GVPWV--------MCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFL 259
VP M PD G D+ + P M +E W GWF
Sbjct: 182 VVPLTTVDQPMDDMLAHGTLPDLHTTGSFGSRIDERLATLREHQTTGPLMCSEFWIGWFD 241
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFISTSY 312
+G V D A A+ R G + N YM+HGGTNF T+G P + TSY
Sbjct: 242 HWGAHHHTTDVADAANALDRLLGAGASV-NIYMFHGGTNFGFTNGANDKGVYQPLV-TSY 299
Query: 313 DYDAPLDEYGLIRQPKWG 330
DYDAPL E G + W
Sbjct: 300 DYDAPLAEDGYPTEKYWA 317
>gi|413922057|gb|AFW61989.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 139
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 83/110 (75%), Positives = 95/110 (86%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
++A + A V+YDHRAVVI G+RR+LISGSIHYPRSTPEMWP L+QK+KDGGLDV++TY
Sbjct: 18 MIAPSPANAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTY 77
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
VFWN HEPVR QY F RYDLV+FVKL +AGLY HLRIGPYVCAEWNFG
Sbjct: 78 VFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFG 127
>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
Length = 867
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 133/415 (32%), Positives = 201/415 (48%), Gaps = 35/415 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+TYD ++ I +R ++S +IHY R W +++ K+K GG + IETY+ WN HE
Sbjct: 2 ITYDKKSWKIHNERVFILSAAIHYFRLPRAEWNEVLDKAKAGGCNTIETYIPWNFHEMNE 61
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+++F G DL F +L A+ LY R GPY+CAEW+FGGFP WL IQ+R+
Sbjct: 62 GEWDFSGDKDLAHFFQLCADKELYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPA 121
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
F + ++ +++ ++ + +L ++ G +I+ Q+ENE+ AYG K Y+++
Sbjct: 122 FLHYVDQYFDRVIPIIDEYQL--TKNGTVIMVQVENEF----QAYGKPDKPYMEYIRDGM 175
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCN--------GFYCDQFTPNSNNKPKMWTENWSGWF 258
+ VP V C A + + N D+ P ++PK E W GWF
Sbjct: 176 KARGIDVPLVTC--YGAVEGAVEFRNFWSHSKHAAAILDERFP---DQPKGVMEFWIGWF 230
Query: 259 LSFGG-AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD----RTSGGPFI-STSY 312
+GG + E L + G T NYYMY GGTNFD RT G + +T+Y
Sbjct: 231 EQWGGNKADQKTPEQLERECYQLLSNGFTAINYYMYFGGTNFDHWGGRTVGEQTLCTTTY 290
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCE-----AALVATDPTYPSLGPNLEATVYKT 367
DYD +DEY L K+ LK H +K E A VA+D PS +L++ +
Sbjct: 291 DYDVAIDEY-LQPTRKYEVLKRYHSFVKWLEPLFTDAEKVASDMKLPS---DLKSERIAS 346
Query: 368 GSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNV-VFNTAKINSVT 421
G N VK + L + ++LP +NV V N I ++T
Sbjct: 347 PYGEVIFIENNRNERIQSHVKHGYDQILFTIEANTVLPIVRNVKVGNHFTIKTLT 401
>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
Length = 940
Score = 183 bits (464), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 120/338 (35%), Positives = 171/338 (50%), Gaps = 26/338 (7%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
V YD + +I G+R ++S ++HY R W +++ KSK+ G + IETYV WN HE
Sbjct: 5 RVQYDRNSWIIDGRRVFILSAAVHYFRLPRAEWAEVLDKSKEAGCNCIETYVPWNWHEEE 64
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
Q++F G DL F+ L AE GLY +R GPY+CAEW+ GG P WL P +Q+R +
Sbjct: 65 EGQWDFSGDKDLGAFLDLCAERGLYVIVRPGPYICAEWDMGGLPYWLERKPDMQYRKFHR 124
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
F + + ++V ++ L S G +I+ Q+ENE+ A G K+Y+++
Sbjct: 125 EFLHYVDLYWDRLVPVVLPRLL--SNSGTVIMVQVENEF----QALGKPDKAYMEYLRDG 178
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGF-----YCDQFTPNSNNKPKMWTENWSGWFLS 260
+ VP V C A D + N + + ++PK E W GWF
Sbjct: 179 LIERGIDVPLVTCY--GAVDGAVEFRNFWSHAEEHARTLEERFADQPKGVLEFWIGWFEQ 236
Query: 261 FGGAVPYRPVEDLAFAVAR----FFQRGGTFQNYYMYHGGTNF----DRTSG-GPFISTS 311
+GG R + A V R + G T NYYM+ GGTNF RT G F++TS
Sbjct: 237 WGGP---RANQKTASQVERKTYELIREGFTAINYYMFFGGTNFGHWGGRTIGEHTFMTTS 293
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVAT 349
YDYDA LDEY L K+ LK +H ++ E L T
Sbjct: 294 YDYDAALDEY-LRPTAKYKALKLVHDFVRWMEPLLTET 330
>gi|125526285|gb|EAY74399.1| hypothetical protein OsI_02287 [Oryza sativa Indica Group]
Length = 255
Score = 182 bits (462), Expect = 7e-43, Method: Composition-based stats.
Identities = 91/199 (45%), Positives = 113/199 (56%), Gaps = 48/199 (24%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+V+YD R++VI G+RR+++SGSIHYPRSTPE
Sbjct: 29 SVSYDDRSLVIDGQRRIILSGSIHYPRSTPEE---------------------------- 60
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ AG+YA LRIGPY+C EWN+GG P WL IPG+QFR NE
Sbjct: 61 ------------------IQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 102
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKWAA 203
PF+ EM+ FT IV+ MK K++A QGGPIIL+QIENEYGNI + YI W A
Sbjct: 103 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 162
Query: 204 GMALSLDTGVPWVMCQQSD 222
MA + GVPW+MCQQ D
Sbjct: 163 DMANKQNVGVPWIMCQQDD 181
>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
Length = 628
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 176/369 (47%), Gaps = 38/369 (10%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
++L +L V++ + S + + ++ GK + SG +HYPR E W +Q
Sbjct: 8 LVLFILFACNVLIFSQSRKSTFEIKNGHFLLNGKLFSIHSGEMHYPRIPQEYWKHRLQMM 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K GL+ + TYVFWN HE ++N+ G DL KF+K E GLY +R GPYVCAEW F
Sbjct: 68 KAMGLNAVTTYVFWNYHEENPGKWNWSGEKDLKKFIKTAQEVGLYVIIRPGPYVCAEWEF 127
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG+P WL I G++ R DN F AE Q++ ++ + +K L + GGP+I+ Q ENE+G
Sbjct: 128 GGYPWWLQNIKGLKIREDNNLFLAETQKYITQLYNQVKD--LQITNGGPVIMVQAENEFG 185
Query: 186 NI-----------DSAYGAAGKSYIKWAAGMALSLDTGVPWVM--------CQQSDAPDP 226
+ Y A +K A + W+ ++ D
Sbjct: 186 SFVAQRKDIPLASHRTYNAKIVKQLKDAGFSVPMFTSDGSWLFEGGSVVGALPTANGEDN 245
Query: 227 IINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGT 286
I N +Q+ N+N P M E + GW + P +A ++ + +
Sbjct: 246 IENLKK--IVNQY--NNNQGPYMVAEFYPGWLAHWAEKFPRVDAGTVARQTDKYLKNDVS 301
Query: 287 FQNYYMYHGGTNFDRTSGGPFIS--------TSYDYDAPLDEYGLIRQPKWGHLKDL--- 335
F NYYM HGGTNF T+G + TSYDYDAP+ E G R PK+ L+ +
Sbjct: 302 F-NYYMVHGGTNFGFTNGANYDKNHDIQPDLTSYDYDAPITEAGW-RTPKYDSLRAVISK 359
Query: 336 HKAIKLCEA 344
H KL E
Sbjct: 360 HTKAKLPEV 368
Score = 46.6 bits (109), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 54/252 (21%), Positives = 95/252 (37%), Gaps = 60/252 (23%)
Query: 487 IKADEPL----LEDGSKTVLHVQSLGHALHA-------------FINGKLVGSGYGSSSN 529
+KAD+PL L G VL+ + + +ING+ VG ++
Sbjct: 398 VKADKPLSFEDLNQGHGYVLYRRHFNQPISGTLDLKGLRDYATIYINGEKVGELNRYYNH 457
Query: 530 AKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSS 589
+ +D P +T ++L G NYG+ + GI V++ + N +++
Sbjct: 458 YTMPIDIPF-----NSTLEILVENWGRINYGSRINENTKGIISAVKIGDTEITGNWEMTK 512
Query: 590 QQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMG 649
+ Q + +G Q +L Y+ F+ + ID G
Sbjct: 513 LPFPDQFASTIKAKPIDTGKQAQLKDVPSL--------YQGEFELTETGDTF-IDMQSWG 563
Query: 650 KGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLK 709
KG +VNG++IGR+W P Q+LY +P WLK
Sbjct: 564 KGVIFVNGRNIGRFWKV----------------------------GPQQTLY-IPGVWLK 594
Query: 710 SSGNTLVLFEEI 721
N +++F+++
Sbjct: 595 KGKNEIIIFDQL 606
>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
Length = 782
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 163/323 (50%), Gaps = 29/323 (8%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ GK V+ + IHYPR E W I+ K G++ I YVFWN HEP +Y+F
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G+ D+ F +L E G+Y +R GPYVCAEW GG P WL I+ R + + +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSL 209
+ F ++ + L S+GG II+ Q+ENEYG+ ID Y A + +K A
Sbjct: 153 KLFMNEVGKQLTD--LQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF----- 205
Query: 210 DTGVPWVMCQ-----QSDAPDPIINTCN----GFYCDQFTPNSNNKPK---MWTENWSGW 257
TGVP C +++A D ++ T N DQF +P M +E WSGW
Sbjct: 206 -TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF-----ISTSY 312
F +G R EDL + R +F + YM HGGT+F G F TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 313 DYDAPLDEYGLIRQPKWGHLKDL 335
DYDAP++E G + PK+ +++L
Sbjct: 324 DYDAPINESGKV-TPKYFEVRNL 345
>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 163/323 (50%), Gaps = 29/323 (8%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ GK V+ + IHYPR E W I+ K G++ I YVFWN HEP +Y+F
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G+ D+ F +L E G+Y +R GPYVCAEW GG P WL I+ R + + +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSL 209
+ F ++ + L S+GG II+ Q+ENEYG+ ID Y A + +K A
Sbjct: 153 KLFMNEVGKQLAD--LQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF----- 205
Query: 210 DTGVPWVMCQ-----QSDAPDPIINTCN----GFYCDQFTPNSNNKPK---MWTENWSGW 257
TGVP C +++A D ++ T N DQF +P M +E WSGW
Sbjct: 206 -TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF-----ISTSY 312
F +G R EDL + R +F + YM HGGT+F G F TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 313 DYDAPLDEYGLIRQPKWGHLKDL 335
DYDAP++E G + PK+ +++L
Sbjct: 324 DYDAPINESGKV-TPKYFEVRNL 345
>gi|296081427|emb|CBI16778.3| unnamed protein product [Vitis vinifera]
Length = 242
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 92/165 (55%), Positives = 110/165 (66%), Gaps = 15/165 (9%)
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK 246
+D A GK +K + +LD V + Q INTCN FYCDQFTPNS NK
Sbjct: 90 LDLAKIPQGKGLLK---ILTFNLDHNVEFFSLQ--------INTCNSFYCDQFTPNSPNK 138
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP 306
PKMWTENW GW +FG P+ P ED+ F+VARFF + NYYM HGGTNF RTSGGP
Sbjct: 139 PKMWTENWPGWSKTFGALDPHGPREDIVFSVARFFWK----VNYYMDHGGTNFGRTSGGP 194
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP 351
FI+T+YDY+AP+DEYGL R PK GHLK+L +AIK CE L+ +P
Sbjct: 195 FITTTYDYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEP 239
>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
Length = 574
Score = 180 bits (456), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 164/311 (52%), Gaps = 36/311 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG++HY R PE W D I+ +K GL+ IETYV WN HEPVR +++ G
Sbjct: 12 LLDGRPHQVISGTLHYFRIHPEHWADRIRTAKAMGLNTIETYVAWNAHEPVRGEWDATGW 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+ L+A GL+A +R GPY+CAEW+ GG P+WL PGI R F + +
Sbjct: 72 NDLGRFLDLIAAEGLHAIVRPGPYICAEWHNGGLPVWLTSTPGIGIRRSEPQFVEAVSEY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA------AGMALS 208
++ +++ ++ +GG ++L QIENEYG AYG + K Y++ AG+ +
Sbjct: 132 LRRVYEIVAPRQI--DRGGNVVLVQIENEYG----AYG-SDKEYLRELVRVTKDAGITVP 184
Query: 209 L---DTGVPWVMCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFLSF 261
L D +PW M + P+ + G + + P M +E W GWF +
Sbjct: 185 LTTVDQPMPW-MLEAGSLPELHLTGSFGSRSAERLATLREHQPTGPLMCSEFWDGWFDWW 243
Query: 262 GG----AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPF--ISTS 311
G P DL +A G N YM HGGTNF T+G G F I TS
Sbjct: 244 GSIHHTTDPAASAHDLDVLLA-----AGASVNIYMVHGGTNFGTTNGANDKGRFDPIVTS 298
Query: 312 YDYDAPLDEYG 322
YDYDAP+DE G
Sbjct: 299 YDYDAPIDESG 309
>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 180 bits (456), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 163/323 (50%), Gaps = 29/323 (8%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ GK V+ + IHYPR E W I+ K G++ I YVFWN HEP +Y+F
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G+ D+ F +L E G+Y +R GPYVCAEW GG P WL I+ R + + +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSL 209
+ F ++ + L ++GG II+ Q+ENEYG+ ID Y A + +K A
Sbjct: 153 KLFMNEVGKQLTD--LQINKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF----- 205
Query: 210 DTGVPWVMCQ-----QSDAPDPIINTCN----GFYCDQFTPNSNNKPK---MWTENWSGW 257
TGVP C +++A D ++ T N DQF +P M +E WSGW
Sbjct: 206 -TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF-----ISTSY 312
F +G R EDL + R +F + YM HGGT+F G F TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 313 DYDAPLDEYGLIRQPKWGHLKDL 335
DYDAP++E G + PK+ +++L
Sbjct: 324 DYDAPINESGKV-TPKYFEVRNL 345
>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
Length = 638
Score = 179 bits (454), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 177/362 (48%), Gaps = 42/362 (11%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G N T D + V I +SG+IHY R E W D + K K GL+ +ETYV WNLHE
Sbjct: 15 GENFTLDGKPVQI-------LSGAIHYFRVPREYWRDRMLKLKACGLNTLETYVCWNLHE 67
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P + +++F G D+ +++ A GL+ R GPY+CAEW++GG P WL P +Q RT
Sbjct: 68 PEKGKFDFTGMLDIAAYLREAANLGLWVIFRPGPYICAEWDYGGLPSWLLRDPNMQVRTT 127
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKW 201
+P+ ++RF ++ ++K +GGPII Q+ENEYG+ D Y A K I+
Sbjct: 128 YQPYMEAVERFFDALLPIVK--PFQYKEGGPIIAMQVENEYGSYARDDKYLTAVKQAIQK 185
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN---------NKPKMWTE 252
L L + + + ++ T N F P N+P+M E
Sbjct: 186 RGIEELLLTSDGGQIERLERGCIPGVLMTANF----NFNPKKQLGALKKLQPNRPQMVME 241
Query: 253 NWSGWFLSFG---GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS 309
WSGWF +G + E L + RF + N+YM+HGGTNF +G +I+
Sbjct: 242 FWSGWFDHWGRDHHKLHVEKFEQLLGDILRF----PSSVNFYMFHGGTNFGFMNGANYIN 297
Query: 310 ------TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEAT 363
TSYDYDAPL E G PK+ ++L K + A A P + P E +
Sbjct: 298 GYKPDVTSYDYDAPLSEAG-DPTPKYYKTRELLKTL----AMKGAVPSELPEVPPATEKS 352
Query: 364 VY 365
Y
Sbjct: 353 SY 354
>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
Length = 579
Score = 179 bits (454), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 165/315 (52%), Gaps = 35/315 (11%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
+H ++ G RVL +G++HY R P++W D I+K++ GL+ IETY WNLHEPV Y
Sbjct: 8 EHDFLLDGRPHRVL-AGALHYFRVHPDLWADRIEKARLMGLNTIETYTPWNLHEPVEGAY 66
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G DL +F++LVA+AG++A +R GPY+CAEW+ GG P WL+ P + R +
Sbjct: 67 DFTGMLDLERFLRLVADAGMHAIVRPGPYICAEWDNGGLPAWLYRDPEVGVRRSEPRYLG 126
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSL 209
+ + ++ D++ L +GGP++L QIENEYG AYG + K Y++ +
Sbjct: 127 AVSAYLRRVYDVVT--PLQIDRGGPVVLVQIENEYG----AYG-SDKFYLRHLVDLTREC 179
Query: 210 DTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK---------------PKMWTENW 254
VP Q P + + C T + ++ P M +E W
Sbjct: 180 GITVPLTTVDQ---PTDEMLSQGSLDCLHRTGSFGSRATERLATLRRHQPTGPLMCSEFW 236
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PF 307
+GWF +G ED A + G + N YM+HGGTNF TSG P
Sbjct: 237 NGWFDHWGDRHHTTSAEDSAAELDALLAAGASV-NIYMFHGGTNFGLTSGANDKGVYQPT 295
Query: 308 ISTSYDYDAPLDEYG 322
I TSYDYDAPLDE G
Sbjct: 296 I-TSYDYDAPLDEAG 309
>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 782
Score = 179 bits (454), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 162/323 (50%), Gaps = 29/323 (8%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ G V+ + IHYPR E W I+ K G++ I YVFWN HEP +Y+F
Sbjct: 33 KTFLLNGNPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G+ D+ F +L E G+Y +R GPYVCAEW GG P WL I+ R + + +
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSL 209
+ F ++ + L S+GG II+ Q+ENEYG+ ID Y A + +K A
Sbjct: 153 KLFMNEVGKQLTD--LQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF----- 205
Query: 210 DTGVPWVMCQ-----QSDAPDPIINTCN----GFYCDQFTPNSNNKPK---MWTENWSGW 257
TGVP C +++A D ++ T N DQF +P M +E WSGW
Sbjct: 206 -TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGW 264
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF-----ISTSY 312
F +G R EDL + R +F + YM HGGT+F G F TSY
Sbjct: 265 FDHWGAKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSY 323
Query: 313 DYDAPLDEYGLIRQPKWGHLKDL 335
DYDAP++E G + PK+ +++L
Sbjct: 324 DYDAPINESGKV-TPKYFEVRNL 345
>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
Length = 589
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 115/331 (34%), Positives = 170/331 (51%), Gaps = 37/331 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG+IHY R P+ W D + K G + +ETY+ WNLHEP +++F+G
Sbjct: 11 IVDGKPIKILSGAIHYFRIVPKHWEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V F+K E L +R PY+CAEW FGG P WL + R+D + +++ +
Sbjct: 71 KDVVSFIKKAQEMELMVIVRPSPYICAEWEFGGLPAWLLTYDNLHLRSDCPRYLEKVKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ M+ L ++QGGPII+ Q+ENE+G+ + K+Y+K + L L VP
Sbjct: 131 YEVLLPMLT--SLQSTQGGPIIMMQVENEFGSF-----SNNKTYLKKLKKIMLDLGVEVP 183
Query: 215 -------WVMCQQSDA--PDPIINTCN-GFY-------CDQFTPNSNNK-PKMWTENWSG 256
W +S + D ++ T N G + +QF N K P M E W G
Sbjct: 184 LFTSDGSWQQALESGSLIDDDVLVTANFGSHSHENLDVLEQFMANHQKKWPLMSMEFWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R +DLA V RG N YM+HGGTNF +G P +
Sbjct: 244 WFNRWGEEIITRDAQDLANCVKELLTRGSI--NLYMFHGGTNFGFMNGCSARGQKDLPQV 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAI 339
TSYDYDA L E G I + K+ +K + K +
Sbjct: 302 -TSYDYDALLTEAGDITE-KYQCVKKVMKEL 330
>gi|414880685|tpg|DAA57816.1| TPA: putative RAN GTPase activating family protein [Zea mays]
Length = 598
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 116/298 (38%), Positives = 148/298 (49%), Gaps = 60/298 (20%)
Query: 291 YMYHGGTNFDRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD 350
+ YHGGTNF RTSGGP+I+TSYDYDAPLDEYG IRQPK+GHLKDLH I+ E LV
Sbjct: 308 FKYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGK 367
Query: 351 PTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNV 410
S G N I + DV V +G ++L+PAWSVSILPDCK V
Sbjct: 368 YNDTSYGKNA------------------IFVDRDVKVTLSGGTHLVPAWSVSILPDCKTV 409
Query: 411 VFNTAKINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINE---PVGISKDDAFTKPGLLE 467
+NTAKI + T V S++ ++ WS++ E P D+F LLE
Sbjct: 410 AYNTAKIKTQTSVMVKKANSVEKEPEALR-----WSWMPENLKPFMTDHRDSFRHSQLLE 464
Query: 468 QINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFI------------ 515
QI T+ DQSDYLWY S K +GS T L+V + GH + +
Sbjct: 465 QITTSTDQSDYLWYRTSLEHKG------EGSYT-LYVNTSGHEMAKLLGRWSVRLPAPVS 517
Query: 516 ---------------NGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
+ + G Y + + P+ L GKN LLS TVGL++
Sbjct: 518 GEAPLRKELRFSPQRHSRTQGQNYSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKS 575
>gi|300770171|ref|ZP_07080050.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
gi|300762647|gb|EFK59464.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
Length = 638
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/357 (35%), Positives = 174/357 (48%), Gaps = 47/357 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG +HY R + W +Q K GL+ + TYVFWN HE +NFEG
Sbjct: 42 VYDGKATRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEGD 101
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
+DL F+K E GL+ LR GPY CAEW+FGG+P WL I G++ R DN A+ +
Sbjct: 102 HDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDN----AKFLEY 157
Query: 155 TAKIVDMMKQE--KLYASQGGPIILSQIENEYGNIDS-----------AYGAAGKSYIKW 201
T K +D + +E L + GGPII+ Q ENE+G+ S AY A K ++
Sbjct: 158 TKKYIDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLE- 216
Query: 202 AAGMALSLDTGVPWVMCQQSDAPD--PIINTCNGF-----YCDQFTPNSNNKPKMWTENW 254
AG + L T + + P P N N DQ+ N+N P M E +
Sbjct: 217 EAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQGPYMVAEFY 274
Query: 255 SGWFLSFGGAVPYRPVE--DLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--- 309
GW + A P+ V+ +A ++ Q +F NYYM HGGTNF TSG + +
Sbjct: 275 PGWLDHW--AEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGANYNNKSD 331
Query: 310 -----TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLE 361
TSYDYDAP+ E G PK+ ++ + + T P P P +E
Sbjct: 332 IQPDITSYDYDAPISEAGWTT-PKYDSIR------TVIQKYADYTVPAIPKANPVIE 381
Score = 52.8 bits (125), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 127/325 (39%), Gaps = 80/325 (24%)
Query: 416 KINSVTLVPSFSRQSLQVAADSSDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQ 475
K N V +PS L A+ D S + INE P EQ+ DQ
Sbjct: 375 KANPVIEIPSIK---LTAVANVFDYAKSAKTTINE-----------TPLNFEQL----DQ 416
Query: 476 SD-YLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTV 534
++ Y+ YS N + L DG + V +I+G VG N ++ +
Sbjct: 417 ANGYVLYSKQFNQPINGKLKIDGLRDFAVV---------YIDGTKVGELNRVFKNYEMDI 467
Query: 535 DFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTY 594
D P +T +L +G NYG+ GI PV + +++++ WT
Sbjct: 468 DIPF-----NSTLQILVENMGRINYGSEIIHNHKGIISPVLI------NDMEITGD-WTM 515
Query: 595 QT-------GLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTG 647
Q L G++ + +TL K QP V Y+ TFD + ID
Sbjct: 516 QQLPMDKVPDLAGKQTATIQNTKVNTSKIATL-KGQP-VLYQGTFDLKEIGDTF-IDMEK 572
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
GKG ++NG +IGRYW T P +LY +P +
Sbjct: 573 WGKGIVFINGINIGRYWKT----------------------------GPQHTLY-IPGPY 603
Query: 708 LKSSGNTLVLFEEIGGD-PTKISFV 731
LK N++V+FE++ + T++S V
Sbjct: 604 LKKGSNSIVIFEQLNDEIKTEVSTV 628
>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
Length = 603
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 157/306 (51%), Gaps = 21/306 (6%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG++HY R P+ W D I+K++ GL+ +ETYV WN+H P R ++ GR
Sbjct: 12 LLDGRSLQIVSGALHYFRVHPDQWADRIRKARLLGLNTVETYVAWNVHSPERGVFDTSGR 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+ LVA GL+A +R GPY+CAEW GG P WL P + R F + +
Sbjct: 72 RDLARFLDLVAAEGLHAIVRPGPYICAEWTGGGLPAWLFADPEVGVRRAEPRFLEAIGEY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
A ++ ++ + ++ ++GGP+++ Q+ENEYG + Y++ A M + VP
Sbjct: 132 YAALLPIVAERQV--TRGGPVLMVQVENEYGAYGDDPPVERERYLRALADMIRAQGIDVP 189
Query: 215 WVMCQQSD--------APDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFLSFG 262
Q++ P+ + G + + P M E W GWF S G
Sbjct: 190 LFTSDQANDHHLSRGSLPELLTTANFGSRATERLAILRKHQPTGPLMCMEFWDGWFDSAG 249
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPF--ISTSYDYDA 316
P E A + G + N YM HGGTNF TSG G + I+TSYDYDA
Sbjct: 250 LHHHTTPPEANARDLDDLLAAGASV-NLYMLHGGTNFGLTSGANDKGVYRPITTSYDYDA 308
Query: 317 PLDEYG 322
PL E+G
Sbjct: 309 PLSEHG 314
>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
Length = 570
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 136/418 (32%), Positives = 190/418 (45%), Gaps = 57/418 (13%)
Query: 55 PEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLR 114
PE W D + K K GL+ +ETYV WNLHE V++ + F+ D+VKFVKL GLY +R
Sbjct: 2 PEYWKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVIIR 61
Query: 115 IGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGP 174
GPY+CAEW+ GG P WL P ++ RT PF + R+ K+ ++ L QGGP
Sbjct: 62 PGPYICAEWDLGGLPSWLLSDPEMKLRTSYGPFMEAVDRYFQKLFPLL--TPLQYCQGGP 119
Query: 175 IILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMC-------QQSDAPDPI 227
II QIENEY + D +Y++ M + GV ++ ++ + +
Sbjct: 120 IIAWQIENEYSSFDK---KVDMTYMELLQKMMVK--NGVTEMLLMSDNLFSMKTHPINLV 174
Query: 228 INTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ 282
+ T N Q +KP M TE W GWF +G P E L + F
Sbjct: 175 LKTINLQKNVKDALLQLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKDLFS 234
Query: 283 RGGTFQNYYMYHGGTNFDRTSGGPFIS--------------TSYDYDAPLDEYGLIRQPK 328
G + N+YM+HGGTNF +G F TSYDYDAPL E G I PK
Sbjct: 235 LGASI-NFYMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESGDI-TPK 292
Query: 329 WGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTNSDVTVK 388
+ K L K I+ E A P+ P++ + +YK G + +D +
Sbjct: 293 Y---KALRKFIR--EHA--------PNPFPDIPSNLYKGAYGKTMYLFNFLIEETDQKI- 338
Query: 389 FNGNSYLLPAWSVSILPDCKN-------VVFNTA-KINSVTLVPSFSRQSLQVAADSS 438
F+ V LP + V++ TA K ++ +LV R V DS
Sbjct: 339 FDQAIVSDTVKPVEFLPINNHGGQGYGFVIYQTALKHDAKSLVVEIVRDRAHVMVDSK 396
>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 584
Score = 177 bits (448), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 172/346 (49%), Gaps = 38/346 (10%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T + +++ + +I+G+IHY R PE W D + K K G + +ETYV WN HEP
Sbjct: 4 LTIQGKQLMLNDRPFRIIAGAIHYFRVVPEYWRDRLLKLKACGFNTVETYVPWNFHEPEE 63
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++ FEG DL KF+ L E GLYA +R PY+CAEW FGG P WL PG++ R +P
Sbjct: 64 GRFVFEGMADLEKFIALAGELGLYAIVRPSPYICAEWEFGGLPAWLLKDPGMRLRCSYKP 123
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
F + + +++ + +++GGP+I QIENEYG +YG K+Y+ +
Sbjct: 124 FLDKADAYYDELIPRLT--PFLSTKGGPLIAMQIENEYG----SYG-NDKTYLNYLKEAL 176
Query: 207 LSLDTGVPWVMCQQSDAPDPII-----------------NTCNGF-YCDQFTPNSNNKPK 248
+ GV V+ SD P+ + + F ++ P ++P
Sbjct: 177 VK--RGVD-VLLFTSDGPEDFMLQGGMVEGVWETVNFGSRSAEAFAKLQEYQP---DQPL 230
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
M E W+GWF +G R D+A + G + N+YM+HGGTNF SG +
Sbjct: 231 MCMEFWNGWFDHWGETHHTRGAADVALVLDEMLAAGASV-NFYMFHGGTNFGFFSGANYT 289
Query: 309 S------TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVA 348
TSYDYD+PL E G + + + + + K +L L A
Sbjct: 290 DRLLPTVTSYDYDSPLSESGELTEKYYAVREVIAKYAELGPLELPA 335
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 92/232 (39%), Gaps = 54/232 (23%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L++Q + FI+G G S+ + D P PG +L +G NYG
Sbjct: 397 LNLQEVHDRALIFIDGVFKGVIERSNPEHDLVFDVP----PGGVELAILVENMGRINYGP 452
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
+ K GIT V+ QQ+ + ++ L+ S Q+ + S+ P
Sbjct: 453 -HMKDVKGITEGVRF------------GQQFLFNWTVRPLPLD--DLSKLQFSALSSQPC 497
Query: 622 LQPLVWYKTTF--DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCN 679
LQP +Y+ F D PA + + G KG A++NG ++GRYW
Sbjct: 498 LQP-SFYRGEFEVDEPADT---FLSMKGWTKGVAYMNGFNLGRYWEI------------- 540
Query: 680 YRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
P ++LY +P L++ N +++FE + +S +
Sbjct: 541 ---------------APQETLY-IPGPLLRTGKNEIIVFELHAAESASVSLL 576
>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
Length = 583
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 119/340 (35%), Positives = 170/340 (50%), Gaps = 21/340 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
++YD +G + LISG+IHY R P W D ++K K G + IETYV WNLHEP
Sbjct: 4 LSYDQGQFTMGDRPIQLISGAIHYFRVVPAYWEDRLRKIKAMGCNCIETYVAWNLHEPRE 63
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+++FEG D+ +FV+L E GLY +R PY+CAEW FGG P WL ++ R ++
Sbjct: 64 GEFHFEGMSDVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL-LKDDMRLRCNDPR 122
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAG 204
F ++ + ++ + L A++GGPII QIENEYG+ D AY A ++ +
Sbjct: 123 FLEKVAAYYDALLPQLT--PLLATKGGPIIAVQIENEYGSYGNDQAYLQAQRAMLIERGV 180
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFL 259
L + P Q + ++ T N D+ + P M E W+GWF
Sbjct: 181 DVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMCMEYWNGWFD 240
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFISTSY 312
+ R ED A + G + N+YM HGGTNF SG P + TSY
Sbjct: 241 HWFEQHHTRDAEDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDKYEPTV-TSY 298
Query: 313 DYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDP 351
DYDA + E G + PK+ ++ + K + L E L A P
Sbjct: 299 DYDAAISEAGDL-TPKYHAFREVIGKYVSLPEGDLPANTP 337
>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
Length = 655
Score = 176 bits (445), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 116/337 (34%), Positives = 176/337 (52%), Gaps = 39/337 (11%)
Query: 33 AVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE 92
A + GK+ +L+SG++HY R PE W D + K K GL+ +ETYV WN HE VR ++F
Sbjct: 10 AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69
Query: 93 GRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQ 152
G DL +F+++ + GLY LR GPY+C+EW+FGG P WL P ++ RT P+ +
Sbjct: 70 GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129
Query: 153 RFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWA-AGMALS 208
+ AKI+ ++ L S+GGPII Q+ENEYG+ D Y K+ +IK+ + +
Sbjct: 130 AYLAKILPLVND--LQMSKGGPIIAVQLENEYGSYGDDLDYKLFLKNQFIKYGIEELLFT 187
Query: 209 LDTGVPWVMCQQSDAPDP-IINTCN------GFYCDQFTPNSNNK--PKMWTENWSGWFL 259
D G + P P ++ T N G+ ++ N P M E WSGWF
Sbjct: 188 SDNG-----TGIQNGPIPGVLATTNFQEQEQGYLMFEYLRNIKQPGLPMMVMEFWSGWFD 242
Query: 260 SFGGAVPYRPVEDLAFA-VARFFQRGGTFQNYYMYHGGTNF-------------DRTSGG 305
+G + F V ++ G+ N+YM+HGGTNF + G
Sbjct: 243 HWGEQ--HNLCHHAEFIDVFKWILLEGSSVNFYMFHGGTNFGFMAGANEDFGATNEGGGE 300
Query: 306 PFI--STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIK 340
P+ +TSYDYD P+ E G + + K+ ++++ +K
Sbjct: 301 PYAADTTSYDYDCPVSESGQLNE-KFYEIRNILSEMK 336
>gi|301763006|ref|XP_002916929.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Ailuropoda
melanoleuca]
Length = 1209
Score = 175 bits (444), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 179/370 (48%), Gaps = 32/370 (8%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+GG + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F
Sbjct: 499 LGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENL 558
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT + F + ++
Sbjct: 559 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYF 618
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + L +GGPII Q+ENEYG+ A K Y+ + L+ G+
Sbjct: 619 DHLIS--RVVPLQYHKGGPIIAVQVENEYGSF-----AVDKDYMPYVRKAL--LERGIVE 669
Query: 216 VMCQQSDAPDPI------------INTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
++ DA + +NT +Q + NKP M E W GWF ++GG
Sbjct: 670 LLVTSDDAENLQKGYLEGVLATINMNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDTWGG 729
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAP 317
ED+ V++F +F N YM+HGGTNF +G + + TSYDYDA
Sbjct: 730 KHMVNNAEDVEETVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGIHRAVVTSYDYDAL 788
Query: 318 LDEYGLIRQPKWGHLKDLHK---AIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
L E G + K+ L+ L + A+ L + YPS+ P+L ++ L
Sbjct: 789 LTEAGDYTK-KYFKLQRLFRSVLAMPLPPLPELTPKAKYPSVKPSLYLPLWDALQYLNEP 847
Query: 375 FLANIGTNSD 384
++N N +
Sbjct: 848 VISNRPVNME 857
Score = 73.2 bits (178), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 69/145 (47%), Gaps = 25/145 (17%)
Query: 42 VLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFV 101
++I+G+IHY R E W D + K K G + + T FV
Sbjct: 64 LIIAGTIHYFRVPREYWRDRLMKLKACGFNTVTT-----------------------AFV 100
Query: 102 KLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDM 161
+ ++ GL+ L GPY+ ++ + GG P WL P ++ RT F + + KI+
Sbjct: 101 AMASDVGLWVILCPGPYIGSDLDLGGLPSWLLRDPKMKLRTTYRGFTKAVNLYFDKIIPK 160
Query: 162 MKQEKLYASQGGPIILSQIENEYGN 186
+ Q L +GGPII Q+ENEYG+
Sbjct: 161 IVQ--LQYGKGGPIIALQVENEYGS 183
>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
Length = 664
Score = 175 bits (444), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 121/378 (32%), Positives = 182/378 (48%), Gaps = 32/378 (8%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+GG + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 93 LGGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSGNL 152
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT + F + ++
Sbjct: 153 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPKMILRTTYKGFVEAVNKYF 212
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + L + GPII Q+ENEYG+ A K Y+ + L+ G+
Sbjct: 213 DHLIS--RVVPLQYRKRGPIIAVQVENEYGSF-----AEDKDYMPYIQKAL--LERGIVE 263
Query: 216 VMCQQSDAPDPI------------INTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
++ DA + +NT Q + NKP M E W GWF ++GG
Sbjct: 264 LLMTSDDAKHMLKGYIEGVLATINMNTFQINDFKQLSQVQRNKPIMVMEFWVGWFDTWGG 323
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAP 317
+ ED+ V++F +F N YM+HGGTNF +G + + TSYDYDA
Sbjct: 324 KHMIKNAEDVEDTVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGKHRGVVTSYDYDAV 382
Query: 318 LDEYGLIRQPKWGHLKDLH---KAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSA 374
L E G + K+ L+ L A+ L ++ YP++ P+L ++ L
Sbjct: 383 LTEAGDYTE-KYFKLRKLFGSVVAVHLPPLPKLSPKAEYPAVKPSLYLPLWDVLQYLNKP 441
Query: 375 FLANIGTNSDVTVKFNGN 392
+++ N + NGN
Sbjct: 442 VISHTPVNMESLPINNGN 459
>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
Length = 592
Score = 175 bits (444), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 162/319 (50%), Gaps = 40/319 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK +ISGSIHY R PE W D ++K K+ G + +ETY+ WN+ EP + ++ F+G
Sbjct: 11 LLDGKPFQIISGSIHYFRVVPEYWQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFDGL 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D KF+ L + GLYA +R PY+CAEW GG P W+ +PG++ R NEP+ ++ +
Sbjct: 71 CDFEKFLDLAQKLGLYAIVRPSPYICAEWELGGLPSWIFTVPGLEPRCKNEPYYQNVRDY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ + ++ +GG IIL QIENEYG Y SY+ + G+ VP
Sbjct: 131 YKVLLPRLVNHQI--DKGGNIILMQIENEYG-----YYGKDMSYMHFLEGLMREGGITVP 183
Query: 215 WVMCQQSDAPDPIINTCNG-------------FYCD---QFTPNSNNKPKMWTENWSGWF 258
+V I C+G + + N P M E W GWF
Sbjct: 184 FVTSDGPWGKMFIHGQCDGALPTGNFGSHARPLFANMKRMMKKTGNRGPLMCMEFWIGWF 243
Query: 259 LSFGG-----AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI----- 308
++G + R ++DL + + ++G N+YM+HGGTNF +G +
Sbjct: 244 DAWGNKEHKTSKLKRNIKDLNYML----KKGNV--NFYMFHGGTNFGFMNGSNYFTKLTP 297
Query: 309 -STSYDYDAPLDEYGLIRQ 326
+TSYDYDAPL E G I +
Sbjct: 298 DTTSYDYDAPLSEDGKITE 316
>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
Length = 592
Score = 175 bits (443), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 163/316 (51%), Gaps = 36/316 (11%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R P++W D +++ GL+ +ETYV WN HE VR + +F G DL +F+
Sbjct: 26 VLSGAIHYFRIHPDLWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLARFIS 85
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L + GL +R GPY+CAEW+FGG P WL PGI RT + F A + + +V ++
Sbjct: 86 LAGDLGLDVIVRPGPYICAEWDFGGLPAWLMTEPGIALRTSDPAFLAAVDDWFDAVVPVI 145
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSD 222
+ L + GGP++ Q+ENEYG +YG G+ LD G+ V+ SD
Sbjct: 146 R--PLLTTAGGPVVAVQVENEYG----SYGDDAAYLEHCRKGL---LDRGID-VLLFTSD 195
Query: 223 APDP----------IINTCN-GFYCD----QFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
P P ++ T N G D + P M E W+GWF +G
Sbjct: 196 GPGPDWLDNGTIPGVLATVNFGSRTDEAFAELRKVQPAGPDMVMEYWNGWFDHWGEPHHV 255
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDAPLD 319
R V+D A + + GG+ N+YM HGGTNF SG P + TSYDYDA +
Sbjct: 256 RDVDDAAGVLDDVLRAGGSV-NFYMAHGGTNFGLWSGANVEDGKLQPTV-TSYDYDAAVG 313
Query: 320 EYGLIRQPKWGHLKDL 335
E G + PK+ +++
Sbjct: 314 EAGEL-TPKFHAFREV 328
>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
Length = 779
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 123/355 (34%), Positives = 181/355 (50%), Gaps = 33/355 (9%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYD--HRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
K +L L++ V+ ++ S + T++ ++ G+ V+ + IHYPR E W
Sbjct: 2 KKPLLYLLILVVAVLGSSCSQSSEGTFEVGKNTFLLNGEPFVVKAAEIHYPRIPKEYWEH 61
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
I+ K G++ I YVFWN HEP +Y+F G+ D+ F +L E G+Y +R GPYVC
Sbjct: 62 RIKMCKALGMNTICLYVFWNFHEPEEGRYDFAGQKDIAAFCRLAQENGMYVIVRPGPYVC 121
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQ 179
AEW GG P WL I+ R + +P+ M+R + ++ KQ L S+GG II+ Q
Sbjct: 122 AEWEMGGLPWWLLKKKDIKLR-EQDPYY--MERVKLFLNEVGKQLADLQISKGGNIIMVQ 178
Query: 180 IENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCN 232
+ENEYG ID Y + + +K A TGVP C +++A D ++ T N
Sbjct: 179 VENEYGAFGIDKPYISEIRDMVKQAGF------TGVPLFQCDWNSNFENNALDDLLWTIN 232
Query: 233 ---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGG 285
G D QF +P M +E WSGWF +G R E+L + R
Sbjct: 233 FGTGANIDEQFKRLKELRPDTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLDRNI 292
Query: 286 TFQNYYMYHGGTNFDRTSGGPF-----ISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
+F + YM HGGT+F G F TSYDYDAP++E G + PK+ +++L
Sbjct: 293 SF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKV-TPKYLEVRNL 345
>gi|167856235|ref|ZP_02478970.1| beta-galactosidase [Haemophilus parasuis 29755]
gi|167852655|gb|EDS23934.1| beta-galactosidase [Haemophilus parasuis 29755]
Length = 596
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 161/320 (50%), Gaps = 40/320 (12%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
+ ++ GK ++SG++HY R PE W + K G + +ETYV WNLH+P +Q+N
Sbjct: 7 EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F R DLVKF++ + GLY LR PY+CAEW FGG P WL IP I+ R ++ F AE
Sbjct: 67 FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
+ R+ +++ + ++ +QGG I++ QIENEYG+ + K+Y++ + L
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYGSFGN-----DKNYLRAILALMLIHG 179
Query: 211 TGVP-------WVMCQQSDA--PDPIINTCN------------GFYCDQFTPNSNNKPKM 249
VP W ++ A D I+ T N Y D+ + + P M
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236
Query: 250 WTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD-------RT 302
E W GWF + V R +DLA +R N+YM+ GGTNF R
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLADCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294
Query: 303 SGGPFISTSYDYDAPLDEYG 322
TSYDYDAP+ E+G
Sbjct: 295 DTDLPQVTSYDYDAPVHEWG 314
Score = 42.4 bits (98), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 60/154 (38%), Gaps = 19/154 (12%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+ F + + V + Y +V + P G+ DLL +G NYG +
Sbjct: 411 IQIFNDKQKVATQYQHEIGNEVMLQTP----NGEFQLDLLVENMGRVNYGG-------KL 459
Query: 571 TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKT 630
Q KG G G +DL + TG + ++F +D + Q +Y+
Sbjct: 460 LAATQHKGIGAGAVLDLH-----FHTGWQHYAIDFDRLEEIDFDGEK---DSQAPSFYQF 511
Query: 631 TFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+ ID GKG VNG+++GRYW
Sbjct: 512 KLHLDQEPQDTFIDTRAFGKGVIVVNGENLGRYW 545
>gi|219870459|ref|YP_002474834.1| beta-galactosidase [Haemophilus parasuis SH0165]
gi|219690663|gb|ACL31886.1| beta-galactosidase, glucosyl hydrolase family protein [Haemophilus
parasuis SH0165]
Length = 596
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 161/320 (50%), Gaps = 40/320 (12%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
+ ++ GK ++SG++HY R PE W + K G + +ETYV WNLH+P +Q+N
Sbjct: 7 EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F R DLVKF++ + GLY LR PY+CAEW FGG P WL IP I+ R ++ F AE
Sbjct: 67 FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
+ R+ +++ + ++ +QGG I++ QIENEYG+ + K+Y++ + L
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYGSFGN-----DKNYLRAIRALMLIHG 179
Query: 211 TGVP-------WVMCQQSDA--PDPIINTCN------------GFYCDQFTPNSNNKPKM 249
VP W ++ A D I+ T N Y D+ + + P M
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236
Query: 250 WTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD-------RT 302
E W GWF + V R +DLA +R N+YM+ GGTNF R
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLANCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294
Query: 303 SGGPFISTSYDYDAPLDEYG 322
TSYDYDAP+ E+G
Sbjct: 295 DTDLPQVTSYDYDAPVHEWG 314
Score = 45.4 bits (106), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 61/154 (39%), Gaps = 19/154 (12%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+ F + + V + Y +V + P G+ DLL +G NYG +
Sbjct: 411 IQIFNDKQKVATQYQHEIGNEVMLQTP----NGEFQLDLLVENMGRVNYGG-------KL 459
Query: 571 TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKT 630
P Q KG G G +DL + TG + ++F +D + Q +Y+
Sbjct: 460 LAPTQHKGIGAGAVLDLH-----FHTGWQQYAIDFDRLEEIDFDGEK---DSQTPSFYQF 511
Query: 631 TFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+ ID GKG VNG+++GRYW
Sbjct: 512 KLHLDQEPQDTFIDTRAFGKGVIVVNGENLGRYW 545
>gi|353441134|gb|AEQ94151.1| beta-galactosidase [Elaeis guineensis]
Length = 127
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 83/128 (64%), Positives = 104/128 (81%), Gaps = 1/128 (0%)
Query: 721 IGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNPNQVIS 780
+GGDPTKISF T+Q GS LC+HV++SHP P+D W S + K GPV+ LECP NQVIS
Sbjct: 1 VGGDPTKISFATRQTGS-LCAHVSESHPSPIDDWISSQRKVGKLGPVVHLECPYANQVIS 59
Query: 781 SIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSIGVSVNTFGDPCKGVMKS 840
SIKFASFGTP G+CGS+++G CSS +L+VV+QAC+G+KSCS+GVS FGDPC G+ KS
Sbjct: 60 SIKFASFGTPHGSCGSYNQGNCSSDSALAVVQQACIGAKSCSVGVSTKMFGDPCTGITKS 119
Query: 841 LAVEASCT 848
LAVEA+C+
Sbjct: 120 LAVEAACS 127
>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
Length = 618
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 170/358 (47%), Gaps = 39/358 (10%)
Query: 15 FVVLATTSFGANVTYDHRA--------VVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++ + + F N+ Y + ++ GK + SG +HYPR E W +Q K
Sbjct: 7 YLYIILSFFSINLLYSQKGNFEIKDGHFLLSGKPFTIYSGEMHYPRVPSEYWKHRLQMMK 66
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
GL+ + TYVFWN HE ++NF G DL KF+K EAGLY +R GPYVCAEW FG
Sbjct: 67 SMGLNTVTTYVFWNYHEEEPGKWNFSGEKDLKKFIKTAQEAGLYVIIRPGPYVCAEWEFG 126
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
G+P WL ++ RTDN+ F + + + ++ + L + GGP+I+ Q ENE+G+
Sbjct: 127 GYPWWLQKDKNLEIRTDNKAFLKQCENYINELAKQII--PLQINNGGPVIMVQAENEFGS 184
Query: 187 IDSAYG----AAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGF--- 234
+ K Y + VP+ + + + + T NG
Sbjct: 185 YVAQRKDISLEQHKKYSHKIKDFLVKSGITVPFFTSDGSWLFKEGSIEGALPTANGEGDV 244
Query: 235 -----YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQN 289
++F N+ P M E + GW + ED+ + + G +F N
Sbjct: 245 DNLRKKINEF--NNGKGPYMVAEYYPGWLDHWAEPFVKVSTEDVVKQTELYIKNGISF-N 301
Query: 290 YYMYHGGTNFDRTSGGPFIS--------TSYDYDAPLDEYGLIRQPKWGHLKDLHKAI 339
YYM HGGTNF TSG + TSYDYDAP++E G + PK+ L+D+ + I
Sbjct: 302 YYMIHGGTNFGFTSGANYDKNHDIQPDLTSYDYDAPINEAGWV-TPKFNALRDIFQKI 358
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 84/227 (37%), Gaps = 55/227 (24%)
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
K L ++ L + ++N + G + N K +D I + ++L +G
Sbjct: 424 QKGKLEIKGLRDYANVYVNERWQGEL--NRVNKKYDLDIEIKAG---DRLEILVENMGRI 478
Query: 558 NYGAFYEKTGAGITGPVQLKGS---GNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD 614
NYGA GI PV + GS GN L Q FP Q D
Sbjct: 479 NYGAEIVHNLKGIISPVIINGSEISGNWEMFPLPFDQ-------------FPKHKYQQKD 525
Query: 615 SKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGC 674
+ P V + F + +D GKG ++NG++IGRYW S+ G
Sbjct: 526 IANNSP-----VISEAEFKLDETGD-TFLDMRKFGKGIVFINGRNIGRYW----SKAG-- 573
Query: 675 TDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
P Q+LY VP WLK N + +FE+I
Sbjct: 574 ---------------------PQQTLY-VPGVWLKKGKNGIQIFEQI 598
>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
Length = 584
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 113/302 (37%), Positives = 159/302 (52%), Gaps = 32/302 (10%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISGSIHY R P W D ++K + G + +ETYV WN+HEP +++F DL +F++
Sbjct: 19 IISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLY LR PY+CAEW FGG P WL P ++ R D PF ++ R+ ++ +
Sbjct: 79 LAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQV 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA------GMALSLDTGV-PW 215
L +Q GPI++ Q+ENEYG +YG KSY++ +A G+ +SL T PW
Sbjct: 139 S--DLQITQEGPILMMQVENEYG----SYG-NDKSYLRKSAELMRHNGIDVSLFTSDGPW 191
Query: 216 V-MCQQSDAPD---PIINTCNGFYCDQFTP----NSNNKPKMWTENWSGWFLSFGGAVPY 267
+ M + D P IN C + F + +P M E W GWF ++G +
Sbjct: 192 LDMLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHH 250
Query: 268 -RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDE 320
V D A + + G N YM+HGGTNF +G + TSYDYDA L E
Sbjct: 251 TTSVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLSPDVTSYDYDALLSE 308
Query: 321 YG 322
+G
Sbjct: 309 WG 310
>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 587
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/305 (35%), Positives = 153/305 (50%), Gaps = 16/305 (5%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ ++G + ++SG+IHY R PE W D + K + GL+ +ETY+ WNLHEP Q+
Sbjct: 9 NQQFLLGDEPIQILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFV 68
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F+G DL +FV++ + GL+ LR PY+CAEW FGG P WL P IQ R + + +
Sbjct: 69 FDGIADLERFVRIAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEK 128
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMAL 207
+ ++ +++ + L S+GGP+I QIENEYG+ D+AY K IK + L
Sbjct: 129 VDQYYDELIPRLV--PLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDGLIKRGVDVLL 186
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYC----DQFTPNSNNKPKMWTENWSGWFLSFGG 263
G M Q P + G D+ P M E W+GWF +
Sbjct: 187 FTSDGPTDGMLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLK 246
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAP 317
R ED A + N+YM+HGGTNF +G F TSYDYDAP
Sbjct: 247 PHHTRDAEDAAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAP 305
Query: 318 LDEYG 322
L E G
Sbjct: 306 LSECG 310
>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
Length = 591
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 115/307 (37%), Positives = 156/307 (50%), Gaps = 28/307 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P+ W D I K++ GL+ IETYV WN HEPV Q+++EG
Sbjct: 12 LLDGRPHRILSGAIHYFRIHPDQWADRIHKARLMGLNTIETYVAWNAHEPVEGQWSWEGG 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL F+K VA+ G++A +R PY+CAEW+ GG P WL R D F A +Q +
Sbjct: 72 LDLAAFLKAVADEGMHAIVRPAPYICAEWDNGGLPAWLFGEKAAGVRRDEPVFMAAVQAY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ +++ E L GGP+IL QIENEYG AYG + Y++ + S VP
Sbjct: 132 LRRVYEVI--EPLQIHHGGPVILVQIENEYG----AYG-SDPEYLRKLVDITSSAGITVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFY-CDQFTPNSNNK-----------PKMWTENWSGWFLSFG 262
Q + + G F S + P M E W+GWF +G
Sbjct: 185 LTTVDQPEDGMLAAGSLPGLLRTGSFGSRSPERLATLRRHQPTGPLMCMEYWNGWFDDWG 244
Query: 263 GAVPYRPVEDLAFAVARFFQRG-GTFQNYYMYHGGTNFDRTSG----GPF--ISTSYDYD 315
P+ + A A G G N YM GGTNF T+G G + I TSYDYD
Sbjct: 245 --TPHHTTDAEASAADLDALLGSGASVNLYMLCGGTNFGLTNGANDKGTYEPIVTSYDYD 302
Query: 316 APLDEYG 322
APLDE G
Sbjct: 303 APLDEAG 309
>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 587
Score = 173 bits (439), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 109/305 (35%), Positives = 153/305 (50%), Gaps = 16/305 (5%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ ++G + ++SG+IHY R PE W D + K + GL+ +ETY+ WNLHEP Q+
Sbjct: 9 NQQFLLGDEPIQILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFV 68
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F+G DL +FV++ + GL+ LR PY+CAEW FGG P WL P IQ R + + +
Sbjct: 69 FDGIADLERFVRIAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEK 128
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMAL 207
+ ++ +++ + L S+GGP+I QIENEYG+ D+AY K IK + L
Sbjct: 129 VDQYYDELIPRLV--PLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDGLIKRGVDVLL 186
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYC----DQFTPNSNNKPKMWTENWSGWFLSFGG 263
G M Q P + G D+ P M E W+GWF +
Sbjct: 187 FTSDGPTDGMLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLK 246
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAP 317
R ED A + N+YM+HGGTNF +G F TSYDYDAP
Sbjct: 247 PHHTRDAEDAAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAP 305
Query: 318 LDEYG 322
L E G
Sbjct: 306 LSECG 310
>gi|348508362|ref|XP_003441723.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 605
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 166/343 (48%), Gaps = 17/343 (4%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
D + GK ++ GS+HY R W D + K K GL+ + TYV WNLHEP R +
Sbjct: 10 DSSQFTLEGKPFRILGGSVHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGTF 69
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
NF+ + DL +V L A+ GL+ LR GPY+CAEW+ GG P WL +Q RT F
Sbjct: 70 NFQDQLDLKAYVSLAAQLGLWVILRPGPYICAEWDLGGLPSWLLQDEEMQLRTTYPGFVN 129
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ + K++ ++K L GGPII Q+ENEYG+ D Y K+ ++ + G+
Sbjct: 130 AVNLYFDKLISVIK--PLMFEGGGPIIAVQVENEYGSFAKDDKYMPFIKNCLQ-SRGIKE 186
Query: 208 SLDTGVPW--VMCQQSDAPDPIINTCNGFY--CDQFTPNSNNKPKMWTENWSGWFLSFGG 263
L T W + C + +N + KP M E WSGWF +G
Sbjct: 187 LLMTSDNWEGLRCGGVEGALKTVNLQRLSFGAIQHLADIQPQKPLMVMEYWSGWFDVWGE 246
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPFIS--TSYDYDAP 317
ED+ V+ RG + N YM+HGGT F +G G + S TSYDYDAP
Sbjct: 247 HHHVFYAEDMLAVVSEILDRGVSI-NLYMFHGGTTFGFMNGAMDFGTYKSQVTSYDYDAP 305
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
L E G PK+ HL++L V + P + GP L
Sbjct: 306 LSEAGDC-TPKYHHLRNLFSQYHSEHLPGVPSSPERKAYGPAL 347
>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
Length = 786
Score = 173 bits (438), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 179/360 (49%), Gaps = 34/360 (9%)
Query: 1 MASKEILLLVLCWGFVVLAT-TSFGAN-----VTYDHRAVVIGGKRRVLISGSIHYPRST 54
M +K L +L F +L TSFGA + ++ GK V+ + +HYPR
Sbjct: 1 MTTKRNFLAIL---FALLTVFTSFGAPKRGGIFVAGDKTFLLNGKPFVIKAAELHYPRIP 57
Query: 55 PEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLR 114
W I+ K G++ I YVFWN+HE ++NF G D+ F +L + GLY +R
Sbjct: 58 RPYWEHRIRMCKALGMNTICLYVFWNIHEQQEGKFNFTGNNDVAAFCRLAQKHGLYVIVR 117
Query: 115 IGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGP 174
GPYVCAEW GG P WL I+ R + F ++ F ++ + + L +GGP
Sbjct: 118 PGPYVCAEWEMGGLPWWLLKKKDIRLRERDPYFMERVKVFEQQVGNQLA--PLTIDKGGP 175
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN 232
II+ Q+ENEYG+ +D Y + + ++ ++G W + + D +I T N
Sbjct: 176 IIMVQVENEYGSYGVDKEYVSQIRDIVR-SSGFDKVALFQCDWASNFEKNGLDDLIWTMN 234
Query: 233 ---GFYCD-------QFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ 282
G D + P S PKM +E WSGWF +G RP +++ +
Sbjct: 235 FGTGANIDEQFKRLGELRPQS---PKMCSEFWSGWFDKWGARHETRPAKNMVAGIDEMLT 291
Query: 283 RGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
+G +F + YM HGGT+F +G P + TSYDYDAP++EYGL PK+ L+ + +
Sbjct: 292 KGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGLA-TPKYYELRAMMQ 349
>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
Length = 780
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 139/478 (29%), Positives = 220/478 (46%), Gaps = 57/478 (11%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+LLL L ++ G + T ++ G+ V+ + +HYPR W I+
Sbjct: 13 VLLLSLA------VPSARGGDFTVGKNTFLLNGRPFVIKAAELHYPRIPRPYWEQRIKMC 66
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K G++ + YVFWN+HE Q++F G D+ F +L + G+Y +R GPYVCAEW
Sbjct: 67 KALGMNTLCLYVFWNIHEQREGQFDFTGNNDVAAFCRLAHKNGMYVIVRPGPYVCAEWEM 126
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P WL ++ R D+ F A ++ F A++ + L GGPII+ Q+ENEYG
Sbjct: 127 GGLPWWLLKKKDVRLREDDPYFMARVKAFEAEVGRQLA--PLTIQNGGPIIMVQVENEYG 184
Query: 186 N--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCD-QF 239
+ I+ Y + + +K A+G W + + D ++ T N G D QF
Sbjct: 185 SYGINKKYVSEIRDIVK-ASGFDKVTLFQCDWASNFEHNGLDDLVWTMNFGTGANIDEQF 243
Query: 240 TPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGG 296
+P+ M +E WSGWF +G RP +D+ + ++G +F + YM HGG
Sbjct: 244 RRLKQLRPEAPLMCSEFWSGWFDKWGARHETRPAKDMVEGIDEMLRKGISF-SLYMTHGG 302
Query: 297 TNFDRTSGG--PFIS---TSYDYDAPLDEYGLIRQPKW-----------GHLKDLHK--- 337
T+F +G P + TSYDYDAP++EYG+ PK+ G L D+ K
Sbjct: 303 TSFGHWAGANSPGFAPDVTSYDYDAPINEYGM-PTPKFFALRNTMAKYSGKLPDVPKPAA 361
Query: 338 ------AIKLCEAALVATDPTYPSLGPN---LEATVYKTGSGLCSAFLANIGTNSDVTVK 388
+ L E + + T P+L + E G + + L I T S +++
Sbjct: 362 PVITIPKLTLAEFSPLIYSLTIPTLSRDTKTFEEMDMGWGVMVYATILPEIATRSRLSLN 421
Query: 389 FNGNSYLLPAWSVSILPDCKNV-VFNTAKINSVTLVPSFSR-QSLQVAADSSDAIGSG 444
+G+ Y I D K + V + K ++P + Q L + ++ I G
Sbjct: 422 -DGHDY------AQIFIDDKPIGVIDRVKNEKTLMLPPVKKGQHLTILVEAMGRINFG 472
>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
domestica]
Length = 673
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 120/342 (35%), Positives = 172/342 (50%), Gaps = 35/342 (10%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G R + GSIHY R E W D + K K GL+ + TY+ WNLHEP R ++NF G
Sbjct: 91 LLEGSRFRIFGGSIHYFRVPREYWKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSGN 150
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV++ A+ GL+ LR GPY+C+EW+ GG P WL ++ RT F + +
Sbjct: 151 LDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYVGFIKAVDLY 210
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+++ + L +QGGPII Q+ENEYG+ D YIK MAL L G+
Sbjct: 211 FNQLIP--RVVPLQYTQGGPIIAVQVENEYGSYDK--DPNYMPYIK----MAL-LKRGIV 261
Query: 215 WVMCQQSDAP-------DPIINTCNGFYCDQFTPN-----SNNKPKMWTENWSGWFLSFG 262
++ + + ++ T N D N +NKP M TE W+GWF ++G
Sbjct: 262 ELLMTSDNKDGLSGGYVEGVLATINLKNVDSIIFNYLQSFQDNKPTMVTEFWTGWFDTWG 321
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDA 316
G +D+ +V+ Q G + N YM+HGGTNF +G + TSYDYDA
Sbjct: 322 GPHHIVDADDVMVSVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFTDYQADVTSYDYDA 380
Query: 317 PLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
L E G PK+ L++ L + L P P+L P
Sbjct: 381 ILTEAG-DYTPKFFKLREYFST--LIDNPL----PQLPALKP 415
>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
Length = 580
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 118/365 (32%), Positives = 181/365 (49%), Gaps = 29/365 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
++Y+ + ++ GK LISG++HY R PE W D ++K K G + +ETY+ WN+HEP
Sbjct: 4 LSYEDQHFMLEGKPIQLISGAVHYFRIVPEYWEDRLRKVKAMGCNCVETYIAWNVHEPRD 63
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Q+NF+G D+V+F+++ L +R PY+CAEW FGG P WL I+ R +
Sbjct: 64 GQFNFDGIADVVEFIRIAQRVDLLVIVRPSPYICAEWEFGGMPAWL-LKEDIRLRCSDPR 122
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAG 204
F ++ + ++ +K L ++ GGPII QIENEYG+ D AY A ++ +
Sbjct: 123 FLEKVSAYYDALIPQLK--PLLSTSGGPIIAVQIENEYGSYGNDQAYLQALRNMLVERGI 180
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCN-------GF-YCDQFTPNSNNKPKMWTENWSG 256
L + P Q + ++ T N F +++ PN+ P M E W+G
Sbjct: 181 DVLLFTSDGPADDMLQGGMTEGVLATVNFGSRPKEAFGKLEEYQPNA---PLMCMEYWNG 237
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFIS 309
WF + R ED A + G + N+YM HGGTNF +SG P +
Sbjct: 238 WFDHWFEEHHTRSAEDAAQVLDEMLSMGASV-NFYMLHGGTNFGFSSGANHGGRYKPTV- 295
Query: 310 TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDP--TYPSLGPNLEATVYK 366
TSYDYD+ + E G I PK+ + + K + L E + P Y + N ++
Sbjct: 296 TSYDYDSAISEAGDI-TPKYQLFRKVIGKYVSLSEDDMPQNTPKAAYGEVKVNRSVKLFD 354
Query: 367 TGSGL 371
T S +
Sbjct: 355 TLSSM 359
>gi|350588684|ref|XP_003130139.3| PREDICTED: galactosidase, beta 1-like 3 [Sus scrofa]
Length = 656
Score = 172 bits (437), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 139/452 (30%), Positives = 207/452 (45%), Gaps = 38/452 (8%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G +++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 84 LEGHEFLILGGSIHYFRVPRESWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 143
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ F+ L AE GL+ LR GPY+C+E + GG P L P Q RT N F + +
Sbjct: 144 DMEAFILLAAEVGLWVILRPGPYICSEIDLGGLPSRLLQDPTSQLRTTNHSFIEAVDEYL 203
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L +GGPII Q+ENEYG+ D AY K+ +K L
Sbjct: 204 DHLI--ARVVPLQYRKGGPIIAVQVENEYGSFHKDEAYMPYLHKALLKRGIVELLLTSDN 261
Query: 213 VPWVMCQQSDAPDPIINTCN---GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRP 269
V+ +N + G + D + SN KP + E W GWF ++G R
Sbjct: 262 TNEVLKGHIKGVLATVNMKSFKEGEFKDLYQVQSN-KPILIMEFWVGWFDTWGNKHAVRD 320
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYGL 323
D+ + F + +F N YM+HGGTNF +G + + TSYDYDA L E G
Sbjct: 321 AIDVENTIFDFIRLEISF-NVYMFHGGTNFGFMNGATYFEQHRGVVTSYDYDAVLTEAG- 378
Query: 324 IRQPKWGHLKDLHKAIKLCE-AALVATDP--TYPSLGPNLEATVYKTGSGLCSAFLANIG 380
PK+ L++L K+I + AL P YP + P+L ++ L S L+N+
Sbjct: 379 DYTPKFFKLRELFKSIFVTPLPALPEPTPKAVYPLVRPSLYLPLWDALQYLNSPVLSNVP 438
Query: 381 TNSDVTVKFNGN--SYLLPAWSVSI---------LPDCKNVVFNTAKINSV------TLV 423
N + NGN SY L + +I D V N + ++ +
Sbjct: 439 LNMENLPINNGNGQSYGLVLYQTTICSGGQLHANAQDMAQVFLNETNLGTLANGRQDVYI 498
Query: 424 PSFSR-QSLQVAADSSDAIGSGWSYINEPVGI 454
P + Q L++ ++ + W N+ G+
Sbjct: 499 PRITECQLLRILVENQGRVNFSWKIQNQQKGL 530
>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
Length = 586
Score = 172 bits (436), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 106/308 (34%), Positives = 153/308 (49%), Gaps = 30/308 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG++HY R P++W D I K++ GL+ IETYV WN H P R ++ +G
Sbjct: 9 LLDGKPFRILSGALHYFRVHPDLWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTDGA 68
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F++LV G+ A +R GPY+CAEW+ GG P WL P + R D + + +
Sbjct: 69 LDLERFLRLVEAEGMLAIVRPGPYICAEWDNGGLPGWLFRDPAVGVRRDEPLYMEAVSEY 128
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTG-- 212
++D++ +GGP++L Q+ENE YGA G ++ MAL+ G
Sbjct: 129 LGTVLDLVA--PFQVDRGGPVVLVQVENE-------YGAYGSDHVYLEKLMALTRSHGIT 179
Query: 213 VPWVMCQQSDAPDPIINTCNGFY-CDQFTPNSNNK-----------PKMWTENWSGWFLS 260
VP Q + +G + F S + P M E W GWF
Sbjct: 180 VPLTSIDQPSGTMLADGSIDGLHRTGSFGSRSAERLATLREHQPTGPLMCAEFWDGWFDH 239
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPF--ISTSYDY 314
+G +D A + G + N YM+HGGTNF TSG G + +TSYDY
Sbjct: 240 WGAHHHTTSAQDAARELDELLAAGASV-NIYMFHGGTNFGFTSGANDKGVYQPTTTSYDY 298
Query: 315 DAPLDEYG 322
DAPL E G
Sbjct: 299 DAPLAEDG 306
>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
Length = 781
Score = 172 bits (435), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 110/321 (34%), Positives = 165/321 (51%), Gaps = 23/321 (7%)
Query: 20 TTSFGANV-TYD--HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
T S+GA+ ++D H+ ++ GK + + +HYPR W I+ K G++ I Y
Sbjct: 21 TISYGADKGSFDIGHKTFLLNGKPFTVKAAELHYPRIPRPYWEHRIKMCKALGMNAICIY 80
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
VFWN+HE ++NF G D+ +F +L + G+Y +R GPYVCAEW GG P WL
Sbjct: 81 VFWNIHEQKEGEFNFTGNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKK 140
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAA 194
I+ R + F ++ F K+ + + L +GGPII+ Q+ENEYG+ ID Y
Sbjct: 141 DIKLRERDPYFMERVKIFEDKVAEQLA--PLTIQRGGPIIMVQVENEYGSYGIDKQYVGE 198
Query: 195 GKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCD-QFTPNSNNKPK-- 248
+ ++ G + + W + D +I T N G D QF + +P
Sbjct: 199 IRDMLRQGWGNDVKM-FQCDWSSNFTHNGLDDLIWTMNFGTGANIDNQFKKLKSLRPDAP 257
Query: 249 -MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-- 305
M +E WSGWF +G RP +D+ + +G +F + YM HGGT+F +G
Sbjct: 258 LMCSEFWSGWFDKWGARHETRPAQDMVNNIDEMLSKGISF-SLYMTHGGTSFGHWAGANS 316
Query: 306 ----PFISTSYDYDAPLDEYG 322
P + TSYDYDAP++EYG
Sbjct: 317 PGFQPDV-TSYDYDAPINEYG 336
>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 775
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 137/446 (30%), Positives = 209/446 (46%), Gaps = 45/446 (10%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S ++L L F + A +S V ++ I GK LI G +HYPR E W D +
Sbjct: 6 SNVFIMLNLIVSFFISACSSPREQVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRL 65
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
+++ GL+ + YVFWN HE ++F G+ D+ +FV++ E GLY LR GPYVCAE
Sbjct: 66 HRARAMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAE 125
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
W+FGG+P WL + +R+ + F + +R+ ++ + L + GG II+ Q+EN
Sbjct: 126 WDFGGYPSWLLKEKDLTYRSKDPRFMSYCERYIKELGKQLA--PLTINNGGNIIMVQVEN 183
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDP-----IINTCNGFY-- 235
EYG+ AA K Y+ M VP C + + T NG +
Sbjct: 184 EYGSY-----AADKEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGE 238
Query: 236 -----CDQFTPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGT 286
D++ P P E + WF +G +V Y RP E L + + G
Sbjct: 239 DIFKIVDKYHPGG---PYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGH-----GV 290
Query: 287 FQNYYMYHGGTNF-----DRTSGGPFIS-TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIK 340
+ YM+HGGTNF TSGG TSYDYDAPL E+G PK+ +++ +
Sbjct: 291 SVSMYMFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWGNCY-PKYHAFREIIQKY- 348
Query: 341 LCEAALVATDPTYPSLGPNLE-ATV-YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPA 398
L E + P P+ P ATV K + L +AF I + ++++ G +
Sbjct: 349 LPEGTQL---PEVPADNPTTTFATVELKESAPLTTAFHQTIQSEDVLSMEDVGADFGYIH 405
Query: 399 WSVSI-LPDCKNVVFNTAKINSVTLV 423
+ +I P + ++ + +V LV
Sbjct: 406 YQTTIKTPGKQKLIIQDLRDYAVILV 431
>gi|332838248|ref|XP_001156615.2| PREDICTED: galactosidase, beta 1-like 3 [Pan troglodytes]
Length = 653
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 135/452 (29%), Positives = 204/452 (45%), Gaps = 36/452 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN--TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
V+ + IN + +Q +KP + E W GWF +G +
Sbjct: 260 EKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDA 319
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYGLI 324
+++ AV+ F + +F N YM+HGGTNF +G + I TSYDYDA L E G
Sbjct: 320 KEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDY 378
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
+ K+ L+ L +++ V P YP + P+L ++ S L +
Sbjct: 379 TE-KYLKLQKLFQSVSATPLPRVPKLPPKAVYPPVRPSLYLPLWDALSYLNEPVRSRQPV 437
Query: 382 NSDVTVKFNGN--SYLLPAWSVSILP---------DCKNVVFNTAKI------NSVTLVP 424
N + NG+ SY L + SI D V + I N +P
Sbjct: 438 NMENLPINNGSGQSYGLVLYEKSICSGGRLRAHAHDMAQVFLDETMIGILNENNKDLHIP 497
Query: 425 SFSR-QSLQVAADSSDAIGSGWSYINEPVGIS 455
+ L++ ++ + W NE GI+
Sbjct: 498 ELRDCRYLRILVENQGRVNFSWQIQNEQKGIT 529
>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
Length = 584
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 112/302 (37%), Positives = 158/302 (52%), Gaps = 32/302 (10%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISGSIHY R P W D ++K + G + +ETYV WN+HEP +++F DL +F++
Sbjct: 19 IISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLY LR PY+CAEW FGG P WL P ++ R D PF ++ R+ ++ +
Sbjct: 79 LAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQV 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAA------GMALSLDTGV-PW 215
L +Q GPI++ Q+ENEYG +YG KSY++ +A G+ + L T PW
Sbjct: 139 S--DLQITQEGPILMMQVENEYG----SYG-NDKSYLRKSAELMRHNGIDVPLFTSDGPW 191
Query: 216 V-MCQQSDAPD---PIINTCNGFYCDQFTP----NSNNKPKMWTENWSGWFLSFGGAVPY 267
+ M + D P IN C + F + +P M E W GWF ++G +
Sbjct: 192 LDMLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHH 250
Query: 268 -RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDE 320
V D A + + G N YM+HGGTNF +G + TSYDYDA L E
Sbjct: 251 TTSVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLLPDVTSYDYDALLSE 308
Query: 321 YG 322
+G
Sbjct: 309 WG 310
>gi|255652865|ref|NP_001157373.1| beta-galactosidase [Bombyx mori]
gi|239938036|gb|ACS36117.1| beta-galactosidase [Bombyx mori]
Length = 606
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 166/345 (48%), Gaps = 20/345 (5%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G N++ +I GK +ISGS+HY R W D + K K GL+ + TYV W+ HE
Sbjct: 3 GHNISIVGDKFMIDGKPLHIISGSLHYFRVPAVYWRDRLHKFKAAGLNTVATYVEWSYHE 62
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLW-LHFIPGIQFRT 142
P QYNFEG DLV+FV+ AE GL+ LR+GPY+CAE + GG P W L P I+ RT
Sbjct: 63 PEEKQYNFEGDRDLVRFVQTAAEVGLHVLLRVGPYICAERDLGGLPYWLLGKYPNIKLRT 122
Query: 143 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDS--AYGAAGKSYIK 200
++ F AE + K+ + + L GGPIIL Q+ENEYG+ DS AY + I
Sbjct: 123 TDKDFIAESDIWLKKLFEQVSH--LLFGNGGPIILVQVENEYGSYDSDLAYKEKMRDLIS 180
Query: 201 WAAGMALSLDT-------GVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTEN 253
G L T G + + + + + F P M +E
Sbjct: 181 AHVGDKALLYTTDGPSLVGAGMIPGVHATIDFGVTSQPTEQFDSLFHLRPAPGPLMNSEF 240
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS---- 309
+ GW +G + D+ + R N+Y++ GG+NF+ TSG F
Sbjct: 241 YPGWLTHWGERMARVGTNDIVLTL-RNMIVNKIHVNFYVFFGGSNFEFTSGANFDGTYQP 299
Query: 310 --TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPT 352
TSYDYDAPL E G PK+ +++ K + + + P+
Sbjct: 300 DITSYDYDAPLSEAG-DPTPKYYAIRETLKQLNFVDEKIEPPQPS 343
Score = 45.1 bits (105), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 2/41 (4%)
Query: 627 WYKTTFDAPAGSEPVA--IDFTGMGKGEAWVNGQSIGRYWP 665
+Y+ TF P G +P+ +D TG KG WVNG ++GRYWP
Sbjct: 508 FYEGTFVLPEGQKPLDTFLDTTGWDKGYVWVNGHNLGRYWP 548
>gi|397498227|ref|XP_003819886.1| PREDICTED: beta-galactosidase-1-like protein 3 [Pan paniscus]
Length = 653
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 135/452 (29%), Positives = 204/452 (45%), Gaps = 36/452 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN--TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
V+ + IN + +Q +KP + E W GWF +G +
Sbjct: 260 EKHVLSGHTKGVLAAINLQKLHQDTFNQLHKIQRDKPLLIMEYWVGWFDRWGDKHHVKDA 319
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYGLI 324
+++ AV+ F + +F N YM+HGGTNF +G + I TSYDYDA L E G
Sbjct: 320 KEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDY 378
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
+ K+ L+ L +++ V P YP + P+L ++ S L +
Sbjct: 379 TE-KYLKLQKLFQSVSATPLPRVPKLPPKAVYPPVRPSLYLPLWDALSYLNEPVRSRQPV 437
Query: 382 NSDVTVKFNGN--SYLLPAWSVSILP---------DCKNVVFNTAKI------NSVTLVP 424
N + NG+ SY L + SI D V + I N +P
Sbjct: 438 NMENLPINNGSGQSYGLVLYEKSICSGGRLRAHAHDMAQVFLDETMIGILNENNKDLHIP 497
Query: 425 SFSR-QSLQVAADSSDAIGSGWSYINEPVGIS 455
+ L++ ++ + W NE GI+
Sbjct: 498 ELRDCRYLRILVENQGRVNFSWQIQNEQKGIT 529
>gi|327283884|ref|XP_003226670.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Anolis
carolinensis]
Length = 584
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/298 (36%), Positives = 150/298 (50%), Gaps = 27/298 (9%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++ GS+HY R E W D + K K GL+ + TYV WNLHE +R +++F G DL F+K
Sbjct: 29 ILGGSLHYFRIPREYWKDRLMKMKACGLNTVTTYVPWNLHEAIRGKFDFSGNLDLQVFIK 88
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ E GL+ LR GPY+C+EW+ GG P WL P +Q RT F + + +++ +
Sbjct: 89 MAEEVGLWVILRPGPYICSEWDLGGLPSWLLQDPEMQLRTTYRGFTEAVDNYFDRLIPQV 148
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQ-- 220
L GGPII Q+ENEYG+ A + +YIK MAL+ V +M
Sbjct: 149 V--PLQYKYGGPIIAVQVENEYGSY--AQDPSYMTYIK----MALTSRKIVEMLMTSDNH 200
Query: 221 ----SDAPDPIINTCNGFYCDQF------TPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
S D + T N D T N PKM E W+GWF S+GG
Sbjct: 201 DGLVSGTVDGALATINFQKLDTAIMVFLSTDQRNKMPKMVMEYWTGWFDSWGGLHHVFDA 260
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYG 322
+D+ V + + G + N YM+HGGTNF +G + TSYDYDA L E G
Sbjct: 261 DDMVQTVGKVIKLGASI-NLYMFHGGTNFGFLNGAQHSNEYKSTITSYDYDAVLTESG 317
>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
Length = 612
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 171/351 (48%), Gaps = 33/351 (9%)
Query: 7 LLLVLCWGF------VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
+L+VL F VV+ + AN R + GK ++SG++HY R P+ W D
Sbjct: 20 ILVVLWMAFGSSNKRVVVRSKGLVAN----GRHFTMDGKPFTILSGAMHYFRIPPQYWED 75
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
I K K GL+ +ETYV WNLHE ++ +NF+ D+V+F+K + LY +R GPY+C
Sbjct: 76 RIVKLKAMGLNTVETYVSWNLHEEIQGDFNFKDGLDIVEFIKTAQKHDLYVIMRPGPYIC 135
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEW+ GG P WL P I R+ + F RF +++ + + S GGPII QI
Sbjct: 136 AEWDLGGLPSWLLHNPNIYLRSLDPIFMKATLRFFDELIPRLIDYQY--SNGGPIIAWQI 193
Query: 181 ENEYGNID--SAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDP-IINTCNGFYCD 237
ENEY + D SAY + + L + W M + P ++ T N F +
Sbjct: 194 ENEYLSYDNSSAYMRKLQQEMVIRGVKELLFTSDGIWQMQIEKKYSLPGVLKTVN-FQRN 252
Query: 238 Q------FTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYY 291
+ N P M TE WSGWF +G VE A + + NYY
Sbjct: 253 ETNILKGLRKLQPNMPLMVTEFWSGWFDHWGEDKHVLTVEKAAERTKNILKMESSI-NYY 311
Query: 292 MYHGGTNFDRTSGG--------PFISTSYDYDAPLDEYGLIRQPKWGHLKD 334
M HGGTNF +G P I TSYDYDAP+ E G I PK+ L++
Sbjct: 312 MLHGGTNFGFMNGANAENGKYKPTI-TSYDYDAPISESGDI-TPKYRELRE 360
>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 137/446 (30%), Positives = 209/446 (46%), Gaps = 45/446 (10%)
Query: 3 SKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLI 62
S ++L L F + A +S V ++ I GK LI G +HYPR E W D +
Sbjct: 6 SNVFIMLNLIVSFFISACSSPREQVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRL 65
Query: 63 QKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAE 122
++ GL+ + YVFWN HE ++F G+ D+ +FV++ E GLY LR GPYVCAE
Sbjct: 66 HRAHAMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAE 125
Query: 123 WNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIEN 182
W+FGG+P WL + +R+ + F + +R+ ++ + L + GG II+ Q+EN
Sbjct: 126 WDFGGYPSWLLKEKDLTYRSKDPRFMSYCERYIKELGKQLA--PLTINNGGNIIMVQVEN 183
Query: 183 EYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFY-- 235
EYG+ AA K Y+ M VP C ++ + T NG +
Sbjct: 184 EYGSY-----AADKEYLAAIRDMLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGE 238
Query: 236 -----CDQFTPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGT 286
D++ P P E + WF +G +V Y RP E L + + G
Sbjct: 239 DIFKIVDKYHPGG---PYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLGH-----GV 290
Query: 287 FQNYYMYHGGTNF-----DRTSGGPFIS-TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIK 340
+ YM+HGGTNF TSGG TSYDYDAPL E+G PK+ +++ +
Sbjct: 291 SVSMYMFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWGNCY-PKYHAFREIIQKY- 348
Query: 341 LCEAALVATDPTYPSLGPNLE-ATV-YKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPA 398
L E + P P+ P ATV K + L +AF I + ++++ G +
Sbjct: 349 LPEGTQL---PEVPADNPTTTFATVELKESAPLTTAFHQTIQSEDVLSMEDVGTDFGYIH 405
Query: 399 WSVSI-LPDCKNVVFNTAKINSVTLV 423
+ +I P + ++ + +V LV
Sbjct: 406 YQTTIKTPGKQKLIIQDLRDYAVILV 431
>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
Length = 591
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 105/299 (35%), Positives = 153/299 (51%), Gaps = 18/299 (6%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ L SG+IHY R PE W D ++K K G + +ETYV WNLHEP ++ FEG DL
Sbjct: 15 GEELRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMADL 74
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F++L GL+ +R PY+CAEW FGG P WL PG++ R + + +++ + +
Sbjct: 75 ERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYDE 134
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVP 214
++ + L + GGP+IL Q+ENEYG+ D AY + ++ + L G
Sbjct: 135 LIPRLV--PLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGPT 192
Query: 215 WVMCQQSDAPDPIINTCN--GFYCDQFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRP 269
M Q P ++ T N + F +P+ M E W+GWF + R
Sbjct: 193 DAMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWMEEHHQRD 251
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYG 322
D A + G + N+YM+HGGTNF +G I TSYDYD+PL E+G
Sbjct: 252 AADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSPLTEWG 309
>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
Length = 588
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/297 (35%), Positives = 153/297 (51%), Gaps = 26/297 (8%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG +HY R P +W D + K++ GL+ +ETYV WNLH+P +++ +G DL +F+
Sbjct: 25 ILSGGLHYFRVHPGLWRDRLHKARLMGLNTVETYVPWNLHQPRPDEFRMDGGLDLPRFLD 84
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L A GL+ LR GPY+CAEW GG P WL P ++ R+ + F A + + +++ +
Sbjct: 85 LAAAEGLHVLLRPGPYICAEWEGGGLPSWLLADPAMRLRSRDPNFLAAVDDYFRRLLPPL 144
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQ-- 220
AS+GGP++ Q+ENEYG AYG +Y++ A VP C Q
Sbjct: 145 HDR--LASRGGPVLAVQVENEYG----AYG-DDTAYLEHLADSLRRHGVDVPLFTCDQPA 197
Query: 221 ---SDAPDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVED 272
A ++ T N + + P + TE W GWF +GG R E
Sbjct: 198 DLERGALAGVLATANFGSRPAAHLATLRTARPSAPLLCTEFWIGWFDRWGGNHVVRDAEQ 257
Query: 273 LAFAVARFFQRGGTFQNYYMYHGGTNF-------DRTSGGPFISTSYDYDAPLDEYG 322
+ + G + N+YM+HGGTNF D+ + P + TSYDYDAPLDE G
Sbjct: 258 ASQELDELLATGASV-NFYMFHGGTNFGFMNGANDKHTYRPTV-TSYDYDAPLDEAG 312
>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 169/337 (50%), Gaps = 33/337 (9%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+++L + F++ + +S V + I GK LI G +HYPR E W D ++++
Sbjct: 11 LVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRA 70
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
+ GL+ + YVFWN HE +++F G+ D+ +F++ E GLY LR GPYVCAEW+F
Sbjct: 71 RAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAEWDF 130
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEY 184
GG+P WL + +R+ + F + +R+ I ++ KQ L + GG II+ Q+ENEY
Sbjct: 131 GGYPSWLLKEKDMTYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQVENEY 187
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFYCDQ- 238
G+ AA K Y+ M VP C ++ + + T NG + +
Sbjct: 188 GSY-----AADKGYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNGVFGEDI 242
Query: 239 ---FTPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGTFQNYY 291
P E + WF +G +V Y RP E L + ++ G + Y
Sbjct: 243 FKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GVSVSMY 297
Query: 292 MYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
M+HGGTNF+ T+G TSYDYDAPL E+G
Sbjct: 298 MFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
Length = 630
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 179/374 (47%), Gaps = 42/374 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK +ISG +HYPR + W +Q K GL+ + TYVFWN+HEP +++F G
Sbjct: 36 VYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNIHEPEPGKWDFTGD 95
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
+L +++K+ E GL LR GPYVCAEW FGG+P WL + G++ R DNE F ++
Sbjct: 96 KNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGLELRRDNEQF----LKY 151
Query: 155 TAKIVDMMKQE--KLYASQGGPIILSQIENEYGNIDSA-----------YGAAGKSYIKW 201
T ++ + +E L ++GGPI++ Q ENE+G+ S Y A +K
Sbjct: 152 TQLYINRLYKEVGNLQITKGGPIVMVQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQLKD 211
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTEN 253
A S + W+ + A + T NG D++ N P M E
Sbjct: 212 AGFDVPSFTSDGSWLF--EGGAVPGALPTANGESNIENLKKAVDKY--NGGQGPYMVAEF 267
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS---- 309
+ GW + P +A ++ Q + NYYM HGGTNF TSG +
Sbjct: 268 YPGWLAHWLEPHPQISATSIARQTEKYLQNNVSI-NYYMVHGGTNFGFTSGANYDKKHDI 326
Query: 310 ----TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG-PNLEATV 364
TSYDYDAP+ E G + PK+ L+++ K K +L P + P+++
Sbjct: 327 QPDLTSYDYDAPISEAGWV-TPKYDSLRNVIK--KYVNYSLPKVPAAIPVIEIPSIKLDK 383
Query: 365 YKTGSGLCSAFLAN 378
T GL S + N
Sbjct: 384 IATLDGLNSKVVEN 397
>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
Length = 778
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 173/353 (49%), Gaps = 27/353 (7%)
Query: 19 ATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVF 78
+T G T + ++ GK V+ + +HYPR W I+ K G++ + YVF
Sbjct: 13 STAQKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVF 72
Query: 79 WNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI 138
WN+HE +++F G D+ +F +L GLY +R GPYVCAEW GG P WL I
Sbjct: 73 WNIHEQQEGKFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDI 132
Query: 139 QFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGK 196
+ R + F ++ F K+ + + L GGPII+ Q+ENEYG+ + AY +A +
Sbjct: 133 RLREPDPYFMERVKLFERKVGEQLAS--LTIQNGGPIIMVQVENEYGSYGKNKAYVSAIR 190
Query: 197 SYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCDQ-------FTPNSNNK 246
++ + ++L W + + D ++ T N G DQ PN+
Sbjct: 191 DIVRRSGFDKVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNA--- 246
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG- 305
P+M +E WSGWF +G RP + + + +G +F + YM HGGT+F +G
Sbjct: 247 PQMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGAN 305
Query: 306 -PFIS---TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP 354
P + TSYDYDAP++EYG PK+ L+ H K + + P P
Sbjct: 306 SPGFAPDVTSYDYDAPINEYGQA-TPKYWELR--HTMEKYNDGGKLPAPPKAP 355
>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 777
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 169/337 (50%), Gaps = 33/337 (9%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+++L + F++ + +S V + I GK LI G +HYPR E W D ++++
Sbjct: 11 LVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRA 70
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
+ GL+ + YVFWN HE +++F G+ D+ +F++ E GLY LR GPYVCAEW+F
Sbjct: 71 RAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAEWDF 130
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEY 184
GG+P WL + +R+ + F + +R+ I ++ KQ L + GG II+ Q+ENEY
Sbjct: 131 GGYPSWLLKEKDMTYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQVENEY 187
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFYCDQ- 238
G+ AA K Y+ M VP C ++ + + T NG + +
Sbjct: 188 GSY-----AADKGYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHTEGALPTLNGVFGEDI 242
Query: 239 ---FTPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGTFQNYY 291
P E + WF +G +V Y RP E L + ++ G + Y
Sbjct: 243 FKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GVSVSMY 297
Query: 292 MYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
M+HGGTNF+ T+G TSYDYDAPL E+G
Sbjct: 298 MFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
>gi|345800024|ref|XP_546385.3| PREDICTED: galactosidase, beta 1-like 3 [Canis lupus familiaris]
Length = 808
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 140/460 (30%), Positives = 213/460 (46%), Gaps = 52/460 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+GG + + GSIHY R W D ++K K G + + TYV WNLHEP R +++F G
Sbjct: 237 LGGHKFQVFGGSIHYFRVPRAYWGDRLRKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 296
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ FV L AE GL+ LR GPY+C+E + GG P WL P + RT F + ++
Sbjct: 297 DMEAFVLLAAEMGLWVILRPGPYICSEIDLGGLPSWLLQDPKMVLRTTYSGFVKAVDKYF 356
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + L +GGPII Q+ENEYG+ A + Y+ + L+ G+
Sbjct: 357 DHLIS--RVVPLQYRRGGPIIAVQVENEYGSF-----AEDRGYMPYLQKAL--LERGIVE 407
Query: 216 VMCQQSDAPDPI----------INTCNGFYCDQFTPNS---NNKPKMWTENWSGWFLSFG 262
++ DA + + IN N F F S +NKP M E W GWF ++G
Sbjct: 408 LLVTSDDAENLLKGHIKGVLATINM-NSFQESDFKLLSYVQSNKPIMVMEFWVGWFDTWG 466
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGP------FISTSYDYDA 316
+ +D+ V +F +F N YM+HGGTNF +G + TSYDYDA
Sbjct: 467 SEHKVKNPKDVEETVTKFIASEISF-NVYMFHGGTNFGFMNGATDFGIHRGVVTSYDYDA 525
Query: 317 PLDEYGLIRQPKWGHLKDLH---KAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCS 373
L E G + K+ L+ L AI L + YPS+ P+L ++ L
Sbjct: 526 VLTEAGDYTE-KYFKLRRLFGSVSAIPLPPLPELTPKAEYPSVKPSLYLPLWDVLQYLNE 584
Query: 374 AFLANIGTN-SDVTVK-FNGNSYLLPAWSVSI---------LPDCKNVVFNTAKI----- 417
++N N ++ + NG SY + SI + D V N I
Sbjct: 585 PVMSNTPVNMENLPINGGNGQSYGFVLYETSICSGGSLRADVHDTAQVFLNEINIGHLHD 644
Query: 418 NSVTL-VPSFSR-QSLQVAADSSDAIGSGWSYINEPVGIS 455
++ TL VP+ +R Q L++ ++ + W ++ G++
Sbjct: 645 HAKTLTVPTMTRCQLLRILVENQGRVNFSWKIQDQRKGLT 684
>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
Length = 591
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 157/304 (51%), Gaps = 22/304 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ L+SG+IHY R PE W D + K K G + +ETY+ WNLHEP Q+ F+G
Sbjct: 13 LDGESIRLVSGAIHYFRVVPEYWRDRLLKLKACGFNTVETYIPWNLHEPKPGQFRFDGLA 72
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+V+FV++ E GL+ +R PY+CAEW FGG P WL PG++ R + P+ + +
Sbjct: 73 DVVRFVEIAGEVGLHVIVRPSPYICAEWEFGGLPAWLLADPGMRVRCMHRPYLDRVDAYY 132
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGK-SYIKWAAGMALSLDTG 212
V + + L + GGPII QIENEYG+ D AY K + ++ + L G
Sbjct: 133 D--VLLPLLKPLLCTNGGPIIAMQIENEYGSYGNDRAYLVYLKDAMLQRGMDVLLFTSDG 190
Query: 213 VPWVMCQQSDAPDPIINTCN-GFYCDQ----FTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
M Q P ++ T N G ++ + P M E W+GWF +G
Sbjct: 191 PEHFMLQGGMIPG-VLETVNFGSRAEEAFEMLRKYQPDGPIMCMEYWNGWFDHWGEQHHT 249
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG---------PFISTSYDYDAPL 318
R +D+A + G + N+YM+HGGTNF SG P I TSYDYD PL
Sbjct: 250 RDAKDVADVFDDMLRLGASV-NFYMFHGGTNFGYMSGANCPQRDHYEPTI-TSYDYDVPL 307
Query: 319 DEYG 322
+E G
Sbjct: 308 NESG 311
>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
Length = 775
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 169/337 (50%), Gaps = 33/337 (9%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+++L + F++ + +S V + I GK LI G +HYPR E W D ++++
Sbjct: 9 LVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRA 68
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
+ GL+ + YVFWN HE +++F G+ D+ +F++ E GLY LR GPYVCAEW+F
Sbjct: 69 RAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAEWDF 128
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEY 184
GG+P WL + +R+ + F + +R+ I ++ KQ L + GG II+ Q+ENEY
Sbjct: 129 GGYPSWLLKEKDMTYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQVENEY 185
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFYCDQF 239
G+ AA K Y+ M VP C ++ + + T NG + +
Sbjct: 186 GSY-----AADKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDI 240
Query: 240 ----TPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGTFQNYY 291
P E + WF +G +V Y RP E L + ++ G + Y
Sbjct: 241 FKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GVSVSMY 295
Query: 292 MYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
M+HGGTNF+ T+G TSYDYDAPL E+G
Sbjct: 296 MFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 332
>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
Length = 608
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 120/344 (34%), Positives = 170/344 (49%), Gaps = 55/344 (15%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
GAN T D GK L+SG++HY R PE W D + K K GL+ +ETYV WNLHE
Sbjct: 27 GANFTID-------GKPVRLLSGAMHYFRVVPEYWRDRMLKMKAAGLNTLETYVPWNLHE 79
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P + YNFEG DL +++ + E GL+ LR GPY+CAEW FGG P WL ++ RT
Sbjct: 80 PEKYTYNFEGILDLGRYLDIAHEVGLWVILRPGPYICAEWEFGGIPGWLAYVKE-HVRTT 138
Query: 144 N----EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSA--YGAAGKS 197
+P + R A++V + GGPII QIENEYG ++ Y K
Sbjct: 139 RPMFIDPVEVWFGRLLAEVVPRQ------YTNGGPIIAVQIENEYGGFSNSTEYMERLKK 192
Query: 198 YIK---------WAAGMALSLDTGVPWVMCQ---QSDAPDPIINTCNGFYCDQFTPNSNN 245
++ + G + G+P V+ Q++A D + + +
Sbjct: 193 ILESRGIVELLFTSDGKGALISGGIPGVLKTVNFQNNASDKL---------QKLKEIQPD 243
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFF-QRGGTFQNYYMYHGGTNFDRTSG 304
+P M E W+GWF +G +E +F + F+ G N+YM+HGGTNF +G
Sbjct: 244 RPMMVMEYWTGWFDHWGEDHHLYRLESESFVHSVFYILDAGASVNFYMFHGGTNFGFMNG 303
Query: 305 G-----------PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
P I TSYDYDAP+ E G + PK+ ++++ K
Sbjct: 304 ANTRYKSGGRTLPTI-TSYDYDAPISETGDL-TPKYFKIREILK 345
>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
Length = 591
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 105/299 (35%), Positives = 153/299 (51%), Gaps = 18/299 (6%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ L SG+IHY R PE W D ++K K G + +ETYV WNLHEP ++ FEG DL
Sbjct: 15 GEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMADL 74
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F++L GL+ +R PY+CAEW FGG P WL PG++ R + + +++ + +
Sbjct: 75 ERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYDE 134
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVP 214
++ + L + GGP+IL Q+ENEYG+ D AY + ++ + L G
Sbjct: 135 LIPRLV--PLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGPT 192
Query: 215 WVMCQQSDAPDPIINTCN--GFYCDQFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRP 269
M Q P ++ T N + F +P+ M E W+GWF + R
Sbjct: 193 DSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWMEEHHQRD 251
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYG 322
D A + G + N+YM+HGGTNF +G I TSYDYD+PL E+G
Sbjct: 252 AADAARVFGEMLEAGASV-NFYMFHGGTNFGFHNGANHIKTYEPTITSYDYDSPLTEWG 309
>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
Length = 585
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 120/349 (34%), Positives = 163/349 (46%), Gaps = 42/349 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISG+IHY R PE W D ++K + G + +ETYV WNLHE Y F+G DL +F++
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
E GLY LR PY+CAEW FGG P WL P ++ R D PF ++ R+ A + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWV------ 216
+ L +QGGPII+ Q+ENEYG+ A K Y++ P V
Sbjct: 139 RD--LQITQGGPIIMMQVENEYGSY-----ANDKEYLRKMVAAMRQHGVETPLVTSDGPW 191
Query: 217 --MCQQSDAPD---PIINTCNGFYCDQFTP----NSNNKPKMWTENWSGWFLSFGGAVPY 267
M + D P IN C + F + +P M E W GWF ++G +
Sbjct: 192 HDMLENGSIKDLALPTIN-CGSNIKENFEKLRKFHGEKRPLMVMEFWIGWFDAWGDDQHH 250
Query: 268 -RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDE 320
++D + G N YM+HGGTNF +G + TSYDYDA L E
Sbjct: 251 TTSIQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTE 308
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS 369
WG ++A K A A P +P L +E Y T S
Sbjct: 309 --------WGEPTAKYQAFKKV-IADYAEIPEFP-LSMKIERKAYGTFS 347
>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
Length = 591
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 105/299 (35%), Positives = 153/299 (51%), Gaps = 18/299 (6%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ L SG+IHY R PE W D ++K K G + +ETYV WNLHEP ++ FEG DL
Sbjct: 15 GEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMADL 74
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F++L GL+ +R PY+CAEW FGG P WL PG++ R + + +++ + +
Sbjct: 75 ERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYDE 134
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVP 214
++ + L + GGP+IL Q+ENEYG+ D AY + ++ + L G
Sbjct: 135 LIPRLV--PLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGPT 192
Query: 215 WVMCQQSDAPDPIINTCN--GFYCDQFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRP 269
M Q P ++ T N + F +P+ M E W+GWF + R
Sbjct: 193 DSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWMEEHHQRD 251
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYG 322
D A + G + N+YM+HGGTNF +G I TSYDYD+PL E+G
Sbjct: 252 AADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSPLTEWG 309
>gi|257143787|emb|CAZ44333.1| beta-D-galactosidase [Paenibacillus thiaminolyticus]
Length = 583
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 116/340 (34%), Positives = 169/340 (49%), Gaps = 21/340 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
++YD +G + LISG+IHY R P W D ++K K G + IETYV WN+HEP
Sbjct: 4 LSYDEGQFKMGDRPIQLISGAIHYFRIVPAYWEDRLRKIKAMGCNCIETYVAWNVHEPRE 63
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+++FE D+ +FV+L E GLY +R PY+CAEW FGG P WL ++ R ++
Sbjct: 64 GEFHFERMADVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL-LKDDMRLRCNDPR 122
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAG 204
F ++ + ++ + L A++GGPII QIENEYG+ D AY A ++ +
Sbjct: 123 FLEKVSAYYDALLPQLT--PLLATKGGPIIAVQIENEYGSYGNDQAYLQAQRAMLIERGV 180
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFL 259
L + P Q + ++ T N D+ + P M E W+GWF
Sbjct: 181 DVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMCMEYWNGWFD 240
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFISTSY 312
+ R +D A + G + N+YM HGGTNF SG P + TSY
Sbjct: 241 HWFEPHHTRDAKDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDKYEPTV-TSY 298
Query: 313 DYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDP 351
DYDA + E G + PK+ ++ + K + L E L A P
Sbjct: 299 DYDAAISEAGDL-TPKYHAFREVIGKYVSLPEGELPANTP 337
>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
adhaerens]
Length = 543
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 117/313 (37%), Positives = 161/313 (51%), Gaps = 34/313 (10%)
Query: 45 SGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLV 104
SG+IHY R PE W D + K K GL+ +ETYV WNLHEPV Q+++ G ++ KF+ L
Sbjct: 15 SGAIHYFRVVPEYWRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFDYTGILNVRKFILLA 74
Query: 105 AEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ 164
E G Y LR GPY+CAEW FGG P WL +Q R+ +PFK + RF + +K
Sbjct: 75 QELGFYVILRPGPYICAEWEFGGMPSWLLSDKNMQVRSTYKPFKDAVNRFFDGFIPEIKS 134
Query: 165 EKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSD-- 222
L AS+GGPII Q+ENEYG +YG + + Y+++ ++ V S+
Sbjct: 135 --LQASKGGPIIAVQVENEYG----SYG-SDEEYMQFIRDALINRGIVELLVTSDNSEGI 187
Query: 223 ----APDPIINTCN--GFYCDQFT--PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLA 274
AP ++ T N G + + P + E WSGWF +G V +A
Sbjct: 188 KHGGAPG-VLKTYNFQGHAKSHLSILERLQDAPSIVMEFWSGWFDHWGEK--NHQVHTIA 244
Query: 275 FAVARF---FQRGGTFQNYYMYHGGTNFDRTSGGPFIS---------TSYDYDAPLDEYG 322
F +F N+Y++HGGTNF +G FI TSYDYDAPL E G
Sbjct: 245 HVTNTFKDILDCDASF-NFYVFHGGTNFGFMNGANFIDFFSYYLPTVTSYDYDAPLSEAG 303
Query: 323 LIRQPKWGHLKDL 335
I + K+ L+ +
Sbjct: 304 DITE-KYMELRKI 315
>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
Length = 784
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 170/348 (48%), Gaps = 20/348 (5%)
Query: 5 EILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
+ ++ L + L + G + T ++ G+ V+ + +HYPR W I+
Sbjct: 8 KTIITTLLFSLSTLTALARGGDFTAGKNTFLLNGQPFVVKAAELHYPRIPRPYWDQRIKM 67
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
K G++ I YVFWN+HE ++Y+F G D+ F +L + G+Y +R GPYVCAEW
Sbjct: 68 CKALGMNTICLYVFWNIHEQQESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWE 127
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
GG P WL I+ R D+ F A ++ F A++ + L GGPII+ Q+ENEY
Sbjct: 128 MGGLPWWLLKKKDIRLREDDPYFLARVKAFEAEVGRQLA--PLTIQNGGPIIMVQVENEY 185
Query: 185 GN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCD-Q 238
G+ ++ Y + + +K A+G W + + D ++ T N G D Q
Sbjct: 186 GSYGVNKQYVSQIRDIVK-ASGFDKVTLFQCDWASNFEKNGLDDLLWTMNFGTGSNIDAQ 244
Query: 239 FTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHG 295
F +P+ M +E WSGWF +G RP + + + + +F + YM HG
Sbjct: 245 FKRLKQLRPETPLMCSEFWSGWFDKWGARHETRPAKAMVEGINEMLSKNISF-SLYMTHG 303
Query: 296 GTNFDRTSG------GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
GT+F +G P + TSYDYDAP++EYG W K + K
Sbjct: 304 GTSFGHWAGANSPGFAPDV-TSYDYDAPINEYGHATPKFWELRKTMQK 350
>gi|420262409|ref|ZP_14765050.1| beta-galactosidase [Enterococcus sp. C1]
gi|394770166|gb|EJF49970.1| beta-galactosidase [Enterococcus sp. C1]
Length = 585
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 162/351 (46%), Gaps = 46/351 (13%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISG+IHY R PE W D ++K + G + +ETYV WNLHE Y FEG DL +F++
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGILDLRRFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
E GLY LR PY+CAEW FGG P WL P ++ R D PF ++ R+ A + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWV------ 216
+ L +QGGPI++ Q+ENEYG+ A K Y++ P V
Sbjct: 139 RD--LQITQGGPILMMQVENEYGSY-----ANDKEYLRKMVAAMRQQGVETPLVTSDGPW 191
Query: 217 --MCQQSDAPD---PIINTCNGFYCDQFTP----NSNNKPKMWTENWSGWFLSFGGAVPY 267
M + D P IN C + F + +P M E W GWF ++G +
Sbjct: 192 HDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDD--H 248
Query: 268 RPVEDLAFAVARF---FQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPL 318
A AV G N YM+HGGTNF +G + TSYDYDA L
Sbjct: 249 HHTTSTADAVKELQDCLAEGSV--NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALL 306
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS 369
E WG ++A K A A P +P L LE Y T S
Sbjct: 307 TE--------WGEPTAKYQAFKKV-IADYAEIPEFP-LSMKLERKAYGTFS 347
>gi|281337336|gb|EFB12920.1| hypothetical protein PANDA_005061 [Ailuropoda melanoleuca]
Length = 655
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 120/376 (31%), Positives = 179/376 (47%), Gaps = 38/376 (10%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+GG + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F
Sbjct: 78 LGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENL 137
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT + F + ++
Sbjct: 138 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYF 197
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + L +GGPII Q+ENEYG+ A K Y+ + L+ G+
Sbjct: 198 DHLIS--RVVPLQYHKGGPIIAVQVENEYGSF-----AVDKDYMPYVRKAL--LERGIVE 248
Query: 216 VMCQQSDAPDPI------------INTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
++ DA + +NT +Q + NKP M E W GWF ++GG
Sbjct: 249 LLVTSDDAENLQKGYLEGVLATINMNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDTWGG 308
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTS------ 311
ED+ V++F +F N YM+HGGTNF +G + + TS
Sbjct: 309 KHMVNNAEDVEETVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGIHRAVVTSYGKCLL 367
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHK---AIKLCEAALVATDPTYPSLGPNLEATVYKTG 368
YDYDA L E G + K+ L+ L + A+ L + YPS+ P+L ++
Sbjct: 368 YDYDALLTEAGDYTK-KYFKLQRLFRSVLAMPLPPLPELTPKAKYPSVKPSLYLPLWDAL 426
Query: 369 SGLCSAFLANIGTNSD 384
L ++N N +
Sbjct: 427 QYLNEPVISNRPVNME 442
>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
Length = 859
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 161/327 (49%), Gaps = 27/327 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + +HYPR W I+ K G++ + YVFWN+HE Q++F G+
Sbjct: 102 LLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGQ 161
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ F +L + G+Y +R GPYVCAEW GG P WL I+ R + F ++ F
Sbjct: 162 NDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVELF 221
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIK--WA-------A 203
K+ + + L +GGPII+ Q+ENEYG+ D AY + + ++ W+
Sbjct: 222 EQKVAEQLA--PLTIRRGGPIIMVQVENEYGSYGEDKAYVSQIRDVLRRYWSLSPTGEGR 279
Query: 204 GMALS-LDTGVPWVMCQQSDAPDPIINTCN----GFYCDQFTPNSN---NKPKMWTENWS 255
G A S L W + D ++ T N DQF + PKM +E WS
Sbjct: 280 GEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPDAPKMCSEFWS 339
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---T 310
GWF +G RP D+ + +G +F + YM HGGT+F +G P + T
Sbjct: 340 GWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVT 398
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHK 337
SYDYDAP++EYG W K + K
Sbjct: 399 SYDYDAPINEYGQATPKFWELRKTMEK 425
>gi|119588243|gb|EAW67839.1| hCG1729998, isoform CRA_d [Homo sapiens]
Length = 653
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 134/452 (29%), Positives = 203/452 (44%), Gaps = 36/452 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L Q GP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQAGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN--TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
V+ + IN + +Q +KP + E W GWF +G +
Sbjct: 260 EKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDA 319
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYGLI 324
+++ AV+ F + +F N YM+HGGTNF +G + I TSYDYDA L E G
Sbjct: 320 KEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDY 378
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
+ K+ L+ L +++ V P YP + P+L ++ S L +
Sbjct: 379 TE-KYLKLQKLFQSVSATPLPRVPKLPPKAVYPPVRPSLYLPLWDALSYLNEPVRSRQPV 437
Query: 382 NSDVTVKFNGN--SYLLPAWSVSILP---------DCKNVVFNTAKI------NSVTLVP 424
N + NG+ SY L + SI D V + I N +P
Sbjct: 438 NMENLPINNGSGQSYGLVLYEKSICSGGRLRAHAHDMAQVFLDETMIGILNENNKDLHIP 497
Query: 425 SFSR-QSLQVAADSSDAIGSGWSYINEPVGIS 455
+ L++ ++ + W NE GI+
Sbjct: 498 ELRDCRYLRILVENQGRVNFSWQIQNEQKGIT 529
>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
Length = 601
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 110/293 (37%), Positives = 147/293 (50%), Gaps = 16/293 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISG++HY R PE W D + K K G + +ETYV WN+HEP +++F G D++ FV+
Sbjct: 20 IISGALHYFRVVPEYWRDRLLKMKACGCNTVETYVAWNVHEPEEGKFDFGGIADVIAFVE 79
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GL+ +R PY+CAEW FGG P WL +Q R + F A++ + V +
Sbjct: 80 LAGELGLHVIVRPSPYICAEWEFGGLPAWLLKDSEMQLRCSDPKFLAKVDAYYD--VLLP 137
Query: 163 KQEKLYASQGGPIILSQIENEYGNI--DSAY-GAAGKSYIKWAAGMALSLDTGVPWVMCQ 219
K L + GGPII Q+ENEYG+ D AY G I + L G M Q
Sbjct: 138 KFVPLLCTNGGPIIAMQVENEYGSYGNDKAYLGYLRDGMIARGIDVLLFTSDGPTDEMLQ 197
Query: 220 QSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
PD + G ++ F ++P M E W+GWF + R ED A
Sbjct: 198 GGTLPDVLATVNFGSRPEESFAKFREYRPDEPLMCMEFWNGWFDHWMEEHHTRDGEDAAR 257
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYG 322
+ G + N+YM+HGGTNF SG I TSYDYDAPL E G
Sbjct: 258 VLDDMLGAGASV-NFYMFHGGTNFGFYSGANHIKTYEPTVTSYDYDAPLTERG 309
>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
Length = 797
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 161/327 (49%), Gaps = 27/327 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + +HYPR W I+ K G++ + YVFWN+HE Q++F G+
Sbjct: 40 LLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGQ 99
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ F +L + G+Y +R GPYVCAEW GG P WL I+ R + F ++ F
Sbjct: 100 NDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVELF 159
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIK--WA-------A 203
K+ + + L +GGPII+ Q+ENEYG+ D AY + + ++ W+
Sbjct: 160 EQKVAEQLA--PLTIRRGGPIIMVQVENEYGSYGEDKAYVSQIRDVLRRYWSLSPTGEGR 217
Query: 204 GMALS-LDTGVPWVMCQQSDAPDPIINTCN----GFYCDQFTPNSN---NKPKMWTENWS 255
G A S L W + D ++ T N DQF + PKM +E WS
Sbjct: 218 GEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPDAPKMCSEFWS 277
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---T 310
GWF +G RP D+ + +G +F + YM HGGT+F +G P + T
Sbjct: 278 GWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVT 336
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHK 337
SYDYDAP++EYG W K + K
Sbjct: 337 SYDYDAPINEYGQATPKFWELRKTMEK 363
>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
Length = 787
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 112/348 (32%), Positives = 171/348 (49%), Gaps = 27/348 (7%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G T + ++ GK V+ + +HYPR W I+ K G++ + YVFWN+HE
Sbjct: 27 GGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHE 86
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
+++F G D+ +F +L GLY +R GPYVCAEW GG P WL I+ R
Sbjct: 87 QQEGRFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREP 146
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKW 201
+ F ++ F K+ + + L GGPII+ Q+ENEYG+ + AY +A + ++
Sbjct: 147 DPYFMERVKLFERKVGEQLAS--LTIQNGGPIIMVQVENEYGSYGENKAYVSAIRDIVRQ 204
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCDQ-------FTPNSNNKPKMWT 251
+ ++L W + + D ++ T N G DQ PN+ P+M +
Sbjct: 205 SGFDKVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNA---PQMCS 260
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS 309
E WSGWF +G RP + + + +G +F + YM HGGT+F +G P +
Sbjct: 261 EFWSGWFDKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFA 319
Query: 310 ---TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP 354
TSYDYDAP++EYG PK+ L+ H K + + P P
Sbjct: 320 PDVTSYDYDAPINEYGQA-TPKYWELR--HTMEKYNDGGKLPAPPKAP 364
>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
Length = 586
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 175/364 (48%), Gaps = 40/364 (10%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ ++SG++HY R PE+W D + K K GL+ +ETYV WNLHEP Q+ +EG
Sbjct: 17 LNGQPFRVLSGALHYFRVLPELWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFRYEGGL 76
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL F++L GLY +R GP++CAEW FGG P WL P ++ R +P+ ++RF
Sbjct: 77 DLAAFIRLAESLGLYVIVRPGPFICAEWEFGGLPAWLLADPYMEVRCCYQPYLEAVRRFY 136
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + ++ +GGPI+ Q+ENEYG +YG + + Y+ W L LD GV
Sbjct: 137 DDLLPRLLPLQI--QRGGPILAMQVENEYG----SYG-SDQLYLTWLR--RLMLDGGVET 187
Query: 216 VMCQQSDAPDPIIN---TCNGFYCDQFTPNSNNK-----------PKMWTENWSGWFLSF 261
++ A D ++ + F + + P M E W+GWF +
Sbjct: 188 LLFTSDGATDHMLKHGTLAQVWKSANFGSRAEEEFAKLREYQPDGPLMCMEFWNGWFDHW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG----------PFISTS 311
G R D A A+ R G N YM+HGGTNF +G P ++ S
Sbjct: 248 GEPHHTRDAADAADALERIMACGAHV-NVYMFHGGTNFGFMNGANTDLLTRDYQPTVN-S 305
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGL 371
YDYDAPLDE G L K ++L L A P ++A + +GL
Sbjct: 306 YDYDAPLDETGQPTAKFHAFRAVLEKHVQLPPMQLPAPAPRI-----AIDALTFDASAGL 360
Query: 372 CSAF 375
A
Sbjct: 361 WEAL 364
>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
Length = 604
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L GG I++ QIENEYG+ + AY A + + AL +
Sbjct: 140 YDVLMEKIVPHQL--VNGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
Length = 777
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 168/337 (49%), Gaps = 33/337 (9%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+++L + F++ + +S V + I GK LI G +HYPR E W D ++++
Sbjct: 11 LVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRA 70
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
GL+ + YVFWN HE +++F G+ D+ +F++ E GLY LR GPYVCAEW+F
Sbjct: 71 SAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAEWDF 130
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEY 184
GG+P WL + +R+ + F + +R+ I ++ KQ L + GG II+ Q+ENEY
Sbjct: 131 GGYPSWLLKEKDMTYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQVENEY 187
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFYCDQF 239
G+ AA K Y+ M VP C ++ + + T NG + +
Sbjct: 188 GSY-----AADKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDI 242
Query: 240 ----TPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGTFQNYY 291
P E + WF +G +V Y RP E L + ++ G + Y
Sbjct: 243 FKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GVSVSMY 297
Query: 292 MYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
M+HGGTNF+ T+G TSYDYDAPL E+G
Sbjct: 298 MFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 334
>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
Length = 615
Score = 169 bits (429), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 167/338 (49%), Gaps = 36/338 (10%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A +T+ H A + G+ ++SGS+HY R PE W D + + GL+ ++TYV WN HE
Sbjct: 23 ATLTHTHGAFLRRGRPHRVLSGSLHYFRVHPEQWADRLDRLAALGLNTVDTYVPWNFHER 82
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ F+G DL +FV+L AGL +R GPY+CAEW+ GG P WL PG++ R +
Sbjct: 83 RPGEARFDGWRDLARFVRLAQRAGLDVMVRPGPYICAEWDNGGLPAWLTGTPGMRLRAGH 142
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAG 204
+P+ + R+ +V + + L A GGP++ QIENEYG+ + +Y++W
Sbjct: 143 QPYLDAVARWFDALVPRVAE--LQAVHGGPVVAVQIENEYGSYGDDH-----AYVRWVRD 195
Query: 205 MALSLDTGVPWVMCQQSDAPDPII---NTCNGFYCDQ------------FTPNSNNKPKM 249
+D G+ + +D P P++ T G +P +
Sbjct: 196 AL--VDRGIT-ELLYTADGPTPLMLDGGTVPGELAAATFGSRAAEAAALLRSRRPGEPFL 252
Query: 250 WTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG---- 305
E W+GWF +G R + A V GG+ + YM HGGTNF +G
Sbjct: 253 CAEFWNGWFDHWGEKHHVRSRDGAAQEVEEILDAGGSV-SLYMAHGGTNFGLWAGANHDG 311
Query: 306 ----PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAI 339
P + TSYD DAP+ E+G + PK+ L++ A+
Sbjct: 312 GVLRPTV-TSYDSDAPVSEHGAL-TPKFHALRERFAAL 347
>gi|325569852|ref|ZP_08145846.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325156975|gb|EGC69143.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 585
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 162/351 (46%), Gaps = 46/351 (13%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISG+IHY R PE W D ++K + G + +ETYV WNLHE Y FEG DL +F++
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGILDLRRFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
E GLY LR PY+CAEW FGG P WL P ++ R D PF ++ R+ A + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWV------ 216
+ L +QGGPI++ Q+ENEYG+ A K Y++ P V
Sbjct: 139 RD--LQITQGGPILMMQVENEYGSY-----ANDKEYLRKMVAAMRQQGVETPLVTSDGPW 191
Query: 217 --MCQQSDAPD---PIINTCNGFYCDQFTP----NSNNKPKMWTENWSGWFLSFGGAVPY 267
M + D P IN C + F + +P M E W GWF ++G +
Sbjct: 192 HDMLENGTIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDD--H 248
Query: 268 RPVEDLAFAVARF---FQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPL 318
A AV G N YM+HGGTNF +G + TSYDYDA L
Sbjct: 249 HHTTSTADAVKELQDCLAEGSV--NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALL 306
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS 369
E WG ++A K A A P +P L LE Y T S
Sbjct: 307 TE--------WGEPTAKYQAFKKV-IADYAEIPEFP-LSMKLERKAYGTFS 347
>gi|332264040|ref|XP_003281056.1| PREDICTED: beta-galactosidase-1-like protein 3 [Nomascus
leucogenys]
Length = 655
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 115/341 (33%), Positives = 169/341 (49%), Gaps = 22/341 (6%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNM 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPQLLLRTTNKGFIEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIINT----CNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
V+ + IN N F Q +KP + E W GWF +G +
Sbjct: 260 EKHVLSGHTKGVLAAINLQKLHQNTF--SQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVK 317
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
+++ AV+ F + +F N YM+HGGTNF +G + I TSYDYDA L E G
Sbjct: 318 DAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHTGIVTSYDYDAVLTEAG 376
Query: 323 LIRQPKWGHLKDLHKAIK---LCEAALVATDPTYPSLGPNL 360
+ K+ L+ L +++ L + + YP + P+L
Sbjct: 377 DYTE-KYFKLQKLFESVSATPLPQVPKLTPKAVYPPMRPSL 416
>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
Length = 775
Score = 169 bits (428), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 168/337 (49%), Gaps = 33/337 (9%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+++L + F++ + +S V + I GK LI G +HYPR E W D ++++
Sbjct: 9 LVILNIIVSFLISSCSSPKEQVRIGNGTFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRA 68
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
GL+ + YVFWN HE +++F G+ D+ +F++ E GLY LR GPYVCAEW+F
Sbjct: 69 SAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGPYVCAEWDF 128
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEY 184
GG+P WL + +R+ + F + +R+ I ++ KQ L + GG II+ Q+ENEY
Sbjct: 129 GGYPSWLLKEKDMTYRSKDPRFLSYCERY---IKELGKQLSPLTINNGGNIIMVQVENEY 185
Query: 185 GNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFYCDQF 239
G+ AA K Y+ M VP C ++ + + T NG + +
Sbjct: 186 GSY-----AADKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDI 240
Query: 240 ----TPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGTFQNYY 291
P E + WF +G +V Y RP E L + ++ G + Y
Sbjct: 241 FKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLSH-----GVSVSMY 295
Query: 292 MYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
M+HGGTNF+ T+G TSYDYDAPL E+G
Sbjct: 296 MFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWG 332
>gi|297727459|ref|NP_001176093.1| Os10g0340600 [Oryza sativa Japonica Group]
gi|255679317|dbj|BAH94821.1| Os10g0340600 [Oryza sativa Japonica Group]
Length = 143
Score = 169 bits (428), Expect = 5e-39, Method: Composition-based stats.
Identities = 68/104 (65%), Positives = 89/104 (85%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V+YD R++++ G+RR++ISGSIHYPRSTPEMWPDLI+K+K+GGL+ IETYVFWN HEP R
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPL 130
++NFEG YD+V+F K + AG+YA LRIGPY+C EWN+G P+
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGYMPM 134
>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
Length = 594
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L GG I++ QIENEYG+ + AY A + + AL +
Sbjct: 130 YDVLMEKIVPHQL--VNGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 586
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 159/306 (51%), Gaps = 26/306 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG++HY R P++W D I+K++ GL+ IETYV WN H P R ++ G
Sbjct: 12 LLDGEPLQILSGALHYFRVHPDLWADRIRKARLMGLNTIETYVAWNAHAPERGVFDLTGN 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+ LVA GL+A +R GPY+CAEW+ GG P WL PG+ RT + + +
Sbjct: 72 LDLGRFLDLVAAEGLHAIVRPGPYICAEWDNGGLPAWLMATPGVGVRTAEPQYLEAIAGY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+I+ ++ ++ ++GGP+++ Q+ENEYG AYG Y++ M VP
Sbjct: 132 YDEILAVVAPRQV--TRGGPVLMVQVENEYG----AYG-DDADYLRALVTMMRERGIEVP 184
Query: 215 WVMCQQSDAPD------PIINTCNGF------YCDQFTPNSNNKPKMWTENWSGWFLSFG 262
C Q++ P ++ F + + P M E W GWF S+G
Sbjct: 185 LTTCDQANDEMLGRGGLPELHKTATFGSRSPERLETLRRHQPTGPLMCMEYWDGWFDSWG 244
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPF--ISTSYDYDA 316
+ A + +G + N YM+HGGTN T+G G + I+TSYDYDA
Sbjct: 245 EQHHTTDAAEAAADLDLLLSQGAS-ANLYMFHGGTNLGFTNGANDKGTYLPITTSYDYDA 303
Query: 317 PLDEYG 322
PL E G
Sbjct: 304 PLAEDG 309
>gi|242078611|ref|XP_002444074.1| hypothetical protein SORBIDRAFT_07g006936 [Sorghum bicolor]
gi|241940424|gb|EES13569.1| hypothetical protein SORBIDRAFT_07g006936 [Sorghum bicolor]
Length = 147
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 82/148 (55%), Positives = 108/148 (72%), Gaps = 1/148 (0%)
Query: 701 YHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKI 760
YHVP +L+ N +VLFE+ GGDP+KISFV +Q S+C+ V++ HP +D W S +
Sbjct: 1 YHVPCLFLQPGSNDIVLFEQFGGDPSKISFVIRQT-RSVCAQVSEEHPAQIDSWNSSQQT 59
Query: 761 QRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKS 820
++ P L LECP QVISSIKFASFGTP GTCGS+S G CSS +++SVV++AC+G +
Sbjct: 60 MQRYRPELRLECPKDGQVISSIKFASFGTPSGTCGSYSHGECSSTQAISVVQEACIGVSN 119
Query: 821 CSIGVSVNTFGDPCKGVMKSLAVEASCT 848
CS+ VS N FG+P GV KSLAVEA+C+
Sbjct: 120 CSVPVSSNYFGNPWTGVTKSLAVEAACS 147
>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
Length = 823
Score = 169 bits (427), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 108/351 (30%), Positives = 169/351 (48%), Gaps = 26/351 (7%)
Query: 5 EILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
+ ++ L L + G + T ++ G+ V+ + +HYPR W I+
Sbjct: 47 KTVIATLVLSLATLTAPARGGDFTVGKNTFLLNGQPFVVKAAELHYPRIPRPYWEQRIKM 106
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
K G++ + YVFWN+HE +++F G D+ F +L + G+Y +R GPYVCAEW
Sbjct: 107 CKSLGMNTVCLYVFWNIHEQQEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWE 166
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
GG P WL I+ R D+ F A ++ F A++ + L GGPII+ Q+ENEY
Sbjct: 167 MGGLPWWLLKKKDIRLREDDPYFMARVKAFEAEVGRQLA--PLTIQNGGPIIMVQVENEY 224
Query: 185 GN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCD-- 237
G+ ++ Y + + +K A+G W +++ D ++ T N G D
Sbjct: 225 GSYGVNKKYVSQIRDIVK-ASGFDKVTLFQCDWASNFENNGLDDLVWTMNFGTGSNIDAQ 283
Query: 238 -----QFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYM 292
Q P++ P M +E WSGWF +G RP + + + + +F + YM
Sbjct: 284 FKRLKQLRPDA---PLMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKNISF-SLYM 339
Query: 293 YHGGTNFDRTSG------GPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
HGGT+F +G P + TSYDYDAP++EYG W K + K
Sbjct: 340 THGGTSFGHWAGANSPGFAPDV-TSYDYDAPINEYGHATPKFWELRKTMQK 389
>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
Length = 585
Score = 169 bits (427), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 120/349 (34%), Positives = 162/349 (46%), Gaps = 42/349 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISG+IHY R PE W D ++K + G + +ETYV WNLHE Y F+G DL +F++
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
E GLY LR PY+CAEW FGG P WL P ++ R D PF ++ R+ A + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWV------ 216
+ L +QGGPII+ Q+ENEYG+ A K Y++ P V
Sbjct: 139 RD--LQITQGGPIIMMQVENEYGSY-----ANDKEYLRKMVAAMRQHGVETPLVTSDGPW 191
Query: 217 --MCQQSDAPD---PIINTCNGFYCDQFTP----NSNNKPKMWTENWSGWFLSFGGAVPY 267
M + D P IN C + F + +P M E W GWF ++G +
Sbjct: 192 HDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDQHH 250
Query: 268 -RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDE 320
+D + G N YM+HGGTNF +G + TSYDYDA L E
Sbjct: 251 TTSTQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTE 308
Query: 321 YGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS 369
WG ++A K A A P +P L +E Y T S
Sbjct: 309 --------WGEPTAKYQAFKKV-IADYAEIPEFP-LSMEIERKAYGTFS 347
>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
Length = 651
Score = 169 bits (427), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 110/304 (36%), Positives = 153/304 (50%), Gaps = 25/304 (8%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ K ++SG++HY R PE W D + + K GL+ +ETYV WNLHE + ++ F G
Sbjct: 65 LDNKELRILSGAMHYFRIVPEYWLDRLTRMKAAGLNTVETYVPWNLHEEIHGEFVFTGML 124
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPF----KAEM 151
D+ +FV + + GL LR GP++C+EW FGG P WL P + R+ PF ++ M
Sbjct: 125 DIRRFVAIAEKVGLLVILRPGPFICSEWEFGGLPSWLLRDPQMDVRSTYRPFMDAARSYM 184
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSL 209
+ +++ DM Q GGPII QIENEYG+ D Y K+ I +G+ L
Sbjct: 185 RSLISELEDMQYQ------YGGPIIAMQIENEYGSYSDDVNYMQELKN-IMTDSGVIEIL 237
Query: 210 DTGVPWVMCQQSDAPDPIINTC------NGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
T Q P + T G D+ KP M E WSGWF +
Sbjct: 238 FTSDNKHGLQPGRVPGVFMTTNFKNTNEGGRMFDKLHELQPGKPLMVMEFWSGWFDHWEE 297
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG---PFIS--TSYDYDAPL 318
+E+ A AV Q+G + N YM+HGGTNF +G P++ TSYDYD+PL
Sbjct: 298 KHHTMSLEEYASAVEYILQQGSSI-NLYMFHGGTNFGFLNGANTEPYLPTVTSYDYDSPL 356
Query: 319 DEYG 322
E G
Sbjct: 357 SEAG 360
>gi|423252157|ref|ZP_17233159.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|423252477|ref|ZP_17233408.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
gi|392647903|gb|EIY41596.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|392660553|gb|EIY54162.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
Length = 628
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 172/359 (47%), Gaps = 42/359 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K+ E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG DQ+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPT--YPSLGPNLEATV 364
TSYDYDAP+ E G + PK+ +++ + K +K A +P PS+ N A V
Sbjct: 329 MTSYDYDAPISEAGWV-TPKYDSIRNVIKKYVKYTIPEAPAPNPVIEIPSIQLNKVADV 386
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 87/229 (37%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PVQ+ G D+ YQ + P + + D+ +P
Sbjct: 483 EIVHNTKGIISPVQIAGKEIVGGWDM------YQLPMD----EMPDLTKLKADTHKNVPS 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + +D GKG +VNG +IGRYW
Sbjct: 533 EVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY VP WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNETP 611
>gi|319893645|ref|YP_004150520.1| beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|386318129|ref|YP_006014292.1| glycosyl hydrolase [Staphylococcus pseudintermedius ED99]
gi|317163341|gb|ADV06884.1| Beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|323463300|gb|ADX75453.1| glycosyl hydrolase, family 35 [Staphylococcus pseudintermedius
ED99]
Length = 590
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 110/305 (36%), Positives = 154/305 (50%), Gaps = 34/305 (11%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R + W D + K G + +ETYV WN HE + N+Y+F+G DL F++
Sbjct: 19 ILSGAIHYFRIPKDDWEDSLYNLKALGFNTVETYVPWNFHETIENEYDFKGHKDLKHFIE 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L A+ GLY +R PY+CAEW FGGFP WL ++ R+ +E + +++++ ++ ++
Sbjct: 79 LAAKLGLYVIVRPSPYICAEWEFGGFPAWLLNDRTMRIRSRDEKYLEKVKKYYHELFKIL 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP-------W 215
L QGGPII+ Q+ENEYG+ Y++ A M VP W
Sbjct: 139 T--PLQIDQGGPIIMMQVENEYGSF-----GQDHDYLRSLAHMMREEGVTVPFFTSDGAW 191
Query: 216 VMCQQSDA--PDPIINTCN-GFYCDQFTPN--------SNNKPKMWTENWSGWFLSFGGA 264
C ++ + D I+ T N G Q N S P M E W GWF +G
Sbjct: 192 DQCLRAGSLIEDDILPTGNFGSRTVQNFENLKTFQQEFSKKWPLMCMEFWDGWFNRWGEP 251
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF-------DRTSGGPFISTSYDYDAP 317
V R +DLA V + G N YM+HGGTNF R + TSYDY AP
Sbjct: 252 VIKRDSDDLAEEVRDAVKLGSL--NLYMFHGGTNFGFWNGCSARGTKDLPQVTSYDYHAP 309
Query: 318 LDEYG 322
LDE G
Sbjct: 310 LDEAG 314
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 62/230 (26%), Positives = 95/230 (41%), Gaps = 49/230 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + +H F++ + V + Y ++ F + L + D+L +G NYG
Sbjct: 402 LRIVDARDRVHCFVDQQHVYTAY----QEEIGDQFEVTLTSDQPQIDVLIENMGRVNYGY 457
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
+ P Q KG G G DL Q G + +++F ++ + K +
Sbjct: 458 -------KLLAPTQRKGLGQGLMQDLHFVQ-----GWEQFDIDFDRLTANHF--KREWSE 503
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
QP +YK TFD A S ID +G GKG VNG +IGRYW
Sbjct: 504 QQP-AFYKYTFDL-AESNNTHIDVSGFGKGVVLVNGFNIGRYWEI--------------- 546
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
PSQSLY +P+++LK N +++F+ G P I +
Sbjct: 547 -------------GPSQSLY-IPKAFLKQGQNEIIVFDSEGKYPESIQLI 582
>gi|443689405|gb|ELT91801.1| hypothetical protein CAPTEDRAFT_23316, partial [Capitella teleta]
Length = 596
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 114/343 (33%), Positives = 170/343 (49%), Gaps = 35/343 (10%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+R + SGS HY R+ P +W D + + K GL+ + TYV WN HEP + Q+ G Y
Sbjct: 11 LDGRRFKIFSGSFHYFRTHPLLWGDRLLRMKAAGLNTVMTYVPWNFHEPRKGQFTLGGLY 70
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN-EPFKAEMQRF 154
DLV F++ V + GLY +R GPY+CAEW FGGFP WL P + RT + P+ E++++
Sbjct: 71 DLVSFMEQVQKVGLYLIVRPGPYICAEWEFGGFPSWLLRDPKMNLRTSSYTPYLNEVKQY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI---DSAY-GAAGKSYIKWAAGMALSLD 210
+++ ++ K GGPII Q+ENE+G+ D Y Y W L
Sbjct: 131 LSQLFAVLT--KFTYKHGGPIIAFQVENEFGSKGVHDPEYLQFLVTQYSSWNLNELLFTS 188
Query: 211 TGVPWVMCQQSDAPDPI--INTCNGFYCD-----QFTPNSNNKPKMWTENWSGWFLSFGG 263
G ++ PD + IN + D +F P +P M TE W+GWF +G
Sbjct: 189 DGKKYL--SNGTLPDVLATINLNDHAKEDLEELKEFQP---ERPLMVTEFWAGWFDHWGE 243
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-------------- 309
+ +L + + N+YM+ GGTNF +G ++S
Sbjct: 244 EHHHYGTTELERELEAILSLNASV-NFYMFIGGTNFGFWNGANYLSYNKDKEASLLGPTV 302
Query: 310 TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPT 352
TSYDYDA + E+G ++ PK+ +++L K L L PT
Sbjct: 303 TSYDYDAAVSEWGHVK-PKYNVIRNLLKKYSLTPLDLPDVPPT 344
>gi|354466872|ref|XP_003495895.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Cricetulus
griseus]
Length = 761
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/349 (34%), Positives = 171/349 (48%), Gaps = 36/349 (10%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + +++ GSIHY R E W D + K + G + + TY+ WNLHE R ++F
Sbjct: 188 LDGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQNRGTFDFSEIL 247
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +V L A GL+ LR GPY+CAE + GG P WL P +Q RT + F + ++
Sbjct: 248 DLEAYVSLAATLGLWVILRPGPYICAEVDLGGLPSWLLGYPELQLRTTQQEFLDAVDKYF 307
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDT-- 211
++ + L +GGP+I QIENEYG+ D Y K ++ + L L +
Sbjct: 308 DHLIPRIL--PLQYLRGGPVIAVQIENEYGSFSKDGDYMEYIKEALQKRGIVELLLTSDN 365
Query: 212 --GVPWVMCQQSDAPDPIINTCN--GFYCDQFTP---NSNNKPKMWTENWSGWFLSFGGA 264
G+ Q+ + + T N F D F N+KP M E W+GWF ++G
Sbjct: 366 HKGI------QTGSVKGALTTINMASFEKDSFIKLLQMQNDKPIMVMEYWTGWFDTWGRE 419
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG------PFISTSYDYDAPL 318
+ E++ + V+RF + G +F N YM+HGGTNF +G + TSYDYDA L
Sbjct: 420 HNVKSAEEIRYTVSRFIKYGISF-NMYMFHGGTNFGFINGAFHYDKHSSVVTSYDYDAVL 478
Query: 319 DEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKT 367
E G + + K KL +A V P P L P TVY T
Sbjct: 479 TEAG-------DYTEKYFKLRKLFASASVGFLPRLPQLIPK---TVYPT 517
>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
Length = 624
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 167/318 (52%), Gaps = 22/318 (6%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ V+ SG +HYPR W + ++ ++ GL+ + TY FW+ HEP Q++F G+
Sbjct: 42 LDGQPFVIRSGEMHYPRIPRAAWRERLRMARAMGLNTVTTYAFWSQHEPEPGQWSFSGQN 101
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL F+K AE GL LR GPYVCAE +FGGFP WL G++ R+ + + A R+
Sbjct: 102 DLRTFIKTAAEEGLNVVLRPGPYVCAEVDFGGFPAWLMRTQGLRVRSMDARYLAASARYF 161
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMA--LSLDT 211
++ + L +S+GGPI++ Q+ENEYG+ D Y A ++ ++ A A + D
Sbjct: 162 KRLAQEVAD--LQSSRGGPILMLQLENEYGSYGRDHDYLRAVRTQMRQAGFDAPLFTSDG 219
Query: 212 GVPWVMCQQSDAPDPIINTCNGFYCD------QFTPNSNNKPKMWTENWSGWFLSFGGAV 265
G + + A P + G D + + P+M E W+GWF +G
Sbjct: 220 GAGRLFEGGTLADVPAVVNFGGGADDAQASVQELAAWRPHGPRMAGEYWAGWFDHWGEQH 279
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI--------STSYDYDAP 317
+ E+ A V R +G +F N YM+HGGT+F +G + +TSYDYDA
Sbjct: 280 HTQSPEEAARTVERMLSQGVSF-NLYMFHGGTSFGWLAGANYSGSEPYQPDTTSYDYDAA 338
Query: 318 LDEYGLIRQPKWGHLKDL 335
LDE G PK+ L+D+
Sbjct: 339 LDEAGRP-TPKYFALRDV 355
Score = 42.4 bits (98), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 39/83 (46%), Gaps = 31/83 (37%)
Query: 638 SEPVA--IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGK 695
S+PV +D G GKG+ WVNG+ +GRYW + G
Sbjct: 547 SKPVDTFLDTRGWGKGQVWVNGRHLGRYW---------------------------HIG- 578
Query: 696 PSQSLYHVPRSWLKSSGNTLVLF 718
P Q+LY +P SWLK N +++F
Sbjct: 579 PQQTLY-LPASWLKEGANEVLVF 600
>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 604
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|298384202|ref|ZP_06993762.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|383123627|ref|ZP_09944306.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|251839745|gb|EES67828.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|298262481|gb|EFI05345.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 624
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/337 (34%), Positives = 169/337 (50%), Gaps = 48/337 (14%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ ++SG +HY R + W +Q K GL+ + TYVFWNLHE +++F G +L
Sbjct: 35 GEEIPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+++++ E G+ LR GPYVCAEW FGG+P WL IPG++ R DN E ++T K
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDN----TEFLKYTKK 150
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNIDS-----------AYGAAGKSYIKWAAG 204
+D + +E L ++GGPII+ Q ENE+G+ S +Y A K + AG
Sbjct: 151 YIDRLYEEVGDLQCTKGGPIIMVQCENEFGSYVSQRKDIPLEEHRSYNAKIKGQLA-DAG 209
Query: 205 MALSLDTGV-PWVM---CQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTE 252
+ L T W+ C P T NG +Q+ + + P M E
Sbjct: 210 FTIPLFTSDGSWLFEGGCVAGALP-----TANGESDIANLKKVVNQY--HGDKGPYMVAE 262
Query: 253 NWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--- 309
+SGW +G P ++A + Q +F N+YM HGGTNF TSG +
Sbjct: 263 FYSGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRD 321
Query: 310 -----TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIK 340
TSYDYDAP+ E G + PK+ ++ + K +K
Sbjct: 322 IQPDLTSYDYDAPISEAGWL-TPKYDSIRSVIQKYVK 357
>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 594
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
Length = 920
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 155/312 (49%), Gaps = 32/312 (10%)
Query: 33 AVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE 92
A ++ G+ +ISG +HYPR E W D ++K+K GL+ I TYVFWNLHEP + +Y+F
Sbjct: 346 AFLLDGQPFQIISGEMHYPRVPREAWRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFS 405
Query: 93 GRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQ 152
G D+ FVK E GL+ LR PYVCAEW FGG+P WL I G++ R+ + +Q
Sbjct: 406 GNNDIAAFVKTAQEEGLWVILRPSPYVCAEWEFGGYPYWLQNIKGLEVRSKEPQY---LQ 462
Query: 153 RFTAKIVDMMKQ-EKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIK------WAAGM 205
+ I+ + KQ L + GG I++ Q+ENEYG AYG + + Y+ AG
Sbjct: 463 AYKNYIMQVGKQLAPLQVNHGGNILMVQVENEYG----AYG-SDREYLDINRRLFIEAGF 517
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGF----YCDQFTPNSNN--KPKMWTENWSGWFL 259
L T P + + P + + NG Q +N P E + WF
Sbjct: 518 DGLLYTCDPEPFLAKGNLPGKLFTSINGLDKPARIKQLIKQNNEGKGPYFVAEWYPAWFD 577
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG---------PFIST 310
+G P E + G + N YM+HGGT D +G P IS
Sbjct: 578 WWGTQHHKVPAEKYTPGLDSVLSAGMSV-NMYMFHGGTTRDFMNGANYNDQNPYEPQIS- 635
Query: 311 SYDYDAPLDEYG 322
SYDYDAPLDE G
Sbjct: 636 SYDYDAPLDEAG 647
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 100/250 (40%), Gaps = 58/250 (23%)
Query: 494 LEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVD-FPIALAPGKNTFDLLSL 552
++ G + L ++ L FINGK + S + ++ D + L K D+L
Sbjct: 725 IDGGREGALKIKDLRDYGLVFINGKRI-----SVLDRRLKQDSIWLKLPDEKIQLDILVE 779
Query: 553 TVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ 612
+G NYG + K GIT V S NG + TG + +L F +S
Sbjct: 780 NLGRINYGPYLLKNKKGITEGV----SFNGKEL----------TGWQMFKLPFNDLNSVA 825
Query: 613 WDSKSTL---PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
+ TL P L+ K TF + ++ GKG WVNG ++GRYW
Sbjct: 826 LKNSKTLSGAPVLK-----KGTFSLQTVGD-TYLNLGNWGKGVVWVNGHNLGRYW----- 874
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
N G P Q+LY VP WLK GN +++ E + + +++
Sbjct: 875 ----------------------NIG-PQQTLY-VPVEWLKKGGNEIIVLELLKPEQSQLQ 910
Query: 730 FVTKQLGSSL 739
V K + L
Sbjct: 911 AVDKPILDKL 920
>gi|355567243|gb|EHH23622.1| hypothetical protein EGK_07120 [Macaca mulatta]
Length = 653
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 162/336 (48%), Gaps = 25/336 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G R ++ GSIHY R E W D + K + G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHRFLICGGSIHYFRVPREYWRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN----TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
V+ + IN N F +Q +KP + E W GWF +G +
Sbjct: 260 EKNVLSGHTKGVLAAINLQKVQRNTF--NQLHKVQRDKPLLVMEYWVGWFDRWGDKHHVK 317
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
+++ AV+ F + +F N YM+HGGTNF +G I TSYDYDA L E G
Sbjct: 318 DAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATNFGKHTGIVTSYDYDAVLTEAG 376
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
+ + K KL E+ P P L P
Sbjct: 377 -------DYTEKYFKLQKLLESVSATPLPQVPKLTP 405
>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 604
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 160/316 (50%), Gaps = 27/316 (8%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG DL +F+K
Sbjct: 29 ILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLK 88
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLYA +R PY+CAEW FGGFP WL PG + R++N + + + +++ +
Sbjct: 89 LAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEYYDVLMEKI 147
Query: 163 KQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGVPW--VMC 218
+L + GG I++ QIENEYG+ + AY A + + A + PW +
Sbjct: 148 VPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLR 205
Query: 219 QQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRP 269
S D I+ T N G F + P M E W GWF + + R
Sbjct: 206 AGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRD 265
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDAPLDEY 321
++LA +V G N YM+HGGTNF+ +G P I TSYDYDAPLDE
Sbjct: 266 PQELAESVREALALGSI--NLYMFHGGTNFEFMNGCSARGTIDLPQI-TSYDYDAPLDEQ 322
Query: 322 GLIRQPKWGHLKDLHK 337
G + + K LH+
Sbjct: 323 GNPTEKYFALQKMLHE 338
>gi|242078605|ref|XP_002444071.1| hypothetical protein SORBIDRAFT_07g006925 [Sorghum bicolor]
gi|241940421|gb|EES13566.1| hypothetical protein SORBIDRAFT_07g006925 [Sorghum bicolor]
Length = 147
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 82/148 (55%), Positives = 108/148 (72%), Gaps = 1/148 (0%)
Query: 701 YHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKI 760
YHVP +L+ N +VLFE+ GGDP+KISFV +Q GS + + V++ HP +D W S +
Sbjct: 1 YHVPCLFLQPGNNDIVLFEQFGGDPSKISFVIRQTGSVI-AQVSEEHPAQIDSWNSSQQT 59
Query: 761 QRKPGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKS 820
++ GP L LECP QVISSIKFASFGTP GTC S+S G CSS +++SVV++AC+G +
Sbjct: 60 MQRYGPELRLECPKDGQVISSIKFASFGTPSGTCRSYSHGECSSIQAISVVQEACIGVSN 119
Query: 821 CSIGVSVNTFGDPCKGVMKSLAVEASCT 848
CS+ VS N FG+P GV KSLAVEA+C+
Sbjct: 120 CSVPVSSNYFGNPWTGVTKSLAVEAACS 147
>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
Length = 604
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 120/342 (35%), Positives = 168/342 (49%), Gaps = 29/342 (8%)
Query: 19 ATTSFGANVTYDH--RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
A T G NV ++ G+ ++SG+IHY R P W + K G + +ETY
Sbjct: 3 AFTRKGGNVERFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETY 62
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
V WNLHEP + ++FEG DL +F+KL E GLYA +R PY+CAEW FGGFP WL P
Sbjct: 63 VPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEP 122
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAA 194
G + R++N + + + +++ + +L + GG I++ QIENEYG+ + AY A
Sbjct: 123 G-RMRSNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRA 179
Query: 195 GKSYIKWAAGMALSLDTGVPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNS 243
+ + A + PW + S D I+ T N G F +
Sbjct: 180 IRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHG 239
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
P M E W GWF + + R ++LA +V G N YM+HGGTNF +
Sbjct: 240 KKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMN 297
Query: 304 GG--------PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
G P I TSYDYDAPLDE G + + K LH+
Sbjct: 298 GCSARGTIDLPQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
Length = 624
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 164/336 (48%), Gaps = 46/336 (13%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ ++SG +HY R + W +Q K GL+ + TYVFWNLHE +++F G +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+++++ E G+ LR GPYVCAEW FGG+P WL IPG++ R DN E ++T K
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDN----TEFLKYTKK 150
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNIDS-----------AYGAAGKSYIKWAAG 204
+D + QE L ++GGPII+ Q ENE+G+ S +Y A K + A
Sbjct: 151 YIDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGF 210
Query: 205 MALSLDTGVPWVM---CQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTEN 253
+ W+ C P T NG +Q+ + P M E
Sbjct: 211 TVPLFTSDGSWLFEGGCVAGALP-----TANGESDIANLKKVVNQY--HGGKGPYMVAEF 263
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS---- 309
+ GW +G P ++A + Q +F N+YM HGGTNF TSG +
Sbjct: 264 YPGWLSHWGEPFPQVSASEIARQTEAYLQNNVSF-NFYMVHGGTNFGFTSGANYDKKRDI 322
Query: 310 ----TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIK 340
TSYDYDAP+ E G I PK+ ++ + K +K
Sbjct: 323 QPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357
>gi|384939972|gb|AFI33591.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
gi|387541294|gb|AFJ71274.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
Length = 653
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 162/336 (48%), Gaps = 25/336 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G R ++ GSIHY R E W D + K + G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHRFLICGGSIHYFRVPREYWRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN----TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
V+ + IN N F +Q +KP + E W GWF +G +
Sbjct: 260 EKNVLSGHTKGVLAAINLQKVQRNTF--NQLHKVQRDKPLLVMEYWVGWFDRWGDKHHVK 317
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
+++ AV+ F + +F N YM+HGGTNF +G I TSYDYDA L E G
Sbjct: 318 DAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATNFGKHTGIVTSYDYDAVLTEAG 376
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
+ + K KL E+ P P L P
Sbjct: 377 -------DYTEKYFKLQKLLESVSATPLPQVPKLTP 405
>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
Length = 604
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
Length = 594
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|167524869|ref|XP_001746770.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775040|gb|EDQ88666.1| predicted protein [Monosiga brevicollis MX1]
Length = 600
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/332 (34%), Positives = 171/332 (51%), Gaps = 35/332 (10%)
Query: 45 SGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR-YDLVKFVKL 103
SGS+HY R E W D ++ +K GL+ I TYV WN HE ++FE +DL +F+ L
Sbjct: 70 SGSLHYFRIPAEYWLDRLEMAKHMGLNTISTYVPWNFHEVGPGSFDFETHAHDLARFLNL 129
Query: 104 VAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMK 163
E GL +R PY+CAEW+FGG P L P ++ R+ N+ F E++R+ ++ +++
Sbjct: 130 AHEVGLRVLIRPSPYICAEWDFGGLPARLMANPDLELRSSNDAFLDEVERYYDALMPILR 189
Query: 164 QEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVM--CQQS 221
L AS GGPII +ENEYG +YG A + Y++ A +A+ D G+ M C +
Sbjct: 190 --PLQASNGGPIIAFYVENEYG----SYG-ADRDYLQ--ALVAMMRDRGIVEQMFTCDNA 240
Query: 222 D-----APDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVE 271
A + T N + DQ ++P M +E W+GWF G E
Sbjct: 241 QGLSRGALPGALQTINFQDNVERHLDQLAHFQPDQPLMVSEYWTGWFDHDGEEHHTFDSE 300
Query: 272 DLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS--TSYDYDAPLDEYGLIRQP 327
DL + + RG +F N Y++HGGT+F +G P+ TSYDYDAPL E+G + P
Sbjct: 301 DLVEGLQKILDRGASF-NLYVFHGGTSFGWNAGANSPYAPDITSYDYDAPLSEHGQV-TP 358
Query: 328 KWGHLKDLHKAIKLCEAALVATDPTYPSLGPN 359
K+ + I++ + A P P N
Sbjct: 359 KY-------EDIQMVLMSYGAEHPRRPHYHSN 383
>gi|431919325|gb|ELK17922.1| Beta-galactosidase-1-like protein 3 [Pteropus alecto]
Length = 1113
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 175/371 (47%), Gaps = 34/371 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+GG + + GSIHY R E W D + K K G + + TYV WNLHEP R ++F
Sbjct: 631 LGGHKFRIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPQRGAFDFSENL 690
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL ++ RT ++ F + ++
Sbjct: 691 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDSNVRLRTTDQGFVEAVDKYF 750
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + L QGGPII Q+ENEYG+ D K Y+ + L G+
Sbjct: 751 DHLI--ARVVPLQYRQGGPIIAVQVENEYGSFDK-----DKYYMPYIQQALLK--RGIVE 801
Query: 216 VMCQQSDAPDPIIN----------TCNGFYCDQFTPNSN---NKPKMWTENWSGWFLSFG 262
++ SDA ++ F D F P N NKP + E W GWF +G
Sbjct: 802 LLL-TSDAKTEVLKGYIKGVLAAINIEKFQNDAFEPLYNIQKNKPILVMEYWVGWFDKWG 860
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG------PFISTSYDYDA 316
+ +D+ V+ F + +F N YM+HGGTNF +G I+TSYDYDA
Sbjct: 861 DEHNVKDAQDVENTVSEFIKFEISF-NVYMFHGGTNFGFINGATNFGKHKSIATSYDYDA 919
Query: 317 PLDEYGLIRQPKWGHLKDLH---KAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCS 373
L E G + K+ L+ L A+ L + YPS+ P+ ++ L
Sbjct: 920 VLTEAGDYTE-KYFKLRKLFGSVLALPLPHLPELTPKAVYPSMRPSFHLPLWDVLQYLNE 978
Query: 374 AFLANIGTNSD 384
+++ N +
Sbjct: 979 PVISDKPINME 989
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/298 (27%), Positives = 135/298 (45%), Gaps = 37/298 (12%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G+N T D G ++I+G+IHY R E W D + K K G + + +V W+ HE
Sbjct: 53 GSNFTLD-------GFPFLIIAGTIHYFRVPREYWKDRLLKLKACGFNTVTMHVPWSHHE 105
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
P R+++ F G DL F+ + + GL+ L GPY+ ++ + GG P WL P ++ RT
Sbjct: 106 PQRHKFYFTGDLDLRAFISIASNEGLWVILCPGPYIGSDLDLGGLPSWLLQDPKMKLRTT 165
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKW 201
+ F + ++ +++ + GPII Q+ENEYG ++D Y SY+K
Sbjct: 166 YKGFTKAVNQYFDQLIPRIA--PFQYENYGPIIAVQVENEYGSYHLDKRY----MSYVKK 219
Query: 202 A------AGMALSLDTGVPWV--MCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMW--- 250
A M ++ D G + + A + N Y + F+ + M
Sbjct: 220 ALVKRGIKAMLMTADDGQEIIRGYLNKVIATVHMKNIKKETYKNLFSIQGLSPILMMVYT 279
Query: 251 ---TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
+++W + V + V ++ F +F N+YM+HGGTNF G
Sbjct: 280 TSSSDSWGHSHHTLDSHVLMKNVHEM-------FNLRFSF-NFYMFHGGTNFGFIGGA 329
>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
Length = 594
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|298376422|ref|ZP_06986377.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
gi|298266300|gb|EFI07958.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
Length = 768
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 161/325 (49%), Gaps = 31/325 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG +HYPR + W ++ + GL+ + TYVFWNLHE +++FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
+L +++++ E GL LR GPYVCAEW FGG+P WL IPG++ R DN F + +
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
K+ + + L S+GGPII+ Q ENE+G+ Y A K + AG
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLA-DAG 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------PKMWTENWSGWF 258
+ L T + + P + T NG + N+ P M E + GW
Sbjct: 216 FNVPLFTSDGSWLFEGGSTPG-ALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------T 310
+ + P +A + Q +F N+YM HGGTNF TSG + T
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLT 333
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDL 335
SYDYDAP+ E G + PK+ ++++
Sbjct: 334 SYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
Length = 604
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 594
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|423331257|ref|ZP_17309041.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
gi|409230553|gb|EKN23415.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
Length = 768
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 161/325 (49%), Gaps = 31/325 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG +HYPR + W ++ + GL+ + TYVFWNLHE +++FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
+L +++++ E GL LR GPYVCAEW FGG+P WL IPG++ R DN F + +
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
K+ + + L S+GGPII+ Q ENE+G+ Y A K + AG
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLA-DAG 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------PKMWTENWSGWF 258
+ L T + + P + T NG + N+ P M E + GW
Sbjct: 216 FNVPLFTSDGSWLFEGGSTPG-ALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------T 310
+ + P +A + Q +F N+YM HGGTNF TSG + T
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLT 333
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDL 335
SYDYDAP+ E G + PK+ ++++
Sbjct: 334 SYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
Length = 624
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 164/336 (48%), Gaps = 46/336 (13%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ ++SG +HY R + W +Q K GL+ + TYVFWNLHE +++F G +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+++++ E G+ LR GPYVCAEW FGG+P WL IPG++ R DN E ++T K
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDN----TEFLKYTKK 150
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNIDS-----------AYGAAGKSYIKWAAG 204
+D + QE L ++GGPII+ Q ENE+G+ S +Y A K + A
Sbjct: 151 YIDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGF 210
Query: 205 MALSLDTGVPWVM---CQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTEN 253
+ W+ C P T NG +Q+ + P M E
Sbjct: 211 TVPLFTSDGSWLFEGGCVAGALP-----TANGESDIANLKKVVNQY--HGGKGPYMVAEF 263
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS---- 309
+ GW +G P ++A + Q +F N+YM HGGTNF TSG +
Sbjct: 264 YPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 322
Query: 310 ----TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIK 340
TSYDYDAP+ E G I PK+ ++ + K +K
Sbjct: 323 QPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357
>gi|255015104|ref|ZP_05287230.1| beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410104527|ref|ZP_11299440.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
gi|409234336|gb|EKN27166.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
Length = 768
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 161/325 (49%), Gaps = 31/325 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG +HYPR + W ++ + GL+ + TYVFWNLHE +++FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
+L +++++ E GL LR GPYVCAEW FGG+P WL IPG++ R DN F + +
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
K+ + + L S+GGPII+ Q ENE+G+ Y A K + AG
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLA-DAG 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------PKMWTENWSGWF 258
+ L T + + P + T NG + N+ P M E + GW
Sbjct: 216 FNVPLFTSDGSWLFEGGSTPGA-LPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------T 310
+ + P +A + Q +F N+YM HGGTNF TSG + T
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLT 333
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDL 335
SYDYDAP+ E G + PK+ ++++
Sbjct: 334 SYDYDAPISEAGWV-TPKFDSIRNV 357
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 66/267 (24%), Positives = 106/267 (39%), Gaps = 63/267 (23%)
Query: 463 PGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGS 522
P EQ+N Y+ YS N +PL K L + L +++G+ VG
Sbjct: 404 PLTFEQLNQGYG---YVLYSTHFN----QPL-----KGRLEIPGLRDYATIYVDGERVGE 451
Query: 523 GYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNG 582
+ + +D P T D+L +G NYG + GI V++ GS
Sbjct: 452 LNRCFNQYAMEIDIPF-----NATLDILVENMGRINYGEEIVRNTKGIISSVKINGS--- 503
Query: 583 TNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKSTLPKLQPL----VWYKTTFDAPAG 637
++S + Y+ + P+ S + + K+ P++ L V Y+ TF +
Sbjct: 504 ---EISDWK-MYKLPMD----RMPALVSDEPYVYKNGSPEVAALGNKPVLYEGTFHL-SD 554
Query: 638 SEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPS 697
+ ID GKG ++NG +IGRYW Y G P
Sbjct: 555 TGDTFIDMEDWGKGIIFINGVNIGRYW---------------YAG-------------PQ 586
Query: 698 QSLYHVPRSWLKSSGNTLVLFEEIGGD 724
Q+LY +P WL N +V++E++ D
Sbjct: 587 QTLY-IPGVWLNKGENKIVIYEQLNND 612
>gi|150008152|ref|YP_001302895.1| beta-glycosidase [Parabacteroides distasonis ATCC 8503]
gi|149936576|gb|ABR43273.1| glycoside hydrolase family 35, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 768
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 161/325 (49%), Gaps = 31/325 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG +HYPR + W ++ + GL+ + TYVFWNLHE +++FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
+L +++++ E GL LR GPYVCAEW FGG+P WL IPG++ R DN F + +
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
K+ + + L S+GGPII+ Q ENE+G+ Y A K + AG
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLA-DAG 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------PKMWTENWSGWF 258
+ L T + + P + T NG + N+ P M E + GW
Sbjct: 216 FNVPLFTSDGSWLFEGGSTPGA-LPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------T 310
+ + P +A + Q +F N+YM HGGTNF TSG + T
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLT 333
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDL 335
SYDYDAP+ E G + PK+ ++++
Sbjct: 334 SYDYDAPISEAGWV-TPKFDSIRNV 357
Score = 45.8 bits (107), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 65/266 (24%), Positives = 99/266 (37%), Gaps = 61/266 (22%)
Query: 463 PGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGS 522
P EQ+N Y+ YS N +PL K L + L +++G+ VG
Sbjct: 404 PLTFEQLNQGYG---YVLYSTHFN----QPL-----KGRLEIPGLRDYATIYVDGERVGE 451
Query: 523 GYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNG 582
+ + +D P T D+L +G NYG + GI V++ GS
Sbjct: 452 LNRCFNQYAMEIDIPF-----NATLDILVENMGRINYGEEIVRNTKGIISSVKINGSEIS 506
Query: 583 TNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPL----VWYKTTFDAPAGS 638
+ GE + +GS P++ L V Y+ TF + +
Sbjct: 507 DWKMYKLPMDRMPALVSGEPYVYKNGS----------PEVAALGNKPVLYEGTFHL-SDT 555
Query: 639 EPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQ 698
ID GKG ++NG +IGRYW Y G P Q
Sbjct: 556 GDTFIDMEDWGKGIIFINGVNIGRYW---------------YAG-------------PQQ 587
Query: 699 SLYHVPRSWLKSSGNTLVLFEEIGGD 724
+LY +P WL N +V++E++ D
Sbjct: 588 TLY-IPGVWLNKGENKIVIYEQLNND 612
>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 628
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 171/359 (47%), Gaps = 42/359 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG DQ+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPT--YPSLGPNLEATV 364
TSYDYDAP+ E G + PK+ +++ + K +K A +P PS+ N A V
Sbjct: 329 MTSYDYDAPISEAGWV-TPKYDSIRNVIKKYVKYTIPEAPAPNPVIEIPSIQLNKVADV 386
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 87/229 (37%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ V ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSVEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PVQ+ G D+ YQ + P + + D+ +P
Sbjct: 483 EIVHNTKGIISPVQIAGKEIVGGWDM------YQLPMD----EMPDLTKLKADTHKNVPS 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + +D GKG +VNG +IGRYW
Sbjct: 533 EVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY VP WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNETP 611
>gi|375360076|ref|YP_005112848.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
gi|383119863|ref|ZP_09940600.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|251944025|gb|EES84544.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|301164757|emb|CBW24316.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
Length = 628
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 171/359 (47%), Gaps = 42/359 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG DQ+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPT--YPSLGPNLEATV 364
TSYDYDAP+ E G + PK+ +++ + K +K A +P PS+ N A V
Sbjct: 329 MTSYDYDAPISEAGWV-TPKYDSIRNVIKKYVKYTIPEAPAPNPVIEIPSIQLNKVADV 386
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 86/228 (37%), Gaps = 48/228 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGS---GNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
GI PVQ+ G G L + T LK + T + S
Sbjct: 483 EIVHNTKGIISPVQIAGKEIVGGWDMYQLPMDEMPDLTKLKAD---------THKNVPSE 533
Query: 619 LPKLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
+ KL+ V Y+ TF + +D GKG +VNG +IGRYW
Sbjct: 534 VAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNIGRYWKV----------- 581
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY VP WLK N +V+FE++ P
Sbjct: 582 -----------------GPQQTLY-VPGVWLKKGENKIVIFEQLNETP 611
>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
Length = 594
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|256840666|ref|ZP_05546174.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
gi|256737938|gb|EEU51264.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
Length = 768
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 161/325 (49%), Gaps = 31/325 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG +HYPR + W ++ + GL+ + TYVFWNLHE +++FEG
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
+L +++++ E GL LR GPYVCAEW FGG+P WL IPG++ R DN F + +
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
K+ + + L S+GGPII+ Q ENE+G+ Y A K + AG
Sbjct: 159 DKLYEQVGD--LQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLA-DAG 215
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------PKMWTENWSGWF 258
+ L T + + P + T NG + N+ P M E + GW
Sbjct: 216 FNVPLFTSDGSWLFEGGSTPG-ALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------T 310
+ + P +A + Q +F N+YM HGGTNF TSG + T
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLT 333
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDL 335
SYDYDAP+ E G + PK+ ++++
Sbjct: 334 SYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
Length = 624
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 164/336 (48%), Gaps = 46/336 (13%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ ++SG +HY R + W +Q K GL+ + TYVFWNLHE +++F G +L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+++++ E G+ LR GPYVCAEW FGG+P WL IPG++ R DN E ++T K
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDN----TEFLKYTKK 150
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNIDS-----------AYGAAGKSYIKWAAG 204
+D + QE L ++GGPII+ Q ENE+G+ S +Y A K + A
Sbjct: 151 YIDRLYQEVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGF 210
Query: 205 MALSLDTGVPWVM---CQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTEN 253
+ W+ C P T NG +Q+ + P M E
Sbjct: 211 TVPLFTSDGSWLFEGGCVAGALP-----TANGESDIANLKKVVNQY--HGGKGPYMVAEF 263
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS---- 309
+ GW +G P ++A + Q +F N+YM HGGTNF TSG +
Sbjct: 264 YPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 322
Query: 310 ----TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIK 340
TSYDYDAP+ E G I PK+ ++ + K +K
Sbjct: 323 QPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357
>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
Length = 604
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|60683238|ref|YP_213382.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60494672|emb|CAH09473.1| putative exported beta-galactosidase [Bacteroides fragilis NCTC
9343]
Length = 628
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 171/359 (47%), Gaps = 42/359 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG DQ+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPT--YPSLGPNLEATV 364
TSYDYDAP+ E G + PK+ +++ + K +K A +P PS+ N A V
Sbjct: 329 MTSYDYDAPISEAGWV-TPKYDSIRNVIKKYVKYTIPEAPAPNPVIEIPSIQLNKVADV 386
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/229 (25%), Positives = 90/229 (39%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PVQ+ G D+ YQ L +E+ P + + D+ +P
Sbjct: 483 EIVHNTKGIISPVQIAGKEIVGGWDM------YQ--LPMDEM--PDLTKLKADTHKNVPS 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + +D GKG +VNG +IGRYW
Sbjct: 533 EVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY VP WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNETP 611
>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 604
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 632
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 114/329 (34%), Positives = 164/329 (49%), Gaps = 33/329 (10%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK +ISG +HY R + W ++ K GL+ + TYVFWNLHEP +++F G
Sbjct: 35 VYDGKAIRIISGEMHYARIPHQYWRHRMKMLKAMGLNAVATYVFWNLHEPEPGKWDFSGD 94
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
+L +++++ E GL LR GPYVCAEW FGG+P WL + G++ R DNE F ++
Sbjct: 95 RNLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGMELRRDNEQF----LKY 150
Query: 155 TAKIVDMMKQE--KLYASQGGPIILSQIENEYGNIDS-----------AYGAAGKSYIKW 201
T ++ + +E KL +QGGPII+ Q ENE+G+ S AY A +K
Sbjct: 151 TKLYLERLYKEVGKLQITQGGPIIMVQGENEFGSYVSQRKDITLEEHRAYNAKIIKQLK- 209
Query: 202 AAGMALSLDTGVPWVMCQQSDAPD--PIINTCNGFYCDQFTPNSNN---KPKMWTENWSG 256
G + + T + + P P N N + N N P M E + G
Sbjct: 210 EVGFDVPMFTSDGSWLFEGGYVPGALPTANGENNIENLKKVVNQYNGGQGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ G +F NYYM HGGTNF TSG +
Sbjct: 270 WLAHWCEPHPQVKASTIARQTEKYLANGVSF-NYYMVHGGTNFGFTSGANYDKKHDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
TSYDYDAP+ E G + PK+ ++++ K
Sbjct: 329 LTSYDYDAPISEAGWV-TPKFDSIRNVIK 356
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 88/224 (39%), Gaps = 47/224 (20%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L ++ L +++G+ VG + N K ++D I P ++L +G NYG+
Sbjct: 428 LTIEGLRDYATVYVDGEFVGRL--NRYNKKYSMDIEI---PFNGNLEILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL----NFPSGSSTQWDSKS 617
GI PV++ + + +W T L E+ P+ + T S
Sbjct: 483 EIVHNNKGIISPVKI-------DDNFIEGEWE-MTKLPMSEVPAFEKMPANTVTSIMGSS 534
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
+ YK TF + +D GKG +VNG +IGRYW Q G
Sbjct: 535 ANALVGKPSLYKGTFTLQETGD-TFLDMKDWGKGIVFVNGINIGRYW-----QVG----- 583
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
P Q+L+ VP WLK N +V+F+++
Sbjct: 584 ------------------PQQTLF-VPGVWLKKGINEIVIFDQL 608
>gi|336412039|ref|ZP_08592497.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|423261296|ref|ZP_17242197.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|423267821|ref|ZP_17246801.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|423272270|ref|ZP_17251238.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|423276726|ref|ZP_17255658.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|423283105|ref|ZP_17261990.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
gi|335939211|gb|EGN01088.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|387774329|gb|EIK36442.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|392695462|gb|EIY88674.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|392695591|gb|EIY88799.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|392696055|gb|EIY89256.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|404581379|gb|EKA86078.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
Length = 628
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 171/359 (47%), Gaps = 42/359 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG DQ+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPT--YPSLGPNLEATV 364
TSYDYDAP+ E G + PK+ +++ + K +K A +P PS+ N A V
Sbjct: 329 MTSYDYDAPISEAGWV-TPKYDSIRNVIKKYVKYTIPEAPAPNPVIEIPSIQLNKVADV 386
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 87/229 (37%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PVQ+ G D+ YQ + P + + D+ +P
Sbjct: 483 EIVHNTKGIISPVQIAGKEIVGGWDM------YQLPMD----EMPDLTKLKADTHKNVPS 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + +D GKG +VNG +IGRYW
Sbjct: 533 EVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY +P WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-IPGVWLKKGENKIVIFEQLNETP 611
>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 604
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 604
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|301309736|ref|ZP_07215675.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|423340209|ref|ZP_17317948.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
gi|300831310|gb|EFK61941.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|409227644|gb|EKN20540.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
Length = 765
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 108/325 (33%), Positives = 161/325 (49%), Gaps = 31/325 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG +HYPR + W ++ + GL+ + TYVFWNLHE +++FEG
Sbjct: 36 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 95
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
+L +++++ E GL LR GPYVCAEW FGG+P WL IPG++ R DN F + +
Sbjct: 96 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 155
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
K+ + + L S+GGPII+ Q ENE+G+ Y A K + AG
Sbjct: 156 DKLYEQVGD--LQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLA-DAG 212
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------PKMWTENWSGWF 258
+ L T + + P + T NG + N+ P M E + GW
Sbjct: 213 FNVPLFTSDGSWLFEGGSTPG-ALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 271
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------T 310
+ + P +A + Q +F N+YM HGGTNF TSG + T
Sbjct: 272 MHWAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHDIQPDLT 330
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDL 335
SYDYDAP+ E G + PK+ ++++
Sbjct: 331 SYDYDAPISEAGWV-TPKFDSIRNV 354
>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 604
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
Length = 594
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
Length = 594
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
Length = 594
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 594
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 594
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
Length = 604
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 120/342 (35%), Positives = 167/342 (48%), Gaps = 29/342 (8%)
Query: 19 ATTSFGANVTYDH--RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
A T G NV ++ G+ ++SG+IHY R P W + K G + +ETY
Sbjct: 3 AFTRKGGNVERFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETY 62
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP 136
V WNLHEP + ++FEG DL +F+KL E GLYA +R PY+CAEW FGGFP WL P
Sbjct: 63 VPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEP 122
Query: 137 GIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAA 194
G + R++N + + + +++ + +L GG I++ QIENEYG+ + AY A
Sbjct: 123 G-RMRSNNPTYLKHVAEYYDVLMEKIVPHQL--VNGGNILMIQIENEYGSFGEEKAYLRA 179
Query: 195 GKSYIKWAAGMALSLDTGVPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNS 243
+ + A + PW + S D I+ T N G F +
Sbjct: 180 IRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHG 239
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
P M E W GWF + + R ++LA +V G N YM+HGGTNF +
Sbjct: 240 KKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMN 297
Query: 304 GG--------PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
G P I TSYDYDAPLDE G + + K LH+
Sbjct: 298 GCSARGTIDLPQI-TSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|395520729|ref|XP_003764476.1| PREDICTED: beta-galactosidase-1-like protein 2 [Sarcophilus
harrisii]
Length = 704
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 165/324 (50%), Gaps = 31/324 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G + GSIHY R E W D + K K GL+ + TY+ WNLHEP R ++NF G
Sbjct: 123 LLEGSHFQIFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSGN 182
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV++ A+ GL+ LR GPY+C+EW+ GG P WL ++ RT F + R+
Sbjct: 183 LDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYAGFLKAVDRY 242
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ + L QGGPII Q+ENEYG+ D + YIK A M+ ++
Sbjct: 243 FNHLIP--RVVPLQYKQGGPIIAVQVENEYGSYDK--DSNYMPYIK-KALMSRGINE--- 294
Query: 215 WVMCQQSDAPD--------PIINTCNGFYCDQFTPN-----SNNKPKMWTENWSGWFLSF 261
+ SD D ++ T N + D N NKP M TE W+GWF ++
Sbjct: 295 --LLMTSDNKDGLSGGYLEGVLATVNLKHVDSMIFNYLHSFQENKPTMVTEYWTGWFDTW 352
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPFIS--TSYDYD 315
GG +D+ V+ Q G + N YM+HGGTNF +G G +++ TSYDYD
Sbjct: 353 GGPHNIVDADDVVVTVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFGEYLADVTSYDYD 411
Query: 316 APLDEYGLIRQPKWGHLKDLHKAI 339
A L E G PK+ L++ I
Sbjct: 412 AILTEAG-DYTPKFFKLREFFSTI 434
>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
Length = 604
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Cavia porcellus]
Length = 679
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 121/360 (33%), Positives = 169/360 (46%), Gaps = 40/360 (11%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A+ T + G + ++ GSIHY R E W D + K K G + + TY+ WNLHEP
Sbjct: 93 ASTTKGRAHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYIPWNLHEP 152
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
R ++ F G DL FV L AE GL+ LR GPY+CAE + GG P WL P Q RT
Sbjct: 153 QRGKFVFSGNLDLEAFVLLAAEIGLWVILRPGPYICAEIDLGGLPSWLLQNPKTQLRTTE 212
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWA 202
F + + + M + L GGP+I Q+ENEYG N D Y A Y+K A
Sbjct: 213 RTFVDAVDAYFDHL--MRRMVPLQYHHGGPVIAVQVENEYGSFNRDGQYMA----YLKEA 266
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNG--------------FYCDQFTPNSNNKPK 248
L G+ ++ D + + G FY Q ++KP
Sbjct: 267 L-----LKRGIVELLFTCDYYKDVVNGSLKGVLATVNLGSLGKNSFY--QLLQVQSHKPI 319
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR------T 302
+ E W GW+ S+G + ++A V+ F + G +F N YM+HGGTNF
Sbjct: 320 LIMEYWVGWYDSWGLPHANKSAAEVAHTVSTFIKNGISF-NVYMFHGGTNFGFINAAGIV 378
Query: 303 SGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDL---HKAIKLCEAALVATDPTYPSLGPN 359
G ++TSYDYDA L E G + K+ L++L A+ L + YPS+ P+
Sbjct: 379 EGRRSVTTSYDYDAVLSEAGDYTE-KYFKLRELLGSFSAVPLPHLPEITPKTVYPSVKPS 437
>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
Length = 788
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 163/327 (49%), Gaps = 21/327 (6%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G T + ++ GK V+ + +HYPR W I+ K G++ + YVFWN+HE
Sbjct: 29 GGTFTTGDKTFLLNGKPFVVKAAELHYPRIPRAYWEHRIKMCKALGMNTVCLYVFWNIHE 88
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
+++F G D+ F +L + G+Y +R GPYVCAEW GG P WL I+ R
Sbjct: 89 QEEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQ 148
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKW 201
+ F ++ F ++ + L GGPII+ Q+ENEYG+ D Y +A + ++
Sbjct: 149 DPYFMQRVEIFEKEVGKQLA--PLTIQNGGPIIMVQVENEYGSYGKDKPYVSAIRDIVRK 206
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCDQ----FTPNSNNKPKMWTENW 254
+ +SL W ++ D + T N G DQ N PKM +E W
Sbjct: 207 SGFDKVSL-FQCDWSSNFLNNGLDDLTWTMNFGTGANIDQQFKRLGEVRPNAPKMCSEFW 265
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG------PFI 308
SGWF +G RP +D+ + +G +F + YM HGGT+F +G P +
Sbjct: 266 SGWFDKWGARHETRPAKDMVEGMDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQPDV 324
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDL 335
TSYDYDAP++E+GL PK+ L+ +
Sbjct: 325 -TSYDYDAPINEWGLA-TPKFYELQKM 349
>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 591
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 156/317 (49%), Gaps = 36/317 (11%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ G+ +ISG++HY R PE W + K G + +ETYV WN+HEP +NF
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNIHEPKEGVFNF 67
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
EG DLVK+V+L + GL LR PY+CAEW FGG P WL I+ R++ F ++
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLDKV 127
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
+ F ++ M+ L GGPII+ Q+ENEYG+ K Y++ + LD
Sbjct: 128 ENFYKVLLPMVT--PLQVENGGPIIMMQVENEYGSF-----GNDKEYVRSIKKIMRDLDV 180
Query: 212 GVP-------WVMCQQSDA--PDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTEN 253
VP W +S + D ++ T N G ++ N P M E
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCMEF 240
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------- 305
W GWF +G + R +LA V +R N+YM+ GGTNF +G
Sbjct: 241 WDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298
Query: 306 PFISTSYDYDAPLDEYG 322
P I TSYDYDA L E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 73/154 (47%), Gaps = 20/154 (12%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+H F+N +L+ + Y +V++D L +NT D+L +G NYGA +
Sbjct: 411 VHMFLNEQLIDTQYRDEIGREVSLD----LTKEENTLDILVENMGRVNYGA-------RL 459
Query: 571 TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKT 630
P Q KG +G ID+ Q L+ + L+ + QW+ + +Y+
Sbjct: 460 LSPTQRKGISSGVMIDIHLQSNWEHYALEFDNLD-EIDFNGQWEPNTP-------SFYEY 511
Query: 631 TFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
TF+ ++ +D + +GKG +NG ++G+YW
Sbjct: 512 TFNVQELNDTF-LDCSKLGKGFVVLNGFNLGKYW 544
>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
Length = 584
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 161/320 (50%), Gaps = 29/320 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG++HY R P++W D I K++ GL+ IETYV WN H P ++ G
Sbjct: 12 LLDGRPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLSGG 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F++LVA+AG+YA +R GPY+CAEW+ GG P WL P + R + ++ +
Sbjct: 72 LDLDRFLRLVADAGMYAIVRPGPYICAEWDNGGLPAWLFRDPSVGVRRYEPKYLDAVREY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
K+ +++ ++ +GGP++L Q+ENEYG A+G K Y+K A VP
Sbjct: 132 LTKVYEVVVPHQI--DRGGPVLLVQVENEYG----AFG-DDKRYLKALAEHTREAGVTVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQ------------FTPNSNNKPKMWTENWSGWFLSFG 262
Q + +G + + P M +E W+GWF +G
Sbjct: 185 LTTVDQPTPEMLEAGSLDGLHRTASFGSGAEARLAILRAHQPTGPLMCSEFWNGWFDHWG 244
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFISTSYDYD 315
D A + G + N YM+HGGTNF T+G P I TSYDYD
Sbjct: 245 AHHHTTSAADSAAELDALLAAGASV-NLYMFHGGTNFGLTNGANDKGVYQPLI-TSYDYD 302
Query: 316 APLDEYGLIRQPKWGHLKDL 335
APLDE G PK+ +D+
Sbjct: 303 APLDEAG-DPTPKYHAFRDV 321
>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
Length = 791
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 164/327 (50%), Gaps = 25/327 (7%)
Query: 24 GANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 83
G T + ++ GK V+ + +HYPR W I+ K G++ + YVFWN+HE
Sbjct: 31 GGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHE 90
Query: 84 PVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
+++F D+ +F +L GLY +R GPYVCAEW GG P WL I+ R
Sbjct: 91 QQEGKFDFTDNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREP 150
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKW 201
+ F ++ F K+ + + L GGPII+ Q+ENEYG+ + AY +A + ++
Sbjct: 151 DPYFMERVKLFERKVGEQLAS--LTIQNGGPIIMVQVENEYGSYGENKAYVSAIRDIVRQ 208
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCDQ-------FTPNSNNKPKMWT 251
+ ++L W + + D ++ T N G DQ PN+ P+M +
Sbjct: 209 SGFDKVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNA---PQMCS 264
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS 309
E WSGWF +G RP + + + +G +F + YM HGGT+F +G P +
Sbjct: 265 EFWSGWFDKWGARHETRPAKTMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFA 323
Query: 310 ---TSYDYDAPLDEYGLIRQPKWGHLK 333
TSYDYDAP++EYG PK+ L+
Sbjct: 324 PDVTSYDYDAPINEYGQA-TPKYWELR 349
>gi|223982755|ref|ZP_03632983.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
gi|223965255|gb|EEF69539.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
Length = 592
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 165/308 (53%), Gaps = 25/308 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ LISG++HY R PE W D ++K K+ G + +ETY+ WN HEP + Q++F GR
Sbjct: 11 MLDGQPVKLISGALHYFRIVPEYWQDRLEKLKNMGCNCVETYIPWNYHEPKKGQFDFSGR 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ +FV+ GL+ LR PY+CAEW FGG P WL ++ R+ +P+ + +
Sbjct: 71 KDVARFVRKAQALGLWVILRPTPYICAEWEFGGLPAWLLADDSMRVRSTYQPYLDAVDAY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSL--- 209
A++ +++ L+ + GGP+++ QIENEYG+ D Y A K ++ G + +
Sbjct: 131 YAELFKVIR--PLFFTHGGPVLMCQIENEYGSFGNDKQYLKAIKRLME-KHGCDVPMFTS 187
Query: 210 DTGVPWVMCQQSDAPDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTENWSGWFLS 260
D G V+ + + ++ T N G D+ N + P M E W GWF +
Sbjct: 188 DGGWREVLDAGTLLNEGVLPTANFGSRTDEQIGALRQFMNDNDIHGPLMCMEFWIGWFNN 247
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDY 314
+G + R ++ A + ++G N YM+HGGTN + +G + + TSYDY
Sbjct: 248 WGSPLKTRDAKEAADELDAMLRQGSV--NIYMFHGGTNPEFYNGCSYHNGMDPQITSYDY 305
Query: 315 DAPLDEYG 322
APL E+G
Sbjct: 306 AAPLTEWG 313
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 87/215 (40%), Gaps = 56/215 (26%)
Query: 512 HAFINGKLVGSGYGSS--SNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAG 569
F+N KL+ + Y + SN +T+D P N D+L +G NYGA
Sbjct: 411 QVFVNQKLIATQYKETMGSNIPLTLDHPT-----DNVIDILVENLGRINYGA-------S 458
Query: 570 ITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQW--DSKSTLPKLQPLVW 627
+ P Q KG G +DL + TG + L + + + + +P +
Sbjct: 459 LVSPHQRKGIKGGFMLDLH-----FHTGWQQYCLELDNVDQVDFTGEYQEGVP-----AF 508
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
Y+ T D ++ ++ G GKG A++NG+++GR+W
Sbjct: 509 YQFTVDIEEPAD-TFLNLNGWGKGAAFLNGENLGRFWEL--------------------- 546
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P+ LY +P LK NT+VLFE G
Sbjct: 547 -------GPTHYLY-IPAPLLKKGKNTIVLFETEG 573
>gi|354585216|ref|ZP_09004105.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
gi|353188942|gb|EHB54457.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
Length = 619
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 167/319 (52%), Gaps = 36/319 (11%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T+ + ++ G+ +ISG++HY R PE W D + K K G + +ETY+ WN+HEP
Sbjct: 4 LTWKNGQYLLDGQPYRIISGAVHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPTE 63
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF G D+ F++L + GL+ +R P++CAEW FGG P WL I+ R +
Sbjct: 64 GEFNFSGMADVGSFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPL 123
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAG 204
+ +++ + +++ M L +S GGPI+ Q+ENEYG+ D AY Y++ AG
Sbjct: 124 YLSKVDHYYDELIPRMV--PLLSSNGGPILAVQVENEYGSYGNDHAY----LEYLR--AG 175
Query: 205 MALSLDTGVPWVMCQQSDAP----------DPIINTCN-GFYCDQ----FTPNSNNKPKM 249
+ + GV V+ SD P D + T N G ++ + ++P M
Sbjct: 176 L---VRRGVD-VLLFTSDGPTDEMLLGGSIDHVHATVNFGSRVEESFGKYREYRTDEPLM 231
Query: 250 WTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI- 308
E W+GWF + R D+A + ++G + N YM+HGGTNF SG I
Sbjct: 232 VMEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEKGSSI-NMYMFHGGTNFGFYSGANHIK 290
Query: 309 -----STSYDYDAPLDEYG 322
+TSYDYDAPL E+G
Sbjct: 291 TYEPTTTSYDYDAPLTEWG 309
>gi|313231409|emb|CBY08524.1| unnamed protein product [Oikopleura dioica]
Length = 493
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 150/321 (46%), Gaps = 34/321 (10%)
Query: 33 AVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE 92
A + G++ L+SGSIHY R E W D + K K GL+ +E YV WNLHEP ++NF
Sbjct: 62 AFWLDGEKITLVSGSIHYFRVPNEYWLDRLTKLKYAGLNTVELYVSWNLHEPYSGEFNFS 121
Query: 93 GRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQ 152
G D+V+F+++ E GL+ R GPY+CAEW +GG P WL ++ RT + ++
Sbjct: 122 GDLDVVRFIEMAGELGLHVLFRPGPYICAEWEWGGHPYWLLHDTDMKVRTTYPGYLEAVE 181
Query: 153 RFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAY--GAAGKSYIKW--------- 201
+F +++ + L GGPII QIENEY A+ G ++ W
Sbjct: 182 KFYSELFGRVNH--LMYRNGGPIIAVQIENEYAGFADAFEIGPLDPGFLTWLRQTIKDQQ 239
Query: 202 AAGMALSLDTGVPWVMCQQSDAP-----DPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
+ + D G + + P D ++ ++ + N KPKM E WSG
Sbjct: 240 CEELLFTSDGGWDFYKYELEGDPYGLNFDDVLRA--NYWLNILENNQPGKPKMVMEWWSG 297
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF--------- 307
WF F G D R NYYM+HGGTNF +G F
Sbjct: 298 WF-DFWGYHHQGTTADSFEENLRAILSQNASVNYYMFHGGTNFGYMNGANFNTNDQTNDL 356
Query: 308 ----ISTSYDYDAPLDEYGLI 324
+ TSYDYD PL E G I
Sbjct: 357 EYQPVVTSYDYDCPLSEEGRI 377
>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
intestinalis]
Length = 658
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 160/310 (51%), Gaps = 16/310 (5%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
+ +T + + GK +ISG++HY R E W D + K K GL+ IETYV WNLHEP
Sbjct: 56 SGLTAQGKTFKLDGKPMTIISGAVHYFRMPREYWRDRLMKMKACGLNTIETYVPWNLHEP 115
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ +YNF G DLV F+ L + Y LR GPY+C+EW FGG P WL P ++ RT
Sbjct: 116 IPGKYNFTGDLDLVHFILLAHKLEFYVLLRPGPYICSEWEFGGLPSWLLRDPKMKVRTMY 175
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWA 202
P+ A + ++ ++ +K L GGPII Q++NEYG+ D+ Y K +++
Sbjct: 176 PPYIAAVTKYFNYLLPFVK--PLQYQYGGPIIAFQLDNEYGSYFKDADYLPYLKEFLQNK 233
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPK---MWTENWSGWFL 259
+ L + + QQ+ P + + FT SN +P M E W+GWF
Sbjct: 234 GIIELLFISDSIEGLRQQT-IPGVLKTVNFKRMENHFTDLSNMQPDAPLMVMEFWTGWFD 292
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-----PFIS--TSY 312
+G V++ + F +GG+ N+YM+ GGTNF +G F + TSY
Sbjct: 293 WWGEKHHILTVQEFGETLNEIFSQGGSV-NFYMFFGGTNFGFMNGAYKDGTGFHADITSY 351
Query: 313 DYDAPLDEYG 322
DYDA + E G
Sbjct: 352 DYDALIAENG 361
>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
Length = 594
Score = 167 bits (422), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 162/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--VNGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
Length = 571
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 158/320 (49%), Gaps = 25/320 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T D + GK ++SG+IHY R + W +Q D GL+ I+ Y+ WNLHE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++F G DLV+F + AE GL R GPY+C+EW++GG P WL P + R++
Sbjct: 68 GNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
++A + + +K++ ++ L S GGPII Q+ENEYG+ Y ++ W A +
Sbjct: 128 YQAAVSSYFSKLLPLLA--PLQHSNGGPIIAFQVENEYGD----YVDKDNEHLPWLADLM 181
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNS-----NNKPKMWTENWSGWFLSF 261
S + + I N + TP S NKP + TE W+GWF +
Sbjct: 182 KSHGLFELFFISDGGHT----IRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAGWFDYW 237
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------TSYD 313
G + + +RG + N+YM+HGGTNF +G + TSYD
Sbjct: 238 GHGRNLLNNDVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGYYTADVTSYD 296
Query: 314 YDAPLDEYGLIRQPKWGHLK 333
YD P+DE G R KW +K
Sbjct: 297 YDCPVDESG-NRTEKWEIIK 315
>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 604
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 115/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQVFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 604
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 159/316 (50%), Gaps = 27/316 (8%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG DL +F+K
Sbjct: 29 ILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLK 88
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLYA +R PY+CAEW FGGFP WL PG + R++N + + + +++ +
Sbjct: 89 LAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEYYDVLMEKI 147
Query: 163 KQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGVPW--VMC 218
+L + GG I++ QIENEYG+ + AY A + + A + PW +
Sbjct: 148 VPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLR 205
Query: 219 QQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRP 269
S D I+ T N G F + P M E W GWF + + R
Sbjct: 206 AGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRD 265
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDAPLDEY 321
++LA +V G N YM+HGGTNF +G P I TSYDYDAPLDE
Sbjct: 266 PQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYDYDAPLDEQ 322
Query: 322 GLIRQPKWGHLKDLHK 337
G + + K LH+
Sbjct: 323 GNPTEKYFALQKMLHE 338
>gi|164519026|ref|NP_001073876.2| beta-galactosidase-1-like protein 3 [Homo sapiens]
gi|269849685|sp|Q8NCI6.3|GLBL3_HUMAN RecName: Full=Beta-galactosidase-1-like protein 3
Length = 653
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 133/452 (29%), Positives = 202/452 (44%), Gaps = 36/452 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR G Y+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGRYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L Q GP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQAGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN--TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
V+ + IN + +Q +KP + E W GWF +G +
Sbjct: 260 EKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDA 319
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYGLI 324
+++ AV+ F + +F N YM+HGGTNF +G + I TSYDYDA L E G
Sbjct: 320 KEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDY 378
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGPNLEATVYKTGSGLCSAFLANIGT 381
+ K+ L+ L +++ V P YP + P+L ++ S L +
Sbjct: 379 TE-KYLKLQKLFQSVSATPLPRVPKLPPKAVYPPVRPSLYLPLWDALSYLNEPVRSRQPV 437
Query: 382 NSDVTVKFNGN--SYLLPAWSVSILP---------DCKNVVFNTAKI------NSVTLVP 424
N + NG+ SY L + SI D V + I N +P
Sbjct: 438 NMENLPINNGSGQSYGLVLYEKSICSGGRLRAHAHDVAQVFLDETMIGILNENNKDLHIP 497
Query: 425 SFSR-QSLQVAADSSDAIGSGWSYINEPVGIS 455
+ L++ ++ + W NE GI+
Sbjct: 498 ELRDCRYLRILVENQGRVNFSWQIQNEQKGIT 529
>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 597
Score = 166 bits (421), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 156/314 (49%), Gaps = 36/314 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG+IHY R PE W + K G + +ETYV WN HE V +++F G
Sbjct: 11 MLDGKPLKILSGAIHYFRVLPEDWEHSLYNLKALGFNAVETYVPWNFHETVEGEFDFSGT 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ +F+ GLY +R PY+CAEW FGG P WL P ++ R+ + F ++R+
Sbjct: 71 KDIKRFIHTAEAIGLYVIIRPSPYICAEWEFGGLPAWLLTKPNLRVRSRDPQFLEYVERY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ +++ L GPI++ Q+ENEYG +YG K+Y+ A M VP
Sbjct: 131 YDRLFEILT--PLQIDHHGPILMMQVENEYG----SYG-EDKTYLSALARMMRDRGVTVP 183
Query: 215 -------WVMCQQ--SDAPDPIINTCNGFYCDQFTPNSNNK---------PKMWTENWSG 256
W C + S A II T N Q ++ +K P M E W G
Sbjct: 184 LFTSDGSWQQCLEAGSLAEADIIPTGNFGSKSQKRLDNLHKFHQQFGKTWPLMSMEFWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R ++L + +RG N YM+HGGTNF +G P +
Sbjct: 244 WFNRWGDRIITRQSDELIDEIGEVLKRGSI--NLYMFHGGTNFGFWNGCSARGRIDLPQV 301
Query: 309 STSYDYDAPLDEYG 322
TSYDYDAPLDE G
Sbjct: 302 -TSYDYDAPLDEAG 314
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 86/222 (38%), Gaps = 51/222 (22%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+ F++ + V + Y K F +AL D+L +G NYG +
Sbjct: 411 VQLFLDNEKVYTAYQEEIGDK----FEVALKQPVVQADVLVEHMGRVNYGY-------KL 459
Query: 571 TGPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYK 629
P Q KG G G DL QQW Q + + L W++ QP +Y+
Sbjct: 460 VAPTQRKGLGQGLMQDLHFVQQWE-QFDIDFDLLE-DKHFEQAWEAD------QP-SFYR 510
Query: 630 TTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKC 689
FD E +D +G GKG VNG +IGRYW
Sbjct: 511 YQFDIET-PESTYLDVSGFGKGVVLVNGFNIGRYW------------------------- 544
Query: 690 LKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
N G P+ SLY +P + LK N +++FE G +I +
Sbjct: 545 --NIG-PTLSLY-IPGALLKQGQNEIIIFETEGQYSEEIRLL 582
>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
Length = 604
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 116/324 (35%), Positives = 164/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N F Q F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFDMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|163790001|ref|ZP_02184436.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
gi|159874701|gb|EDP68770.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
Length = 595
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 106/316 (33%), Positives = 153/316 (48%), Gaps = 34/316 (10%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
++ G+ +ISG+IHY R PE W + K G + +ETY+ WN+HE +Y+F
Sbjct: 8 EEFLLNGEPFKIISGAIHYFRILPEDWYHSLYNLKALGFNTVETYIPWNVHETKEREYDF 67
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G+ D+ +FV+ E GL+ LR PY+CAEW FGG P WL ++ R+ + F ++
Sbjct: 68 SGQLDIQRFVQTAKELGLFVILRPSPYICAEWEFGGLPAWLLTYKNMRIRSSDPQFIEKV 127
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
+ K+ + + L + GGP+I+ Q+ENEYG +YG K Y+K + L L
Sbjct: 128 SSYYKKLFEQIV--PLQVTSGGPVIMMQLENEYG----SYG-EDKEYLKTLYELMLELGV 180
Query: 212 GVP-------WVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK-----------PKMWTEN 253
VP W Q++ + G + Q N N P M E
Sbjct: 181 TVPIFTSDGAWKATQEAGTMTDLDILTTGNFGSQSKENFKNLKEFHESKGKNWPLMCMEY 240
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG-----GPFI 308
W GWF + + R +DL V + G N YM+HGGTNF +G G +
Sbjct: 241 WGGWFNRWNDPIIKRDAQDLTNDVKEALKIGSL--NLYMFHGGTNFGFMNGCSARLGKDL 298
Query: 309 S--TSYDYDAPLDEYG 322
TSYDYDAPL+E G
Sbjct: 299 PQLTSYDYDAPLNEQG 314
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 84/230 (36%), Gaps = 53/230 (23%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+D + V +H F+N + + + Y K+ PIA G N D+L +
Sbjct: 395 KDSDEEFYRVIDGSDRVHFFLNEEKIATQYQEEIGEKIYGS-PIA---GSNQLDVLVENM 450
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD 614
G NYG + Q KG G DL + T + L+F + ++
Sbjct: 451 GRVNYGH-------KLLADTQQKGIRRGVMSDLH-----FITDWEQYSLDFLKPLTIDFN 498
Query: 615 S--KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
K P YK T D P E I+ GKG VNG +IGR+W
Sbjct: 499 EEWKENAPSFYQ---YKVTIDTP---EDTFINMELFGKGIVLVNGFNIGRFW-------- 544
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
N G P+ SLY P+S K N +++FE G
Sbjct: 545 -------------------NVG-PTLSLY-APKSLFKKGENEIIVFETEG 573
>gi|402895880|ref|XP_003911040.1| PREDICTED: beta-galactosidase-1-like protein 3 [Papio anubis]
Length = 653
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/336 (33%), Positives = 162/336 (48%), Gaps = 25/336 (7%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+R ++ GSIHY R W D + K + G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGRRFLICGGSIHYFRVPRAYWRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN----TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
V+ + IN N F +Q +KP + E W GWF +G +
Sbjct: 260 EKNVLSGHTKGVLAAINLQKVQRNTF--NQLHKVQRDKPLLVMEYWVGWFDRWGDKHHVK 317
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
+++ AV+ F + +F N YM+HGGTNF +G I TSYDYDA L E G
Sbjct: 318 DAKEVERAVSEFIKYEISF-NVYMFHGGTNFGFMNGATNFGKHTGIVTSYDYDAVLTEAG 376
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
+ + K KL E+ P P L P
Sbjct: 377 -------DYTEKYFKLQKLLESVSATPLPQVPKLTP 405
>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
15897]
gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
Length = 577
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 105/316 (33%), Positives = 163/316 (51%), Gaps = 40/316 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
+I G++ +ISG++HY R PE W D + KD G + +ETY+ WNLHEP + +++F+G+
Sbjct: 11 IIDGQKTKIISGAVHYFRIVPEYWEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDFDGQ 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ F++L + GLY +R PY+C+EW GG P WL I+ RT++ + ++ +
Sbjct: 71 KDVCAFLELAKKLGLYVIIRPSPYICSEWELGGLPAWLLKDSDIRLRTNDSVYMKHLEEY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
A ++ M+ + ++ ++ G IIL+Q+ENEYG+ + K Y+K A + + + G+
Sbjct: 131 YAVLLPMIAKYQI--NREGTIILAQLENEYGSYNQ-----DKDYLK--ALLKMMREYGIE 181
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF-TPN--SNNK-----------------PKMWTENW 254
+ + + + F D F T N SN K P M E W
Sbjct: 182 VPIFTADGTWEEALEAGSLFEEDVFPTGNFGSNAKENIAVLKEFMKKHQIVAPIMCMEFW 241
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF + + R E+L + G N+YM+HGGTNF +G P
Sbjct: 242 DGWFNRWNMEIVKRDPEELVQSAKEMIDLGSI--NFYMFHGGTNFGWMNGCSARKEHDLP 299
Query: 307 FISTSYDYDAPLDEYG 322
I TSYDYDA L EYG
Sbjct: 300 QI-TSYDYDAILTEYG 314
>gi|336063700|ref|YP_004558559.1| beta-galactosidase [Streptococcus pasteurianus ATCC 43144]
gi|334281900|dbj|BAK29473.1| beta-galactosidase precursor [Streptococcus pasteurianus ATCC
43144]
Length = 595
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 171/355 (48%), Gaps = 50/355 (14%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ + GK ++SGSIHY R P+ W + K G + +ETYV WNLHEP +++F
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G DL +F+ + E GLYA +R PY+CAEW FGG P WL G++ R+ ++ F +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWL-LEKGVRVRSQDKDFLQVV 126
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
+R+ ++ + + +L QGG I++ Q+ENEYG +YG K Y++ M L L
Sbjct: 127 KRYYEALIPRLIKHQL--DQGGNILMFQVENEYG----SYG-EDKVYLRELKQMMLELGL 179
Query: 212 GVPWVMCQQSDAP------------DPIINTCN-------GFYCDQ--FTPNSNNKPKMW 250
P+ SD P D ++ T N F + F P M
Sbjct: 180 EEPFFT---SDGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMC 236
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG----- 305
E W GWF +G V R E+LA AV + G N YM+HGGTNF +G
Sbjct: 237 MEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARKQ 294
Query: 306 ---PFISTSYDYDAPLDEYG-------LIRQPKWGHLKDLHKAIKLCEAALVATD 350
P + TSYDYDA LDE G +++ +LH A L + + D
Sbjct: 295 TDLPQV-TSYDYDAILDEAGNPTKKFYILQHRLKNKYPELHYAAPLVKPTMAIKD 348
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 85/213 (39%), Gaps = 53/213 (24%)
Query: 512 HAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT 571
F+NG + + Y + V+F ++ D+L +G NYG +T
Sbjct: 412 QVFLNGNHIVTQYQEEIGDDIQVNF----TSEESQLDILVENMGRVNYGH-------KLT 460
Query: 572 GPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPL-VWYK 629
P Q KG G G +DL QW +P ++ + K + P + + +Y+
Sbjct: 461 APSQHKGIGRGVMLDLHFVNQWE----------TYPLSMNSIKNLKYSSPWREGVPSFYE 510
Query: 630 TTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKC 689
F E +D +G GKG A++NG ++GR+W
Sbjct: 511 FKFHC-LNPEDTYMDMSGFGKGVAFINGYNLGRFW------------------------- 544
Query: 690 LKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
N G P+ SLY +PR + NT+ +FE G
Sbjct: 545 --NIG-PTLSLY-IPRGMMVCGENTITIFETEG 573
>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
Length = 606
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 164/330 (49%), Gaps = 30/330 (9%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A +T+ ++ G+ ++SGS+HY R P W D + + GL+ ++TYV WN HE
Sbjct: 15 ATLTHAGGTLLRAGRPHRILSGSLHYFRVHPGQWADRLARLAALGLNTVDTYVPWNFHER 74
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
F+G DL +FV+L E GL +R GPY+CAEW+ GG P WL PG++ RT +
Sbjct: 75 TPGDVRFDGWRDLDRFVRLAQETGLDVIVRPGPYICAEWDNGGLPAWLTGTPGMRPRTSH 134
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW--- 201
PF A + R+ +++ + L A +GGP++ QIENEYG+ YG G Y++W
Sbjct: 135 PPFLAAVARWFDQLIPRIA--ALQAGRGGPVVAVQIENEYGS----YGDDG-DYVRWVRD 187
Query: 202 ---AAGMALSLDT--GVPWVMCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTE 252
A G+ L T G +M + G +Q +P E
Sbjct: 188 ALTARGVTELLYTADGPTELMLDAGAVEGELAAATFGSRPEQAARLLRSRRPEEPFFCAE 247
Query: 253 NWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG------- 305
W+GWF +G RP A V R GG+ + YM HGGTNF +G
Sbjct: 248 FWNGWFDHWGEQHHVRPARSAADDVGRILGAGGSL-SLYMAHGGTNFGLWAGANHDGDRL 306
Query: 306 -PFISTSYDYDAPLDEYGLIRQPKWGHLKD 334
P + TSYD DAP+ E+G + + K+ L+D
Sbjct: 307 QPTV-TSYDSDAPVAEHGALTE-KFFALRD 334
>gi|313149603|ref|ZP_07811796.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
gi|313138370|gb|EFR55730.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
Length = 628
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 169/354 (47%), Gaps = 47/354 (13%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGNLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG +Q+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVNQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTYPSLGPNLE 361
TSYDYDAP+ E G + PK+ +++ + K +K T P P+ P +E
Sbjct: 329 LTSYDYDAPISEAGWV-TPKYDSIRNVIRKYVKY-------TVPEAPAPNPVIE 374
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 90/229 (39%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PV++ G +++ + YQ + P + + D+ + +P
Sbjct: 483 EIVHNTKGIISPVKIAGK------EITGEWDMYQLPMS----EMPDLAKLKADAHANVPA 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + ID GKG +VNG +IGRYW
Sbjct: 533 EAAKLKGCPVLYEGTFTLDNVGD-TFIDMENWGKGIIFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY +P WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLNEVP 611
>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 628
Score = 166 bits (419), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 170/359 (47%), Gaps = 42/359 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K +
Sbjct: 154 YIDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADVGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG DQ+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVDQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPT--YPSLGPNLEATV 364
TSYDYDAP+ E G + PK+ +++ + K +K A +P PS+ N A V
Sbjct: 329 MTSYDYDAPISEAGWV-TPKYDSIRNVIKKYVKYTIPEAPAPNPVIEIPSIQLNKVADV 386
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 87/229 (37%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PVQ+ G D+ YQ + P + + D+ +P
Sbjct: 483 EIVHNTKGIISPVQIAGKEIVGGWDM------YQLPMD----EMPDLTKLKADTHKNVPS 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + +D GKG +VNG +IGRYW
Sbjct: 533 EVAKLKGCPVLYEGTFTLDKVGD-TFMDMESWGKGIVFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY VP WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNETP 611
>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 621
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 108/328 (32%), Positives = 155/328 (47%), Gaps = 27/328 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK + SG IHYPR W ++ K GL+ + TYVFWN HE ++NF G
Sbjct: 39 LLNGKPFTIYSGEIHYPRVPSAYWKHRLEMMKAMGLNTVTTYVFWNYHEEAPGKWNFSGE 98
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL KF+K E GLY +R GPYVCAEW FGG+P WL ++ R DN+ F E ++
Sbjct: 99 KDLQKFIKTAQETGLYVIIRPGPYVCAEWEFGGYPWWLQKNKELEIRRDNKAFSEECWKY 158
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG----AAGKSYIKWAAGMALSLD 210
+++ + ++ + GGP+I+ Q ENE+G+ + + Y M L
Sbjct: 159 ISQLAKQITPMQI--TNGGPVIMVQAENEFGSYVAQRKDIPLEEHRKYSHKIKEMLLKSG 216
Query: 211 TGVPWVMCQQSD-----APDPIINTCNGFYCDQFTPNSNNK------PKMWTENWSGWFL 259
VP S + + + T NG S N+ P M E + GW
Sbjct: 217 ISVPLFTSDGSSLFKGGSVEGALPTANGESDIDVLKKSINEYNGGKGPYMIAEYYPGWLD 276
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------TS 311
+ E++ + + G +F NYYM HGGTNF TSG + TS
Sbjct: 277 HWAEPFVKVSTEEVVKQTNLYIENGVSF-NYYMIHGGTNFGFTSGANYDKDHDIQPDLTS 335
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDLHKAI 339
YDYDAP+ E G PK+ L+ + + I
Sbjct: 336 YDYDAPISEAGWA-TPKYNALRKIFQKI 362
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 86/224 (38%), Gaps = 49/224 (21%)
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
K +L V+ L + +INGK G + N K +D I + ++L +G
Sbjct: 428 QKGLLEVKGLRDYANVYINGKWKGEL--NRVNKKYDLDIEIKSG---DRLEILVENMGRI 482
Query: 558 NYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKS 617
NYGA GI PV++ NGT + S W E L P + + K+
Sbjct: 483 NYGAEIVHNLKGIISPVKI----NGTEV---SGNW--------EMLPLPFDTFPKHHFKN 527
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
+ V + F + +D GKG ++NG++ GRYW T
Sbjct: 528 KNIEDHSPVIQEAEFTLNETGD-TFLDMRNFGKGIVFINGRNAGRYWSTV---------- 576
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
P Q+LY +P WLK N + +FE+I
Sbjct: 577 -----------------GPQQTLY-IPGVWLKKGRNKIQIFEQI 602
>gi|423280524|ref|ZP_17259436.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
gi|404583731|gb|EKA88404.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
Length = 628
Score = 166 bits (419), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 169/354 (47%), Gaps = 47/354 (13%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG +Q+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVNQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTYPSLGPNLE 361
TSYDYDAP+ E G + PK+ +++ + K +K T P P+ P +E
Sbjct: 329 LTSYDYDAPISEAGWV-TPKYDSIRNVIRKYVKY-------TVPEAPAPNPVIE 374
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 90/229 (39%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PV++ G +++ + YQ + P + + D+ + +P
Sbjct: 483 EIVHNTKGIISPVKIAGK------EITGEWDMYQLPMS----EMPDLAKLKADAHANVPA 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + ID GKG +VNG +IGRYW
Sbjct: 533 EAAKLKGCPVLYEGTFTLDNVGD-TFIDMENWGKGIIFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY +P WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLNEVP 611
>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
Length = 787
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 108/340 (31%), Positives = 167/340 (49%), Gaps = 29/340 (8%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M ++L+ L F A+ + T ++ ++ G+ V+ + +HYPR W
Sbjct: 1 MKKVKLLITALLLTFAQFAS---AGDFTVGNKTFLLNGEPFVVKAAEVHYPRIPRPYWEH 57
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
I+ K G++ + YVFWN+HE Q++F D+ +F +L + G+Y +R GPYVC
Sbjct: 58 RIKMCKALGMNTLCIYVFWNIHEQREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGPYVC 117
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEW GG P WL I+ R + F ++ F K+ + + L GGPII+ Q+
Sbjct: 118 AEWEMGGLPWWLLKKKDIRLRERDPYFLERVKIFEQKVGEQLA--PLTIQNGGPIIMVQV 175
Query: 181 ENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFY 235
ENEYG+ D Y + + ++ G L+L W + + D ++ T N G
Sbjct: 176 ENEYGSYGEDKPYVSEIRDCLRGIYGEKLTL-FQCDWSSNFERNGLDDLVWTMNFGTGAN 234
Query: 236 CD-------QFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQ 288
D Q PN+ P M +E WSGWF +G RP +D+ + + +F
Sbjct: 235 IDHEFARLKQLRPNA---PLMCSEFWSGWFDKWGANHETRPAKDMVDGMDEMLSKNISF- 290
Query: 289 NYYMYHGGTNFDRTSG------GPFISTSYDYDAPLDEYG 322
+ YM HGGT+F +G P + TSYDYDAP++EYG
Sbjct: 291 SLYMTHGGTSFGHWAGANSPGFAPDV-TSYDYDAPINEYG 329
>gi|297194215|ref|ZP_06911613.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|197722531|gb|EDY66439.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 590
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 160/331 (48%), Gaps = 42/331 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ L+SG++HY R PE WP ++ + GLD +ETYV WNLHEP +Y+F+G
Sbjct: 11 LDGRPLRLLSGALHYFRVLPEQWPHRLRMLRAMGLDTVETYVPWNLHEPRPGEYDFDGIA 70
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGI-QFRTDNEPFKAEMQRF 154
DL +F+ EAGL+A +R PY+CAEW GG P WL P + R + + A + R+
Sbjct: 71 DLDRFLHATREAGLHAIVRPSPYICAEWENGGLPWWLLADPEVGALRCQDPAYLAHVDRW 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM-ALSLDTGV 213
+++ ++ ++ S+GG +++ Q+ENEYG+ YG AAG+ A +D
Sbjct: 131 FDRLIPVVAAHQV--SRGGNVLMVQVENEYGS----YGTDTGYLEHLAAGLRARGID--- 181
Query: 214 PWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPK---------------MWTENWSGWF 258
V SD PD T T N ++PK M E W GWF
Sbjct: 182 --VPLFTSDGPDDFFLTGGALPGHLATVNFGSRPKEALADLARLRPDDPAMCMEFWCGWF 239
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG------------P 306
+G R D A + G + N YM HGGTNF +G P
Sbjct: 240 DHWGTDHVVRDPADAAGVLEELLAAGASV-NVYMAHGGTNFSTWAGANTEDPAAGTGYRP 298
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
+ TSYDYDAP+DE G + W + L +
Sbjct: 299 TV-TSYDYDAPVDERGAATEKFWAFREVLER 328
>gi|424665121|ref|ZP_18102157.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
gi|404574985|gb|EKA79730.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
Length = 628
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 169/354 (47%), Gaps = 47/354 (13%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK ++SG +HY R + W +Q K GL+ + TYVFWNLHEP +++F G +L
Sbjct: 38 GKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNL 97
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+K E G+ LR GPYVCAEW FGG+P WL + G++ R DN E ++T
Sbjct: 98 AEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDN----PEFLKYTKA 153
Query: 158 IVDMMKQE--KLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
+D + +E L ++GGPI++ Q ENE+G+ AY A K + A
Sbjct: 154 YIDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGF 213
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWSG 256
+ W+ + A + T NG +Q+ + P M E + G
Sbjct: 214 NVPLFTSDGSWLF--EGGATPGALPTANGESDIENLKKVVNQY--HDGKGPYMVAEFYPG 269
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W + P +A ++ Q +F N+YM HGGTNF TSG +
Sbjct: 270 WLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPD 328
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTYPSLGPNLE 361
TSYDYDAP+ E G + PK+ +++ + K +K T P P+ P +E
Sbjct: 329 LTSYDYDAPISEAGWV-TPKYDSIRNVIRKYVKY-------TVPEAPAPNPVIE 374
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 90/229 (39%), Gaps = 50/229 (21%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L +++G+ VG ++ + ++ P T +L +G NYG+
Sbjct: 428 LEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPF-----NATLQILVENMGRINYGS 482
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP- 620
GI PV++ G +++ + YQ + P + + D+ + +P
Sbjct: 483 EIVHNTKGIISPVKIAGK------EITGEWDMYQLPMS----EMPDLAKLKADAHANVPA 532
Query: 621 ---KLQPL-VWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
KL+ V Y+ TF + ID GKG +VNG +IGRYW
Sbjct: 533 EAAKLKGCPVLYEGTFTLDNVGD-TFIDMENWGKGIIFVNGVNIGRYWKV---------- 581
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
P Q+LY +P WLK N +V+FE++ P
Sbjct: 582 ------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLNEVP 611
>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
Length = 604
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 114/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV W+LHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|257090118|ref|ZP_05584479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|256998930|gb|EEU85450.1| beta-galactosidase [Enterococcus faecalis CH188]
Length = 594
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 114/324 (35%), Positives = 163/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV W+LHEP + ++FEG
Sbjct: 11 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 71 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 130 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 187
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 188 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGGTNF +G P I TSYD
Sbjct: 248 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328
>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 604
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 114/324 (35%), Positives = 162/324 (50%), Gaps = 27/324 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ ++SG+IHY R P W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 21 LLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGI 80
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+KL E GLYA +R PY+CAEW FGGFP WL PG + R++N + + +
Sbjct: 81 LDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAEY 139
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+++ + +L + GG I++ QIENEYG+ + AY A + + A +
Sbjct: 140 YDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSD 197
Query: 213 VPW--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
PW + S D I+ T N G F + P M E W GWF +
Sbjct: 198 GPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRW 257
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R ++LA +V G N YM+HGG NF +G P I TSYD
Sbjct: 258 KEPIIKRDPQELAESVREALALGSI--NLYMFHGGINFGFMNGCSARGTIDLPQI-TSYD 314
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHK 337
YDAPLDE G + + K LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338
>gi|433461907|ref|ZP_20419504.1| beta-galactosidase [Halobacillus sp. BAB-2008]
gi|432189486|gb|ELK46587.1| beta-galactosidase [Halobacillus sp. BAB-2008]
Length = 579
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 183/704 (25%), Positives = 278/704 (39%), Gaps = 160/704 (22%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T ++ ++ K ++SG+IHY R+ PE W D ++K K GL+ +ETYV WNLHEP R
Sbjct: 2 LTAENGQFLLNDKPFQILSGAIHYFRTVPEHWEDRLEKLKALGLNTVETYVPWNLHEPRR 61
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++ F G D+ F++ A+ GLY +R PY+CAEW GG P WL + R+ +
Sbjct: 62 GEFEFSGLADIEGFIQTAADLGLYVIVRPAPYICAEWEMGGLPSWLLKDKDVVMRSSDPV 121
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGK--SYIK---W 201
+ + ++ + +++ LY + GGPII QIENEYG AYG K +++K
Sbjct: 122 YLSYVESYYKELLPKFVPH-LYQN-GGPIIAMQIENEYG----AYGNDQKYLTFLKKQYE 175
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGW 257
G+ L T +Q PD G +Q PKM E W GW
Sbjct: 176 QHGLDTFLFTSDGPDFIEQGSLPDVTTTLNFGSKVEQAFERLDAFKTGSPKMVAEFWIGW 235
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFISTSYDYDAP 317
F + G R D A +R + NF GG T++ +
Sbjct: 236 FDYWTGEHHTRDAGDAAAVFRELMERKAS----------VNFYMFHGG----TNFGFMNG 281
Query: 318 LDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLA 377
+ Y D YP++ T Y S L +
Sbjct: 282 ANHY----------------------------DVYYPTI------TSYDYDSLLTES--- 304
Query: 378 NIGTNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADS 437
+T K+N SIL D + V A SV ++ R ++
Sbjct: 305 -----GAITEKYNAVK--------SILADYQTV---PADYESVLSSEAYGRVEVEEGVSL 348
Query: 438 SDAIGSGWSYINEPVGISKDDAFTKPGLLEQINTTADQS-DYLWYSLSTNIKADEPLLED 496
D + I K A KP +E+I DQ+ Y Y + N
Sbjct: 349 FDTLAE----------IGKRTAHIKPLSMEEI----DQAYGYTLYKTTVN---------R 385
Query: 497 GSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGL 556
+ + ++++ ++NG + Y + K T+ FP + NT ++L +G
Sbjct: 386 SGELSMGIEAVHDRAFIYVNGTYQKTIYINDEQKKTTLVFPEKI----NTLEILVENMGR 441
Query: 557 QNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN-FPSGSSTQWDS 615
NYG E G+T + L +Q+ ++ + EL+ P + Q DS
Sbjct: 442 ANYGEHLEDR-KGLTKNIWL------------GEQYFFEWEMYAVELDILPESYAKQEDS 488
Query: 616 KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
+ PK +++ TFDAP G ID G KG +VNG ++GRYW
Sbjct: 489 R--YPK-----FFRGTFDAP-GRHDTYIDSEGFTKGNLFVNGFNLGRYW----------- 529
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
N P + +Y VP LK GN LV+ E
Sbjct: 530 ----------------NTAGPQKRIY-VPGPLLKEQGNELVILE 556
>gi|306832839|ref|ZP_07465973.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
gi|304424978|gb|EFM28110.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
Length = 595
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 171/355 (48%), Gaps = 50/355 (14%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ + GK ++SGSIHY R P+ W + K G + +ETYV WNLHEP +++F
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G DL +F+ + E GLYA +R PY+CAEW FGG P WL G++ R+ ++ F +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWL-LEKGVRVRSQDKGFLQVV 126
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
+R+ ++ + + +L QGG I++ Q+ENEYG +YG K Y++ M L L
Sbjct: 127 KRYYEVLIPRLIKHQL--DQGGNILMFQVENEYG----SYG-EDKVYLRELKQMMLELGL 179
Query: 212 GVPWVMCQQSDAP------------DPIINTCN-------GFYCDQ--FTPNSNNKPKMW 250
P+ SD P D ++ T N F + F P M
Sbjct: 180 EEPFFT---SDGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMC 236
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG----- 305
E W GWF +G V R E+LA AV + G N YM+HGGTNF +G
Sbjct: 237 MEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARKQ 294
Query: 306 ---PFISTSYDYDAPLDEYG-------LIRQPKWGHLKDLHKAIKLCEAALVATD 350
P + TSYDYDA LDE G +++ +LH A L + + D
Sbjct: 295 TDLPQV-TSYDYDAILDEAGNPTKKFYILQHRLKNKYPELHYATPLVKPTMAIKD 348
Score = 50.8 bits (120), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 85/213 (39%), Gaps = 53/213 (24%)
Query: 512 HAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT 571
F+NG + + Y + V+F ++ D+L +G NYG +T
Sbjct: 412 QVFLNGNHIVTQYQEEIGDDIQVNF----TSEESQLDILVENMGRVNYGH-------KLT 460
Query: 572 GPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPL-VWYK 629
P Q KG G G +DL QW +P ++ + K + P + + +Y+
Sbjct: 461 APSQHKGIGRGVMLDLHFVNQWE----------TYPLSMNSIKNLKYSSPWREGVPSFYE 510
Query: 630 TTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKC 689
F E +D +G GKG A++NG ++GR+W
Sbjct: 511 FKFHC-LNPEDTYMDMSGFGKGVAFINGYNLGRFW------------------------- 544
Query: 690 LKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
N G P+ SLY +PR + NT+ +FE G
Sbjct: 545 --NIG-PTLSLY-IPRGMMVCGENTITIFETEG 573
>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
Length = 587
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 150/305 (49%), Gaps = 16/305 (5%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ V+G + ++SG+IHY R PE W D + K K GL+ +ETY+ WN HEP ++N
Sbjct: 8 NQQFVLGEEAIQILSGAIHYFRVVPEYWEDRLLKLKACGLNTVETYIPWNWHEPDEGRFN 67
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F G D+ F+ L + GL+ +R PY+CAEW FGG P WL P +Q R + F +
Sbjct: 68 FSGMADIEAFITLAGKLGLHVIVRPSPYICAEWEFGGLPAWLLQDPHMQLRCLDPKFLKK 127
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAY-GAAGKSYIKWAAGMAL 207
+ + +++ + L ++ GGPII QIENEYG+ D+AY ++ I + L
Sbjct: 128 VDAYYDELIPRLV--PLLSTNGGPIIAVQIENEYGSYGNDTAYLQYLQEALIARGVDVLL 185
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFLSFGG 263
G M Q P G + + P M E W+GWF +
Sbjct: 186 FTSDGPTDGMLQGGTVPGVTATVNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMK 245
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAP 317
R ED A A G + N+YM+HGGTNF +G + TSYDYDAP
Sbjct: 246 PHHTRDSEDAASVFAEMLALGASV-NFYMFHGGTNFGFYNGANYHDKYEPTITSYDYDAP 304
Query: 318 LDEYG 322
L E G
Sbjct: 305 LSECG 309
>gi|313241117|emb|CBY33414.1| unnamed protein product [Oikopleura dioica]
Length = 608
Score = 165 bits (418), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 110/331 (33%), Positives = 164/331 (49%), Gaps = 25/331 (7%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TY + + + ++SGS+HY R E W D ++K K GL+ ++TY+ WNLHEP
Sbjct: 4 TYLFKIRRLFKSKTRILSGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREG 63
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR-TDNEP 146
+ FE D+ +F+K+ + GLY +R GPY+CAEW +GGFP WL + R T +E
Sbjct: 64 DFIFEDELDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEA 123
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAG 204
+ A +Q + + ++ + S+GGPII Q+ENEY N DS Y K+ +
Sbjct: 124 YLAAVQNWFTVLFSQLRDHQW--SRGGPIISIQVENEYASYNKDSEYLPWVKNLLTDVGK 181
Query: 205 -MALSLDTGVPWVMCQQSDAPDPII-----NTCNGF-YCDQFTPNSNNKPKMWTENWSGW 257
L + + + PD + + N F D+ P N+PKM TE W+GW
Sbjct: 182 CFLLKIINETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQP---NRPKMVTEFWAGW 238
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-------- 309
F +G R G+ N YM+HGGT+F +G ++S
Sbjct: 239 FDHWGQQGHSTLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSD 298
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKDLHKAI 339
TSYDYDAPL E G + + KW +++ K
Sbjct: 299 TTSYDYDAPLSESGDLTE-KWNVTREIIKEF 328
>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
Length = 588
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/313 (34%), Positives = 156/313 (49%), Gaps = 27/313 (8%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ G+ + SG++HY R P W D ++K K GL+ +ETY+ WN+HEP Q+ F
Sbjct: 10 KEFLLNGQPFKIYSGAVHYFRIAPSEWRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVF 69
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
E RYD+ KFVKL GLY LR PY+CAEW FGG P WL P + R++ F ++
Sbjct: 70 EDRYDIGKFVKLAQSIGLYVILRPSPYICAEWEFGGLPAWLLRYPDMVVRSNTPRFMEKV 129
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSL 209
+ + ++ L + GGP+++ Q+ENEYG+ D AY KS ++ G+ + L
Sbjct: 130 ANYYEALFKVLV--PLQITHGGPVLMMQVENEYGSFGNDKAYLRHVKSLME-TNGVDVPL 186
Query: 210 DTGV-PWVMCQQSDA--PDPIINTCNG--------FYCDQFT-PNSNNKPKMWTENWSGW 257
T W ++ + D + T N QF + N P M E W GW
Sbjct: 187 FTADGSWQQALKAGSLIEDDVFVTANFGSKSRENLAELRQFMLMHHKNWPLMCMEFWDGW 246
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG--------GPFIS 309
F + + R + +A + +F N YM+ GGTNF +G P I
Sbjct: 247 FNRWQEEIVTRSADSFQTDLAELVKEQASF-NLYMFRGGTNFGFFNGCSSRQNVDYPQI- 304
Query: 310 TSYDYDAPLDEYG 322
TSYDYDA L E G
Sbjct: 305 TSYDYDAVLHEDG 317
>gi|313238883|emb|CBY13879.1| unnamed protein product [Oikopleura dioica]
Length = 601
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/331 (33%), Positives = 164/331 (49%), Gaps = 25/331 (7%)
Query: 28 TYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRN 87
TY + + + ++SGS+HY R E W D ++K K GL+ ++TY+ WNLHEP
Sbjct: 4 TYLFKIRRLFKSKTRILSGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREG 63
Query: 88 QYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR-TDNEP 146
+ FE D+ +F+K+ + GLY +R GPY+CAEW +GGFP WL + R T +E
Sbjct: 64 DFIFEDELDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEA 123
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAG 204
+ A +Q + + ++ + S+GGPII Q+ENEY N DS Y K+ +
Sbjct: 124 YLAAVQNWFTVLFSQLRDHQW--SRGGPIISIQVENEYASYNKDSEYLPWVKNLLTDVGK 181
Query: 205 -MALSLDTGVPWVMCQQSDAPDPII-----NTCNGF-YCDQFTPNSNNKPKMWTENWSGW 257
L + + + PD + + N F D+ P N+PKM TE W+GW
Sbjct: 182 CFLLKIINETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQP---NRPKMVTEFWAGW 238
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-------- 309
F +G R G+ N YM+HGGT+F +G ++S
Sbjct: 239 FDHWGQQGHSLLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSD 298
Query: 310 -TSYDYDAPLDEYGLIRQPKWGHLKDLHKAI 339
TSYDYDAPL E G + + KW +++ K
Sbjct: 299 TTSYDYDAPLSESGDLTE-KWNVTREIIKEF 328
>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
Length = 786
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/337 (32%), Positives = 169/337 (50%), Gaps = 37/337 (10%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
T +FG ++ ++ GK ++ + +HYPR W I+ K G++ + YVFW
Sbjct: 33 TETFGVG----NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFW 88
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
N+HE +++F G D+ +F++L E GLY +R GPYVCAEW GG P WL I+
Sbjct: 89 NIHEQEEGKFDFTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIR 148
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS 197
R + F + F K+ + + L +GGPII+ Q+ENEYG+ D Y +A +
Sbjct: 149 LREQDPYFMERYRIFAQKLGEQIGD--LTIEKGGPIIMVQVENEYGSYGEDKPYVSAIRD 206
Query: 198 YIK-------------WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGF-YCDQFTPNS 243
I+ W++ + + W M + A N N F + P S
Sbjct: 207 IIRDSGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGTGA-----NIENEFKKLGELRPES 261
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
P+M +E WSGWF +GG R +++ + +G +F + YM HGGT++ +
Sbjct: 262 ---PQMCSEFWSGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWA 317
Query: 304 GG--PFIS---TSYDYDAPLDEYGLIRQPKWGHLKDL 335
G P S TSYDYDAP++E G + PK+ L+++
Sbjct: 318 GANSPGFSPDVTSYDYDAPINEAGQV-TPKYMELREM 353
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 68/280 (24%), Positives = 111/280 (39%), Gaps = 59/280 (21%)
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
+++VL + FINGKL+GS N + T+ P A+ G + D+L +G
Sbjct: 422 TQSVLTITDAHDFAQVFINGKLIGSI--DRRNHEKTMLLP-AMKEG-DQLDILVEAMGRI 477
Query: 558 NYGAFYEKTGAGITGPVQLKGSGN-GTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSK 616
N+G K GIT V+L + N G+ + ++ + W T Q D K
Sbjct: 478 NFGRAI-KDFKGITEKVELSYTMNTGSQVTVNLKNWQIYT--------LSDSYQVQKDMK 528
Query: 617 STLPKLQPLV-WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
K Q + Y+ TF+ + ++ GKG+ +VNG +IGR+W
Sbjct: 529 YVPLKDQKVPGCYRATFNLKKTGD-TFLNLETWGKGQVYVNGHAIGRFWKI--------- 578
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
P Q+LY +P WLK N +++ + +G T + ++K +
Sbjct: 579 -------------------GPQQTLY-MPGCWLKKGENEIIVQDIVGPQETVVEGLSKPI 618
Query: 736 GSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
L ++H RK G L+L P
Sbjct: 619 IDKLNVDAPNTH--------------RKEGQTLNLAGETP 644
>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
Length = 581
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 158/324 (48%), Gaps = 39/324 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK +ISG+IHY R PE W D ++K K G + +ETY+ WN+HEP + +++FEG
Sbjct: 12 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ +FVK E GLY LR PY+CAEW FGG P WL G++ R PF +Q +
Sbjct: 72 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + ++ + GGP+IL Q+ENEYG Y A + Y+ M + G
Sbjct: 132 DVLLKKIVPYQI--NYGGPVILMQVENEYG-----YYANDREYL---LAMRDKMQKGGVV 181
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSN-----------------NKPKMWTENWSGWF 258
V SD P NG + + P N P M TE W GWF
Sbjct: 182 VPLVTSDG--PFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFWVGWF 239
Query: 259 LSFG-GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TS 311
+G G +E+ + + + G N YM+ GGTNF +G + TS
Sbjct: 240 DHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTS 297
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDL 335
YDYDA L E G I + K+ +D+
Sbjct: 298 YDYDALLTEDGQITE-KYRRYRDV 320
>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
Length = 588
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 158/324 (48%), Gaps = 39/324 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK +ISG+IHY R PE W D ++K K G + +ETY+ WN+HEP + +++FEG
Sbjct: 19 LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 78
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ +FVK E GLY LR PY+CAEW FGG P WL G++ R PF +Q +
Sbjct: 79 DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 138
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + ++ + GGP+IL Q+ENEYG Y A + Y+ M + G
Sbjct: 139 DVLLKKIVPYQI--NYGGPVILMQVENEYG-----YYANDREYL---LAMRDKMQKGGVV 188
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSN-----------------NKPKMWTENWSGWF 258
V SD P NG + + P N P M TE W GWF
Sbjct: 189 VPLVTSDG--PFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFWVGWF 246
Query: 259 LSFG-GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TS 311
+G G +E+ + + + G N YM+ GGTNF +G + TS
Sbjct: 247 DHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTS 304
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDL 335
YDYDA L E G I + K+ +D+
Sbjct: 305 YDYDALLTEDGQITE-KYRRYRDV 327
>gi|432894411|ref|XP_004075980.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oryzias
latipes]
Length = 640
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/308 (36%), Positives = 149/308 (48%), Gaps = 44/308 (14%)
Query: 42 VLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFV 101
+++ GSIHY R W D + K K GL+ + TYV WNLHEP R ++FEG DL ++
Sbjct: 63 LILGGSIHYFRVPKAYWEDRLLKLKACGLNTLTTYVPWNLHEPERGVFDFEGELDLEAYL 122
Query: 102 KLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDM 161
L A G++ LR GPY+CAEW+ GG P WL ++ RT F A + + ++
Sbjct: 123 GLAASLGIWVILRPGPYICAEWDLGGLPSWLLRDQNMRLRTTYPGFTAAVDSYFDHLIK- 181
Query: 162 MKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAA---------------- 203
K S+GGPII Q+ENEYG+ +D Y +IK A
Sbjct: 182 -KVAPYQYSRGGPIIAVQVENEYGSYAMDEEY----MPFIKEALLSRGITELLVTSDNKD 236
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
G+ L G + Q P+ I Y ++ P KPKM E WSGWF +GG
Sbjct: 237 GLKLGGVKGALETINFQKLDPEEIK------YLEKIQP---QKPKMVMEYWSGWFDLWGG 287
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF---------DRTSGGPFISTSYDY 314
P E++ V + + N YM+HGGTNF R S P + TSYDY
Sbjct: 288 LHHVFPAEEMMAVVTEILKLDMSI-NLYMFHGGTNFGFMSGAFAVGRPSPAPMV-TSYDY 345
Query: 315 DAPLDEYG 322
DAPL E G
Sbjct: 346 DAPLSEAG 353
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 75/188 (39%), Gaps = 40/188 (21%)
Query: 543 GKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEE 602
GK T LL G NYG ++ G+ G +QL N ++ + +K +
Sbjct: 480 GKRTLGLLVENCGRVNYGKTLDEQRKGLVGDIQL-------NANILRDFMIHSLDMKPDF 532
Query: 603 LNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGR 662
++ SS QW S P +++T + + + G KG +VNG+++GR
Sbjct: 533 VSRLQ-SSAQWKSMREKPSFP--AFFQTKLYLSSSPKDTFLKLPGWSKGVVFVNGKNLGR 589
Query: 663 YWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
YW P Q+LY VP +WL N +++FEE+
Sbjct: 590 YWSV----------------------------GPQQTLY-VPGAWLNRWDNEIIVFEELE 620
Query: 723 GDPTKISF 730
D K+ F
Sbjct: 621 TD-GKVQF 627
>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
Length = 789
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/335 (32%), Positives = 163/335 (48%), Gaps = 21/335 (6%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
TT+ + T ++ + V+ + +HYPR W I+ K G++ I YVFW
Sbjct: 25 TTAAPGDFTVGKGTFLLNNRPFVVKAAELHYPRIPRAYWDHRIKMCKALGMNTICLYVFW 84
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
N+HE +++F G D+ F +L + G+Y +R GPYVCAEW GG P WL I+
Sbjct: 85 NIHEQREGEFDFSGNSDVAAFCRLTQKNGMYIIVRPGPYVCAEWEMGGLPWWLLKKKDIR 144
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS 197
R + F ++ F K+ + + L GGPII+ Q+ENEYG+ D Y +
Sbjct: 145 LRESDPYFMERVEIFEQKVAEQLA--PLTIQNGGPIIMVQVENEYGSYGEDKKYVGQIRD 202
Query: 198 YIK---WAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFYCD-QFTPNSN---NKP 247
++ + G +L W + + + +I T N G D QF + P
Sbjct: 203 VLRKYWYTNGRGPAL-FQCDWASNFEKNGLEDLIWTMNFGTGANIDAQFMRLGELRPDAP 261
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-- 305
KM +E WSGWF +G RP +D+ + +G +F + YM HGGT+F +G
Sbjct: 262 KMCSEFWSGWFDKWGARHETRPAKDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGANS 320
Query: 306 PFIS---TSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
P + TSYDYDAP++EYG + W K + K
Sbjct: 321 PGFAPDVTSYDYDAPINEYGQVTPKFWELRKMMEK 355
>gi|297842039|ref|XP_002888901.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297334742|gb|EFH65160.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 686
Score = 164 bits (416), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 169/351 (48%), Gaps = 39/351 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G +I G +HY R PE W D + ++K GL+ I+ YV WNLHEP + FEG DL
Sbjct: 73 GNHFQIIGGDLHYFRVLPEYWEDRLLRAKALGLNTIQVYVPWNLHEPKPGKMVFEGIGDL 132
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI-PGIQFRTDNEPFKAEMQRFTA 156
V F+KL + LR GPY+C EW+ GGFP WL + P +Q RT + + ++R+
Sbjct: 133 VSFLKLCDKLDFMVMLRAGPYICGEWDLGGFPAWLLSVKPRLQLRTSDPAYLKLVERWWG 192
Query: 157 KIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAY---------GAAGKSYIKWAA-- 203
V + K L S GGP+I+ QIENEYG+ D AY G G I +
Sbjct: 193 --VLLPKIFPLIYSNGGPVIMVQIENEYGSYGNDKAYLRKLVSMARGHLGDDIIVYTTDG 250
Query: 204 GMALSLDTG-VPW------VMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
G +L+ G VP V D P PI F P S+ P + +E ++G
Sbjct: 251 GTKETLEKGTVPVDDVYSAVDFTTGDDPWPIFELQKKFNA----PGSS--PPLSSEFYTG 304
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W +G + E A ++ + R G+ YM HGGTNF +G S
Sbjct: 305 WLTHWGEKIAKTDAEFTATSLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDYKP 363
Query: 310 --TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGP 358
TSYDYDAP+ E G I PK+ L+ + K + +++ ++ + GP
Sbjct: 364 DLTSYDYDAPIKESGDIDNPKFRALQRVIKKYNVASHSIIPSNKQRKAYGP 414
>gi|426371159|ref|XP_004052521.1| PREDICTED: beta-galactosidase-1-like protein 3 [Gorilla gorilla
gorilla]
Length = 653
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/339 (33%), Positives = 167/339 (49%), Gaps = 18/339 (5%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIH R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 82 LEGHKFLIFGGSIHCFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + AE GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 142 DLEAFVLMGAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG+ D Y K+ ++ L G
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFKKDKTYMLYLHKALLRRGIVELLLTSDG 259
Query: 213 VPWVMCQQSDAPDPIIN--TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPV 270
V+ + IN + +Q +KP + E W GWF +G +
Sbjct: 260 EKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDA 319
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYGLI 324
+++ AV+ F + +F N YM+HGGTNF +G + I TSYDYDA L E G
Sbjct: 320 KEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDY 378
Query: 325 RQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGPNL 360
+ K+ L+ L +++ V P YP + P+L
Sbjct: 379 TE-KYLKLQKLFQSVSATPLPRVPKLPPKAVYPPVRPSL 416
>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
Length = 1106
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 157/319 (49%), Gaps = 25/319 (7%)
Query: 33 AVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE 92
+ ++ GK V+ + +HYPR W I+ K G++ + YVFWN HEP Y+F
Sbjct: 356 SFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFT 415
Query: 93 GRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQ 152
+ DL +F +L + +Y LR GPYVCAEW GG P WL I+ R + F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVN 475
Query: 153 RFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLD 210
F + +K L + GGPII+ Q+ENEYG+ D Y + + ++ G ++L
Sbjct: 476 LFEEAVAKQVKD--LTIANGGPIIMVQVENEYGSYGADKGYVSQIRDIVRTHFGNDIAL- 532
Query: 211 TGVPWVMCQQSDAPDPIINTCN---GFYCDQ-------FTPNSNNKPKMWTENWSGWFLS 260
W + D +I T N G DQ PNS P M +E WSGWF
Sbjct: 533 FQCDWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKKLRPNS---PLMCSEFWSGWFDK 589
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDYD 315
+G RP ED+ + RG +F + YM HGGTN+ +G P + TSYDYD
Sbjct: 590 WGANHETRPAEDMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYD 648
Query: 316 APLDEYGLIRQPKWGHLKD 334
AP+ E G PK+ L++
Sbjct: 649 APISESGQT-TPKYWKLRE 666
>gi|312903586|ref|ZP_07762766.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|310633462|gb|EFQ16745.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
Length = 611
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 170/354 (48%), Gaps = 48/354 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + IETY+ WNLHEPV Y+FEG
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V FV L E GL LR Y+CAEW FGG P WL ++ R+ + F A+++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRTY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ V + K L + GGP+I+ Q+ENEYG +YG K Y++ + VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 215 WVMCQQSDAPDPIINTC-------------------NGFYCDQFTPNSNNK-PKMWTENW 254
+ A + +++ N F + K P M E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G + R +DLA V G N YM+HGGTNF +G P
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLG 357
+ TSYDYDA L E G + K+ H++ +AIK + +P T+ SLG
Sbjct: 299 QV-TSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAEPRRKTFGSLG 347
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 109/302 (36%), Gaps = 67/302 (22%)
Query: 456 KDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFI 515
KD T + I +S Y + S N+K + L V LH F
Sbjct: 361 KDQMMTAQETMYPITMEEAESGYGYMLYSVNLKNYH------HENKLKVVEASDRLHLFA 414
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKN--TFDLLSLTVGLQNYGAFYEKTGAGITGP 573
+G L Y + +V I P K D+L +G NYG + GP
Sbjct: 415 DGSLQTIQYQENLGEEVM----IKGTPEKEWIELDVLVENLGRVNYGF-------KLNGP 463
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
Q+KG G D+ Q G + L + + D + QP +Y+ F
Sbjct: 464 TQVKGIRGGIMQDIHFHQ-----GYRQYALTLSADQLKKIDYTAGKNPAQP-SFYQAEFT 517
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
++ ID GKG VNG ++GRY Q G
Sbjct: 518 LTDLADTF-IDCRSYGKGVVIVNGINLGRYL-----QRG--------------------- 550
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF----VTKQL--------GSSLCS 741
P SLY P+ +LK N +V+FE G + ++ F + K+L GS++ +
Sbjct: 551 --PIHSLY-CPKEFLKKGTNEIVIFETEGIEINELIFCGQPIVKKLLTNDFSEIGSNIHN 607
Query: 742 HV 743
H+
Sbjct: 608 HI 609
>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
Length = 598
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/317 (34%), Positives = 155/317 (48%), Gaps = 36/317 (11%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ G+ +ISG++HY R PE W + K G + +ETYV WN+HEP +NF
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGIFNF 67
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
EG DLVK+V+L + GL LR PY+CAEW FGG P WL I+ R++ F ++
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLNKV 127
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
+ F ++ M+ L GGPII+ Q+ENEYG+ K Y++ + L
Sbjct: 128 ENFYKVLLPMVT--PLQVENGGPIIMMQVENEYGSF-----GNDKEYVRNIKKLMRDLGV 180
Query: 212 GVP-------WVMCQQSDA--PDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTEN 253
VP W +S + D ++ T N G ++ N P M E
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCMEF 240
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------- 305
W GWF +G + R +LA V +R N+YM+ GGTNF +G
Sbjct: 241 WDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298
Query: 306 PFISTSYDYDAPLDEYG 322
P I TSYDYDA L E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 73/154 (47%), Gaps = 20/154 (12%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+H F+N +LV + Y +V++D L +NT D+L +G NYGA +
Sbjct: 411 VHLFLNEQLVDTQYRDEIGREVSLD----LTKEENTLDILVENMGRVNYGA-------RL 459
Query: 571 TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKT 630
P Q KG +G ID+ Q L+ + L+ + QW+ + +Y+
Sbjct: 460 LSPTQRKGISSGVMIDIHLQSNWEHYALEFDNLD-EIDFNGQWEPNTP-------SFYEY 511
Query: 631 TFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
TF+ ++ +D + +GKG +NG ++G+YW
Sbjct: 512 TFNVQELNDTF-LDCSKLGKGFVVLNGFNLGKYW 544
>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 591
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/317 (34%), Positives = 155/317 (48%), Gaps = 36/317 (11%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
+ ++ G+ +ISG++HY R PE W + K G + +ETYV WN+HEP +NF
Sbjct: 8 KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGVFNF 67
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
EG DLVK+V+L + GL LR PY+CAEW FGG P WL I+ R++ F ++
Sbjct: 68 EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYRDIRVRSNTNLFLNKV 127
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
+ F ++ ++ L GGPII+ Q+ENEYG+ K Y++ + L
Sbjct: 128 ENFYKVLLPLVTS--LQVENGGPIIMMQVENEYGSF-----GNDKEYVRSIKKLMRDLGV 180
Query: 212 GVP-------WVMCQQSDA--PDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTEN 253
VP W +S + D ++ T N G ++ N P M E
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNALESFIKENKKEWPLMCMEF 240
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------- 305
W GWF +G + R +LA V +R N+YM+ GGTNF +G
Sbjct: 241 WDGWFNRWGMEIIRRDSSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298
Query: 306 PFISTSYDYDAPLDEYG 322
P I TSYDYDA L E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 71/154 (46%), Gaps = 20/154 (12%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+H F+N +L+ + Y +V++D L +NT D+L +G NYGA +
Sbjct: 411 VHLFLNEQLIDTQYRDEIGREVSLD----LTKEENTLDILVENMGRVNYGA-------RL 459
Query: 571 TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKT 630
Q KG +G ID+ Q L+ + L+ + QW+ + +Y+
Sbjct: 460 LSQTQRKGISSGVMIDIHLQSNWEHYALEFDNLD-EIDFNGQWEPNTP-------SFYEY 511
Query: 631 TFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
TF+ + +D + +GKG +NG ++G+YW
Sbjct: 512 TFNVQELKDTF-LDCSKLGKGFVVLNGFNLGKYW 544
>gi|315647882|ref|ZP_07900983.1| Beta-galactosidase [Paenibacillus vortex V453]
gi|315276528|gb|EFU39871.1| Beta-galactosidase [Paenibacillus vortex V453]
Length = 587
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 109/308 (35%), Positives = 156/308 (50%), Gaps = 22/308 (7%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ V+G + ++SG++HY R PE W D + K K G + +ETY+ WNLHEP Q+
Sbjct: 9 NQQFVLGDEPIQILSGAVHYFRIVPEYWEDRLMKLKACGFNTVETYIPWNLHEPKEGQFT 68
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F+G DL FV+ GL+ LR PY+CAEW FGG P WL P I R + + +
Sbjct: 69 FDGIADLEGFVQKAGHLGLHVILRPSPYICAEWEFGGLPAWLLQYPDIHLRCMDPVYLEK 128
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALS 208
+ + +++ + L S+GGP+I QIENEYG+ D+AY K + A G+ +
Sbjct: 129 VDHYYDELIPRIV--PLLTSKGGPVIAIQIENEYGSYGNDTAYLEYLKDGLS-ARGVDVL 185
Query: 209 LDT--GVPWVMCQQSDAPDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
L T G M Q P+ ++ T N G + P M E W+GWF +
Sbjct: 186 LFTSDGPTDGMLQGGTVPN-VLATVNFGSRPGEAFAKLREYRTEDPLMCMEYWNGWFDHW 244
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF-------DRTSGGPFISTSYDY 314
R E++A + + N+YM+HGGTNF D+ P + TSYDY
Sbjct: 245 LKPHHTRSSEEVAQVFEEMLRLNASV-NFYMFHGGTNFGFYNGANDQEKYEPTV-TSYDY 302
Query: 315 DAPLDEYG 322
DAPL E G
Sbjct: 303 DAPLSECG 310
>gi|29376389|ref|NP_815543.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|227519038|ref|ZP_03949087.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553661|ref|ZP_03983710.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256961654|ref|ZP_05565825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293383358|ref|ZP_06629271.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388990|ref|ZP_06633475.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907816|ref|ZP_07766806.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910433|ref|ZP_07769280.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714340|ref|ZP_16771066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715597|ref|ZP_16772313.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676484|ref|ZP_18113355.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681702|ref|ZP_18118489.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685588|ref|ZP_18122282.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686206|ref|ZP_18122874.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690524|ref|ZP_18127059.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424694932|ref|ZP_18131318.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696643|ref|ZP_18132984.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424700339|ref|ZP_18136532.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703758|ref|ZP_18139884.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424712611|ref|ZP_18144783.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424718249|ref|ZP_18147501.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424721894|ref|ZP_18150963.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723972|ref|ZP_18152924.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733572|ref|ZP_18162127.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424741709|ref|ZP_18170052.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424751990|ref|ZP_18179997.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|29343852|gb|AAO81613.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|227073538|gb|EEI11501.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177203|gb|EEI58175.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256952150|gb|EEU68782.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291079149|gb|EFE16513.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081771|gb|EFE18734.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626177|gb|EFQ09460.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289706|gb|EFQ68262.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575942|gb|EFU88133.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580774|gb|EFU92965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350621|gb|EJU85522.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356496|gb|EJU91227.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358329|gb|EJU93003.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364102|gb|EJU98549.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367740|gb|EJV02077.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402369105|gb|EJV03397.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402374029|gb|EJV08075.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377412|gb|EJV11319.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402379869|gb|EJV13650.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402382152|gb|EJV15835.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402384002|gb|EJV17579.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402390099|gb|EJV23464.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402391584|gb|EJV24885.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402396442|gb|EJV29504.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402401146|gb|EJV33935.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402404973|gb|EJV37581.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 611
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 170/354 (48%), Gaps = 48/354 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + IETY+ WNLHEPV Y+FEG
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V FV L E GL LR Y+CAEW FGG P WL ++ R+ + F A+++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRTY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ V + K L + GGP+I+ Q+ENEYG +YG K Y++ + VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 215 WVMCQQSDAPDPIINTC-------------------NGFYCDQFTPNSNNK-PKMWTENW 254
+ A + +++ N F + K P M E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G + R +DLA V G N YM+HGGTNF +G P
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLG 357
+ TSYDYDA L E G + K+ H++ +AIK + +P T+ SLG
Sbjct: 299 QV-TSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAEPRRKTFGSLG 347
Score = 49.7 bits (117), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 110/302 (36%), Gaps = 67/302 (22%)
Query: 456 KDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFI 515
KD T + I +S Y + S N+K + L V LH F
Sbjct: 361 KDQMMTAQETMYPITMEEAESGYGYMLYSVNLKNYH------HENKLKVVEASDRLHLFA 414
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKN--TFDLLSLTVGLQNYGAFYEKTGAGITGP 573
+G L Y + +V I P K D+L +G NYG + GP
Sbjct: 415 DGSLQTIQYQENLREEVM----IKGTPEKEWIELDVLVENLGRVNYGF-------KLNGP 463
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
Q+KG G D+ Q G + L + + D + QP +Y+ F
Sbjct: 464 TQVKGIRGGIMQDIHFHQ-----GYRQYALTLSADQLKKIDYTAGKNPAQP-SFYQAEFT 517
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
++ ID GKG VNG ++GRYW Q G
Sbjct: 518 LTDLADTF-IDCRSYGKGVVIVNGINLGRYW-----QRG--------------------- 550
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF----VTKQL--------GSSLCS 741
P SLY P+ +LK N +V+FE G + ++ F + K+L GS++ +
Sbjct: 551 --PIHSLY-CPKEFLKKGTNEIVIFETEGIEINELIFCGQPIVKKLLTNDFSEIGSNIHN 607
Query: 742 HV 743
H+
Sbjct: 608 HI 609
>gi|86142033|ref|ZP_01060557.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
gi|85831596|gb|EAQ50052.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
Length = 620
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 117/361 (32%), Positives = 176/361 (48%), Gaps = 41/361 (11%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
++L+VL +GF A A+ ++ + V GK + SG +HY R E W IQ
Sbjct: 11 LVLIVLSFGF---AQAQDDASFKIENGSFVYNGKPTPIYSGEMHYERIPKEYWRHRIQMM 67
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFE-GRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
K GL+ I TYVFWN H P ++FE G ++ +F+K+ E ++ LR GPY C EW
Sbjct: 68 KAMGLNTIATYVFWNYHNPAPGVWDFESGNRNVAEFIKIAKEEEMFVILRPGPYACGEWE 127
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENE 183
FGG+P +L IPG++ R +N F A + + I ++ KQ L + GG II++Q+ENE
Sbjct: 128 FGGYPWFLQNIPGLKVRENNAQFLAACKEY---INELAKQVAPLQVNNGGNIIMTQVENE 184
Query: 184 YGNI-----------DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN 232
+G+ AY A +K A A + W+ + + + ++ T N
Sbjct: 185 FGSYVAQREDIAPEDHKAYKEAIFKMLKDAGFQAPFFTSDGAWLF--EGGSLEGVLPTAN 242
Query: 233 GF--------YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRG 284
G ++F N+N P M E + GW + D+A + + G
Sbjct: 243 GEGNIDNLKKVVNKF--NNNEGPYMVAEFYPGWLDHWAEPFVKISASDIAKQTEVYLKNG 300
Query: 285 GTFQNYYMYHGGTNFDRTSGGPFIS--------TSYDYDAPLDEYGLIRQPKWGHLKDLH 336
F N+YM HGGTNF TSG + TSYDYDAP+ E G + PK+ ++ L
Sbjct: 301 VNF-NFYMAHGGTNFGFTSGANYNDEHDIQPDITSYDYDAPISEAGWVT-PKYDSIRALM 358
Query: 337 K 337
+
Sbjct: 359 Q 359
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 86/220 (39%), Gaps = 51/220 (23%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V L ++NGK VG + + + PI + P + ++L +G NYGA
Sbjct: 431 LKVPGLRDFATVYVNGKKVGE----LNRVFNSYEMPIKI-PFNGSLEILVENMGRINYGA 485
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
GIT PV + Y+ E P + + + +
Sbjct: 486 EIVNNLKGITAPVSIN---------------DYEITGGWEMYKAPFAEVPEVINSTEVKT 530
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
+P+V Y +FD + ++ + MGKG +VNG ++GRYW
Sbjct: 531 GRPVV-YSGSFDLKKQGD-TFLNMSEMGKGIVFVNGHNLGRYWKV--------------- 573
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
P Q+LY VP WLK GNT+ +FE++
Sbjct: 574 -------------GPQQTLY-VPGCWLKKKGNTITIFEQL 599
>gi|229545563|ref|ZP_04434288.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256619317|ref|ZP_05476163.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853375|ref|ZP_05558745.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256964870|ref|ZP_05569041.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|257090147|ref|ZP_05584508.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|294614275|ref|ZP_06694194.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|307272958|ref|ZP_07554205.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|307277803|ref|ZP_07558888.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291733|ref|ZP_07571605.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|384518848|ref|YP_005706153.1| beta-galactosidase [Enterococcus faecalis 62]
gi|422685728|ref|ZP_16743941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422689100|ref|ZP_16747212.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422720655|ref|ZP_16777264.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422731066|ref|ZP_16787446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|422739263|ref|ZP_16794446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|430849460|ref|ZP_19467237.1| glycosyl hydrolase [Enterococcus faecium E1185]
gi|229309303|gb|EEN75290.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256598844|gb|EEU18020.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711834|gb|EEU26872.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256955366|gb|EEU71998.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256998959|gb|EEU85479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|291592934|gb|EFF24524.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|306497185|gb|EFM66730.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505543|gb|EFM74728.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306510572|gb|EFM79595.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|315029440|gb|EFT41372.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032046|gb|EFT43978.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144925|gb|EFT88941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|315162898|gb|EFU06915.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577862|gb|EFU90053.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|323480981|gb|ADX80420.1| beta-galactosidase [Enterococcus faecalis 62]
gi|430537598|gb|ELA77922.1| glycosyl hydrolase [Enterococcus faecium E1185]
Length = 611
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 170/354 (48%), Gaps = 48/354 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + IETY+ WNLHEPV Y+FEG
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V FV L E GL LR Y+CAEW FGG P WL ++ R+ + F A+++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRTY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ V + K L + GGP+I+ Q+ENEYG +YG K Y++ + VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 215 WVMCQQSDAPDPIINTC-------------------NGFYCDQFTPNSNNK-PKMWTENW 254
+ A + +++ N F + K P M E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G + R +DLA V G N YM+HGGTNF +G P
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLG 357
+ TSYDYDA L E G + K+ H++ +AIK + +P T+ SLG
Sbjct: 299 QV-TSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAEPRRKTFGSLG 347
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 110/302 (36%), Gaps = 67/302 (22%)
Query: 456 KDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFI 515
KD T + I +S Y + S N+K + L V LH F
Sbjct: 361 KDQMMTAQETMYPITMEEAESGYGYMLYSVNLKNYH------HENKLKVVEASDRLHLFA 414
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKN--TFDLLSLTVGLQNYGAFYEKTGAGITGP 573
+G L Y + +V I P K D+L +G NYG + GP
Sbjct: 415 DGSLQTIQYQENLGEEVM----IKGTPEKEWIELDVLVENLGRVNYGF-------KLNGP 463
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
Q+KG G D+ Q G + L + + D + QP +Y+ F
Sbjct: 464 TQVKGIRGGIMQDIHFHQ-----GYRQYALTLSADQLKKIDYTAGKNPAQP-SFYQAEFT 517
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
++ ID GKG VNG ++GRYW Q G
Sbjct: 518 LTDLADTF-IDCRSYGKGVVIVNGINLGRYW-----QRG--------------------- 550
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF----VTKQL--------GSSLCS 741
P SLY P+ +LK N +V+FE G + ++ F + K+L GS++ +
Sbjct: 551 --PIHSLY-CPKEFLKKGTNEIVIFETEGIEINELIFCGQPIVKKLLTNDFSEIGSNIHN 607
Query: 742 HV 743
H+
Sbjct: 608 HI 609
>gi|307275710|ref|ZP_07556850.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|306507586|gb|EFM76716.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
Length = 611
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 170/354 (48%), Gaps = 48/354 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + IETY+ WNLHEPV Y+FEG
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V FV L E GL LR Y+CAEW FGG P WL ++ R+ + F A+++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRTY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ V + K L + GGP+I+ Q+ENEYG +YG K Y++ + VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 215 WVMCQQSDAPDPIINTC-------------------NGFYCDQFTPNSNNK-PKMWTENW 254
+ A + +++ N F + K P M E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G + R +DLA V G N YM+HGGTNF +G P
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLG 357
+ TSYDYDA L E G + K+ H++ +AIK + +P T+ SLG
Sbjct: 299 QV-TSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAEPRRKTFGSLG 347
Score = 46.2 bits (108), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 107/302 (35%), Gaps = 67/302 (22%)
Query: 456 KDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFI 515
KD T + I +S Y + S N+K + L V LH F
Sbjct: 361 KDQMMTAQETMYPITMEEAESGYGYMLYSVNLKNYH------HENKLKVVEASDRLHLFA 414
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKN--TFDLLSLTVGLQNYGAFYEKTGAGITGP 573
+G L Y + + I P K D+L +G NYG + GP
Sbjct: 415 DGSLQTIQYQENLGEEEM----IKGTPEKEWIELDVLVENLGRVNYGF-------KLNGP 463
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
Q+KG G D+ Q G + L + + D + QP +Y+ F
Sbjct: 464 TQVKGIRGGIMQDIHFHQ-----GYRQYALTLSADQLKKIDYTAGKNPAQP-SFYQAEFT 517
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
++ ID GKG VNG ++GRYW Q G
Sbjct: 518 LTDLADTF-IDCRSYGKGVVIVNGINLGRYW-----QRG--------------------- 550
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFEEIG---------GDPTKISFVTK---QLGSSLCS 741
P SLY P+ +LK N +V+FE G G P +T ++GS++ +
Sbjct: 551 --PIHSLY-CPKEFLKKGTNEIVIFETEGIEINELIFCGQPIVKKLLTNDFSEIGSNIHN 607
Query: 742 HV 743
H+
Sbjct: 608 HI 609
>gi|312378199|gb|EFR24839.1| hypothetical protein AND_10320 [Anopheles darlingi]
Length = 639
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 112/344 (32%), Positives = 169/344 (49%), Gaps = 37/344 (10%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
++L +C LA + Y+ V+ GK ++GS HY R+ P+ W ++ +
Sbjct: 6 IVLAVCLAIAGLAEAQRSFTIDYERDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLRTLR 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
GGL+ ++ YV W+LH P Y++EG ++ ++ E LY LR GPY+CAE + G
Sbjct: 66 AGGLNAVDLYVQWSLHNPRDGVYSWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNG 125
Query: 127 GFPLWL-HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
G P WL + PGIQ RT + + AE++++ ++ M + E GGPII+ QIENEYG
Sbjct: 126 GLPYWLFNKYPGIQVRTSDANYLAEVKKWYGEL--MSRMEPYMYGNGGPIIMVQIENEYG 183
Query: 186 NIDSAYGAAGKSYI--------KWAAGMALSLDTGVPW---VMCQQSDAPDPIINTCNGF 234
A+G K Y+ ++ A+ P+ + C Q D I T G
Sbjct: 184 ----AFGKCDKPYLNFLKEETNRYVQDKAVLFTVDRPYDDEIGCGQIDG--VFITTDFGL 237
Query: 235 YCDQFTPNSNNK--------PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGT 286
D+ K P + TE ++GW + + RP LA A R + G
Sbjct: 238 MTDEEVDTHAAKVRSYQPKGPLVNTEFYTGWLTHWQESNQRRPAGPLA-ATLRKMLKDGW 296
Query: 287 FQNYYMYHGGTNFDRTSG------GPFIS--TSYDYDAPLDEYG 322
++YMY GGTNF +G G +++ TSYDYDAP+DE G
Sbjct: 297 NVDFYMYFGGTNFGFWAGANDWGLGKYMADITSYDYDAPMDEAG 340
>gi|414879450|tpg|DAA56581.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 154
Score = 164 bits (414), Expect = 3e-37, Method: Composition-based stats.
Identities = 70/107 (65%), Positives = 86/107 (80%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
TT+ VTYD RA+++ G RR+L SG +HYPRSTPEMWPDLI K+K GGLDVI+TYVFW
Sbjct: 31 TTAGRGEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFW 90
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
N HEPV+ Q+NFEGRYDLVKF++ + GLY LRIGP+V +EW +G
Sbjct: 91 NAHEPVQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYG 137
>gi|421514041|ref|ZP_15960756.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|401672838|gb|EJS79281.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 611
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 170/354 (48%), Gaps = 48/354 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + IETY+ WNLHEPV Y+FEG
Sbjct: 11 LVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V FV L E GL LR Y+CAEW FGG P WL ++ R+ + F A+++ +
Sbjct: 71 KDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRTY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ V + K L + GGP+I+ Q+ENEYG +YG K Y++ + VP
Sbjct: 130 FS--VLLPKLVPLQVTHGGPVIMMQVENEYG----SYGME-KEYLRQTKQVMEEFGIDVP 182
Query: 215 WVMCQQSDAPDPIINTC-------------------NGFYCDQFTPNSNNK-PKMWTENW 254
+ A + +++ N F + K P M E W
Sbjct: 183 --LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEYW 240
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G + R +DLA V G N YM+HGGTNF +G P
Sbjct: 241 DGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDLP 298
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLG 357
+ TSYDYDA L E G + K+ H++ +AIK + +P T+ SLG
Sbjct: 299 QV-TSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPEVWQAEPRRKTFGSLG 347
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 108/302 (35%), Gaps = 67/302 (22%)
Query: 456 KDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFI 515
KD T + I +S Y + S N+K + L V LH F
Sbjct: 361 KDQMMTAQETMYPITMEEAESGYGYMLYSVNLKNYH------HENKLKVVEASDRLHLFA 414
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKN--TFDLLSLTVGLQNYGAFYEKTGAGITGP 573
+G L Y + +V I P K D+L +G NYG + GP
Sbjct: 415 DGSLQTIQYQENLGEEVM----IKGTPEKEWIELDVLVENLGRVNYGF-------KLNGP 463
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
Q+KG G D+ Q G + L + + D + QP +Y+ F
Sbjct: 464 TQVKGIRGGIMQDIHFHQ-----GYRQYALTLSADQLKKIDYTAGKNPAQP-SFYQAEFT 517
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
++ ID GKG VNG ++GRYW Q G
Sbjct: 518 LTDLADTF-IDCRSYGKGVVIVNGINLGRYW-----QRG--------------------- 550
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFEEIG---------GDPTKISFVTK---QLGSSLCS 741
P SLY P+ +LK N +V+FE G G P +T ++GS++ +
Sbjct: 551 --PIHSLY-CPKEFLKKGTNEIVIFETEGIEINELIFCGQPIVKKLLTNDFLEIGSNIHN 607
Query: 742 HV 743
H+
Sbjct: 608 HI 609
>gi|328958462|ref|YP_004375848.1| beta-galactosidase [Carnobacterium sp. 17-4]
gi|328674786|gb|AEB30832.1| beta-galactosidase [Carnobacterium sp. 17-4]
Length = 589
Score = 164 bits (414), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 154/314 (49%), Gaps = 36/314 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ + SG++HY R PE W + K G + +ETY+ WN+HEP +Y F G+
Sbjct: 11 LLNGEPFKITSGAVHYFRVLPEDWYHSLYNLKALGFNTVETYIPWNVHEPKEGEYQFSGQ 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
+D+ KFV+L E GL+ LR PY+CAEW FGG P WL + R+ + F ++ R+
Sbjct: 71 WDIKKFVQLAEELGLFVILRPSPYICAEWEFGGLPAWLLTYKDMLIRSSDPVFIEKVSRY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+++ + L GGP+I+ Q+ENEYG +YG K Y++ + L L +P
Sbjct: 131 YKELLKQIT--PLQVDHGGPVIMMQLENEYG----SYG-EDKEYLRTLYELMLKLGVTIP 183
Query: 215 -------WVMCQQS-DAPDPIINTCNGF---------YCDQFTPNSNNK-PKMWTENWSG 256
W Q++ D I T F +F + K P M E W G
Sbjct: 184 IFTSDGAWRATQEAGTMTDLDILTTGNFGSRSKENFKELKEFHESKGKKWPLMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF + + R +L V + G N YM+HGGTNF +G P +
Sbjct: 244 WFNRWNDPIIKRDALELTQDVKEALEIGSL--NLYMFHGGTNFGFMNGCSARLRKDLPQV 301
Query: 309 STSYDYDAPLDEYG 322
TSYDYDAPL+E G
Sbjct: 302 -TSYDYDAPLNEQG 314
Score = 50.1 bits (118), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 61/230 (26%), Positives = 84/230 (36%), Gaps = 53/230 (23%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+D + V LH F+N + + + Y K+ PI+ G N D+L +
Sbjct: 395 KDSDEEFYRVIDGSDRLHFFLNEEKIATQYQEEIGEKIYAS-PIS---GSNQLDVLVENM 450
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD 614
G NYG + Q KG G DL + T + L+F S +D
Sbjct: 451 GRVNYGH-------KLLADTQQKGIRRGVMSDLH-----FITNWEQYSLDFSEPLSIDFD 498
Query: 615 S--KSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNG 672
K P YK T DAP E I+ GKG VNG +IGR+W
Sbjct: 499 KEWKENSPSFYQ---YKVTIDAP---EDTFINMELFGKGIVLVNGFNIGRFW-------- 544
Query: 673 GCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
N G P+ SLY P S + N +++FE G
Sbjct: 545 -------------------NVG-PTLSLY-APMSLFRKGENEIIVFETEG 573
>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
Length = 589
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/310 (34%), Positives = 157/310 (50%), Gaps = 28/310 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG+IHY R P+ W + K G + +ETYV WNLHE Q++F G
Sbjct: 11 LVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTGG 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DLV FVK E GL LR GPY+CAEW GG P WL ++ R D+E F +++ +
Sbjct: 71 KDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVENY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSL--- 209
++ ++ L ++GGP+I+ Q+ENEYG+ D Y A K I+ AG+ + L
Sbjct: 131 FKVLLPLIV--PLQVTKGGPVIMVQVENEYGSFSNDKLYLRALKKMIE-DAGIDVPLFTS 187
Query: 210 DTGVPWVMCQQSDAPDPIINTCN-------GFYCDQFTPNSNNK--PKMWTENWSGWFLS 260
D + + + ++ T N F Q ++K P M E W GWF
Sbjct: 188 DGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEFWCGWFNR 247
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSY 312
+ + R +++ + QRG N YM+HGGTNF +G P + TSY
Sbjct: 248 WNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLPQV-TSY 304
Query: 313 DYDAPLDEYG 322
DYDA L E+G
Sbjct: 305 DYDAFLTEWG 314
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 52/210 (24%), Positives = 81/210 (38%), Gaps = 52/210 (24%)
Query: 524 YGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGT 583
Y + + ++ +F + L G++ LL +G NYGA + P Q KG G
Sbjct: 420 YLTQTQEEIGTEFNLPLQ-GEHELSLLVENMGRNNYGA-------RLLAPTQRKGIRGGV 471
Query: 584 NIDLSSQQWTYQTGLKGE---ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEP 640
+D + Q L E +++F G W + +Y+ F+A E
Sbjct: 472 MVDHHFETEWVQYALSFETIGDVDFAKG----WIPNTP-------AFYEYEFEAHE-CED 519
Query: 641 VAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSL 700
+D + +GKG A++N ++GRYW P Q L
Sbjct: 520 TFLDCSTLGKGVAFINDFNLGRYWSV----------------------------GPIQYL 551
Query: 701 YHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
Y +P LK N LVLFE G +I+
Sbjct: 552 Y-IPGPLLKVGINKLVLFETEGVVAERIAL 580
>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
Length = 589
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 108/310 (34%), Positives = 157/310 (50%), Gaps = 28/310 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG+IHY R P+ W + K G + +ETYV WNLHE Q++F G
Sbjct: 11 LVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTGG 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DLV FVK E GL LR GPY+CAEW GG P WL ++ R D+E F +++ +
Sbjct: 71 KDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVENY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSL--- 209
++ ++ L ++GGP+I+ Q+ENEYG+ D Y A K I+ AG+ + L
Sbjct: 131 FKVLLPLIV--PLQVTKGGPVIMVQVENEYGSFSNDKLYLRALKKMIE-DAGIDVPLFTS 187
Query: 210 DTGVPWVMCQQSDAPDPIINTCN-------GFYCDQFTPNSNNK--PKMWTENWSGWFLS 260
D + + + ++ T N F Q ++K P M E W GWF
Sbjct: 188 DGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEFWCGWFNR 247
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSY 312
+ + R +++ + QRG N YM+HGGTNF +G P + TSY
Sbjct: 248 WNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLPQV-TSY 304
Query: 313 DYDAPLDEYG 322
DYDA L E+G
Sbjct: 305 DYDAFLTEWG 314
Score = 43.1 bits (100), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 52/210 (24%), Positives = 81/210 (38%), Gaps = 52/210 (24%)
Query: 524 YGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGT 583
Y + + ++ +F + L G++ LL +G NYGA + P Q KG G
Sbjct: 420 YLTQTQEEIGTEFNLPLQ-GEHELSLLVENMGRNNYGA-------RLLAPTQRKGIRGGV 471
Query: 584 NIDLSSQQWTYQTGLKGE---ELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEP 640
+D + Q L E +++F G W + +Y+ F+A E
Sbjct: 472 MVDHHFETEWVQYALSFETIGDVDFTKG----WIPNTP-------AFYEYEFEAHE-CED 519
Query: 641 VAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSL 700
+D + +GKG A++N ++GRYW P Q L
Sbjct: 520 TFLDCSTLGKGVAFINDFNLGRYWSV----------------------------GPIQYL 551
Query: 701 YHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
Y +P LK N LVLFE G +I+
Sbjct: 552 Y-IPGPLLKVGINKLVLFETEGVVAERIAL 580
>gi|256396208|ref|YP_003117772.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256362434|gb|ACU75931.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 625
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 105/329 (31%), Positives = 161/329 (48%), Gaps = 25/329 (7%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T D + GG+ ++S +IHY R P++W D +Q+ + G + +E Y+ WN H+P
Sbjct: 7 LTIDGGRFLRGGREHRIVSAAIHYFRIHPDLWRDRLQRLRAMGCNTVECYIAWNFHQPTP 66
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
F+G D+ FV+L E G R GPY+CAEW+FGG P WL ++ RT +
Sbjct: 67 AAPRFDGWRDVAGFVRLAGELGFDVIARPGPYICAEWDFGGLPAWLLADENVRLRTTDPV 126
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAY-GAAGKSYIKWAA 203
+ A + + +++ ++ + L A++GGP++ QIENEYG+ D Y K I+
Sbjct: 127 YLAAVDAWFDELIPVLAE--LQATRGGPVVAVQIENEYGSFGADPDYLDHLRKGLIERGV 184
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFL 259
L G +M PD + G D+ + P + E W+GWF
Sbjct: 185 DTLLFTSDGPQELMLAGGTVPDVLATVNFGSRADEAFATLRRVRPDDPPVCMEFWNGWFD 244
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------------P 306
FG R +D A ++ GG+ N+YM HGGTNF +G P
Sbjct: 245 HFGEPHHTRSAQDAARSLDEILAAGGSV-NFYMGHGGTNFGFWAGANHSGVGTGDPGYQP 303
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
I TSYDYDAP+ E G + PK+ +++
Sbjct: 304 TI-TSYDYDAPVGEAGEL-TPKFHLFREV 330
>gi|348573619|ref|XP_003472588.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Cavia
porcellus]
Length = 880
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 119/316 (37%), Positives = 155/316 (49%), Gaps = 37/316 (11%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 307 IFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 366
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L AE GL+ LR GPY+CAE + GG P WL PG++ RT + F + + + M
Sbjct: 367 LAAEIGLWVILRPGPYICAEIDLGGLPSWLLQDPGMKLRTTYQGFTEAVDLYFDHL--MS 424
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQ 220
+ L GGPII Q+ENEYG N D AY YIK A D G+ ++
Sbjct: 425 RVVPLQYKHGGPIIAVQVENEYGSYNRDPAY----MPYIKKALE-----DRGIIELLL-T 474
Query: 221 SDAPDPI-----------INTCNGFYCDQFTPN----SNNKPKMWTENWSGWFLSFGGAV 265
SD D + IN + T + N+PKM E W+GWF S+GG
Sbjct: 475 SDNKDGLQKGVVHGVLATINLQSQQELQSLTTSLLSVQGNQPKMVMEYWTGWFDSWGGPH 534
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLD 319
++ V+ G + N YM+HGGTNF +G + TSYDYDA L
Sbjct: 535 NILDSSEVLDTVSAITNAGSSI-NLYMFHGGTNFGFINGAMHFNDYKSDVTSYDYDAVLT 593
Query: 320 EYGLIRQPKWGHLKDL 335
E G K+G L+D
Sbjct: 594 EAGDYTA-KYGKLRDF 608
>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
Length = 786
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 168/337 (49%), Gaps = 37/337 (10%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
T +FG ++ ++ GK ++ + +HYPR W I+ K G++ + YVFW
Sbjct: 33 TETFGVG----NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFW 88
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
N+HE +++F G D+ +F++L E GLY +R GPYVCAEW GG P WL I+
Sbjct: 89 NIHEQEEGKFDFTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIR 148
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS 197
R + F + F K+ + + L +GGPII+ Q+ENEYG+ D Y + +
Sbjct: 149 LREQDPYFMERYRIFAKKLGEQIGD--LTIEKGGPIIMVQVENEYGSYGEDKPYVSGIRD 206
Query: 198 YIK-------------WAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGF-YCDQFTPNS 243
I+ W++ + + W M + A N N F + P S
Sbjct: 207 IIRDSGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGTGA-----NIENEFKKLGELRPES 261
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
P+M +E WSGWF +GG R +++ + +G +F + YM HGGT++ +
Sbjct: 262 ---PQMCSEFWSGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWA 317
Query: 304 GG--PFIS---TSYDYDAPLDEYGLIRQPKWGHLKDL 335
G P S TSYDYDAP++E G + PK+ L+++
Sbjct: 318 GANSPGFSPDVTSYDYDAPINEAGQV-TPKYMELREM 353
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/280 (23%), Positives = 111/280 (39%), Gaps = 59/280 (21%)
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
++++L + FINGKL+GS N + T+ P A+ G + D+L +G
Sbjct: 422 TQSILTITDAHDFAQVFINGKLIGSI--DRRNHEKTMLLP-AMKEG-DQLDILVEAMGRI 477
Query: 558 NYGAFYEKTGAGITGPVQLKGSGN-GTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSK 616
N+G K GIT V+L + N G+ + ++ + W T Q D K
Sbjct: 478 NFGRAI-KDFKGITEKVELSYTMNTGSQVTVNLKNWQIYT--------LSDSYQVQKDMK 528
Query: 617 STLPKLQPLV-WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCT 675
K Q + Y+ TF+ + ++ GKG+ +VNG +IGR+W
Sbjct: 529 YVPLKDQKVPGCYRATFNLKKTGD-TFLNLETWGKGQVYVNGHAIGRFWKI--------- 578
Query: 676 DSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQL 735
P Q+LY +P WLK N +++ + +G T + ++K +
Sbjct: 579 -------------------GPQQTLY-MPGCWLKKGENEIIVQDIVGPQETVVEGLSKPI 618
Query: 736 GSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
L ++H RK G L+L P
Sbjct: 619 IDKLNVDAPNTH--------------RKEGQTLNLAGETP 644
>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
18170]
gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
Length = 784
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 109/317 (34%), Positives = 160/317 (50%), Gaps = 26/317 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ V+ + +HYPR W I++ K G++ I YVFWN HE +++F G+
Sbjct: 41 LLNGEPFVVKAAELHYPRIPRAYWEHRIKQCKALGMNTICLYVFWNFHEEKPGEFDFTGQ 100
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F +L + +Y LR GPYVCAEW GG P WL I+ R D+ F + F
Sbjct: 101 KDLAEFCRLCQKNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREDDPYFLERVAIF 160
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ + + L +GGPII+ Q+ENEYG +YG + K Y+ + V
Sbjct: 161 EKEVANQVA--GLTIQKGGPIIMVQVENEYG----SYGES-KEYVAKIRDIVRGNFGDVT 213
Query: 215 WVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFG 262
C Q +A D ++ T N G D QF P +P M +E WSGWF +G
Sbjct: 214 LFQCDWASNFQLNALDDLVWTMNFGTGANIDEQFAPLKKVRPDSPLMCSEFWSGWFDKWG 273
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDYDAP 317
R +D+ + +G +F + YM HGGTN+ +G P + TSYDYDAP
Sbjct: 274 ANHETRAADDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAP 332
Query: 318 LDEYGLIRQPKWGHLKD 334
+ E G I PK+ L++
Sbjct: 333 ISESGKI-TPKYEKLRE 348
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 112/280 (40%), Gaps = 58/280 (20%)
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
D S T+ ++ +A FI+GK +G N + +D P A A G D+L +G
Sbjct: 418 DRSATLTVTEAHDYA-QIFIDGKYIGKL--DRRNGEKQLDIP-ACAEGAQ-LDILVEAMG 472
Query: 556 LQNYGAFYEKTGAGITGPVQLKGSGNGTNI------DLSSQQWTYQTGLKGEELNFPSGS 609
N+G K GIT V+LK G T + +L + Y+ GLK E L +
Sbjct: 473 RINFGRAI-KDFKGITEKVELKNGGRTTELKGWKVYNLEDRYEGYK-GLKFEPLKSVKDA 530
Query: 610 STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
Q +P Y+ TF + ++F GKG +VNG IGR W
Sbjct: 531 QGQ-----RVPGC-----YRATFHVEKPGDTF-LNFETWGKGLVYVNGYGIGRIWEI--- 576
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
P Q+LY +P WLK N +++F+ +G +
Sbjct: 577 -------------------------GPQQTLY-MPGCWLKEGENEILVFDIVGPKEAR-- 608
Query: 730 FVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLS 769
T+ L + + + + PL G + ++ + PVLS
Sbjct: 609 --TEGLEEPILNQLLVNKPLTHRNEGEELRLAGET-PVLS 645
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 172/363 (47%), Gaps = 48/363 (13%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
++ D R++++ G R +L+SGSIHYPRSTP MWP L +++ GL+ IE+Y FWN H
Sbjct: 1037 SIARDGRSLLVNGSRVLLLSGSIHYPRSTPAMWPKLFAEARANGLNAIESYAFWNKHSAT 1096
Query: 86 R---NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFP------------L 130
R Y F G DL F+ L AE L+ R GPYVCAEW GG P
Sbjct: 1097 RYGAYDYGFNGDVDL--FLSLAAEHDLFVLWRFGPYVCAEWPAGGIPARAPRRAVFASNA 1154
Query: 131 WLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSA 190
W+H +PG++ RT+N + E R+ + D + + S+ G ++IENEYG S
Sbjct: 1155 WIHDVPGMKTRTNNTAWLNETGRW---MRDHFAVIEPHLSRNG--ASNRIENEYGGSKSD 1209
Query: 191 YGAAGKSYIKWAAGMALSLDTGVPWVMCQQSD--APDPIINTCNGFYCDQ--------FT 240
A A A++ + + W+MC APD ++T NG DQ
Sbjct: 1210 AAAVAYVDALDALADAVAPE--LVWMMCGFVSLVAPD-ALHTGNGCPHDQGPASAHVVVP 1266
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
P P +TE+ W+ ++G RP D+A+ VA + GG N+YM+HGG ++
Sbjct: 1267 PAPGADPAWYTED-ELWYDAWGLPSLARPPADVAYGVASYVATGGAMHNFYMWHGGNHYG 1325
Query: 301 RTS------GG------PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVA 348
S GG P Y APL G +P + HL +H + L+
Sbjct: 1326 NWSTATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLFSHLAAVHGTLDAYAEVLLG 1385
Query: 349 TDP 351
P
Sbjct: 1386 ATP 1388
>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
Length = 607
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 102/328 (31%), Positives = 156/328 (47%), Gaps = 40/328 (12%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+T D + ++ G+ LISG +HYPR W D ++K++ GL+ + Y FWN HE
Sbjct: 25 RLTTDPQHFLLDGQPFQLISGEMHYPRIPRAAWRDRLRKARAMGLNAVTVYAFWNFHEEE 84
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
++F G+ D+ +FV++ + GL+ LR GPYVCAEW+ GG+P WL P + R+ +
Sbjct: 85 EGHFDFTGQRDIAEFVRIAQQEGLFVILRPGPYVCAEWDLGGYPSWLLKSPAVNLRSLDS 144
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
+ A ++ + + L A++GGPI+ Q+ENEYG+ + ++Y+ M
Sbjct: 145 RYIAAADKWMKALGQQLA--PLQAAKGGPILAVQVENEYGSFPDSAQPNAQAYLDRVHQM 202
Query: 206 ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCD--------------------QFTPNSNN 245
LD G + D D + G + D +F PN+N
Sbjct: 203 V--LDAGFKDSLLYTGDGADVL---ARGTFADLTAGIDYGTGDSARSIALYKKFRPNTN- 256
Query: 246 KPKMWT-ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
++T E W GWF +G V GG+ + YM HGGT+F +G
Sbjct: 257 ---IYTAEYWDGWFDHWGAKHEVVDASIHLKEVHDVLTSGGSI-SLYMLHGGTSFGWMNG 312
Query: 305 GPFIS-------TSYDYDAPLDEYGLIR 325
TSYDYDAP+DE G +R
Sbjct: 313 ANIDHNHYEPDVTSYDYDAPIDEAGQLR 340
>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 774
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 114/338 (33%), Positives = 164/338 (48%), Gaps = 38/338 (11%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
+L L F+ LA +S + D + GK LI G +HY R E W D +++++
Sbjct: 10 FILGLLMPFLFLACSS-KERIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRAR 68
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
GL+ I YVFWN HE +++F G+ D+ +FV+L E GLY LR GPY CAEW+FG
Sbjct: 69 AMGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFG 128
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
G+P WL + +R+ + F +R+ + + L + GG I++ Q+ENEYG+
Sbjct: 129 GYPSWLLKEKDMVYRSKDPRFLEYCERYIKALGKQLA--PLTVNNGGNILMVQVENEYGS 186
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFY------ 235
AA K Y+ M VP C ++ D + T NG +
Sbjct: 187 Y-----AADKEYLAALRDMIKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFK 241
Query: 236 -CDQFTPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGTFQNY 290
D++ P P E + WF +G V Y RP E L + + + G +
Sbjct: 242 IIDKYHPGG---PYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSM 293
Query: 291 YMYHGGTNF-----DRTSGGPFIS-TSYDYDAPLDEYG 322
YM+HGGTNF T+GG TSYDYDAPL E+G
Sbjct: 294 YMFHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWG 331
>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
Length = 774
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 114/338 (33%), Positives = 164/338 (48%), Gaps = 38/338 (11%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
+L L F+ LA +S + D + GK LI G +HY R E W D +++++
Sbjct: 10 FILGLLMPFLFLACSS-KERIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRAR 68
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
GL+ I YVFWN HE +++F G+ D+ +FV+L E GLY LR GPY CAEW+FG
Sbjct: 69 AMGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFG 128
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN 186
G+P WL + +R+ + F +R+ + + L + GG I++ Q+ENEYG+
Sbjct: 129 GYPSWLLKEKDMVYRSKDPRFLEYCERYIKALGKQLA--PLTVNNGGNILMVQVENEYGS 186
Query: 187 IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCNGFY------ 235
AA K Y+ M VP C ++ D + T NG +
Sbjct: 187 Y-----AADKEYLAALRDMIKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFK 241
Query: 236 -CDQFTPNSNNKPKMWTENWSGWFLSFG---GAVPY-RPVEDLAFAVARFFQRGGTFQNY 290
D++ P P E + WF +G V Y RP E L + + + G +
Sbjct: 242 IIDKYHPGG---PYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSM 293
Query: 291 YMYHGGTNF-----DRTSGGPFIS-TSYDYDAPLDEYG 322
YM+HGGTNF T+GG TSYDYDAPL E+G
Sbjct: 294 YMFHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWG 331
>gi|311281324|ref|YP_003943555.1| glycoside hydrolase [Enterobacter cloacae SCF1]
gi|308750519|gb|ADO50271.1| glycoside hydrolase family 35 [Enterobacter cloacae SCF1]
Length = 591
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 149/306 (48%), Gaps = 30/306 (9%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK LISG+IHY R P+ W + K G + +ETY+ WN+H+P ++ F G D+
Sbjct: 14 GKPVQLISGAIHYFRLVPQYWEHSLNNLKALGANCVETYLPWNIHQPDPERFCFTGMADV 73
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+ L GL+ LR PY+CAEW FGG P WL P ++ R+ F ++R+ A+
Sbjct: 74 ERFIALAQRKGLFVILRPSPYICAEWEFGGLPAWLLRDPSMRVRSSQPAFLQAVERYYAE 133
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP--- 214
++ + + +GGP+++ Q+ENEYG+ + K+Y++ A M VP
Sbjct: 134 LLPRLAPWQY--DRGGPVVMMQLENEYGSFGN-----DKAYLRTLAAMMRRYGVSVPLFT 186
Query: 215 ----WVMCQQSDA--PDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
W Q+ + D ++ T N D +P M E W+GWF +G
Sbjct: 187 SDGAWQEALQAGSLCEDNVLATANFGSRSAESLDNLAAFQPERPLMCLEFWNGWFNRYGD 246
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-------TSYDYDA 316
A+ R +D+ + R N YM+ GGTNF +G TSYDYDA
Sbjct: 247 AIIRRDADDVGQEIRTLLTRASI--NIYMFQGGTNFGFMNGCSVRGDKDLPQVTSYDYDA 304
Query: 317 PLDEYG 322
L E+G
Sbjct: 305 LLSEWG 310
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 94/235 (40%), Gaps = 57/235 (24%)
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
V G + + NG+ + + Y + ++ P AL N DLL +G NYG
Sbjct: 399 RVVDAGDRVQFYCNGEHLATQY----HEQIGEQIPFALREADNVLDLLIENMGRVNYGP- 453
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQ-QW-TYQTGLKG-EELNFPSGSSTQWDSKSTL 619
+ P Q KG G IDL + W + L ++++F +G Q
Sbjct: 454 ------RLLAPTQRKGLRGGLVIDLHLETDWDIFPLPLDNIDDVDFSAGWQPQ------- 500
Query: 620 PKLQPLVW-YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
QP + Y D+PA + +D +GKG A++NG ++GRYW
Sbjct: 501 ---QPAFYEYCFAIDSPADT---FLDTRSLGKGVAFINGFNLGRYW-------------- 540
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
YRG P LY +P LK N L++FE G + ++ + K
Sbjct: 541 -YRG-------------PLGYLY-IPAPLLKQGENRLIIFETEGVEVGALALLNK 580
>gi|115361550|gb|ABI95864.1| beta-galactosidase [Planococcus sp. L4]
Length = 552
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 110/313 (35%), Positives = 156/313 (49%), Gaps = 25/313 (7%)
Query: 48 IHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEA 107
+HY R+ PE W D +QK K GL+ +ETY+ WN HEP + Q++F G D+ F++L
Sbjct: 1 MHYFRTVPEQWEDRLQKLKALGLNTVETYIPWNFHEPKKGQFHFSGMADIEGFIELAHRL 60
Query: 108 GLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKL 167
GLY LR PY+CAEW GG P WL + R+ + F ++ + A++ + K K
Sbjct: 61 GLYVILRPAPYICAEWEMGGLPSWLMKDKNLVLRSSDPAFLGHVEDYFAEL--LPKFTKH 118
Query: 168 YASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPD 225
GGP+I QIENEYG DSAY K+ + + P + Q S PD
Sbjct: 119 LYQNGGPVIAMQIENEYGAYGNDSAYLDFFKAQYEHHGLNTFLFTSDGPDFITQGS-MPD 177
Query: 226 PIINTCNGFYCDQ-------FTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVA 278
G D+ F P+S PKM E W GWF + G R +D+A
Sbjct: 178 VTTTLNFGSRVDESFQALDAFKPDS---PKMVAEFWIGWFDYWSGEHTVRSGDDVASVFK 234
Query: 279 RFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFISTSYDYDAPLDEYGLIRQPKWGH 331
++ + N+YM+HGGTNF +G P I TSYDYD+ L E G I + K+
Sbjct: 235 EIMEKNISV-NFYMFHGGTNFGFMNGANHYDIYYPTI-TSYDYDSLLTEGGAITE-KYKA 291
Query: 332 LKDLHKAIKLCEA 344
+K++ + + A
Sbjct: 292 VKEVLREYREVPA 304
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 84/206 (40%), Gaps = 52/206 (25%)
Query: 514 FINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGP 573
+ING+ V + Y + +T+DFP A+ NT ++L +G NYG +T P
Sbjct: 381 YINGRHVATSYINDEEKMLTLDFPEAV----NTLEILVENMGRANYGEH-------LTDP 429
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
KG N N+ L Q + + K E P D + PK +++ +FD
Sbjct: 430 ---KGLVN--NLWLGEQYFFHWDMFKVELEQLPQSYGAGEDPR--FPK-----FFRGSFD 477
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
A G + +D G KG ++NG ++GRYW N
Sbjct: 478 AEEGLDSY-VDTHGFTKGNVFINGFNLGRYW---------------------------NT 509
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFE 719
P Q LY +P LK N +V+ E
Sbjct: 510 AGPQQRLY-LPGPLLKKQHNEIVVLE 534
>gi|157106611|ref|XP_001649403.1| beta-galactosidase [Aedes aegypti]
gi|108879822|gb|EAT44047.1| AAEL004580-PA [Aedes aegypti]
Length = 656
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 111/325 (34%), Positives = 161/325 (49%), Gaps = 39/325 (12%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ YD V+ GK ++GS HY R+ P+ W ++ + GGL+ ++ YV W+LH P
Sbjct: 45 IDYDRDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLKTLRAGGLNAVDLYVQWSLHNPKE 104
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF-IPGIQFRTDNE 145
NQY ++G ++ ++ EA LY LR GPY+CAE + GG P WL PGIQ RT +
Sbjct: 105 NQYVWDGIANIKDVIEAAIEADLYVILRPGPYICAEIDNGGLPYWLFTKYPGIQVRTSDA 164
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYA-SQGGPIILSQIENEYGNIDSAYGAAGKSYI----- 199
+ E+ + K +M Q Y GGPII+ Q+ENEYG A+G K Y+
Sbjct: 165 NYLKEVATWYEK---LMSQLTPYMYGNGGPIIMVQLENEYG----AFGKCDKPYLNFLKE 217
Query: 200 ---KWAAGMALSLDTGVPW---VMCQQSDAPDPIINTCNGFYCDQFTPNSN--------N 245
K+ G A+ P+ + C Q P + T G D+ N
Sbjct: 218 ETEKYTQGKAVLFTVDRPYGNEMECGQ--VPGVFVTTDFGLMTDEEVDTHKAKLRSVQPN 275
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG- 304
P + TE ++GW + + RP E LA + + G ++YMY GGTNF +G
Sbjct: 276 GPLVNTEFYTGWLTHWQESNQRRPAEPLANTLRKMLHDGWNV-DFYMYFGGTNFGFWAGA 334
Query: 305 -----GPFIS--TSYDYDAPLDEYG 322
G +++ TSYDYDAP+DE G
Sbjct: 335 NDWGLGKYMADITSYDYDAPMDEAG 359
>gi|257899628|ref|ZP_05679281.1| glycosyl hydrolase [Enterococcus faecium Com15]
gi|257837540|gb|EEV62614.1| glycosyl hydrolase [Enterococcus faecium Com15]
Length = 595
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 128/401 (31%), Positives = 188/401 (46%), Gaps = 50/401 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KDIVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSI 403
+++ +C I T+ +T++ N Y +S+++
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLLYSLTL 393
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 105/284 (36%), Gaps = 69/284 (24%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNKSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN----TFDLLSLTVGLQN 558
+ +I+GK + + + T+ + + KN T D+L +G N
Sbjct: 403 RLIETNDRAQIYIDGKY------NQTQTQETLGDEMMIEGQKNQPTITLDVLVENLGRVN 456
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YGA + P Q KG NG D+ + G + L F + D +
Sbjct: 457 YGA-------KLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAG 504
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
QP +Y+ FD A ID + GKG +NG ++GRYW N G
Sbjct: 505 KDPSQP-SFYQFEFDL-AEEADTYIDCSLYGKGVVIINGFNLGRYW------NHG----- 551
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +++FE G
Sbjct: 552 -----------------PVLSLY-CPKDVLKKGRNEVIIFETEG 577
>gi|387791561|ref|YP_006256626.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379654394|gb|AFD07450.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 619
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 167/334 (50%), Gaps = 38/334 (11%)
Query: 33 AVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE 92
A V GK + SG +H+ R E W ++ K GL+ + TYVFWN HE ++F+
Sbjct: 32 AFVYDGKPVQIHSGEMHFARVPQEYWRHRLKMMKAMGLNSVATYVFWNYHETAPGVWDFK 91
Query: 93 -GRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
G ++ +F+K+ E GL LR GPY CAEW +GG+P +L + G++ R +N F A
Sbjct: 92 TGNKNISEFIKIAGEEGLMVILRPGPYACAEWEYGGYPWFLQNVEGLEVRRNNPKFLAAC 151
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIK 200
+ + + +K +++ ++GGPII+ Q ENE+G+ AY +A K+ +
Sbjct: 152 KEYIDHLAKEVKNQQI--TKGGPIIMVQAENEFGSYVAQRKDIPLAEHKAYSSAIKAQL- 208
Query: 201 WAAGMALSLDTGV-PWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWT 251
AAG + L T W+ + + + + T NG DQ+ N P M
Sbjct: 209 LAAGFDVPLFTSDGSWLF--EGGSIENCLPTANGEDNIENLKKVVDQY--NGGKGPYMVA 264
Query: 252 ENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-- 309
E + GW + P P ED+ ++ Q +F NYYM HGGTNF TSG +
Sbjct: 265 EFYPGWLDHWAEPFPKVPTEDVVKQTEKYLQNNVSF-NYYMVHGGTNFGYTSGANYDKNH 323
Query: 310 ------TSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
TSYDYDAP+ E G PK+ +++L K
Sbjct: 324 DIQPDMTSYDYDAPISEAGW-ATPKYIAIRELMK 356
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 80/221 (36%), Gaps = 56/221 (25%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L + L ++NG+ V N +D P T D+ +G NYGA
Sbjct: 428 LELNGLRDYALVYVNGEKVAELNRYYKNYSCEIDVPF-----NATLDIFVENMGRINYGA 482
Query: 562 FYEKTGAGITGPVQLKG---SGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
+ GI PV + G SGN + ++ +K +E+
Sbjct: 483 KITENNKGIISPVVINGTEISGNWKMYKMPLEKQEEVASIKAKEV--------------- 527
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
K QP+V K TF+ + +D GKG +VNG +GRYW
Sbjct: 528 --KSQPVV-LKGTFNLTETGD-TFLDMEAWGKGIVFVNGYHLGRYW-------------- 569
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFE 719
N G P Q+LY +P WLK N + + E
Sbjct: 570 -------------NVG-PQQTLY-LPGCWLKKGANEITIVE 595
>gi|347967091|ref|XP_001689312.2| AGAP002056-PA [Anopheles gambiae str. PEST]
gi|333469762|gb|EDO63217.2| AGAP002056-PA [Anopheles gambiae str. PEST]
Length = 629
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 109/334 (32%), Positives = 161/334 (48%), Gaps = 33/334 (9%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
++ YD+ V+ GK ++GS HY R+ PE WP +++ + GL+ I TYV W+LH P
Sbjct: 27 SIDYDNDTFVMDGKPFQYVAGSFHYFRALPESWPSILRSMRAAGLNAITTYVEWSLHNPK 86
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL-HFIPGIQFRTDN 144
+ YN++G D+ F++L AGLY LR GPY+CAE + GGFP WL H P I RT++
Sbjct: 87 EDVYNWQGMADIEHFLELADSAGLYVILRPGPYICAERDMGGFPSWLLHKYPDILLRTND 146
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW--- 201
+ E++ + A+++ + ++ QGGPII+ Q+ENEYG ++ A Y+ W
Sbjct: 147 LRYLREVRTWYAQLLS--RVQRFLVGQGGPIIMVQVENEYG----SFYACDHKYLNWLRD 200
Query: 202 -----AAGMALSLDTGVPWV--------MCQQSDAPDPIINTCNGFYCDQFTPNSNNKPK 248
G A+ P + + D + NGF+ P
Sbjct: 201 ETERYVMGNAVLFTNNGPGLEGCGAIEHVLSSLDFGPGTEDEINGFWS-TLRKTQPKGPL 259
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI 308
+ E + GW + R F R N YM+ GGTN+ T+G +
Sbjct: 260 VNAEYYPGWLTHWQEPHMARTDTKPVVDSLDFMLRNKVNVNIYMFFGGTNYGFTAGANNM 319
Query: 309 S--------TSYDYDAPLDEYGLIRQPKWGHLKD 334
TSYDYDAPLDE G PK+ L+D
Sbjct: 320 GAGGYAADLTSYDYDAPLDESG-DPTPKYFALRD 352
>gi|348508360|ref|XP_003441722.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 648
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 110/304 (36%), Positives = 149/304 (49%), Gaps = 36/304 (11%)
Query: 42 VLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFV 101
+++ GSIHY R W D + K K GL+ + TYV WNLHEP R + F+ + DL ++
Sbjct: 72 LILGGSIHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGVFKFDDQLDLEAYL 131
Query: 102 KLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDM 161
+L A GL+ LR GPY+CAEW+ GG P WL P ++ RT F + F +++
Sbjct: 132 RLAASLGLWVILRPGPYICAEWDLGGLPSWLLRDPQMKLRTTYSGFTYAVNSFFDEVIK- 190
Query: 162 MKQEKLYASQGGPIILSQIENEYGN----------IDSAYGAAGKSYIKWAA----GMAL 207
K S+GGPII Q+ENEYG+ I A + G + + + G+ L
Sbjct: 191 -KAVPHQYSKGGPIIAVQVENEYGSYATDENYMPFIKEALLSRGITELLLTSDNKDGLKL 249
Query: 208 SLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
G + Q PD I Y +Q P +PKM E WSGWF +GG
Sbjct: 250 GGVKGALETINFQKLDPDEIK------YLEQIQP---QQPKMVMEYWSGWFDLWGGLHHV 300
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG---------PFISTSYDYDAPL 318
E++ V + + N YM+HGGTNF SG P + TSYDYDAPL
Sbjct: 301 YTAEEMIPVVTEILKLDMSI-NLYMFHGGTNFGFMSGAFAVGLPAPKPMV-TSYDYDAPL 358
Query: 319 DEYG 322
E G
Sbjct: 359 SEAG 362
>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
Length = 613
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 130/376 (34%), Positives = 180/376 (47%), Gaps = 35/376 (9%)
Query: 1 MASKEILLLVLCWGFVV-----LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTP 55
M + LVL F + A T N V GK L+SG+IH+ R
Sbjct: 1 MLRTTLAPLVLALAFALPITGAAADTERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPR 60
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR
Sbjct: 61 AYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRP 120
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPI 175
GPY CAEW GG+P WL I+ R+ + F A Q + + + + + L GGPI
Sbjct: 121 GPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPI 178
Query: 176 ILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINT 230
I Q+ENEYG+ D AY A ++ Y+K AL L T M PD ++N
Sbjct: 179 IAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNF 237
Query: 231 CNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RG 284
G D+ ++P+M E W+GWF +G P+ + A A F+ R
Sbjct: 238 APGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWILRQ 293
Query: 285 GTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGHLKD 334
G N YM+ GGT+F +G F +TSYDYDA LDE G PK+ ++D
Sbjct: 294 GHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
Query: 335 -LHKAIKLCEAALVAT 349
+ + + AL AT
Sbjct: 353 AIARVTGIQPPALPAT 368
Score = 39.7 bits (91), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 69/167 (41%), Gaps = 24/167 (14%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + +++ + VGS + V+ P G++T D+L G N
Sbjct: 422 KGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLDVEIPA----GQHTLDVLVENSGRIN 477
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKS 617
YG AG+ PV L +QQ T G + L + S + W K+
Sbjct: 478 YGPRMADGRAGLVDPVVL-----------DNQQLT---GWQAFPLPMRTPDSIRGWTRKA 523
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ T ++ +D GKG AW NG ++GR+W
Sbjct: 524 ----VQGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWANGVNLGRHW 565
>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 640
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 120/355 (33%), Positives = 171/355 (48%), Gaps = 41/355 (11%)
Query: 17 VLATTSFGAN-----VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLD 71
VL TS N V Y+ + G+ +SG +HY R W D IQK K GL+
Sbjct: 16 VLCDTSNSTNNRTFIVDYEKNEFLKDGEVFRYVSGDLHYFRVPKSYWKDRIQKIKAAGLN 75
Query: 72 VIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLW 131
I TYV W+LHEP YNFEG DL F+KL+ + G+Y LR GPY+CAE +FGGFP W
Sbjct: 76 AITTYVEWSLHEPFPGTYNFEGMADLEYFIKLIQDEGMYLLLRPGPYICAERDFGGFPYW 135
Query: 132 -LHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSA 190
L+ P RT++ +K + ++ + ++ M Q LY + GG II+ Q+ENEYG +
Sbjct: 136 LLNVTPKGSLRTNDSSYKKYVSQWFSVLMKKM-QPHLYGN-GGNIIMVQVENEYG----S 189
Query: 191 YGAAGKSYIKWAAGMALSL--DTGVPWV--MCQQSD---APDPIIN-------TCNGFYC 236
Y A Y W + D + + +C+Q D P P + + N C
Sbjct: 190 YYACDSDYKLWLRDLLKGYVEDKALLYTIDICRQRDFDCGPIPEVYATVDFGISVNAATC 249
Query: 237 DQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHG 295
F N P + +E + GW + P +D+ + +F ++YM+HG
Sbjct: 250 FDFLKNYQKGGPSVNSEFYPGWLAHWQEPHPKVNSDDVVNHMKSMLSLNASF-SFYMFHG 308
Query: 296 GTNFDRTSGG------------PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKA 338
GTNF TSG P + TSYDYDAP+ E G + + + + L A
Sbjct: 309 GTNFGFTSGANTNESDANIGYLPQL-TSYDYDAPITEAGDLTEKYFKIKQTLENA 362
>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 610
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 120/337 (35%), Positives = 161/337 (47%), Gaps = 34/337 (10%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSK 66
LL+V + F + T A ++ GK +ISG IHYPR E W D ++ +K
Sbjct: 9 LLIVFSYLFSIAQQQH---TFTLGDTAFLLDGKPLQMISGEIHYPRVPRECWRDRMKMAK 65
Query: 67 DGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFG 126
GL+ I TYVFWN+HEP + QY+F G D+ FVK+ E L+ LR PYVCAEW FG
Sbjct: 66 AMGLNTIGTYVFWNVHEPEKGQYDFSGNNDIAAFVKMAKEEDLWVVLRPSPYVCAEWEFG 125
Query: 127 GFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEYG 185
G+P WL I G++ R+ + ++ + I+ + KQ L + GG I++ QIENEYG
Sbjct: 126 GYPYWLQEIKGLKVRSKEPQY---LEAYRNYIMAVGKQLSPLLVTHGGNILMVQIENEYG 182
Query: 186 NI--DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINTCNGFYCDQFTP 241
+ D Y + AG L T P + P P IN + +
Sbjct: 183 SYSDDKDYLDINRKMFV-EAGFDGLLYTCDPKAAIKNGHLPGLLPAINGVDDPLQVKQLI 241
Query: 242 NSNNKPK------MWTENWSGWFLSFGGAVPYRP-VEDLAFAVARFFQRGGTFQNYYMYH 294
N N+ K W W W+ + VPYR + L +A G N YM+H
Sbjct: 242 NENHSGKGPYYIAEWYPAWFDWWGTKHHTVPYRQYLGKLDSVLA-----AGISINMYMFH 296
Query: 295 GGTNFDRTSGG---------PFISTSYDYDAPLDEYG 322
GGT +G P IS SYDYDAPLDE G
Sbjct: 297 GGTTRGFMNGANANDADPYEPQIS-SYDYDAPLDEAG 332
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 99/272 (36%), Gaps = 68/272 (25%)
Query: 450 EPVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGH 509
+PVG +K F G Y W ST L G K +L ++ L
Sbjct: 384 KPVGSAKPRTFEDLG-----------QAYGWVMYSTT-------LTGGRKGLLQLKELRD 425
Query: 510 ALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAG 569
+NGK G S + +D P GK DLL +G N+G + G
Sbjct: 426 YCVVMVNGKRAGVLDRRSKRDSIALDLPA----GKVKLDLLVENLGRINFGPYLLSNRKG 481
Query: 570 ITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYK 629
IT V D + Q GL ++L P+ ++ + + +P + +
Sbjct: 482 ITEKVLF---------DRQELKGWQQYGLPFDKL--PAVAAKGIKAGANVP-----TYRQ 525
Query: 630 TTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKC 689
TF + +D + GKG W+NG +GRYW Q G
Sbjct: 526 GTFTLDKTGD-TWLDMSNWGKGAVWINGHHLGRYW-----QVG----------------- 562
Query: 690 LKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEI 721
P Q++Y VP WLK N +V+ E I
Sbjct: 563 ------PQQTIY-VPAEWLKKGMNDIVIMELI 587
>gi|425056292|ref|ZP_18459750.1| putative beta-galactosidase [Enterococcus faecium 505]
gi|403032128|gb|EJY43702.1| putative beta-galactosidase [Enterococcus faecium 505]
Length = 595
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 128/401 (31%), Positives = 188/401 (46%), Gaps = 50/401 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KDVVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRIIKEVCPSVWQAEPRTKTLKNLGTYPVNRSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSI 403
+++ +C I T+ +T++ N Y +S+++
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLLYSLTL 393
Score = 45.8 bits (107), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 103/280 (36%), Gaps = 61/280 (21%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNRSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
+ +I+GK Y + + D + G+ ++L V ++N G
Sbjct: 403 RLIETNDRAQIYIDGK-----YDQTQTQETLGD--EMMIEGQKNQPTIALDVLVENLGRV 455
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKL 622
GA + P Q KG NG D+ + G + L F + D +
Sbjct: 456 --NYGAKLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAGKDPS 508
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QP +Y+ FD A ID + GKG VNG ++GRYW N G
Sbjct: 509 QP-SFYQFEFDL-AEEADTYIDCSLYGKGVVIVNGFNLGRYW------NHG--------- 551
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +V+FE G
Sbjct: 552 -------------PVLSLY-CPKDVLKKGRNEVVIFETEG 577
>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 604
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 104/338 (30%), Positives = 164/338 (48%), Gaps = 43/338 (12%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
+ +T+ + + G+ ++SG+IHY R PE W D + K K G + +ETY+ WNLHEP
Sbjct: 2 SRLTWKDQKYRLDGEEFRILSGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIPWNLHEP 61
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
+ F+G D+ +F++ GL+ +R PY+CAEW FGG P WL + R +
Sbjct: 62 REGSFRFDGFADVARFIETAGRLGLHVIVRPSPYICAEWEFGGLPAWL-LKSSMGLRCMD 120
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWA 202
+ ++ R+ +++ + L S+GGPII Q+ENEYG+ D+AY A + +
Sbjct: 121 NEYLEKVDRYYDELIPRLL--PLLDSRGGPIIAVQVENEYGSYGNDTAYLAYLRDGL--- 175
Query: 203 AGMALSLDTGVPWVMCQQSDAPDPII--NTCNGFYCD------------QFTPNSNNKPK 248
+ GV ++ D ++ T G + ++ ++P
Sbjct: 176 ------IRRGVDCLLFTSDGPTDEMLLGGTVEGLHATVNFGSRVAESLAKYREYRQDEPL 229
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF- 307
M E W GWF + R D+A + ++G + N YM+HGGTNF SG +
Sbjct: 230 MVMEYWLGWFDHWRKPHHVREAGDVANVLDEMLEQGASV-NLYMFHGGTNFGFYSGANYG 288
Query: 308 -----ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIK 340
TSYDYDAPL E WG + + +KAI+
Sbjct: 289 EHYEPTITSYDYDAPLTE--------WGDITEKYKAIR 318
>gi|346725882|ref|YP_004852551.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
gi|346650629|gb|AEO43253.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
Length = 611
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 130/382 (34%), Positives = 183/382 (47%), Gaps = 35/382 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK L+SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F G
Sbjct: 38 VRAGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGN 97
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 98 NDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQSY 157
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDT 211
+ + + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 158 LDALAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFT 214
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
M PD ++N G D+ ++P+M E W+GWF +G P
Sbjct: 215 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--P 272
Query: 267 YRPVEDLAFAVARFFQ---RGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYD 313
+ + A A F+ R G N YM+ GGT+F +G F +TSYD
Sbjct: 273 HAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYD 330
Query: 314 YDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC 372
YDA LDE G PK+ ++D + + + AL A P L AT + + L
Sbjct: 331 YDAILDEAGHP-TPKFALMRDAIARVTGVQPPALPA-----PIATATLPATPLRESASLW 384
Query: 373 SAFLANIGTNSDVTVKFNGNSY 394
A I ++ ++ G Y
Sbjct: 385 DNLPAPIAIDTPQPMEQFGQDY 406
>gi|225407896|ref|ZP_03761085.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
gi|225042575|gb|EEG52821.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
Length = 590
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 154/313 (49%), Gaps = 36/313 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ L+SG++HY R PE W D + K G + +ETY+ WN+HEP +++F G
Sbjct: 12 LDGRPVKLLSGAVHYFRLMPEYWEDCLYNLKAMGFNTVETYIPWNIHEPEEGEFDFSGSR 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ FV+L GL+ LR P++CAEW GG P WL P ++ RT+ F +++ +
Sbjct: 72 DVEAFVRLAGSMGLHVILRPSPFICAEWEMGGLPAWLLRYPDMKVRTNTPLFLVKVEAYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + L ++GGP+IL Q+ENEYG+ + K Y++ + VP+
Sbjct: 132 RELFRHIAD--LQITRGGPVILMQVENEYGSFGN-----DKEYLRRIKSLMERFGAEVPF 184
Query: 216 VMCQQS-DA--------PDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTENWSGW 257
S DA D ++ T N G D+ F + P M E W GW
Sbjct: 185 FTSDGSWDAALEAGSLIEDGVLATANFGSRSDENLDVLEAFFKRHGRKWPLMCMEFWDGW 244
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFIS 309
F + + R EDLA V + +R N YM+ GGTNF +G P I
Sbjct: 245 FNRWREKIITRDAEDLAMEVRQLLERASI--NLYMFQGGTNFGFYNGCSARGYTDLPQI- 301
Query: 310 TSYDYDAPLDEYG 322
TSY+YDA L E+G
Sbjct: 302 TSYNYDAILTEWG 314
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 86/213 (40%), Gaps = 52/213 (24%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+ ++NG G+ Y ++S ++ + F P +N DLL +G NYG +
Sbjct: 411 VQYYLNGMFEGTQYQNNSGEELELFF----GP-ENRLDLLVENMGRVNYGY-------KL 458
Query: 571 TGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVW-YK 629
P Q KG G +D+ +++G E+ P + + D + + P + Y+
Sbjct: 459 QAPTQRKGIRTGVMVDIH-----FESGW--EQYALPLDNVNRVDFEKEWIQDTPAFYRYE 511
Query: 630 TTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKC 689
D P + ++ +GKG A++NG ++GRYW
Sbjct: 512 FQVDQPKDT---FLNCRELGKGVAFINGFNLGRYWSE----------------------- 545
Query: 690 LKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P Q LY +P L+ N L++FE G
Sbjct: 546 -----GPVQYLY-IPAPLLREGKNELIVFETEG 572
>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
Length = 587
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 153/326 (46%), Gaps = 30/326 (9%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+I+G +HY R+ + W D + K K G + +ETYV WN+HE + Y F G D+ F++
Sbjct: 20 IIAGGMHYFRTMKDSWKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIE 79
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L L+ +R PY+CAEW FGG P WL PG++ RT +PF ++ + + ++
Sbjct: 80 LAQSLELFVIVRPSPYICAEWEFGGLPAWLLKDPGMKVRTVYKPFMKHVKEYFEVLFKIL 139
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ--- 219
L Q GPIIL QIENEYG Y K Y+ + T VP V
Sbjct: 140 A--PLQIDQDGPIILMQIENEYG-----YYGNDKEYLSTLLKIMRDFGTTVPVVTSDGPW 192
Query: 220 ---------QSDAPDPIINTCNGF--YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY- 267
+D P +N G + + F NKP M E W GWF ++G +
Sbjct: 193 GEALDAGSLLADVSLPTMNFGTGAKEHIENFKEKYVNKPVMCMEFWVGWFDAWGDDRHHT 252
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEY 321
R D A + G N YM+HGGTNF +G + TSYDYDA L E
Sbjct: 253 RDASDAANELRDILNEGSV--NIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILTEC 310
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALV 347
G + + + K + + ++ E L+
Sbjct: 311 GDLTEKYYEFKKVISEFTEIKEVELL 336
>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
25986]
gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
Length = 598
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 146/301 (48%), Gaps = 25/301 (8%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R P W + K G + +ETYV WNLHEP ++F G DL F+
Sbjct: 19 ILSGAIHYMRVHPSDWHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSGSIDLAAFLD 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
A GLYA +R P++CAEW FGG P WL ++ R+ + F A + ++ ++ ++
Sbjct: 79 EAASLGLYAIVRPSPFICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQYYDHLMPIL 138
Query: 163 KQEKLYASQGGPIILSQIENEYGNI--DSAY-GAAGKSYIKWAAGMALSLDTGVPWVMCQ 219
++ +GG II+ Q+ENEYG+ D Y A + ++ + L G PW C
Sbjct: 139 VSRQI--DKGGNIIMMQVENEYGSYCEDKDYLRAIRRLMVERGVSVPLCTSDG-PWRGCL 195
Query: 220 QSDAPDPIINTCNGFYCDQFTPN-----------SNNKPKMWTENWSGWFLSFGGAVPYR 268
++ C G + N P M E W GWF +G V R
Sbjct: 196 RAGTLIDDDVLCTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWDGWFNRYGENVIRR 255
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD-------RTSGGPFISTSYDYDAPLDEY 321
EDLA V + GG+ N YM+HGGTNF R + TSYDYDAPLDE
Sbjct: 256 DPEDLASCVREVLELGGSL-NLYMFHGGTNFGFMNGCSARHTHDLHQVTSYDYDAPLDEQ 314
Query: 322 G 322
G
Sbjct: 315 G 315
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/228 (27%), Positives = 88/228 (38%), Gaps = 53/228 (23%)
Query: 514 FINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGP 573
F+NG V + Y + D L N D+L+ +G NYG +
Sbjct: 416 FVNGDKVATQY----QEHIGEDIHCVLPCEHNRLDVLTEDMGRVNYGH-------KLLAD 464
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
Q KG G +DL + TG + L P + D + + QP +Y+ FD
Sbjct: 465 TQHKGIRTGVCVDLH-----FVTGWEMRCL--PLDNIDNLDYSAGWVEGQP-SFYRAKFD 516
Query: 634 APAGSEPVA--IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLK 691
SEP ID TG GKG A+VNG ++GR+W
Sbjct: 517 I---SEPADTFIDTTGFGKGVAFVNGTNVGRFWDK------------------------- 548
Query: 692 NCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSL 739
P +LY VP L N LV+FE G KIS ++ + +L
Sbjct: 549 ---GPIMTLY-VPHGLLHPGTNELVMFETEGVYDAKISLRSEPVIRTL 592
>gi|81889875|sp|Q5XIL5.1|GLBL3_RAT RecName: Full=Beta-galactosidase-1-like protein 3
gi|53734228|gb|AAH83665.1| Galactosidase, beta 1-like 3 [Rattus norvegicus]
Length = 631
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 152/307 (49%), Gaps = 32/307 (10%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + +++ GSIHY R E W D + K + G + + TY+ WNLHE R +++F
Sbjct: 58 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 117
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +V L GL+ LR GPY+CAE + GG P WL PG RT N+ F + ++
Sbjct: 118 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 177
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
++ K L +GGP+I Q+ENEYG+ D Y YIK A L+ G+
Sbjct: 178 DHLIP--KILPLQYRRGGPVIAVQVENEYGSFRNDKNY----MEYIKKAL-----LNRGI 226
Query: 214 PWVMCQQSDAPDPIINTCNG---------FYCDQFTP---NSNNKPKMWTENWSGWFLSF 261
++ + I + G F D F N+KP M E W+GW+ S+
Sbjct: 227 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 286
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYD 315
G + ++ + RFF G +F N YM+HGGTNF +GG + TSYDYD
Sbjct: 287 GSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYD 345
Query: 316 APLDEYG 322
A L E G
Sbjct: 346 AVLSEAG 352
>gi|164519029|ref|NP_001019529.2| beta-galactosidase-1-like protein 3 precursor [Rattus norvegicus]
Length = 644
Score = 162 bits (411), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 152/307 (49%), Gaps = 32/307 (10%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + +++ GSIHY R E W D + K + G + + TY+ WNLHE R +++F
Sbjct: 71 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 130
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +V L GL+ LR GPY+CAE + GG P WL PG RT N+ F + ++
Sbjct: 131 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 190
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
++ K L +GGP+I Q+ENEYG+ D Y YIK A L+ G+
Sbjct: 191 DHLIP--KILPLQYRRGGPVIAVQVENEYGSFRNDKNY----MEYIKKAL-----LNRGI 239
Query: 214 PWVMCQQSDAPDPIINTCNG---------FYCDQFTP---NSNNKPKMWTENWSGWFLSF 261
++ + I + G F D F N+KP M E W+GW+ S+
Sbjct: 240 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 299
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYD 315
G + ++ + RFF G +F N YM+HGGTNF +GG + TSYDYD
Sbjct: 300 GSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYD 358
Query: 316 APLDEYG 322
A L E G
Sbjct: 359 AVLSEAG 365
>gi|260912222|ref|ZP_05918774.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
gi|260633656|gb|EEX51794.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
Length = 627
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 112/325 (34%), Positives = 159/325 (48%), Gaps = 28/325 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE-G 93
V GK L SG +HY R W ++ K GL+ + TYVFWN HE +++++ G
Sbjct: 43 VYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKTG 102
Query: 94 RYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQR 153
+L +FVK AE G+ LR GPY CAEW+FGG+P WL G+ R DN+PF +
Sbjct: 103 NRNLRQFVKTAAEEGMLVILRPGPYCCAEWDFGGYPWWLSKAKGLVIRADNQPFLDSCRV 162
Query: 154 FTAKIVDMMKQEKLYASQGGPIILSQIENEYGN-IDSAYGAAGKSYIKWAAGMALSL-DT 211
+ ++ M+ L ++GGPII+ Q ENE+G+ + +S+ ++A + L D
Sbjct: 163 YINQLASQMRD--LQITKGGPIIMVQAENEFGSYVAQRKDVPLESHRAYSAKIKQQLIDA 220
Query: 212 G--VPWVMCQQS--------DAPDPIINTCNGFYCDQFTPNSNN---KPKMWTENWSGWF 258
G VP S + P N N + N N P M E + GW
Sbjct: 221 GFDVPLFTSDGSWLFKGGTIEGALPTANGENDIEKLKKVVNEYNGGKGPYMVAEFYPGWL 280
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--------T 310
+ P E + A++ + G +F NYYM HGGTNF TSG + + T
Sbjct: 281 SHWAEPFPQVSTESIVKQTAKYLENGVSF-NYYMVHGGTNFGFTSGANYTTATNLQSDLT 339
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDL 335
SYDYDAP+ E G PK+ L+ L
Sbjct: 340 SYDYDAPISEAGW-NTPKYDALRAL 363
Score = 43.5 bits (101), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 50/219 (22%), Positives = 86/219 (39%), Gaps = 50/219 (22%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
+ + L ++NG+ VG S + ++ P D+L +G NYGA
Sbjct: 437 MKIAGLADYALVYVNGQKVGELDRVSDVDSIEINMPF-----NGVLDILVENMGRINYGA 491
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
++ GI GPV + G+ +++ Y+ + P ++ + LP
Sbjct: 492 RIPQSIKGINGPVVIDGN------EITGNWQMYKLPMN----EAPDVNALPTANNKGLPT 541
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
L Y TF+ + ++ GKG ++NG ++GRYW R
Sbjct: 542 L-----YSGTFNLDTTGDTF-LNMETWGKGIVFINGFNLGRYWK---------------R 580
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEE 720
G P Q+LY +P +LK N +V+FE+
Sbjct: 581 G-------------PQQTLY-LPGCFLKKGENKIVVFEQ 605
>gi|402895882|ref|XP_003911041.1| PREDICTED: beta-galactosidase-1-like protein 2 [Papio anubis]
Length = 636
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 114/310 (36%), Positives = 155/310 (50%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY A K ++ + L L + G+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMAYVKKALEDRGIVELLLTSDNKDGLSKG 240
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 241 IVQGVLATINLQSTRELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 300
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 301 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 358
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 359 MKLRDFFGSI 368
>gi|373953412|ref|ZP_09613372.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890012|gb|EHQ25909.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 610
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 154/306 (50%), Gaps = 21/306 (6%)
Query: 33 AVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE 92
A ++ GK +ISG +HYPR E W ++ +K GL+ I TYVFWNLHEP + ++F
Sbjct: 34 AFMLDGKPFQMISGEMHYPRVPREAWRARMKMAKAMGLNTIGTYVFWNLHEPQKGHFDFS 93
Query: 93 GRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQ 152
G D+ +FVK+ E GL+ LR PYVCAEW FGG+P WL G+ R+ + AE +
Sbjct: 94 GNNDVAEFVKIAKEEGLWVILRPSPYVCAEWEFGGYPYWLQNEKGLVVRSMEAQYIAEYR 153
Query: 153 RFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLD 210
++ ++ + L + GG I++ QIENEYG+ D AY A + K AAG L
Sbjct: 154 KYINEVGKQL--APLQINHGGNILMVQIENEYGSYGSDKAYLALNQQLFK-AAGFDGLLY 210
Query: 211 TGVPWVMCQQSDAPD--PIINTCNGFYCDQFTPNSNNK---PKMWTENWSGWFLSFGGAV 265
T P + P P IN + + N N+ P E + WF +G +
Sbjct: 211 TCDPGADVKNGHLPGLMPAINGVDDPAKVKKIINENHNGKGPYYIAEWYPAWFDWWGASH 270
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGT--------NF-DRTSGGPFISTSYDYDA 316
E + G + N YM+HGGT N+ D T P I TSYDYDA
Sbjct: 271 HTVAAEKYVGRLDTVLAAGISI-NMYMFHGGTTRAFMNGANYKDETPYEPQI-TSYDYDA 328
Query: 317 PLDEYG 322
PLDE G
Sbjct: 329 PLDEAG 334
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 99/242 (40%), Gaps = 49/242 (20%)
Query: 494 LEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLT 553
++ G VL + L +NGK +G+ +TV P G D+L
Sbjct: 412 IQGGKTGVLKLSDLRDYAVIMVNGKTIGTLDRRLKQDSMTVTLP----AGPVILDILVEN 467
Query: 554 VGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQW 613
+G N+G + + GIT V G+ +++ Q + +++ F +G +
Sbjct: 468 MGRINFGKYLLENKKGITKAVFFNGA------EINKWQMFGLSLSDSKQIAFKAGVA--- 518
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
+ LP + K TF+ ++ ID + GKG WVNG ++GRYW
Sbjct: 519 -AGGNLPTFK-----KGTFNLQKIAD-TYIDLSKWGKGVVWVNGHNLGRYW--------- 562
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
N G P Q+LY +P WLK N +++FE + + + +S + K
Sbjct: 563 ------------------NIG-PEQTLY-LPAEWLKKGANEIIVFELLKPESSNLSAIEK 602
Query: 734 QL 735
+
Sbjct: 603 PI 604
>gi|261406481|ref|YP_003242722.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261282944|gb|ACX64915.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 619
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 156/309 (50%), Gaps = 16/309 (5%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T+++ ++ G+ +ISG+IHY R PE W D + K K G + +ETY+ WN+HEP
Sbjct: 4 LTWENGQYLLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQE 63
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NF G D+ F++L + GL+ +R P++CAEW FGG P WL I+ R +
Sbjct: 64 GEFNFSGMADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPL 123
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAY-GAAGKSYIKWAA 203
+ +++ + +++ + L ++ GGPI+ Q+ENEYG+ D AY + ++
Sbjct: 124 YLSKVDHYYDELIPQLV--PLLSTHGGPILAVQVENEYGSYGNDHAYLEYLREGLVRRGV 181
Query: 204 GMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFL 259
+ L G M D G ++ + +P M E W+GWF
Sbjct: 182 DVLLFTSDGPTDEMLLGGTLSDVHATVNFGSRVEESFRKYREYRAEEPLMVMEFWNGWFD 241
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI------STSYD 313
+ R D+A + + G + N YM+HGGTNF SG I +TSYD
Sbjct: 242 HWMEDHHVRDAADVAGVLDEMLEMGSSM-NMYMFHGGTNFGFYSGANHIQAYEPTTTSYD 300
Query: 314 YDAPLDEYG 322
YDAPL E+G
Sbjct: 301 YDAPLTEWG 309
>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
Length = 613
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 126/360 (35%), Positives = 173/360 (48%), Gaps = 34/360 (9%)
Query: 1 MASKEILLLVLCWGFVV-----LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTP 55
M + LVL F + A T N V GK L+SG+IH+ R
Sbjct: 1 MLRTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPR 60
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR
Sbjct: 61 AYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRP 120
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPI 175
GPY CAEW GG+P WL I+ R+ + F A Q + + + + + L GGPI
Sbjct: 121 GPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPI 178
Query: 176 ILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINT 230
I Q+ENEYG+ D AY A ++ Y+K AL L T M PD ++N
Sbjct: 179 IAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNF 237
Query: 231 CNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RG 284
G D+ ++P+M E W+GWF +G P+ + A A F+ R
Sbjct: 238 APGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWILRQ 293
Query: 285 GTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGHLKD 334
G N YM+ GGT+F +G F +TSYDYDA LDE G PK+ ++D
Sbjct: 294 GHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 613
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 126/360 (35%), Positives = 173/360 (48%), Gaps = 34/360 (9%)
Query: 1 MASKEILLLVLCWGFVV-----LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTP 55
M + LVL F + A T N V GK L+SG+IH+ R
Sbjct: 1 MLRTTLAPLVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPR 60
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR
Sbjct: 61 AYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRP 120
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPI 175
GPY CAEW GG+P WL I+ R+ + F A Q + + + + + L GGPI
Sbjct: 121 GPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPI 178
Query: 176 ILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINT 230
I Q+ENEYG+ D AY A ++ Y+K AL L T M PD ++N
Sbjct: 179 IAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNF 237
Query: 231 CNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RG 284
G D+ ++P+M E W+GWF +G P+ + A A F+ R
Sbjct: 238 APGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWILRQ 293
Query: 285 GTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGHLKD 334
G N YM+ GGT+F +G F +TSYDYDA LDE G PK+ ++D
Sbjct: 294 GHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
Length = 613
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 172/352 (48%), Gaps = 32/352 (9%)
Query: 7 LLLVLCWGFVVLAT---TSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQ 63
L+L L + + T T N V GK L+SG+IH+ R W D +Q
Sbjct: 9 LVLALAFALPITGTAAETERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQ 68
Query: 64 KSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEW 123
K++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR GPY CAEW
Sbjct: 69 KARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAEW 128
Query: 124 NFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENE 183
GG+P WL I+ R+ + F A Q + + + + + L GGPII Q+ENE
Sbjct: 129 EAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIAVQVENE 186
Query: 184 YGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINTCNG---FY 235
YG+ D AY A ++ Y+K AL L T M PD ++N G
Sbjct: 187 YGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNFAPGEAKSA 245
Query: 236 CDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RGGTFQNYYM 292
D+ ++P+M E W+GWF +G P+ + A A F+ R G N YM
Sbjct: 246 FDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWILRQGHSANLYM 301
Query: 293 YHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGHLKD 334
+ GGT+F +G F +TSYDYDA LDE G PK+ ++D
Sbjct: 302 FIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|16611713|gb|AAL27306.1|AF376481_1 BgaC [Carnobacterium maltaromaticum]
Length = 586
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 107/305 (35%), Positives = 150/305 (49%), Gaps = 38/305 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ISG+IHY R PE W ++ K+ G + +ETYV WN HEP + QY F DL +F++
Sbjct: 19 IISGAIHYFRVVPEYWEHRLKLLKNMGCNTVETYVAWNQHEPKKGQYVFSDALDLRRFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L GL LR PY+CAE+ FGG P WL ++ R+ PF M+R ++
Sbjct: 79 LADSLGLKVILRPSPYICAEFEFGGLPAWLLKDRHMRVRSTYPPF---MERVRLYYRELF 135
Query: 163 KQE-KLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWV----- 216
K+ L + GGPIIL Q+ENEYG YG+ K Y++ M VP V
Sbjct: 136 KEVIDLQITSGGPIILMQVENEYG----GYGSE-KKYLQELVTMMKENGVTVPLVTSDGP 190
Query: 217 ---MCQQSDAPDPIINTCNGFYCDQFTPNSNNK---------PKMWTENWSGWFLSFGGA 264
M + + + T N C P ++ P M E W GWF ++
Sbjct: 191 WGDMLENGSLQESALPTVN---CGSAIPEHFDRLAAFKQKKGPLMVMEYWIGWFDAWQDK 247
Query: 265 VPYRP-VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI------STSYDYDAP 317
+ V+ ++ +RG N+YM+HGGTNF +G + +TSYDYDAP
Sbjct: 248 KHHTTDVKSSVESLEEILKRGSV--NFYMFHGGTNFGFMNGANYYGKLLPDTTSYDYDAP 305
Query: 318 LDEYG 322
L+EYG
Sbjct: 306 LNEYG 310
>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
Length = 688
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 161/328 (49%), Gaps = 36/328 (10%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
FV L+T + Y + G + +++ GSIHY R E W D + K + G + +
Sbjct: 80 FVGLSTKTNALGKAY----FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVT 135
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TY+ WNLHE R +++F DL +V L GL+ LR GPY+CAE + GG P WL
Sbjct: 136 TYIPWNLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLR 195
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYG 192
P RT N+ F + ++ ++ K L GGP+I Q+ENEYG+ D Y
Sbjct: 196 NPVTDLRTTNKGFIEAVDKYFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNY- 252
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNG----FYCDQFTPNS----- 243
+Y+K A L G+ ++ D I + NG + FT +S
Sbjct: 253 ---MNYLKKAL-----LKRGIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLH 304
Query: 244 ---NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
++KP M E W+GW+ S+G + E++ V +F G +F N YM+HGGTNF
Sbjct: 305 KMQSDKPIMIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFG 363
Query: 301 RTSGGPF------ISTSYDYDAPLDEYG 322
+GG + + TSYDYDA L E G
Sbjct: 364 FINGGRYENHHISVVTSYDYDAVLSEAG 391
>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
Length = 595
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 167/351 (47%), Gaps = 23/351 (6%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
+ ++Y ++ G+ L++GS+HY R P W D +++ GL+ ++TYV WN HE
Sbjct: 4 STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
F+G DL +F++L E GL +R GPY+CAEW+ GG P WL PG++ RT +
Sbjct: 64 TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWA 202
P+ + R+ +V + + L A +GGP++ QIENEYG+ D AY + +
Sbjct: 124 GPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGSYGDDRAYVRHIRDALVAR 181
Query: 203 AGMALSLDTGVPWVMCQQSDA-PDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGW 257
L P + Q A P + G D+ +P E W+GW
Sbjct: 182 GITELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFCAEFWNGW 241
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFIS 309
F +G RP A + GG+ + YM HGGTNF +G P +
Sbjct: 242 FDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEGGTIRPTV- 299
Query: 310 TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAAL-VATDPTYPSLGPN 359
TSYD DAP+ E G + PK+ L+D A+ A + DP P L P
Sbjct: 300 TSYDSDAPIAENGAL-TPKFFALRDRLTALGTVAARRPLPADP--PLLAPR 347
>gi|329927841|ref|ZP_08281902.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938242|gb|EGG34637.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 619
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 166/319 (52%), Gaps = 36/319 (11%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T+ + ++ G+ +ISG+IHY R PE W D + K K G + +ETY+ WN+HEP
Sbjct: 4 LTWGNGQYLLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQE 63
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
+++F G D+ F++L + GL+ +R P++CAEW FGG P WL I+ R +
Sbjct: 64 GKFSFSGMADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPL 123
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAG 204
+ +++ + +++ + L +S GGPI+ Q+ENEYG+ D AY Y++ AG
Sbjct: 124 YLSKVDHYYDELIPRLV--PLLSSNGGPILAVQVENEYGSYGNDHAY----LDYLR--AG 175
Query: 205 MALSLDTGVPWVMCQQSDAP-DPII--NTCNGFYC------------DQFTPNSNNKPKM 249
+ + G+ V+ SD P D ++ T N + ++ +P M
Sbjct: 176 L---VRRGID-VLLFTSDGPTDEMLLGGTLNDVHATVNFGSRVEESFRKYREYRTEEPLM 231
Query: 250 WTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFI- 308
E W+GWF + R D+A + ++G + N YM+HGGTNF SG I
Sbjct: 232 VMEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEKGSSM-NMYMFHGGTNFGFYSGANHIQ 290
Query: 309 -----STSYDYDAPLDEYG 322
+TSYDYDAPL E+G
Sbjct: 291 TYEPTTTSYDYDAPLTEWG 309
>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
Length = 595
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 165/347 (47%), Gaps = 22/347 (6%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
+ ++Y ++ G+ L++GS+HY R P W D +++ GL+ ++TYV WN HE
Sbjct: 4 STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
F+G DL +F++L E GL +R GPY+CAEW+ GG P WL PG++ RT +
Sbjct: 64 TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWA 202
P+ + R+ +V + + L A +GGP++ QIENEYG+ D AY + +
Sbjct: 124 GPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGSYGDDRAYVRHIRDALVAR 181
Query: 203 AGMALSLDTGVPWVMCQQSDA-PDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGW 257
L P + Q A P + G D+ +P E W+GW
Sbjct: 182 GITELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFFCAEFWNGW 241
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFIS 309
F +G RP A + GG+ + YM HGGTNF +G P +
Sbjct: 242 FDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEGGTIRPTV- 299
Query: 310 TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSL 356
TSYD DAP+ E G + PK+ L+D + L AA P P L
Sbjct: 300 TSYDSDAPIAENGAL-TPKFFALRD--RLTALGTAATRRPLPADPPL 343
>gi|224027078|ref|ZP_03645444.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
gi|224020314|gb|EEF78312.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
Length = 783
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 159/324 (49%), Gaps = 29/324 (8%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ ++ GK ++ + IHY R E W I+ K G++ I Y FWN+HE +++
Sbjct: 37 NKEFLLNGKPFLIKAAEIHYTRIPAEYWEHRIEMCKALGMNTICIYAFWNIHEQRPGEFD 96
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
FEG+ D+ +F +L + G+Y LR GPYVC+EW GG P WL I RT + F
Sbjct: 97 FEGQNDVARFCRLAQKHGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIALRTSDPYFLER 156
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALS 208
+ F ++ + L A +GG II+ Q+ENEYG D Y A+ + ++ AG
Sbjct: 157 TKIFMNELGKQLAD--LQAPRGGNIIMVQVENEYGAYAEDKEYIASIRDIVR-GAGF--- 210
Query: 209 LDTGVPWVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNSNNKPK---MWTENWSG 256
T VP C Q + D ++ T N G D QF +P+ M +E WSG
Sbjct: 211 --TDVPLFQCDWASTFQRNGLDDLLWTINFGTGADIDQQFKALREARPETPLMCSEYWSG 268
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTS 311
WF +G RP + + + R +F + YM HGGT F G + +S
Sbjct: 269 WFDHWGRKHETRPADVMVKGIKDMMDRNISF-SLYMTHGGTTFGHWGGANSPSYSAMCSS 327
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDL 335
YDYDAP+ E G PK+ L+DL
Sbjct: 328 YDYDAPISEAGWA-TPKYYQLRDL 350
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 124/322 (38%), Gaps = 71/322 (22%)
Query: 458 DAFTKPGLLEQINTTADQSDYLWYSL--STNIKADEPLLEDGSKTVLHVQSLGHALHAFI 515
D KP E I +Q D W ++ T + AD +++G TVL V ++
Sbjct: 386 DNLPKPQTSEAIQPM-EQFDQGWGTILYRTTLPAD---VKEG--TVLLVDEPHDWAQVYL 439
Query: 516 NGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG-AFYEKTGAGITGPV 574
NG+L+G + + P A + D+L +G N+ A +++ GIT V
Sbjct: 440 NGQLLGRL--DRRRGENILSLPDVKAGTR--LDILVEAMGRVNFDRAIHDR--KGITDKV 493
Query: 575 QLKGSGNGTNIDLSSQQWTYQTGLK-GEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFD 633
QL G Q +++ T K + F GS K +Y+TTF
Sbjct: 494 QLLNEGCEPQTLTGWQVYSFPTDAKFAADKQFAKGS-----------KFDGPAYYRTTFT 542
Query: 634 APAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNC 693
+ +D + GKG WVNG ++GR+W
Sbjct: 543 LDKTGD-TFLDMSTWGKGMVWVNGHAMGRFWKI--------------------------- 574
Query: 694 GKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDM 753
P Q+L+ +P WLK N +V+ + +G D TKI + + + L + +H
Sbjct: 575 -GPQQTLF-MPGCWLKKGKNEIVVLDLLGPDETKIEGLKQPILDVLHNEEPVTH------ 626
Query: 754 WGSDSKIQRKPGPVLSLECPNP 775
RK G L+L+ P
Sbjct: 627 --------RKEGETLNLKGETP 640
>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
Length = 612
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 157/316 (49%), Gaps = 23/316 (7%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ LISG+IH+ R W D +QK++ GL+ +ETYVFWNL E Q++F G D+
Sbjct: 39 GRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDI 98
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
FV+ A GL LR GPYVCAEW GGFP WL P ++ R+ + F QR+
Sbjct: 99 GAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEA 158
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG---AAGKSYIKWAAGMALSLDTGVP 214
+ ++ L S GGPII Q+ENEYG+ +G A +IK G AL L T
Sbjct: 159 LGTQVR--PLLNSNGGPIIAMQVENEYGSYGDDHGYLQAVRALFIKAGLGGAL-LFTSDG 215
Query: 215 WVMCQQSDAPDPI--INTCNGF---YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRP 269
M PD + +N G D+ +P++ E W+GWF +G
Sbjct: 216 AQMLGNGTLPDVLAAVNVAPGEAKQALDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTD 275
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLD 319
+ A + ++G + N YM+ GGT+F +G F +TSYDYDA LD
Sbjct: 276 AKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPGDHYSPQTTSYDYDAALD 334
Query: 320 EYGLIRQPKWGHLKDL 335
E G PK+ +D+
Sbjct: 335 EAGRP-MPKFALFRDV 349
Score = 44.3 bits (103), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 53/236 (22%), Positives = 80/236 (33%), Gaps = 51/236 (21%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + H +++ VG V VD P G + D+L G N
Sbjct: 419 KGRLYLGEVRDDAHVYVDRLFVGRAERRRQQVWVEVDIP----SGTHRLDVLVENSGRVN 474
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YG AG+ GPV L N ++ W E P + +T
Sbjct: 475 YGPHLADGRAGLIGPVML----NHERVN----NW--------ETFLLPLQTPEAIHGWTT 518
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
P P T F G +D KG W NG +GRYW
Sbjct: 519 APMQGPAFHRGTLFIRTPGD--TFLDMEAFSKGVTWANGHMLGRYW-------------- 562
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQ 734
+ G P ++LY P +W + NT+++F+ ++ V +Q
Sbjct: 563 -------------DIG-PQRALY-FPGAWQRQGENTVLVFDVSDTAAAQVRGVQQQ 603
>gi|78048770|ref|YP_364945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78037200|emb|CAJ24945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
Length = 650
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 130/382 (34%), Positives = 183/382 (47%), Gaps = 35/382 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK L+SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F G
Sbjct: 77 VRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGN 136
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 137 NDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQSY 196
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDT 211
+ + + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 197 LDALAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFT 253
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
M PD ++N G D+ ++P+M E W+GWF +G P
Sbjct: 254 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--P 311
Query: 267 YRPVEDLAFAVARFFQ---RGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYD 313
+ + A A F+ R G N YM+ GGT+F +G F +TSYD
Sbjct: 312 HAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYD 369
Query: 314 YDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC 372
YDA LDE G PK+ ++D + + + AL A P L AT + + L
Sbjct: 370 YDAILDEAGHP-TPKFALMRDAIARVTGVQPPALPA-----PIATATLPATPLRESASLW 423
Query: 373 SAFLANIGTNSDVTVKFNGNSY 394
A I ++ ++ G Y
Sbjct: 424 DNLPAPIAIDTPQPMEQFGQDY 445
>gi|393780989|ref|ZP_10369190.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
gi|392677324|gb|EIY70741.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
Length = 776
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 120/380 (31%), Positives = 178/380 (46%), Gaps = 33/380 (8%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
M +K I LL+ C + + ++ G+ ++ + +HY R W
Sbjct: 1 MKNKIIYLLLFCTCLALPGQAQQFKTFEVGKKTFLLNGEPFIVKAAELHYTRIPQPYWEH 60
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
I+ K G++ I YVFWN+HE Q++F G+ D+ F +L + G+Y +R GPYVC
Sbjct: 61 RIKMCKALGMNTICLYVFWNIHEQEEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVC 120
Query: 121 AEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQI 180
AEW GG P WL I RT + + + F K+ + + L ++GG II+ Q+
Sbjct: 121 AEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKKVGEQLV--PLQITRGGNIIMVQV 178
Query: 181 ENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPIINTCN- 232
ENEYG+ D Y +A + ++ AG T VP C S +A D ++ T N
Sbjct: 179 ENEYGSYGTDKPYVSAIRDMVR-GAGF-----TEVPLFQCDWSSNFTNNALDDLLWTVNF 232
Query: 233 --GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGT 286
G D QF +P+ M +E WSGWF +G RP +D+ + R +
Sbjct: 233 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGLKDMLDRNIS 292
Query: 287 FQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKL 341
F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL K
Sbjct: 293 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDLLKGYLP 350
Query: 342 CEAALVATDPTYPSLGPNLE 361
+L PT P P +E
Sbjct: 351 TGQSL----PTIPEALPVME 366
Score = 48.1 bits (113), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 102/278 (36%), Gaps = 63/278 (22%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
TVL + + F+N L+ + TV P AL G D+L +G N+
Sbjct: 418 TVLEITEVHDWAQVFVNNTLLARL--DRRKGEFTVTLP-ALKKG-TQLDILVEAMGRVNF 473
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDS--KS 617
GIT V L + I + Q + +++ S+ Q+ S K
Sbjct: 474 DKSIHDR-KGITESVVLAATDGNKQIVKNWQVYNL-------PVDYAFASNKQYVSGGKQ 525
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
T+P +YK TF + ++ +D + GKG WVNG ++GR+W
Sbjct: 526 TMP-----AYYKATFKL-SKTDDTFLDMSTWGKGMVWVNGHAMGRFWEI----------- 568
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G + + + K +
Sbjct: 569 -----------------GPQQTLF-MPGCWLKKGVNEIIVLDLKGPEKAMVKGLKKPILD 610
Query: 738 SLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
L ++H RK G L+L P
Sbjct: 611 VLREKAPETH--------------RKEGEHLNLSAETP 634
>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 633
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 155/317 (48%), Gaps = 22/317 (6%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ L+SG +HY R E W +Q +K GL+ + TY+FWN+HEP Y+F G +
Sbjct: 51 LNGEPVQLLSGEMHYARIPREYWRARLQMAKAMGLNTVATYIFWNVHEPKPGVYDFSGNH 110
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIP--GIQFRTDNEPFKAEMQR 153
D+ FVK+ E GL LR GPY CAEW FGG+P WL P G R+++E + A ++R
Sbjct: 111 DVAAFVKMAQEEGLNVILRAGPYACAEWEFGGYPSWLMKDPKMGSALRSNDEVYMAPVER 170
Query: 154 FTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAA---GMALS 208
+ ++ M L S GGPI+ Q+ENEYG+ D Y A + A +
Sbjct: 171 WIKRLGQEMV--PLLISNGGPIVAVQVENEYGDFGGDKKYLAHMLEIFQNAGFKDSFLYT 228
Query: 209 LDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN---NKPKMWTENWSGWFLSFGGAV 265
+D V P +N G T ++ +P +E W GWF +G
Sbjct: 229 VDPSKALVNGSLEGLPSG-VNFGVGNAERGLTALAHLRPGQPLFASEYWPGWFDHWGHPH 287
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-------TSYDYDAPL 318
RP+ +A + N YM+HGGT+F SG + TSYDYDAPL
Sbjct: 288 ETRPIPPQLKDIAYTLDHKSSI-NIYMFHGGTSFGFMSGASWTGGEYLPDVTSYDYDAPL 346
Query: 319 DEYGLIRQPKWGHLKDL 335
DE G PK+ +DL
Sbjct: 347 DEAGH-PTPKFYAYRDL 362
>gi|156375241|ref|XP_001629990.1| predicted protein [Nematostella vectensis]
gi|156217002|gb|EDO37927.1| predicted protein [Nematostella vectensis]
Length = 578
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 108/297 (36%), Positives = 150/297 (50%), Gaps = 20/297 (6%)
Query: 55 PEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLR 114
PE W D ++K K GL+ +ETYV WNLHE V+ + F+ D+VKFV L E GL+ +R
Sbjct: 2 PEYWADRLKKLKAMGLNTVETYVAWNLHEQVKENFKFKDEVDIVKFVNLAQELGLHVIIR 61
Query: 115 IGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGP 174
GPY+C+EW+ GG P WL P ++ R+ PF ++++ +K+ ++ L S+GGP
Sbjct: 62 PGPYICSEWDLGGLPSWLLNDPNMRLRSTYGPFMEAVEKYFSKLFALLT--PLQFSRGGP 119
Query: 175 IILSQIENEYGN----IDSAYGA-AGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIIN 229
II Q+ENEY + +D+ Y K +K A L V +
Sbjct: 120 IIAWQVENEYASVQEEVDNHYMELLHKLMLKNGATELLFTSDDVGYTKRYPIKLDGGKYM 179
Query: 230 TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQN 289
+ N ++C F +KP M TE WSGWF +G E + G N
Sbjct: 180 SFNKWFC-LFLHFQPDKPIMVTEYWSGWFDHWGEKHHVLNTERKMINEVKDILDMGASIN 238
Query: 290 YYMYHGGTNFDRTSG----GPFIS-------TSYDYDAPLDEYGLIRQPKWGHLKDL 335
+YM+HGGTNF +G G I TSYDYDAPL E G I PK+ L+ L
Sbjct: 239 FYMFHGGTNFGFMNGANTAGNRIDDGYQPDVTSYDYDAPLSEAGDI-TPKYKALRKL 294
>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 919
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 168/335 (50%), Gaps = 23/335 (6%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
V Y+ + I G++ L S +IHY R E W +++ K+K G++ ++TY WN+HEP
Sbjct: 18 VQYNAFSYNINGEQVFLNSAAIHYFRMPKEEWREVLVKAKLAGMNCVDTYFAWNVHEPEE 77
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++NFEG D F+ L E GL+ R GP++CAEW+FGGFP WL+ ++FR +
Sbjct: 78 GEWNFEGDNDCGAFLDLCHELGLWVIARPGPFICAEWDFGGFPYWLNTKKDMKFRAFDMQ 137
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+ + R+ +I+ +++ ++ A GG +IL Q+ENEYG + A + Y+ +
Sbjct: 138 YLTYVDRYMDRIIPIIRDREINA--GGSVILVQVENEYGYL--ASDEVARDYMLHLRDVM 193
Query: 207 LSLDTGVPWVMCQQSDAPDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFLSF 261
L VP + C + + N + + + PK+ TE W+GWF +
Sbjct: 194 LDRGVMVPLITC--VGGAEGTVEGANFWSGADHHYNNLVQKQPDTPKIVTEFWTGWFEHW 251
Query: 262 GGAVPYRPVEDLAFAVARFFQR---GGTFQNYYM----YHGGTNFDRTSGGP--FISTSY 312
G P + A R + G T ++YM + G RT G F+ TSY
Sbjct: 252 GA--PAATQKTAALYEKRMLESLRAGFTGVSHYMFFGGTNFGGYGGRTVGASDIFMVTSY 309
Query: 313 DYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALV 347
DYDAPL EYG + K+ K + ++ E+ L+
Sbjct: 310 DYDAPLSEYGRVTD-KYNTAKRMSYFVQATESVLL 343
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 47/116 (40%), Gaps = 34/116 (29%)
Query: 626 VWYKTTFDAPAGSEPV----AIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
VW+ FD P V + TGM KG W+NG +GRYW Q G D
Sbjct: 826 VWHTVQFDKPELPADVNAKLKLRLTGMSKGTLWLNGIDLGRYW-----QVGPQED----- 875
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
Y +P +WLK N LVLF+E G P+K+ + Q S
Sbjct: 876 -------------------YKIPMAWLKDR-NELVLFDENGASPSKVRLLYDQASS 911
>gi|333384209|ref|ZP_08475850.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826788|gb|EGJ99602.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
Length = 632
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/328 (34%), Positives = 158/328 (48%), Gaps = 35/328 (10%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK +ISG +HYPR + W +Q K GL+ + TYVFWN HEP +++F
Sbjct: 38 VYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNAHEPEPGKWDFTED 97
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
+L +++K+ E GL LR GPYVCAEW FGG+P WL + ++ R DNE F ++
Sbjct: 98 KNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEEMELRRDNEQF----LKY 153
Query: 155 TAKIVDMMKQE--KLYASQGGPIILSQIENEYGNIDSA-----------YGAAGKSYIKW 201
T ++ + QE L ++GGPII+ Q ENE+G+ S Y A +K
Sbjct: 154 TQLYINRLYQEVGNLQITKGGPIIMVQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQLKT 213
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTCNG-FYCDQFTP-----NSNNKPKMWTENWS 255
A S + W+ + A + T NG D N P M E +
Sbjct: 214 AGFDIPSFTSDGSWLF--EGGAVPGALPTANGESNIDNLKKVVNRYNGGQGPYMVAEFYP 271
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------ 309
GW + P +A ++ Q + NYYM HGGTNF TSG +
Sbjct: 272 GWLAHWVEPHPQVSATSVARQTEKYLQNDVSI-NYYMVHGGTNFGFTSGANYDKKHDIQP 330
Query: 310 --TSYDYDAPLDEYGLIRQPKWGHLKDL 335
TSYDYDAP+ E G + PK+ L+++
Sbjct: 331 DLTSYDYDAPVSEAGWV-TPKFDSLRNV 357
Score = 48.1 bits (113), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 88/205 (42%), Gaps = 41/205 (20%)
Query: 531 KVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQ 590
K T+D + P +T ++L +G NYG+ GI PV++ N I+ Q
Sbjct: 456 KYTMDIDV---PFNSTLEILVENMGRINYGSEIIHNTKGIISPVRI----NDMEIEGGWQ 508
Query: 591 QWTYQTGLKGEELNFPSGSSTQWDSKSTLPKL--QPLVWYKTTFDAPAGSEPVAIDFTGM 648
+ K + + +S +++S + L +P++ YK TF+ + I+
Sbjct: 509 MISIPMD-KAPDFSKMDQASVYDNNESAIKSLAGKPVL-YKGTFNLTETGD-TFINMEDW 565
Query: 649 GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWL 708
GKG ++NG++IGRYW YV P Q+LY +P WL
Sbjct: 566 GKGIIFINGKNIGRYW--YVG--------------------------PQQTLY-IPGVWL 596
Query: 709 KSSGNTLVLFEEIGGDPTKISFVTK 733
K N +++FE++ P TK
Sbjct: 597 KKGENKIIIFEQLNDKPHTEVRTTK 621
>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
Length = 649
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 161/328 (49%), Gaps = 36/328 (10%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
FV L+T + Y + G + +++ GSIHY R E W D + K + G + +
Sbjct: 41 FVGLSTKTNALGKAY----FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVT 96
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TY+ WNLHE R +++F DL +V L GL+ LR GPY+CAE + GG P WL
Sbjct: 97 TYIPWNLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLR 156
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYG 192
P RT N+ F + ++ ++ K L GGP+I Q+ENEYG+ D Y
Sbjct: 157 NPVTDLRTTNKGFIEAVDKYFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNY- 213
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNG----FYCDQFTPNS----- 243
+Y+K A L G+ ++ D I + NG + FT +S
Sbjct: 214 ---MNYLKKAL-----LKRGIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLH 265
Query: 244 ---NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
++KP M E W+GW+ S+G + E++ V +F G +F N YM+HGGTNF
Sbjct: 266 KMQSDKPIMIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFG 324
Query: 301 RTSGGPF------ISTSYDYDAPLDEYG 322
+GG + + TSYDYDA L E G
Sbjct: 325 FINGGRYENHHISVVTSYDYDAVLSEAG 352
>gi|218260271|ref|ZP_03475643.1| hypothetical protein PRABACTJOHN_01305, partial [Parabacteroides
johnsonii DSM 18315]
gi|218224641|gb|EEC97291.1| hypothetical protein PRABACTJOHN_01305 [Parabacteroides johnsonii
DSM 18315]
Length = 539
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 159/324 (49%), Gaps = 29/324 (8%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ ++ GK V+ + IHY R E W IQ K G++ I Y FWN+HE +++
Sbjct: 36 NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F G+ D+ F +L + +Y LR GPYVC+EW GG P WL I+ RT++ F
Sbjct: 96 FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALS 208
+ F +I + L ++GG II+ Q+ENEYG+ D Y A + +K AG
Sbjct: 156 TKLFMNEIGKQLAD--LQITKGGNIIMVQVENEYGSYATDKEYIANIRDIVK-GAGF--- 209
Query: 209 LDTGVPWVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNSN---NKPKMWTENWSG 256
T VP C Q++A D ++ T N G D QF N P M +E WSG
Sbjct: 210 --TDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSG 267
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTS 311
WF +G R E + + RG +F + YM HGGT F G + +S
Sbjct: 268 WFDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSS 326
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDL 335
YDYDAP+ E G PK+ L++L
Sbjct: 327 YDYDAPISEAGWT-TPKYFKLREL 349
>gi|355690250|gb|AER99094.1| galactosidase, beta 1 [Mustela putorius furo]
Length = 648
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 165/349 (47%), Gaps = 30/349 (8%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+ Y H + G+ ISGSIHY R W D + K K GL+ I+TYV WN HEP
Sbjct: 22 KIDYHHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 81
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
QY F G D+ F+KL E GL LR GPY+CAEW+ GG P WL I R+ +
Sbjct: 82 PGQYKFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 141
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
+ A + ++ ++ MK L GGPII Q+ENEYG +Y Y+++ +
Sbjct: 142 DYLAAVDKWLGVLLPRMK--PLLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQKL 195
Query: 206 ALSLDTGVPWVMCQQSDAPDPIIN--TCNGFYCD-QFTPNSN-------------NKPKM 249
G ++ A +P + G Y F P +N P +
Sbjct: 196 -FHYHLGKDVLLFTTDGALEPFLQCGALQGLYATVDFGPGANITAAFEVQRKSEPKGPLV 254
Query: 250 WTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PF 307
+E ++GW +G E +A ++ RG N YM+ GGTNF +G P+
Sbjct: 255 NSEFYTGWLDHWGQPHSTVKTEVVASSLHDILARGANV-NLYMFIGGTNFAYWNGANMPY 313
Query: 308 IS--TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTY 353
+ TSYDYDAPL E G + + K+ L+D + K K+ E + + P +
Sbjct: 314 KAQPTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGVIPPSTPKF 361
>gi|189096261|pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
Thetaiotaomicron
Length = 612
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 157/320 (49%), Gaps = 29/320 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ V+ + IHYPR E W I+ K G + I YVFWN HEP +Y+F G+
Sbjct: 16 LLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPEEGRYDFAGQ 75
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ F +L E G Y +R GPYVCAEW GG P WL I+ R + + ++ F
Sbjct: 76 KDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDPYYXERVKLF 135
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTG 212
++ + L S+GG II Q+ENEYG ID Y + + +K AG TG
Sbjct: 136 LNEVGKQLAD--LQISKGGNIIXVQVENEYGAFGIDKPYISEIRDXVK-QAGF-----TG 187
Query: 213 VPWVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNSNNKPKM---WTENWSGWFLS 260
VP C +++A D ++ T N G D QF +P +E WSGWF
Sbjct: 188 VPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEFWSGWFDH 247
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF-----ISTSYDYD 315
+G R E+L R +F + Y HGGT+F G F TSYDYD
Sbjct: 248 WGAKHETRSAEELVKGXKEXLDRNISF-SLYXTHGGTSFGHWGGANFPNFSPTCTSYDYD 306
Query: 316 APLDEYGLIRQPKWGHLKDL 335
AP++E G + PK+ +++L
Sbjct: 307 APINESGKV-TPKYLEVRNL 325
>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
Length = 662
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 161/328 (49%), Gaps = 36/328 (10%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
FV L+T + Y + G + +++ GSIHY R E W D + K + G + +
Sbjct: 54 FVGLSTKTNALGKAY----FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVT 109
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
TY+ WNLHE R +++F DL +V L GL+ LR GPY+CAE + GG P WL
Sbjct: 110 TYIPWNLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLR 169
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYG 192
P RT N+ F + ++ ++ K L GGP+I Q+ENEYG+ D Y
Sbjct: 170 NPVTDLRTTNKGFIEAVDKYFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNY- 226
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNG----FYCDQFTPNS----- 243
+Y+K A L G+ ++ D I + NG + FT +S
Sbjct: 227 ---MNYLKKAL-----LKRGIVELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLH 278
Query: 244 ---NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
++KP M E W+GW+ S+G + E++ V +F G +F N YM+HGGTNF
Sbjct: 279 KMQSDKPIMIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFG 337
Query: 301 RTSGGPF------ISTSYDYDAPLDEYG 322
+GG + + TSYDYDA L E G
Sbjct: 338 FINGGRYENHHISVVTSYDYDAVLSEAG 365
>gi|227552575|ref|ZP_03982624.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257896912|ref|ZP_05676565.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|293379016|ref|ZP_06625170.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|431750982|ref|ZP_19539676.1| beta-galactosidase [Enterococcus faecium E2620]
gi|227178324|gb|EEI59296.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257833477|gb|EEV59898.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|292642358|gb|EFF60514.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|430616240|gb|ELB53164.1| beta-galactosidase [Enterococcus faecium E2620]
Length = 595
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/396 (32%), Positives = 186/396 (46%), Gaps = 52/396 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVK--FNGNSYLL 396
+++ +C I T+ +T++ NG YLL
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLL 388
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 103/280 (36%), Gaps = 61/280 (21%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNRSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
+ +I+GK Y + + D + G+ ++L V ++N G
Sbjct: 403 RLIETNDRAQIYIDGK-----YDQTQTQETLGD--EMMIEGQKNQPTIALDVLVENLGRV 455
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKL 622
GA + P Q KG NG D+ + G + L F + D +
Sbjct: 456 --NYGAKLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAGKDPS 508
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QP +Y+ FD A ID + GKG VNG ++GRYW N G
Sbjct: 509 QP-SFYQFEFDL-AEEADAYIDCSLYGKGIVIVNGFNLGRYW------NHG--------- 551
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +V+FE G
Sbjct: 552 -------------PVLSLY-CPKDVLKKGRNEVVIFETEG 577
>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
51196]
Length = 664
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/304 (35%), Positives = 147/304 (48%), Gaps = 19/304 (6%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V+ G+ +ISG +HY R W +Q +K GL+ I TYVFWNLHEP +++F G
Sbjct: 38 VLDGQPFQIISGEMHYERIPRAYWKARLQMAKAMGLNTIATYVFWNLHEPEPGKFDFSGN 97
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ--FRTDNEPFKAEMQ 152
DL +F++ + GL LR GPY CAEW FGGFP WL P +Q R+++ F +
Sbjct: 98 ADLAQFIRDAQQTGLKVLLRAGPYSCAEWEFGGFPAWLMKNPKMQTALRSNDPEFMKPAE 157
Query: 153 RFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLD 210
++ ++ + L GGPII QIENEYG+ D+AY K A L
Sbjct: 158 QWILRLGR--EVAPLQVGYGGPIIGVQIENEYGDFGGDAAYLEHLKKIFLKAGFTQSLLY 215
Query: 211 TGVPWVMCQQSDAPD--PIINTCNGFYC---DQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
T P + P +N G D +P + +E W+GWF +G
Sbjct: 216 TANPSRALVRGSIPGVYSAVNFAPGHAAQALDSLAQLRAGQPLLSSEYWTGWFDHWGEPH 275
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-------TSYDYDAPL 318
+P+ L + R G N YM+HGGT+F SG + TSYDY APL
Sbjct: 276 QSKPL-SLQVKDFNYILRHGAGVNLYMFHGGTSFGMMSGSSWTKHQFLPDVTSYDYGAPL 334
Query: 319 DEYG 322
DE G
Sbjct: 335 DEAG 338
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/214 (25%), Positives = 91/214 (42%), Gaps = 36/214 (16%)
Query: 475 QSDYLWYSLSTNI--KADEPL--LEDGSKTVLHVQSLGHALH-------------AFING 517
++ LW L + K EP+ L +L+ ++L HA+ ++NG
Sbjct: 378 EASSLWRGLPKPVVTKNPEPMEWLGQSYGFILYRKTLHHAVDGDLVLNGMNDYALVYLNG 437
Query: 518 KLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLK 577
KL G+ + +++ + + A K D+L G N G+ GPV L
Sbjct: 438 KLQGTLNRTCNDSTLMLHSNSA----KTRLDILVENSGRINSTRMMLHANKGLMGPVMLA 493
Query: 578 GSGNGTNIDLSSQQW-TYQTGLKGEELNFPSG--SSTQWDSKSTLPK-LQPLVWYKTTFD 633
G + W TY+ +K + + P G T ++ KST + + +Y+ TF
Sbjct: 494 GR--------ALHGWKTYRLPMKPDTIADPLGMPQETHFNEKSTPAQAMSGPAFYRGTFR 545
Query: 634 APAGSEPVA---IDFTGMGKGEAWVNGQSIGRYW 664
S+ + +D G+GKG W++G IGRYW
Sbjct: 546 VETKSKQIPDTFLDIRGLGKGAVWIDGHPIGRYW 579
>gi|397498763|ref|XP_003820147.1| PREDICTED: beta-galactosidase-1-like protein 2 [Pan paniscus]
Length = 720
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 155/310 (50%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R++++F G DL FV
Sbjct: 147 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERSKFDFSGNLDLEAFVL 206
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 207 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 264
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 265 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 324
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 325 IVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 384
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 385 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 442
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 443 MKLRDFFGSI 452
>gi|424764212|ref|ZP_18191655.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
gi|402420907|gb|EJV53177.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
Length = 595
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/396 (32%), Positives = 186/396 (46%), Gaps = 52/396 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVK--FNGNSYLL 396
+++ +C I T+ +T++ NG YLL
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLL 388
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 103/284 (36%), Gaps = 69/284 (24%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNRSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN----TFDLLSLTVGLQN 558
+ +I+GK + + T+ + + KN D+L +G N
Sbjct: 403 RLIETNDRAQIYIDGKY------EQTQTQETLGDEMMIEGQKNQPTIALDILVENLGRVN 456
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YGA + P Q KG NG D+ + G + L F + D +
Sbjct: 457 YGA-------KLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAG 504
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
QP +Y+ FD A ID + GKG VNG ++GRYW N G
Sbjct: 505 KDPSQP-SFYQFEFDL-AEEADAYIDCSLYGKGIVIVNGFNLGRYW------NHG----- 551
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +V+FE G
Sbjct: 552 -----------------PVLSLY-CPKDVLKKGRNEVVIFETEG 577
>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
Length = 823
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 154/323 (47%), Gaps = 31/323 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++ + +HYPR W I+ K G++ I YVFWNLHEP +++F G+
Sbjct: 76 LLNGKPFIIRAAELHYPRIPKPYWEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFTGQ 135
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL F +L + +Y LR GPYVCAEW GG P WL I+ R + F + F
Sbjct: 136 NDLAAFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREADPYFIERVNIF 195
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ + L GGPII+ Q+ENEYG +YG + K Y+ + + V
Sbjct: 196 EQEVARQVG--GLTIQNGGPIIMVQVENEYG----SYGES-KEYVSLIRDIVRTNFGDVT 248
Query: 215 WVMCQ------QSDAPDPI--INTCNGFYCDQ-------FTPNSNNKPKMWTENWSGWFL 259
C ++ PD + IN G DQ P+S P M +E WSGWF
Sbjct: 249 LFQCDWASNFTKNALPDLLWTINFGTGANIDQQFAGLKKLRPDS---PLMCSEFWSGWFD 305
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDY 314
+G RP D+ + +G +F + YM HGGTN+ +G P + TSYDY
Sbjct: 306 KWGANHETRPASDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDY 364
Query: 315 DAPLDEYGLIRQPKWGHLKDLHK 337
DAP+ E G W K L K
Sbjct: 365 DAPISESGQTTPKYWALRKTLGK 387
>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
Length = 587
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 147/306 (48%), Gaps = 28/306 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ +ISG++HY R P+ W D ++K++ GL+ +ETYV WNLH+P +G
Sbjct: 13 LNGEPFRIISGALHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLDGLL 72
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F++L GL LR GPY+CAEW+ GG P WL +Q R+ + F A + R+
Sbjct: 73 DLPRFLRLAHAEGLKVLLRPGPYICAEWDGGGLPHWLMSESDVQLRSSDPKFTAIIDRYL 132
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ + A GGP+I Q+ENEYG AYG + Y+K+ S
Sbjct: 133 DLLLPPLLPH--MAESGGPVIAVQVENEYG----AYGNDAE-YLKYLVEAFRSRGIEELL 185
Query: 216 VMCQQSDAPDPIINTCNGFYCD------------QFTPNSNNKPKMWTENWSGWFLSFGG 263
C Q + + G + P M E W GWF +GG
Sbjct: 186 FTCDQVNPEHQQAGSIPGVLSTGTFGGKIETALATLRAHQPEGPLMCAEFWIGWFDHWGG 245
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG-------GPFISTSYDYDA 316
R D+A + + G + N YM+HGGTNF T+G P I TSYDYDA
Sbjct: 246 PHHTRDTADVAADLDKLLAAGASV-NIYMFHGGTNFGLTNGANHHHTYAPTI-TSYDYDA 303
Query: 317 PLDEYG 322
PL E G
Sbjct: 304 PLTENG 309
Score = 41.2 bits (95), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 81/201 (40%), Gaps = 49/201 (24%)
Query: 521 GSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGA--GITGPVQLKG 578
G+ G N + P+ + ++L +G NYG + GA G+ GPV G
Sbjct: 412 GAPVGVLENERRETSLPVQVHRRGAVLEVLVENMGRVNYGP---RIGAPKGLLGPVTFDG 468
Query: 579 SGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGS 638
+ W + ++ P G++ D++ T +P +++ TF+ +
Sbjct: 469 --------MPVTGWE----CRPLPMDAPLGAALYADAE-TEACAEP-AFHRGTFEVTDPA 514
Query: 639 EPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQ 698
+ + G KG+AWVNG S+GRYW N G P Q
Sbjct: 515 D-TFLSLPGWTKGQAWVNGFSLGRYW---------------------------NRG-PQQ 545
Query: 699 SLYHVPRSWLKSSGNTLVLFE 719
+LY VP L+ NTL++ E
Sbjct: 546 TLY-VPGPVLRPGANTLIVLE 565
>gi|257888197|ref|ZP_05667850.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|431040248|ref|ZP_19492755.1| beta-galactosidase [Enterococcus faecium E1590]
gi|431763679|ref|ZP_19552228.1| beta-galactosidase [Enterococcus faecium E3548]
gi|257824251|gb|EEV51183.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|430562100|gb|ELB01353.1| beta-galactosidase [Enterococcus faecium E1590]
gi|430622052|gb|ELB58793.1| beta-galactosidase [Enterococcus faecium E3548]
Length = 595
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/396 (32%), Positives = 186/396 (46%), Gaps = 52/396 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVK--FNGNSYLL 396
+++ +C I T+ +T++ NG YLL
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLL 388
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 104/280 (37%), Gaps = 61/280 (21%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNRSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
+ +I+GK + + ++ ++ G+ ++L V ++N G
Sbjct: 403 RLIETNDRAQIYIDGKYEQTQTQETLGDEMMIE-------GQKNQPTIALDVLVENLGRV 455
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKL 622
GA + P Q KG NG D+ + G + L F + D +
Sbjct: 456 --NYGAKLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAGKDPS 508
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QP +Y+ FD A ID + GKG VNG ++GRYW N G
Sbjct: 509 QP-SFYQFEFDL-AEEADAYIDCSLYGKGIVIVNGFNLGRYW------NHG--------- 551
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +V+FE G
Sbjct: 552 -------------PVLSLY-CPKDVLKKGRNEVVIFETEG 577
>gi|357626884|gb|EHJ76789.1| putative carbamoyl-phosphate synthase large chain [Danaus
plexippus]
Length = 2861
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 159/314 (50%), Gaps = 38/314 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SGS+HY R E W D ++K + GL+ + TYV W+ HE Y+FEG
Sbjct: 63 MLDGKPLRIVSGSVHYYRLPAEYWRDRLRKIRAAGLNAVSTYVEWSSHEEEEGAYSFEGD 122
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLW-LHFIPGIQFRTDNEPFKAEMQR 153
D+ +F+K+ AE LY LR GPY+CAE + GG P W L P I+ RT + F AE ++
Sbjct: 123 KDIARFLKIAAEENLYVLLRPGPYICAERDLGGLPYWLLSKYPDIKLRTTDGNFIAETKK 182
Query: 154 FTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAG----------KSYIKWAA 203
+ AK+ + +K GGPIIL Q+ENEYG +YGA+ KS+++ AA
Sbjct: 183 WMAKLFEEVK--PFLLGNGGPIILVQVENEYG----SYGASKEYMKQIRDIIKSHVEDAA 236
Query: 204 GMALS--------LDTGVPWVMCQQSDAP-DPIINTCNGFYCDQFTPNSNNKPKMWTENW 254
+ + +D + + P +INT + P P M +E +
Sbjct: 237 LLYTTDGPYRSYFIDGSISGTLTTIDFGPTTSVINTFKELRA--YMPVG---PLMNSEFY 291
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG---GPFIS-- 309
GW + + + + F + + N+Y++ GGTNF+ TSG G F
Sbjct: 292 PGWLTHWSEHIQQVSTDRVTFTLRDMLENKINL-NFYVFFGGTNFEFTSGANYGRFYQPD 350
Query: 310 -TSYDYDAPLDEYG 322
TSYDYDAPL E G
Sbjct: 351 ITSYDYDAPLSEAG 364
Score = 40.4 bits (93), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 76/179 (42%), Gaps = 30/179 (16%)
Query: 496 DGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVG 555
+G+ VL+++ + +++ KL G S + I PG +T LL G
Sbjct: 450 NGTGGVLNIKKPRDFIFVYVDKKL----QGVISRMMMLYSLSINSKPG-STLSLLVENQG 504
Query: 556 LQNYGAFYEKTGAGITGPVQLKG-------SGNGTNIDLSSQQWTYQTGLKGEELNFPSG 608
N+G GI G V L S G ++D+ K + L+ +
Sbjct: 505 RINFGNRIHDF-KGILGSVLLNNKTLEGPWSVTGYSLDVK----------KSKLLSDDNI 553
Query: 609 SSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVA--IDFTGMGKGEAWVNGQSIGRYWP 665
S+ D+ S P + ++ F P G EP+ ID T GKG +VNG ++GRYWP
Sbjct: 554 SAFTEDALSDGPMM-----FEGQFVIPEGEEPLDTFIDTTNWGKGYIFVNGYNLGRYWP 607
>gi|410100792|ref|ZP_11295748.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
gi|409214073|gb|EKN07084.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
Length = 779
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 170/350 (48%), Gaps = 31/350 (8%)
Query: 7 LLLVLCWGFVVLATTSFGANVTYD--HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
LL+V+ + G+N T++ + ++ GK ++ + IHY R E W IQ
Sbjct: 10 LLMVMLICVLSGCKNQSGSNGTFEIGDKTFLLNGKPFIIKAAEIHYTRIPVEYWEHRIQM 69
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
K G++ I Y FWN+HE +++F G+ D+ F +L + G+Y LR GPYVC+EW
Sbjct: 70 CKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKNGMYIMLRPGPYVCSEWE 129
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
GG P WL IQ RT++ F + + +I + ++ ++GG II+ Q+ENEY
Sbjct: 130 MGGLPWWLLKKEDIQLRTNDPYFIERTRIYMNEIGKQLADRQI--TRGGNIIMVQVENEY 187
Query: 185 GN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPIINTCN---GF 234
G+ D +Y A + ++ AG T VP C S +A D ++ T N G
Sbjct: 188 GSYATDKSYIAKNRDILR-DAGF-----TDVPLFQCDWSSNFLNNALDDLVWTVNFGTGA 241
Query: 235 YCD-QFTPNSN---NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNY 290
D QF N P M +E WSGWF +G R E + + R +F +
Sbjct: 242 NIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMIAGLRDMLDRNISF-SL 300
Query: 291 YMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
YM HGGT F G + +SYDYDAP+ E G PK+ L++
Sbjct: 301 YMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWA-TPKYHKLREF 349
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/278 (23%), Positives = 106/278 (38%), Gaps = 64/278 (23%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
T L + + FI+GKL+G + T+ P A + D+L +G N+
Sbjct: 422 TTLLIDEVHDWAQVFIDGKLIGRL--DRRRGEFTIKLPATAAGAR--LDILIEAMGRVNF 477
Query: 560 G-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
A +++ G IT V L + + + W + N P S D K T
Sbjct: 478 DKAIHDRKG--ITNKVVLITESSSDEL----KDW--------QVYNLPVDYSFVKDKKYT 523
Query: 619 L-PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
K++ +Y+ TF+ + V +D GKG WVNG+++GR+W
Sbjct: 524 PGKKIEAPAYYRATFNLETPGD-VFLDMQTWGKGMVWVNGKAMGRFWEI----------- 571
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G P K S K L +
Sbjct: 572 -----------------GPQQTLF-MPGCWLKKGENEIIVLDLKG--PEKAS--VKGLKT 609
Query: 738 SLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
+ + PL RK G L+L+ P
Sbjct: 610 PILDMLRPEAPL----------TNRKEGQNLNLKNEKP 637
>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 613
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 125/356 (35%), Positives = 174/356 (48%), Gaps = 40/356 (11%)
Query: 7 LLLVLCWGFVVLATTS-------FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
L+L L + + T + FG T R GK L+SG+IH+ R W
Sbjct: 9 LVLALAFALPITGTAAETERWPNFGTQGTQFAR----DGKPYQLLSGAIHFQRIPRAYWK 64
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR GPY
Sbjct: 65 DRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYA 124
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
CAEW GG+P WL I+ R+ + F A Q + + + + + L GGPII Q
Sbjct: 125 CAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPIIAVQ 182
Query: 180 IENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINTCNG- 233
+ENEYG+ D AY A ++ Y+K AL L T M PD ++N G
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNFAPGE 241
Query: 234 --FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RGGTFQ 288
D+ ++P+M E W+GWF +G P+ + A A F+ R G
Sbjct: 242 AKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWILRQGHSA 297
Query: 289 NYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGHLKD 334
N YM+ GGT+F +G F +TSYDYDA LDE G PK+ ++D
Sbjct: 298 NLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
>gi|1352080|sp|P48982.1|BGAL_XANMN RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|1045034|gb|AAC41485.1| beta-galactosidase [Xanthomonas axonopodis pv. manihotis]
Length = 598
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 119/321 (37%), Positives = 162/321 (50%), Gaps = 29/321 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK L+SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F G
Sbjct: 38 VRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGN 97
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FVK A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 98 NDVAAFVKEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAY 157
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDT 211
+ + + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 158 LDALAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFT 214
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
M PD ++N G D+ ++P+M E W+GWF +G P
Sbjct: 215 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--P 272
Query: 267 YRPVEDLAFAVARFFQ---RGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYD 313
+ + A A F+ R G N YM+ GGT+F +G F +TSYD
Sbjct: 273 HAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYD 330
Query: 314 YDAPLDEYGLIRQPKWGHLKD 334
YDA LDE G PK+ ++D
Sbjct: 331 YDAILDEAGHP-TPKFALMRD 350
Score = 41.2 bits (95), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 83/208 (39%), Gaps = 41/208 (19%)
Query: 475 QSDYLWYSLSTNIKAD--EPLLEDGS---------------KTVLHVQSLGHALHAFING 517
+S LW +L T I D +P+ + G K L++ + +++
Sbjct: 379 ESASLWDNLPTPIAIDTPQPMEQFGQDYGYILYRTTITGPRKGPLYLGDVRDVARVYVDQ 438
Query: 518 KLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLK 577
+ VGS + V+ P G++T D+L G NYG AG+ PV L
Sbjct: 439 RPVGSVERRLQQVSLEVEIPA----GQHTLDVLVENSGRINYGTRMADGRAGLVDPVLL- 493
Query: 578 GSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKSTLPKLQPLVWYKTTFDAPA 636
SQQ TG + L + S + W K+ +Q +++ T
Sbjct: 494 ----------DSQQ---LTGWQAFPLPMRTPDSIRGWTGKA----VQGPAFHRGTLRIGT 536
Query: 637 GSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
++ +D GKG AW NG ++GR+W
Sbjct: 537 PTD-TYLDMRAFGKGFAWANGVNLGRHW 563
>gi|324507659|gb|ADY43243.1| Beta-galactosidase [Ascaris suum]
Length = 655
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 114/365 (31%), Positives = 183/365 (50%), Gaps = 34/365 (9%)
Query: 1 MASKEILLLVLCW--GFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMW 58
MA + I++L L + G V+ + ++ ++ + ++ G+ ISGSIHY R P+ W
Sbjct: 6 MADQLIIILSLLFNCGAVIDSHSAPSFSIDPQNNVFLLDGRSFRYISGSIHYFRVHPDQW 65
Query: 59 PDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPY 118
D + + + GL+ I+ Y+ WN HE ++ F+G ++ F++L + LYA +RIGPY
Sbjct: 66 NDRLSRMRAAGLNAIQFYIPWNFHEIYEGKHRFDGSRNITHFLQLAMQNELYALVRIGPY 125
Query: 119 VCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILS 178
+CAEW GG P WL I+ RT ++ F ++R+ ++ ++K GGPI++
Sbjct: 126 ICAEWENGGAPWWLLKYKDIKMRTSDKRFLDAVKRWFDVLLPILKPN--LRKNGGPILML 183
Query: 179 QIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN---GFY 235
Q+ENEYG+ D +++ A D V+ +D D C G Y
Sbjct: 184 QLENEYGSFDGGCDRNYTIFLRDLARRHFGDD-----VVLYTTDGGDDFYLKCGTIPGVY 238
Query: 236 CD-QFTPNSN---------------NKPKMWTENWSGWFLSFGGAVP-YRPVEDLAFAVA 278
F P S+ + P + +E + GWFL++ +PV ++
Sbjct: 239 ATVDFGPASSEAIDHCFASQRQYEPHGPLVNSEFYPGWFLTWSQKERGDQPVHNVINGSK 298
Query: 279 RFFQRGGTFQNYYMYHGGTNFDRTSGGP---FISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
F++G F NYYM+HGGTNF +GG I+TSYDY APL E I K+ ++D
Sbjct: 299 YMFEKGANF-NYYMFHGGTNFAFWNGGATKTAITTSYDYFAPLSEAADITD-KYLAIRDW 356
Query: 336 HKAIK 340
K I+
Sbjct: 357 IKTIE 361
>gi|114641374|ref|XP_001157987.1| PREDICTED: galactosidase, beta 1-like 2 isoform 2 [Pan troglodytes]
Length = 636
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 158/318 (49%), Gaps = 17/318 (5%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V+ G + GSIHY R E W D + K K GL+ + TYV WNLHEP R++++F G
Sbjct: 55 VLEGSTFWIFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERSKFDFSGN 114
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL FV + AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + +
Sbjct: 115 LDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLY 174
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT- 211
+ M + L +GGPII Q+ENEYG N D AY K ++ + L L +
Sbjct: 175 FDHL--MSRVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSD 232
Query: 212 ---GVPWVMCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPY 267
G+ + Q A + +T F N +PKM E W+GWF S+GG
Sbjct: 233 NKDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNI 292
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEY 321
++ V+ G + N YM+HGGTNF +G TSYDYDA L E
Sbjct: 293 LDSSEVLKTVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEA 351
Query: 322 GLIRQPKWGHLKDLHKAI 339
G K+ L+D +I
Sbjct: 352 G-DYTAKYMKLRDFFGSI 368
>gi|431741495|ref|ZP_19530400.1| beta-galactosidase [Enterococcus faecium E2039]
gi|430601673|gb|ELB39267.1| beta-galactosidase [Enterococcus faecium E2039]
Length = 595
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 127/401 (31%), Positives = 188/401 (46%), Gaps = 50/401 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSI 403
+++ +C I T+ +T++ N Y +S+++
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLLYSLTL 393
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 87/226 (38%), Gaps = 40/226 (17%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNKSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN----TFDLLSLTVGLQN 558
+ +I+GK + + + T+ + L KN D+L +G N
Sbjct: 403 RLIETNDRAQIYIDGKY------NQTQTQETLGDEMMLEGQKNQPTIALDVLVENLGRVN 456
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YGA + P Q KG NG D+ + G + L F + D +
Sbjct: 457 YGA-------KLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAG 504
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
QP +Y+ FD A ID + GKG +NG ++GRYW
Sbjct: 505 KDPSQP-SFYQFEFDL-AEEADTYIDCSLYGKGVVIINGFNLGRYW 548
>gi|423342145|ref|ZP_17319860.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
gi|409219016|gb|EKN11981.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
Length = 779
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 159/324 (49%), Gaps = 29/324 (8%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ ++ GK V+ + IHY R E W IQ K G++ I Y FWN+HE +++
Sbjct: 36 NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F G+ D+ F +L + +Y LR GPYVC+EW GG P WL I+ RT++ F
Sbjct: 96 FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALS 208
+ F +I + L ++GG II+ Q+ENEYG+ D Y A + +K AG
Sbjct: 156 TKLFMNEIGKQLAD--LQITKGGNIIMVQVENEYGSYATDKEYIANIRDIVK-GAGF--- 209
Query: 209 LDTGVPWVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNSN---NKPKMWTENWSG 256
T VP C Q++A D ++ T N G D QF N P M +E WSG
Sbjct: 210 --TDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSG 267
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTS 311
WF +G R E + + RG +F + YM HGGT F G + +S
Sbjct: 268 WFDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSS 326
Query: 312 YDYDAPLDEYGLIRQPKWGHLKDL 335
YDYDAP+ E G PK+ L++L
Sbjct: 327 YDYDAPISEAGWT-TPKYFKLREL 349
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 64/279 (22%), Positives = 110/279 (39%), Gaps = 62/279 (22%)
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
S T L + + + NGKL+G + ++ P ALA G D+L +G
Sbjct: 420 SGTTLLITEVHDWAQVYANGKLLGRL--DRRRGENSLKLP-ALAAG-TQLDILIEAMGRV 475
Query: 558 NYG-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSK 616
N+ A +++ G IT V+L ++ SS Q + +++P ++
Sbjct: 476 NFDKAIHDRKG--ITEKVEL--------LNESSTQELKNWQVYSFPVDYPFVKEKKY--- 522
Query: 617 STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
+ KL +Y+ TF+ + V +D GKG WVNG++IGR+W
Sbjct: 523 APGKKLDGPAYYRATFNLEEAGD-VFLDMQTWGKGMVWVNGKAIGRFWEI---------- 571
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
P Q+L+ +P WLK N +++ + +G + I + K +
Sbjct: 572 ------------------GPQQTLF-MPGCWLKKGENEIIVLDLLGPEKATIKGLDKPIL 612
Query: 737 SSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
L + +H RK G L+L+ P
Sbjct: 613 DMLRAEAPMTH--------------RKEGENLNLKNEKP 637
>gi|329960238|ref|ZP_08298680.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328532911|gb|EGF59688.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 778
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 180/377 (47%), Gaps = 33/377 (8%)
Query: 15 FVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIE 74
F+ +T++ ++ ++ GK ++ + +HY R E W IQ K G++ I
Sbjct: 19 FMGCSTSNKSQTFEVGNQTFLLDGKPFIIKAAEMHYTRIPAEYWEHRIQMCKALGMNTIC 78
Query: 75 TYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHF 134
Y FWN+HE +++F+G+ D+ +F +L + G+Y LR GPYVC+EW GG P WL
Sbjct: 79 IYAFWNIHEQRPGEFDFKGQNDIAEFCRLAQKNGMYIMLRPGPYVCSEWEMGGLPWWLLK 138
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN--IDSAYG 192
IQ RT++ F + F +I + L A +GG II+ Q+ENEYG ++ Y
Sbjct: 139 KKDIQLRTNDPYFLERTKLFMNEIGKQLAD--LQAPRGGNIIMVQVENEYGGYAVNKEYI 196
Query: 193 AAGKSYIKWAAGMALSLDTGVPWVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNS 243
A + ++ AG T VP C Q + D ++ T N G D QF
Sbjct: 197 ANVRDIVR-GAGF-----TDVPLFQCDWSSTFQLNGLDDLLWTINFGTGANIDAQFKSLK 250
Query: 244 NNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
+P M +E WSGWF +G R E + + R +F + YM HGGT F
Sbjct: 251 EARPDAPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLDRNISF-SLYMAHGGTTFG 309
Query: 301 RTSGG---PF--ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPS 355
G P+ + +SYDYDAP+ E G PK+ L+++ ++ ++A V P P
Sbjct: 310 HWGGANCPPYSAMCSSYDYDAPISEAGWA-TPKYYKLREM--LMQYADSAQVI--PDVPQ 364
Query: 356 LGPNLEATVYKTGSGLC 372
P +E + C
Sbjct: 365 AYPLIEIPAIRFEETAC 381
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 119/288 (41%), Gaps = 70/288 (24%)
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
P +++G TVL + + + +GKL+G S +T+ AL G D+L
Sbjct: 415 PEVKEG--TVLLIDEVHDWAQVYADGKLLGRLDRRRSENSLTLP---ALKAG-TQLDILV 468
Query: 552 LTVGLQNYG-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEEL-NFPSGS 609
+G N+ A +++ G IT V+L L+ + + LKG ++ +FP+ +
Sbjct: 469 EAMGRVNFDYAIHDRKG--ITEKVEL----------LTEES---RKELKGWQVYSFPTDA 513
Query: 610 --STQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTY 667
+ Q D + K + +Y+ +F+ + V +D GKG WVNG++IGR+W
Sbjct: 514 DFAAQKDFRKG-NKAEGPAYYRASFNLKETGD-VFLDMQTWGKGMVWVNGKAIGRFWEI- 570
Query: 668 VSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTK 727
P Q+LY +P WLK N +V+ + +G D +
Sbjct: 571 ---------------------------GPQQTLY-MPGCWLKKGKNEIVVLDLLGPDKAE 602
Query: 728 ISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRKPGPVLSLECPNP 775
I K L + + PL RK G L+L+ P
Sbjct: 603 I----KGLKQPILDMLRSEEPL----------THRKEGENLNLKNEKP 636
>gi|431758215|ref|ZP_19546843.1| beta-galactosidase [Enterococcus faecium E3083]
gi|430617878|gb|ELB54742.1| beta-galactosidase [Enterococcus faecium E3083]
Length = 595
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 186/396 (46%), Gaps = 52/396 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ +P
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDIP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNRSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVK--FNGNSYLL 396
+++ +C I T+ +T++ NG YLL
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLL 388
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 104/280 (37%), Gaps = 61/280 (21%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNRSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAF 562
+ +I+GK + + ++ ++ G+ ++L V ++N G
Sbjct: 403 RLIETNDRAQIYIDGKYEQTQTQETLGDEMMIE-------GQKNQPTIALDVLVENLGRV 455
Query: 563 YEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKL 622
GA + P Q KG NG D+ + G + L F + D +
Sbjct: 456 --NYGAKLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAGKDPS 508
Query: 623 QPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
QP +Y+ FD A ID + GKG VNG ++GRYW N G
Sbjct: 509 QP-SFYQFEFDL-AEEADAYIDCSLYGKGIVIVNGFNLGRYW------NHG--------- 551
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +V+FE G
Sbjct: 552 -------------PVLSLY-CPKDVLKKGRNEVVIFETEG 577
>gi|293570811|ref|ZP_06681858.1| beta-galactosidase [Enterococcus faecium E980]
gi|430840422|ref|ZP_19458347.1| beta-galactosidase [Enterococcus faecium E1007]
gi|431064256|ref|ZP_19493603.1| beta-galactosidase [Enterococcus faecium E1604]
gi|431124630|ref|ZP_19498626.1| beta-galactosidase [Enterococcus faecium E1613]
gi|431738579|ref|ZP_19527522.1| beta-galactosidase [Enterococcus faecium E1972]
gi|291609079|gb|EFF38354.1| beta-galactosidase [Enterococcus faecium E980]
gi|430495187|gb|ELA71394.1| beta-galactosidase [Enterococcus faecium E1007]
gi|430566915|gb|ELB06003.1| beta-galactosidase [Enterococcus faecium E1613]
gi|430568897|gb|ELB07927.1| beta-galactosidase [Enterococcus faecium E1604]
gi|430597307|gb|ELB35110.1| beta-galactosidase [Enterococcus faecium E1972]
Length = 595
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/401 (31%), Positives = 188/401 (46%), Gaps = 50/401 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSI 403
+++ +C I T+ +T++ N Y +S+++
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLLYSLTL 393
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 104/284 (36%), Gaps = 69/284 (24%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNKSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN----TFDLLSLTVGLQN 558
+ +I+GK + + + T+ + L KN D+L +G N
Sbjct: 403 RLIETNDRAQIYIDGKY------NQTQTQETLGDEMMLEGQKNQPTIALDVLVENLGRVN 456
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YGA + P Q KG NG D+ + G + L F + D +
Sbjct: 457 YGA-------KLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAG 504
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
QP +Y+ FD A ID + GKG +NG ++GRYW N G
Sbjct: 505 KDPSQP-SFYQFEFDL-AEEADTYIDCSLYGKGVVIINGFNLGRYW------NHG----- 551
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +++FE G
Sbjct: 552 -----------------PVLSLY-CPKDVLKKGRNEVIIFETEG 577
>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
Length = 583
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 158/330 (47%), Gaps = 41/330 (12%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+T D + GK ++SG+IHY R + W +Q D GL+ I+ Y+ WNLHE R
Sbjct: 8 LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
++F G DLV+F + AE GL R GPY+C+EW++GG P WL P + R++
Sbjct: 68 GNFDFAGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
++A + + +K++ ++ L S GGPII Q+ENEYG+ Y ++ W A +
Sbjct: 128 YQAAVSSYFSKLLPLLA--PLQHSNGGPIIAFQVENEYGD----YVDKDNEHLPWLADLM 181
Query: 207 LSLDTGVPWVMCQQSDAPDPI-------------INT------CNGFYCDQFTPNSNNKP 247
S + + SD I +N+ F P NKP
Sbjct: 182 KSHGLFELFFI---SDGGHTIRKANMLKVRSTAQLNSGSFQLLAKAFSLKSLQP---NKP 235
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF 307
+ TE W+GWF +G E + +RG + N+YM+HGGTNF +G
Sbjct: 236 MLVTEFWAGWFDYWGHGRNLLNNEVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIE 294
Query: 308 IS--------TSYDYDAPLDEYGLIRQPKW 329
+ TSYDYD P+DE G R KW
Sbjct: 295 LEKGYYTADVTSYDYDCPVDESG-NRTEKW 323
>gi|411007376|ref|ZP_11383705.1| beta-galactosidase [Streptomyces globisporus C-1027]
Length = 606
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/313 (35%), Positives = 154/313 (49%), Gaps = 43/313 (13%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK L+SG++HY R E W + GL+ +ETYV WNLHEP + G L
Sbjct: 15 GKPVRLLSGALHYFRVHEEQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG--AL 72
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+ V AGL+A +R GPY+CAEW GG P+W+ G + RT + ++A ++R+ +
Sbjct: 73 GRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAEYRAVVERWFRE 132
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVM 217
++ + Q ++ +GGP+IL Q ENEYG+ S Y++W AG+ VP
Sbjct: 133 LLPQVVQRQVV--RGGPVILVQAENEYGSFGSD-----AVYLEWLAGLLRECGVTVPLFT 185
Query: 218 CQQSDAPDP----------IINTCN-------GFYCDQFTPNSNNKPKMWTENWSGWFLS 260
SD P+ ++ T N GF + + P M E W GWF
Sbjct: 186 ---SDGPEDHMLTGGSVPGLLATANFGSGAREGF--EVLRRHQPKGPLMCMEFWCGWFDH 240
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF----DRTSGGPF-------IS 309
+G R E+ A A+ + G + N YM HGGTNF GGP
Sbjct: 241 WGAEPVLRDAEEAAGALREILECGASV-NVYMAHGGTNFAGWAGANRGGPLQDGEFQPTV 299
Query: 310 TSYDYDAPLDEYG 322
TSYDYDAP+DEYG
Sbjct: 300 TSYDYDAPVDEYG 312
>gi|242078615|ref|XP_002444076.1| hypothetical protein SORBIDRAFT_07g006945 [Sorghum bicolor]
gi|241940426|gb|EES13571.1| hypothetical protein SORBIDRAFT_07g006945 [Sorghum bicolor]
Length = 144
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/145 (54%), Positives = 105/145 (72%), Gaps = 1/145 (0%)
Query: 704 PRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGSSLCSHVTDSHPLPVDMWGSDSKIQRK 763
P +L+ N +VLFE+ GGDP+KISFV +Q S+C+ V++ HP +D W S + ++
Sbjct: 1 PCLFLQPGSNDIVLFEQFGGDPSKISFVIRQT-RSVCAQVSEEHPAQIDSWNSSQQTMQR 59
Query: 764 PGPVLSLECPNPNQVISSIKFASFGTPLGTCGSFSRGRCSSARSLSVVRQACVGSKSCSI 823
P L LECP QVISSIKFASFGTP GTCGS+S G CSS +++SVV++AC+G +CS+
Sbjct: 60 YRPELRLECPKDGQVISSIKFASFGTPSGTCGSYSHGECSSTQAISVVQEACIGVSNCSV 119
Query: 824 GVSVNTFGDPCKGVMKSLAVEASCT 848
VS N FG+P GV KSLAVEA+C+
Sbjct: 120 PVSSNYFGNPWTGVTKSLAVEAACS 144
>gi|18410234|ref|NP_565051.1| beta-galactosidase 17 [Arabidopsis thaliana]
gi|75163694|sp|Q93Z24.1|BGL17_ARATH RecName: Full=Beta-galactosidase 17; Short=Lactase 17; Flags:
Precursor
gi|16648842|gb|AAL25611.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|22655360|gb|AAM98272.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
gi|332197279|gb|AEE35400.1| beta-galactosidase 17 [Arabidopsis thaliana]
Length = 697
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 117/330 (35%), Positives = 157/330 (47%), Gaps = 39/330 (11%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G R +I G +HY R PE W D + ++ GL+ I+ YV WNLHEP + FEG DL
Sbjct: 74 GNRFQIIGGDLHYFRVLPEYWEDRLLRANALGLNTIQVYVPWNLHEPKPGKMVFEGIGDL 133
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI-PGIQFRTDNEPFKAEMQRFTA 156
V F+KL + LR GPY+C EW+ GGFP WL + P +Q RT + + ++R+
Sbjct: 134 VSFLKLCEKLDFLVMLRAGPYICGEWDLGGFPAWLLAVKPRLQLRTSDPVYLKLVERWWD 193
Query: 157 KIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAY---------GAAGKSYIKWAA-- 203
V + K L S GGP+I+ QIENEYG+ D AY G G I +
Sbjct: 194 --VLLPKVFPLLYSNGGPVIMVQIENEYGSYGNDKAYLRKLVSMARGHLGDDIIVYTTDG 251
Query: 204 GMALSLDTG-VPW------VMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSG 256
G +LD G VP V D P PI F P + +E ++G
Sbjct: 252 GTKETLDKGTVPVADVYSAVDFSTGDDPWPIFKLQKKFNA------PGRSPPLSSEFYTG 305
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------- 309
W +G + E A ++ + R G+ YM HGGTNF +G S
Sbjct: 306 WLTHWGEKITKTDAEFTAASLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDYKP 364
Query: 310 --TSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
TSYDYDAP+ E G I PK+ L+ + K
Sbjct: 365 DLTSYDYDAPIKESGDIDNPKFQALQRVIK 394
>gi|257067624|ref|YP_003153879.1| beta-galactosidase [Brachybacterium faecium DSM 4810]
gi|256558442|gb|ACU84289.1| beta-galactosidase [Brachybacterium faecium DSM 4810]
Length = 631
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 100/300 (33%), Positives = 156/300 (52%), Gaps = 16/300 (5%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G +++SG++HY R PE W D +++ G + +ETYV WN+H+P R FEG DL
Sbjct: 16 GDPHLIVSGALHYFRIHPEQWRDRLRRLVVMGCNTVETYVAWNIHQPSREVTTFEGFADL 75
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
+F+ + AE GL A +R GPY+CAEW GGFP W+ ++ R N + + + +
Sbjct: 76 GRFLDIAAEEGLDAIVRPGPYICAEWENGGFPGWILADRNLRLRNRNAAYLQLVDAWFDQ 135
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ ++ Q + A +GG +++ Q+ENEYG+ D+AY A + + L + + P
Sbjct: 136 LIPVIAQRQ--AGRGGNVVMVQVENEYGSFGDDTAYLAHLRDGLVARGIEELLVTSDGPA 193
Query: 216 VMCQQSDAPDPIINTCN-GFYCDQFTPNSN----NKPKMWTENWSGWFLSFGGAVPYRPV 270
M D + T N G + + ++P+M E W+GWF +G R
Sbjct: 194 RMWLTGGTVDGALGTVNFGSRTLEVLAMAERELPDQPQMCMEFWNGWFDHWGEEHHERTG 253
Query: 271 EDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYGLI 324
D A +A + G + N+YM HGGTNF +G +TSYDYDAP+ E G +
Sbjct: 254 GDAAGELADMLEHGMSV-NFYMAHGGTNFGMQAGANHDGTLQPTTTSYDYDAPIAENGAL 312
>gi|332264034|ref|XP_003281053.1| PREDICTED: beta-galactosidase-1-like protein 2 [Nomascus
leucogenys]
Length = 679
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 154/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 106 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 165
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 166 MAAEIGLWVILRPGPYICSELDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 223
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 224 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 283
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 284 VVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 343
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 344 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 401
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 402 MKLRDFFGSI 411
>gi|426371167|ref|XP_004052524.1| PREDICTED: beta-galactosidase-1-like protein 2 [Gorilla gorilla
gorilla]
Length = 678
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 154/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 105 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 164
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 165 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 222
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 223 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 282
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 283 IVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 342
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 343 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 400
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 401 MKLRDFFGSI 410
>gi|325925751|ref|ZP_08187124.1| beta-galactosidase [Xanthomonas perforans 91-118]
gi|325543808|gb|EGD15218.1| beta-galactosidase [Xanthomonas perforans 91-118]
Length = 611
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 129/382 (33%), Positives = 183/382 (47%), Gaps = 35/382 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F G
Sbjct: 38 VRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGN 97
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 98 NDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQSY 157
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDT 211
+ + + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 158 LDALAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFT 214
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
M PD ++N G D+ ++P+M E W+GWF +G P
Sbjct: 215 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--P 272
Query: 267 YRPVEDLAFAVARFFQ---RGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYD 313
+ + A A F+ R G N YM+ GGT+F +G F +TSYD
Sbjct: 273 HAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYD 330
Query: 314 YDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC 372
YDA LDE G PK+ ++D + + + AL A P L AT + + L
Sbjct: 331 YDAILDEAGHP-TPKFALMRDAIARVTGVQPPALPA-----PIATATLPATPLRESASLW 384
Query: 373 SAFLANIGTNSDVTVKFNGNSY 394
A I ++ ++ G Y
Sbjct: 385 DNLPAPIAIDTPQPMEQFGQDY 406
>gi|28199702|ref|NP_780016.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182682446|ref|YP_001830606.1| beta-galactosidase [Xylella fastidiosa M23]
gi|386083781|ref|YP_006000063.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|417557800|ref|ZP_12208811.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
gi|28057823|gb|AAO29665.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182632556|gb|ACB93332.1| Beta-galactosidase [Xylella fastidiosa M23]
gi|307578728|gb|ADN62697.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|338179583|gb|EGO82518.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
Length = 612
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 168/352 (47%), Gaps = 25/352 (7%)
Query: 4 KEILLLVLCWGFVVLATTSFGANVTYDHRAV--VIGGKRRVLISGSIHYPRSTPEMWPDL 61
+ +L L L + V+ S + R + G+ LISG+IH+ R W D
Sbjct: 3 RHLLTLSLIFAIVLPIGVSAAPWPAFSTRGTQFIRDGRPYQLISGAIHFQRIPRAYWKDR 62
Query: 62 IQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCA 121
+QK++ GL+ +ETYVFWNL E Q++F G D+ FV+ A GL LR GPYVCA
Sbjct: 63 LQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDIGAFVREAASQGLNVILRPGPYVCA 122
Query: 122 EWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIE 181
EW GGFP WL P ++ R+ + F QR+ + ++ L GGPII Q+E
Sbjct: 123 EWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEALGTQVR--PLLNGNGGPIIAVQVE 180
Query: 182 NEYGNIDSAYG---AAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPI--INTCNGF-- 234
NEYG+ +G A +IK G AL L T M PD + +N G
Sbjct: 181 NEYGSYGDDHGYLQAVRALFIKAGLGGAL-LFTADGAQMLGNGTLPDVLAAVNVAPGEAK 239
Query: 235 -YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMY 293
D+ +P++ E W+GWF +G + A + ++G + N YM+
Sbjct: 240 QALDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMF 298
Query: 294 HGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
GGT+F +G F +TSYDYDA LDE G PK+ +D+
Sbjct: 299 VGGTSFGFMNGANFQGGPSDHYSPQTTSYDYDAVLDEAGRP-MPKFALFRDV 349
Score = 45.4 bits (106), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 59/266 (22%), Positives = 88/266 (33%), Gaps = 51/266 (19%)
Query: 469 INTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSS 528
+ TTAD Y + L K L++ + H +++ VG
Sbjct: 389 VATTADPQPMERYGQAYGYILYRTTLHGPRKGRLYLGEVRDDAHVYVDRLFVGRAERRRQ 448
Query: 529 NAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS 588
V VD P G + D+L G NYG AG+ GPV L N ++
Sbjct: 449 QVWVEVDIP----SGTHCLDVLVENSGRVNYGPHLADGRAGLIGPVML----NHERVN-- 498
Query: 589 SQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGM 648
W E P + +T P P T F G +D
Sbjct: 499 --NW--------ETFLLPLQTPEAIHGWTTAPMQGPAFHRGTLFIRTPGD--TFLDMEAF 546
Query: 649 GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWL 708
KG W NG +GRYW + G P ++LY P +W
Sbjct: 547 SKGVTWANGHMLGRYW---------------------------DIG-PQRALY-FPGAWQ 577
Query: 709 KSSGNTLVLFEEIGGDPTKISFVTKQ 734
+ NT+++F+ ++ V +Q
Sbjct: 578 RQGENTVLVFDVSDTAAAQVRGVQQQ 603
>gi|325261840|ref|ZP_08128578.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
gi|324033294|gb|EGB94571.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
Length = 581
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/326 (34%), Positives = 159/326 (48%), Gaps = 43/326 (13%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
I ++ +ISG +HY R E W D + K K G + +ETY+ WNLHE + ++ FEG
Sbjct: 12 IDNQKVKIISGGVHYFRIMAEYWKDCLLKLKAFGCNTVETYIPWNLHEKEKGEFCFEGNL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ KFV + + GLY LR PY+CAEW FGG P WL G++ R +PF ++ +
Sbjct: 72 DITKFVHIAKDLGLYVILRPSPYICAEWEFGGLPYWLLKEDGMRLRCSYKPFLKHVEEYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ +++ L ++GGP+I+ Q+ENEYG Y Y+K +S VP
Sbjct: 132 HRLFEVIA--PLQYTKGGPVIMMQVENEYG-----YYGNDTLYLKTLQDFMVSYGCEVPL 184
Query: 216 VMCQQSDAP----------DPIINTCN-GFYCDQ----FTPNSNNKPKMWTENWSGWFLS 260
V SD P + ++ T N G Q NKP M E W GWF S
Sbjct: 185 V---TSDGPWGDAFDCGKLEGVLQTGNFGSKSRQQLQIMRDKIGNKPLMCMEFWVGWFDS 241
Query: 261 FGGAV-----PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------ 309
+G P + E+L + G N YM+ GGTNF +G +
Sbjct: 242 WGQTEHKQEDPNKNAENL----DEILESGHV--NIYMFMGGTNFGFMNGSNYYDVLTPDV 295
Query: 310 TSYDYDAPLDEYGLIRQPKWGHLKDL 335
TSYDYDA L E G + PK+ LK++
Sbjct: 296 TSYDYDALLTEAGDL-TPKYELLKNV 320
>gi|220914306|ref|YP_002489615.1| beta-galactosidase [Arthrobacter chlorophenolicus A6]
gi|219861184|gb|ACL41526.1| Beta-galactosidase [Arthrobacter chlorophenolicus A6]
Length = 586
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/312 (35%), Positives = 159/312 (50%), Gaps = 32/312 (10%)
Query: 32 RAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNF 91
R ++ G+ ++SG+IHY R P++W D I+K++ GL+ IETYV WN H +
Sbjct: 9 RDFLLDGEPFRILSGAIHYFRVHPDLWADRIRKARLMGLNTIETYVPWNEHSSTPGAFRT 68
Query: 92 EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEM 151
+G DL +F+ LVA G+ +R GPY+CAEW+ GG P WL P I R+ + A +
Sbjct: 69 DGGLDLGRFLDLVAAEGMQGIVRPGPYICAEWDNGGLPAWLFTDPSIGVRSSEPGYLAAV 128
Query: 152 QRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDT 211
F +++ ++ + ++ ++GGP+IL QIENEYG AYG+ K+Y++ A
Sbjct: 129 DGFMDRLLPIVVERQI--TRGGPVILFQIENEYG----AYGSD-KAYLQHLVDTATRAGV 181
Query: 212 GVPWVMCQQ------SDAPDPIINTCNGF--YCDQ----FTPNSNNKPKMWTENWSGWFL 259
VP C Q D P ++ F D+ + P M E W+GWF
Sbjct: 182 EVPLFTCDQPFETMIEDGSLPGLHKTGTFGSRADERLAFLRERQPDGPLMCAEFWNGWFD 241
Query: 260 SFGGAVPYRPVEDLAFAVARFFQRGGTFQ--NYYMYHGGTNFDRTSGG-------PFIST 310
++G + D A + A N YM+HGGTNF T+G P I T
Sbjct: 242 NWG---THHHTTDAAASAAELDALLAAGASVNIYMFHGGTNFGFTNGANDKGIYEPTI-T 297
Query: 311 SYDYDAPLDEYG 322
SYDYDAPL E G
Sbjct: 298 SYDYDAPLSEDG 309
>gi|296216696|ref|XP_002807336.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Callithrix jacchus]
Length = 652
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 133/454 (29%), Positives = 201/454 (44%), Gaps = 40/454 (8%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F G
Sbjct: 81 LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSGNL 140
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV + +E GL+ LR GPY+C+E + GG P WL P + RT N+ F ++++
Sbjct: 141 DLEAFVLMASEIGLWVILRPGPYICSEIDLGGLPSWLLQDPQLLLRTTNKGFIEAVEKYF 200
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L QGGP+I Q+ENEYG N D Y K+ ++ L G
Sbjct: 201 DHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKKYMPYLHKAMLRRGIVELLLTSDG 258
Query: 213 VPWVMCQQSDAPDPIINTC----NGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
V+ + IN N F Q +KP + E W GWF +
Sbjct: 259 EKNVLSGHTKGVLATINLQKLHRNTF--SQLHKVQRDKPLLNMEYWVGWFDRWXDKHHVT 316
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
+++ V+ F + +F N YM+HGGTNF +G + + TSYDYDA L E G
Sbjct: 317 DAKEIEHTVSEFIKYEISF-NVYMFHGGTNFGFLNGATYFGKHAGVVTSYDYDAVLTEAG 375
Query: 323 LIRQPKWGHLKDL---HKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANI 379
+ K+ L+ L AI L + YP + P+L ++ + L ++
Sbjct: 376 DYTE-KYFKLQKLFGSFSAIPLPRVPKLTPKAAYPPVRPSLYLRLWDVLAYLNEPVRSHQ 434
Query: 380 GTNSDVTVKFNGN--SYLLPAWSVSILP---------DCKNVVFNTAKI------NSVTL 422
N + NG+ SY L + SI D V + I N
Sbjct: 435 PINMENLPINNGSGQSYGLVLYEKSICSGGRLCAHAHDMAQVFLDETMIGILNENNQNLH 494
Query: 423 VPSFSR-QSLQVAADSSDAIGSGWSYINEPVGIS 455
+P + L++ ++ + W NE GI+
Sbjct: 495 IPELRVCRYLRILVENQGRVNFSWQIQNEQKGIT 528
>gi|298481696|ref|ZP_06999887.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
gi|298272237|gb|EFI13807.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
Length = 778
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 177/362 (48%), Gaps = 39/362 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M +K I LLVL F V+ +S A T ++ GK V+ + +HY R
Sbjct: 1 MKNKIIALLVL---FTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F KL + G+Y +R
Sbjct: 58 AYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL + RT + + M+R + ++ KQ L ++GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDVALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ D Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGTDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Query: 336 HK 337
K
Sbjct: 347 LK 348
Score = 49.3 bits (116), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 62/250 (24%), Positives = 94/250 (37%), Gaps = 49/250 (19%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
TVL + + + +GKL+ + T P AL G D+L +G N+
Sbjct: 420 TVLKITEVHDWAQIYADGKLLARL--DRRKGEFTTTLP-ALKKG-TQLDILVEAMGRVNF 475
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
GIT V+L SGN T + WT NFP S D K
Sbjct: 476 DKSIHDR-KGITEKVELV-SGNQTK---ELKNWTV--------YNFPVDYSFIKDKKYND 522
Query: 620 PKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
K+ P + +YK+TF + +D + GKG WVNG ++GR+W
Sbjct: 523 TKILPAMPAYYKSTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI----------- 570
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 571 -----------------GPQQTLF-MPGCWLKEGENEILVLDLKGPAKASIKGLKKPILD 612
Query: 738 SLCSHVTDSH 747
L ++H
Sbjct: 613 VLREKAPETH 622
>gi|156552637|ref|XP_001603160.1| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 629
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 177/356 (49%), Gaps = 34/356 (9%)
Query: 7 LLLVLCWGFVVLATTS------FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPD 60
L + F LA +S + + Y++ ++ GK +SGS HY R+ + W
Sbjct: 7 LFITYLLAFSNLAESSEHNIKNYSFAIDYENDQFLLDGKPFRYVSGSFHYFRTPRQHWRG 66
Query: 61 LIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVC 120
+++K + GGL+ + TYV W++HEP +Q+ ++G D+V+F+K+ E L+ LR GPY+C
Sbjct: 67 ILRKMRAGGLNAVSTYVEWSMHEPEFDQWVWDGDADIVEFIKIAQEEDLFVILRPGPYIC 126
Query: 121 AEWNFGGFPLW-LHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
AE +FGGFP W L +P I+ RT +E + +RF +I + + + L GGPII+ Q
Sbjct: 127 AERDFGGFPYWLLSRVPDIKLRTKDERYVFYAERFLNEI--LRRTKPLLRGNGGPIIMVQ 184
Query: 180 IENEYGNI---DSAYGAAG----KSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCN 232
+ENEYG+ D Y + ++K A + + + + C I+ N
Sbjct: 185 VENEYGSFYACDDQYKSKMYEIFHRHVKNDAVLFTTDGSARSMLKCGSIPGVYATIDFGN 244
Query: 233 GF-------YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGG 285
G +F+P P + +E + GW +G + ++A +
Sbjct: 245 GANVPFNYKIMREFSPKG---PLVNSEYYPGWLTHWGESFQRVNSHNVAKTLDEMLAYNV 301
Query: 286 TFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKWGHLKDL 335
+ N YMY+GGTNF TSG TSYDYDAPL E G PK+ L+D+
Sbjct: 302 SV-NIYMYYGGTNFAFTSGANINEHYWPQLTSYDYDAPLTEAG-DPTPKYFELRDV 355
>gi|431593417|ref|ZP_19521746.1| beta-galactosidase [Enterococcus faecium E1861]
gi|430591294|gb|ELB29332.1| beta-galactosidase [Enterococcus faecium E1861]
Length = 595
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 127/401 (31%), Positives = 188/401 (46%), Gaps = 50/401 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G +ISG+IHY R P W + K G + +ETY+ WNLHEP ++F G
Sbjct: 11 LVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSGF 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++V+FVK+ E L LR Y+CAEW FGG P WL P I+ R+ + F +++ +
Sbjct: 71 KNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG KSY++ + L+ VP
Sbjct: 131 YQ--VLLPKLAPLQITQGGPVIMMQLENEYG----SYGME-KSYLRQTKELMLAHSIDVP 183
Query: 215 -------WV-MCQQSDAPDPIINTCNGF---------YCDQFTPN-SNNKPKMWTENWSG 256
W+ + D I F +F N N P M E W G
Sbjct: 184 LFTSDGAWLEVLDAGILIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWDG 243
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R E+LA V + G N YM+HGGTNF +G P I
Sbjct: 244 WFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQI 301
Query: 309 STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDP---TYPSLGP---NLEA 362
TSYDYDA L+E G QP + + + IK ++ +P T +LG N
Sbjct: 302 -TSYDYDALLNEAG---QPTEKYYA-VQRVIKEVCPSVWQAEPRTKTLKNLGTYPVNKSV 356
Query: 363 TVYKTGSGLCSAFLANIGTNSDVTVKFNGNSYLLPAWSVSI 403
+++ +C I T+ +T++ N Y +S+++
Sbjct: 357 SLFHIKEQICE----EIKTDYPLTMEQASNGYGYLLYSLTL 393
Score = 46.2 bits (108), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 104/284 (36%), Gaps = 69/284 (24%)
Query: 451 PVGISKDDAFTKPGLLEQINT----TADQSD----YLWYSLSTNIKADEPLLEDGSKTVL 502
PV S K + E+I T T +Q+ YL YSL+ L G K L
Sbjct: 351 PVNKSVSLFHIKEQICEEIKTDYPLTMEQASNGYGYLLYSLT--------LKNYGHKNKL 402
Query: 503 HVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKN----TFDLLSLTVGLQN 558
+ +I+GK + + + T+ + L KN D+L +G N
Sbjct: 403 RLIETNDRAQIYIDGKY------NQTQTQETLGDEMMLEGQKNQPTIALDVLVENLGRVN 456
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YGA + P Q KG NG D+ + G + L F + D +
Sbjct: 457 YGA-------KLNSPSQSKGIRNGVMQDIH-----FHLGYRHYPLTFEQAQLDKIDYSAG 504
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSC 678
QP +Y+ FD A ID + GKG +NG ++GRYW N G
Sbjct: 505 KDPSQP-SFYQFEFDL-AEEADTYIDCSLYGKGAVIINGFNLGRYW------NHG----- 551
Query: 679 NYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P SLY P+ LK N +++FE G
Sbjct: 552 -----------------PVLSLY-CPKDVLKKGRNEVIIFETEG 577
>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
Length = 612
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 156/316 (49%), Gaps = 23/316 (7%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ LISG+IH+ R W D +QK++ GL+ +ETYVFWNL E Q++F G D+
Sbjct: 39 GRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDI 98
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
FV+ A GL LR GPYVCAEW GGFP WL P ++ R+ + F QR+
Sbjct: 99 SAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEA 158
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG---AAGKSYIKWAAGMALSLDTGVP 214
+ ++ L GGPII Q+ENEYG+ +G A +IK G AL L T
Sbjct: 159 LGTQVR--PLLNGNGGPIIAVQVENEYGSYGDDHGYLQAVRALFIKAGLGGAL-LFTADG 215
Query: 215 WVMCQQSDAPDPI--INTCNGF---YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRP 269
M PD + +N G D+ +P++ E W+GWF +G
Sbjct: 216 AQMLGNGTLPDVLAAVNVAPGEAKQALDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTD 275
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLD 319
+ A + ++G + N YM+ GGT+F +G F +TSYDYDA LD
Sbjct: 276 AKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPSDHYSPQTTSYDYDAALD 334
Query: 320 EYGLIRQPKWGHLKDL 335
E G PK+ +D+
Sbjct: 335 EAGRP-MPKFVLFRDV 349
>gi|22760570|dbj|BAC11247.1| unnamed protein product [Homo sapiens]
Length = 636
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 154/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 240
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 241 IVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 300
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 301 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 358
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 359 MKLRDFFGSI 368
>gi|325922356|ref|ZP_08184130.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
gi|325547138|gb|EGD18218.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
Length = 613
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/363 (35%), Positives = 170/363 (46%), Gaps = 40/363 (11%)
Query: 1 MASKEILLLVLCWGFV-----VLATT----SFGANVTYDHRAVVIGGKRRVLISGSIHYP 51
M + LVL F + ATT SFG T V GK L+SG+IH+
Sbjct: 1 MLRTTLAPLVLALAFALPVTAIAATTDTWPSFGTQGT----QFVRDGKPYQLLSGAIHFQ 56
Query: 52 RSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYA 111
R E W D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL
Sbjct: 57 RIPREYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFAGNNDVAAFVREAAAQGLNV 116
Query: 112 HLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQ 171
LR GPY CAEW GG+P WL I+ R+ + F A Q + + + L
Sbjct: 117 ILRPGPYTCAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQAYLDAVSKQV--HPLLNHN 174
Query: 172 GGPIILSQIENEYGNIDS--AYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--P 226
GGPII Q+ENEYG+ D AY A ++ Y+K AL L T M PD
Sbjct: 175 GGPIIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDAL-LFTSDGADMLANGTLPDTLA 233
Query: 227 IINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARF--F 281
++N G ++ +P+M E W+GWF +G D F
Sbjct: 234 VVNFAPGEAKTAFEKLIKFRPEQPRMVGEYWAGWFDHWGKP---HASTDAKQQTEEFEWI 290
Query: 282 QRGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGH 331
R G N YM+ GGT+F +G F +TSYDYDA LDE G PK+
Sbjct: 291 LRQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFAL 349
Query: 332 LKD 334
++D
Sbjct: 350 MRD 352
>gi|403528012|ref|YP_006662899.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
gi|403230439|gb|AFR29861.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
Length = 598
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 120/388 (30%), Positives = 174/388 (44%), Gaps = 55/388 (14%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A ++Y + G+ +++G+IHY R P++W D +++ K G + ++TYV WN H+P
Sbjct: 4 ALLSYHDAVLYRSGEPYRILAGAIHYFRVHPDLWQDRLRRLKAMGANTVDTYVAWNFHQP 63
Query: 85 VRNQY-NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
R++ +F G DL +F+ L AE GL +R GPY+CAEW+ GGFP WL IPGI R
Sbjct: 64 KRDEAPDFSGWQDLGRFMDLAAEEGLDVIVRPGPYICAEWDNGGFPSWLTGIPGIGLRCM 123
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW-- 201
+ F A ++ + ++ ++ + S GGP++ QIENEYG+ + YI+W
Sbjct: 124 DPVFTAAIEEWFDHLLPIVASRQ--TSAGGPVVAVQIENEYGSYGDDH-----EYIRWNR 176
Query: 202 ---------------AAGMALSLDTGV---PWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
G LD G W D + T +
Sbjct: 177 RALEERGITELLFTADGGTDYFLDGGAVEGTWATATLGSRGDEAVAT--------WQRRR 228
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
+P E W GWF +G R ED A + GG+ YM HGGTNF S
Sbjct: 229 PGEPFFNVEFWGGWFDHWGEHHHGRDAEDAALEARKMLDLGGSL-CAYMAHGGTNFGLRS 287
Query: 304 G----GPFIS---TSYDYDAPLDEYGLIRQPKWGHLKDLHKA-----IKLCEAALVATDP 351
G G + TSYD DAP+ E G + K+ ++A + A L+A P
Sbjct: 288 GSNHDGTMLQPTVTSYDSDAPIAENGALTPKFHAFRKEFYRAQGVDDLPELPADLLADAP 347
Query: 352 TYP------SLGPNLEATVYKTGSGLCS 373
P S GP L V G + S
Sbjct: 348 VLPAQSLPLSPGPELLELVRDAGKPVSS 375
>gi|334347175|ref|XP_003341899.1| PREDICTED: beta-galactosidase-1-like protein [Monodelphis
domestica]
Length = 646
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 111/300 (37%), Positives = 148/300 (49%), Gaps = 30/300 (10%)
Query: 44 ISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKL 103
+SGSIHY R +W D + K + GL+ ++ YV WN HEP YNF+G DLV F+K
Sbjct: 66 VSGSIHYSRVPSPLWSDRLHKMRMSGLNAVQVYVPWNYHEPQPGVYNFQGNRDLVAFLKA 125
Query: 104 VAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMK 163
A L LR GPY+CAEW GG P WL P I RT + F A + + ++ M+
Sbjct: 126 AANEDLLVILRPGPYICAEWEMGGLPAWLLQNPEIVLRTSDPDFLAAVDSWFHVLMPMV- 184
Query: 164 QEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDA 223
Q LY + GG II Q+ENEYG +Y A Y++ AG+ +L + +D
Sbjct: 185 QPWLYHN-GGNIISVQVENEYG----SYFACDFRYMRHLAGLFRALLGDQ--IFLFTTDG 237
Query: 224 PDPI-INTCNGFYCD-QFTPNSN-------------NKPKMWTENWSGWFLSFGGAVPYR 268
P T G Y F P+ N N P + +E ++GW +GG
Sbjct: 238 PRGFSCGTLQGLYSTVDFGPDDNMTEIFAMQQKYEPNGPLVNSEYYTGWLDYWGGNHSKW 297
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
+ LA + + G N YM+HGGTNF SG F ++TSYDYDAPL E G
Sbjct: 298 DTKTLANGLQNMLELGANV-NMYMFHGGTNFGYWSGADFKKIYQPVTTSYDYDAPLSEAG 356
>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1106
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 154/322 (47%), Gaps = 24/322 (7%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + +HYPR W I+ K G++ + YVFWN HEP Y+F +
Sbjct: 358 LLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQ 417
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F +L + +Y LR GPYVCAEW GG P WL ++ R + F + F
Sbjct: 418 NDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVALF 477
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+ +K L + GGPII+ Q+ENEYG+ D Y + + ++ G ++L
Sbjct: 478 EEAVAKQVKD--LTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNGIAL-FQ 534
Query: 213 VPWVMCQQSDAPDPIINTCN---GFYCD-------QFTPNSNNKPKMWTENWSGWFLSFG 262
W + D +I T N G D Q PNS P M +E WSGWF +G
Sbjct: 535 CDWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKQLRPNS---PLMCSEFWSGWFDKWG 591
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDYDAP 317
RP D+ + RG +F + YM HGGTN+ +G P + TSYDYDAP
Sbjct: 592 ANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAP 650
Query: 318 LDEYGLIRQPKWGHLKDLHKAI 339
+ E G W + + K +
Sbjct: 651 ISESGQTTPKYWALREAMAKYM 672
>gi|336428330|ref|ZP_08608312.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336005980|gb|EGN36021.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 583
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 150/309 (48%), Gaps = 34/309 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK +ISG++HY R PE W D ++K K G + +ETYV WN+HEP + ++ FEG
Sbjct: 14 LDGKPFKIISGAVHYFRIVPEYWRDRLEKLKAMGANTVETYVPWNMHEPQKGKFVFEGML 73
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ +F+ L E GLY +R PY+CAEW FGG P WL G++ R EPF ++ +
Sbjct: 74 DISRFILLAQELGLYVIVRPSPYICAEWEFGGLPAWLLKEDGMRLRGCYEPFLEAVREYY 133
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
+ + ++ L GGP+IL Q+ENEYG Y Y++ + L VP
Sbjct: 134 SVLFPILV--PLQIHHGGPVILMQVENEYG-----YYGDDTRYMETMKQLMLDNGAEVPL 186
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSNNK---------------PKMWTENWSGWFLS 260
V SD P +C T N +K P M TE W GWF
Sbjct: 187 V---TSDGPMDESLSCGRLPGVLPTGNFGSKTEERFEVLKKYTEGGPLMCTEFWVGWFDH 243
Query: 261 FGGAVPYR-PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYD 313
+G R +E+ + + + G N YM+ GGTNF +G + TSYD
Sbjct: 244 WGNGGHMRGNLEESTKDLDKMLEMGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTSYD 301
Query: 314 YDAPLDEYG 322
YDA L E G
Sbjct: 302 YDAVLTEAG 310
Score = 42.7 bits (99), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 50/203 (24%), Positives = 71/203 (34%), Gaps = 52/203 (25%)
Query: 528 SNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDL 587
A+V DF D+L +G N+G E GI G VQL G
Sbjct: 426 KEAEVKADFESG-----ALLDILVENMGRVNFGPLMESQRKGIAGCVQLNGH-------- 472
Query: 588 SSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTG 647
W T P + + D + P +YK F+ + +DF G
Sbjct: 473 MHYNWEMYT--------LPLNNLEKLDFSKGYEEGTP-GFYKFVFEVEEAGD-TFLDFGG 522
Query: 648 MGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSW 707
GKG A++NG ++GR+W P + LY +P
Sbjct: 523 WGKGCAFLNGFNLGRFWEI----------------------------GPQKRLY-IPGPL 553
Query: 708 LKSSGNTLVLFEEIGGDPTKISF 730
LK N ++LFE G +IS
Sbjct: 554 LKEGRNEIILFETDGKTAPEISL 576
>gi|31543093|ref|NP_612351.2| beta-galactosidase-1-like protein 2 precursor [Homo sapiens]
gi|74728154|sp|Q8IW92.1|GLBL2_HUMAN RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|26251705|gb|AAH40641.1| Galactosidase, beta 1-like 2 [Homo sapiens]
gi|119588247|gb|EAW67843.1| hypothetical protein BC008326, isoform CRA_b [Homo sapiens]
gi|119588248|gb|EAW67844.1| hypothetical protein BC008326, isoform CRA_b [Homo sapiens]
Length = 636
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 154/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 240
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 241 IVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 300
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 301 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 358
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 359 MKLRDFFGSI 368
>gi|423215069|ref|ZP_17201597.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692332|gb|EIY85570.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
Length = 778
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 176/362 (48%), Gaps = 39/362 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M +K I LLVL F V+ +S A T ++ GK V+ + +HY R
Sbjct: 1 MKNKIIALLVL---FTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F KL + G+Y +R
Sbjct: 58 AYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFAGQNDIAAFCKLAQQHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL + RT + + M+R + ++ KQ L +GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDVALRTLDPYY---MERVGIFMKEVGKQLAPLQVDKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ D Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGTDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Query: 336 HK 337
K
Sbjct: 347 LK 348
Score = 44.7 bits (104), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 92/250 (36%), Gaps = 49/250 (19%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
TVL + + + +G L+ + T P AL G D+L +G N+
Sbjct: 420 TVLKITEVHDWAQIYADGTLLARL--DRRKGEFTTTLP-ALKKG-TQLDILVEAMGRVNF 475
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
GIT V+L SGN + WT NFP S D K
Sbjct: 476 DKSIHDR-KGITEKVELV-SGNQAK---ELKNWTV--------YNFPVDYSFIKDKKYND 522
Query: 620 PKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
K+ P + +YK+TF + +D + GKG WVNG ++GR+W
Sbjct: 523 TKILPSMPAYYKSTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI----------- 570
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 571 -----------------GPQQTLF-MPGCWLKEGENEILVLDLKGPAKASIKGLKKPILD 612
Query: 738 SLCSHVTDSH 747
L ++H
Sbjct: 613 VLREKAPETH 622
>gi|37182117|gb|AAQ88861.1| HYDRL-14 [Homo sapiens]
Length = 636
Score = 160 bits (405), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 154/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 240
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 241 IVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 300
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 301 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 358
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 359 MKLRDFFGSI 368
>gi|336404675|ref|ZP_08585368.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
gi|335941579|gb|EGN03432.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
Length = 778
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 176/362 (48%), Gaps = 39/362 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M +K I LLVL F V+ +S A T ++ GK V+ + +HY R
Sbjct: 1 MKNKIIALLVL---FTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F KL + G+Y +R
Sbjct: 58 AYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL + RT + + M+R + ++ KQ L +GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDVALRTLDPYY---MERVGIFMKEVGKQLAPLQVDKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ D Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGTDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Query: 336 HK 337
K
Sbjct: 347 LK 348
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 93/250 (37%), Gaps = 49/250 (19%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
TVL + + + +GKL+ + T P AL G D+L +G N+
Sbjct: 420 TVLKITEVHDWAQIYADGKLLARL--DRRKGEFTTTLP-ALKKG-TQLDILVEAMGRVNF 475
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
GIT V+L SGN + WT NFP S D K
Sbjct: 476 DKSIHDR-KGITEKVELV-SGNQAK---ELKNWTV--------YNFPVDYSFIKDKKYND 522
Query: 620 PKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
K+ P + +YK+TF + +D + GKG WVNG ++GR+W
Sbjct: 523 TKILPAMPAYYKSTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI----------- 570
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 571 -----------------GPQQTLF-MPGCWLKEGENEILVLDLKGPAKASIKGLKKPILD 612
Query: 738 SLCSHVTDSH 747
L ++H
Sbjct: 613 VLREKAPETH 622
>gi|422877900|ref|ZP_16924370.1| beta-galactosidase [Streptococcus sanguinis SK1056]
gi|332358593|gb|EGJ36417.1| beta-galactosidase [Streptococcus sanguinis SK1056]
Length = 592
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 154/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + +R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVWREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|328956117|ref|YP_004373450.1| beta-galactosidase [Coriobacterium glomerans PW2]
gi|328456441|gb|AEB07635.1| Beta-galactosidase [Coriobacterium glomerans PW2]
Length = 597
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 161/312 (51%), Gaps = 34/312 (10%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ + SG+IHY R P+ W + K G + +ETY+ WN+HEP ++++
Sbjct: 12 MDGRPFQIRSGAIHYFRLHPDDWEHSLYNLKAMGFNTVETYIPWNMHEPHKDEFRITAET 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D +F+ L ++ GL+A +R P++CAEW FGG P WL G++ R+++ F + +
Sbjct: 72 DFERFLGLASDLGLWAIVRPSPFICAEWEFGGLPAWLLAERGMRIRSNDPRFLERLALYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
++ + + ++ ++G II+ QIENEYG+ DS Y + + + G+ + L T
Sbjct: 132 DMLMPHLAKHQI--TRGANIIMMQIENEYGSYCEDSDYMRSVRDLMV-ERGIDVKLCTSD 188
Query: 214 -PWVMCQQSDA--PDPIINTCN-------------GFYCDQFTPNSNNKPKMWTENWSGW 257
PW CQ++ + D ++ T N GF+ + + P M E W+GW
Sbjct: 189 GPWRACQRAGSLIEDNVLATGNFGSHATENFAALKGFHKE----HGKTWPLMCMEFWAGW 244
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD-------RTSGGPFIST 310
F +G +V R E+LA +V + G N YM+HGGTNF R T
Sbjct: 245 FNRWGESVVRRDPEELARSVREALREGSI--NLYMFHGGTNFGFMNGCSARHDHDLHQIT 302
Query: 311 SYDYDAPLDEYG 322
SYDYDAPLDE G
Sbjct: 303 SYDYDAPLDEAG 314
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 83/219 (37%), Gaps = 52/219 (23%)
Query: 514 FINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGP 573
F+NG+LV + Y + D A PG N D+L +G NYG +
Sbjct: 415 FLNGRLVATQY----QEDIGEDILAAPKPGINQLDILVENMGRVNYGH-------KLLAS 463
Query: 574 VQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWD-SKSTLPKLQPLVWYKTTF 632
Q KG G +DL + TG E P S+ + D S+ P +
Sbjct: 464 TQHKGIRTGICVDLH-----FVTGF--EVFRLPLASADKVDFSRGWTPGAPAFHRFAAVV 516
Query: 633 DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKN 692
A +D TG GKG +VNG ++GR+W
Sbjct: 517 RDTALD--THLDLTGFGKGCVFVNGFNVGRFWEK-------------------------- 548
Query: 693 CGKPSQSLYHVPRSWLKSSGNTLVLFEEIG--GDPTKIS 729
P++SLY VP L+ N +++FE G D K+S
Sbjct: 549 --GPTRSLY-VPHGLLRVGSNDIIVFETEGIYSDELKLS 584
>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
17393]
gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
Length = 1106
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 154/322 (47%), Gaps = 24/322 (7%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + +HYPR W I+ K G++ + YVFWN HEP Y+F +
Sbjct: 358 LLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQ 417
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F +L + +Y LR GPYVCAEW GG P WL ++ R + F + F
Sbjct: 418 NDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVALF 477
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+ +K L + GGPII+ Q+ENEYG+ D Y + + ++ G ++L
Sbjct: 478 EEAVAKQVKD--LTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIAL-FQ 534
Query: 213 VPWVMCQQSDAPDPIINTCN---GFYCD-------QFTPNSNNKPKMWTENWSGWFLSFG 262
W + D +I T N G D Q PNS P M +E WSGWF +G
Sbjct: 535 CDWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKQLRPNS---PLMCSEFWSGWFDKWG 591
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDYDAP 317
RP D+ + RG +F + YM HGGTN+ +G P + TSYDYDAP
Sbjct: 592 ANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAP 650
Query: 318 LDEYGLIRQPKWGHLKDLHKAI 339
+ E G W + + K +
Sbjct: 651 ISESGQTTPKYWALREAMAKYM 672
>gi|398787680|ref|ZP_10550020.1| beta-galactosidase [Streptomyces auratus AGR0001]
gi|396992782|gb|EJJ03876.1| beta-galactosidase [Streptomyces auratus AGR0001]
Length = 603
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 175/358 (48%), Gaps = 25/358 (6%)
Query: 16 VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIET 75
VLA +T + ++ GK ++SG+ HY R+ P+ W D + + + GL+ +ET
Sbjct: 16 TVLAQAEGPGGLTIRGKEFLLDGKPFRILSGAFHYFRTHPQDWRDRLMRMRAMGLNTVET 75
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI 135
YV WN H+P + +F G D+V FV+ E GL +R GPY+CAEW+FGG P WL
Sbjct: 76 YVAWNFHQPDEKEADFTGWRDVVAFVRTADEVGLKVIVRPGPYICAEWDFGGLPAWLLKD 135
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGA 193
R + F+ + + A++ + + L A++GGPII Q+ENEYG+ D AY
Sbjct: 136 KDAPLRRSDPAFERAVDAWFAEL--LPRFVDLQATRGGPIIAMQVENEYGSYGDDHAYLE 193
Query: 194 AGKSYIKWAAGM--ALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSN------N 245
+ ++ A G+ L G + PD +++T N F D P + +
Sbjct: 194 HLRDTMR-AQGIDGLLFCSNGATQEALKAGSLPD-LLSTVN-FGGDPTGPFAELRAFQPD 250
Query: 246 KPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
KP TE W GWF +G A V + + G + N+YM GGTNF ++G
Sbjct: 251 KPLFCTEFWDGWFDHWGERHRTTDPAQTAADVEKMLEAGASI-NFYMAVGGTNFGWSAGA 309
Query: 306 PF-------ISTSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTYPS 355
TSYDYD+P+ E G + + K+ ++D L K L L AT P+
Sbjct: 310 NLSGSGYQPTVTSYDYDSPISESGELTE-KFHKVRDVLGKYTTLPNTPLPATPHRMPA 366
>gi|406657850|ref|ZP_11065990.1| family 35 glycosyl hydrolase [Streptococcus iniae 9117]
gi|405578065|gb|EKB52179.1| family 35 glycosyl hydrolase [Streptococcus iniae 9117]
Length = 594
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 108/306 (35%), Positives = 151/306 (49%), Gaps = 37/306 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R P W + K G + +ETYV WNLHEP R ++NFEG DL KF+
Sbjct: 19 ILSGAIHYFRLAPGSWYKSLYNLKALGFNTVETYVPWNLHEPQRGKFNFEGLADLEKFLD 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLYA +R PY+CAEW FGG P WL ++ R+ + + A ++ + ++ +
Sbjct: 79 LAQEMGLYAIVRPTPYICAEWEFGGLPAWL-LKENVRVRSHDAKYLAFVKDYYQVLLPKL 137
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGV-------PW 215
+ ++ SQGG I++ Q+ENEYG +YG K Y+K M V PW
Sbjct: 138 VKRQI--SQGGNILMFQVENEYG----SYG-EDKQYLKQLMQMMREFGISVPLFTSDGPW 190
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWSGWFLSFGGA 264
Q+ + G + Q N +N P M E W GWF +
Sbjct: 191 QSALQAGSLIDEDVLVTGNFGSQSKANFSNLRAFLDAHDKKWPLMCMEFWVGWFNRWKEP 250
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDA 316
V R +++ A+ + G N YM+HGGTNF +G P + TSYDYDA
Sbjct: 251 VIRRDPKEMVDAIMEVLEEGSI--NLYMFHGGTNFGFMNGSSARLQEDLPQV-TSYDYDA 307
Query: 317 PLDEYG 322
LDE G
Sbjct: 308 ILDEAG 313
>gi|380512533|ref|ZP_09855940.1| beta-galactosidase [Xanthomonas sacchari NCPPB 4393]
Length = 616
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 116/338 (34%), Positives = 164/338 (48%), Gaps = 30/338 (8%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH + GK +ISG+IH+ R W D +QK++ GL+ +ETYVFWNL EP Q+
Sbjct: 39 DH--FIRDGKPYQVISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVEPRPGQF 96
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ FV A GL LR GPYVCAEW GG+P WL PG++ R+ + F A
Sbjct: 97 DFSGNNDIAAFVDEAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAEPGMRVRSQDPRFLA 156
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMA 206
Q + + +K GGPI+ Q+ENEYG+ D AY ++ +++ A
Sbjct: 157 ASQAYLDALAAQVKPR--LNGNGGPIVAVQVENEYGSYGDDHAYMRLNRAMFVQAGFDKA 214
Query: 207 LSLDTGVPWVMCQQSDAPD--PIINTCNGFYCDQFTPNSN---NKPKMWTENWSGWFLSF 261
L P V+ + PD ++N G + F + +P+M E W+GWF +
Sbjct: 215 LLFTADGPDVLANGT-LPDTLAVVNFAPGDAKNAFETLAKFRPGQPQMVGEYWAGWFDQW 273
Query: 262 GGAVPYRPVEDLAFAVARF--FQRGGTFQNYYMYHGGTNFDRTSGGPF----------IS 309
G D + F R G N YM+ GGT+F +G F +
Sbjct: 274 GEK---HAATDATKQASEFEWILRQGHSANIYMFVGGTSFGFMNGANFQKNPSDHYAPQT 330
Query: 310 TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAAL 346
TSYDYDA LDE G PK+ +D + + + AL
Sbjct: 331 TSYDYDAVLDEAGRP-TPKFTLFRDAIQRVTGIAPPAL 367
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 88/241 (36%), Gaps = 53/241 (21%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + +++ KL GS V VD P G +T D+L G N
Sbjct: 425 KGSLYLGDVRDYARVYVDRKLAGSAERRLQQVAVDVDIPA----GTHTLDVLVENTGRIN 480
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSS-TQWDSKS 617
YGA AG+ PV L G TG + L S T W
Sbjct: 481 YGAHLPDGRAGLVDPVLLDGK--------------QLTGWQTFPLPMDDPSKLTGW---- 522
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
T K+ +++ T ++ +D GKG AW NG ++GR+W
Sbjct: 523 TTAKIDGPAFHRGTLKIGTPAD-TFLDMQAFGKGFAWANGHNLGRHWKI----------- 570
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P ++LY P W + GN++++F+ + VT Q+ S
Sbjct: 571 -----------------GPQRALY-FPAPWQRKGGNSVIVFDLDSTPDASVRGVTGQVWS 612
Query: 738 S 738
+
Sbjct: 613 T 613
>gi|242077941|ref|XP_002443739.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
gi|241940089|gb|EES13234.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
Length = 111
Score = 160 bits (404), Expect = 4e-36, Method: Composition-based stats.
Identities = 63/109 (57%), Positives = 86/109 (78%)
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
+MWP LI K+K+GGLDVI+TYVFWN+HEPV+ QYNFEGRYD V+F+K + GLY +LRI
Sbjct: 1 QMWPKLIAKAKEGGLDVIQTYVFWNVHEPVQGQYNFEGRYDFVRFIKEIQGQGLYVNLRI 60
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ 164
GP++ +EW +GGFP WLH +P I FR+DNEPFK ++ ++V +++
Sbjct: 61 GPFIESEWKYGGFPFWLHDVPNITFRSDNEPFKPSVRNMLGELVSLLEH 109
>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1106
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 154/322 (47%), Gaps = 24/322 (7%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + +HYPR W I+ K G++ + YVFWN HEP Y+F +
Sbjct: 358 LLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQ 417
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F +L + +Y LR GPYVCAEW GG P WL ++ R + F + F
Sbjct: 418 NDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVALF 477
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTG 212
+ +K L + GGPII+ Q+ENEYG+ D Y + + ++ G ++L
Sbjct: 478 EEAVAKQVKN--LTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIAL-FQ 534
Query: 213 VPWVMCQQSDAPDPIINTCN---GFYCD-------QFTPNSNNKPKMWTENWSGWFLSFG 262
W + D +I T N G D Q PNS P M +E WSGWF +G
Sbjct: 535 CDWASNFTLNGLDDLIWTMNFGTGANVDQQFAKLKQLRPNS---PLMCSEFWSGWFDKWG 591
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDYDAP 317
RP D+ + RG +F + YM HGGTN+ +G P + TSYDYDAP
Sbjct: 592 ANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAP 650
Query: 318 LDEYGLIRQPKWGHLKDLHKAI 339
+ E G W + + K +
Sbjct: 651 ISESGQTTPKYWALREAMAKYM 672
>gi|332376142|gb|AEE63211.1| unknown [Dendroctonus ponderosae]
Length = 659
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 117/361 (32%), Positives = 169/361 (46%), Gaps = 42/361 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR- 94
+ K + SG++HY R P W D ++K + GL+ +ETYV WN+HEP ++F
Sbjct: 34 LNSKPLKIFSGALHYFRVHPLYWRDRLKKYRAAGLNCVETYVPWNIHEPEDGSFDFGEDP 93
Query: 95 --------YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
DLV+F+K+ E L+ LR GPY+CAEW FGG P WL ++ RT +
Sbjct: 94 DRNDFSLFLDLVQFLKIAQEEDLFVILRPGPYICAEWEFGGLPSWLLRHEDLKVRTSDSK 153
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGN-------IDSAYGAAGKSYI 199
F ++R+ K++ ++ E L ++GG II QIENEYGN ID AY A K I
Sbjct: 154 FLFYVERYFKKLLALV--EPLQFTKGGSIIAVQIENEYGNVKEDDKPIDIAYLEALKDII 211
Query: 200 KWAAGMALSLDTGVPWVMCQQSDAPDP-IINTCN-----GFYCDQFTPNSNNKPKMWTEN 253
K + L + P Q P ++ T N G + KP M E
Sbjct: 212 KKNGIVELLFTSDTP---TQGFHGALPGVLATANCDKDCGLELARLESYQPTKPLMVMEY 268
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF------------DR 301
W+GWF + + VE ++ +F N YM HGGTN+ D
Sbjct: 269 WTGWFDHYSEKHHIQTVEQFYANLSDILMGHASF-NLYMMHGGTNWGFLNGANICGATDD 327
Query: 302 TSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAI-KLCEAALVATDPTYPSLGPNL 360
SG ++SYDY APL E G K+ L+ L +LC + +PT+ + P +
Sbjct: 328 NSGFQPDTSSYDYHAPLAENGDYTD-KYVQLQQLTAEYNELCISQPAPPEPTFREIYPEI 386
Query: 361 E 361
+
Sbjct: 387 D 387
>gi|422852505|ref|ZP_16899175.1| beta-galactosidase [Streptococcus sanguinis SK150]
gi|325693831|gb|EGD35750.1| beta-galactosidase [Streptococcus sanguinis SK150]
Length = 592
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMECYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQGVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|395846590|ref|XP_003795986.1| PREDICTED: beta-galactosidase-1-like protein 3 [Otolemur garnettii]
Length = 681
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 131/455 (28%), Positives = 202/455 (44%), Gaps = 42/455 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G + ++ GSIHY R E W D + K K G + + TYV WNLHEP R +++F
Sbjct: 110 LEGHKFLIFGGSIHYFRVPREYWQDRLLKLKACGFNTVTTYVPWNLHEPQRGKFDFSENL 169
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL FV L AE GL+ LR GPY+C+E + GG P WL P ++ RT + F + ++
Sbjct: 170 DLEAFVLLAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPELKLRTTSPGFLEAVDKYF 229
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGA-AGKSYIKWAAGMALSLDTG 212
++ + L SQGGP+I Q+ENEYG D Y K+ ++ L G
Sbjct: 230 DHLIP--RVIPLQYSQGGPVIALQVENEYGAYAQDVKYMPYLHKTLLQRGIVELLLTSDG 287
Query: 213 VPWVMCQQSDAPDPIIN----TCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYR 268
V+ +N N F Q KP + E W GWF +G +
Sbjct: 288 EKEVLKGHIKGVLATVNLKKLRKNAF--SQLYEVQRGKPLLIMEFWVGWFDRWGESHHIT 345
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
++L + V++ + +F N YM+HGGTNF +G + + TSYDYDA L E G
Sbjct: 346 NADNLEYNVSKLIKHEISF-NLYMFHGGTNFGFMNGASYMGRHVSVVTSYDYDAVLTEAG 404
Query: 323 LIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANIGTN 382
+ K+ L+ L + + + + PT P++ P ++ ++Y + S + N
Sbjct: 405 DYTE-KYFKLRKLLENVSVTPLPSLP-KPTLPAVYPPVKPSLYLPLWDVLSYLNEPVKLN 462
Query: 383 SDVTVKF------NGNSYLLPAWSVSILP---------DCKNVVFNTAKINSVT------ 421
V ++ +G SY + I D V N I +
Sbjct: 463 QPVNMENLPINNGSGQSYGFVLYETRICSGGFLWAHAHDIAEVFLNETIIGFLNEAVRGL 522
Query: 422 LVPSFSR-QSLQVAADSSDAIGSGWSYINEPVGIS 455
+P F Q L++ ++ I W NE G++
Sbjct: 523 RIPQFRDCQLLRILVENQGRINYSWKMQNEQKGLT 557
>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 651
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 167/344 (48%), Gaps = 39/344 (11%)
Query: 7 LLLVLCWGFVVLATTS-------FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
L+L L + V A + FG T V GK L+SG+IH+ R W
Sbjct: 47 LVLALAFALPVTAAAADTERWPDFGTQGT----QFVRDGKPYQLLSGAIHFQRIPRAYWK 102
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR GPY
Sbjct: 103 DRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGPYA 162
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
CAEW GG+P WL I+ R+ + F A Q + + + + L GGPII Q
Sbjct: 163 CAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQV--QPLLNHNGGPIIAVQ 220
Query: 180 IENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINTCNG- 233
+ENEYG+ D AY A ++ Y+K AL L T M PD ++N G
Sbjct: 221 VENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNFAPGE 279
Query: 234 --FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RGGTFQ 288
D+ ++P+M E W+GWF +G P+ + A A F+ R G
Sbjct: 280 AKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ATQQAEEFEWILRQGHSA 335
Query: 289 NYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYG 322
N YM+ GGT+F +G F +TSYDYDA +DE G
Sbjct: 336 NLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAG 379
Score = 41.2 bits (95), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 69/169 (40%), Gaps = 28/169 (16%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + H +++ VGS VD P G +T D+L G N
Sbjct: 460 KGPLYLGDVRDVAHVYLDQTPVGSVERRLQQVSTAVDIPA----GHHTLDVLVENSGRIN 515
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKS 617
YG AG+ PV L GN QQ TG + L + S + W K+
Sbjct: 516 YGPRMADGRAGLVDPVLL---GN--------QQ---VTGWQAFPLPMRAPDSIRGWTRKA 561
Query: 618 TLPKLQPLVWYKTT--FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ T PA + +D GKG AW NG ++GR+W
Sbjct: 562 ----VQGPAFHRGTVRIGTPADTY---LDMRAFGKGFAWANGVNLGRHW 603
>gi|199599299|ref|ZP_03212698.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
gi|199589801|gb|EDY97908.1| glycosyl hydrolase, family 35 [Lactobacillus rhamnosus HN001]
Length = 593
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 172/364 (47%), Gaps = 44/364 (12%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ GK ++SG+IHY R P W + K G + +ETYV WNLHE ++
Sbjct: 7 DHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+K E GLYA +R PY+CAEW FGGFP WL ++ RTD+ + A
Sbjct: 66 DFSGILDIERFLKTAEELGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPTYLA 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSL 209
+ R+ ++ + ++ + GG +I+ Q+ENEYG +YG + Y+ A +
Sbjct: 125 AIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYG----SYG-EDQDYLAVVAKLMQQH 177
Query: 210 DTGVPWVMCQQSDAPDP------------IINTCN-GFYCDQ--------FTPNSNNKPK 248
VP SD P P I+ T N G D+ + + P
Sbjct: 178 GVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPL 234
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--- 305
M E W GWF +G + R ++ A + +RG N YM+HGGTNF +G
Sbjct: 235 MCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGSV--NLYMFHGGTNFGFMNGTSAR 292
Query: 306 -----PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
P + TSYDYDAPL+E G + K +H+ + + A PT L
Sbjct: 293 KDHDLPQV-TSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQAKPLVKPTMAPASHPL 351
Query: 361 EATV 364
A V
Sbjct: 352 TAKV 355
Score = 42.0 bits (97), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 89/238 (37%), Gaps = 58/238 (24%)
Query: 491 EPLL---EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+PL+ + G+ L V + A+++ K + + Y + + D + G +
Sbjct: 391 QPLISGTDKGTPAKLRVIDARDRVQAYLDQKWLATQYQEA----IGDDILLPEVEGHHQL 446
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELN 604
DLL + NYG+ I Q KG G +DL + Q L + L
Sbjct: 447 DLLVENMSRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKGYQQYPLDLNRASRLT 499
Query: 605 FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
F G W + +YK TF A + +D G GKG VNG ++GR+W
Sbjct: 500 FTEG----WQPATP-------AFYKYTFGLTAPQD-TYLDCRGFGKGVMLVNGVNVGRFW 547
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+ G P+ SLY VP L + N +++FE G
Sbjct: 548 -----EKG-----------------------PTLSLY-VPAGLLHAGKNDVIVFETEG 576
>gi|295086466|emb|CBK67989.1| Beta-galactosidase [Bacteroides xylanisolvens XB1A]
Length = 778
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 176/362 (48%), Gaps = 39/362 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M +K I LLVL F V+ +S A T ++ GK V+ + +HY R
Sbjct: 1 MKNKIIALLVL---FTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F KL + G+Y +R
Sbjct: 58 AYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL + RT + + M+R + ++ KQ L +GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDVALRTLDPYY---MERVGIFMKEVGKQLAPLQVDKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ D Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGTDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KFFLLRDL 346
Query: 336 HK 337
K
Sbjct: 347 LK 348
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 92/250 (36%), Gaps = 49/250 (19%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
TVL + + + GKL+ + T P AL G D+L +G N+
Sbjct: 420 TVLKITEVHDWAQIYAGGKLLARL--DRRKGEFTTTLP-ALKKG-TQLDILVEAMGRVNF 475
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
GIT V+L SGN + WT NFP S D K
Sbjct: 476 DKSIHDR-KGITEKVELV-SGNQAK---ELKNWTV--------YNFPVDYSFIKDKKYND 522
Query: 620 PKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
K+ P + +YK+TF + +D + GKG WVNG ++GR+W
Sbjct: 523 TKILPFMPAYYKSTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI----------- 570
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 571 -----------------GPQQTLF-MPGCWLKEGENEILVLDLKGPAKASIKGLKKPILD 612
Query: 738 SLCSHVTDSH 747
L ++H
Sbjct: 613 MLREKAPETH 622
>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
Length = 593
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 153/313 (48%), Gaps = 36/313 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
I + ++SG++HY R P W D + K G + +ETY+ WN+HEP +++FEG
Sbjct: 12 IDDNKFKILSGAVHYFRIHPSQWGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIK 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ KF+K+ + GLY LR PY+CAEW FGG P WL I+ R+ ++ F +++ +
Sbjct: 72 DIEKFIKISEKLGLYVILRPTPYICAEWEFGGLPAWLLKDKEIKLRSSDDNFIEKLRNYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
++ + K ++GGP+++ Q+ENEYG +YG K Y++ A + VP
Sbjct: 132 NDLLPRLV--KYQVTKGGPVLMMQVENEYG----SYGNE-KEYLRIVASIMKENGVDVPL 184
Query: 215 ------WVMCQQ--SDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGW 257
W+ + S D I + N D N P M E W GW
Sbjct: 185 FTSDGTWIEALECGSLIEDDIFVSGNFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDGW 244
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFIS 309
F +G + R DLA V + G N YM+ GGTNF +G P +
Sbjct: 245 FNRWGEDIIRRDSIDLAEDVKEMLKIGSI--NLYMFRGGTNFGFMNGCSARGNNDLPQV- 301
Query: 310 TSYDYDAPLDEYG 322
TSYDYDA L E+G
Sbjct: 302 TSYDYDAILTEWG 314
>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
Length = 586
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 151/301 (50%), Gaps = 32/301 (10%)
Query: 42 VLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFV 101
+++ GSIHY R E W D + K + G + + TY+ WNLHE R +++F DL +V
Sbjct: 1 MIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYV 60
Query: 102 KLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDM 161
L GL+ LR GPY+CAE + GG P WL P RT N+ F + ++ ++
Sbjct: 61 LLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIP- 119
Query: 162 MKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ 219
K L GGP+I Q+ENEYG+ D Y +Y+K A L G+ ++
Sbjct: 120 -KILPLQYRHGGPVIAVQVENEYGSFQKDRNY----MNYLKKAL-----LKRGIVELLLT 169
Query: 220 QSDAPDPIINTCNG----FYCDQFTPNS--------NNKPKMWTENWSGWFLSFGGAVPY 267
D I + NG + FT +S ++KP M E W+GW+ S+G
Sbjct: 170 SDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIE 229
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEY 321
+ E++ V +F G +F N YM+HGGTNF +GG + + TSYDYDA L E
Sbjct: 230 KSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEA 288
Query: 322 G 322
G
Sbjct: 289 G 289
>gi|66767541|ref|YP_242303.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
gi|66572873|gb|AAY48283.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
Length = 613
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 161/331 (48%), Gaps = 47/331 (14%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F
Sbjct: 40 VRDGKPYQVLSGAIHFQRIPRTYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNAN 99
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 100 NDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSY 159
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ ++ L GGPII Q+ENEYG+ D + +YI A A+ + G
Sbjct: 160 LDAVAQQVR--PLLNHNGGPIIAVQVENEYGSYDDDH-----AYI--ADNRAMFVKAGFD 210
Query: 215 WVMCQQSDAPDPIIN-TCNGFYC-------------DQFTPNSNNKPKMWTENWSGWFLS 260
+ SD D + N T G D+ ++P+M E W+GWF
Sbjct: 211 KALLFTSDGADMLANGTLPGTLAVVNFAPGEAKSAFDKLIKFQPDQPRMVGEYWAGWFDH 270
Query: 261 FGGAVPY------RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------- 307
+G P+ + E+L + + R G N YM+ GGT+F +G F
Sbjct: 271 WG--TPHASTNAKQQTEELEWIL-----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDH 323
Query: 308 ---ISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
+TSYDYDA LDE G PK+ ++D+
Sbjct: 324 YAPQTTSYDYDAILDEAGR-PTPKFALMRDV 353
>gi|297483826|ref|XP_002693891.1| PREDICTED: galactosidase, beta 1-like 3 [Bos taurus]
gi|296479482|tpg|DAA21597.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 899
Score = 159 bits (403), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 175/373 (46%), Gaps = 22/373 (5%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G +++ GS+HY R W D + K + G + + TYV WNLHEP R ++F G
Sbjct: 323 LEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSGNL 382
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL F+ L E GL+ LR GPY+C+E + GG P WL P Q RT N F + ++
Sbjct: 383 DLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNKYF 442
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYI--KWAAGMALSLDT 211
++ + L QGGPII Q+ENEYG D AY + + G+ L+ D+
Sbjct: 443 DHLIP--RVALLQYLQGGPIIAVQVENEYGFFYKDEAYMPYLLQALQQRGIGGLLLTADS 500
Query: 212 GVPWVMCQQSDAPDPIINTCNGFYCDQFT---PNSNNKPKMWTENWSGWFLSFGGAVPYR 268
VM IN GF D F +KP + E W GWF ++G
Sbjct: 501 -TEEVMRGHIKGVLASIN-MKGFKVDSFKHLYKLQRHKPILIMEFWVGWFDTWGIDHRVM 558
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
V ++ +V+ F + G +F N YM+HGGTNF +G ++TSYDYDA L E G
Sbjct: 559 GVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTSYDYDAVLTEAG 617
Query: 323 LIRQPKWGHLKDLHKAIKL---CEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANI 379
K+ L+ L ++I + YPSL + +++ L +N+
Sbjct: 618 DY-TAKYFMLRSLFESILVRPLPPVPSPTPKAVYPSLKLSHYLPLWEALPYLQRPVTSNV 676
Query: 380 GTNSDVTVKFNGN 392
N + NGN
Sbjct: 677 PINMENLPINNGN 689
>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 613
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 167/344 (48%), Gaps = 39/344 (11%)
Query: 7 LLLVLCWGFVVLATTS-------FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
L+L L + V A + FG T V GK L+SG+IH+ R W
Sbjct: 9 LVLALTFALPVTAAAADTERWPDFGTQGT----QFVRDGKPYQLLSGAIHFQRIPRAYWK 64
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR GPY
Sbjct: 65 DRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGPYA 124
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
CAEW GG+P WL I+ R+ + F A Q + + + + L GGPII Q
Sbjct: 125 CAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQV--QPLLNHNGGPIIAVQ 182
Query: 180 IENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINTCNG- 233
+ENEYG+ D AY A ++ Y+K AL L T M PD ++N G
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGAEMLANGTLPDTLAVVNFAPGE 241
Query: 234 --FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RGGTFQ 288
D+ ++P+M E W+GWF +G P+ + A A F+ R G
Sbjct: 242 AKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ATQQAEEFEWILRQGHSA 297
Query: 289 NYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYG 322
N YM+ GGT+F +G F +TSYDYDA +DE G
Sbjct: 298 NLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAG 341
Score = 42.7 bits (99), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 70/169 (41%), Gaps = 28/169 (16%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + H +++ VGS TVD P G +T D+L G N
Sbjct: 422 KGPLYLGDVRDVAHVYLDQTPVGSVERRLQQVSTTVDIPA----GHHTLDVLVENSGRIN 477
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKS 617
YG AG+ PV L GN QQ T G + L + S + W K+
Sbjct: 478 YGTRMADGRAGLVDPVLL---GN--------QQLT---GWQAFPLPMRTPDSIRGWTRKA 523
Query: 618 TLPKLQPLVWYKTT--FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ T PA + +D GKG AW NG ++GR+W
Sbjct: 524 ----VQGPAFHRGTVRIGTPADTY---LDMRAFGKGFAWANGVNLGRHW 565
>gi|289664883|ref|ZP_06486464.1| beta-galactosidase [Xanthomonas campestris pv. vasculorum NCPPB
702]
Length = 582
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 163/321 (50%), Gaps = 29/321 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK L+SG++H+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F G
Sbjct: 9 VRDGKPYQLLSGAVHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGN 68
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 69 NDVAAFVREAAALGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAY 128
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDT 211
+ + + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 129 LDALAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMAENRAMYVKAGFDKAL-LFT 185
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
M PD ++N G D+ +++P+M E W+GWF +G P
Sbjct: 186 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRSDQPRMVGEYWAGWFDHWGK--P 243
Query: 267 YRPVEDLAFAVARFFQ---RGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYD 313
+ + A A F+ R G N YM+ GGT+F +G + +TSYD
Sbjct: 244 HAATD--ARQQADEFEWILRQGHSANLYMFIGGTSFGFMNGANYQNNPSDHYAPQTTSYD 301
Query: 314 YDAPLDEYGLIRQPKWGHLKD 334
YDA LDE G PK+ ++D
Sbjct: 302 YDAILDEAGHP-TPKFALMRD 321
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 22/166 (13%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + +++ K VGS VD P G++T D+L G N
Sbjct: 391 KGPLYLGDVRDVARVYLDQKPVGSVERRLQQVSTNVDIPA----GQHTLDVLVENSGRIN 446
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YG AG+ PV L N L+S Q + ++ + S W K+
Sbjct: 447 YGPRMADGRAGLIDPVLLD------NQQLTSWQ-AFPLPMRAPD------SIRGWTRKT- 492
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ T ++ +D GKG AW NG ++GR+W
Sbjct: 493 ---VQGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWANGVNLGRHW 534
>gi|71731106|gb|EAO33173.1| Beta-galactosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 612
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 156/316 (49%), Gaps = 23/316 (7%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
G+ LISG+IH+ R W D +QK++ GL+ +ETYVFWNL E Q++F G D+
Sbjct: 39 GRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDI 98
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
FV+ A GL LR GPYVCAEW GGFP WL P ++ R+ + F QR+
Sbjct: 99 GAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEA 158
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG---AAGKSYIKWAAGMALSLDTGVP 214
+ ++ L GGPII Q+ENEYG+ +G A +IK G AL L T
Sbjct: 159 LGTQVR--PLLNGNGGPIIAVQVENEYGSYGDDHGYLQAVHALFIKAGLGGAL-LFTADG 215
Query: 215 WVMCQQSDAPDPI--INTCNGF---YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRP 269
M PD + +N G D+ +P++ E W+GWF +G
Sbjct: 216 AQMLGNGTLPDVLAAVNFAPGEAKQALDKLATFHPGQPQLVGEYWAGWFDQWGKPHAQTD 275
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLD 319
+ A + ++G + N YM+ GGT+F +G F +TSYDYDA LD
Sbjct: 276 AKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPGDHYSPQTTSYDYDAVLD 334
Query: 320 EYGLIRQPKWGHLKDL 335
E G PK+ +D+
Sbjct: 335 EAGRP-MPKFALFRDV 349
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 58/266 (21%), Positives = 87/266 (32%), Gaps = 51/266 (19%)
Query: 469 INTTADQSDYLWYSLSTNIKADEPLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSS 528
+ TTAD Y + L K L++ + H +++ VG
Sbjct: 389 VATTADPQPMERYRQAYGYILYRTTLHGPRKGRLYLGEVRDDAHVYVDRLFVGRAERRRQ 448
Query: 529 NAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLS 588
V VD P G + D+L G NYG AG+ GPV L N ++
Sbjct: 449 QVWVEVDIP----SGTHCLDVLVENSGRVNYGPHLADGRAGLIGPVML----NHERVN-- 498
Query: 589 SQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGM 648
W E P + +T P P T F G +D
Sbjct: 499 --NW--------ETFLLPLQTPEAIHGWTTAPMQGPAFHRGTLFIRTPGD--TFLDMEAF 546
Query: 649 GKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWL 708
KG W N +GRYW + G P ++LY P +W
Sbjct: 547 SKGVTWANSHMLGRYW---------------------------DIG-PQRALY-FPGTWQ 577
Query: 709 KSSGNTLVLFEEIGGDPTKISFVTKQ 734
+ NT+++F+ ++ V +Q
Sbjct: 578 RQGENTVLVFDVSDTAAAQVRGVQQQ 603
>gi|422861007|ref|ZP_16907651.1| beta-galactosidase [Streptococcus sanguinis SK330]
gi|327468658|gb|EGF14137.1| beta-galactosidase [Streptococcus sanguinis SK330]
Length = 592
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVSVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|289670687|ref|ZP_06491762.1| beta-galactosidase [Xanthomonas campestris pv. musacearum NCPPB
4381]
Length = 612
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 163/321 (50%), Gaps = 29/321 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK L+SG++H+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F G
Sbjct: 39 VRDGKPYQLLSGAVHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGN 98
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 99 NDVAAFVREAAALGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAY 158
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDT 211
+ + + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 159 LDALAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMAENRAMYVKAGFDKAL-LFT 215
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
M PD ++N G D+ +++P+M E W+GWF +G P
Sbjct: 216 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRSDQPRMVGEYWAGWFDHWGK--P 273
Query: 267 YRPVEDLAFAVARFFQ---RGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYD 313
+ + A A F+ R G N YM+ GGT+F +G + +TSYD
Sbjct: 274 HAATD--ARQQADEFEWILRQGHSANLYMFIGGTSFGFMNGANYQNNPSDHYAPQTTSYD 331
Query: 314 YDAPLDEYGLIRQPKWGHLKD 334
YDA LDE G PK+ ++D
Sbjct: 332 YDAILDEAGHP-TPKFALMRD 351
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 22/166 (13%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + +++ K VGS VD P G++T D+L G N
Sbjct: 421 KGPLYLGDVRDVARVYLDQKPVGSVERRLQQVSTNVDIPA----GQHTLDVLVENSGRIN 476
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
YG AG+ PV L N L+S Q + ++ + S W K+
Sbjct: 477 YGPRMADGRAGLIDPVLLD------NQQLTSWQ-AFPLPMRAPD------SIRGWTRKT- 522
Query: 619 LPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ T ++ +D GKG AW NG ++GR+W
Sbjct: 523 ---VQGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWANGVNLGRHW 564
>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
Length = 613
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/360 (34%), Positives = 173/360 (48%), Gaps = 34/360 (9%)
Query: 1 MASKEILLLVLCWGFVV-----LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTP 55
M + LVL F + A T N V GK L+SG+IH+ R
Sbjct: 1 MLRTTLAPLVLALAFALPITGAAADTERWPNFGTQGTQFVRDGKPYQLLSGAIHFQRIPR 60
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR
Sbjct: 61 AYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNIILRP 120
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPI 175
GPY CAEW GG+P WL I+ R+ + F A Q + + + + + L GGPI
Sbjct: 121 GPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQV--QPLLNHNGGPI 178
Query: 176 ILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINT 230
I Q+ENEYG+ D AY A ++ Y+K AL L T M PD ++N
Sbjct: 179 IAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNF 237
Query: 231 CNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RG 284
G D+ ++P+M E W+GWF +G P+ + A A F+ R
Sbjct: 238 APGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ARQQAEEFEWILRQ 293
Query: 285 GTFQNYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYGLIRQPKWGHLKD 334
G + YM+ GGT+F +G F +TSYDYDA LDE G PK+ ++D
Sbjct: 294 GHSASLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRD 352
Score = 39.7 bits (91), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 69/167 (41%), Gaps = 24/167 (14%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + +++ + VGS + V+ P G++T D+L G N
Sbjct: 422 KGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLDVEIPA----GQHTLDVLVENSGRIN 477
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKS 617
YG AG+ PV L +QQ T G + L + S + W K+
Sbjct: 478 YGPRMADGRAGLVDPVVL-----------DNQQLT---GWQAFPLPMRTPDSIRGWTRKA 523
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ T ++ +D GKG AW NG ++GR+W
Sbjct: 524 ----VQGPAFHRGTLRIGTPTD-TYLDMRAFGKGFAWANGVNLGRHW 565
>gi|123788298|sp|Q3UPY5.1|GLBL2_MOUSE RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|74224567|dbj|BAE25259.1| unnamed protein product [Mus musculus]
Length = 636
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 148/304 (48%), Gaps = 38/304 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL F++
Sbjct: 63 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L A+ GL+ LR GPY+C+E + GG P WL P ++ RT F ++ + + M
Sbjct: 123 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVELYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAA----------------G 204
+ L GGPII Q+ENEYG N D AY YIK A G
Sbjct: 181 RVVPLQYKHGGPIIAVQVENEYGSYNKDRAY----MPYIKKALEDRGIIEMLLTSDNKDG 236
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
+ + GV + QS +NT +PKM E W+GWF S+GG+
Sbjct: 237 LEKGVVDGVLATINLQSQQELMALNTV-------LLSIQGIQPKMVMEYWTGWFDSWGGS 289
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPL 318
++ V+ + G + N YM+HGGTNF +G + TSYDYDA L
Sbjct: 290 HNILDSSEVLQTVSAIIKDGSSI-NLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAIL 348
Query: 319 DEYG 322
E G
Sbjct: 349 TEAG 352
>gi|358415935|ref|XP_600640.6| PREDICTED: uncharacterized protein LOC522360 [Bos taurus]
Length = 1360
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 175/373 (46%), Gaps = 22/373 (5%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G +++ GS+HY R W D + K + G + + TYV WNLHEP R ++F G
Sbjct: 323 LEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSGNL 382
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL F+ L E GL+ LR GPY+C+E + GG P WL P Q RT N F + ++
Sbjct: 383 DLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNKYF 442
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYI--KWAAGMALSLDT 211
++ + L QGGPII Q+ENEYG D AY + + G+ L+ D+
Sbjct: 443 DHLIP--RVALLQYLQGGPIIAVQVENEYGFFYKDEAYMPYLLQALQQRGIGGLLLTADS 500
Query: 212 GVPWVMCQQSDAPDPIINTCNGFYCDQFT---PNSNNKPKMWTENWSGWFLSFGGAVPYR 268
VM IN GF D F +KP + E W GWF ++G
Sbjct: 501 -TEEVMRGHIKGVLASIN-MKGFKVDSFKHLYKLQRHKPILIMEFWVGWFDTWGIDHRVM 558
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
V ++ +V+ F + G +F N YM+HGGTNF +G ++TSYDYDA L E G
Sbjct: 559 GVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTSYDYDAVLTEAG 617
Query: 323 LIRQPKWGHLKDLHKAIKL---CEAALVATDPTYPSLGPNLEATVYKTGSGLCSAFLANI 379
K+ L+ L ++I + YPSL + +++ L +N+
Sbjct: 618 DY-TAKYFMLRSLFESILVRPLPPVPSPTPKAVYPSLKLSHYLPLWEALPYLQRPVTSNV 676
Query: 380 GTNSDVTVKFNGN 392
N + NGN
Sbjct: 677 PINMENLPINNGN 689
>gi|422871792|ref|ZP_16918285.1| beta-galactosidase [Streptococcus sanguinis SK1087]
gi|328945306|gb|EGG39459.1| beta-galactosidase [Streptococcus sanguinis SK1087]
Length = 592
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
Length = 602
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 105/347 (30%), Positives = 167/347 (48%), Gaps = 29/347 (8%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A +TY ++ G+ +++G++HY R P+ W D +++ GL+ ++TY+ WN HE
Sbjct: 7 ALLTYSEGTLLRAGRPHQVLAGTLHYFRVHPDQWHDRLERLAAMGLNTVDTYIAWNFHER 66
Query: 85 VRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDN 144
++ F+G D+ +FV+ GL +R GPY+CAEW+ GG P WL PG++ R+
Sbjct: 67 RTGEHRFDGWRDIERFVRTAQRTGLDVIVRPGPYICAEWDNGGLPAWLTDRPGMRPRSSY 126
Query: 145 EPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWA-- 202
P+ E+ R+ ++ + L A++GGP++ Q+ENEYG+ + +Y++W
Sbjct: 127 APYLDEVARWFDVLIPRIAD--LQAARGGPVVAVQVENEYGSYGDDH-----AYMRWVHD 179
Query: 203 --AGMA----LSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQ----FTPNSNNKPKMWTE 252
AG L G +M P + G DQ + +P + E
Sbjct: 180 ALAGRGVTELLYTADGPTELMLDGGSLPGVLATATLGSRADQAAQLLRTRRSGEPFLCAE 239
Query: 253 NWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS--- 309
W+GWF +G R V A A+ +GG+ + Y HGGTNF +G
Sbjct: 240 FWNGWFDHWGEKHHTRSVGSAAAALDEILAKGGSV-SLYPAHGGTNFGLWAGANHADGAL 298
Query: 310 ----TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDP 351
TSYD DAP+ E+G PK+ +D L A E L + P
Sbjct: 299 QPTVTSYDSDAPIAEHG-APTPKFHAFRDRLLAATGAAERELPRSRP 344
>gi|346320352|gb|EGX89953.1| beta-calactosidase, putative [Cordyceps militaris CM01]
Length = 633
Score = 159 bits (401), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 172/324 (53%), Gaps = 23/324 (7%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
TT + +Y+ ++ G+ +I G + R PE W ++ ++ GL+ I +Y++W
Sbjct: 22 TTHAPGSFSYNRTDFLLNGQPFQIIGGQMDPQRILPEYWTHRLKMARAMGLNTIFSYLYW 81
Query: 80 NLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQ 139
NLHEP ++F GR D+ +F +L + GL LR GPY+C E ++GGFP WL +PG+
Sbjct: 82 NLHEPRPGAWDFSGRNDVARFFRLAQQEGLRVVLRPGPYICGERDWGGFPAWLSQVPGMA 141
Query: 140 FRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS 197
R +N PF + + ++ + Q L +QGGPI+++Q+ENEYG+ D Y AA +
Sbjct: 142 VRQNNRPFLDAAKSYIDRLGKELGQ--LQITQGGPILMAQLENEYGSFGTDKTYLAALAA 199
Query: 198 YIKWAAGMALSLDT--GVPWVMCQQSDAPDPII--NTCNGFYC-DQFTPNSNN-KPKMWT 251
++ + L + G ++ Q +I ++ +GF D++ + + P++
Sbjct: 200 MLRENFDVFLYTNDGGGQSYLEGGQLHGVLAVIDGDSQSGFAARDKYVTDPTSLGPQLNG 259
Query: 252 ENWSGWFLSFGGAVPYRPV----EDLAFAVARF--FQRGGTFQNYYMYHGGTNFDRTSG- 304
E + W +G P++ + D+A AVA GG + YM+HGGTNF +G
Sbjct: 260 EYYISWIDQWGSDYPHQQIAGSQADVAKAVADLDWTLAGGYSFSIYMFHGGTNFGFENGG 319
Query: 305 ----GPF--ISTSYDYDAPLDEYG 322
GP ++TSYDY APLDE G
Sbjct: 320 IRDDGPLAAMTTSYDYGAPLDESG 343
>gi|422845798|ref|ZP_16892481.1| beta-galactosidase [Streptococcus sanguinis SK72]
gi|325688586|gb|EGD30603.1| beta-galactosidase [Streptococcus sanguinis SK72]
Length = 592
Score = 159 bits (401), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|345487997|ref|XP_001602984.2| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 638
Score = 159 bits (401), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 166/330 (50%), Gaps = 35/330 (10%)
Query: 17 VLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETY 76
V TSF + +++ ++ GK +SGS HY R+ + W D ++K + GL+ + TY
Sbjct: 24 VTNRTSFA--IDFENNQFLLDGKPFRYVSGSFHYFRTPKQYWRDRLRKMRAAGLNALSTY 81
Query: 77 VFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLW-LHFI 135
V W+LH+P N++ ++G DLVKF++L E L+ LR GPY+CAE FGGFP W L+ +
Sbjct: 82 VEWSLHQPEPNKWVWDGDADLVKFLQLAQEEDLFVLLRPGPYICAEREFGGFPYWLLNLV 141
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI---DSAYG 192
PGI+ RT++ + + + +++ +K L GGPII+ Q+ENEYG+ D Y
Sbjct: 142 PGIKLRTNDTRYLEYAEEYLNQVLTRVK--PLLRGNGGPIIMVQVENEYGSFHACDKDYM 199
Query: 193 AAGKSYIKWAAGM-ALSLDTGVPW---VMCQQSDAPDPIIN-------TCNGFYCDQFTP 241
K+ I+ G AL T + + C I+ T N +F P
Sbjct: 200 TKLKNIIQNHVGTDALLYTTDGSYRQALRCGPVSGAYATIDFGTSSNVTQNFNLMREFEP 259
Query: 242 NSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRG---GTFQNYYMYHGGTN 298
P + +E + GW + P+ VE F + + G N YM++GGTN
Sbjct: 260 KG---PLVNSEFYPGWLSHW--EEPFERVE--TFKITKMLDEMLSLGASVNMYMFYGGTN 312
Query: 299 FDRTSGGPFIS------TSYDYDAPLDEYG 322
F +SG TSYDYDAPL E G
Sbjct: 313 FAFSSGANIFDNYTPDLTSYDYDAPLSEAG 342
>gi|422824944|ref|ZP_16873129.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|422827211|ref|ZP_16875390.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|422857055|ref|ZP_16903709.1| beta-galactosidase [Streptococcus sanguinis SK1]
gi|324992224|gb|EGC24146.1| beta-galactosidase [Streptococcus sanguinis SK405]
gi|324994315|gb|EGC26229.1| beta-galactosidase [Streptococcus sanguinis SK678]
gi|327459541|gb|EGF05887.1| beta-galactosidase [Streptococcus sanguinis SK1]
Length = 592
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|422849537|ref|ZP_16896213.1| beta-galactosidase [Streptococcus sanguinis SK115]
gi|325689511|gb|EGD31516.1| beta-galactosidase [Streptococcus sanguinis SK115]
Length = 592
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|422864131|ref|ZP_16910760.1| beta-galactosidase [Streptococcus sanguinis SK408]
gi|327472954|gb|EGF18381.1| beta-galactosidase [Streptococcus sanguinis SK408]
Length = 592
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|422859360|ref|ZP_16906010.1| beta-galactosidase [Streptococcus sanguinis SK1057]
gi|327459140|gb|EGF05488.1| beta-galactosidase [Streptococcus sanguinis SK1057]
Length = 592
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKEWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|422880263|ref|ZP_16926727.1| beta-galactosidase [Streptococcus sanguinis SK1059]
gi|422930132|ref|ZP_16963071.1| beta-galactosidase [Streptococcus sanguinis ATCC 29667]
gi|422930724|ref|ZP_16963655.1| beta-galactosidase [Streptococcus sanguinis SK340]
gi|332364839|gb|EGJ42608.1| beta-galactosidase [Streptococcus sanguinis SK1059]
gi|339614112|gb|EGQ18823.1| beta-galactosidase [Streptococcus sanguinis ATCC 29667]
gi|339620700|gb|EGQ25268.1| beta-galactosidase [Streptococcus sanguinis SK340]
Length = 592
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 613
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 167/344 (48%), Gaps = 39/344 (11%)
Query: 7 LLLVLCWGFVVLATTS-------FGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWP 59
L+L L + V A + FG T V GK L+SG+IH+ R W
Sbjct: 9 LVLALAFALPVTAAAADTERWPDFGTQGT----QFVRDGKPYQLLSGAIHFQRIPRAYWK 64
Query: 60 DLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYV 119
D +QK++ GL+ +ETYVFWNL EP + Q++F G D+ FV+ A GL LR GPY
Sbjct: 65 DRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGPYA 124
Query: 120 CAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQ 179
CAEW GG+P WL I+ R+ + F A Q + + + + L GGPII Q
Sbjct: 125 CAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQV--QPLLNHNGGPIIAVQ 182
Query: 180 IENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDTGVPWVMCQQSDAPD--PIINTCNG- 233
+ENEYG+ D AY A ++ Y+K AL L T M PD ++N G
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSDGADMLANGTLPDTLAVVNFAPGE 241
Query: 234 --FYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQ---RGGTFQ 288
D+ ++P+M E W+GWF +G P+ + A A F+ R G
Sbjct: 242 AKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK--PHAATD--ATQQAEEFEWILRQGHSA 297
Query: 289 NYYMYHGGTNFDRTSGGPF----------ISTSYDYDAPLDEYG 322
N YM+ GGT+F +G F +TSYDYDA +DE G
Sbjct: 298 NLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAIVDEAG 341
Score = 40.4 bits (93), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 69/169 (40%), Gaps = 28/169 (16%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + H +++ VGS VD P G +T D+L G N
Sbjct: 422 KGPLYLGDVRDVAHVYLDQTPVGSVERRLQQVSTAVDIPA----GHHTLDVLVENSGRIN 477
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKS 617
YG AG+ PV L GN QQ TG + L + S + W K+
Sbjct: 478 YGPRMADGRAGLVDPVLL---GN--------QQ---VTGWQAFPLPMRAPDSIRGWTRKA 523
Query: 618 TLPKLQPLVWYKTT--FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ T PA + +D GKG AW NG ++GR+W
Sbjct: 524 ----VQGPAFHRGTVRIGTPADTY---LDMRAFGKGFAWANGVNLGRHW 565
>gi|239986962|ref|ZP_04707626.1| putative beta-galactosidase [Streptomyces roseosporus NRRL 11379]
Length = 606
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/323 (34%), Positives = 158/323 (48%), Gaps = 39/323 (12%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
+ T D GK L+SG++HY R E W + GL+ +ETYV WNLHEP
Sbjct: 3 DFTVDDDGFRFDGKPVRLLSGALHYFRVHEEQWGHRLAVLAAMGLNCVETYVPWNLHEPR 62
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
+ G L +F+ V AGL+A +R GPY+CAEW GG P+W+ G + RT +
Sbjct: 63 EGEVRDVG--ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDA 120
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
++A ++R+ +++ + + ++ +GGP+IL Q ENEYG+ S Y++W AG+
Sbjct: 121 EYRAVVERWFRELLPQVVERQVV--RGGPVILVQAENEYGSFGSD-----AVYLEWLAGL 173
Query: 206 ALSLDTGVPWVMCQQSDAPDP----------IINTCN--GFYCDQFTPNSNNKPK---MW 250
VP SD P+ ++ T N + F ++PK M
Sbjct: 174 LRECGVTVPLFT---SDGPEDHMLTGGSVPGLLATANFGSGAREGFAVLRRHQPKGPLMC 230
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF----DRTSGGP 306
E W GWF +G R E+ A A+ + G + N YM HGGTNF GGP
Sbjct: 231 MEFWCGWFDHWGAEPVLRDAEEAAGALREILECGASV-NIYMAHGGTNFAGWAGANRGGP 289
Query: 307 F-------ISTSYDYDAPLDEYG 322
TSYDYDAP+DEYG
Sbjct: 290 LQDGEFQPTVTSYDYDAPVDEYG 312
>gi|24418925|ref|NP_722498.1| beta-galactosidase-1-like protein 2 [Mus musculus]
gi|23512349|gb|AAH38479.1| Galactosidase, beta 1-like 2 [Mus musculus]
gi|148693361|gb|EDL25308.1| cDNA sequence BC038479, isoform CRA_b [Mus musculus]
Length = 652
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 147/304 (48%), Gaps = 38/304 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL F++
Sbjct: 79 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 138
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L A+ GL+ LR GPY+C+E + GG P WL P ++ RT F + + + M
Sbjct: 139 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVDLYFDHL--MS 196
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAA----------------G 204
+ L GGPII Q+ENEYG N D AY YIK A G
Sbjct: 197 RVVPLQYKHGGPIIAVQVENEYGSYNKDRAY----MPYIKKALEDRGIIEMLLTSDNKDG 252
Query: 205 MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGA 264
+ + GV + QS +NT +PKM E W+GWF S+GG+
Sbjct: 253 LEKGVVDGVLATINLQSQQELMALNTV-------LLSIQGIQPKMVMEYWTGWFDSWGGS 305
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPL 318
++ V+ + G + N YM+HGGTNF +G + TSYDYDA L
Sbjct: 306 HNILDSSEVLQTVSAIIKDGSSI-NLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAIL 364
Query: 319 DEYG 322
E G
Sbjct: 365 TEAG 368
>gi|125717147|ref|YP_001034280.1| glycosyl hydrolase family protein [Streptococcus sanguinis SK36]
gi|125497064|gb|ABN43730.1| Glycosylhydrolase, family 35, putative [Streptococcus sanguinis
SK36]
Length = 592
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|422864548|ref|ZP_16911173.1| beta-galactosidase [Streptococcus sanguinis SK1058]
gi|327490742|gb|EGF22523.1| beta-galactosidase [Streptococcus sanguinis SK1058]
Length = 592
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|188990653|ref|YP_001902663.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
B100]
gi|167732413|emb|CAP50607.1| exported beta-galactosidase [Xanthomonas campestris pv. campestris]
Length = 680
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 159/322 (49%), Gaps = 31/322 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F
Sbjct: 107 VRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNAN 166
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 167 NDVAAFVREAAAQGLNVILRPGPYACAEWETGGYPAWLFGKDNIRVRSRDPRFLAASQAY 226
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDS--AYGAAGKS-YIKWAAGMALSLDT 211
+ + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 227 LDAVSKQV--HPLLNHNGGPIIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDAL-LFT 283
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFG---- 262
M PD ++N G D+ ++P+M E W+GWF +G
Sbjct: 284 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHA 343
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSY 312
+ E+L + + R G N YM+ GGT+F +G F +TSY
Sbjct: 344 STDAKQQTEELEWIL-----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSY 398
Query: 313 DYDAPLDEYGLIRQPKWGHLKD 334
DYDA LDE G PK+ ++D
Sbjct: 399 DYDAILDEAGRA-TPKFALMRD 419
>gi|258507331|ref|YP_003170082.1| beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|385827042|ref|YP_005864814.1| beta-galactosidase [Lactobacillus rhamnosus GG]
gi|257147258|emb|CAR86231.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|259648687|dbj|BAI40849.1| beta-galactosidase [Lactobacillus rhamnosus GG]
Length = 593
Score = 159 bits (401), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 172/364 (47%), Gaps = 44/364 (12%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ GK ++SG+IHY R P W + K G + +ETYV WNLHE ++
Sbjct: 7 DHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+K + GLYA +R PY+CAEW FGGFP WL ++ RTD+ + A
Sbjct: 66 DFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLA 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSL 209
+ R+ ++ + ++ + GG +I+ Q+ENEYG +YG + Y+ A +
Sbjct: 125 AIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYG----SYG-EDQDYLAAVAKLMQQH 177
Query: 210 DTGVPWVMCQQSDAPDP------------IINTCN-GFYCDQ--------FTPNSNNKPK 248
VP SD P P I+ T N G D+ + + P
Sbjct: 178 GVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPL 234
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--- 305
M E W GWF +G + R ++ A + +RG N YM+HGGTNF +G
Sbjct: 235 MCVEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGSV--NLYMFHGGTNFGFMNGTSAR 292
Query: 306 -----PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
P + TSYDYDAPL+E G + K +H+ + + A PT L
Sbjct: 293 KDHDLPQV-TSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQAKPLVKPTMAPASHPL 351
Query: 361 EATV 364
A V
Sbjct: 352 TAKV 355
Score = 46.6 bits (109), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 99/271 (36%), Gaps = 62/271 (22%)
Query: 458 DAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLL---EDGSKTVLHVQSLGHALHAF 514
D TKP T Y Y+L +PL+ + G+ L V + A+
Sbjct: 362 DQLTKPIAASYPQTQEFLGQYTGYTLYRT----QPLISGTDKGTPAKLRVIDARDRVQAY 417
Query: 515 INGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPV 574
++ K + + Y + + D + G + DLL + NYG+ I
Sbjct: 418 LDQKWLATQYQEA----IGDDILLPEVEGHHQLDLLVENMSRVNYGS-------KIEAIT 466
Query: 575 QLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTT 631
Q KG G +DL + Q L + L F G W + +YK T
Sbjct: 467 QFKGIRTGVMVDLHFIKGYQQYPLDLNRASRLTFTEG----WQPATP-------AFYKYT 515
Query: 632 FDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNKCLK 691
FD A + +D G GKG VNG ++GR+W + G
Sbjct: 516 FDLTAPQD-TYLDCRGFGKGVMLVNGVNVGRFW-----EKG------------------- 550
Query: 692 NCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P+ SLY VP L + N +++FE G
Sbjct: 551 ----PTLSLY-VPAGLLHAGKNDVIVFETEG 576
>gi|281352249|gb|EFB27833.1| hypothetical protein PANDA_007660 [Ailuropoda melanoleuca]
Length = 626
Score = 158 bits (400), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 116/352 (32%), Positives = 168/352 (47%), Gaps = 38/352 (10%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ Y H + G+ ISGSIHY R W D + K K GL+ I++YV WN HEP
Sbjct: 8 IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 67
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY F G +D+ F+KL E GL LR GPY+CAEW+ GG P WL I R+ +
Sbjct: 68 GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 127
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM- 205
+ A + ++ ++ MK L GGPII Q+ENEYG +Y + ++++ +
Sbjct: 128 YLAAVDKWLGVLLPKMK--PLLYQNGGPIITVQVENEYG----SYFSCDYDHLRFLQKLF 181
Query: 206 --ALSLDTGVPWVMCQQSDAPDPIINTC---NGFYCD-QFTPNSN-------------NK 246
L D V+ +D + C G Y F P +N
Sbjct: 182 HYHLGND-----VLLFTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQRKSEPRG 236
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG- 305
P + +E ++GW +G E +A A+ RG N YM+ GGTNF +G
Sbjct: 237 PLVNSEFYTGWLDHWGQPHSTAKTEVVASALHEILSRGANV-NLYMFIGGTNFAYWNGAN 295
Query: 306 -PFIS--TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTY 353
P+ + TSYDYDAPL E G + + K+ L+D + K K+ E + + P +
Sbjct: 296 MPYQAQPTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGFIPPSTPKF 346
>gi|297204198|ref|ZP_06921595.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197714112|gb|EDY58146.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 588
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 160/309 (51%), Gaps = 32/309 (10%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG++HY R P+ W D ++K++ GL+ IETY+ WNLHEP +G
Sbjct: 15 LLHGEPFRIISGAMHYFRIHPDQWTDRLRKARLMGLNTIETYLPWNLHEPEPGTLVLDGF 74
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL ++++L + GL+ LR GP++CAEW+ GG P WL P I+ R+ + F +
Sbjct: 75 LDLPRWLRLAQDEGLHVLLRPGPFICAEWDDGGLPAWLLADPDIRLRSSDPRFTGAFDGY 134
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+++ ++ A+ GGP+I Q+ENEYG AYG +Y+K AL D GV
Sbjct: 135 LDQLLPALR--PFMAAHGGPVIAVQVENEYG----AYG-DDTAYLK-HVHQALR-DRGVE 185
Query: 215 WVM--CQQSDA--------PDPIINTCNGFYCDQ----FTPNSNNKPKMWTENWSGWFLS 260
++ C Q+ A P + G ++ + P M +E W GWF
Sbjct: 186 ELLYTCDQASAEHLAAGTLPGTLATATFGSRVEENLAALRTHQPEGPLMCSEFWVGWFDH 245
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------PFISTSYD 313
+GG R D A + R G + N YM+HGGTNF T+G P + TSYD
Sbjct: 246 WGGPHHVRSAADAAADLDRLLSAGASV-NIYMFHGGTNFGFTNGANHKHAYEPTV-TSYD 303
Query: 314 YDAPLDEYG 322
YDAPL E G
Sbjct: 304 YDAPLTESG 312
Score = 42.7 bits (99), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 91/235 (38%), Gaps = 59/235 (25%)
Query: 507 LGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKT 566
+G F++G VG + ++V P A A ++L +G NYG +
Sbjct: 404 VGDRAQVFVDGASVGVLERERHDETLSVRVPHAGA----VLEVLVENMGGVNYGP---RI 456
Query: 567 GA--GITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQP 624
GA G+ GPV +G+ + W + + P G ST + +P
Sbjct: 457 GAPKGLLGPVSFQGT--------ELRGWECRPVPLDDLAAVPFGPSTA--TTDAVP---- 502
Query: 625 LVWYKTTF--DAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
+++ TF D+PA + + G KG+AWVNG +GRYW
Sbjct: 503 -AFHRGTFEVDSPADT---FLSLPGWTKGQAWVNGFHLGRYW------------------ 540
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV-TKQLG 736
N G P +LY VP L+ N LVL E T+ F T LG
Sbjct: 541 ---------NRG-PQHTLY-VPAPVLRPGANELVLLELHATTGTRAQFTDTPDLG 584
>gi|156408171|ref|XP_001641730.1| predicted protein [Nematostella vectensis]
gi|156228870|gb|EDO49667.1| predicted protein [Nematostella vectensis]
Length = 647
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 165/352 (46%), Gaps = 22/352 (6%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
S ++ YD+ + GK ISG +HY R W D + K K G++ ++TYV WNL
Sbjct: 17 SLSFSIDYDNNCFMKDGKPFRYISGGMHYFRVPQYYWKDRLLKLKASGMNTVQTYVPWNL 76
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP+ QYNF G +L F+++ L LR GPY+CAEW+FGG P WL P I R
Sbjct: 77 HEPIPKQYNFAGNANLTSFLEIAQSLDLLVILRPGPYICAEWDFGGLPGWLLKDPSIVIR 136
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQ-GGPIILSQIENEYGNI---DSAYGAAGKS 197
+ KA M+ A + ++ K + + GGP+I+ Q+ENEYG+ D Y +
Sbjct: 137 SSQG--KAYMEAVDAWMSVLLPLVKPFLYENGGPVIMVQVENEYGDYIHCDHQYMLHLQQ 194
Query: 198 YIKWAAG---MALSLDTGVPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------PK 248
++ + + D G + P G D P +N + P
Sbjct: 195 LFRYHLTDDIILFTTDDGSNLTAIECGTLPSLYTTVDFGANTDPSIPFANQRKLQQKGPL 254
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF- 307
+ +E ++GW +G R + +A A+ + + N YM+ GGTNF SG F
Sbjct: 255 VNSEFYTGWLDYWGTPHQTRTSKVVADALDKILALNASV-NLYMFEGGTNFGFWSGADFH 313
Query: 308 -----ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYP 354
+ TSYDYDAPL E G + + + + K + L + + P YP
Sbjct: 314 GQYQPVPTSYDYDAPLTEAGDLTEKYHAIREVIGKYLTLPDIPIPPATPKYP 365
>gi|421767985|ref|ZP_16204697.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
gi|421773235|ref|ZP_16209883.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411182327|gb|EKS49478.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411186672|gb|EKS53794.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
Length = 656
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 173/372 (46%), Gaps = 44/372 (11%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
F + DH ++ GK ++SG+IHY R P W + K G + +ETYV WNL
Sbjct: 62 KFVTTFSIDHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNL 120
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HE +++F G D+ +F+K + GLYA +R PY+CAEW FGGFP WL ++ R
Sbjct: 121 HEYREGEFDFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLR 179
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
TD+ + + R+ ++ + ++ + GG +I+ Q+ENEYG +YG + Y+
Sbjct: 180 TDDPAYLVAIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYG----SYG-EDQDYLAA 232
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDP------------IINTCN-GFYCDQ--------FT 240
A + VP SD P P I+ T N G D+
Sbjct: 233 VAKLMQQHGVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQ 289
Query: 241 PNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFD 300
+ + P M E W GWF +G + R ++ A + +RG N YM+HGGTNF
Sbjct: 290 EHGRDWPLMCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGSV--NLYMFHGGTNFG 347
Query: 301 RTSGG--------PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPT 352
+G P + TSYDYDAPL+E G + K +H+ + + A PT
Sbjct: 348 FMNGTSARKDHDLPQV-TSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQAKPLVKPT 406
Query: 353 YPSLGPNLEATV 364
L A V
Sbjct: 407 MAPASHPLTAKV 418
Score = 46.2 bits (108), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 91/238 (38%), Gaps = 58/238 (24%)
Query: 491 EPLL---EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+PL+ + G+ L V + A+++ K + + Y + + D + G +
Sbjct: 454 QPLISGTDKGTPAKLRVIDARDRVQAYLDQKWLATQYQEA----IGDDILLPEVEGHHQL 509
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELN 604
DLL + NYG+ I Q KG G +DL + Q L + +L
Sbjct: 510 DLLVENMSRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKGYQQYPLDLNRASQLT 562
Query: 605 FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
F G W + +YK TFD A + +D G GKG VNG ++GR+W
Sbjct: 563 FTEG----WQPATP-------AFYKYTFDLTAPQD-TYLDCRGFGKGVMLVNGVNVGRFW 610
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+ G P+ SLY VP L + N +++FE G
Sbjct: 611 -----EKG-----------------------PTLSLY-VPAGLLHAGKNDVIVFETEG 639
>gi|422852902|ref|ZP_16899566.1| beta-galactosidase [Streptococcus sanguinis SK160]
gi|325697836|gb|EGD39720.1| beta-galactosidase [Streptococcus sanguinis SK160]
Length = 592
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M +P
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTIPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|423346501|ref|ZP_17324189.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
gi|409219652|gb|EKN12612.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
Length = 780
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/354 (33%), Positives = 168/354 (47%), Gaps = 33/354 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + IHY R E W IQ K G++ I Y FWN+HE +++F+G+
Sbjct: 41 LLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFKGQ 100
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ F +L + G+Y LR GPYVC+EW GG P WL I+ RT++ F + F
Sbjct: 101 NDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTKLF 160
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTG 212
+I + L ++GG II+ Q+ENEYG D AY A + +K AAG T
Sbjct: 161 MNEIGKQLAD--LQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVK-AAGF-----TD 212
Query: 213 VPWVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLS 260
VP C Q + D ++ T N G D QF +P M +E WSGWF
Sbjct: 213 VPLFQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDH 272
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYD 315
+G R + + R +F + YM HGGT F G + +SYDYD
Sbjct: 273 WGRKHETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYD 331
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS 369
AP+ E G PK+ L++L + ++ V P P+ P +E + G
Sbjct: 332 APISEAGWA-TPKYYKLREL--LTQYADSGQVI--PDVPAAYPLIEIPAFTVGE 380
Score = 48.9 bits (115), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 60/258 (23%), Positives = 104/258 (40%), Gaps = 52/258 (20%)
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
P +++G T L + + F +GKL+G + TV P ALA G D+L
Sbjct: 417 PAVKEG--TTLLIDEVHDWAQVFADGKLLGRL--DRRRGESTVVLP-ALAAG-TRLDILV 470
Query: 552 LTVGLQNYG-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSS 610
+G N+ A +++ G IT V+L + + W + +FP +
Sbjct: 471 EAMGRVNFDVAIHDRKG--ITDKVELISDTGRQEL----EDW--------QVYSFPVDYA 516
Query: 611 TQWDSK-STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
D K + KL +Y+TTF+ + V +D GKG WVNG+++GR+W
Sbjct: 517 FVQDKKYAAGDKLDGPAYYRTTFELDEVGD-VFLDMQTWGKGMVWVNGKAMGRFWEI--- 572
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
P Q+L+ +P WLK N +++ + +G + +
Sbjct: 573 -------------------------GPQQTLF-MPGCWLKKGKNEIIILDLLGPEKAVVE 606
Query: 730 FVTKQLGSSLCSHVTDSH 747
+ + L + +H
Sbjct: 607 GRKEPILDMLRAEAPATH 624
>gi|258538519|ref|YP_003173018.1| beta-galactosidase [Lactobacillus rhamnosus Lc 705]
gi|385834266|ref|YP_005872040.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
gi|257150195|emb|CAR89167.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus Lc 705]
gi|355393757|gb|AER63187.1| beta-galactosidase family protein [Lactobacillus rhamnosus ATCC
8530]
Length = 593
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 172/364 (47%), Gaps = 44/364 (12%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ GK ++SG+IHY R P W + K G + +ETYV WNLHE ++
Sbjct: 7 DHE-FMLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+K + GLYA +R PY+CAEW FGGFP WL ++ RTD+ + A
Sbjct: 66 DFSGILDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLA 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSL 209
+ R+ ++ + ++ + GG +I+ Q+ENEYG +YG + Y+ A +
Sbjct: 125 AIDRYYTALMPHLVDHQV--THGGNVIMMQVENEYG----SYG-EDQDYLAAVAKLMQQH 177
Query: 210 DTGVPWVMCQQSDAPDP------------IINTCN-GFYCDQ--------FTPNSNNKPK 248
VP SD P P I+ T N G D+ + + P
Sbjct: 178 GVDVPLFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPL 234
Query: 249 MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--- 305
M E W GWF +G + R ++ A + +RG N YM+HGGTNF +G
Sbjct: 235 MCMEFWDGWFNRWGEPIIRRDPDETAEDLRAVIKRGSV--NLYMFHGGTNFGFMNGTSAR 292
Query: 306 -----PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNL 360
P + TSYDYDAPL+E G + K +H+ + + A PT L
Sbjct: 293 KDHDLPQV-TSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQAKPLVKPTMAPASHPL 351
Query: 361 EATV 364
A V
Sbjct: 352 TAKV 355
Score = 44.7 bits (104), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 90/238 (37%), Gaps = 58/238 (24%)
Query: 491 EPLL---EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+PL+ + G+ L V + A+++ K + + Y + + D + G +
Sbjct: 391 QPLISGTDKGTPAKLRVIDARDRVQAYLDQKWLATQYQEA----IGDDILLPEVEGHHQL 446
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELN 604
DLL + NYG+ I Q KG G +DL + Q L + L
Sbjct: 447 DLLVENMSRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKGYQQYPLDLNRASRLT 499
Query: 605 FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
F G W + +YK TFD A + +D G GKG VNG ++GR+W
Sbjct: 500 FTEG----WQPATP-------AFYKYTFDLTAPQD-TYLDCHGFGKGVMLVNGVNVGRFW 547
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+ G P+ SLY VP L + N +++FE G
Sbjct: 548 -----EKG-----------------------PTLSLY-VPAGLLHAGKNDVIVFETEG 576
>gi|301767332|ref|XP_002919083.1| PREDICTED: beta-galactosidase-like [Ailuropoda melanoleuca]
Length = 668
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/352 (32%), Positives = 168/352 (47%), Gaps = 38/352 (10%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ Y H + G+ ISGSIHY R W D + K K GL+ I++YV WN HEP
Sbjct: 35 IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 94
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY F G +D+ F+KL E GL LR GPY+CAEW+ GG P WL I R+ +
Sbjct: 95 GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM- 205
+ A + ++ ++ MK L GGPII Q+ENEYG +Y + ++++ +
Sbjct: 155 YLAAVDKWLGVLLPKMK--PLLYQNGGPIITVQVENEYG----SYFSCDYDHLRFLQKLF 208
Query: 206 --ALSLDTGVPWVMCQQSDAPDPIINTC---NGFYCD-QFTPNSN-------------NK 246
L D V+ +D + C G Y F P +N
Sbjct: 209 HYHLGND-----VLLFTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQRKSEPRG 263
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG- 305
P + +E ++GW +G E +A A+ RG N YM+ GGTNF +G
Sbjct: 264 PLVNSEFYTGWLDHWGQPHSTAKTEVVASALHEILSRGANV-NLYMFIGGTNFAYWNGAN 322
Query: 306 -PFIS--TSYDYDAPLDEYGLIRQPKWGHLKD-LHKAIKLCEAALVATDPTY 353
P+ + TSYDYDAPL E G + + K+ L+D + K K+ E + + P +
Sbjct: 323 MPYQAQPTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGFIPPSTPKF 373
>gi|154490061|ref|ZP_02030322.1| hypothetical protein PARMER_00290 [Parabacteroides merdae ATCC
43184]
gi|423723056|ref|ZP_17697209.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
gi|154089210|gb|EDN88254.1| glycosyl hydrolase family 35 [Parabacteroides merdae ATCC 43184]
gi|409241481|gb|EKN34249.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
Length = 780
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/354 (33%), Positives = 168/354 (47%), Gaps = 33/354 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + IHY R E W IQ K G++ I Y FWN+HE +++F+G+
Sbjct: 41 LLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFKGQ 100
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ F +L + G+Y LR GPYVC+EW GG P WL I+ RT++ F + F
Sbjct: 101 NDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTKLF 160
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTG 212
+I + L ++GG II+ Q+ENEYG D AY A + +K AAG T
Sbjct: 161 MNEIGKQLAD--LQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVK-AAGF-----TD 212
Query: 213 VPWVMCQ-----QSDAPDPIINTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLS 260
VP C Q + D ++ T N G D QF +P M +E WSGWF
Sbjct: 213 VPLFQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDH 272
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYD 315
+G R + + R +F + YM HGGT F G + +SYDYD
Sbjct: 273 WGRKHETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYD 331
Query: 316 APLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGS 369
AP+ E G PK+ L++L + ++ V P P+ P +E + G
Sbjct: 332 APISEAGWA-TPKYYKLREL--LTQYADSGQVI--PDVPAAYPLIEIPAFTVGE 380
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 60/258 (23%), Positives = 104/258 (40%), Gaps = 52/258 (20%)
Query: 492 PLLEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLS 551
P +++G T L + + F +GKL+G + TV P ALA G D+L
Sbjct: 417 PAVKEG--TTLLIDEVHDWAQVFADGKLLGRL--DRRRGENTVVLP-ALAAG-TRLDILV 470
Query: 552 LTVGLQNYG-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSS 610
+G N+ A +++ G IT V+L + + W + +FP +
Sbjct: 471 EAMGRVNFDVAIHDRKG--ITDKVELISDTGRQEL----EDW--------QVYSFPVDYA 516
Query: 611 TQWDSK-STLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVS 669
D K + KL +Y+TTF+ + V +D GKG WVNG+++GR+W
Sbjct: 517 FVQDKKYAAGDKLDGPAYYRTTFELDEVGD-VFLDMQTWGKGMVWVNGKAMGRFWEI--- 572
Query: 670 QNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKIS 729
P Q+L+ +P WLK N +++ + +G + +
Sbjct: 573 -------------------------GPQQTLF-MPGCWLKKGKNEIIILDLLGPEKAVVE 606
Query: 730 FVTKQLGSSLCSHVTDSH 747
+ + L + +H
Sbjct: 607 GRKEPILDMLRAEAPATH 624
>gi|21232326|ref|NP_638243.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
gi|21114096|gb|AAM42167.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
Length = 613
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 161/331 (48%), Gaps = 47/331 (14%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F
Sbjct: 40 VRDGKPYQVLSGAIHFQRIPRTYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNAN 99
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 100 NDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSY 159
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ ++ L GGPII Q+ENEYG+ D + +Y+ A A+ + G
Sbjct: 160 LDAVAQQVR--PLLNHNGGPIIAVQVENEYGSYDDDH-----AYM--ADNRAMFVKAGFD 210
Query: 215 WVMCQQSDAPDPIIN-TCNGFYC-------------DQFTPNSNNKPKMWTENWSGWFLS 260
+ SD D + N T G D+ ++P+M E W+GWF
Sbjct: 211 KALLFTSDGADMLANGTLPGTLAVVNFAPGEAKSAFDKLIKFQPDQPRMVGEYWAGWFDH 270
Query: 261 FGGAVPY------RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------- 307
+G P+ + E+L + + R G N YM+ GGT+F +G F
Sbjct: 271 WG--TPHASTNAKQQTEELEWIL-----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDH 323
Query: 308 ---ISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
+TSYDYDA LDE G PK+ ++D+
Sbjct: 324 YAPQTTSYDYDAILDEAGR-PTPKFALMRDV 353
>gi|328721397|ref|XP_003247292.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 628
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/341 (33%), Positives = 165/341 (48%), Gaps = 47/341 (13%)
Query: 16 VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIET 75
+VL T+ V Y+ + G+ +SGS+HY R W D IQK K GL+ I T
Sbjct: 6 IVLRTSKPTFTVDYERNEFLKDGQVFRYVSGSLHYFRVPKPYWKDRIQKMKAAGLNAIST 65
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL-HF 134
YV W+LHEP +YNF+ DL F++LV + G+Y LR GPY+CAE +FGGFP WL +
Sbjct: 66 YVEWSLHEPYPGEYNFDDIADLEYFLQLVKDEGMYLLLRPGPYICAERDFGGFPFWLLNV 125
Query: 135 IPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAA 194
+P + RT++ +K + ++ V M K ++ GG II+ Q+ENEYG +Y A
Sbjct: 126 VPKKRLRTNDPSYKHYVTKWFN--VLMPKIDRFLYGNGGNIIMVQVENEYG----SYNAC 179
Query: 195 GKSYIKWAAGM--------ALSLDT---GVPWVMCQQSDAPDPIINTCNGF------YCD 237
+ Y+ W + AL T G + C PD G C
Sbjct: 180 DQEYMLWLRDLYKRYVGYKALLYTTDGCGYSYFTC--GAIPDVYATVDFGASVKDVSQCF 237
Query: 238 QFTPNSNNK-PKMWTENWSGWFLSFGGAVP----YRPVEDLAFAVARFFQRGGTFQNYYM 292
++ + + P + +E ++GW + P Y VE + +A N+YM
Sbjct: 238 KYMRTTQKRGPLVNSEYYAGWLSHWREPSPVISSYEVVETMKDMLAL-----NASINFYM 292
Query: 293 YHGGTNFDRTSGGPFIS-----------TSYDYDAPLDEYG 322
+HGGTNF TSG TSYDY++PLDE G
Sbjct: 293 FHGGTNFGFTSGANKYESLKNPDYLPQLTSYDYNSPLDEAG 333
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 30/43 (69%), Gaps = 3/43 (6%)
Query: 627 WYKTTFDAPAG-SEPV--AIDFTGMGKGEAWVNGQSIGRYWPT 666
+YKT F P G ++P+ +D TG KG A+VNG +IGRYWP+
Sbjct: 530 FYKTQFKLPDGLTKPLDTYLDVTGWKKGVAFVNGINIGRYWPS 572
>gi|170034404|ref|XP_001845064.1| beta-galactosidase [Culex quinquefasciatus]
gi|167875697|gb|EDS39080.1| beta-galactosidase [Culex quinquefasciatus]
Length = 650
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/325 (34%), Positives = 160/325 (49%), Gaps = 39/325 (12%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ YD V+ GK +SGS HY R+ P+ W ++ + GGL+ ++ YV W+LH P
Sbjct: 37 IDYDRDTFVMDGKDFRYVSGSFHYFRALPQTWRSKLRTMRAGGLNAVDLYVQWSLHNPKD 96
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL-HFIPGIQFRTDNE 145
NQY ++G ++ ++ E LY LR GPY+CAE + GG P WL + PGIQ R +
Sbjct: 97 NQYVWDGIANITDVIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIQVRISDA 156
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYA-SQGGPIILSQIENEYGNIDSAYGAAGKSYI----- 199
+ E++ + K +M Q Y GGPII+ Q+ENEYG A+G K Y+
Sbjct: 157 NYIKEVKIWYEK---LMSQLTPYMYGNGGPIIMVQLENEYG----AFGKCDKQYLNVLKE 209
Query: 200 ---KWAAGMALSLDTGVPW---VMCQQSDAPDPIINTCNGFYCDQFTPNSNNK------- 246
K+ G A+ P+ ++C Q P I T G D K
Sbjct: 210 ETEKYTQGKAVLFTVDRPYDDELVCGQ--IPGVFITTDFGLMTDDEVDTHAAKVRSIQPK 267
Query: 247 -PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG- 304
P + TE ++GW + RP LA A R + G ++YMY GGTNF +G
Sbjct: 268 GPLVNTEFYTGWLTHWQEKNQRRPAGPLA-ATLRKMLKDGWNVDFYMYFGGTNFGFWAGA 326
Query: 305 -----GPFIS--TSYDYDAPLDEYG 322
G +++ TSYDYDAP+DE G
Sbjct: 327 NDWGLGKYMADITSYDYDAPMDEAG 351
>gi|158301280|ref|XP_550752.3| AGAP002055-PA [Anopheles gambiae str. PEST]
gi|157012394|gb|EAL38488.3| AGAP002055-PA [Anopheles gambiae str. PEST]
Length = 657
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 161/324 (49%), Gaps = 37/324 (11%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ Y+ V+ GK ++GS HY R+ PE W ++ + GGL+ ++ YV W+LH P
Sbjct: 45 IDYERDTFVMDGKDFRYVAGSFHYFRALPETWRTKLRTLRAGGLNAVDLYVQWSLHNPRD 104
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL-HFIPGIQFRTDNE 145
YN+EG ++ ++ E LY LR GPY+CAE + GG P WL + PGI RT +
Sbjct: 105 GVYNWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIAVRTSDA 164
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYI------ 199
+ E++++ ++ M + E GGPII+ QIENEYG A+G K Y+
Sbjct: 165 NYLEEVRKWYGEL--MSRMEPYMYGNGGPIIMVQIENEYG----AFGKCDKPYLNFLKQQ 218
Query: 200 --KWAAGMALSLDTGVPW---VMCQQSDAPDPIINTCNGFYCDQFTPNSNNK-------- 246
++ A+ P+ + C Q D I T G ++ K
Sbjct: 219 TERYVQDKAVLFTVDRPYDDEIGCGQIDG--VFITTDFGLMTEEEVDTHAAKVRSYQPKG 276
Query: 247 PKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG-- 304
P + TE ++GW + + RP + LA A R R G ++YMY GGTNF +G
Sbjct: 277 PLVNTEFYTGWLTHWQESNQRRPAQPLA-ATLRKMLRDGWNVDFYMYFGGTNFGFWAGAN 335
Query: 305 ----GPFIS--TSYDYDAPLDEYG 322
G +++ TSYDYDAP+DE G
Sbjct: 336 DWGLGKYMADITSYDYDAPMDEAG 359
>gi|332375542|gb|AEE62912.1| unknown [Dendroctonus ponderosae]
Length = 454
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 172/358 (48%), Gaps = 50/358 (13%)
Query: 20 TTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFW 79
+T + + + + K + SG++HY R W D ++K + GL+ +ETYV W
Sbjct: 20 STGINSGLNANQSFFTLNDKLIKIYSGAMHYFRVPRPYWRDRLRKIRAAGLNTVETYVPW 79
Query: 80 NLHEPVRNQYNF-------EGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWL 132
NLHEP +++F E L +F+ E L+ LR GPY+C+E+N GGFP WL
Sbjct: 80 NLHEPENGKFDFGEGGSEFEDFLHLEEFLNAAKEEDLFVILRTGPYICSEYNSGGFPSWL 139
Query: 133 HFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYG 192
+ FRT E + + RF ++ ++ + GGP+I Q+ENEYGN+++ G
Sbjct: 140 LREKPMGFRTSEENYMKFVTRFFNVVLTLLAAFQF--QLGGPVIAFQVENEYGNLEN--G 195
Query: 193 AA---GKSYIKWAAGMAL---------SLDT--------GVPWVMCQQSDAPDPIINTCN 232
AA K Y++ + L S D+ +P + Q ++ D +N N
Sbjct: 196 AAFQPDKVYMEELRQLFLKNGIVELLTSADSPLWKGTSGTLPGELFQTANFGDNAVNQLN 255
Query: 233 GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYM 292
++F P +P M E W GWF + GG + ED + F + +F N YM
Sbjct: 256 K--LEEFQP---GRPLMVMEYWIGWFDNVGGEHSVKSDEDSRRVLEDIFSKNASF-NAYM 309
Query: 293 YHGGTNF------------DRTSGGPFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKA 338
+HGGTNF SG I+TSYDYDAP+ E G R K+ +K+L A
Sbjct: 310 FHGGTNFWFNNGANLDNDLMDNSGYTAITTSYDYDAPISESGGYRN-KYFIVKELVAA 366
>gi|327282153|ref|XP_003225808.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 649
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 166/359 (46%), Gaps = 32/359 (8%)
Query: 16 VVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIET 75
V+ + +FG + Y H + G+ ISGSIHY R W D + K K GLD I+T
Sbjct: 23 VITSQRTFG--IDYGHNCFLKDGQPFRYISGSIHYSRIPRYYWKDRLLKMKMAGLDAIQT 80
Query: 76 YVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFI 135
YV WN HEP R YNF G DL F++L E GL LR GPY+CAEW+ GG P WL
Sbjct: 81 YVPWNFHEPERGVYNFTGDRDLEYFLQLAQEVGLLVILRAGPYICAEWDMGGLPAWLLEK 140
Query: 136 PGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAG 195
I R+ + + + + + MK LY GGPII+ Q+ENEYG +Y A
Sbjct: 141 ESIVLRSSDPDYLTAVGSWMGIFLPKMKPH-LY-QNGGPIIMVQVENEYG----SYFACD 194
Query: 196 KSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTC---NGFYCD-QFTPNSN------- 244
Y+++ + V+ +D C G Y F P N
Sbjct: 195 FDYLRYLQNLFRQYLGDE--VVLFTTDGASMFYLRCGALQGLYSTVDFGPGRNVTAAFST 252
Query: 245 ------NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTN 298
P + +E ++GW +G P +A +++ G N YM+ GGTN
Sbjct: 253 QRHTEPKGPLVNSEFYTGWLDHWGHRHITVPASIVAKSLSEILASGANV-NMYMFIGGTN 311
Query: 299 FDRTSGG--PFIS--TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY 353
F +G P+++ TSYDYDAPL E G + + + + + KL E + T P +
Sbjct: 312 FGYWNGANMPYMAQPTSYDYDAPLSEAGDLTEKYFAIREVIGMFKKLPEGPIPPTTPKF 370
>gi|417923406|ref|ZP_12566873.1| glycosyl hydrolase family 35 [Streptococcus mitis SK569]
gi|342837055|gb|EGU71256.1| glycosyl hydrolase family 35 [Streptococcus mitis SK569]
Length = 595
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 170/345 (49%), Gaps = 40/345 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R PE W + K G + +ETYV WNLHEP ++NFEG
Sbjct: 12 LDGKPFKILSGAIHYFRIPPEDWSHSLYNLKALGFNTVETYVAWNLHEPREGEFNFEGAL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F+++ + GLYA +R P++CAEW FGG P WL ++ R+ + + + R+
Sbjct: 72 DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + L +GG I++ Q+ENEYG+ D AY A + ++ +
Sbjct: 131 DQLLSRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRHLMEERGVTCPLFTSDG 188
Query: 214 PWVMCQQSDA--PDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ D + T N F Q F + P M E W GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R ++LA AV ++G N YM+HGGTNF +G P + TSYDY
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGP 358
DA LDE G P +L A+K ++AT P YP L P
Sbjct: 306 DALLDEEG---NPTAKYL-----AVK----KMMATHFPEYPQLEP 338
>gi|22760724|dbj|BAC11309.1| unnamed protein product [Homo sapiens]
Length = 636
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 153/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G D FV
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDQEAFVL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 240
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ Q A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 241 IVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 300
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 301 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 358
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 359 MKLRDFFGSI 368
>gi|288928311|ref|ZP_06422158.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288331145|gb|EFC69729.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 674
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 112/330 (33%), Positives = 159/330 (48%), Gaps = 38/330 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFE-G 93
V GK L SG +HY R W ++ K GL+ + TYVFWN HE +++++ G
Sbjct: 90 VYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKTG 149
Query: 94 RYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQR 153
+L +FVK AE G+ LR GPY CAEW FGG+P WL G+ R DN+PF +
Sbjct: 150 NRNLRQFVKTAAEEGMLVILRPGPYCCAEWEFGGYPWWLSKAKGLVIRADNQPFLDSCRV 209
Query: 154 FTAKIVDMMKQEKLYASQGGPIILSQIENEYGN-----------IDSAYGAAGKSYIKWA 202
+ ++ M+ L ++GGPII+ Q ENE+G+ AY A K +
Sbjct: 210 YINQLASQMRD--LQITKGGPIIMVQAENEFGSYVAQRKDIPLETHRAYSAKIKQQL-LD 266
Query: 203 AGMALSLDTGV-PWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTEN 253
AG + L T W+ + + + T NG +++ N P M E
Sbjct: 267 AGFDVPLFTSDGSWLF--KGGTIEGALPTANGESDIEKLKKVVNEY--NGGKGPYMVAEF 322
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS---- 309
+ GW + P E + A++ + G +F NYYM HGGTNF TSG + +
Sbjct: 323 YPGWLSHWAEPFPQVSTESIVKQTAKYLENGISF-NYYMVHGGTNFGFTSGANYTTATNL 381
Query: 310 ----TSYDYDAPLDEYGLIRQPKWGHLKDL 335
TSYDYDAP+ E G PK+ L+ L
Sbjct: 382 QPDLTSYDYDAPISEAGW-NTPKYDALRAL 410
Score = 47.0 bits (110), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 88/225 (39%), Gaps = 50/225 (22%)
Query: 501 VLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYG 560
+L V L ++NG+ VG S + ++ P D+L +G NYG
Sbjct: 483 MLKVAGLADYALVYVNGQKVGELDRVSDVDSIEINVPF-----NGVLDILVENMGRINYG 537
Query: 561 AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLP 620
A ++ GI GPV + G+ +++ Y+ + P ++ + LP
Sbjct: 538 ARITQSIKGINGPVVIDGN------EITGNWQMYKLPMN----EVPDVNALPTANNKGLP 587
Query: 621 KLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNY 680
L Y TF+ + ++ GKG +VNG ++GRYW
Sbjct: 588 TL-----YSGTFNLDTTGD-TFLNMETWGKGIVFVNGINLGRYWK--------------- 626
Query: 681 RGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP 725
RG P Q+LY +P +LK N +V+FE+ P
Sbjct: 627 RG-------------PQQTLY-LPGCFLKKGENKIVVFEQQNDTP 657
>gi|413954159|gb|AFW86808.1| putative RAN GTPase activating family protein [Zea mays]
Length = 449
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 132/241 (54%), Gaps = 18/241 (7%)
Query: 322 GLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTG-SGLCSAFLANIG 380
G IRQPK+GHLKDLH I+ E LV S G N T Y G S +C F+ N
Sbjct: 200 GNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYGKNAIVTKYTYGGSSVC--FINNQF 257
Query: 381 TNSDVTVKFNGNSYLLPAWSVSILPDCKNVVFNTAKINSVTLVPSFSRQSLQVAADSSDA 440
+ DV V G ++L+PAWSVSILPDCK V +NTAKI + T V S++ ++
Sbjct: 258 VDRDVKVTLGGGTHLVPAWSVSILPDCKTVAYNTAKIKTQTSVMVKKANSVEKELEALR- 316
Query: 441 IGSGWSYINE---PVGISKDDAFTKPGLLEQINTTADQSDYLWYSLSTNIKADEPLLEDG 497
WS++ E P D+F + LLEQI T+ DQSDYLWY S K +G
Sbjct: 317 ----WSWMPENLKPFMTDHRDSFRQSQLLEQIATSTDQSDYLWYRTSLEHKG------EG 366
Query: 498 SKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQ 557
S T L+V + GH ++ F+NG+LVG Y + + P+ L GKN LLS TVGL+
Sbjct: 367 SYT-LYVNTSGHEMYVFVNGRLVGQNYSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLK 425
Query: 558 N 558
+
Sbjct: 426 S 426
>gi|325567414|ref|ZP_08144081.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325158847|gb|EGC70993.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 591
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/334 (35%), Positives = 162/334 (48%), Gaps = 45/334 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + +ETY+ WNLHEP Y+FEG
Sbjct: 11 LLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FVK GL LR Y+CAEW FGG P WL P ++ R+ + F A+++ +
Sbjct: 71 KDICAFVKQAQTLGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRNY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L + GGP+I+ Q+ENEYG +YG K+Y++ + VP
Sbjct: 130 FQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQTKELMEEYGIDVP 182
Query: 215 -------W--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSG 256
W V+ + D I T N + + N P M E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R +DLA V G N YM+HGGTNF +G P +
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDLPQV 300
Query: 309 STSYDYDAPLDEYGLIRQP--KWGHLKDLHKAIK 340
S SYDYDA L E G +P K+ H++ KAIK
Sbjct: 301 S-SYDYDALLTEAG---EPTDKYYHVQ---KAIK 327
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 87/233 (37%), Gaps = 53/233 (22%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT--FDLLSLTVGLQNY 559
L V LH F +G+L Y + ++ I P K T D+L +G NY
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELL----IQGTPDKETIELDVLVENLGRVNY 456
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ--WTYQTGLKGEELNFPSGSSTQWDSKS 617
G + GP Q KG G D+ Q Y L E+L Q
Sbjct: 457 GF-------KLNGPTQAKGIRGGIMQDIHFHQGYRHYPLMLSAEQLQ---AIDYQAGKNP 506
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
T P +Y+TTF + ID G GKG VNG ++GRYW Q G
Sbjct: 507 THPS-----FYQTTFRLTEVGDTF-IDCRGYGKGVVIVNGINLGRYW-----QRG----- 550
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
P SLY P+ +LK N +V+FE G + ++ F
Sbjct: 551 ------------------PVHSLY-CPKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|229553373|ref|ZP_04442098.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
gi|229313254|gb|EEN79227.1| beta-galactosidase [Lactobacillus rhamnosus LMS2-1]
Length = 583
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 115/359 (32%), Positives = 170/359 (47%), Gaps = 43/359 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG+IHY R P W + K G + +ETYV WNLHE +++F G
Sbjct: 1 MLDGKPFKILSGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYREGEFDFSGI 60
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ +F+K + GLYA +R PY+CAEW FGGFP WL ++ RTD+ + A + R+
Sbjct: 61 LDIERFLKTAEDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLAAIDRY 119
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ + ++ + GG +I+ Q+ENEYG +YG + Y+ A + VP
Sbjct: 120 YTALMPHLVDHQV--THGGNVIMMQVENEYG----SYG-EDQDYLAAVAKLMQQHGVDVP 172
Query: 215 WVMCQQSDAPDP------------IINTCN-GFYCDQ--------FTPNSNNKPKMWTEN 253
SD P P I+ T N G D+ + + P M E
Sbjct: 173 LFT---SDGPWPATLNAGSMIDAGILATGNFGSAADKNFDRLAAFHQEHGRDWPLMCMEF 229
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------- 305
W GWF +G + R ++ A + +RG N YM+HGGTNF +G
Sbjct: 230 WDGWFNRWGEPIIRRDPDETAEDLRAVIKRGSV--NLYMFHGGTNFGFMNGTSARKDHDL 287
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATV 364
P + TSYDYDAPL+E G + K +H+ + + A PT L A V
Sbjct: 288 PQV-TSYDYDAPLNEQGNPTPKYFAIQKMIHEELPEVQQAKPLVKPTMAPASHPLTAKV 345
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 90/238 (37%), Gaps = 58/238 (24%)
Query: 491 EPLL---EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTF 547
+PL+ + G+ L V + A+++ K + + Y + + D + G +
Sbjct: 381 QPLISGTDKGTPAKLRVIDARDRVQAYLDQKWLATQYQEA----IGDDILLPEVEGHHQL 436
Query: 548 DLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELN 604
DLL + NYG+ I Q KG G +DL + Q L + L
Sbjct: 437 DLLVENMSRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKGYQQYPLDLNRASRLT 489
Query: 605 FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
F G W + +YK TFD A + +D G GKG VNG ++GR+W
Sbjct: 490 FTEG----WQPATP-------AFYKYTFDLTAPQD-TYLDCHGFGKGVMLVNGVNVGRFW 537
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
+ G P+ SLY VP L + N +++FE G
Sbjct: 538 -----EKG-----------------------PTLSLY-VPAGLLHAGKNDVIVFETEG 566
>gi|380693434|ref|ZP_09858293.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 778
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 183/386 (47%), Gaps = 43/386 (11%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M +K I LLVL F V+ +S A T ++ GK V+ + +HY R
Sbjct: 1 MKNKLIALLVL---FTVIFFSSAEAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 58 AYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL + RT + + M+R + ++ KQ L ++GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDVALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ D Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGTDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTRNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 346
Query: 336 HKAIKLCEAALVATDPTYPSLGPNLE 361
K AL P P P +E
Sbjct: 347 LKTYLPAGEAL----PEVPDALPVIE 368
>gi|422694237|ref|ZP_16752232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315148319|gb|EFT92335.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 593
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 163/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V +R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIHREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGEKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LILGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G D + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIDIEYLKFTNQ 589
>gi|303233304|ref|ZP_07319975.1| beta-galactosidase family protein [Atopobium vaginae PB189-T1-4]
gi|302480604|gb|EFL43693.1| beta-galactosidase family protein [Atopobium vaginae PB189-T1-4]
Length = 643
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/309 (34%), Positives = 160/309 (51%), Gaps = 29/309 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R P W + K G + +ETY+ WN+HEP+ + F+G
Sbjct: 14 LNGKPWKILSGAIHYFRIHPSDWEHSLYNLKALGFNTVETYIPWNIHEPIPGTFMFDGMC 73
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
++ F++L A GLYA +R PY+CAEW GG P WL G++ R+ + + + +Q +
Sbjct: 74 NIEHFLELAAACGLYAIVRPSPYICAEWEMGGLPAWL-LTKGVRLRSSDPAYLSYVQSYY 132
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + +L S GG I++ Q+ENEYG+ DS+Y + + ++AG+ + L T
Sbjct: 133 DELLPRLVPHQL--SCGGNILMFQVENEYGSYGEDSSY-LTSLANMMYSAGITMPLCTSD 189
Query: 214 -PWVMCQQSDA--PDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTENWSGWFLSF 261
PW C +S + I+ T N G + + F ++ P M E W GWF +
Sbjct: 190 GPWDACLESGSLIDSNILPTGNFGSHAHENFAAMRRFFARHNKVFPIMCMEFWDGWFSRW 249
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
V R V D V + G N YM+HGGTNF +G P I TSYD
Sbjct: 250 NEDVVTRKVTDFTEDVRETMEEGSI--NLYMFHGGTNFSCMNGCSARYDSDLPQI-TSYD 306
Query: 314 YDAPLDEYG 322
Y APL+E G
Sbjct: 307 YGAPLNEQG 315
>gi|374312360|ref|YP_005058790.1| glycoside hydrolase family protein [Granulicella mallensis
MP5ACTX8]
gi|358754370|gb|AEU37760.1| glycoside hydrolase family 35 [Granulicella mallensis MP5ACTX8]
Length = 627
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 166/358 (46%), Gaps = 37/358 (10%)
Query: 6 ILLLVLCWGFVV-LATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQK 64
I LL L G V T+ A +T ++ K ++SG + Y R W D ++K
Sbjct: 17 ITLLPLLSGAVRGQVATASAAPLTVGTSGFLLKDKPFRIVSGELEYARIPRPYWRDRLRK 76
Query: 65 SKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWN 124
+ GL+ I YVFWN+HEP Y+F G+ D+ +FV+ + GLY LR GPYVCAEW+
Sbjct: 77 AHAMGLNAITIYVFWNIHEPTPEVYDFSGQNDVAEFVREAQQEGLYVILRPGPYVCAEWD 136
Query: 125 FGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
GG+P WL ++ R+ FKA R+ ++ + L AS+GGPI+ Q+ENEY
Sbjct: 137 LGGYPAWLLKDHEMKLRSLQPEFKAAATRWMLRLGQELT--PLQASRGGPILAVQVENEY 194
Query: 185 GNIDSAYGAAGKSYIKWAAGMALS-------LDTGVPWVMCQQSDAPDPIINTCNGF--- 234
G+ + Y+KW + L L TG + +Q P G
Sbjct: 195 GSFGDDH-----EYMKWVHELVLQAGFGGSLLYTGDGADVLKQGTLPSVFAGIDFGTGDA 249
Query: 235 -----YCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQN 289
F P + P E W GWF +G + ++G + +
Sbjct: 250 ARSIKLYKAFRPQT---PVYVAEYWDGWFDHWGEKHQLTDAAKQETEIRSMLEQGDSI-S 305
Query: 290 YYMYHGGTNFDRTSGG--------PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAI 339
YM HGGT+F +G P +S SYDYDAPLDE G R PK+ L+++ I
Sbjct: 306 LYMVHGGTSFGWMNGANNDHDGYQPDVS-SYDYDAPLDESGRPR-PKYFRLRNIINEI 361
>gi|384428898|ref|YP_005638258.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
gi|341938001|gb|AEL08140.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
Length = 613
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/331 (33%), Positives = 161/331 (48%), Gaps = 47/331 (14%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F
Sbjct: 40 VRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNAN 99
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 100 NDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQSY 159
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ ++ L GGPII Q+ENEYG+ D + +Y+ A A+ + G
Sbjct: 160 LDAVAQQVR--PLLNHNGGPIIAVQVENEYGSYDDDH-----AYM--ADNRAMFVKAGFD 210
Query: 215 WVMCQQSDAPDPIIN-TCNGFYC-------------DQFTPNSNNKPKMWTENWSGWFLS 260
+ SD D + N T G D+ ++P+M E W+GWF
Sbjct: 211 KALLFTSDGADMLANGTLPGTLAVVNFAPGEAKSAFDKLIKFQPDQPRMVGEYWAGWFDH 270
Query: 261 FGGAVPY------RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------- 307
+G P+ + E+L + + R G N YM+ GGT+F +G F
Sbjct: 271 WG--TPHASTNAKQQTEELEWIL-----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDH 323
Query: 308 ---ISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
+TSYDYDA LDE G PK+ ++D+
Sbjct: 324 YAPQTTSYDYDAILDEAGR-PTPKFALMRDV 353
>gi|32709094|gb|AAP86763.1| beta-galactosidase Gal35I [Xanthomonas campestris pv. campestris]
Length = 613
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/322 (35%), Positives = 159/322 (49%), Gaps = 31/322 (9%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F
Sbjct: 40 VRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNAN 99
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 100 NDVAAFVREAAAQGLNVILRPGPYACAEWETGGYPAWLFGKDNIRVRSRDPRFLAASQAY 159
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDS--AYGAAGKS-YIKWAAGMALSLDT 211
+ + L GGPII Q+ENEYG+ D AY A ++ Y+K AL L T
Sbjct: 160 LDAVSKQV--HPLLNHNGGPIIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDAL-LFT 216
Query: 212 GVPWVMCQQSDAPD--PIINTCNG---FYCDQFTPNSNNKPKMWTENWSGWFLSFG---- 262
M PD ++N G D+ ++P+M E W+GWF +G
Sbjct: 217 SDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHA 276
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSY 312
+ E+L + + R G N YM+ GGT+F +G F +TSY
Sbjct: 277 STDAKQQTEELEWIL-----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSY 331
Query: 313 DYDAPLDEYGLIRQPKWGHLKD 334
DYDA LDE G PK+ ++D
Sbjct: 332 DYDAILDEAGRA-TPKFALMRD 352
>gi|401681814|ref|ZP_10813709.1| glycosyl hydrolase family 35 [Streptococcus sp. AS14]
gi|400185120|gb|EJO19350.1| glycosyl hydrolase family 35 [Streptococcus sp. AS14]
Length = 592
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGQPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRSFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|119962102|ref|YP_948531.1| beta-galactosidase [Arthrobacter aurescens TC1]
gi|119948961|gb|ABM07872.1| beta-galactosidase [Arthrobacter aurescens TC1]
Length = 598
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/388 (30%), Positives = 174/388 (44%), Gaps = 55/388 (14%)
Query: 25 ANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84
A ++Y + G+ +++G+IHY R P++W D +++ K G + ++TYV WN H+P
Sbjct: 4 ALLSYHDAVLYRSGEPYRILAGAIHYFRVHPDLWQDRLRRLKAMGANTVDTYVAWNFHQP 63
Query: 85 VRNQY-NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTD 143
R++ +F G DL +F+ L AE GL +R GPY+CAEW+ GGFP L IPGI R
Sbjct: 64 KRDEAPDFSGWRDLGRFMDLAAEEGLDVIVRPGPYICAEWDNGGFPSCLTGIPGIGLRCM 123
Query: 144 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW-- 201
+ F A ++ + ++ ++ + S GGP++ QIENEYG+ + YI+W
Sbjct: 124 DPVFTAAIEEWFDHLLPIVASRQ--TSAGGPVVAVQIENEYGSYGDDH-----EYIRWNR 176
Query: 202 ---------------AAGMALSLDTGV---PWVMCQQSDAPDPIINTCNGFYCDQFTPNS 243
G LD G W D + T +
Sbjct: 177 RALEERGITELLFTADGGTDYFLDGGAVEGTWATATLGSRGDEAVAT--------WQRRR 228
Query: 244 NNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTS 303
+P E W GWF +G R ED A + GG+ YM HGGTNF S
Sbjct: 229 PGEPFFNVEFWGGWFDHWGEHHHGRDAEDAALEARKMLDLGGSL-CAYMAHGGTNFGLRS 287
Query: 304 G----GPFIS---TSYDYDAPLDEYGLIRQPKWGHLKDLHKA-----IKLCEAALVATDP 351
G G + TSYD DAP+ E G + K+ ++A + AAL+A P
Sbjct: 288 GSNHDGTMLQPTVTSYDSDAPIAENGALTPKFHAFRKEFYRAQGVDDLPELPAALLADAP 347
Query: 352 TYP------SLGPNLEATVYKTGSGLCS 373
P S GP L V G + S
Sbjct: 348 VLPAQSLPLSPGPELLELVRDAGKPVSS 375
>gi|390469877|ref|XP_002807335.2| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Callithrix jacchus]
Length = 718
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 153/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL F+
Sbjct: 145 IFGGSIHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIL 204
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ +E GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 205 MASEIGLWXILRPGPYICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 262
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 263 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 322
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 323 IVHGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 382
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 383 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 440
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 441 MKLRDFFGSI 450
>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
Length = 583
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/316 (33%), Positives = 164/316 (51%), Gaps = 21/316 (6%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ +++G++HY R P W D + K K GL+ +ETYV WNLHEP +++F
Sbjct: 13 LDGEPFRILAGAMHYFRVHPAYWKDRLLKLKAMGLNTVETYVAWNLHEPHEGEFHFGDWL 72
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
++ ++++L E GLY +R GPY+CAEW GG P WL P ++ R +P+ + +
Sbjct: 73 NIERYIELAGELGLYVIVRPGPYICAEWEMGGLPAWLLKDPQMKLRCMYQPYLDAVGEYF 132
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDT-- 211
+++ M + L +++GGPII Q+ENEYG+ D+ Y + ++ G+ + L T
Sbjct: 133 SQL--MHRLVPLQSTRGGPIIAMQVENEYGSYGNDTRYLKYLEELLR-QCGVDVLLFTAD 189
Query: 212 GVPWVMCQQSDAPDPI--INTCN--GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPY 267
GV M Q P +N N G ++ P + E W GWF +G
Sbjct: 190 GVADEMMQYGSLPHLFKAVNFGNRPGDAFEKLREYQTGGPLLVAEFWDGWFDHWGERHHT 249
Query: 268 RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-----PFIS---TSYDYDAPLD 319
R ++A + G + N YM+HGGTNF +G P + TSYDYDAPL
Sbjct: 250 RSAGEVARVLDDLLSEGASV-NLYMFHGGTNFGFMNGANAFPSPHYTPTVTSYDYDAPLS 308
Query: 320 EYGLIRQPKWGHLKDL 335
E G I PK+ ++++
Sbjct: 309 ECGNI-TPKYEAMREV 323
>gi|257870316|ref|ZP_05649969.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
gi|257804480|gb|EEV33302.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
Length = 593
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/302 (37%), Positives = 153/302 (50%), Gaps = 29/302 (9%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
L+SG+IHY R P+ W + K G + +ETYV WNLHEP + + FEG DL +F+
Sbjct: 19 LLSGAIHYFRVHPDDWEHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEGILDLERFLS 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLY LR PY+CAEW FGG P WL G + R + + A + + ++ +
Sbjct: 79 LAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG-RLRACDPSYLAHVAEYYDVLLPKI 137
Query: 163 KQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSY-IKWAAGMALSLDTGVPWVMCQ 219
+L S GG I++ Q+ENEYG+ + AY A K I M L G PW
Sbjct: 138 IPYQL--SHGGNILMIQVENEYGSYGEEKAYLRAIKEMLINRGIDMPLFTSDG-PWQAAL 194
Query: 220 QSDA--PDPIINTCN-------GFYCDQFTPNSNNK--PKMWTENWSGWFLSFGGAVPYR 268
++ + D ++ T N F Q + +NK P M E W GWF + + R
Sbjct: 195 RAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWDGWFNRWNEPIIRR 254
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDAPLDE 320
+DLA +V + G N YM+HGGTNF +G P + TSYDYDAPLDE
Sbjct: 255 DPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQV-TSYDYDAPLDE 311
Query: 321 YG 322
G
Sbjct: 312 QG 313
>gi|76636681|ref|XP_597358.2| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|297483828|ref|XP_002693892.1| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|296479483|tpg|DAA21598.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 758
Score = 157 bits (398), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/302 (35%), Positives = 145/302 (48%), Gaps = 34/302 (11%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GS+HY R W D + K + GL+ + TYV WNLHEP R ++F G DL F+
Sbjct: 185 IFGGSVHYFRVPRAYWRDRLLKLRACGLNTLTTYVPWNLHEPERGTFDFSGNLDLEAFIL 244
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L AE GL+ LR GPY+C+E + GG P WL P ++ RT + F + + + M+
Sbjct: 245 LAAEVGLWVILRPGPYICSEVDLGGLPSWLLRDPDMRLRTTYKGFTEAVDLYFDHL--ML 302
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ- 219
+ L GGPII Q+ENEYG N D AY YIK A D G+ ++
Sbjct: 303 RVVPLQYKHGGPIIAVQVENEYGSYNKDPAY----MPYIKKALQ-----DRGIAELLLTS 353
Query: 220 ------QSDAPDPIINTCN-------GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVP 266
+S D ++ T N + ++PKM E W+GWF S+GG
Sbjct: 354 DNQGGLKSGVLDGVLATINLQSQSELQLFTTILLGAQGSQPKMVMEYWTGWFDSWGGPHY 413
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDE 320
++ V+ + G + N YM+HGGTNF G TSYDYDA L E
Sbjct: 414 ILDSSEVLNTVSAIVKAGSSI-NLYMFHGGTNFGFIGGAMHFQDYKPDVTSYDYDAVLTE 472
Query: 321 YG 322
G
Sbjct: 473 AG 474
>gi|339640120|ref|ZP_08661564.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
gi|339453389|gb|EGP66004.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
F0418]
Length = 595
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/316 (33%), Positives = 157/316 (49%), Gaps = 40/316 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W + + K G + +ETY+ W+LHEP Q+ +G
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRETLHNLKALGYNTVETYIPWSLHEPQEGQFVTDGLL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + LV E GL+ +R PY+CAE++FGG P WL PG++FR ++ F ++ RF
Sbjct: 72 DFEAYFDLVQEMGLHLIVRPTPYICAEFDFGGMPPWLLNYPGMRFRVNDALFLEKVSRFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + ++GGPI++ Q+ENEYG+ A K Y++ A M VP
Sbjct: 132 DWLFPKLLPYQF--TEGGPILMMQVENEYGSY-----AEDKEYMRNIAKMMRDRGVSVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQAKENTDNLRAFMERHGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF +G + R EDLA V + G N ++ GGTNF +T P
Sbjct: 243 GWFSRWGEEIVRRDAEDLAQDVKEMMRIGSM--NLFLLRGGTNFGFISGCSARKTRDLPQ 300
Query: 308 ISTSYDYDAPLDEYGL 323
I TSYD+DAP+ E+G+
Sbjct: 301 I-TSYDFDAPVTEWGV 315
>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
Length = 725
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 103/295 (34%), Positives = 151/295 (51%), Gaps = 33/295 (11%)
Query: 48 IHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEA 107
+HYPR E W D +++++ GL+ + YVFWN HE +++F G+ D+ +FV+ E
Sbjct: 1 MHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEE 60
Query: 108 GLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EK 166
GLY LR GPYVCAEW+FGG+P WL + +R+ + F + +R+ I ++ KQ
Sbjct: 61 GLYVILRPGPYVCAEWDFGGYPSWLLKEKDMIYRSKDPRFLSYCERY---IKELGKQLSS 117
Query: 167 LYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQ-----QS 221
L + GG II+ Q+ENEYG+ AA K Y+ M VP C ++
Sbjct: 118 LTINNGGNIIMVQVENEYGSY-----AADKEYLAAIRDMIKEAGFNVPLFTCDGGGQVEA 172
Query: 222 DAPDPIINTCNGFYCDQFTPNSNNK----PKMWTENWSGWFLSFG---GAVPY-RPVEDL 273
+ + T NG + + +N P E + WF +G +V Y RP E L
Sbjct: 173 GHIEGALPTLNGVFGEDIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQL 232
Query: 274 AFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------ISTSYDYDAPLDEYG 322
+ ++ G + YM+HGGTNF T+G TSYDYDAPL E+G
Sbjct: 233 DWMLSH-----GVSVSMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWG 282
>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
Length = 1104
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 153/322 (47%), Gaps = 25/322 (7%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK V+ + +HYPR W I+ K G++ I YVFWN HEP ++F G+
Sbjct: 357 LLNGKPFVVKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHEPQPGVFDFTGQ 416
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F +L + +Y LR GPYVCAEW GG P WL I+ R + F + F
Sbjct: 417 NDLAEFCRLCRQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVGIF 476
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ + + + GGPII+ Q+ENEYG +YG K Y+ + + GV
Sbjct: 477 EKAVAEQVAD--MTIQNGGPIIMVQVENEYG----SYG-EDKGYVSQIRDIVRANYPGVT 529
Query: 215 WVMCQ------QSDAPDPI--INTCNGFYCD-QFTPNSNNKPK---MWTENWSGWFLSFG 262
C ++ D + +N G D QF P +P M +E WSGWF +G
Sbjct: 530 LFQCDWASNFTKNGLHDLVWTMNFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWG 589
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFIS---TSYDYDAP 317
RP D+ + +G +F + YM HGGTN+ +G P + TSYDYDAP
Sbjct: 590 ANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAP 648
Query: 318 LDEYGLIRQPKWGHLKDLHKAI 339
+ E G W K L K +
Sbjct: 649 ISESGQTTPKYWELRKTLSKYM 670
>gi|336424850|ref|ZP_08604882.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013315|gb|EGN43197.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 596
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 155/319 (48%), Gaps = 45/319 (14%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ +ISG IHY R PE W D +QK K+ G + +ETY+ WN+HEPV+ +++F G +
Sbjct: 16 LNGEPFQIISGGIHYFRILPEYWEDRLQKLKELGCNTVETYIPWNMHEPVKGKFDFYGEH 75
Query: 96 -----DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
D+V FV+ GL+ LR PY+CAEW+FGG P WL + RT +E +
Sbjct: 76 VHGMLDVVSFVRTAQRLGLWVILRPSPYICAEWDFGGLPFWLMAGEEMDLRTSDERYLRH 135
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
++ + +++ ++ L QGGP+++ Q+ENEYG+ K Y++ M
Sbjct: 136 VRDYYDRLMPLLA--PLQIDQGGPVLMLQVENEYGSF-----GNDKKYLESLRDMMRERG 188
Query: 211 TGVPWVMCQQSDAPD-------------PIINTCNGF-----YCDQFTPNSNNKPKMWTE 252
VP SD PD P N +G +++T + P M TE
Sbjct: 189 ITVPLF---ASDGPDHNMLANTKTEGIFPTANFGSGASKAFSILEEYT---DGGPCMCTE 242
Query: 253 NWSGWFLSFGGAVPYR-PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-- 309
W GWF ++ V + E + + G N YM+ GGTNF +G +
Sbjct: 243 FWIGWFDAWHDEVHHEGDTETAVKELENILELGNV--NIYMFEGGTNFGFMNGSNYSDHL 300
Query: 310 ----TSYDYDAPLDEYGLI 324
TSYDYDA L E G I
Sbjct: 301 TADVTSYDYDALLTEDGQI 319
>gi|298386767|ref|ZP_06996322.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|298260441|gb|EFI03310.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 778
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 125/387 (32%), Positives = 183/387 (47%), Gaps = 43/387 (11%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M + I LLVL F V+ +S A T ++ GK V+ + +HY R
Sbjct: 1 MRHRFIALLVL---FTVIFFSSAEAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 58 AYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL + RT + + M+R + ++ KQ L ++GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDVALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ D Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGTDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTRNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYYLLRDL 346
Query: 336 HKAIKLCEAALVATDPTYPSLGPNLEA 362
K AL P P+ P +E
Sbjct: 347 LKTYLPAGEAL----PEVPAAMPVIEV 369
>gi|29349062|ref|NP_812565.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383124327|ref|ZP_09944991.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
gi|29340969|gb|AAO78759.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839176|gb|EES67260.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
Length = 778
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 125/387 (32%), Positives = 183/387 (47%), Gaps = 43/387 (11%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M + I LLVL F V+ +S A T ++ GK V+ + +HY R
Sbjct: 1 MRHRFIALLVL---FTVIFFSSAEAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 58 AYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL + RT + + M+R + ++ KQ L ++GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDVALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ D Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGTDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTRNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYYLLRDL 346
Query: 336 HKAIKLCEAALVATDPTYPSLGPNLEA 362
K AL P P+ P +E
Sbjct: 347 LKTYLPAGEAL----PEVPAAMPVIEV 369
>gi|257876100|ref|ZP_05655753.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
gi|257810266|gb|EEV39086.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
Length = 591
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/314 (35%), Positives = 151/314 (48%), Gaps = 37/314 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + +ETY+ WNLHEP Y+FEG
Sbjct: 11 LLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FVK GL LR Y+CAEW FGG P WL P ++ R+ + F A+++ +
Sbjct: 71 KDICAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRNY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L + GGP+I+ Q+ENEYG +YG K+Y++ + VP
Sbjct: 130 FQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQTKELMEEYGIDVP 182
Query: 215 -------W--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSG 256
W V+ + D + T N + + N P M E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R +DLA V G N YM+HGGTNF +G P +
Sbjct: 243 WFNRWGEPIIKRAGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDLPQV 300
Query: 309 STSYDYDAPLDEYG 322
S SYDYDA L E G
Sbjct: 301 S-SYDYDALLTEAG 313
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 87/233 (37%), Gaps = 53/233 (22%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT--FDLLSLTVGLQNY 559
L V LH F +G+L Y + ++ I P K T D+L +G NY
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELL----IQGTPDKETIELDVLVENLGRVNY 456
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ--WTYQTGLKGEELNFPSGSSTQWDSKS 617
G + GP Q KG G D+ Q Y L E+L Q
Sbjct: 457 GF-------KLNGPTQAKGIRGGIMQDIHFHQGYRHYPLTLSAEQLQ---AIDYQAGKNP 506
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
T P +Y+TTF + ID G GKG VNG ++GRYW Q G
Sbjct: 507 THPS-----FYQTTFTLTEVGDTF-IDCRGYGKGVVIVNGINLGRYW-----QRG----- 550
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
P SLY P+ +LK N +V+FE G + ++ F
Sbjct: 551 ------------------PVHSLY-CPKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|392987629|ref|YP_006486222.1| glucosyl hydrolase family protein [Enterococcus hirae ATCC 9790]
gi|392335049|gb|AFM69331.1| glucosyl hydrolase family protein [Enterococcus hirae ATCC 9790]
Length = 592
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 155/317 (48%), Gaps = 43/317 (13%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG+IHY R W + K G + +ETYV WNLHEP + ++FEG
Sbjct: 11 LLNGKPFKILSGAIHYFRVDSADWYHSLYNLKALGFNTVETYVPWNLHEPKKGDFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL F+ + E GLYA +R PY+CAEW FGGFP WL G + RT+ + + +
Sbjct: 71 LDLEHFLSIAEELGLYAIVRPSPYICAEWEFGGFPAWL-LNEGTRIRTNETVYLNHVADY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ + +L + GG I++ QIENEYG +YG K Y++ + L VP
Sbjct: 130 YDVLIKKIVPHQL--TNGGNILMIQIENEYG----SYGEE-KDYLRSIRDLMLDRGITVP 182
Query: 215 WVMCQQSDAP------------DPIINTCN-GFYCDQ--------FTPNSNNKPKMWTEN 253
+ SD P + I+ T N G ++ F + P M E
Sbjct: 183 FFT---SDGPWRATLRAGSMIDEDILVTGNFGSKAEENFSSMEAFFNEHGKKWPLMCMEF 239
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------- 305
W GWF + + R ++LA A+ RG N YM+HGGTNF +G
Sbjct: 240 WDGWFNRWKEPIVQRDAKELAEAIKEVVLRGSI--NLYMFHGGTNFGFMNGCSARGVIDL 297
Query: 306 PFISTSYDYDAPLDEYG 322
P I TSYDY APLDE G
Sbjct: 298 PQI-TSYDYGAPLDEQG 313
>gi|420261585|ref|ZP_14764229.1| glycosyl hydrolase [Enterococcus sp. C1]
gi|394771519|gb|EJF51280.1| glycosyl hydrolase [Enterococcus sp. C1]
Length = 591
Score = 157 bits (397), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 151/314 (48%), Gaps = 37/314 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R TP W D + K G + +ETY+ WNLHEP Y+FEG
Sbjct: 11 LLDGKPIKLISGAIHYFRMTPVQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FVK GL LR Y+CAEW FGG P WL P ++ R+ + F A+++ +
Sbjct: 71 KDICAFVKQAQTIGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRNY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L + GGP+I+ Q+ENEYG +YG K+Y++ + VP
Sbjct: 130 FQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQTKELMEEYGIDVP 182
Query: 215 -------W--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSG 256
W V+ + D I T N + + N P M E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R +DLA V G N YM+HGGTNF +G P +
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDLPQV 300
Query: 309 STSYDYDAPLDEYG 322
S SYDYDA L E G
Sbjct: 301 S-SYDYDALLTEAG 313
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 87/233 (37%), Gaps = 53/233 (22%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT--FDLLSLTVGLQNY 559
L V LH F +G+L Y + ++ I P K T D+L +G NY
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELL----IQGTPDKETIELDVLVENLGRVNY 456
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ--WTYQTGLKGEELNFPSGSSTQWDSKS 617
G + GP Q KG G D+ Q Y L E+L Q
Sbjct: 457 GF-------KLNGPTQAKGIRGGIMQDIHFHQGYRHYPLTLSAEQLQ---AIDYQAGKNP 506
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
T P +Y+TTF + ID G GKG VNG ++GRYW Q G
Sbjct: 507 THPS-----FYQTTFTLTEVGDTF-IDCRGYGKGVVIVNGINLGRYW-----QRG----- 550
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
P SLY P+ +LK N +V+FE G + ++ F
Sbjct: 551 ------------------PVHSLY-CPKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|225868140|ref|YP_002744088.1| beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus]
gi|225701416|emb|CAW98512.1| putative beta-galactosidase precursor [Streptococcus equi subsp.
zooepidemicus]
Length = 601
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 156/308 (50%), Gaps = 28/308 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ ++SG+IHY R P+ W + K G + +ETY+ WNLHE Y+F G+
Sbjct: 14 LDGRPLQILSGAIHYFRIHPDDWYQSLYNLKALGFNTVETYIPWNLHEAKEGSYDFSGQL 73
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ F+ L + GLYA +R PY+CAEW FGG P WL R+ + + A ++R+
Sbjct: 74 DVEAFLTLAQQLGLYAIVRPSPYICAEWEFGGLPAWL-LTKNCHIRSSDPAYLAYVRRYY 132
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + + + QGG I++ Q+ENEYG+ D AY A K +++ L G
Sbjct: 133 EELLPRLARHEW--QQGGNILMFQLENEYGSYGEDKAYLTAVKGFMEEHLSAPLFTADG- 189
Query: 214 PWVMCQQSDA--PDPIINTCN------GFYCDQ---FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ + D + T N + D F+ + + P M E W GWF +
Sbjct: 190 PWRATLRAGSLIEDDVFVTGNFGSRARDNFADMQAFFSEHGKHWPLMCMEFWDGWFNRWN 249
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R E+LA AV +G N YM+HGGTNF +G P + TSYDY
Sbjct: 250 EPIIKRDPEELADAVMEVLAQGSI--NLYMFHGGTNFGFMNGCSARKQLDLPQV-TSYDY 306
Query: 315 DAPLDEYG 322
DA LDE G
Sbjct: 307 DAILDEAG 314
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 81/225 (36%), Gaps = 55/225 (24%)
Query: 512 HAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT 571
F++G+ V + Y + + ++ AL+ D+L +G NYG +T
Sbjct: 413 QVFLDGQRVATQYQETIGDDIIINQQHALS----QVDVLIENMGRVNYGH-------KLT 461
Query: 572 GPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKT 630
P Q KG G G DL W E P +Q + QP +Y
Sbjct: 462 APSQCKGLGRGMMADLHFVTNW--------EMYCLPLDDLSQLRFDGDFYEGQP-GFYHY 512
Query: 631 TFDAPAGSEPVA--IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNK 688
F+ EP A ID TG GKG ++N IGR+W
Sbjct: 513 QFEC---HEPEASYIDMTGFGKGCVFINNHPIGRFWEV---------------------- 547
Query: 689 CLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P +LY +P+ + N +V+FE G I V +
Sbjct: 548 ------GPLLTLY-IPKGYFNKGLNDIVIFETEGVYQDSIRLVDR 585
>gi|237734327|ref|ZP_04564808.1| beta-galactosidase [Mollicutes bacterium D7]
gi|365831197|ref|ZP_09372750.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|374624872|ref|ZP_09697289.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
gi|229382557|gb|EEO32648.1| beta-galactosidase [Coprobacillus sp. D7]
gi|365262188|gb|EHM92085.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|373916155|gb|EHQ47903.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
Length = 584
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 148/323 (45%), Gaps = 46/323 (14%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ I G + +ISG++HY R PE W D + K G + +ETYV WNLHEP + +Y+
Sbjct: 7 NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F G D+ F+KL E L+ LR PY+CAEW GG P WL P I+ RT+++ +
Sbjct: 67 FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
+ ++ + + + K K +Q GPIIL+Q+ENEYG +YG K Y+ M
Sbjct: 127 LDQYFS--ILLPKLSKYQITQNGPIILAQLENEYG----SYG-EDKEYLLAVYQMMRKYG 179
Query: 211 TGVPWVMCQ-----------------------QSDAPDPIINTCNGFYCDQFTPNSNNKP 247
VP S A + I Q T P
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESHQITA-----P 234
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-- 305
M E W GWF + + R ++ + G N+YM+ GGTNF +G
Sbjct: 235 LMCMEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSA 292
Query: 306 ------PFISTSYDYDAPLDEYG 322
P I TSYDYDA L EYG
Sbjct: 293 RKEHDLPQI-TSYDYDAILTEYG 314
>gi|384512509|ref|YP_005707602.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430358961|ref|ZP_19425649.1| beta-galactosidase [Enterococcus faecalis OG1X]
gi|327534398|gb|AEA93232.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429513519|gb|ELA03099.1| beta-galactosidase [Enterococcus faecalis OG1X]
Length = 592
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 11 LLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 71 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKKKGVRLRSTDPIFMTKVRNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 131 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 183
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 184 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 241
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 242 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 299
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 300 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 345
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 402 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 454
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 455 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GCQHYPLTFSQEQLAKIDYTAGKNP 507
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 508 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 550
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 551 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 588
>gi|69247392|ref|ZP_00604336.1| Beta-galactosidase [Enterococcus faecium DO]
gi|256619331|ref|ZP_05476177.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|384518861|ref|YP_005706166.1| beta-galactosidase [Enterococcus faecalis 62]
gi|389870025|ref|YP_006377575.1| beta-galactosidase [Enterococcus faecium DO]
gi|68194864|gb|EAN09337.1| Beta-galactosidase [Enterococcus faecium DO]
gi|256598858|gb|EEU18034.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|309385841|gb|ADO66768.1| beta-galactosidase [Enterococcus faecium]
gi|323480994|gb|ADX80433.1| beta-galactosidase [Enterococcus faecalis 62]
gi|388535404|gb|AFK60593.1| beta-galactosidase [Enterococcus faecium DO]
Length = 592
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 118/368 (32%), Positives = 170/368 (46%), Gaps = 54/368 (14%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK ++SG+IHY R P W + K G + +ETYV WNLHEP + +++FEG
Sbjct: 11 LLKGKTFKILSGAIHYFRIPPCDWEHSLYNLKALGFNTVETYVPWNLHEPQKGEFHFEGI 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
DL +F+ + + GLYA +R PY+CAEW FGGFP WL P I R + + + +
Sbjct: 71 LDLERFLTIAQDLGLYAIVRPSPYICAEWEFGGFPSWLLREP-IHIRRNEIAYLEHVADY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
++ + +L + GG I++ QIENEYG+ K Y++ + + VP
Sbjct: 130 YDVLMKRIVPHQL--NNGGNILMIQIENEYGSF-----GEEKEYLRAIRDLMIKRGVTVP 182
Query: 215 WVMCQQSDAP------------DPIINTCN-------GFYCDQ--FTPNSNNKPKMWTEN 253
+ SD P D I+ T N F + F N P M E
Sbjct: 183 FFT---SDGPWRATLRAGSMIEDDILVTGNFGSKAKDNFNSMKQFFKEYDKNWPLMCMEF 239
Query: 254 WSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-------- 305
W GWF + + R ++LA AV ++G N YM+HGGTNF +G
Sbjct: 240 WDGWFNRWKEPIIQRDPQELAEAVKEVLEQGSI--NLYMFHGGTNFGFMNGCSARGVIDL 297
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVY 365
P I TSYDY APLDE G + + K +H P L P ++ T+
Sbjct: 298 PQI-TSYDYGAPLDEQGNPTEKYYALRKMIHDNY-----------PEIKQLDPVIKPTIE 345
Query: 366 KTGSGLCS 373
K L +
Sbjct: 346 KKKISLTN 353
>gi|325914137|ref|ZP_08176490.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
gi|325539640|gb|EGD11283.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
Length = 635
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 159/330 (48%), Gaps = 47/330 (14%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
V GK ++SG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F
Sbjct: 62 VRDGKPYQILSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSAN 121
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FV+ A GL LR GPY CAEW GG+P WL I+ R+ + F A Q +
Sbjct: 122 NDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQAY 181
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
+ + + L GGPII Q+ENEYG+ D + +Y+ A A+ + G
Sbjct: 182 LDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYDDDH-----AYM--ADNRAMFVKAGFD 232
Query: 215 WVMCQQSDAPDPIIN-TCNGFYC-------------DQFTPNSNNKPKMWTENWSGWFLS 260
+ SD D + N T G D+ +P+M E W+GWF
Sbjct: 233 KALLFTSDGADMLANGTLPGTLAVVNFAPGEAKSAFDKLIKFRPEQPRMVGEYWAGWFDH 292
Query: 261 FGGAVPY------RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF------- 307
+G P+ + E+L + + R G N YM+ GGT+F +G F
Sbjct: 293 WG--TPHASTDAKQQTEELEWIL-----RQGHSANLYMFIGGTSFGFMNGANFQGNPSDH 345
Query: 308 ---ISTSYDYDAPLDEYGLIRQPKWGHLKD 334
+TSYDYDA LDE G PK+ ++D
Sbjct: 346 YAPQTTSYDYDAILDEAGHP-TPKFALMRD 374
Score = 40.4 bits (93), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 68/167 (40%), Gaps = 24/167 (14%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + +++ K VGS VD P G++T D+L G N
Sbjct: 444 KGSLYLGEVRDVARVYVDQKPVGSVERRLQQVATDVDIPA----GQHTLDVLVENSGRIN 499
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQ-WDSKS 617
YG AG+ PV L GN QQ T G + L S S + W K+
Sbjct: 500 YGPRMADGRAGLVDPVLL---GN--------QQLT---GWQAFPLPMRSPDSLRGWTRKA 545
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+Q +++ ++ +D GKG AW NG ++GR+W
Sbjct: 546 ----VQGPAFHRGNLRIGTPTD-TYLDMRAFGKGIAWANGVNLGRHW 587
>gi|424687003|ref|ZP_18123658.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402366194|gb|EJV00591.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
Length = 593
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|424760912|ref|ZP_18188500.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402402633|gb|EJV35336.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 593
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|229548754|ref|ZP_04437479.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257421063|ref|ZP_05598053.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|312951816|ref|ZP_07770707.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422691033|ref|ZP_16749073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422707894|ref|ZP_16765431.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|229306094|gb|EEN72090.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257162887|gb|EEU92847.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|310630219|gb|EFQ13502.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315154243|gb|EFT98259.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315154885|gb|EFT98901.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
Length = 593
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLTLDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|227517783|ref|ZP_03947832.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|424678087|ref|ZP_18114931.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681129|ref|ZP_18117923.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685648|ref|ZP_18122340.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424689662|ref|ZP_18126226.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424693525|ref|ZP_18129955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424698239|ref|ZP_18134537.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424701365|ref|ZP_18137539.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424702750|ref|ZP_18138894.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424711867|ref|ZP_18144074.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424717978|ref|ZP_18147248.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424722429|ref|ZP_18151489.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723619|ref|ZP_18152577.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733091|ref|ZP_18161660.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424746203|ref|ZP_18174452.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424755204|ref|ZP_18183090.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227074744|gb|EEI12707.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|402351976|gb|EJU86842.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402352513|gb|EJU87362.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358223|gb|EJU92905.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402367111|gb|EJV01460.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402371797|gb|EJV05943.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402373001|gb|EJV07093.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402373959|gb|EJV08006.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402382684|gb|EJV16335.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402383232|gb|EJV16843.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402386182|gb|EJV19689.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402388743|gb|EJV22170.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402392403|gb|EJV25665.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402397550|gb|EJV30559.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402397571|gb|EJV30579.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402401167|gb|EJV33955.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 593
Score = 157 bits (396), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|257415380|ref|ZP_05592374.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257157208|gb|EEU87168.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 593
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGINIEYLKFTNQ 589
>gi|422727867|ref|ZP_16784288.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315151617|gb|EFT95633.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 593
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 52.8 bits (125), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 87/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D +L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTHALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGINIEYLKFTNQ 589
>gi|148231352|ref|NP_001080304.1| galactosidase, beta 1-like 2 [Xenopus laevis]
gi|28422231|gb|AAH46858.1| Loc89944-prov protein [Xenopus laevis]
Length = 634
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 104/299 (34%), Positives = 152/299 (50%), Gaps = 30/299 (10%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++ GS+HY R W D ++K K G++ + TYV WNLHEP + +++F D+ +F+
Sbjct: 60 ILGGSMHYFRVPMPYWRDRMKKMKACGINTLTTYVPWNLHEPRKGKFDFSKDLDISEFLA 119
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ +E GL+ LR GPY+CAEW+ GG P WL ++ RT F + + +++
Sbjct: 120 IASEMGLWVILRPGPYICAEWDLGGLPSWLLRDKDMKLRTTYRGFTEATEAYLDELIP-- 177
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSD 222
+ K S GGPII Q+ENEYG+ A A +IK A ++ G+ ++ SD
Sbjct: 178 RIAKYQYSNGGPIIAVQVENEYGSY--AKDANYMEFIKNAL-----VEKGIVELLL-TSD 229
Query: 223 APD-----PIINTCNGFYCDQFTP------NS--NNKPKMWTENWSGWFLSFGGAVPYRP 269
D + N + P NS +NKP M E W+GWF +GG
Sbjct: 230 NKDGLSSGSLENVLATVNFQKIEPVLFSYLNSIQSNKPVMVMEFWTGWFDYWGGKHHIFD 289
Query: 270 VEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYG 322
V+++ V+ RG + N YM+HGGTNF +G TSYDYDAPL E G
Sbjct: 290 VDEMISTVSEVLNRGASI-NLYMFHGGTNFGFMNGALHFHEYRPDITSYDYDAPLTEAG 347
>gi|255971270|ref|ZP_05421856.1| beta-galactosidase [Enterococcus faecalis T1]
gi|255962288|gb|EET94764.1| beta-galactosidase [Enterococcus faecalis T1]
Length = 593
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTRQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGINIEYLKFTNQ 589
>gi|255973889|ref|ZP_05424475.1| beta-galactosidase [Enterococcus faecalis T2]
gi|307284354|ref|ZP_07564519.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|255966761|gb|EET97383.1| beta-galactosidase [Enterococcus faecalis T2]
gi|306503294|gb|EFM72546.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
Length = 593
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTRQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G D + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIDIEYLKFTNQ 589
>gi|449493221|ref|XP_002196735.2| PREDICTED: beta-galactosidase [Taeniopygia guttata]
Length = 636
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 162/356 (45%), Gaps = 38/356 (10%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
SFG + YD V GK ISGSIHY R P W D + K K GLD I+TYV WN
Sbjct: 8 SFG--IDYDSNCFVKDGKPFRYISGSIHYSRVPPYYWKDRLLKMKMAGLDAIQTYVPWNY 65
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP Y+F G DL F++L + GL LR GPY+CAEW+ GG P WL I R
Sbjct: 66 HEPQMGTYDFFGGKDLQYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLR 125
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI---DSAYGAAGKSY 198
+ + + ++R+ ++ M+ LY GGPII+ Q+ENEYG+ D Y
Sbjct: 126 SSDSDYLEAVERWMGVLLPKMR-PYLY-QNGGPIIMVQVENEYGSYFACDYNYLRFLLKL 183
Query: 199 IKWAAGMALSLDTGVPWVMCQQSDAPDPIINTC---NGFYCD-QFTPNSN---------- 244
+ G + L T +D C G Y F P +N
Sbjct: 184 FRLHLGDEVVLFT---------TDGASQFHLKCGALQGLYATVDFAPGANVTAAFLAQRS 234
Query: 245 ---NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
P + +E ++GW +G P + +A + G N YM+ GGTNF
Sbjct: 235 SEPKGPLVNSEFYTGWLDHWGHHHSVVPAQTIAKTLNEILASGANV-NLYMFIGGTNFAY 293
Query: 302 TSGG--PFI--STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY 353
+G P++ TSYDYDAPL E G + + + K + +L E T P +
Sbjct: 294 WNGANMPYMPQPTSYDYDAPLSEAGDLTEKYFALRKVIGMYKQLPEGLTPPTTPKF 349
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 57/138 (41%), Gaps = 38/138 (27%)
Query: 627 WYKTTFDAPAG----SEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRG 682
+Y T P G + ++F G KG+ W+NG ++GRYWP RG
Sbjct: 523 FYTGTLSIPGGIPDLPQDTYVNFPGWTKGQIWINGFNLGRYWPA--------------RG 568
Query: 683 AYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDP-----TKISFVTKQLGS 737
P +LY VPR+ L +S + E+ P +I FV + +
Sbjct: 569 -------------PQLTLY-VPRNVLVASAPNNITVLELERSPCSTQACEIEFVDEPNIN 614
Query: 738 SLCSHVTDSHPLPV-DMW 754
+ H TD PL V ++W
Sbjct: 615 ATLQHETDKPPLFVRELW 632
>gi|312901648|ref|ZP_07760918.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291259|gb|EFQ69815.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 593
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGINIEYLKFTNQ 589
>gi|322703307|gb|EFY94918.1| beta-calactosidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 645
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/328 (33%), Positives = 162/328 (49%), Gaps = 43/328 (13%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
N TYD ++ G LI G + R P W +Q +K GL+ I +YVFWN EP
Sbjct: 33 NFTYDRHNFLLDGVPIQLIGGQMDPQRIPPAYWTQRLQMAKAMGLNTIFSYVFWNNIEPT 92
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
++F+GR D+ +F++L + GLY LR GPY+C E +GGFP WL IPG+ R +N+
Sbjct: 93 EGSWDFDGRNDIARFLRLAQQEGLYVVLRPGPYICGEHEWGGFPSWLAQIPGMAVRQNNK 152
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKW-- 201
PF + + ++ + + SQGGP++++Q+ENEYG+ D AY A +K
Sbjct: 153 PFLDASRNYLEQLGKHLAATHI--SQGGPVLMTQLENEYGSFGKDKAYLRAMADMLKANF 210
Query: 202 -------AAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYC-DQFTPNSNN-KPKMWTE 252
G LD G + ++D DP GF DQ+ + P++ E
Sbjct: 211 DGFLYTNDGGGKSYLDGGSLHGILAETDG-DP----KTGFAARDQYVTDPTMLGPQLDGE 265
Query: 253 NWSGWFLSFGGAVPY-----------RPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDR 301
+ W + PY R ++DL + +A G + YM+HGGTN+
Sbjct: 266 YYVTWIDDWSSNSPYQYTSGRPDATKRVLDDLDWILA-----GNNSFSIYMFHGGTNWGF 320
Query: 302 TSGGPF-------ISTSYDYDAPLDEYG 322
+GG + ++TSYDY APLDE G
Sbjct: 321 ENGGIWVDNRLNAVTTSYDYGAPLDESG 348
Score = 39.3 bits (90), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 68/164 (41%), Gaps = 35/164 (21%)
Query: 514 FINGKLVGS-GYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITG 572
++NG VG ++ A V+VD + LL +G +YG + GI G
Sbjct: 448 YVNGARVGVVDKTHAAPASVSVDLKQG-----DVLQLLVENLGRIDYGQQLREQQKGIVG 502
Query: 573 PVQLKGSGNGTNIDLSSQQWT-YQTGLKGEELNFPSGSSTQWDSKSTLPKLQ---PLVWY 628
V + G D + W+ Y L + P+ + D S P+++ V+Y
Sbjct: 503 NVTVGG-------DAILEGWSAYSLPLT----DLPAALA---DENSETPEIKDGGAPVFY 548
Query: 629 KTTFDAPAGSEPVAIDFTGMG--------KGEAWVNGQSIGRYW 664
K TF PAG V D +G KG WVNG +GRYW
Sbjct: 549 KGTFGLPAG---VGNDLSGDTFLSLPNGVKGSVWVNGHHLGRYW 589
>gi|227554928|ref|ZP_03984975.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|422713751|ref|ZP_16770500.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422716430|ref|ZP_16773136.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|227175936|gb|EEI56908.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|315575268|gb|EFU87459.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315581351|gb|EFU93542.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
Length = 593
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|257418414|ref|ZP_05595408.1| beta-galactosidase [Enterococcus faecalis T11]
gi|257160242|gb|EEU90202.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 592
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 11 LLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 71 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 131 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 183
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 184 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 241
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 242 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 299
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 300 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 345
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 402 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 454
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 455 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 507
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 508 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 550
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 551 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 588
>gi|29375402|ref|NP_814556.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|29342862|gb|AAO80626.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
Length = 592
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 11 LLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 71 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 131 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 183
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 184 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 241
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 242 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 299
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 300 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 345
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 402 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 454
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 455 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 507
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 508 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 550
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 551 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 588
>gi|256761574|ref|ZP_05502154.1| beta-galactosidase [Enterococcus faecalis T3]
gi|422736227|ref|ZP_16792491.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|256682825|gb|EEU22520.1| beta-galactosidase [Enterococcus faecalis T3]
gi|315166978|gb|EFU10995.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 593
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|397699203|ref|YP_006536991.1| beta-galactosidase [Enterococcus faecalis D32]
gi|397335842|gb|AFO43514.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 593
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLRQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.8 bits (133), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|395846556|ref|XP_003795969.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Otolemur
garnettii]
Length = 633
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 115/320 (35%), Positives = 154/320 (48%), Gaps = 37/320 (11%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 63 IFGGSIHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPQRGKFDFSGNLDLEAFVL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L AE GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 123 LAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQ 220
+ L GGPII Q+ENEYG+ D AY Y+K A D G+ ++
Sbjct: 181 RVVPLQYKHGGPIIAVQVENEYGSYYKDPAY----MPYVKKALE-----DRGIVELLF-T 230
Query: 221 SDAPD--------PIINTCN-------GFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAV 265
SD D ++ T N +PKM TE W+GWF S+GG
Sbjct: 231 SDNKDGLRKGIIHGVLATINLQSPQELQLLTTLLVSIQGVQPKMVTEYWTGWFDSWGGPH 290
Query: 266 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLD 319
++ V+ G + N YM+HGGTNF +G TSYDYDA L
Sbjct: 291 NILDSSEVLKTVSAIVDTGSSI-NLYMFHGGTNFGFINGAMHFQDYRSDITSYDYDAVLT 349
Query: 320 EYGLIRQPKWGHLKDLHKAI 339
E G PK+ L+D ++
Sbjct: 350 EAG-DYTPKYIKLRDFFDSL 368
>gi|365860016|ref|ZP_09399844.1| putative beta-galactosidase [Streptomyces sp. W007]
gi|364010544|gb|EHM31456.1| putative beta-galactosidase [Streptomyces sp. W007]
Length = 645
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 155/315 (49%), Gaps = 43/315 (13%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK L+SG++HY R W + GL+ +ETYV WNLHEP + G
Sbjct: 13 LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG-- 70
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
L +F+ V AGL+A +R GPY+CAEW GG P+W+ G + RT + ++A ++R+
Sbjct: 71 ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWF 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
+++ + Q ++ S+GGP+IL Q ENEYG+ S Y++W AG+ VP
Sbjct: 131 RELLPQVVQRQV--SRGGPVILVQAENEYGSYGSD-----AVYLEWLAGLLRQCGVTVPL 183
Query: 216 VMCQQSDAPDP----------IINTCN-------GFYCDQFTPNSNNKPKMWTENWSGWF 258
SD P+ ++ T N GF + + P M E W GWF
Sbjct: 184 FT---SDGPEDHMLTGGSVPGLLATANFGSGAREGF--EVLLRHQPRGPLMCMEFWCGWF 238
Query: 259 LSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPF------- 307
+G R E A A+ + G + N YM HGGTNF +G GP
Sbjct: 239 DHWGAEPVRRDPEQAAGALREVLECGASV-NIYMAHGGTNFGGWAGANRSGPHQDESFQP 297
Query: 308 ISTSYDYDAPLDEYG 322
TSYDYDAP+DEYG
Sbjct: 298 TVTSYDYDAPVDEYG 312
>gi|422881390|ref|ZP_16927846.1| beta-galactosidase [Streptococcus sanguinis SK355]
gi|332364328|gb|EGJ42102.1| beta-galactosidase [Streptococcus sanguinis SK355]
Length = 592
Score = 156 bits (395), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 152/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + Q GPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQDGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|257083732|ref|ZP_05578093.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256991762|gb|EEU79064.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 593
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLSPLQITQGGPVIMMQVENEYG----SYGME-KAYLQQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGINIEYLKFTNQ 589
>gi|419447987|ref|ZP_13987985.1| beta-galactosidase family protein [Streptococcus pneumoniae
4075-00]
gi|379624799|gb|EHZ89427.1| beta-galactosidase family protein [Streptococcus pneumoniae
4075-00]
Length = 595
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 168/345 (48%), Gaps = 40/345 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R PE W + K G + +ETYV WNLHEP ++NFEG
Sbjct: 12 LDGKPFKILSGAIHYFRIPPEDWYHSLYNLKALGFNTVETYVAWNLHEPSEGEFNFEGAL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL KF++ + GLYA +R P++CAEW FGG P WL ++ R+ + + + R+
Sbjct: 72 DLEKFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + L GG I++ Q+ENEYG+ D AY A + ++ +
Sbjct: 131 DQLLARLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 214 PWVMCQQSDA--PDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ D + T N F Q F + P M E W GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R ++LA AV ++G N YM+HGGTNF +G P + TSYDY
Sbjct: 249 EPIITRDPKELAEAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGP 358
DA LDE G P +L A+K ++AT P YP L P
Sbjct: 306 DALLDEEG---NPTAKYL-----AVK----KMMATHFPEYPQLEP 338
>gi|323353539|ref|ZP_08088072.1| beta-galactosidase [Streptococcus sanguinis VMC66]
gi|322121485|gb|EFX93248.1| beta-galactosidase [Streptococcus sanguinis VMC66]
Length = 592
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 153/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ EG
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGG I++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGTILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V + Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKKMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|256959941|ref|ZP_05564112.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293384307|ref|ZP_06630193.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388457|ref|ZP_06632963.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907112|ref|ZP_07766105.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312979309|ref|ZP_07791007.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|256950437|gb|EEU67069.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291078380|gb|EFE15744.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291082147|gb|EFE19110.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626889|gb|EFQ10172.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311287903|gb|EFQ66459.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
Length = 593
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPLQITQGGPVIMMQVENEYG----SYGME-KAYLQQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 88/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ +L+ N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFLQQGQNEVVIFETEGINIEYLKFTNQ 589
>gi|422822094|ref|ZP_16870287.1| beta-galactosidase [Streptococcus sanguinis SK353]
gi|324990399|gb|EGC22337.1| beta-galactosidase [Streptococcus sanguinis SK353]
Length = 592
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 152/315 (48%), Gaps = 40/315 (12%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+I Y R P+ W D + K G + +ETY+ W LHEP Q+ E
Sbjct: 12 LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEEML 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D + KLV E GLY +R PY+CAE++FGG P WL P ++ R ++ F ++ F
Sbjct: 72 DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP- 214
+ + + + QGGPI++ Q+ENEYG+ A K+Y++ A M VP
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVTVPL 184
Query: 215 ------WVMCQQSDA--PDPIINTCNGFYCDQFTPNSNNK-----------PKMWTENWS 255
W+ +S D I T N + Q N++N P M TE W
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGN--FGSQPKENTDNLRAFMERYGKKWPLMCTEFWD 242
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNF--------DRTSGGPF 307
GWF + + R EDLA V Q G N ++ GGTNF +T P
Sbjct: 243 GWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQ 300
Query: 308 ISTSYDYDAPLDEYG 322
I TSYD+DAP+ E+G
Sbjct: 301 I-TSYDFDAPITEWG 314
>gi|170782982|ref|YP_001711316.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
gi|169157552|emb|CAQ02748.1| beta-galactosidase [Clavibacter michiganensis subsp. sepedonicus]
Length = 615
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 150/305 (49%), Gaps = 26/305 (8%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ +I+G++HY R P+ W D I+K++ GLD IETYV WN H P R ++
Sbjct: 37 LDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAHSPERGTFDTSAGL 96
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F+ LV G++A +R GPY+CAEW+ GG P WL P + R + A + F
Sbjct: 97 DLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFGDPAVGVRRSEPLYLAAVDEFL 156
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
++ +++ ++ GGP+IL QIENEYG AYG + Y++ + VP
Sbjct: 157 RRVYEIVAPRQI--DMGGPVILVQIENEYG----AYGDDAE-YLRHLVDLTRESGIIVPL 209
Query: 216 VMCQQ-------SDAPDPIINTCN-----GFYCDQFTPNSNNKPKMWTENWSGWFLSFGG 263
Q + D + T + + + P M +E W GWF + G
Sbjct: 210 TTVDQPTDEMLSRGSLDELHRTGSFGSRAAERLETLRRHQRTGPLMCSEFWDGWFDHW-G 268
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG----GPFIS--TSYDYDAP 317
+ A A G N YM+HGGTNF T+G G + S TSYDYDAP
Sbjct: 269 EHHHTTSAADAAAELDALLAAGASVNIYMFHGGTNFGFTNGANHKGTYQSHVTSYDYDAP 328
Query: 318 LDEYG 322
LDE G
Sbjct: 329 LDETG 333
>gi|157106609|ref|XP_001649402.1| beta-galactosidase [Aedes aegypti]
gi|108879821|gb|EAT44046.1| AAEL004575-PA [Aedes aegypti]
Length = 648
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 171/346 (49%), Gaps = 43/346 (12%)
Query: 9 LVLCW---GFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+VLC+ G +L + Y++ ++ G I+GS HY R+ P+ W +++
Sbjct: 15 VVLCYHVNGQRLLDNRQRTFTIDYENNTFLLDGAPFQYIAGSFHYFRALPQAWGPILKSM 74
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
+ GL+ + TYV W+LH P + YN++G D+ +FV+L L LR GPY+CAE +
Sbjct: 75 RAAGLNAVTTYVEWSLHNPKKGVYNWDGMADIERFVQLAQNEDLLVILRPGPYICAERDM 134
Query: 126 GGFPLW-LHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEY 184
GGFP W L+ PGIQ RT + + E++ + A++ + E + GGPII+ Q+ENEY
Sbjct: 135 GGFPYWLLNKYPGIQLRTADVAYLREVRTWYAELFSRL--EPYFYGNGGPIIMVQVENEY 192
Query: 185 GNIDSAYGAAGKSYIKWAA-----------------GMALSLDTGVPWVMCQQSDAPDPI 227
G ++ A Y+KW G L+ G+ V+ P
Sbjct: 193 G----SFFACDYKYMKWLRDETERYVRGKAVLFTNNGPGLTQCGGIDGVLSTLDFGPGTA 248
Query: 228 INTCNGFYCD--QFTPNSNNKPKMWTENWSGWFLSFGGAVPYR-PVEDLAFAVARFFQRG 284
+ +G++ D + P P + E + GW + R P+E + ++ R+
Sbjct: 249 LE-IDGYWKDLRKLQPKG---PLVNAEYYPGWLTHWQEQQMARSPIEPVVTSL-RYMLSS 303
Query: 285 GTFQNYYMYHGGTNFDRTSG------GPFIS--TSYDYDAPLDEYG 322
N YM++GGTNF T+G G FI TSYDYDAPLDE G
Sbjct: 304 KVNVNIYMFYGGTNFGFTAGANEQGPGRFIPDITSYDYDAPLDESG 349
>gi|357050580|ref|ZP_09111778.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
gi|355381233|gb|EHG28360.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
30_1]
Length = 593
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 113/302 (37%), Positives = 152/302 (50%), Gaps = 29/302 (9%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
L+SG+IHY R P+ W + K G + +ETYV WNLHEP + + FEG DL F+
Sbjct: 19 LLSGAIHYFRVHPDDWRHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEGILDLEHFLS 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLY LR PY+CAEW FGG P WL G + R + + A + + ++ +
Sbjct: 79 LAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG-RLRACDPSYLAHVAEYYDVLLPKI 137
Query: 163 KQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSY-IKWAAGMALSLDTGVPWVMCQ 219
+L S GG I++ Q+ENEYG+ + AY A K I M L G PW
Sbjct: 138 IPYQL--SHGGNILMIQVENEYGSYGEEKAYLRAIKEMLINRGIDMPLFTSDG-PWQAAL 194
Query: 220 QSDA--PDPIINTCN-------GFYCDQFTPNSNNK--PKMWTENWSGWFLSFGGAVPYR 268
++ + D ++ T N F Q + +NK P M E W GWF + + R
Sbjct: 195 RAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWDGWFNRWNEPIIRR 254
Query: 269 PVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDAPLDE 320
+DLA +V + G N YM+HGGTNF +G P + TSYDYDAPLDE
Sbjct: 255 DPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQV-TSYDYDAPLDE 311
Query: 321 YG 322
G
Sbjct: 312 QG 313
>gi|373953405|ref|ZP_09613365.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890005|gb|EHQ25902.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 608
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 109/339 (32%), Positives = 154/339 (45%), Gaps = 35/339 (10%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
I LL+L FV A A ++ GK +ISG +HYPR E W ++ +
Sbjct: 8 IALLMLL--FVFPAVGQVNHTFALGDEAFLLDGKPFQMISGEMHYPRVPRESWRARMKMA 65
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
K GL+ I TYVFWNLHEP + +++F G D+ +FV++ + GL+ LR PYVCAEW F
Sbjct: 66 KAMGLNTIGTYVFWNLHEPQKGKFDFTGNNDVAEFVRIAKQEGLWVILRPSPYVCAEWEF 125
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG+P WL G+ R+ + E + + ++ + L + GG I++ QIENEYG
Sbjct: 126 GGYPYWLQNEKGLVVRSKEAQYLKEYESYIKEVGKQL--APLQINHGGNILMVQIENEYG 183
Query: 186 NI--DSAYGAAGKSYIKWAA--GMALSLDTGVPWV---------MCQQSDAPDPIINTCN 232
+ D Y A + K A G+ + D V D PD +
Sbjct: 184 SYGSDKDYLAINQKLFKEAGFDGLLYTCDPAADLVNGHLPGLLPAVNGIDNPDKVKQII- 242
Query: 233 GFYCDQFTPNSNNKPKMWTENW-SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYY 291
+ N N K + W WF +G P + + G + N Y
Sbjct: 243 -------SQNHNGKGPYYIAEWYPAWFDWWGTKHHTVPAAEYTGRLDSVLAAGISI-NMY 294
Query: 292 MYHGGTNFDRTSGGPFIST--------SYDYDAPLDEYG 322
M+HGGT +G + T SYDYDAPLDE G
Sbjct: 295 MFHGGTTRGFMNGANYKDTSPYEPQVSSYDYDAPLDEAG 333
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 94/240 (39%), Gaps = 49/240 (20%)
Query: 494 LEDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLT 553
L+ G +L ++ L +NGK VG+ + + + P+ G D+L
Sbjct: 411 LKGGKSGLLKIKELRDYAVVMLNGKTVGTLDRRLNQDSLQIKLPV----GAVVLDILVEN 466
Query: 554 VGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQW 613
+G N+G + + GIT V L + N + S + + E +N SGSST
Sbjct: 467 LGRINFGKYLLQNKKGITEKV-LFNTQQVNNWQMYSLPFNH-----AEAINLKSGSSTM- 519
Query: 614 DSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGG 673
T P + K+ + + +D GKG WVNG ++GRYW Q G
Sbjct: 520 ---GTAPVI------KSGYFNLQKTGDTYLDMRKWGKGLVWVNGHNLGRYW-----QVG- 564
Query: 674 CTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P Q+LY VP WLK N + + E + + +S + K
Sbjct: 565 ----------------------PQQTLY-VPAEWLKKGQNEVRVLELLKPEQNTLSALDK 601
>gi|167755577|ref|ZP_02427704.1| hypothetical protein CLORAM_01091 [Clostridium ramosum DSM 1402]
gi|167704516|gb|EDS19095.1| glycosyl hydrolase family 35 [Clostridium ramosum DSM 1402]
Length = 584
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 148/323 (45%), Gaps = 46/323 (14%)
Query: 31 HRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYN 90
++ I G + +ISG++HY R PE W D + K G + +ETYV WNLHEP + +Y+
Sbjct: 7 NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66
Query: 91 FEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAE 150
F G D+ F+KL E L+ LR PY+CAEW GG P WL P I+ RT+++ +
Sbjct: 67 FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126
Query: 151 MQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLD 210
+ ++ + + + K K +Q GPIIL+Q+ENEYG +YG K Y+ M
Sbjct: 127 LDQYFS--ILLPKLSKYQITQNGPIILAQLENEYG----SYG-EDKEYLLAVYQMMRKYG 179
Query: 211 TGVPWVMCQ-----------------------QSDAPDPIINTCNGFYCDQFTPNSNNKP 247
VP S A + I Q T P
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESYQITA-----P 234
Query: 248 KMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG-- 305
M E W GWF + + R ++ + G N+YM+ GGTNF +G
Sbjct: 235 LMCMEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSA 292
Query: 306 ------PFISTSYDYDAPLDEYG 322
P I TSYDYDA L EYG
Sbjct: 293 RKEHDLPQI-TSYDYDAILTEYG 314
>gi|417991864|ref|ZP_12632235.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
gi|410534805|gb|EKQ09440.1| beta-galactosidase 3 [Lactobacillus casei CRF28]
Length = 598
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 111/315 (35%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIEHFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDSAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTENWS 255
L T PW + S A I+ T N G + D + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSHADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAEDLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 94/250 (37%), Gaps = 55/250 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W +
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW-----EK 549
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G P+ SLY VP L + N +++FE G I+ V
Sbjct: 550 G-----------------------PTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINLV 585
Query: 732 TKQLGSSLCS 741
+ L +
Sbjct: 586 DHPIFKELNT 595
>gi|403304858|ref|XP_003942999.1| PREDICTED: beta-galactosidase-1-like protein 2 [Saimiri boliviensis
boliviensis]
Length = 636
Score = 156 bits (394), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 153/310 (49%), Gaps = 17/310 (5%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GSIHY R E W D + K K GL+ + TYV WNLHEP R +++F G DL F+
Sbjct: 63 IFGGSIHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ +E GL+ LR GPY+C+E + GG P WL PG++ RT + F + + + M
Sbjct: 123 MASEIGLWVILRPGPYICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDT----GVPWV 216
+ L +GGPII Q+ENEYG N D AY K ++ + L L + G+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSYNKDPAYMPYVKKALEDRGIVELLLTSDNKDGLSKG 240
Query: 217 MCQQSDAPDPIINTCNGFYCDQFTPN-SNNKPKMWTENWSGWFLSFGGAVPYRPVEDLAF 275
+ A + +T F N +PKM E W+GWF S+GG ++
Sbjct: 241 IVHGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLK 300
Query: 276 AVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIRQPKW 329
V+ G + N YM+HGGTNF +G TSYDYDA L E G K+
Sbjct: 301 TVSAIVDAGSSI-NLYMFHGGTNFGFMNGAMHFHDYKSDVTSYDYDAVLTEAG-DYTAKY 358
Query: 330 GHLKDLHKAI 339
L+D +I
Sbjct: 359 MKLRDFFGSI 368
>gi|423301385|ref|ZP_17279409.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
gi|408471986|gb|EKJ90515.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
Length = 779
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 184/386 (47%), Gaps = 42/386 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M ++ I LLVL V +S A T ++ GK V+ + +HY R
Sbjct: 1 MKNRLIALLVLF--TVTFFVSSAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 58
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 59 AYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRP 118
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL I RT + + M+R + ++ KQ L ++GG
Sbjct: 119 GPYVCAEWEMGGLPWWLLKKKDIALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 175
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ I+ Y +A + ++ +G T VP C S +A D +
Sbjct: 176 IIMVQVENEYGSYGINKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 229
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 230 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 289
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 290 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 347
Query: 336 HKAIKLCEAALVATDPTYPSLGPNLE 361
K AAL P P+ P +E
Sbjct: 348 LKNYLPAGAAL----PEVPAALPVME 369
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 97/251 (38%), Gaps = 51/251 (20%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
TVL + + +++GKL+ + T P+ L G D+L +G N+
Sbjct: 421 TVLKITEVHDWAQVYVDGKLLARL--DRRKGEFTTTLPV-LKKG-TQLDILIEAMGRVNF 476
Query: 560 G-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
+ +++ GIT V+L SGN T + WT NFP S D K
Sbjct: 477 DKSIHDR--KGITEKVELI-SGNQTK---ELKNWTV--------YNFPVDYSFIKDKKYN 522
Query: 619 LPKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
K P + +YK TF + +D + GKG WVNG ++GR+W
Sbjct: 523 ETKQLPTMPAYYKGTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI---------- 571
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 572 ------------------GPQQTLF-MPGCWLKKGENEILVLDLKGPAKASIKGLKKPIL 612
Query: 737 SSLCSHVTDSH 747
L ++H
Sbjct: 613 DVLREKAPETH 623
>gi|319900291|ref|YP_004160019.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
gi|319415322|gb|ADV42433.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
Length = 629
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/330 (31%), Positives = 159/330 (48%), Gaps = 37/330 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK+ ++SG +HY R + W +Q K GL+ + TYVFWN HE +++F G
Sbjct: 38 LNGKQTPILSGEMHYARIPHQYWRHRLQMMKGMGLNAVATYVFWNHHETEPGKWDFTGDK 97
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
+L +++K E G+ LR GPYVCAEW FGG+P WL +PG++ R DN F + +
Sbjct: 98 NLAEYIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVPGMEIRRDNPQFLKHTEAYI 157
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI-----------DSAYGAAGKSYIKWAAG 204
++ + L ++GGPI++ Q ENE+G+ AY A K + AG
Sbjct: 158 QRLYKEVGH--LQCTKGGPIVMVQCENEFGSYVAQRKDITLQEHRAYNAKIKQQLA-DAG 214
Query: 205 MALSLDTGV-PWVMCQQSDAPDPIINTCNGF--------YCDQFTPNSNNKPKMWTENWS 255
+ L T W+ + + + + T NG +Q+ + P M E +
Sbjct: 215 FDVPLFTSDGSWLF--EGGSTEGALPTANGETDIANLKKVVNQY--HGGQGPYMVAEFYP 270
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------ 309
GW + P +A + + +F N YM HGGTNF TSG +
Sbjct: 271 GWLSHWAEPFPQVSASSVARTTESYLKNDVSF-NVYMVHGGTNFGFTSGANYDKKRDIQP 329
Query: 310 --TSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
TSYDYDAP+ E G + PK+ ++ + K
Sbjct: 330 DLTSYDYDAPISEAGWV-TPKYDSIRAVIK 358
>gi|385261583|ref|ZP_10039703.1| glycosyl hydrolase family 35 [Streptococcus sp. SK643]
gi|385192786|gb|EIF40181.1| glycosyl hydrolase family 35 [Streptococcus sp. SK643]
Length = 595
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 125/372 (33%), Positives = 181/372 (48%), Gaps = 59/372 (15%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R E W + K G + +ETYV WNLHEPV ++NFEG
Sbjct: 12 LDGKLFKILSGAIHYFRIPAEDWYHSLYNLKALGFNTVETYVAWNLHEPVEGEFNFEGAL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F+++ + GLYA +R P++CAEW FGG P WL ++ R+ + + + R+
Sbjct: 72 DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPSYIEAVGRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGV-- 213
+++ + L +GG I++ Q+ENEYG +YG KSY++ A L + G+
Sbjct: 131 DQLLPRLIPHLL--DKGGNILMMQVENEYG----SYGE-DKSYLR--AIRKLMEERGIDC 181
Query: 214 -------PWVMCQQSDA--PDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWS 255
PW ++ D + T N F Q F + P M E W
Sbjct: 182 PLFTSDGPWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF + + R ++LA AV ++G N YM+HGGTNF +G P
Sbjct: 242 GWFNRWKEPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQ 299
Query: 308 ISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGPNLEATVYK 366
+ TSYDYDA LDE G P +L A+K ++AT P YP L P +YK
Sbjct: 300 V-TSYDYDALLDEEG---NPTAKYL-----AVK----KMMATHFPEYPQLEP-----LYK 341
Query: 367 TGSGLCSAFLAN 378
+ S LA
Sbjct: 342 ESMEMDSISLAE 353
>gi|307707961|ref|ZP_07644436.1| beta-galactosidase [Streptococcus mitis NCTC 12261]
gi|307616026|gb|EFN95224.1| beta-galactosidase [Streptococcus mitis NCTC 12261]
Length = 595
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 115/345 (33%), Positives = 169/345 (48%), Gaps = 40/345 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R PE W + K G + +ETYV WNLHEP +++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F++ + GLYA +R P++CAEW FGG P WL ++ R+ + + + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRLRSSDPAYIEAVGRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + L +GG I++ Q+ENEYG+ D AY A + ++ +
Sbjct: 131 DQLLSRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 214 PWVMCQQSDA--PDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ D + T N F Q F + P M E W GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R ++LA AV ++G N YM+HGGTNF +G P + TSYDY
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGP 358
DA LDE G P +L A+K ++AT P YP L P
Sbjct: 306 DALLDEEG---NPTAKYL-----AVK----KMMATHFPEYPQLEP 338
>gi|433679946|ref|ZP_20511609.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814938|emb|CCP42238.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 615
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 152/321 (47%), Gaps = 35/321 (10%)
Query: 38 GKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDL 97
GK +ISG+IH+ R W D +QK++ GL+ +ETYVFWNL EP + Q++F G DL
Sbjct: 44 GKPYQIISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVEPRQGQFDFSGNNDL 103
Query: 98 VKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAK 157
F+ A GL LR GPYVCAEW GG+P WL PG++ R+ + F A Q +
Sbjct: 104 AAFIDAAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAQPGLRVRSQDPRFLAASQAYLDA 163
Query: 158 IVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVM 217
+ +K + GGP+I Q+ENEYG+ D ++ A + + G +
Sbjct: 164 VAAQVKPK--LNRNGGPVIAVQVENEYGSYDD-------DHVYMQANRTMFVKAGFDKAL 214
Query: 218 CQQSDAPDPIINTC--NGFYCDQFTPNSNNK------------PKMWTENWSGWFLSFGG 263
+D D + N + F P K P+M E W+GWF +G
Sbjct: 215 LFTADGADVLANGTLPDTLAVVNFGPGDAEKAFQTLSKFRPGQPQMVGEYWAGWFDQWGD 274
Query: 264 AVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF----------ISTSYD 313
+ A + + R G N YM+ GGT+F +G F +TSYD
Sbjct: 275 KHANTDAKKQA-SEFEWILRQGHSANIYMFVGGTSFGFMNGANFQKNASDHYAPQTTSYD 333
Query: 314 YDAPLDEYGLIRQPKWGHLKD 334
YDA LDE G PK+ +D
Sbjct: 334 YDAVLDEAGRP-TPKFALFRD 353
Score = 42.7 bits (99), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 55/241 (22%), Positives = 89/241 (36%), Gaps = 53/241 (21%)
Query: 499 KTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQN 558
K L++ + +++ L GS V VD P G +T D+L G N
Sbjct: 424 KGSLYLGDVRDYARVYVDRSLAGSAERRLQQVAVDVDIPA----GPHTVDVLVENGGRIN 479
Query: 559 YGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSS-TQWDSKS 617
YG AG+ PV L G TG + L S T W
Sbjct: 480 YGTHLPDGRAGLVDPVLLNGKP--------------LTGWQTFSLPMDDPSKLTGW---- 521
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
T K++ +++ T ++ +D GKG AW NG ++GR+W
Sbjct: 522 TTAKVEGPAFHRGTVKIATPTD-TFLDMQAFGKGVAWANGHNLGRHW------------- 567
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
N G P ++LY VP + + N++++F+ + V +Q+ S
Sbjct: 568 --------------NIG-PQRALY-VPAPFQRKGENSVIVFDLDSAAEASVRGVKEQVWS 611
Query: 738 S 738
+
Sbjct: 612 A 612
>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
Length = 651
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 117/363 (32%), Positives = 172/363 (47%), Gaps = 20/363 (5%)
Query: 6 ILLLVLCWGFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKS 65
+LLL++ +G + + SF V Y + G++ ISGSIHY R W D + K
Sbjct: 9 VLLLLMLFGRSLGESPSF--TVDYQNDCFRKDGEKFQYISGSIHYNRIPRVYWKDRLLKM 66
Query: 66 KDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNF 125
GL+ I+TYV WN HE V YNF G DL F+KL + GL LR GPY+CAEW+
Sbjct: 67 YMAGLNAIQTYVPWNYHEEVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGPYICAEWDM 126
Query: 126 GGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYG 185
GG P WL I R+ + + A + ++ K++ M+K LY GGPII Q+ENEYG
Sbjct: 127 GGLPAWLLKKKDIVLRSTDPDYIAAVDKWMGKLLPMIK-PYLY-QNGGPIITVQVENEYG 184
Query: 186 -------NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGF-YCD 237
N +SY+ + + G+ ++ C ++ G
Sbjct: 185 SYFACDYNYMRHLSKLFRSYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGPGANVTA 244
Query: 238 QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 294
F P +P + +E ++GW +G +A A++ G N YM+
Sbjct: 245 AFEPQRQVQPHGPLVNSEFYTGWLDHWGSRHSVVSPTQVAKALSEMLLMGANV-NLYMFI 303
Query: 295 GGTNFDRTSGG--PFIS--TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD 350
GGTNF +G P+ + TSYDYDAPL E G + + + + + K+ E + T
Sbjct: 304 GGTNFGYWNGANTPYAAQPTSYDYDAPLTEAGDLTEKYFAIREVIKMYSKVPEGPIPPTT 363
Query: 351 PTY 353
P Y
Sbjct: 364 PKY 366
>gi|419767276|ref|ZP_14293433.1| glycosyl hydrolase family 35 [Streptococcus mitis SK579]
gi|383353272|gb|EID30895.1| glycosyl hydrolase family 35 [Streptococcus mitis SK579]
Length = 595
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 169/345 (48%), Gaps = 40/345 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R PE W + K G + +ETYV WNLHEP ++NFEG
Sbjct: 12 LDGKPFKILSGAIHYFRIPPEDWSHSLYNLKALGFNTVETYVAWNLHEPREGEFNFEGAL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F+++ + GLYA +R P++CAEW FGG P WL ++ R+ + + + R+
Sbjct: 72 DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + L +GG I++ Q+ENEYG+ D AY A + ++ +
Sbjct: 131 DQLLPRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRQLMEDRGVTCPLFTSDG 188
Query: 214 PWVMCQQSDA--PDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ D + T N F Q F P M E W GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEYGKKWPLMCMEFWDGWFNRWK 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R ++LA AV ++G N YM+HGGTNF +G P + TSYDY
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGP 358
DA LDE G P +L A+K ++AT P YP L P
Sbjct: 306 DALLDEEG---NPTAKYL-----AVK----KMMATHFPEYPQLEP 338
>gi|291410639|ref|XP_002721600.1| PREDICTED: galactosidase, beta 1-like [Oryctolagus cuniculus]
Length = 635
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 150/314 (47%), Gaps = 25/314 (7%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GS+HY R E W D + K K GL+ + TYV WNLHEP R +++F G DL FV
Sbjct: 63 IFGGSMHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ AE GL+ LR GPY+C+E + GG P WL G++ RT + F + + + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDSGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTG-------- 212
+ L GGPII Q+ENEYG N D AY K ++ + L L +
Sbjct: 181 RVVPLQYKHGGPIIAVQVENEYGSYNKDPAYMPYIKRALEDRGIVELLLTSDNKDGLSKG 240
Query: 213 -VPWVMCQQSDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFGGAVPYRPVE 271
VP VM + + + F +PKM E W+GWF S+GG
Sbjct: 241 VVPGVMATINLQSHAELQSLTTFLLSV----KGIQPKMVMEYWTGWFDSWGGPHNILDSS 296
Query: 272 DLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDEYGLIR 325
++ V+ G + N YM+HGGTNF +G TSYDYDA L E G
Sbjct: 297 EVLQTVSAIVDAGASI-NLYMFHGGTNFGFINGAMHFQEYKSDVTSYDYDAVLTEAG-DY 354
Query: 326 QPKWGHLKDLHKAI 339
K+ L+D ++
Sbjct: 355 TAKYSKLRDFFGSV 368
>gi|301065438|ref|YP_003787461.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
gi|300437845|gb|ADK17611.1| glycosyl hydrolase, family 35 [Lactobacillus casei str. Zhang]
Length = 598
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDSAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAEDLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 45.8 bits (107), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 94/250 (37%), Gaps = 55/250 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W +
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW-----EK 549
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G P+ SLY VP L + N +++FE G I+ V
Sbjct: 550 G-----------------------PTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINLV 585
Query: 732 TKQLGSSLCS 741
+ L +
Sbjct: 586 DHPIFKELNT 595
>gi|237719727|ref|ZP_04550208.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|229450996|gb|EEO56787.1| beta-galactosidase [Bacteroides sp. 2_2_4]
Length = 778
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 176/362 (48%), Gaps = 39/362 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M ++ I LLVL F V+ ++ A T ++ GK V+ + +HY R
Sbjct: 1 MKNRLIALLVL---FTVIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 58 AYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL I RT + + M+R + ++ KQ L ++GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDIALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ ID Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGIDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Query: 336 HK 337
K
Sbjct: 347 LK 348
Score = 47.4 bits (111), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 98/251 (39%), Gaps = 51/251 (20%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
T L + + + +GKL+ + T P AL G D+L +G N+
Sbjct: 420 TTLKITEVHDWAQIYADGKLLARL--DRRKGEFTTTLP-ALKKG-TQLDILVEAMGRVNF 475
Query: 560 G-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
+ +++ GIT V+L SGN T + WT NFP S D K +
Sbjct: 476 DKSIHDR--KGITEKVELI-SGNQTK---ELKNWTV--------YNFPVDYSFIKDKKYS 521
Query: 619 LPKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
K+ P + +YK+TF + +D + GKG WVNG ++GR+W
Sbjct: 522 DTKILPTMPAYYKSTFTLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI---------- 570
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 571 ------------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPIL 611
Query: 737 SSLCSHVTDSH 747
L ++H
Sbjct: 612 DVLREKAPETH 622
>gi|334348881|ref|XP_001378605.2| PREDICTED: beta-galactosidase-like [Monodelphis domestica]
Length = 658
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 113/347 (32%), Positives = 166/347 (47%), Gaps = 28/347 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ Y+ + GK ISGSIHY R W D + K K GL+ I+TYV WN HEP+
Sbjct: 50 IDYERDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPLP 109
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
Y F YDL F++L E GL LR GPY+CAEW+ GG P WL I R+ +
Sbjct: 110 GVYRFSDDYDLEYFLQLAHEIGLLVILRPGPYICAEWDMGGLPAWLLTKKSIVLRSSDPD 169
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+ AE +++ ++ MK LY + GGPII Q+ENEYG +Y +Y+++ +
Sbjct: 170 YLAETEKWLGVLLPKMK-PYLYQN-GGPIITVQVENEYG----SYFTCDYNYLRFLQQL- 222
Query: 207 LSLDTGVPWVMCQQSDAPDPIIN--TCNGFYC-----------DQFTPNSNNKPK---MW 250
G V+ A + + T G Y + F +PK +
Sbjct: 223 FHKHLGEEVVLFTTDGASEDYLKCGTLQGLYATVDFGTNHNITEAFQSQRKTEPKGPLVN 282
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFI 308
+E ++GW +G A + + ++ +G N YM+ GGTNF +G P+
Sbjct: 283 SEFYTGWLDHWGEAHETVDTKAIISSLNDMLSQGANV-NMYMFIGGTNFGFWNGANIPYA 341
Query: 309 S--TSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY 353
+ TSYDYDAPL E G + + + + + K KL E + T P +
Sbjct: 342 AQPTSYDYDAPLSEAGDLTEKYFALRELIGKFEKLPEGLIPPTTPKF 388
>gi|307710114|ref|ZP_07646558.1| beta-galactosidase [Streptococcus mitis SK564]
gi|307619094|gb|EFN98226.1| beta-galactosidase [Streptococcus mitis SK564]
Length = 595
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 115/345 (33%), Positives = 168/345 (48%), Gaps = 40/345 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R PE W + K G + +ETYV WNLHEP +++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F++ + GLYA +R P++CAEW FGG P WL ++ R+ + + + R+
Sbjct: 72 DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRLRSSDPAYIEAVGRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + L GG I++ Q+ENEYG+ D AY A + ++ +
Sbjct: 131 DQLLSRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 214 PWVMCQQSDA--PDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ D + T N F Q F + P M E W GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R ++LA AV ++G N YM+HGGTNF +G P + TSYDY
Sbjct: 249 EPIITRDPKELAEAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGP 358
DA LDE G P +L A+K ++AT P YP L P
Sbjct: 306 DALLDEEG---NPTAKYL-----AVK----KMMATHFPEYPQLEP 338
>gi|315499712|ref|YP_004088515.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
gi|315417724|gb|ADU14364.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
Length = 613
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 175/371 (47%), Gaps = 39/371 (10%)
Query: 14 GFVVLATTSFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVI 73
GF ++ + T ++ G+ L++G +HYPR E+W D ++K K GL+ +
Sbjct: 19 GFSAGDASAAPSRFTIKDDQFLLDGQPLHLMAGEMHYPRIPRELWRDRLRKLKALGLNTL 78
Query: 74 ETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLH 133
TY FW+ HE Y+F G D+ +VK+ E GL+ LR GPY CAEW+ GG+P W
Sbjct: 79 STYTFWSAHEKKPGVYDFSGNLDVAAWVKMAQEEGLHVLLRPGPYACAEWDNGGYPAWFL 138
Query: 134 FIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAY 191
P I+ R+ + + ++ ++ + L +GGP++++QIENEYG+ D Y
Sbjct: 139 NDPDIRPRSLDPRYMGPSGQWLKRLGQEVAH--LEIDKGGPVLMTQIENEYGSYGNDLNY 196
Query: 192 GAAGKSYIKWAAGMALSLDTGVPWVMCQQSDAPDPIINTCNGFYCD-------QFTPNSN 244
A + ++ AAG + L T + + P+ + N N D ++
Sbjct: 197 MRAVRDQVR-AAGFSGQLYTVDGAAVIENGALPE-LFNGINFGTYDKAEGEFARYAKFKT 254
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
P+M TE W GWF FG + L ++ +F ++YM HGGT+F +G
Sbjct: 255 KGPRMCTELWGGWFDHFGEVHSNMEISPLMESLKWMLDNRISF-SFYMLHGGTSFAFDAG 313
Query: 305 GPFIST--------SYDYDAPLDEYGLIRQPKWGHLKDL----------------HKAIK 340
F T SYDYDA LDE G + PK+ ++L KA+K
Sbjct: 314 ANFHKTHGYQPDISSYDYDAMLDEAGRV-TPKYEAARELFRRYLPPERFTALPEPEKALK 372
Query: 341 LCEAALVATDP 351
+ AL T P
Sbjct: 373 IERFALRETAP 383
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 65/149 (43%), Gaps = 23/149 (15%)
Query: 519 LVGSG---YGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGITGPVQ 575
LV +G +G+ + ++L G +T DLL +G NYG K G+ GPV
Sbjct: 434 LVSAGQTRFGTLDRRLKETEIEVSLKAG-DTLDLLIDAMGHVNYGDQIGKDQKGLIGPVT 492
Query: 576 LKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKTTFDAP 635
L G WT+Q G+ ++L+ + ++ +Y+ TF+
Sbjct: 493 LNGK--------PLTGWTHQ-GVPLDDLSV---------LRFKRQRVNGPAFYRGTFETS 534
Query: 636 AGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+D G GKG WVNG ++GRYW
Sbjct: 535 EAGFTF-LDLRGWGKGYVWVNGHNLGRYW 562
>gi|322378066|ref|ZP_08052553.1| glycosyl hydrolase, family 35 [Streptococcus sp. M334]
gi|321281048|gb|EFX58061.1| glycosyl hydrolase, family 35 [Streptococcus sp. M334]
Length = 595
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 115/345 (33%), Positives = 170/345 (49%), Gaps = 40/345 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK ++SG+IHY R PE W + K G + +ETYV WNLHEP +++FEG
Sbjct: 12 LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAQ 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F+++ + GLYA +R P++CAEW FGG P WL ++ R+ + + + R+
Sbjct: 72 DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + L +GG I++ Q+ENEYG+ D AY A + ++ +
Sbjct: 131 DQLLPRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188
Query: 214 PWVMCQQSDA--PDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ D + T N F Q F + P M E W GWF +
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R ++LA AV ++G N YM+HGGTNF +G P + TSYDY
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305
Query: 315 DAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATD-PTYPSLGP 358
DA LDE G P +L A+K ++AT P YP L P
Sbjct: 306 DALLDEEG---NPTAKYL-----AVK----KMMATHFPEYPQLEP 338
>gi|418004004|ref|ZP_12644053.1| beta-galactosidase 3 [Lactobacillus casei UW1]
gi|410551057|gb|EKQ25134.1| beta-galactosidase 3 [Lactobacillus casei UW1]
Length = 598
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDSAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAEDLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 45.8 bits (107), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 94/250 (37%), Gaps = 55/250 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W +
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW-----EK 549
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G P+ SLY VP L + N +++FE G I+ V
Sbjct: 550 G-----------------------PTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINLV 585
Query: 732 TKQLGSSLCS 741
+ L +
Sbjct: 586 DHPIFKELNT 595
>gi|319945941|ref|ZP_08020191.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|417919516|ref|ZP_12563047.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
gi|319748006|gb|EFW00250.1| beta-galactosidase [Streptococcus australis ATCC 700641]
gi|342832897|gb|EGU67186.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
Length = 595
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 176/375 (46%), Gaps = 56/375 (14%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R E W + K G + +ETYV WN HEP R ++FEG DL F++
Sbjct: 19 ILSGAIHYFRIDREDWYHSLYNLKALGFNTVETYVPWNAHEPQRGHFHFEGNLDLEHFIQ 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
+ E LY LR P++C+EW FGG P WL ++ R+ + F E+ R+ +++ +
Sbjct: 79 VAQELDLYVILRPSPFICSEWEFGGLPAWL-IEKDLRIRSSDPAFLEEVARYYDELLPRV 137
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQSD 222
+ +L +GG I++ Q+ENEYG +YG K+Y++ + + D P SD
Sbjct: 138 AKYQL--DRGGNILMMQVENEYG----SYG-EDKAYLRAIRDLMIERDITCPLFT---SD 187
Query: 223 AP------------DPIINTCN-----GFYCDQ----FTPNSNNKPKMWTENWSGWFLSF 261
P D + T N + Q F + P M E W GWF +
Sbjct: 188 GPWRATLRAGTLIEDGLFVTGNFGSRANYNFSQMKEFFAEHDRKWPLMCMEFWDGWFNRW 247
Query: 262 GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYD 313
+ R E+LA AV Q G N YM+HGGTNF +G P + TSYD
Sbjct: 248 KEPIIKRDPEELAEAVHEVLQEGSI--NLYMFHGGTNFGFMNGCSARGTVDLPQV-TSYD 304
Query: 314 YDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLGPNLEATVYKTGSGLC- 372
YDA LDE G PK+ +K + K P YP P +++++ + L
Sbjct: 305 YDALLDEQG-NPTPKYDAVKKMMKTYY----------PEYPQSEPLVKSSLSERTLELTQ 353
Query: 373 -SAFLANIGTNSDVT 386
++ N+ + VT
Sbjct: 354 KTSLFGNLNEIAQVT 368
>gi|291530918|emb|CBK96503.1| Beta-galactosidase [Eubacterium siraeum 70/3]
Length = 579
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 107/314 (34%), Positives = 149/314 (47%), Gaps = 36/314 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK +ISGSIHY R+ PE W D ++K + G + +ETY+ WN HE + +N++G +
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWDGMH 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ +F++L + GLY +R PY+C+EW FGG P WL ++ R +P+ + +
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDNYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
+ V M K GG II+ QIENEYG Y SY+++ VP+
Sbjct: 132 S--VLMPKLAPYQIDNGGNIIMMQIENEYG-----YYGNDTSYLEFLRDTMRKYGITVPF 184
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSN------------------NKPKMWTENWSGW 257
V SD P +G D P N KP M E W+GW
Sbjct: 185 V---TSDGPWSEFVFKSGM-VDGALPTGNFGSSAEWQLGEMRRFIGEGKPLMCMEFWNGW 240
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG-----GPFISTSY 312
F +G E A + + G N+YM+ GGTNF SG I TSY
Sbjct: 241 FDVWGEEHNITAPEKAAQELDTLLKNGS--MNFYMFEGGTNFGFMSGKNNEKKTGIVTSY 298
Query: 313 DYDAPLDEYGLIRQ 326
DYDAPL E G I +
Sbjct: 299 DYDAPLTEDGRITE 312
>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
Length = 385
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 113/322 (35%), Positives = 154/322 (47%), Gaps = 32/322 (9%)
Query: 22 SFGANVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNL 81
+FG + YD V G ISGSIHY R W D + K K GL+ I+TYV WN
Sbjct: 24 TFG--IDYDCNCFVKDGHPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLNAIQTYVPWNY 81
Query: 82 HEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFR 141
HEP Y+F G DL F++L +E GL LR GPY+CAEW+ GG P WL I R
Sbjct: 82 HEPQMGVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLR 141
Query: 142 TDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKW 201
+ + + ++++ ++ MK LY GGPII+ Q+ENEYG +Y A Y++
Sbjct: 142 SSDSDYLTAVEKWMGVLLPKMKPH-LY-HNGGPIIMVQVENEYG----SYFACDYDYLR- 194
Query: 202 AAGMALSLDTGVPWVMCQQSDAPDPIINTC---NGFYCD-QFTPNSN------------- 244
+ + + V+ +D C G Y F P N
Sbjct: 195 -SLLKIFRQHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEP 253
Query: 245 NKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG 304
P + +E ++GW +G P E +A + RG N YM+ GGTNF +G
Sbjct: 254 TGPLVNSEFYTGWLDHWGHRHIVVPSETIAKTLNEILARGANV-NLYMFIGGTNFAYWNG 312
Query: 305 G--PFIS--TSYDYDAPLDEYG 322
P++S TSYDYDAPL E G
Sbjct: 313 ANMPYMSQPTSYDYDAPLSEAG 334
>gi|327260596|ref|XP_003215120.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 679
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 152/319 (47%), Gaps = 29/319 (9%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
++ Y + + G + ISGSIHY R W D + K GL+ ++ Y+ WN HEP+
Sbjct: 72 SIDYTDKCFLKDGVKFRYISGSIHYFRIPRAYWKDRLLKMYMSGLNAVQIYIPWNYHEPL 131
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
YNF+G DL F+ L A L LR GPY+CAEW GG P WL P I RT +
Sbjct: 132 SGVYNFDGDRDLEGFLDLAANFDLLVILRPGPYICAEWEMGGIPSWLLAKPNIILRTSDP 191
Query: 146 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM 205
F + ++ + ++ +K LY + GG II Q+ENEYG +Y A Y++ +
Sbjct: 192 DFLQAVDKWFSVLLPKIKPH-LYIN-GGNIISVQVENEYG----SYYACDYDYLRHLEAV 245
Query: 206 ALS-LDTGVPWVMCQQSDAPDPIINTCNGFYCD-QFTPNSN-------------NKPKMW 250
S L V + + + T +G Y F P N N P +
Sbjct: 246 FRSYLGKKVVLFTTDGTKESELLCGTLHGLYTTVDFGPEENVTEAFEKQRIHEPNGPLVN 305
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPF--- 307
+E ++GW +G + ED+A + + + G N YM+ GGTNF SG +
Sbjct: 306 SEYYTGWLDYWGEPHSTKSAEDVARGLEKMLELGANV-NMYMFQGGTNFGYWSGADYNNG 364
Query: 308 ----ISTSYDYDAPLDEYG 322
I+TSYDYDAPL E G
Sbjct: 365 IYNPITTSYDYDAPLSEAG 383
Score = 39.7 bits (91), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 81/202 (40%), Gaps = 47/202 (23%)
Query: 545 NTFDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELN 604
+T DLL ++G N+GA E G+ + L G+NI + Y G+
Sbjct: 513 DTLDLLVESMGHINFGA-NESDFKGLVKNLTL-----GSNI--VTDWLIYPLGID----- 559
Query: 605 FPSGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
S W + L +Y F ++ + F G KG+ W+NG ++GR+W
Sbjct: 560 --KAVSHSWPPVAPLSNGTGPAFYTGFFTTLGIAQDSFVKFPGWNKGQIWINGFNLGRFW 617
Query: 665 PTYVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGD 724
PT RG P Q+L+ VP S L SS V+ E+
Sbjct: 618 PT--------------RG-------------PQQTLF-VPGSILSSSTINTVVVLELQNA 649
Query: 725 PT--KISFVTKQL--GSSLCSH 742
P K+ F+ + L G+ + +H
Sbjct: 650 PEKPKLLFLDRPLLNGTQISTH 671
>gi|384209874|ref|YP_005595594.1| beta-galactosidase [Brachyspira intermedia PWS/A]
gi|343387524|gb|AEM23014.1| beta-galactosidase [Brachyspira intermedia PWS/A]
Length = 592
Score = 155 bits (393), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 104/309 (33%), Positives = 148/309 (47%), Gaps = 26/309 (8%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK L+SG+IHY R E W D + K G + +ETY+ WN+HE ++F G
Sbjct: 11 ILNGKPIKLLSGAIHYFRFVEEYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGVFDFSGN 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ F+KL + L LR PY+CAEW FGG P WL ++ RT+ E F +++ +
Sbjct: 71 KDIASFIKLAQKMDLLVILRPTPYICAEWEFGGLPAWLLRYDNMKVRTNTELFLSKVDAY 130
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKS-YIKWAAGMALSLDT 211
++ + L ++ GP+I+ QIENEYG+ D Y A K+ +K A + L
Sbjct: 131 YKELFKQIAD--LQITRNGPVIMMQIENEYGSFGNDKEYLKALKNLMVKHGAEVPLFTSD 188
Query: 212 GVPW--VMCQQSDAPDPIINTCN-------GFYCDQ--FTPNSNNKPKMWTENWSGWFLS 260
G W V+ + D I+ T N F + F P M E W GWF
Sbjct: 189 GA-WDAVLEAGTLVDDGILATVNFGSQAKESFDATEKFFERKGIKNPLMCMEFWDGWFNL 247
Query: 261 FGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS-------TSYD 313
+ + R +D V +RG N YM+ GGTNF +G TSYD
Sbjct: 248 WKEPIIKRDADDFIMEVKEIIKRGSI--NLYMFIGGTNFGFYNGTSVTGYTDFPQITSYD 305
Query: 314 YDAPLDEYG 322
YDA L E+G
Sbjct: 306 YDAVLTEWG 314
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 76/165 (46%), Gaps = 22/165 (13%)
Query: 502 LHVQSLGHA--LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
++V+++G + +H ++NG+ G Y + + F G N +LL VG NY
Sbjct: 400 MNVRAVGASDRVHFYLNGEYKGVKYQDELIEPIEMHF----NNGDNVLELLVENVGRVNY 455
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
G ++ Q+KG G D+ ++TG E+ P + D S
Sbjct: 456 GYKLQECS-------QVKGIRIGVMADIH-----FETGW--EQYALPLDNIKDVDFSSKW 501
Query: 620 PKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
+ P +Y+ FD ++ +D + +GKG A++NG ++GRYW
Sbjct: 502 IENTP-SFYRYEFDVKEPADTF-LDCSKLGKGAAFINGFNLGRYW 544
>gi|417988603|ref|ZP_12629136.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|417997907|ref|ZP_12638140.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|418015108|ref|ZP_12654689.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
gi|410541233|gb|EKQ15720.1| beta-galactosidase 3 [Lactobacillus casei A2-362]
gi|410542248|gb|EKQ16704.1| beta-galactosidase 3 [Lactobacillus casei T71499]
gi|410552187|gb|EKQ26219.1| beta-galactosidase 3 [Lactobacillus casei Lpc-37]
Length = 598
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYSEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAENLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 45.8 bits (107), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 94/250 (37%), Gaps = 55/250 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W +
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW-----EK 549
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G P+ SLY VP L + N +++FE G I+ V
Sbjct: 550 G-----------------------PTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINLV 585
Query: 732 TKQLGSSLCS 741
+ L +
Sbjct: 586 DHPIFKELNT 595
>gi|417994975|ref|ZP_12635282.1| beta-galactosidase 3 [Lactobacillus casei M36]
gi|410539221|gb|EKQ13758.1| beta-galactosidase 3 [Lactobacillus casei M36]
Length = 598
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYSEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAENLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 45.8 bits (107), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 94/250 (37%), Gaps = 55/250 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W +
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW-----EK 549
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G P+ SLY VP L + N +++FE G I+ V
Sbjct: 550 G-----------------------PTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINLV 585
Query: 732 TKQLGSSLCS 741
+ L +
Sbjct: 586 DHPIFKELNT 595
>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
Length = 673
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 114/347 (32%), Positives = 167/347 (48%), Gaps = 28/347 (8%)
Query: 27 VTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 86
+ Y+ + GK ISGSIHY R W D + K K GL+ IETYV WN HEP
Sbjct: 63 IDYEGDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLFKMKMAGLNAIETYVPWNFHEPFP 122
Query: 87 NQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEP 146
QY F G DL F++LV E GL LR GPY+CAEW+ GG P+WL I R+ +
Sbjct: 123 GQYQFSGEQDLEYFLQLVHEVGLLVILRPGPYICAEWDMGGLPVWLLEKKSIFLRSSDPD 182
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMA 206
+ + ++ ++ MK LY + GGPII Q+ENEYG +Y A +Y+++ +
Sbjct: 183 YLKAVDKWLEVLLPKMK-PYLYQN-GGPIITVQVENEYG----SYFACDYNYLRFLLKV- 235
Query: 207 LSLDTGVPWVMCQQSDAPDPIIN--TCNGFYCD-QFTPNSN-------------NKPKMW 250
G V+ A + + T Y F +SN P +
Sbjct: 236 FRQHLGEEVVLFTTDGAGENYLKCGTLQDLYATVDFGTSSNITQAFMIQRKVEPKGPLVN 295
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--PFI 308
+E ++GW +G + +++ ++ RG N YM+ GGTNF +G P++
Sbjct: 296 SEFYTGWLDHWGESHQTVSTKNIVASLTDMLSRGANV-NLYMFIGGTNFGFWNGANMPYL 354
Query: 309 --STSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTY 353
TSYDYDAPL E G + + + + + K KL E + + P +
Sbjct: 355 PQPTSYDYDAPLSEAGDLTEKYYAVREAIGKFEKLPEGPIPPSTPKF 401
>gi|423294349|ref|ZP_17272476.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
gi|392675540|gb|EIY68981.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
Length = 778
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 176/362 (48%), Gaps = 39/362 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M ++ I LLVL F V+ ++ A T ++ GK V+ + +HY R
Sbjct: 1 MKNRLIALLVL---FTVIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 58 AYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL I RT + + M+R + ++ KQ L ++GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDIALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ ID Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGIDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Query: 336 HK 337
K
Sbjct: 347 LK 348
Score = 47.4 bits (111), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 98/251 (39%), Gaps = 51/251 (20%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
T L + + + +GKL+ + T P AL G D+L +G N+
Sbjct: 420 TTLKITEVHDWAQIYADGKLLARL--DRRKGEFTTTLP-ALKKG-TQLDILVEAMGRVNF 475
Query: 560 G-AFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKST 618
+ +++ GIT V+L SGN T + WT NFP S D K +
Sbjct: 476 DKSIHDR--KGITEKVELI-SGNQTK---ELKNWTV--------YNFPVDYSFIKDKKYS 521
Query: 619 LPKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTD 676
K+ P + +YK+TF + +D + GKG WVNG ++GR+W
Sbjct: 522 DTKILPTMPAYYKSTFTLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI---------- 570
Query: 677 SCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLG 736
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 571 ------------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPIL 611
Query: 737 SSLCSHVTDSH 747
L ++H
Sbjct: 612 DVLREKAPETH 622
>gi|389856131|ref|YP_006358374.1| beta-galactosidase [Streptococcus suis ST1]
gi|353739849|gb|AER20856.1| Beta-galactosidase [Streptococcus suis ST1]
Length = 590
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 104/306 (33%), Positives = 155/306 (50%), Gaps = 37/306 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R P+ W + K G + +ETYV WN+HEP + ++ +EG D+ +F+K
Sbjct: 19 ILSGAIHYFRVHPDDWYHSLYNLKALGFNTVETYVPWNMHEPRKGEFCYEGILDIERFLK 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLYA +R PY+CAEW +GG P WL ++ R+ + + + + A ++
Sbjct: 79 LAQELGLYAIVRPSPYICAEWEWGGLPAWL-MKEELRVRSSDSVYLQHLDEYYASLIP-- 135
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP-------W 215
K KL +QGG +++ Q+ENEYG +YG K+Y++ AG+ P W
Sbjct: 136 KLAKLQLAQGGNVLMFQVENEYG----SYGEE-KAYLRAVAGLMRKHGLTAPLFTSDGSW 190
Query: 216 VMCQQSDA--PDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTENWSGWFLSFGGA 264
++ D + T N G + F + N P M E W GWF +G
Sbjct: 191 RATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCMEFWDGWFNRWGDE 250
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDA 316
+ R E++ +V + G N YM+HGGTNF +G P + TSYDYDA
Sbjct: 251 IIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQIDLPQV-TSYDYDA 307
Query: 317 PLDEYG 322
LDE G
Sbjct: 308 ILDEAG 313
Score = 60.8 bits (146), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 89/226 (39%), Gaps = 55/226 (24%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+ + +GK V + Y + V +DF K T D+L +G NYG +
Sbjct: 411 IQIYADGKFVATQYQTEIGDDVELDFK----DDKLTLDILVENMGRVNYGH-------KL 459
Query: 571 TGPVQLKGSGNGTNIDLS-SQQW-TYQTGLKG-EELNFPSGSSTQWDSKSTLPKLQPLVW 627
T P Q KG G G DL W TY L+ E+L+F G W+ +
Sbjct: 460 TAPTQSKGLGRGAMADLHFIGHWATYPLHLESVEDLDFSKG----WEEGQA-------AF 508
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
Y+ F+ ++ +D TG GKG +VN +IGR+W
Sbjct: 509 YRYQFELDELAD-TYLDMTGFGKGVVFVNNVNIGRFWEK--------------------- 546
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P LY +P+ +LK N +++FE G KI F +
Sbjct: 547 -------GPILYLY-IPKGYLKKGANEIIVFETEGKYREKIHFSQR 584
>gi|422700666|ref|ZP_16758509.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315170851|gb|EFU14868.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 593
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 162/351 (46%), Gaps = 44/351 (12%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ G+ +ISG+IHY R TP W D + K G + +ETY+ WN+HEP Y+FEG
Sbjct: 12 LLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEGM 71
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
++ FV+L + L LR Y+CAEW FGG P WL G++ R+ + F +++ +
Sbjct: 72 KNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRNY 131
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K + +QGGP+I+ Q+ENEYG +YG K+Y++ + L VP
Sbjct: 132 FQ--VLLPKLAPMQITQGGPVIMMQVENEYG----SYGME-KAYLQQTKQIMEELGIEVP 184
Query: 215 WVMCQQSDAPDPIINTCNGFYCDQF--------------------TPNSNNKPKMWTENW 254
+ A + +++ D F T + P M E W
Sbjct: 185 --LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEYW 242
Query: 255 SGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------P 306
GWF +G V R DLA V G N YM+HGGTNF +G P
Sbjct: 243 DGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDLP 300
Query: 307 FISTSYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVATDPTYPSLG 357
+ TSYDYDA L E G + + + KAIK + P LG
Sbjct: 301 QV-TSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPEVWQAQPRTKKLG 346
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/232 (24%), Positives = 87/232 (37%), Gaps = 45/232 (19%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGA 561
L V LH +++G L + Y + ++ L G+ D L+L + ++N G
Sbjct: 403 LKVVEASDRLHIYVDGDLAATQYQETVGEEL-------LISGQTEKDTLALDILVENLGR 455
Query: 562 FYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPK 621
G + P Q KG G D+ Q G + L F + D +
Sbjct: 456 V--NYGFKLNNPTQSKGIRGGVMQDIHFHQ-----GYQHYPLTFSQEQLAKIDYTAGKNP 508
Query: 622 LQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYR 681
LQP +Y+ TF+ ++ ID G GKG VNG +GRYW
Sbjct: 509 LQP-SFYQVTFELEQLAD-TYIDCRGYGKGFVVVNGHHLGRYWEI--------------- 551
Query: 682 GAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P SLY P+ + + N +V+FE G + + F +
Sbjct: 552 -------------GPIHSLY-CPKEFFQQGQNEVVIFETEGIEIEYLKFTNQ 589
>gi|255692586|ref|ZP_05416261.1| beta-galactosidase [Bacteroides finegoldii DSM 17565]
gi|260621643|gb|EEX44514.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 779
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 184/386 (47%), Gaps = 42/386 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M ++ I LLVL V +S A T ++ GK V+ + +HY R
Sbjct: 1 MKNRLIALLVLF--TVTFFVSSAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 58
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 59 AYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRP 118
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL I RT + + M+R + ++ KQ L ++GG
Sbjct: 119 GPYVCAEWEMGGLPWWLLKKRDIALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 175
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ I+ Y +A + ++ +G T VP C S +A D +
Sbjct: 176 IIMVQVENEYGSYGINKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 229
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 230 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 289
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G + K+ L+DL
Sbjct: 290 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 347
Query: 336 HKAIKLCEAALVATDPTYPSLGPNLE 361
K AAL P P+ P +E
Sbjct: 348 LKNYLPAGAAL----PEVPAALPVIE 369
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 62/250 (24%), Positives = 92/250 (36%), Gaps = 49/250 (19%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
TVL + + + +GKL+ + T PI L G D+L +G N+
Sbjct: 421 TVLKITEVHDWAQVYADGKLLARL--DRRKGEFTTTLPI-LKKG-TQLDILIEAMGRVNF 476
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
GIT V+L SGN T + WT NFP S D K
Sbjct: 477 DKSIHDR-KGITEKVELI-SGNQTK---ELKNWTV--------YNFPVDYSFIKDKKYNE 523
Query: 620 PKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
K P + +YK TF + +D + GKG WVNG ++GR+W
Sbjct: 524 TKQLPTMPAYYKGTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI----------- 571
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 572 -----------------GPQQTLF-MPGCWLKKGENEILVLDLKGPAKASIKGLKKPILD 613
Query: 738 SLCSHVTDSH 747
L ++H
Sbjct: 614 VLREKAPETH 623
>gi|384248639|gb|EIE22122.1| hypothetical protein COCSUDRAFT_1093, partial [Coccomyxa
subellipsoidea C-169]
Length = 632
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 160/332 (48%), Gaps = 45/332 (13%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK +ISGS+HY R P W D + ++K GL+ + YV WNLHEP QYN++G
Sbjct: 28 MDGKPFRIISGSLHYHRIHPAQWKDRMLRTKALGLNTLSVYVPWNLHEPFPGQYNWDGFA 87
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPG---------IQFRTDNEP 146
DL ++ L E GLY LR GPY+CAEW+FGGFP WL + R+D+
Sbjct: 88 DLEAYLALAQEQGLYVLLRPGPYICAEWDFGGFPWWLASSKAGLCSTSSHSVTLRSDDPA 147
Query: 147 FKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGM- 205
+ + R+ V + K + S+GG I++ Q+ENE+G + + Y++ G
Sbjct: 148 YLELVDRWWK--VLLPKIGRFLYSRGGNILMVQVENEFGFV-----GPNEKYMRHLVGTV 200
Query: 206 -------ALSLDTGVPWVMCQQSDAPDPIINTC---------NGFYCDQFTPNSNNK-PK 248
AL T P + + + D +++ N + Q N+ K P
Sbjct: 201 RASLGDDALIYTTDPPPNIAKGTLPGDEVLSVVDFGAGWFDLNWAFSQQRAMNAPGKSPP 260
Query: 249 MWTENWSGWFLSFGGAVPYRPVE---DLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG 305
M +E ++GW +G + V+ D V F G+ N YM HGGTNF T+GG
Sbjct: 261 MCSEFYTGWLTRWGEKMANTSVDQFLDTLHGVLGFANNTGSV-NLYMVHGGTNFGFTAGG 319
Query: 306 PFIS-------TSYDYDAPLDEYGLIRQPKWG 330
+ TSYDYDAP+ E G QP G
Sbjct: 320 SIDNGVYWACITSYDYDAPISEAGDTGQPGIG 351
>gi|311264379|ref|XP_003130137.1| PREDICTED: galactosidase, beta 1-like 2 [Sus scrofa]
Length = 635
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 110/319 (34%), Positives = 150/319 (47%), Gaps = 35/319 (10%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
+ GS+HY R W D + K K GL+ + TYV WNLHEP R +++F G D+ F+
Sbjct: 62 IFGGSVHYFRVPRAYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDMEAFIL 121
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L AE GL+ LR GPY+C+E + GG P WL ++ RT E F + + + M
Sbjct: 122 LAAEVGLWVILRPGPYICSEIDLGGLPSWLLQDSSMKLRTTYEGFTKAVDLYFDHL--MA 179
Query: 163 KQEKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQ 220
+ L GGPII Q+ENEYG N D AY YIK A D G+ ++
Sbjct: 180 RVVPLQYKNGGPIIAVQVENEYGSYNKDPAY----MPYIKKALE-----DRGIVELLLTS 230
Query: 221 SDAPDPIINTCNGFYCDQFTPNSNN--------------KPKMWTENWSGWFLSFGGAVP 266
+ T +G + N +PKM E W+GWF S+GG
Sbjct: 231 DNEDGLSKGTVDGVLATINLQSQNELRLLHNFLQSVQGVRPKMVMEYWTGWFDSWGGPHH 290
Query: 267 YRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------TSYDYDAPLDE 320
++ V+ G + N YM+HGGTNF +G TSYDYDA L E
Sbjct: 291 ILDTSEVLRTVSAIIDAGASI-NLYMFHGGTNFGFINGAMHFQDYMSDVTSYDYDAVLTE 349
Query: 321 YGLIRQPKWGHLKDLHKAI 339
G PK+ L++L +I
Sbjct: 350 AG-DYTPKYIRLRELFGSI 367
>gi|302670302|ref|YP_003830262.1| beta-galactosidase Bga35A [Butyrivibrio proteoclasticus B316]
gi|302394775|gb|ADL33680.1| beta-galactosidase Bga35A [Butyrivibrio proteoclasticus B316]
Length = 622
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 111/338 (32%), Positives = 159/338 (47%), Gaps = 40/338 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ +ISGS HY R+ PE W D ++K K G + +ETY+ WNL EP + ++NFEG
Sbjct: 12 LNGEPFKVISGSFHYFRTVPEYWVDRLEKLKALGCNTVETYIPWNLTEPKKGEFNFEGFC 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ KF++ E GLY +R PY+CAEW FGG P WL ++ R +PF ++ +
Sbjct: 72 DVEKFIQTATELGLYIIIRPSPYICAEWEFGGLPAWLLKDRNMRLRVSYKPFLDAVEDYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
V M K K GG +IL QIENEYG Y A Y+K+ + + VP
Sbjct: 132 K--VLMPKITKYQIDNGGNVILMQIENEYG-----YYANDHEYMKFMHDLMVKYGVTVPL 184
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPN-----------------SNNKPKMWTENWSGWF 258
+ SD P + G Y + P +N P M E W GWF
Sbjct: 185 I---TSDG--PYHESYRGGYAEGAHPTGNFGSKTEERFDVIKDYTNGGPLMCAEFWVGWF 239
Query: 259 LSF--GGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIS------T 310
+ GG + V+ A + + + G + YM+ GGTNF +G + T
Sbjct: 240 DHWGNGGHMKGNLVQS-AEDLDKMLELGNV--SIYMFQGGTNFGFMNGSNYYDALTPDVT 296
Query: 311 SYDYDAPLDEYGLIRQPKWGHLKDLHKAIKLCEAALVA 348
SYDYD L E G I + + + + K + + E L
Sbjct: 297 SYDYDGILTEDGQITEKYRKYQEIIGKYVDVPEVELTT 334
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 44/176 (25%), Positives = 65/176 (36%), Gaps = 47/176 (26%)
Query: 547 FDLLSLTVGLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFP 606
FD+L +G N+G E GI VQ+ G W T P
Sbjct: 479 FDILVENMGRVNFGPRMETQRKGIGRCVQINGH--------IHNDWDIYT--------LP 522
Query: 607 SGSSTQWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPT 666
+ + D + P +YK TF+ + +DFTG GKG A++NG ++GR+W
Sbjct: 523 LDNVDKVDFSGDYKEGAP-AFYKFTFNVDEKGDTF-LDFTGWGKGVAFINGFNLGRFWEI 580
Query: 667 YVSQNGGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIG 722
P + LY +P LK N +++FE G
Sbjct: 581 ----------------------------GPQKRLY-IPAPLLKDGENEIIIFETEG 607
>gi|225870912|ref|YP_002746859.1| beta-galactosidase precursor [Streptococcus equi subsp. equi 4047]
gi|225700316|emb|CAW94604.1| putative beta-galactosidase precursor [Streptococcus equi subsp.
equi 4047]
Length = 599
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 155/308 (50%), Gaps = 28/308 (9%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ ++SG+IHY R P+ W + K G + +ETYV WNLHE Y+F G+
Sbjct: 12 LDGRSLQILSGAIHYFRIHPDDWYHSLYNLKALGFNTVETYVPWNLHEAREESYDFSGQL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ F+ L + GLYA +R PY+CAEW FGG P WL R+ + + A ++R+
Sbjct: 72 DVEAFLTLAQQLGLYAIVRPSPYICAEWEFGGLPAWL-LTKNCHIRSSDPVYLAYVRRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
+++ + + + QGG I++ Q+ENEYG+ D AY A K +++ L G
Sbjct: 131 EELLPRLARHEW--QQGGNILMFQLENEYGSYGEDKAYLTAVKGFMEEHLSAPLFTADG- 187
Query: 214 PWVMCQQSDA--PDPIINTCN------GFYCDQ---FTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ + D + T N + D F+ + + P M E W GWF +
Sbjct: 188 PWRATLRAGSLIEDDVFVTGNFGSRAQENFADMQAFFSEHGKHWPLMCMEFWDGWFNRWH 247
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
+ R E+ A AV +G N YM+HGGTNF +G P + TSYDY
Sbjct: 248 EPIIKRDPEERADAVMEVLAQGSI--NLYMFHGGTNFGFMNGCSARKQLDLPQV-TSYDY 304
Query: 315 DAPLDEYG 322
DA LDE G
Sbjct: 305 DAILDEAG 312
Score = 43.9 bits (102), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 81/225 (36%), Gaps = 55/225 (24%)
Query: 512 HAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGIT 571
F++G+ V + Y + + ++ AL+ D+L +G NYG +T
Sbjct: 411 QVFLDGQKVATQYQETIGDDIIINQQHALS----QVDVLIENMGRVNYGH-------KLT 459
Query: 572 GPVQLKGSGNGTNIDLS-SQQWTYQTGLKGEELNFPSGSSTQWDSKSTLPKLQPLVWYKT 630
P Q KG G G DL W E P +Q + QP +Y
Sbjct: 460 APSQSKGLGRGMMADLHFVTNW--------EMYCLPLDDLSQLCFDGDFYEGQP-GFYHY 510
Query: 631 TFDAPAGSEPVA--IDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSNK 688
F+ EP A ID TG GKG ++N IGR+W
Sbjct: 511 QFEC---HEPEASYIDMTGFGKGCVFINNHPIGRFWEV---------------------- 545
Query: 689 CLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTK 733
P +LY +P+ + N +V+FE G I V +
Sbjct: 546 ------GPLLTLY-IPKGYFNKGLNDIVIFETEGVYQDSIRLVDR 583
>gi|387878583|ref|YP_006308886.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
gi|386792040|gb|AFJ25075.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
Length = 595
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 102/308 (33%), Positives = 152/308 (49%), Gaps = 27/308 (8%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ G+ ++SG+IHY R P W + K G + +ETYV WN+HEP + Q++F GR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
DL +F+++ GLY +R P++CAEW FGG P WL ++ R+ + F + R+
Sbjct: 72 DLERFIQIAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDMRIRSSDPAFIEAVDRYY 130
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMALSLDTGV 213
++ ++ + ++ QGGPI++ Q+ENEYG+ D Y A + +K +
Sbjct: 131 DHLLGLLTRYQV--DQGGPILMMQVENEYGSYGEDKVYLRAIRDLMKKKGVTCPLFTSDG 188
Query: 214 PWVMCQQSDA--PDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262
PW ++ D + T N G + F P M E W GWF +
Sbjct: 189 PWRATLRAGTLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 263 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDY 314
V R E+LA AV + G N YM+HGGTNF +G P + TSYDY
Sbjct: 249 EPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305
Query: 315 DAPLDEYG 322
A L+E G
Sbjct: 306 GALLNEQG 313
>gi|417985674|ref|ZP_12626256.1| beta-galactosidase 3 [Lactobacillus casei 32G]
gi|410527574|gb|EKQ02437.1| beta-galactosidase 3 [Lactobacillus casei 32G]
Length = 598
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAEDLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 45.8 bits (107), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 94/250 (37%), Gaps = 55/250 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W +
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW-----EK 549
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G P+ SLY VP L + N +++FE G I+ V
Sbjct: 550 G-----------------------PTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINLV 585
Query: 732 TKQLGSSLCS 741
+ L +
Sbjct: 586 DHPIFKELNT 595
>gi|191637109|ref|YP_001986275.1| beta-galactosidase 3 [Lactobacillus casei BL23]
gi|385818812|ref|YP_005855199.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|385821988|ref|YP_005858330.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|409995961|ref|YP_006750362.1| beta-galactosidase 17 [Lactobacillus casei W56]
gi|190711411|emb|CAQ65417.1| Beta-galactosidase 3 [Lactobacillus casei BL23]
gi|327381139|gb|AEA52615.1| galactosidase, beta 1-like protein [Lactobacillus casei LC2W]
gi|327384315|gb|AEA55789.1| galactosidase, beta 1-like protein [Lactobacillus casei BD-II]
gi|406356973|emb|CCK21243.1| Beta-galactosidase 17 [Lactobacillus casei W56]
Length = 598
Score = 155 bits (392), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAEDLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 46.2 bits (108), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 94/251 (37%), Gaps = 57/251 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW------- 547
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGK-PSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
GK P+ SLY VP L + N +++FE G I+
Sbjct: 548 ----------------------GKGPTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINL 584
Query: 731 VTKQLGSSLCS 741
V + L +
Sbjct: 585 VDHPIFKELNT 595
>gi|383110805|ref|ZP_09931623.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
gi|313694380|gb|EFS31215.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
Length = 778
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 176/362 (48%), Gaps = 39/362 (10%)
Query: 1 MASKEILLLVLCWGFVVLATTSFGANVTYDH-----RAVVIGGKRRVLISGSIHYPRSTP 55
M ++ I LLVL F V+ ++ A T ++ GK V+ + +HY R
Sbjct: 1 MKNRLIALLVL---FTVIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVKAAELHYTRIPQ 57
Query: 56 EMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVKLVAEAGLYAHLRI 115
W I+ K G++ I Y+FWN+HE +++F G+ D+ F + + G+Y +R
Sbjct: 58 AYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRP 117
Query: 116 GPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMMKQ-EKLYASQGGP 174
GPYVCAEW GG P WL I RT + + M+R + ++ KQ L ++GG
Sbjct: 118 GPYVCAEWEMGGLPWWLLKKKDIALRTLDPYY---MERVGIFMKEVGKQLAPLQVNKGGN 174
Query: 175 IILSQIENEYGN--IDSAYGAAGKSYIKWAAGMALSLDTGVPWVMCQQS-----DAPDPI 227
II+ Q+ENEYG+ ID Y +A + ++ +G T VP C S +A D +
Sbjct: 175 IIMVQVENEYGSYGIDKPYVSAVRDLVR-ESGF-----TDVPLFQCDWSSNFTNNALDDL 228
Query: 228 INTCN---GFYCD-QFTPNSNNKPK---MWTENWSGWFLSFGGAVPYRPVEDLAFAVARF 280
I T N G D QF +P+ M +E WSGWF +G RP +D+ +
Sbjct: 229 IWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDM 288
Query: 281 FQRGGTFQNYYMYHGGTNFDRTSGG-----PFISTSYDYDAPLDEYGLIRQPKWGHLKDL 335
R +F + YM HGGT F G + +SYDYDAP+ E G K+ L+DL
Sbjct: 289 LDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWTTD-KFFLLRDL 346
Query: 336 HK 337
K
Sbjct: 347 LK 348
Score = 47.4 bits (111), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 94/250 (37%), Gaps = 49/250 (19%)
Query: 500 TVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNY 559
T L + + + +GKL+ + T P AL G D+L +G N+
Sbjct: 420 TTLQITEVHDWAQIYADGKLLARL--DRRKGEFTTILP-ALKKG-TQLDILVEAMGRVNF 475
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGLKGEELNFPSGSSTQWDSKSTL 619
GIT V+L SGN T + WT NFP S D K +
Sbjct: 476 DKSIHDR-KGITEKVELI-SGNQTK---ELKNWTV--------YNFPVDYSFIKDKKYSD 522
Query: 620 PKLQPLV--WYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
K+ P + +YK+TF + +D + GKG WVNG ++GR+W
Sbjct: 523 KKILPTMPAYYKSTFTLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWEI----------- 570
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFVTKQLGS 737
P Q+L+ +P WLK N +++ + G I + K +
Sbjct: 571 -----------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKKPILD 612
Query: 738 SLCSHVTDSH 747
L ++H
Sbjct: 613 VLREKAPETH 622
>gi|418000981|ref|ZP_12641151.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|418009807|ref|ZP_12649594.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
gi|410548851|gb|EKQ23035.1| beta-galactosidase 3 [Lactobacillus casei UCD174]
gi|410554934|gb|EKQ28899.1| beta-galactosidase 3 [Lactobacillus casei Lc-10]
Length = 598
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 7 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDF 65
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 66 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQ 124
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 125 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 181
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 182 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 241
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 242 GWFNRWGEPIIRRDPEETAEDLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 299
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 300 V-TSYDYDAPLNEQG 313
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 94/250 (37%), Gaps = 55/250 (22%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 398 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 453
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 454 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 502
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQN 671
W + + +Y+ FD + +D G GKG VNG +IGR+W +
Sbjct: 503 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW-----EK 549
Query: 672 GGCTDSCNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISFV 731
G P+ SLY VP L + N +++FE G I+ V
Sbjct: 550 G-----------------------PTLSLY-VPAGLLHTGHNEVIVFETEGQYAEAINLV 585
Query: 732 TKQLGSSLCS 741
+ L +
Sbjct: 586 DHPIFKELNT 595
>gi|417092513|ref|ZP_11957129.1| Beta-galactosidase [Streptococcus suis R61]
gi|353532192|gb|EHC01864.1| Beta-galactosidase [Streptococcus suis R61]
Length = 590
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/306 (33%), Positives = 154/306 (50%), Gaps = 37/306 (12%)
Query: 43 LISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRYDLVKFVK 102
++SG+IHY R P+ W + K G + +ETYV WN+HEP + ++ +EG D+ +F+K
Sbjct: 19 ILSGAIHYFRVHPDDWHHSLYNLKALGFNTVETYVPWNMHEPRKGEFCYEGILDIERFLK 78
Query: 103 LVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFTAKIVDMM 162
L E GLYA +R PY+CAEW +GG P WL ++ R+ + + + + A ++
Sbjct: 79 LAQELGLYAIVRPSPYICAEWEWGGLPAWL-MKEELRVRSSDSVYLQHLDEYYASLIP-- 135
Query: 163 KQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP-------W 215
K KL +QGG +++ Q+ENEYG +YG K Y++ AG+ P W
Sbjct: 136 KLAKLQLAQGGNVLMFQVENEYG----SYGEE-KEYLRSVAGLMRKHGLTAPLFTSDGSW 190
Query: 216 VMCQQSDA--PDPIINTCN-GFYCDQ--------FTPNSNNKPKMWTENWSGWFLSFGGA 264
++ D + T N G + F + N P M E W GWF +G
Sbjct: 191 RATLRAGTLIEDDVFVTGNFGSKARENFANMTAFFNEHQKNWPLMCMEFWDGWFNRWGDE 250
Query: 265 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFISTSYDYDA 316
+ R E++ +V + G N YM+HGGTNF +G P + TSYDYDA
Sbjct: 251 IIRREPEEMVDSVMECIELGSL--NLYMFHGGTNFGFMNGCSARGQIDLPQV-TSYDYDA 307
Query: 317 PLDEYG 322
LDE G
Sbjct: 308 ILDEAG 313
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 89/223 (39%), Gaps = 55/223 (24%)
Query: 511 LHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTVGLQNYGAFYEKTGAGI 570
+ + +GK V + Y + V +DF K T D+L +G NYG +
Sbjct: 411 IQIYADGKFVATQYQTEIGDDVELDF----KDDKLTLDILVENMGRVNYGH-------KL 459
Query: 571 TGPVQLKGSGNGTNIDLSS-QQW-TYQTGLKG-EELNFPSGSSTQWDSKSTLPKLQPLVW 627
T P Q KG G G DL W TY L+ E+L+F G W+ +
Sbjct: 460 TAPTQSKGLGRGAMADLHFIGHWETYPLHLESVEDLDFSKG----WEEGQA-------AF 508
Query: 628 YKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDSCNYRGAYSSN 687
Y+ F+ ++ +D TG GKG +VN +IGR+W
Sbjct: 509 YRYQFELDELAD-TYLDMTGFGKGVVFVNNVNIGRFWEK--------------------- 546
Query: 688 KCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
P LY +P+ +LK N +V+FE G KISF
Sbjct: 547 -------GPILYLY-IPKGYLKKGENEIVVFETEGKYREKISF 581
>gi|167750408|ref|ZP_02422535.1| hypothetical protein EUBSIR_01382 [Eubacterium siraeum DSM 15702]
gi|167656559|gb|EDS00689.1| glycosyl hydrolase family 35 [Eubacterium siraeum DSM 15702]
Length = 579
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 107/314 (34%), Positives = 149/314 (47%), Gaps = 36/314 (11%)
Query: 36 IGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGRY 95
+ GK +ISGSIHY R+ PE W D ++K + G + +ETY+ WN HE + +N+ G +
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWNGMH 71
Query: 96 DLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRFT 155
D+ +F++L + GLY +R PY+C+EW FGG P WL ++ R +P+ + +
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDSYY 131
Query: 156 AKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVPW 215
+ V M K GG II+ QIENEYG Y SY+++ VP+
Sbjct: 132 S--VLMPKLAPYQIDNGGNIIMMQIENEYG-----YYGNDTSYLEFLRDTMRKYGITVPF 184
Query: 216 VMCQQSDAPDPIINTCNGFYCDQFTPNSN------------------NKPKMWTENWSGW 257
V SD P +G D P N +KP M E W+GW
Sbjct: 185 V---TSDGPWSEFVFKSGM-VDGALPTGNFGSSAEWQFGEMRRFIGEDKPLMCMEFWNGW 240
Query: 258 FLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSG-----GPFISTSY 312
F +G E A + + G N+YM+ GGTNF SG I TSY
Sbjct: 241 FDVWGEEHNITAPEKAAQELDILLKNGS--MNFYMFEGGTNFGFMSGKNNEKKTGIVTSY 298
Query: 313 DYDAPLDEYGLIRQ 326
DYDAPL E G I +
Sbjct: 299 DYDAPLTEDGRITE 312
>gi|227533108|ref|ZP_03963157.1| beta-galactosidase 3, partial [Lactobacillus paracasei subsp.
paracasei ATCC 25302]
gi|227189289|gb|EEI69356.1| beta-galactosidase 3 [Lactobacillus paracasei subsp. paracasei ATCC
25302]
Length = 578
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 159/315 (50%), Gaps = 30/315 (9%)
Query: 30 DHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQY 89
DH ++ G ++L SG+IHY R P W + K G + +ETYV WNLHE +
Sbjct: 14 DHEFMLDGQPFKIL-SGAIHYFRVHPSDWYHSLYNLKALGFNTVETYVPWNLHEYNEGDF 72
Query: 90 NFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKA 149
+F G D+ +F+ + GLYA +R PY+CAEW FGGFP WL ++ RTD+ +
Sbjct: 73 DFSGILDIERFLNTAKDLGLYAIVRPSPYICAEWEFGGFPAWL-LTKKMRLRTDDPAYLQ 131
Query: 150 EMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNI--DSAYGAAGKSYIKWAAGMAL 207
+ R+ ++ + ++ + GG +I+ Q+ENEYG+ D Y AA +K G+ +
Sbjct: 132 AIDRYYTALMPHLVGHQV--THGGNVIMMQVENEYGSYGEDKDYLAAVAELMK-KHGVDV 188
Query: 208 SLDTGV-PW--VMCQQSDAPDPIINTCN-----GFYCDQFT----PNSNNKPKMWTENWS 255
L T PW + S A I+ T N D+ + ++ P M E W
Sbjct: 189 PLFTSDGPWPATLNAGSMADAGILTTGNFGSRADMNFDRLAAFNQAHGHDWPLMCMEFWD 248
Query: 256 GWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PF 307
GWF +G + R E+ A + QRG N YM+HGGTNF +G P
Sbjct: 249 GWFNRWGEPIIRRDPEETAEDLRAVIQRGSV--NLYMFHGGTNFGFMNGTSARKDHDLPQ 306
Query: 308 ISTSYDYDAPLDEYG 322
+ TSYDYDAPL+E G
Sbjct: 307 V-TSYDYDAPLNEQG 320
Score = 44.3 bits (103), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 70/173 (40%), Gaps = 26/173 (15%)
Query: 495 EDGSKTVLHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNTFDLLSLTV 554
+ G+ L V + AF +GK + + Y + + D + G++ DLL +
Sbjct: 405 DKGTPAKLRVIDARDRVQAFFDGKSLATQYQEA----IGDDILLPEVEGRHQLDLLVENM 460
Query: 555 GLQNYGAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQWTYQTGL---KGEELNFPSGSST 611
NYG+ I Q KG G +DL + Q L K +L+F +
Sbjct: 461 SRVNYGS-------KIEAITQFKGIRTGVMVDLHFIKDYLQYPLDLNKAPQLDF----TG 509
Query: 612 QWDSKSTLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYW 664
W + + +Y+ FD + +D G GKG VNG +IGR+W
Sbjct: 510 DWQAGTP-------AFYQYGFDV-VKPQDTYLDCRGFGKGVMLVNGVNIGRFW 554
>gi|257866484|ref|ZP_05646137.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257873001|ref|ZP_05652654.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
gi|257800442|gb|EEV29470.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257807165|gb|EEV35987.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
Length = 591
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/314 (35%), Positives = 151/314 (48%), Gaps = 37/314 (11%)
Query: 35 VIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYNFEGR 94
++ GK LISG+IHY R T W D + K G + +ETY+ WNLHEP Y+FEG
Sbjct: 11 LLDGKPIKLISGAIHYFRMTSAQWADSLYNLKALGANTVETYIPWNLHEPREGVYDFEGM 70
Query: 95 YDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNEPFKAEMQRF 154
D+ FVK GL LR Y+CAEW FGG P WL P ++ R+ + F A+++ +
Sbjct: 71 KDIFAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRNY 129
Query: 155 TAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIKWAAGMALSLDTGVP 214
V + K L + GGP+I+ Q+ENEYG +YG K+Y++ + VP
Sbjct: 130 FQ--VLLPKLVPLQITHGGPVIMMQVENEYG----SYGME-KAYLRQTKELMEECGIDVP 182
Query: 215 -------W--VMCQQSDAPDPIINTCN---------GFYCDQFTPNSNNKPKMWTENWSG 256
W V+ + D + T N + + N P M E W G
Sbjct: 183 LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEYWDG 242
Query: 257 WFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG--------PFI 308
WF +G + R +DLA V G N YM+HGGTNF ++G P +
Sbjct: 243 WFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFSNGCSARGALDLPQV 300
Query: 309 STSYDYDAPLDEYG 322
S SYDYDA L E G
Sbjct: 301 S-SYDYDALLTEAG 313
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 66/233 (28%), Positives = 88/233 (37%), Gaps = 53/233 (22%)
Query: 502 LHVQSLGHALHAFINGKLVGSGYGSSSNAKVTVDFPIALAPGKNT--FDLLSLTVGLQNY 559
L V LH F +G+L Y + ++ I AP K T D+L +G NY
Sbjct: 401 LKVVEASDRLHIFTDGQLQAIQYQETLGEELL----IQGAPDKETIELDVLVENLGRVNY 456
Query: 560 GAFYEKTGAGITGPVQLKGSGNGTNIDLSSQQ--WTYQTGLKGEELNFPSGSSTQWDSKS 617
G + GP Q KG G D+ Q Y L E+L Q
Sbjct: 457 GF-------KLNGPTQAKGIRGGIMQDIHFHQGYHHYPLTLSAEQL---QAIDYQAGKNP 506
Query: 618 TLPKLQPLVWYKTTFDAPAGSEPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSQNGGCTDS 677
T P +Y+TTF + ID G GKG VNG ++GRYW Q G
Sbjct: 507 THPS-----FYQTTFTLTEVGDTF-IDCRGYGKGVVIVNGINLGRYW-----QRG----- 550
Query: 678 CNYRGAYSSNKCLKNCGKPSQSLYHVPRSWLKSSGNTLVLFEEIGGDPTKISF 730
P SLY P+ +LK N +V+FE G + ++ F
Sbjct: 551 ------------------PVHSLY-CPKEFLKKGSNEVVVFETDGVEIKELVF 584
>gi|375359947|ref|YP_005112719.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301164628|emb|CBW24187.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 769
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 158/332 (47%), Gaps = 31/332 (9%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
N T ++ GK + + +HY R W I+ K G++ I YVFWN+HE
Sbjct: 20 NFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHEQT 79
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
Q++F G+ D+ F +L + G+Y +R GPYVCAEW GG P WL I RT +
Sbjct: 80 EGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTLDP 139
Query: 146 PFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWA 202
F M+R + ++ KQ L ++GG II+ Q+ENEYG +D Y +A + +K +
Sbjct: 140 YF---MERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAVDKPYVSAIRDIVK-S 195
Query: 203 AGMALSLDTGVPWVMCQQSDAPDP--------IINTCNGFYCD-QFTPNSNNKPK---MW 250
AG T VP C S D IN G + QF +P+ M
Sbjct: 196 AGF-----TEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPETPLMC 250
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG----- 305
+E WSGWF +G RP + + + R +F + YM HGGT F G
Sbjct: 251 SEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNPSY 309
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
+ +SYDYDAP+ E G K+ L+DL K
Sbjct: 310 SAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
>gi|60683116|ref|YP_213260.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60494550|emb|CAH09349.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 769
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 158/332 (47%), Gaps = 31/332 (9%)
Query: 26 NVTYDHRAVVIGGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
N T ++ GK + + +HY R W I+ K G++ I YVFWN+HE
Sbjct: 20 NFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHEQT 79
Query: 86 RNQYNFEGRYDLVKFVKLVAEAGLYAHLRIGPYVCAEWNFGGFPLWLHFIPGIQFRTDNE 145
Q++F G+ D+ F +L + G+Y +R GPYVCAEW GG P WL I RT +
Sbjct: 80 EGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTLDP 139
Query: 146 PFKAEMQRFTAKIVDMMKQ-EKLYASQGGPIILSQIENEYG--NIDSAYGAAGKSYIKWA 202
F M+R + ++ KQ L ++GG II+ Q+ENEYG +D Y +A + +K +
Sbjct: 140 YF---MERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAVDKPYVSAIRDIVK-S 195
Query: 203 AGMALSLDTGVPWVMCQQSDAPDP--------IINTCNGFYCD-QFTPNSNNKPK---MW 250
AG T VP C S D IN G + QF +P+ M
Sbjct: 196 AGF-----TEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLKEARPETPLMC 250
Query: 251 TENWSGWFLSFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGG----- 305
+E WSGWF +G RP + + + R +F + YM HGGT F G
Sbjct: 251 SEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNPSY 309
Query: 306 PFISTSYDYDAPLDEYGLIRQPKWGHLKDLHK 337
+ +SYDYDAP+ E G K+ L+DL K
Sbjct: 310 SAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.134 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,758,495,790
Number of Sequences: 23463169
Number of extensions: 690461723
Number of successful extensions: 1355426
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2338
Number of HSP's successfully gapped in prelim test: 286
Number of HSP's that attempted gapping in prelim test: 1341720
Number of HSP's gapped (non-prelim): 5644
length of query: 848
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 696
effective length of database: 8,792,793,679
effective search space: 6119784400584
effective search space used: 6119784400584
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)